The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Powered by Vanilla. Made with Bootstrap.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.
Register now for the upcoming GATK Best Practices workshop, Feb 20-22 in Leuven, Belgium. Open to all comers! More info and signup at http://bit.ly/2i4mGxz

Which bundle should I use for bam files from Illumina's CASAVA pipeline?

ihleeihlee Member Posts: 2
edited January 2013 in Ask the GATK team

I have bam files generated by Illumina's CASAVA pipeline, and the reference genome version in the accompanying document is 'NCBI37_UCSC'. The chromosomes are all named after UCSC hg19 convention, but 'NCBI37' part in the name confuses me. I'm not sure which bundle files should I use with these bam files, b37 or hg19.

Post edited by Geraldine_VdAuwera on
Tagged:

Answers

  • ebanksebanks Broad InstituteMember, Administrator, Broadie, Moderator, Dev Posts: 701 admin

    You should ask Illumina as they provided these files for you. You need to make sure that Illumina used a standard reference - and if not then they need to provide you with the necessary reference file(s).

    Eric Banks, PhD -- Director, Data Sciences and Data Engineering, Broad Institute of Harvard and MIT

  • ihleeihlee Member Posts: 2

    The reference files did come with the bam files. I'm just not sure whether they're from NCBI build or UCSC build. I hoped someone with much experience in handling Illumina data with GATK might help me to solve this problem.

  • ebanksebanks Broad InstituteMember, Administrator, Broadie, Moderator, Dev Posts: 701 admin

    You should post in the Ask The Community section then. I've just moved this discussion there.

    Eric Banks, PhD -- Director, Data Sciences and Data Engineering, Broad Institute of Harvard and MIT

Sign In or Register to comment.