Download the bam file of NA12878 chr20

cardilloxcardillox Posts: 6Member

Dear all, I am really a newbie in NGS, so I apologize if I say something completely wrong. In order to start to manage this kind of data I am trying to follow the workflow described in http://www.broadinstitute.org/gatk/guide/topic?name=intro. It'suggested to download the "raw and realigned, recalibrated NA12878 test data from the GATK resource bundle". When i connect to the ftp through Filezilla I cannot find the bam file of NA12878 (/bundle/hg19) but only the vcf files. Is it correct? Should I download the vcf files?

Sorry again and thank you very much for your availability.

Francesco

Tagged:

Comments

  • Geraldine_VdAuweraGeraldine_VdAuwera Posts: 6,672Administrator, GATK Developer admin

    No, you want the bam files, but they're in the b37 directory (not hg19). They are aligned to the b37 reference (even though the name says hg19 -- that is a temporary glitch that will be corrected in the next release). If you want to work with hg19 you'll need to revert the files (using revertSam from Picard) and redo alignment. But if you have no preference (or external constraint) I recommend you work with b37-aligned files, since most of the resources we provide are made for b37 versions.

    I hope this helps, good luck!

    Geraldine Van der Auwera, PhD

  • cardilloxcardillox Posts: 6Member

    Thank you very much! I am downloading it and I am starting with introductory analysis.

    Hoping everything will so well!

    Thank you again

    Francesco

Sign In or Register to comment.