If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!

Test-drive the GATK tools and Best Practices pipelines on Terra

Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.

Indel calling in pooled samples

I am calling indel in pooled samples using this command:
java -jar -Xmx2g /PATH/2.1.13/GenomeAnalysisTK.jar -l INFO -T UnifiedGenotyper -I pool1.bam -I pool2.bam --out INDEL.vcf -R /reference.fa -glm INDEL

Currently i donot have any information of already known indels.
1.Do i need to first realign (RealignerTargetCreator and IndelRealigner) and then call indels even for pooled data?
2. How different will this be for calling indel on individual sample?

Looking forward for your suggesions.
with thanks sasha

Best Answers


  • sashasasha Member

    Thank you it did answer my question. In the second part i was just wondering if we have individual samples (instead of pool) then will it differ from calling indels? But i think in both cases we need to do the local realignment process (as you mentioned above).

  • sashasasha Member
    edited April 2013

    One more question related to this topic, Now that i have realigned (RealignerTargetCreator and IndelRealigner) and finally got a realigned.bam file by using this command:
    java -jar -Xmx2g /PATH/2.1.13/GenomeAnalysisTK.jar -l INFO -T IndelRealigner -I pool1.bam -I pool2.bam -I pool3.bam ... -L chr1 -targetIntervals pools_chr1.intervals -R reference.fa -o pools_realignedBam_chr1.bam

    So now i should use this pools_realignedBam_chr1.bam to call indels using:
    java -jar -Xmx2g /PATH/2.1.13/GenomeAnalysisTK.jar -l INFO -T UnifiedGenotyper -I pools_realignedBam_chr1.bam --out INDEL.vcf -R reference.fa -glm INDEL

    My question is that should i realign all pool1.bam pool2.bam … seperately? or its the right way by first realigning all pools together and then passing a single realigned file to unified genotyper??

    Looking forward for your guidance.

Sign In or Register to comment.