Holiday Notice:
The Frontline Support team will be offline December 17-18 due to an institute-wide retreat and offline December 22- January 1, while the institute is closed. Thank you for your patience during these next few weeks as we get to all of your questions. Happy Holidays!

GATK v4.0.11.0 HaplotypeCaller missing SNP

Hi GATK team,

GATK v4.011.0 haplotype caller is missed to call this SNP.

$java8 -Xmx10g -jar /gatk-4.0.11.0/gatk-package-4.0.11.0-local.jar \
    HaplotypeCaller \
    -R hg19.fa \
    -L Target.bed \
    -I sample.markDup.recalibrate.bam \
    -A Coverage \
    -A MappingQualityRankSumTest \
    -A ReadPosRankSumTest \
    -A BaseQualityRankSumTest \
    -G StandardAnnotation \
    --dbsnp hg19_dbSNP138.vcf \
    --min-base-quality-score 10 \
    --output sample.HaplotypeCaller.snp.indel.vcf \
    --stand-call-conf 10 \
    --bam-output sample.HaplotypeCaller.bam 

I have tried various parameters like (--allow-non-unique-kmers-in-ref, --kmer-size 10, 25, 35 ), but no luck.
All base & mapping qualities of the non-reference bases are looking good.

This is the output of the HaplotypeCaller GVCF.
chr20 62316953 . A . . END=62316953 GT:DP:GQ:MIN_DP:PL 0/0:98:0:98:0,0,416

But GATK v3.2 UnifiedGenotyper have detected this variant and pass the filters.
chr20 62316953 . A G 947.77 PASS ABHet=0.622;AC=1;AF=0.500;AN=2;BaseQRankSum=3.481;DP=98;Dels=0.00;FS=5.289;HaplotypeScore=2.3548;MLEAC=1;MLEAF=0.500;MQ=58.64;MQ0=0;MQRankSum=0.022;QD=9.67;ReadPosRankSum=-0.366;SB=-3.310e+02 GT:AD:DP:GQ:PL 0/1:61,37:98:99:976,0,1402

Any idea why this SNP was not called by HC?

Thanks for your help.

Regards
Shibu

Best Answer

Answers

  • bhanuGandhambhanuGandham Member, Administrator, Broadie, Moderator admin
    edited November 14

    Hi @shibujohn82

    Is the variant covered in the interval bed?
    Also it would be helpful to see the IGV view of bamout for the region.

    Regards
    Bhanu

  • Hi Bhanu,
    Yes, this variant is covered in the interval bed and there were no reads in --bamout file for this region. Have a look into the IGV screenshot.

    Thanks
    Shibu

  • bhanuGandhambhanuGandham Member, Administrator, Broadie, Moderator admin

    Hi @shibujohn82

    I agree with you this is weird. Would you please send us an IGV screenshot where we can actually see the reads with the alt base, just to make sure there's not something obvious where all the reads which support alt are very bad or something. Also would you please run just this region (so like -L 20:62316800-62317100) with the -debug flag, and send along the resulting stdout. Beyond that, if you are willing to send along your input bam (or even just chr20 of your input bam), we can try to do more debugging here. Instructions to send us your data can be found here.

    Regards
    Bhanu

  • Hi Bhanu,
    Have a look into the IGV screenshot with alt base.

    And I run pipeline with -L 20:62316800-62317100 and I can see this deletion.

    chr20 62316920 . AGCAGGGCTGGGGGCCTTACAGTCCTATAAGGTAGGGGCCACCTCCAGGAGGCAGGTGGAGGGCAGCCCTTGTTCCCCG A 798.73 . AC=1;AF=0.500;AN=2;BaseQRankSum=1.504;DP=133;ExcessHet=3.0103;FS=5.241;MLEAC=1;MLEAF=0.500;MQ=59.82;MQRankSum=1.038;QD=6.01;ReadPosRankSum=-4.317;SOR=1.079 GT:AD:DP:GQ:PL 0/1:65,68:133:99:836,0,19504

    Please find the attached the --debug stdout file and I have uploaded this file (shibujohn.chr20.bam.zip) into the ftp server.

    Regards
    Shibu

Sign In or Register to comment.