Test-drive the GATK tools and Best Practices pipelines on Terra


Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.

GATK4,Cann't get right CalculateContamination result

yyjyyj Member
Question regarding CalculateContamination(GATK/4.1.2.0):

With CalculateContamination in tumor matched mode, I get:
contamination error
NaN 1.0

When I look at the tumor.table and normal. table files generated by Getpileupsummaries, I don't see any unusual data structure/value

What can be the problem?

Answers

  • bhanuGandhambhanuGandham Cambridge MAMember, Administrator, Broadie, Moderator admin

    Hi @yyj

    Can you please post the exact commands you used for Getpileupsummaries and CalculateContamination. Also please post the first few records of the Getpileupsummaries table.

  • yyjyyj Member
    @ bhanuGandham
    The Getpileupsummaries commands as follow:
    for sample in *.bam;
    do
    base=${sample%%.*};
    gatk4 GetPileupSummaries
    -I $sample
    -V ../../reference/somatic-hg38_af-only-gnomad.hg38.SNP_biallelic.vcf.gz
    -O ./contamination/"$base"_getpileupsummaies.table;
    done

    The CalculateContamination commands as follow:
    for sample in *.bam;
    do base=${sample%%.*};
    gatk4 GetPileupSummaries
    -I $sample
    -V ../../reference/somatic-hg38_af-only-gnomad.hg38.SNP_biallelic.vcf.gz
    -L ../../reference/S07604514_hs_hg38/S07604514_hs_hg38/S07604514_Padded.bed
    -O ./contamination/"$base"_getpileupsummaies.table;done
    The first 30 lines of getpileupsummaies.table:
    contig position ref_count alt_count other_alt_count allele_frequency
    chr1 12882 0 0 0 0.021
    chr1 13110 0 0 0 0.149
    chr1 13143 0 0 0 0.05
    chr1 13149 0 0 0 0.011
    chr1 13178 0 0 0 0.061
    chr1 13273 0 0 0 0.115
    chr1 13281 0 0 0 0.04
    chr1 13418 0 0 0 0.183
    chr1 13613 0 0 0 0.02
    chr1 13621 0 0 0 0.011
    chr1 13649 0 0 0 0.054
    chr1 13684 0 0 0 0.027
    chr1 13752 0 0 0 0.018
    chr1 13757 0 0 0 0.021
    chr1 14522 0 0 0 0.05
    chr1 14542 0 0 0 0.072
    chr1 14574 0 0 0 0.101
    chr1 14590 0 0 0 0.097
    chr1 14599 0 0 0 0.123
    chr1 14604 0 0 0 0.125
    chr1 14610 0 0 0 0.128
    chr1 14626 0 0 0 0.011
    chr1 14671 0 0 0 0.011
    chr1 14677 0 0 0 0.058
    chr1 14773 0 0 0 0.016
    chr1 14843 0 0 0 0.018
    chr1 14933 0 0 0 0.158
    chr1 14948 0 0 0 0.055
  • bhanuGandhambhanuGandham Cambridge MAMember, Administrator, Broadie, Moderator admin
    edited May 29

    @yyj
    1)
    --intervals is a required argument for GetPileupSummaries, see : https://software.broadinstitute.org/gatk/documentation/tooldocs/current/org_broadinstitute_hellbender_tools_walkers_contamination_GetPileupSummaries.php

    2)

    The CalculateContamination commands as follow:
    for sample in .bam;
    do base=${sample%%.
    };
    gatk4 GetPileupSummaries

    In this case you are using GetPileupSummaries instead of CalculateContamination. See: https://software.broadinstitute.org/gatk/documentation/tooldocs/current/org_broadinstitute_hellbender_tools_walkers_contamination_CalculateContamination.php

Sign In or Register to comment.