The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

☞ Did you remember to?

1. Search using the upper-right search box, e.g. using the error message.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

☞ Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ` ) each to make a code block as demonstrated here.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.

Getting insertion counts

Member Posts: 4

Hello,

For matched tumor and normal pairs, we easily get insertion and deletion counts from the output of Somatic Indel Detector in GATK. However, when we run multiple samples from the same patient, sometimes calls are made in one sample but not another, so we might not have the numbers for all samples for all indel events. We can get the deletion counts from Depth of Coverage in GATK, but retrieving insertions is trickier.

Does you have a suggestion for how to solve this problem in an automated (ie non-IGV fashion)?

Additionally, as DepthofCoverage is being retired, what do you recommend that we use for getting SNP and deletion counts?

Thank you

Tagged:

Hi there,

DepthofCoverage is actually getting a reprieve -- we won't retire it until DiagnoseTargets is able to completely take over the DoC functionality.

Unfortunately we don't have experience with cancer / somatic mutations, so we can't really advise you on this topic. Perhaps someone in the user community can give you some pointers.

Geraldine Van der Auwera, PhD

• Member Posts: 4

I'm glad to hear that DoC will remain active for a while.
My other question does not require any knowledge of cancer or somatic mutations, so I apologize for not being concise. Reworded: Is there a GATK tool that I can use to get counts of specific indels? (Something like BaseCounts or DoC for indels.)
Thank you.

Do you mean counting in how many of the patient's samples a specific indel occurs? If so I don't think we have a specific tool to do that, but you could just call indels on the interval where the indel occurs, then use the variant manipulation tools to find out the counts. Does that make sense?

Geraldine Van der Auwera, PhD

• Member Posts: 4

Hi Geraldine,
I mean counting how many of the reads in a bam or sample does a specific indel occur. The issue is that while it may occur in that sample, it may be below the threshold of what UnifiedGenotyper would call. For example, if there's only 2 indels out of 634 reads, UnifiedGenotyper would likely not call that, but we still need to retrieve that data.
Thank you.

• Member Posts: 4

Thank you! I'll need to play around with the read filters a bit, but I think this will work.

• Member Posts: 61

Hi Geraldine,

In the DoC tool there is an option for counting the bases called --printBaseCounts and another for counting deletions called --includeDeletions but there is nothing for counting insertions! I have some ultra-deep sequencing data and I would like to count the bases, insertions and deletions per base. Is it possible to do this on GATK, if so which tool? In IGV if you mouse over the top it would show the coverage of C,G,T,A and Ins and Del for each base. I want to do basically the same thing IGV dose but for all regions printed in a form of DoC output, unfortunately, DoC dose it all except the insertions!

Reza