The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

#### Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

#### ☞ Got a problem?

1. Search using the upper-right search box, e.g. using the error message.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

#### ☞ Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ` ) each to make a code block as demonstrated here.

Picard 2.10.4 has MAJOR CHANGES that impact throughput of pipelines. Default compression is now 1 instead of 5, and Picard now handles compressed data with the Intel Deflator/Inflator instead of JDK.
GATK version 4.beta.2 (i.e. the second beta release) is out. See the GATK4 BETA page for download and details.

# Best practice for variant calling on Ion Torrent data with GATK

Member
edited October 2012

Hi,

I wonder how well GATK works with Ion Torrent data. Is there any recommended practice to handle Ion Torrent data, especially SNP and indel calling, with GATK?

Thanks,
XZ

Post edited by Geraldine_VdAuwera on
Tagged:

This is a very good question. We do have reasonably extensive experience with GATK on Ion Torrent (PGM and Proton) and we've been meaning to put together a best practices doc for Ion Torrent. My two cents: we can do reasonably well for SNP but indels are I think impossible today given the homopolymer error mode of that sequencers. I have yet to see any compelling evidence that indels can be done effectively using any approach -- not just GATK -- including the example results on the Ion Community itself.

This is a very good question. We do have reasonably extensive experience with GATK on Ion Torrent (PGM and Proton) and we've been meaning to put together a best practices doc for Ion Torrent. My two cents: we can do reasonably well for SNP but indels are I think impossible today given the homopolymer error mode of that sequencers. I have yet to see any compelling evidence that indels can be done effectively using any approach -- not just GATK -- including the example results on the Ion Community itself.

• Member

Thanks for your prompt response, Mark. I understand that the variant caller developed by Ion Torrent uses a modified version of GATK, which includes a flow space modelling function to tackle the homopolymer issue. Do you have any experience with that? If I want to call INDELs with on Ion data with GATK now, would you recommend using the current INDEL pipeline -- indel realignment, UnifiedGenytyper with glm set to DINDEL and hard filtering?

All good questions -- things we'll address on the best practices entry.

• Member

Is there more known about using GATK for variant calling SNPS and Indels?

Niels, if you mean to ask about calling variants on Ion Torrent data, we have not yet had the time to produce that documentation.

Update to this thread: we are no longer working with Ion Torrent at all and will not produce best practices for dealing with this data type.

• IndiaMember

So no way we can use Ion Torrent data with GATK for variant calling ?
Thanks
Najeeb