The current GATK version is 3.8-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Get notifications!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

Got a problem?

1. Search using the upper-right search box, e.g. using the error message.
2. Try the latest version of tools.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

Did we ask for a bug report?

Then follow instructions in Article#1894.

Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ``` ) each to make a code block as demonstrated here.

Jump to another community
Download the latest Picard release at
GATK version 4.beta.3 (i.e. the third beta release) is out. See the GATK4 beta page for download and details.

how to change the fixed Q20 clipping of reads during indel calling?

GATK 1.6 claims to apply a fixed Q20 threshold to clip ends of the reads for the indel caller:
3. Indel Calling with the Unified Genotyper

[...] while many of the parameters are common between indel and SNP
calling, some parameters have different meaning or operate
differently. For example, --min_base_quality_score has a fixed, well
defined operation for SNPs (bases at a particular location with base
quality lower than this threshold are ignored). However, indel calling
is by definition delocalized and haplotype-based, so this parameter
does not make sense. Instead, the indel caller will clip both ends of
the reads if their quality is below a certain threshold (Q20), up to
the point where there is a base in the read exceeding this threshold.

Also here:

--min_base_quality_score / -mbq ( int with default value 17 )

Minimum base quality required to consider a base for calling. The
minimum confidence needed in a given base for it to be used in variant
calling. Note that the base quality of a base is capped by the mapping
quality so that bases on reads with low mapping quality may get
filtered out depending on this value. Note too that this argument is
ignored in indel calling. In indel calling, low-quality ends of reads
are clipped off (with fixed threshold of Q20).

Can this "fixed threshold of Q20" be changed to another value when running the Unified Genotyper?

Best Answer


Sign In or Register to comment.