The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

☞ Did you remember to?

1. Search using the upper-right search box, e.g. using the error message.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

☞ Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks (  ) each to make a code block as demonstrated here.

GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.

Filtering VCF passed to --knownSites or --known on-the-fly?

Member Posts: 19
edited September 2012

My current workflow for analysing mouse exome-sequencing (based on v4 of Best Practices) can require me to use slightly different VCFs as --knownSites or --known parameters in BQSR, indel realignment etc. Basically, I have a "master" VCF that I subset using SelectVariants. The choice of subset largely depends on the strain of the mice being sequenced but also on other things such as AF'. It'd be great to be able to do this on-the-fly in conjunction with--known' in tools that required knownSites rather than having to create project-specific (or even tool-specific) VCFs.

Is there a way to do this that I've overlooked? Is this a feature that might be added to GATK?

Tagged:

Hi Pete,

If you mean something like the -L option for intervals, but that would select a subset of variants from within a VCF instead, then no, that's not currently a feature. If there were to be significant demand for such a feature we may consider it, but right now you shouldn't count on it, sorry. If you or someone else wants to implement the feature we'd certainly be happy to look at a patch.

Good luck!

Geraldine Van der Auwera, PhD

Hi Pete,

If you mean something like the -L option for intervals, but that would select a subset of variants from within a VCF instead, then no, that's not currently a feature. If there were to be significant demand for such a feature we may consider it, but right now you shouldn't count on it, sorry. If you or someone else wants to implement the feature we'd certainly be happy to look at a patch.

Good luck!

Geraldine Van der Auwera, PhD

• Member Posts: 19

Hi Geraldine,

I'm thinking of options like in VariantFiltration, e.g. --filterExpression "AB < 0.2 || MQ0 > 50"`. I doubt I'll have time myself to implement such a thing but I thought I'd raise it here in case it was something of interest to other GATK users. For now I'll continue to just work around it.

Thanks.