Bug Bulletin: The recent 3.2 release fixes many issues. If you run into a problem, please try the latest version before posting a bug report, as your problem may already have been solved.

Filtering VCF passed to --knownSites or --known on-the-fly?

PeteHaitchPeteHaitch Posts: 19Member
edited September 2012 in Ask the GATK team

My current workflow for analysing mouse exome-sequencing (based on v4 of Best Practices) can require me to use slightly different VCFs as --knownSites or --known parameters in BQSR, indel realignment etc. Basically, I have a "master" VCF that I subset using SelectVariants. The choice of subset largely depends on the strain of the mice being sequenced but also on other things such as AF'. It'd be great to be able to do this on-the-fly in conjunction with--known' in tools that required knownSites rather than having to create project-specific (or even tool-specific) VCFs.

Is there a way to do this that I've overlooked? Is this a feature that might be added to GATK?

Post edited by PeteHaitch on

Best Answer

  • Geraldine_VdAuweraGeraldine_VdAuwera Posts: 5,821 admin
    Answer ✓

    Hi Pete,

    If you mean something like the -L option for intervals, but that would select a subset of variants from within a VCF instead, then no, that's not currently a feature. If there were to be significant demand for such a feature we may consider it, but right now you shouldn't count on it, sorry. If you or someone else wants to implement the feature we'd certainly be happy to look at a patch.

    Good luck!


Sign In or Register to comment.