Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
VQSR on specific genomic region
Dear GATK Team,
I have exome-data of many individuals (>2000) called with the HaplotypeCaller, but only of a specific set of genes from the genome. I would like to apply the VQSR-tool to recalibrate my variants, but (as expected) I get back an error 'No data found'.
I know there is an option to 'pad' your data with other exomes, but then the generation method needs to be comparable to my dataset (which whole-exome-sequencing is not).
Alternatively, I was therefore wondering if there is an option to 'focus' the VQSR-tool only on specific regions of the genome/exome?
Because I know for sure that if only my regions would be considered in the recalibration I would have enough variants to create a recalibration model.
Thank you for your help in advance,