To celebrate the release of GATK 4.0, we are giving away free credits for running the GATK4 Best Practices pipelines in FireCloud, our secure online analysis portal. It’s first come first serve, so sign up now to claim your free credits worth $250. Sponsored by Google Cloud. Learn more at

Subsetting contigs from existing .g.vcf's


I've used HaplotypeCaller to call variants on whole genome CRAMs without specifying regions. Now, I'd like to subset this data to chromosomes 1-22 + X without redoing the calling. Is there a "proper" way to do this in GATK or I should dive into the structure of the vcf's and truncate the data?


Best Answer


Sign In or Register to comment.