Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Selec a fraction of variants of a specific chromosome
I am using the following expression to select variants from my VCF file:
java -jar GenomeAnalysisTK.jar \
-T SelectVariants \
-R reference.fasta \
-V input.vcf \
-o output.vcf \
I am wondering if the expression above will extract the 50% of variants of the chromosome_x or will extract the 50% of all the variants in the VCF and then it will print those that fall in the chromosome_x . I suspect the first behaviour is the correct (it is what I need), but I am asking to be sure.
Thanks in advance,