We've moved!
This site is now read-only. You can find our new documentation site and support forum for posting questions here.
Be sure to read our welcome blog!

What is a good number of samples that can be used to detect a variant - I have 15K GVCFs with 1000DP

ravichavravichav United StatesMember
edited December 2017 in Ask the GATK team


I have 15k GVCFs. To call variants, I understand I can run combineGVCFs step to get batches of GVCF combined. I would like to know whats the good number for a sample set, for bams with coverage of over 800-1000X, to detect a variant? Would the variants called from batches of 500 samples have the same power to detect a variant in all the samples as compared to a variant call in 15k samples together?


Sign In or Register to comment.