The frontline support team will be unavailable to answer questions on April 15th and 17th 2019. We will be back soon after. Thank you for your patience and we apologize for any inconvenience!

Non-human truth and training resources

henriettevdzhenriettevdz South Africa Member

Good day,

I'm working on a bird whole genome and I need to set up truth and training resources. In the documentation it says the following and I just want to confirm that I understand this correctly:

"If you are working with non-human genomes, you will need to find or generate at least truth and training resource datasets with properties corresponding to those described below. To generate your own resource set, one idea is to first do an initial round of SNP calling and only use those SNPs which have the highest quality scores. These sites which have the most confidence are probably real and could be used as truth data to help disambiguate the rest of the variants in the call set."

Do I understand it correctly that in the arguments I will be using the following:
-resource: raw_variants ,known=false,training=true,truth=true,prior=15.0 raw_variants.vcf

So I will use the raw variant vcf file that I have created in the "Call variants" step previously as my truth site training resource?

Thanks a lot!

Issue · Github
by Sheila

Issue Number
Last Updated
Closed By

Best Answer


Sign In or Register to comment.