It looks like you're new here. If you want to get involved, click one of these buttons!
I am facing this error, when I try to validate the variants (ValidateVariants) of a vcf file which is produced through GATK just after UnifiedGenotyper. I am using GenomeAnalysisTK-2.3-6-gebbba25 and dbsnp_137.hg19.vcf. These variants are annotated by DepthOfCoverage, aplotypeScore, ,InbreedingCoeff and LowMQ ...
Basically, I generate two VCF files using UnifiedGenotyper separately, one for SNP and the other for INDEL.
the error for both is about the Allele Count (AC) tag: ##### ERROR MESSAGE: File F93.snp.vcf fails strict validation: the Allele Count (AC) tag is incorrect for the record at position chr1:1225579, 1 vs. 1
I appreciate your comments,
Answers
Can you upgrade to the latest version and see if the error persists?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I tried GenomeAnalysisTK-2.3-9-ge5ebf34, the error still persists ! the Allele Count (AC) tag is incorrect for the record at position chrM:302, 2 vs. 2
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Ok. Can you post the VCF record where the error occurs (chrM:302)?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Can you please post the complete record? This is missing the format field and sample values.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Ah, there you go -- the counts are indeed incorrect. The record includes 3 samples with the first allele but only 1 of the second, unlike what is reported in the AC tag. Did you exclude some samples after processing, perhaps?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •no ! I didn't ! I used UnifiedGenotyper separably for calling SNP and indels from 93 samples ; then I did filtration
for indel
for SNP:
and then I combined these two, using CombineVariants; as you see, I didn't remove any sample.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I see -- Just to be clear, are you getting the error when you run ValidateVariants on the files that come straight out of the UnifiedGenotyper, or on the combined VCF that results from CombineVariants?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Ah, actually I was mistaken -- the sample that is 2/2 counts for two alleles, so the tag is in fact correct. We'll have a closer look at this.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Surprisingly, I don't see the error again ! sounds like miracle :D , now I am getting something like File F93.all.vcf fails strict validation: the rsID rs35614524 for the record at position chr9:139565479 is not in dbSNP ... I haven't start to pray for it to get fix automatically
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hmm, that's odd. Well, the new version will be out very soon, which is more thoroughly tested -- hopefully you won't suffer these weird phantom bugs then.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •