The current GATK version is 3.6-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Powered by Vanilla. Made with Bootstrap.

VariantAnnotator on tiny VCF file

drmjcdrmjc Garvan Institute of Medical ResearchMember Posts: 14

Hi,
I have a tiny, 5000-line VCF file that I want to add dbSNP annotations to.
I'm surprised to see VariantAnnotator iterating along the millions of records in the dbSNP file, rather than the 5000 variants in the input VCF file. This will take 42mins on dbsnp 137...

Am I misunderstanding how this tool works, or just using it wrongly?

thanks, Mark

java -Xmx2g -jar /share/ClusterShare/software/contrib/gi/gatk/2.5/dist/GenomeAnalysisTK.jar -T VariantAnnotator -R /share/ClusterShare/biodata/contrib/gi/gatk-resource-bundle/2.5/hg19/ucsc.hg19.fasta --variant PG0000864-BLD_PGx_cleaned.vcf --dbsnp /share/ClusterShare/biodata/contrib/gi/gatk-resource-bundle/2.5/hg19/dbsnp_137.hg19.vcf --out PG0000864-BLD_PGx,GATK.vcf --validation_strictness SILENT

Best Answer

Answers

  • drmjcdrmjc Garvan Institute of Medical ResearchMember Posts: 14

    a colleague pointed out the -L flag which really sped things up. Perhaps I could rephrase the question: if you specify --variant, then should -L be implied?

  • ebanksebanks Broad InstituteMember, Administrator, Broadie, Moderator, Dev Posts: 698 admin

    Not for the Variant Annotator

    Eric Banks, PhD -- Director, Data Sciences and Data Engineering, Broad Institute of Harvard and MIT

  • drmjcdrmjc Garvan Institute of Medical ResearchMember Posts: 14

    Thanks for he quick response Eric, but just wondering if you could elaborate? if you only care about the variants within --variants my.vcf, then why look outside of these regions? I'm just trying to get my head around this.
    cheers, Mark

  • drmjcdrmjc Garvan Institute of Medical ResearchMember Posts: 14

    Thanks Geraldine, that makes perfect sense.

Sign In or Register to comment.