The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Get notifications!


You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

Got a problem?


1. Search using the upper-right search box, e.g. using the error message.
2. Try the latest version of tools.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

Did we ask for a bug report?


Then follow instructions in Article#1894.

Formatting tip!


Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ``` ) each to make a code block as demonstrated here.

Jump to another community
Picard 2.10.2 is now available. As of 2.10.0, Picard supports NovaSeq CBCL data. Download and read release notes at https://github.com/broadinstitute/picard/releases.
**GATK4-BETA.2** is here. That's TWO, as in the second beta release. Be sure to read about the known issues before test driving. See Article#9881 to start and https://github.com/broadinstitute/gatk/blob/master/README.md for details.

Difference in GenotypeConcordance between GATK and Picard tools

avinashsavinashs NYCMember
edited September 2016 in Ask the GATK team

Hi,

I am trying to compare Genotype calls between two multi-sample VCFs. I ran the GenotypeConcordance tools from both GATK and Picard (Picard version for just one of the samples). But I see that the GenotypeConcordance reported by the two tools are different.

Relevant portion of the results from the two tools below.


*** Picard ***
VARIANT_TYPE TRUTH_SAMPLE CALL_SAMPLE HET_SENSITIVITY HET_PPV HET_SPECIFICITY HOMVAR_SENSITIVITY HOMVAR_PPV HOMVAR_SPECIFICITY VAR_SENSITIVITY VAR_PPV VAR_SPECIFICITY GENOTYPE_CONCORDANCE NON_REF_GENOTYPE_CONCORDANCE
SNP HG00109 HG00109 0.436364 0.405042 ? 0.430914 0.386074 ? 0.434393 0.397989 0.612558 0.960685 0.960685
INDEL HG00109 HG00109 0.285897 0.233739 ? 0.310651 0.251152 ? 0.290306 0.236989 0.592559 0.66537 0.66537

**** GATK ****
Sample Non-Reference Sensitivity Non-Reference Discrepancy Overall_Genotype_Concordance
ALL 0.394 0.050 0.991
HG00109 0.405 0.042 0.993


I understand that Picard splits up the SNP and INDEL concordances. But since both of them are lower than the GATK reported concordance, I can't see any scenario where combining the two would make it equal to the GATK reported concordance.

Any suggestions on what the reason for the difference might be and which one is probably more accurate? Both tools were run with all default arguments and the two VCFs being compared were both generated by HaplotypeCaller with pretty much identical settings.

I found an old post suggesting that the Picard version is likely better and the GATK version might be deprecated soon ( http://gatkforums.broadinstitute.org/gatk/discussion/5795/genotype-concordance-output ). But since it is more than a year old, I don't know if it still holds true. The GATK version is slightly easier for my purposes (comparing multiple samples at one go). But want to check before starting to use it.

thanks!

Best Answer

Answers

Sign In or Register to comment.