The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Powered by Vanilla. Made with Bootstrap.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.
Register now for the upcoming GATK Best Practices workshop, Feb 20-22 in Leuven, Belgium. Open to all comers! More info and signup at http://bit.ly/2i4mGxz

GATK UGT -heterozygosity / -hets

kumar35885kumar35885 Member Posts: 2
edited October 2012 in Ask the GATK team

I have an inbred mouse strain that I am sequencing and there should be little to NO heterozygosity. Yet with the default settings of UGT -heterozygosity (which is 0.001) many homs are being called as hets. When 230/250 reads are alternate and 20/250 are reference, it calls a het, even though it should be homozygous alternate.

What do you recommendations for this setting for inbred animals?

thanks, GATK is great!

Vivek

Post edited by Geraldine_VdAuwera on

Best Answer

  • ebanksebanks Broad InstituteMember, Administrator, Broadie, Moderator, Dev Posts: 701 admin
    Accepted Answer

    The heterozygosity is the probability that the allele for a single chromosome is non-reference (which in humans is 1/1000 for SNPs). So even for an "inbred human" you would still expect a variant on a given chromosome once every 1000 bases. It is not the probability that a diploid sample is heterozygous at a given position.

    Eric Banks, PhD -- Director, Data Sciences and Data Engineering, Broad Institute of Harvard and MIT

Answers

  • ebanksebanks Broad InstituteMember, Administrator, Broadie, Moderator, Dev Posts: 701 admin
    Accepted Answer

    The heterozygosity is the probability that the allele for a single chromosome is non-reference (which in humans is 1/1000 for SNPs). So even for an "inbred human" you would still expect a variant on a given chromosome once every 1000 bases. It is not the probability that a diploid sample is heterozygous at a given position.

    Eric Banks, PhD -- Director, Data Sciences and Data Engineering, Broad Institute of Harvard and MIT

Sign In or Register to comment.