Genotyping a substitution using the HaplotypeCaller

LaurentLaurent Member, Collaborator Posts: 43 ✭✭
edited December 2012 in Ask the GATK team

Hi All,

I have the following substitution that I am trying to genotype in a deep coverage (>1000x) dataset:

I've aligned it using very relaxed BWA parameters and am now getting it correctly with the Haplotype caller, however it is currently genotyped as multiple indel/SNP events:
4 2558307 G GAGCTA
4 2558310 G C
4 2558311 ATGTGGG A
4 2558318 G A

Filling the blanks between the events above using the reference sequence gives exactly the substitution I am looking for however I'd like to genotype this as one substitution event. I've tried playing with the following options but I never got any results using them:
--fullHaplotype --genotypeFullActiveRegion --activeRegionIn 4:2558307-2558318 --activeRegionOut substitution.out

I am not sure if what I'm trying to do is feasible but would appreciate any advice.

Thanks a lot!

Post edited by Laurent on
Sign In or Register to comment.