Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Seeing multiple calls for the same position
I am running a trio through the best practices pipeline for 3.7 and finding an unexpected result. I am seeing some positions that are being called multiple times with different calls. First, as output from HaplotypeCaller, I am seeing these results for position 78790153:
In each sample, the position 78790153 is represented by at least two lines, it's own line and at least one deletion prior. Of particular concern is that in the bottom case, the call at this position varies depending on which line you look at. It is heterozygous for a deletion at one point and homozygous reference at the others.
After Joint Genotyping, the variants come through in the result with that same position appearing twice still. (Order of samples top to bottom above is order of samples listed below first, second, third)
But, a new concern arises after applying family priors:
Here we see the first sample has her call from the deletion beginning at 78790143 changed from reference to heterozygous with no supporting evidence. This is obviously a very challenging spot because of the repeat, but is it deliberate for a position to be called multiple times and with different called alleles?