It looks like you're new here. If you want to get involved, click one of these buttons!
I ran the same sample through a pipeline using GATK twice and received different variants. I am trying to understand the reason behind this. My samples are from a MiSeq/capture kit run and downsampling could be one reason (given in one scenario that variant is called and in other it isn't) the variant is called at 32% when looked into the .bam files.
As I understand the UnifiedGenotyper downsamples my dataset randomly to 250, so I played around with -dcov parameter
But setting -dt to NONE could be computationally exhaustive for a big sample set. Is there an identifiable reason to why this is happening..?