This site is now read-only. You can find our new documentation site and support forum for posting questions here.
Be sure to read our welcome blog!
Filtering based on BaseQRankSum
I was plotting the distribution of BaseQRankSum and noticed a large number of variants with a BaseQRankSum outside of Z-score of +/- 2, which suggests that a lot of variants have significant base quality differences between the REF and ALT. I plotted the distribution of ClippingRankSum, MQRankSum, and ReadPosRankSum and the majority of variants had Z-scores inside +/- 2.
Is this typical and what is this suggestive of? I followed the best practices for DNA sequencing using GATK3.
I found this post (http://gatkforums.broadinstitute.org/discussion/2035/z-scores-for-baseqranksum), which is similar to what I'm asking but has a different distribution of Z-scores.
Thank you in advance.