Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
BaseRecalibrator on tiny gene panel with deep coverage?
I am currently working on data from a tiny gene panel (~29 kb; target enrichment by amplicons) with deep coverage in most of the samples (mean coverage well above 1000x in most samples).
I have read the basics of
BaseRecalibrator here and according to the rule of thumb quoted below, I should not apply
BaseRecalibrator to this dataset:
This procedure will not work well on a small number of aligned reads. We usually expect to see more than 100M bases per read group; as a rule of thumb, larger numbers will work better
However, in the next paragraph it is stated:
No excuses You should almost always perform recalibration on your sequencing data.
Hence, what is the best choice here? Is this one of those "almost always" exceptions to the rule of thumb?