Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Recommendations for human populations very different from the human reference
I am working on genomes from human populations that are very different (more diverse) from the populations used to build the human reference genome. Thus we are worried about biasing our analyses by relying too much on the reference genome.
In particular, I have a question regarding the indels realigning step (RealignerTargetCreator and IndelRealigner): do you recommend to use the lists of known indels based on the human reference for populations very different from the reference? Or should I run this step without the -known option?