We’re moving the GATK website, docs and forum to a new platform. Read the full story and breakdown of key changes on this blog.
If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!
Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Running HaplotypeCaller with -nct

Hi!
I am trying to see if I can speed up the HaplotypeCaller tool using the -nct flag. The GATK correctly identifies that my machine has 16 processors, and I specified that the HaplotypeCaller uses 16 threads, i.e. -nct 16. However processing the same file roughly takes the same amount of time (3600sec approx). I tried it also using -nct 8, and -nct 4. None of these options seems to help the process finish faster.
Were there any suggestions or ways I could achieve some appreciable gains? Thank you for any insight anyhow.
Best Answer
-
Geraldine_VdAuwera Cambridge, MA admin
Hi there,
Gains from multithreading can be highly variable depending on platform. If that's not working well for you, there are other things you can try. You can parallelize using scatter-gather, and you can do things like limit the number of alternate alleles the program will consider, or tweak the graph pruning argument. These have been covered on this forum and in the documentation; try doing a search for terms like pruning and scatter-gather.
Answers
Hi there,
Gains from multithreading can be highly variable depending on platform. If that's not working well for you, there are other things you can try. You can parallelize using scatter-gather, and you can do things like limit the number of alternate alleles the program will consider, or tweak the graph pruning argument. These have been covered on this forum and in the documentation; try doing a search for terms like pruning and scatter-gather.
Thank you Geraldine, I will investigate all the above mentioned options, especially regarding the scatter-gather option.