We've moved!
This site is now read-only. You can find our new documentation site and support forum for posting questions here.
Be sure to read our welcome blog!

GATK4 BaseRecalibrator multithreading (not Spark)

I'm using GATK4 in a VM with 16 Intel Xeon E7-4860 processors (they can't support AVX), and 32 Gb RAM + 16 Gb swap (I can ask more).

Since GATK4 doesn't have the multithreading options (-nt and -nct) anymore, I oftenly cannot take advantage of all processors. Because of this, I have been trying the Spark version of the tools, but I don't really want to use them until you "aprove" them oficially.

For other tools I have been using a workaround dividing the genome in intervals and run the tools using the -L option on 16 parallel commands; and then merging the results. But BaseRecalibrator can't be applied like this.

Is there any option that I'm missing to be able to run BaseRecalibrator more efficiently?



Sign In or Register to comment.