To celebrate the release of GATK 4.0, we are giving away free credits for running the GATK4 Best Practices pipelines in FireCloud, our secure online analysis portal. It’s first come first serve, so sign up now to claim your free credits worth $250. Sponsored by Google Cloud. Learn more at https://software.broadinstitute.org/firecloud/documentation/freecredits

How to assign memory and CPU cores for genotypeGVCF?

wmywmy MarylandMember

We followed the workflow in http://gatkforums.broadinstitute.org/gatk/discussion/3893/calling-variants-on-cohorts-of-samples-using-the-haplotypecaller-in-gvcf-mode to run GATK hyplotypecaller for a large whole exome sequencing project (~6200 samples). However, in step 3, using genotypeGVCF to merge 31 gvcf became very slow. So, we decided to run each chromosome in parallel. But even for an individual chromosome, our cluster always run out of memory.
To avoid the memory issue, I want to know how much memory and CPU cores we should assign for an individual genotypeGVCF job? How to set java -Xmx and -nt in a command line?
Thanks,

Tagged:

Answers

Sign In or Register to comment.