Which memory and temporary storage need to run GenomeSTRIP (CNVDiscoveryPipeline)
I would like to run CNVDiscoveryPipeline of GenomeSTRIP on 87To of CRAM files (about 3000 human being). I understand that the step 5 of the pipeline need to use together data from several BAM files to improve the reliability of the CNV call. So I have several questions :
1. Which size of memory do we need to dedicate in our cluster to run CNVDiscoveryPipeline on all my samples ?
2. Which size of storage do we need to dedicate in our cluster to run CNVDiscoveryPipeline on all my samples (final output and temporary files) ?
3. Can we run it from CRAM files or do we need to run on it from BAM files ?
Your answers are going to help me to discuss with IT team to run it.