This site is now read-only. You can find our new documentation site and support forum for posting questions here.
Be sure to read our welcome blog!
the importance of sorting markduplicate output files
I discovered that several outputfiles from "markduplicates" (picard) was sent on in the "best-practice workflow (GATK4.1) " without sorting through BaseRecalibrator, BQSR (gatk ApplyBQSR) and Haplotypecaller to make g.vcf files of each sample. No obvious problems or error msg. The plan is to combine these one in a common vcf-file for variant calling. Is there a need to rerun and get these samples (coordinate) sorted after markduplicates and then rerun BaseRecalibrator, ApplyBQSR and Haplotypecallerto avoid errors?