The frontline support team will be unavailable to answer questions on April 15th and 17th 2019. We will be back soon after. Thank you for your patience and we apologize for any inconvenience!
How does Picard's MarkDuplicates handle unique molecular barcodes and PCR error?
I'm interested in using unique molecular barcodes to help distinguish what is PCR error in my samples. My current understanding of how MarkDuplicates chooses a "best-pair" is that it chooses this best pair based on a high sum of base quality scores. Sequences containing PCR errors can also have great base qualities. Has any additional logic been implemented to help distinguish reads containing PCR errors from the original template sequence when using unique molecular barcodes?