The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Powered by Vanilla. Made with Bootstrap.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.
Register now for the upcoming GATK Best Practices workshop, Feb 20-22 in Leuven, Belgium. Open to all comers! More info and signup at

Fishy output from ReduceReads

blakeoftblakeoft ConnecticutMember Posts: 9

I've been following the best practices guide and I've gotten some odd looking output from ReduceReads. Here's a sample:

C100 16 chrM 4934 60 1M * 0 0 T 7 BD:Z:E RG:Z:JC01_L1 BI:Z:L RR:B:c,1 RS:A:1

The odd part is the CIGAR string. Is "1M" a reasonable CIGAR string? Furthermore, prior to ReduceReads, Picard tools' ValidateSamFile finished with no errors, and the validation for the ReduceReads output is like so:

WARNING: Record 1, Read name 1, NM tag (nucleotide differences) is missing

That occurs for records 1 - 100 and then ValidateSamFile does not report any more.

Here is the command line I used for ReduceReads:

java -Xmx2g -jar $GATK -T ReduceReads -R $genomes/hg19.fa -I $alignments/$lane.dedup.realn.recal.bam -o $alignments/$lane.dedup.realn.recal.reduced.bam

Note that pwd is surrounded by back ticks, I just don't know how to disable them from interrupting the code format.

Any advice?


Best Answer


  • blakeoftblakeoft ConnecticutMember Posts: 9

    Thank you for your answer, Geraldine. I misunderstood what ReduceReads actually does. I went back and watched the presentation and it all makes sense now.

Sign In or Register to comment.