Test-drive the GATK tools and Best Practices pipelines on Terra


Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.

BWA and fastqtosam failing with read names do not match errors

I'm using the GATK best practices to call public exome data. Out of over 600 exomes, most of the samples did fine with BWA mem alignment and fastqtosam. About 100 samples failed both steps (the same samples failed both). For fastqtosam an empty file is created. For BWA a very small file is created. The input for both these steps is paired end FASTQ files from a public repository. The error messages from both processes mention read names not matching, for instance this is from fastqtosam:

Exception in thread "main" picard.PicardException: In paired mode, read name 1 (SRR221230.sra.ncbi_enc.1) does not match read name 2 (SRR221230.sra.ncbi_enc.4)

Can anyone tell me what is causing this and if there is a way to recover the data for these samples?

Answers

  • bhanuGandhambhanuGandham Cambridge MAMember, Administrator, Broadie, Moderator admin

    Hi @nparmalee

    We are experiencing a high volume of questions at the moment, so we are focusing on answering GATK error/bug related questions at this time. This is about BWA question which do not support. This might be better suited for www.biostars.org or www.seqanswers.com.

Sign In or Register to comment.