The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Powered by Vanilla. Made with Bootstrap.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.
Register now for the upcoming GATK Best Practices workshop, Feb 20-22 in Leuven, Belgium. Open to all comers! More info and signup at http://bit.ly/2i4mGxz

DepthOfCoverage for whole genome: skipped few regions

NeelamNeelam BlacksburgMember Posts: 6

I used following command to get coverage data for entire genome:
java -Xmx2g -jar ~/GenomeAnalysisTK-2.5-2-gf57256b/GenomeAnalysisTK.jar -T DepthOfCoverage -R ~/Gmax_189.fa -o DoCov_dedup -I dedup.bam.list

Coverage was not reported for 36025109 bases. Many skipped bases fall one after another in the genome to form big continuous regions. I do not understand, why these regions were skipped. Usually if there was no mapping, the coverage value should be reported as 0.

Appreciate some insight into this issue! Thank you.

Tagged:

Best Answers

Answers

  • NeelamNeelam BlacksburgMember Posts: 6

    Thank you. Those are all strings of ambiguous bases. What will be the situation for single ambiguous base in middle of canonical bases?

  • NeelamNeelam BlacksburgMember Posts: 6

    I found DepthOfCoverage is reporting for ambiguous bases in coverage output file.
    Attached are the snapshots of: (1) coverage output for small region on chromosome 1;
    image

    and (2) alignments output in this region.
    image

    Position #21956 is starting of ambiguous region that is more than 8K long in reference genome.
    How can I get rid of these points from depth output file, without actually eliminating the regions of no mapping?

Sign In or Register to comment.