The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Get notifications!


You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

Formatting tip!


Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ``` ) each to make a code block as demonstrated here.

Jump to another community
Picard 2.9.4 is now available. Download and read release notes here.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.

ComputeInsertSizeDistributions

Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie
edited September 2012 in GenomeSTRiP Documentation

1. Introduction

The ComputeInsertSizeDistributions walker traverses a set of BAM files to generate histograms of insert sizes.

The insert size histograms are stored in a binary file format. Many histograms can be stored in the same file. The histograms are identified by <Sample, Library, ReadGroup> triples. The trailing components can be null. For example,
if histograms are computed library-by-library (the default), then the ReadGroup in each triple will be null.

See also MergeInsertSizeDistributions, ComputeInsertStatistics.

2. Inputs / Arguments

  • -I <bam-file> : The set of input BAM files.

  • -md <directory> : The metadata directory. Currently only used to check for a default list of excluded read groups.

  • -overwrite : If true (the default), overwrite the output file, otherwise append.

  • -createEmpty : If true, create a zero length output file if there are no paired reads in the input (default false).

3. Outputs

  • -O <histogram-file> : Location of the output binary histogram file.
Post edited by Geraldine_VdAuwera on
Sign In or Register to comment.