If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We appreciate your help!
Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
(howto) Visualize an alignment with IGV
Visualize sequence read alignment data (BAM or SAM) on IGV using this quick-start tutorial. The Integrative Genomics Viewer is a non-GATK tool developed at the Broad Institute that allows for interactive exploration of large genomic datasets.
- IGV downloaded to your desktop
- Coordinate-sorted and aligned BAM or SAM file
- Corresponding BAI index
- Matching reference genome to which the reads align. See IGV hosted genomes to check if IGV hosts a reference genome or this page for instructions on loading a
Download example data
- tutorial_6491.tar.gz contains a coordinated-sorted BAM and corresponding BAI. Most reads align to a 1 Mbp genomic interval on chromosome 10 (10:96,000,000–97,000,000) of the human GRCh37 reference assembly. Specifically, reads align to GATK bundle's
human_g1k_v37_decoy.fastathat corresponds to the
Human (1kg, b37+decoy)reference hosted by IGV.
- See Tutorial#2909 for instructions on coordinate-sorting and indexing alignment data.
- See the IGV website for downloads and the extensive user guide. For GATK users, we recommend the sections on Viewing Alignments and Viewing VCF files.
- This How to and its example data are referenced in a larger workflow on (How to) Efficiently map and clean up short read sequence data.
View aligned reads using IGV
To view aligned reads using the Integrative Genomics Viewer (IGV), the SAM or BAM file must be coordinate-sorted and indexed.
- Always load the reference genome first. Go to Genomes>Load Genome From Server or load from the drop-down menu in the upper left corner. Select
Human (1kg, b37+decoy).
- Load the data file. Go to File>Load from File and select
6491_snippet.bam. IGV automatically uses the corresponding
6491_snippet.baiindex in the same folder.
- Zoom in to see alignments. For our tutorial data, copy and paste
10:96,867,400-96,869,400into the textbox at the top and press Go. A 2 kbp region of chromosome 10 comes into view as shown in the screenshot above.
Alongside read data, IGV automatically generates a coverage track that sums the depth of reads for each genomic position.
Find a specific read and view as pairs
- Right-click on the alignment track and Select by name. Copy and paste
H0164ALXX140820:2:2107:7323:30703into the read name textbox and press OK. IGV will highlight two reads corresponding to this query name in bold red.
- Right-click on the alignment track and select View as pairs. The two highlighted reads will display in the same row connected by a line as shown in the screenshot.
Because IGV holds in memory a limited set of data overlapping with the genomic interval in view (this is what makes IGV fast), the select by name feature also applies only to the data that you call into view. For example, we know this read has a secondary alignment on contig hs37d5 (
If you jump to this new region, is the read also highlighted in red?
If you find IGV sluggish, download a Java Web Start
jnlp version of IGV that allows more memory. The highest memory setting as of this writing is 10 GB (RAM) for machines with 64-bit Java. For the tutorial example data, the typical 2 GB allocation is sufficient.
- To run the
jnlpversion of IGV, you may need to adjust your system's Java Control Panel settings, e.g. enable Java content in the browser. Also, when first opening the
jnlp, overcome Mac OS X's gatekeeper function by right-clicking the saved
jnlpand selecting Open with Java Web Start.
To change display settings, check out either the Alignment Preferences panel or the Alignment track Pop-up menu. For persistent changes to your IGV display settings, use the Preferences panel. For track-by-track changes, use the Pop-up menus.
Default Alignment Preferences settings are tuned to genomic sequence libraries. Go to View>Preferences and make sure the settings under the Alignments tab allows you to view reads of interest, e.g. duplicate reads.
- IGV saves any changes you make to these settings and applies them to future sessions.
- Some changes apply only to new sessions started after the change.
- To restore default preferences, delete or rename the
prefs.propertiesfile within your system's
igvfolder. IGV automatically generates a new
prefs.propertiesfile with default settings. See IGV's user guide for details.
After loading data, adjust viewing modes specific to track type by right-clicking on a track to pop up a menu of options. For alignment tracks, these options are described here.