If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!
Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Release notes for GATK version 2.8
GATK 2.8 was released on December 6, 2013. Highlights are listed below. Read the detailed version history overview here: http://www.broadinstitute.org/gatk/guide/version-history
Note that this release is relatively smaller than previous ones. We are working hard on some new tools and frameworks that we are hoping to make available to everyone for our next release.
- Fixed bug where indels in very long reads were sometimes being ignored and not used by the caller.
- Improved the indexing scheme for gVCF outputs using the reference calculation model.
- The reference calculation model now works with reduced reads.
- Fixed bug where an error was being generated at certain homozygous reference sites because the whole assembly graph was getting pruned away.
- Fixed bug for homozygous reference records that aren't GVCF blocks and were being treated incorrectly.
- Disable tranche plots in INDEL mode.
- Various VQSR optimizations in both runtime and accuracy. Some particular details include: for very large whole genome datasets with over 2M variants overlapping the training data randomly downsample the training set that gets used to build; annotations are ordered by the difference in means between known and novel instead of by their standard deviation; removed the training set quality score threshold; now uses 2 gaussians by default for the negative model; numBad argument has been removed and the cutoffs are now chosen by the model itself by looking at the LOD scores.
- Fixed bug where mapping quality was being treated as a byte instead of an int, which caused high MQs to be treated as negative.
- Added calculation for GC content.
- Added an option to filter the bases based on their quality scores.
- Fixed bug where annotation values were parsed as Doubles when they should be parsed as Integers due to implicit conversion; submitted by Michael McCowan.
- Changed the behavior for PL/AD fields when it encounters a record that has lost one or more alternate alleles: instead of stripping them out these fields now get fixed.
- SplitSamFile now produces an index with the BAM.
- Length metric updates to QualifyMissingIntervals.
- Provide close methods to clean up resources used while creating AlignmentContexts from BAM file regions; submitted by Brad Chapman.
- Picard jar updated to version 1.104.1628.
- Tribble jar updated to version 1.104.1628.
- Variant jar updated to version 1.104.1628.