Notice:
If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!

Test-drive the GATK tools and Best Practices pipelines on Terra


Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.

Conference Talk (CPPCON) 2014 Sep 8: The Gamgee library for genomics data processing and analysis

Geraldine_VdAuweraGeraldine_VdAuwera adminCambridge, MAMember, Administrator, Broadie admin
edited October 2014 in Archive

Mauricio Carneiro presented this talk at CPPCON (C++ conference) in Bellevue, WA on September 8, 2014. His slide deck and a link to the video are available at this link if you're viewing this post in the forum, or below if you are viewing the presentation page already.

Abstract

Our group has defined the standards for DNA and RNA sequencing data processing and analysis for disease research and clinical applications. In the last 5 years we have published our tools in the GATK (genome analysis toolkit) which is completely written in java. With the scaling of next generation sequencing and the immense amount of that needs to be processed we hit a performance wall and found ourselves limited by the language to make optimizations and rewrite the algorithms in a way that would conform better to modern hardware.

Enter Gamgee. A free and open source C++14 library that offers much of the functionality of the GATK framework with the performance necessary to scale to the hundreds of petabytes of todays complex diseases projects. We will show how the tools developed using the Gamgee library replaced legacy java GATK tools in the production pipeline of the Broad Institute. We will also talk about how the algorithms have changed to take advantage of the native libraries and modern hardware features such as SSE/AVX and GPUs.

Post edited by Geraldine_VdAuwera on

Comments

This discussion has been closed.