Heads up:
We’re moving the GATK website, docs and forum to a new platform. Read the full story and breakdown of key changes on this blog.
If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!

Test-drive the GATK tools and Best Practices pipelines on Terra

Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.

A simple explanation of MuTect2 (GATK3) on how it works

alaabadrealaabadre FranceMember
edited April 7 in Ask the GATK team

Hello GATK team,

As you all know, there are many blogs/docs explaining how MuTect2 works but with lots of technical and statistical details. People who don't specialize in these domains can't easily understand how MuTect2 works. For this reason, I would like to have a discussion on how MuTect2 works with a simple example.

Let's say that we have the following information:

Reference genome sequence in a given region:


The normal sample in the same region having the following reads:


And the tumor sample in the same region:


How does MuTect2 handles such situation ?
Could we go over each step by explaining simply what does MuTect2 does ?
I gave this example by randomly typing the sequence with a single variant. If there are other better situations to take into account that can explain all the decisions that MuTect2 does when comparing reads, I would be happy to hear them.

Let's not forget that there are also the filtering options (dbSNP membership or 1k mills genome) or the hard filters to take into account:

I got another situation in mind. Let's say for example that the same variant is found to be similar in the normal vs tumor sample but different to the reference genome. What happens in this case ?

Thanks in advance.



Sign In or Register to comment.