The current GATK version is 3.8-0
Examples: Monday, today, last week, Mar 26, 3/26/04

#### Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

#### ☞ Got a problem?

1. Search using the upper-right search box, e.g. using the error message.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

#### ☞ Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ` ) each to make a code block as demonstrated here.

GATK version 4.beta.3 (i.e. the third beta release) is out. See the GATK4 beta page for download and details.

# SNP/ INDEL calling in Bisulfite data

Member

I have Bisulfite- treated sequence mapped using Bismark and Bowtie2 and I'd like to call SNPs and INDELs from it. I have used Bis-SNP to call SNPs but it doesn't call indels , can I use GATK to call indels from the mapped data? Do u have any support to Bisulfite data?
Another question please, the data is a mix from 6 different people do u have any support fro pooled data?

Tagged:

Hi there,

The UnifiedGenotyper can handle pooled data, yes. Have a look in the tech doc at the ploidy argument.

However, we have no experience with bisulfite data, so whether you can process it through GATK will depend on how the data is encoded. If the methylated bases are represented by something other than ACTG, then you will run into trouble. But it they are in separate tags it should be okay.

Good luck and please let us know how it goes!

• Member

Bisulfite data is a normal sequence but with the unmethylated 'C' base converted to 'T', so the real sequence is not observed. I think I have to do some modifications for the code. I was wondering where I can find an explanation of your indel detection model. I have found some presentations on you site but I could not find good explanation.
Regards,

Ah, I see. Yes, I think you'll need to make some modifications then.

What kind of details are you trying to find out about the indel discovery model? Mathematical, or more functional? Feel free to look at the code itself if you're comfortable with that, if you want details of how the program operates. It is freely accessible on our github repository.

• Member

I have already downloaded the code but I am searching for the statistical model used in your system. It usually helps to look at the model before digging into the code. I have seen that your model is inspired by Dindel but I'd like to see if there are any differences.