It looks like you're new here. If you want to get involved, click one of these buttons!
Dear GATK team,
I understood from the best practice documentation that if I have no known indels list for the genome I can skip the realignment step. Is this correct, or shall I call Indels and then do realignment and then re-call SNPs and Indels with the UnifiedGenotyper?
Thanks for the help!
Geraldine_VdAuwera
Posts: 2,238 admin
Hi there,
You can run those tools on a merged bam file, or by passing several separate bam files in your command line (see the documentation if you're not sure how to do this).
There are no hard and fast numbers for file sizes -- it depends enormously on your dataset and on your computing hardware. I can only recommend you test a few different configurations and see what works for you.
Answers
I understand now that I need to run the RealignerTargetCreator and then the IndelRealigner tool. I am working on yeast barcoded targeted genome. The RealignerTargetCreator and the IndelRealigner are not read group aware - right? Would you suggest to run them on each sample separately - or on a merged file?
One more question regarding the work on merged bam files - what is the maximal size of merged bam you recommend to work with?
Many thanks for all the help!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thanks!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •