It looks like you're new here. If you want to get involved, click one of these buttons!
Hi,
I trying to detect variants in a population of haploid genomes with UnifiedGenotyper, using the -sample_ploidy option to specify the number of haploid genomes in the pool. I would like to be able to specify other options, but so far the ones I have tried do not seem to work. I have tried: -out_mode EMIT_ALL_SITES --sample_ploidy 50 -minIndelFrac 0.05 --genotype_likelihoods_model BOTH -pnrm EXACT_GENERAL_PLOIDY. Could you please tell me which, if any, of the other commands are available when calling variants on non-diploid samples? Specifically, I would really like to be able to call indels too.
Thanks for your time and help.
Shelbi
Answers
Hi Shelbi,
Could you tell me what you mean when you say it's not working? Are you getting an error or just no variant calls?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine,
Sorry I was vague. I didn't get any variant calls and the program didn't run very long. The output summary at the end seemed truncated as well. I pasted one of the outputs below. I am running the program right now using only the -sample_ploidy option, and it has been running for nearly two hours.
Shelbi
Your job looked like:
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Let me know what results you get from the new run. It looks like you've got it running properly now.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Just as a general recommendation/warning: if your 50 haploid genomes are truly pooled and then sequenced then the -ploidy 50 is correct. But if your samples are individually barcoded (or sequenced separately) and hence are marked as separate samples in your BAM then that would be incorrect: you should set -ploidy 1 and you'll get in your output your individual genotypes for each sample.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Yes, my samples are truly pooled, thanks for inquiring. The genomes are from the intracellular symbiont population of the marine bivalve, Solemya velum. So, essentially the symbionts should be dealt with in a manner similar to organelles.
As for running the UnifiedGenotyper with the -ploidy option, I have found that it works with -minIndelFrac, --genotype_likelihoods_model, and -pnrm. I know for sure that -refsample causes a problem. I'm rerunning it, adding an option at a time to find out which options I can specify. The above output is one where the program didn't output any variants. When the program works, the output is much longer, and details the progress meter for every contig and position. The above example is missing several contigs and most of the positions. The output of a successful run is very long, but I can post one if it would help.
Shelbi
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Ah yes,
-refsampleis an argument that's only really meant for internal use; it will no longer be available in the next version since it is not useful for general users and apparently causes confusion.If it's working for you now then we don't need anything further. Good luck with your work!
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •