It looks like you're new here. If you want to get involved, click one of these buttons!
I am trying to merge two vcfs (SNVs and INDELs) from the same sample. The problem appears to be that the INDEL vcf defines "combined_sample_name" but the SNV vcf does not. So when I merge I get two sample columns. How can I force GATK to treat them as a single sample?
I tried --assumeIdenticalSamples to do a "simple merge," but that made no difference.
As a side note, these vcfs are from an Ion Torrent machine.
Thanks!
java -jar $GATK \
-R $TSP_FILEPATH_GENOME_FASTA \
-T CombineVariants \
--assumeIdenticalSamples \
-V:${baseFolderName} ${i}/SNP_variants.vcf \
-V:${baseFolderName} ${i}/indel_variants.vcf \
-o ${RESULTS_DIR}/${baseFolderName}_variants.vcf
Answers
Hi there,
I'm afraid what you're trying to do is not possible. You'll need to fix the VCF files to have the same sample name and remove the combined_sample_name definition. (use VCFtools)
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 · Off Topic Disagree Agree Like WTF ·