We’re moving the GATK website, docs and forum to a new platform. Read the full story and breakdown of key changes on this blog.
If you happen to see a question you know the answer to, please do chime in and help your fellow community members. We encourage our fourm members to be more involved, jump in and help out your fellow researchers with their questions. GATK forum is a community forum and helping each other with using GATK tools and research is the cornerstone of our success as a genomics research community.We appreciate your help!
Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
We will be out of the office for a Broad Institute event from Dec 10th to Dec 11th 2019. We will be back to monitor the GATK forum on Dec 12th 2019. In the meantime we encourage you to help out other community members with their queries.
Thank you for your patience!
Converting ModelSegments outputs for ABSOLUTE

I work on exome data to detect somatic variants. I identified the SNPs/Indels with Mutect2 and I followed the ModelSegments tutorial to detect CNAs. Now I want to convert the AF outputs I got with ModelSegments into AllelicCapSeg outputs to use them with ABSOLUTE.
For this purpose, I tried to use part of this pipeline on GitHub (unsupported/combine_tracks_postprocessing_cnv/combine_tracks.wdl) to convert the data with the function "convert_model_segments_to_alleliccapseg". I use as input the.af.param and.cr.seg files from ModelSegments.
However, I don't really understand in the script the utility of the following parameters to use as threshold values (Float ? maf90_threshold and Int ? min_hets_acs_results)?
Is there a better way to do this conversion?
Thank you
Best Answer
-
LeeTL1220 Arlington, MA ✭✭✭
@Lea_F Hello, this WDL is unsupported, because we have little evaluation data and we are going to do a different approach in the long-term ...
but for now ....
Are you using the blacklisted intervals when running ModelSegments? If so, try setting the maf90 threshold to 0.485. If not, use the blacklists as input to ModelSegments. The somatic CNV WDL (that runs ModelSegments) has some reasonable, public files in the preamble.
If your data is WGS, set the min_hets_acs_results to 10, otherwise, 2 is reasonable.
These suggestions are not based on a lot of evaluations, but should help your ModelSegments results.
To the best of my knowledge, there is no better conversion. User feedback thusfar has indicated that this conversion process produces the best ABSOLUTE results, even if it is heavy on heuristics.
Answers
Hi @Lea_F ,
This is one of the few workflows in the GATK repo we do not support. We can help you with questions on workflows outside of the "unsupported" directory.
@Lea_F Hello, this WDL is unsupported, because we have little evaluation data and we are going to do a different approach in the long-term ...
but for now ....
Are you using the blacklisted intervals when running ModelSegments? If so, try setting the maf90 threshold to 0.485. If not, use the blacklists as input to ModelSegments. The somatic CNV WDL (that runs ModelSegments) has some reasonable, public files in the preamble.
If your data is WGS, set the min_hets_acs_results to 10, otherwise, 2 is reasonable.
These suggestions are not based on a lot of evaluations, but should help your ModelSegments results.
To the best of my knowledge, there is no better conversion. User feedback thusfar has indicated that this conversion process produces the best ABSOLUTE results, even if it is heavy on heuristics.