The current GATK version is 3.7-0
Examples: Monday, today, last week, Mar 26, 3/26/04

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Did you remember to?


1. Search using the upper-right search box, e.g. using the error message.
2. Try the latest version of tools.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

Did we ask for a bug report?


Then follow instructions in Article#1894.

Formatting tip!


Surround blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks ( ``` ) each to make a code block.
Powered by Vanilla. Made with Bootstrap.
Picard 2.9.0 is now available. Download and read release notes here.
GATK 3.7 is here! Be sure to read the Version Highlights and optionally the full Release Notes.

PhaseByTransmission with more than just trio

mlindermmlinderm Member Posts: 29
edited December 2012 in Ask the GATK team

Is it possible to use PhaseByTransmission with families that are larger than a single trio? I have a family with four siblings. If I include all of the siblings in the PED I get:

PhaseByTransmission - Caution: Family BMD has 6 members; At the moment Phase By Transmission only supports trios and parent/child pairs. Family skipped.
ERROR MESSAGE: Bad input: No PED file passed or no trios found in PED file. Aborted.

And if I just include the one key trio with the proband, I get the following:

ERROR MESSAGE: Sample BMD006_R found in data sources but not in pedigree files with STRICT pedigree validation

There does not seem to be an accessible argument for relaxing the pedigree validation. Is there a way to use PhaseByTransmission with my larger family?

Post edited by Geraldine_VdAuwera on

Best Answer

  • LaurentLaurent Member, Broadie Posts: 43 ✭✭
    edited December 2012 Accepted Answer

    Hi Mlinderm,

    Currently PhaseByTransmission only supports trios so you won't be able to use the information about all 4 siblings jointly.

    Including just the trio of interest is the correct way to go for the moment, however if you leave the other siblings in the VCF file, you should either:

    • Add the children in the PED file but code them as unrelated individuals (they will simply be ignored)
    • Specify the flag --pedigreeValidationType SILENT. This flag lets the GATK run even if not all individuals are found in both the PED and VCF file.

    Cheers,
    Laurent

    Post edited by Geraldine_VdAuwera on

Answers

  • LaurentLaurent Member, Broadie Posts: 43 ✭✭
    edited December 2012 Accepted Answer

    Hi Mlinderm,

    Currently PhaseByTransmission only supports trios so you won't be able to use the information about all 4 siblings jointly.

    Including just the trio of interest is the correct way to go for the moment, however if you leave the other siblings in the VCF file, you should either:

    • Add the children in the PED file but code them as unrelated individuals (they will simply be ignored)
    • Specify the flag --pedigreeValidationType SILENT. This flag lets the GATK run even if not all individuals are found in both the PED and VCF file.

    Cheers,
    Laurent

    Post edited by Geraldine_VdAuwera on
  • trgalltrgall Member Posts: 13

    I would like to use PhaseByTransmission to identify Mendelian errors and phase the children in a family with two parents and four children. Would this be possible by creating 4 "trios", with 4 family IDs, but with the parents the same in each trio?

  • LaurentLaurent Member, Broadie Posts: 43 ✭✭

    Hi trgall,

    At the moment PhaseByTransmission only takes trios in. If you want to identify the mendelian errors in your case, you will unfortunately need to run it once per child. You cannot pass the same individuals with different family IDs as this would not respect the PED format. So Unfortunately for your purpose you'll need to run the tool 4 times. Note that if you use --pedigreeValidationType SILENT, you can leave all children in the VCF file and simply pass a different PED file for each child.

    Cheers,
    Laurent

  • mjcgeneticsmjcgenetics Member Posts: 1

    Maybe this is more a feature request than a question, but it would be nice if this particular app could intelligently break a pedigree into trios and phase each trio and then package it back up into the original VCF for us. Much better than requiring the user to split the VCF, split the PED, and run it once for each trio, and then sew it back up him or herself. For example, if we give it two parents and three children, it does each child's phasing and puts it all back into the original multisample VCF format which GATK itself produces during genotyping.

  • ajc8ajc8 Member Posts: 15

    I am getting the same error message - Sample lgsnd32563jz3 found in data sources but not in pedigree files with STRICT pedigree validation. However, the sample is in the pedigree file. I tried adding the flag --pedigreeValidationType SILENT to see what happened, and all the trios were excluded. Do you know what the problem could be?

Sign In or Register to comment.