The current GATK version is 3.8-0
Examples: Monday, today, last week, Mar 26, 3/26/04

#### Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

You can opt in to receive email notifications, for example when your questions get answered or when there are new announcements, by following the instructions given here.

#### ☞ Got a problem?

1. Search using the upper-right search box, e.g. using the error message.
3. Include tool and Java versions.
4. Tell us whether you are following GATK Best Practices.
5. Include relevant details, e.g. platform, DNA- or RNA-Seq, WES (+capture kit) or WGS (PCR-free or PCR+), paired- or single-end, read length, expected average coverage, somatic data, etc.
6. For tool errors, include the error stacktrace as well as the exact command.
7. For format issues, include the result of running ValidateSamFile for BAMs or ValidateVariants for VCFs.
8. For weird results, include an illustrative example, e.g. attach IGV screenshots according to Article#5484.
9. For a seeming variant that is uncalled, include results of following Article#1235.

#### ☞ Formatting tip!

Wrap blocks of code, error messages and BAM/VCF snippets--especially content with hashes (#)--with lines with three backticks (  ) each to make a code block as demonstrated here.

GATK version 4.beta.3 (i.e. the third beta release) is out. See the GATK4 beta page for download and details.

# PhaseByTransmission with more than just trio

Member
edited December 2012

Is it possible to use PhaseByTransmission with families that are larger than a single trio? I have a family with four siblings. If I include all of the siblings in the PED I get:

PhaseByTransmission - Caution: Family BMD has 6 members; At the moment Phase By Transmission only supports trios and parent/child pairs. Family skipped.
ERROR MESSAGE: Bad input: No PED file passed or no trios found in PED file. Aborted.


And if I just include the one key trio with the proband, I get the following:

ERROR MESSAGE: Sample BMD006_R found in data sources but not in pedigree files with STRICT pedigree validation


There does not seem to be an accessible argument for relaxing the pedigree validation. Is there a way to use PhaseByTransmission with my larger family?

Post edited by Geraldine_VdAuwera on
Tagged:

Hi Mlinderm,

Currently PhaseByTransmission only supports trios so you won't be able to use the information about all 4 siblings jointly.

Including just the trio of interest is the correct way to go for the moment, however if you leave the other siblings in the VCF file, you should either:

• Add the children in the PED file but code them as unrelated individuals (they will simply be ignored)
• Specify the flag --pedigreeValidationType SILENT. This flag lets the GATK run even if not all individuals are found in both the PED and VCF file.

Cheers,
Laurent

Post edited by Geraldine_VdAuwera on

Hi Mlinderm,

Currently PhaseByTransmission only supports trios so you won't be able to use the information about all 4 siblings jointly.

Including just the trio of interest is the correct way to go for the moment, however if you leave the other siblings in the VCF file, you should either:

• Add the children in the PED file but code them as unrelated individuals (they will simply be ignored)
• Specify the flag --pedigreeValidationType SILENT`. This flag lets the GATK run even if not all individuals are found in both the PED and VCF file.

Cheers,
Laurent

Post edited by Geraldine_VdAuwera on
• Member

I would like to use PhaseByTransmission to identify Mendelian errors and phase the children in a family with two parents and four children. Would this be possible by creating 4 "trios", with 4 family IDs, but with the parents the same in each trio?

Hi trgall,

At the moment PhaseByTransmission only takes trios in. If you want to identify the mendelian errors in your case, you will unfortunately need to run it once per child. You cannot pass the same individuals with different family IDs as this would not respect the PED format. So Unfortunately for your purpose you'll need to run the tool 4 times. Note that if you use --pedigreeValidationType SILENT, you can leave all children in the VCF file and simply pass a different PED file for each child.

Cheers,
Laurent

• Member

Maybe this is more a feature request than a question, but it would be nice if this particular app could intelligently break a pedigree into trios and phase each trio and then package it back up into the original VCF for us. Much better than requiring the user to split the VCF, split the PED, and run it once for each trio, and then sew it back up him or herself. For example, if we give it two parents and three children, it does each child's phasing and puts it all back into the original multisample VCF format which GATK itself produces during genotyping.

• Member

I am getting the same error message - Sample lgsnd32563jz3 found in data sources but not in pedigree files with STRICT pedigree validation. However, the sample is in the pedigree file. I tried adding the flag --pedigreeValidationType SILENT to see what happened, and all the trios were excluded. Do you know what the problem could be?