To celebrate the release of GATK 4.0, we are giving away free credits for running the GATK4 Best Practices pipelines in FireCloud, our secure online analysis portal. It’s first come first serve, so sign up now to claim your free credits worth $250. Sponsored by Google Cloud. Learn more at

Alternate allele filtering?

crop_bioinfocrop_bioinfo knoxvilleMember
edited November 2016 in Ask the GATK team

I am trying to call Het as: Ref allele 10-90% or Alt allele 10-90. I have four samples. If one sample is Het, that location is preserved if criteria for Het (as earlier) and Homozygous (> 90% of Alt or Ref allele) at other three locations are met. Is there any simple way to achieve this?

I was just trying SelectVariants to filter with these criteria and tried to see if things works. I used <5% alternate allele for first sample and still can see locations with > 5% alt allele. Here are the code and output. please let me know if I am doing wrong? Thanks

$java -jar $gatk -T SelectVariants -R 8771mapped_Pcap_211contigs.fasta -V snps_after_hardFilter_20x.vcf -o snps_after_hardFilter_20x_alt10_allHet.vcf -select 'vc.getGenotype("7290_R1").getAD().1 / vc.getGenotype("7290_R1").getDP() < 0.05'

I got alt allele of 23/92 = 0.25

#CHROM POS ID REF ALT QUAL FILTER INFO FORMAT 7290_R1 7291_R1 8566_R1 8771_R1 2 2212 . G A 592.90 PASSS AC=2;AF=0.250;AN=8;BaseQRankSum=0.845;ClippingRankSum=0.000;DP=257;ExcessHet=3.6798;FS=0.000;ML EAC=2;MLEAF=0.250;MQ=60.00;MQRankSum=0.000;QD=4.94;ReadPosRankSum=-2.023;SOR=0.687 GT:AD:DP:GQ:PL 0/1:69,23:92:99:518,0,2196 0/1:21,7:28:99:109,0,533 0/0:43,0:43:99:0,129,1439 0/0:91,2:93:99:0,223,2719



Sign In or Register to comment.