Bait bias

Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie admin
edited March 2016 in Dictionary

Bait bias (single bait bias or reference bias artifact) is a type of artifact that affects data generated through hybrid selection methods.

These artifacts occur during or after the target selection step, and correlate with substitution rates that are biased or higher for sites having one base on the reference/positive strand relative to sites having the complementary base on that strand. For example, a G>T artifact during the target selection step might result in a higher (G>T)/(C>A) substitution rate at sites with a G on the positive strand (and C on the negative), relative to sites with the flip (C positive)/(G negative). This is known as the "G-Ref" artifact.

Post edited by dekling on


  • joneskm4joneskm4 NCIMember

    I think we are seeing this type of artifact in some of our exome sequencing data, but I'm a little unclear on what the actual cause is. What causes the substitution during the target selection step?

    Issue · Github
    by Sheila

    Issue Number
    Last Updated
    Closed By
  • deklingdekling Broad InstituteMember admin

    Don't hold my feet to the fire on this, but I believe the errors are introduced to the bait from sample handling. Essentially, guanines on the bait sequence are sensitive to oxidation from extraction agents, heat, etc. This can cause some guanine nucleotides to become 8-oxoguanine (8-OxoG, OxoG) nucleotides. These modified guanines can basepair with T instead of C as would normally be expected. Thus, during PCR, this error is propagated. Since the G is sensitive to oxidation, you will likely see a higher frequency of G ->A then C->T. Is this helpful?

Sign In or Register to comment.