Holiday Notice:
The Frontline Support team will be offline February 18 for President's Day but will be back February 19th. Thank you for your patience as we get to all of your questions!

Looking for picard per_target_coverage output field definitions

lracacholracacho OttawaMember

Hello, I ran Picard CollectHSmetrics with the optional per_target_coverage output and I am looking for the definitions of the following fields: "%GC", "mean_coverage", "normalized_coverage" (the definition for this field was already posted), "min_normalized coverage", "max_normalized coverage", "min_coverage", "max_coverage", "pct_0x" and "read_count".
Cheers

Tagged:

Best Answer

Answers

  • bhanuGandhambhanuGandham Member, Administrator, Broadie, Moderator admin

    HI @lracacho

    Here is a link to the document with the metrics definitions you are looking for: https://broadinstitute.github.io/picard/picard-metric-definitions.html

  • lracacholracacho OttawaMember

    Thank you for the quick reply!

    I have previously looked at the picard-metric-definitions but I could not find the field definitions specifically for the "per_target_coverage" output. In addition to regular HSmetrics, per_target_coverage and per_base_coverage are output options.

    I was able to locate a similar field to "pct_0x". Under the heading for tool TargetedPcrMetrics, ZERO_CVG_TARGETS_PCT is defined as "The fraction of targets that did not reach coverage=1 over any base." Is this the same definition used for the per_target_coverage field "pct_0x"?

    Cheers

  • lracacholracacho OttawaMember

    Thank you! I have one more question....What is the relationship between max_coverage and read_count? For example, for one target region I have a read count of 192 and a min_coverage of 12 and a max_coverage of 32. This field read_count can't be the average across the target region.

  • AdelaideRAdelaideR Unconfirmed, Member, Broadie, Moderator admin

    The coverage is relevant to the "active region" as defined by the HaplotypeCaller.

    MAX_TARGET_COVERAGE The maximum coverage of reads that mapped to target regions of an experiment. This can be affected by two other parameters, MINIMUM_MAPPING_QUALITY (default=20) and MINIMUM_BASE_QUALITY (default=20). So reads with a low mapping quality and bases with a low quality are not considered for the target coverage calculations. Is your mapping quality lower for some samples in this region?

    Take a look at this conversation to see if that answers the question.

Sign In or Register to comment.