Test-drive the GATK tools and Best Practices pipelines on Terra
Check out this blog post to learn how you can get started with GATK and try out the pipelines in preconfigured workspaces (with a user-friendly interface!) without having to install anything.
Dynamically pass multiple input to Picard's MarkDuplicates (multiplexed data)
To pass multiple BAM files to
MarkDuplicates we use the following syntax:
java -jar picard.jar MarkDuplicates \ INPUT=lane1.bam \ INPUT=lane2.bam \ OUTPUT=dedup.bam \ METRICS_FILE=dedub_metrics.txt
However, this syntax doesn't requires us to know the number of inputs beforehand. This is not particularly practical. Is there no way this can be done dynamically, for example, some GATK functions take files containing a list of input files (but