We’re moving the GATK website, docs and forum to a new platform. Read the full story and breakdown of key changes on this blog.
An array of files as an entity attribute in data model
I didn't see this in the documentation, but I might have missed it.
Is it possible to specify an array of files as a data model entity attribute?
I have a genetics data set, with participants and samples. At the sample set level, our genotyping files (VCF files) are split by chromosome, but I want to design a WDL that inputs an array of VCF files (one file per chromosome). I would like to have a row in my sample set which has an entity called "VCF_file_array" which has an entity attribute of "[gs://PATH/chr1.vcf, gs://PATH/chr2.vcf]" etc.
I would then like the method_configuration to assign the output of a WDL, an array of files, to a new entity, for example, if I am writing a conversion script from VCF to GDS file: GDS_file_array entity with "[gs://PATH/chr1.gds, gs://PATH/chr2.gds]" as the entity attribute.
Example WDLs and method configurations would be helpful.