We've moved!
This site is now read-only. You can find our new documentation site and support forum for posting questions here.
Be sure to read our welcome blog!

GATK License and Redistributing in an AWS AMI

I have been asked to reach out to you to clarify the GATK license. We are hoping to package our CNV/SNP calling and annotation pipeline that includes GATK and make it available on an Amazon Machine Image (AMI) for use on the Amazon EC2 cloud for anyone who wants to replicate our protocol using their own data. Of all the tools that we use, only GATK contains license restrictions. It is our intent that our AMI be used by academic/non-commercial users, and it is not intended to be sold. We can list the AMI in the AWS Marketplace as a Bring Your Own License (BYOL) instance. However, there are no checks provided by Amazon that users do in fact have a license. Is this permissible?



  • Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie admin

    Hi there,

    I think this should be okay, but let me check with our business/legal specialist to make sure. I'll get back to you with an answer as soon as I can.

  • francois.sabotfrancois.sabot Montpellier, FranceMember

    Dear Geraldine
    We have almost the same type of question, ie our current pipelining system is using GATK, and for an easier install of the whole framework, we would like propose a running version of GATK for our tool in a embedded download.
    We were planning to add a text saying that you cannot install it if you do not respect the original GATK license (with a link to it).

    Will it be ok using such a system ?



  • Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie admin


    Hi Francois, we're currently working on developing a model to enable this sort of activity, and it would help to have some additional details, plus feedback on our proposed model. Would you mind discussing this with me over email? I am vdauwera at broadinstitute.org

  • rrtrrt Member

    Dear Geraldine,

    Do you have any update on this question?

    Some what related, do you know any GATK based cloud platform which also includes the modules from Broad's Genepattern?


  • Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie admin

    Hi @rrt,

    Since there are actually two rather different questions addressed in this thread I'll give an update on both. In both cases, if you'd like to follow up or discuss in more detail, please email me with the details of your use case and I'll see what we can do to enable you to achieve your goal.

    Cloud platforms / BYOL

    Providing GATK analysis as a service on the cloud is not currently allowed but the situation is evolving rapidly and I hope to be able to give a more positive answer in the near future.

    Downloadable docker or equivalent for on-premises execution

    At this time we don't have a blanket policy in place, but we are able to grant individual requests to bundle GATK into a research pipeline on a case by case basis.

  • Geraldine_VdAuweraGeraldine_VdAuwera Cambridge, MAMember, Administrator, Broadie admin

    Regarding your other question, I'm not aware of any existing options, aside from a pilot project that we are running called FireCloud that includes GATK and other Broad cancer genome analysis tools. It's not yet accessible to researchers outside Broad due to pending regulatory approval, but the plan is to open it up to the general community. I can't guarantee that it will have everything GenePattern currently includes, but there is a feature for users to add tools.

Sign In or Register to comment.