Global Alliance for Genomics and Health Unveils New Genomics API Allowing for Seamless Sharing of Genetic Data

Posted: July 31, 2014 In: News Releases

Alliance’s Data Working Group Develops Version 0.5 of Genomics API to Enable Interoperability

Ontario, CA ⎜July 31, 2014 – The Global Alliance for Genomics and Health (GA4GH) today announced a new Application Programming Interface (API) developed by the Global Alliance’s Data Working Group that will allow DNA data providers and consumers to better share information and work together on a global scale.

This new open source Genomics API (http://ga4gh.org/#/api), referred to as Version 0.5, is a standard, open tool promoting data interoperability, and will be part of a suite of Genomics APIs being developed by the Global Alliance. The API enables the interoperable exchange of information contained in DNA sequence reads across multiple organizations and on multiple platforms.

The new Genomics API is one of the first products to be developed and distributed by the Global Alliance for Genomics and Health, which was formed only one year ago and is made up of over 200 of the world’s leading biomedical research institutions, healthcare providers, information technology and life science companies, funders of research, and disease and patient advocacy organizations.

“This new Genomics API is an exciting step toward interoperability in genomic data. It advances the Global Alliance’s mission of enabling the sharing of genomic and clinical data to improve human health,” said David Haussler, Co-Chair of the Global Alliance’s Data Working Group and Scientific Director of the UC Santa Cruz Genomics Institute. “Because this new API lets researchers work consistently with genomic data across institutions and platforms, it will help realize the benefits that come from large-scale genomic data sharing, allowing us to find the needle in the haystack for patients with rare diseases.”

Promoting the Global Alliance’s goals of transparency and collaboration, this new Genomics API Version 0.5 uses an open development process to allow the wider bioinformatics community to participate. While the Data Working Group has a core team of active developers, all interested developers from any institution can further engage with this platform by exploring sample apps, building implementations from scratch or from existing samples, or by providing feedback on the API and its documentation. The interface is managed in an open Global Alliance developer site at http://ga4gh.org.

The newly announced Genomics API Version 0.5 builds off of the successful Version 0.1, which was also developed by members of the Data Working Group and is in use by leading organizations, including the European Bioinformatics Institute (EMBL-EBI), the U.S. National Center for Biotechnology Information (NCBI), Google, Genome Savant, and Harvard Medical School’s Biomedical Cybernetics Laboratory, powering a growing community of applications. As analysis tools adopt the new API, researchers will be able to extend their own infrastructure to utilize cloud resources, such as those available from Amazon Web Services, Google Cloud Platform, and Microsoft Azure.

The GA4GH Genomics API is built on the file formats developed over the last five years for large-scale genomic sequencing projects, now also managed by the Global Alliance, but features cleaner models, with a modern, easy-to-use data description schema and a web-enabled interface.

“The ability to share data easily and securely is instrumental to the further development of large scale research projects to improve human health and combat major diseases like cancer,” said Dr. Tom Hudson, President and Scientific Director of the Ontario Institute for Cancer Research, a founding member of the Global Alliance. “The Genomics API is a major step toward ensuring that genomic data can be readily shared by qualified researchers around the world and will facilitate the development of new prevention and treatment strategies for patients.”

“Modern DNA sequencing, when coupled with modern data and cloud technology, can lead to breakthroughs in understanding and improving human health. This new Genomics API is a big step forward,” said David Glazer, co-chair of the Reads Task Team and Engineering Director for Google Cloud Platform and Google Genomics. “Google already supports Version 0.1 of the API, and we'll be adding support for Version 0.5 soon, as well as continuing to contribute to the Data Working Group.”

“The Global Alliance is breaking new ground in combining genomic sequencing and clinical care. Amazon Web Services is proud to support these efforts, and help in defining new operating models, such as the latest Genomics API,” said Matt Wood, General Manager of Data Science, Amazon Web Services, Inc. “We view these new APIs as a vital component for collaboration and development of next-generation tools that can run cost-effectively at massive scale.”

“Genome sequencing is transitioning from being a powerful research tool to making an enormous impact in clinical diagnostics and care.” Said Dr. Richard Durbin, Acting Head of Computational Genomics at the Wellcome Trust Sanger Institute and leader of the Genome Informatics group. “This API from the Global Alliance Data Working Group will enable genomic data processing to move beyond research file formats into modern computing and data architectures, facilitating controlled data sharing and the effective use of these new technologies for both clinical and research benefit.”

“We are using the Global Alliance’s work to enable apps for the TBResist initiative that bridge from raw sequence data to clinically useful phenotypes,” said Professor Gil Alterovitz, a faculty at the Harvard Medical School and director of the Biomedical Cybernetics Laboratory. “Also, the Substitutable Medical Applications and Reusable Technology (SMART) Genomics platform is using the Global Alliance interface to enable interoperability between electronic medical record information (HL7) and raw genetic sequence information.”

Other Working Groups of the Global Alliance for Genomics and Health are currently identifying best practices to integrate genomic data into clinical practice, reaching agreement on security protocols, and developing a framework to address ethics and regulatory considerations.

The Global Alliance for Genomics and Health is an international, non-profit alliance formed to help accelerate the potential of genomic medicine to advance human health. Bringing together over 200 leading institutions working in healthcare, research, disease and patient advocacy, life science, and information technology, partners in the Global Alliance are working together to create a common framework of standards and harmonized approaches to enable the responsible, voluntary, and secure sharing of genomic and clinical data. Learn more at: http://genomicsandhealth.org.