St. Jude Children's Research Hospital today launched St. Jude Cloud, an online data-sharing and collaboration platform that provides researchers access to the world's largest public repository of pediatric cancer genomics data. Developed as a partnership among St. Jude, DNAnexus and Microsoft, St. Jude Cloud provides accelerated data mining, analysis and visualization capabilities in a secure cloud-based environment.
"Sharing research and scientific discoveries is vital to advancing cures and saving lives, especially in rare diseases like pediatric cancer," said James R. Downing, M.D., St. Jude president and chief executive officer. "St. Jude has shared data and resources since its founding, and collaboration with researchers across the world is at the core of our mission. St. Jude Cloud offers researchers access to genomics data and analysis tools that will drive faster progress toward cures for catastrophic diseases of childhood."
St. Jude Cloud is a unique resource in the fight to advance cures for pediatric diseases, offering researchers:
- Largest Public Repository of Pediatric Cancer Genomics Data
The interactive data-sharing platform allows scientists to explore more than 5,000 whole-genome (WGS), 5,000 whole-exome (WES) and 1,200 RNA-Seq datasets from more than 5,000 pediatric cancer patients and survivors. By 2019, St. Jude expects to make 10,000 whole-genome sequences available on St. Jude Cloud.
These data have been generated from three large St. Jude-supported genomics initiatives: the St. Jude--Washington University Pediatric Cancer Genome Project, designed to understand the genetic origins of childhood cancers; the Genomes for Kids clinical trial, focused on moving whole genome sequencing into the clinic; and the St. Jude Lifetime Cohort study (St. Jude LIFE), which conducts comprehensive clinical evaluations on thousands of pediatric cancer survivors throughout their lives.
Access to data is simple, fast and does not require downloading prior to exploration. Researchers may also upload their own data in a private, password-protected environment to explore using tools available on the St. Jude Cloud platform.
- Interactive Data Tools
As well as high-quality next-generation sequencing data, St. Jude Cloud features a collection of bioinformatics tools to help both experts and non-specialists gain novel insights from genomics data. These tools include validated data analysis pipelines and interactive visualization tools to make it easier to make discoveries from large datasets. Data and results can be securely shared with collaborators within the platform.
The platform enables researchers to explore St. Jude data or their own results using innovative, interactive visualizations powered by ProteinPaint, the genomic visualization engine developed at St. Jude. The ProteinPaint visualizations allow users to rapidly navigate through the genome and identify genetic changes linked to cancer development. St. Jude Cloud tools also produce custom visualizations of the user's own research data for exploration or comparison with St. Jude-generated data.
- Verifying Research Discoveries Faster
A St. Jude scientist was able to use the St. Jude Cloud to replicate, in just a few days, experimental findings that originally took the research team more than two years to make. The original team discovered mutations connected to UV damage in a B-cell leukemia in work that was recently published in Nature. The intriguing finding led the team to ask whether other leukemia samples not included in the original study might have similar patterns of mutations. They turned to the high-quality datasets available in St. Jude Cloud, where the rapid computing capabilities of the St. Jude Cloud platform enabled them to re-discover the same UV-linked mutational signature in pediatric B-cell leukemia patients. Identification of these additional samples will help researchers understand how UV damage could be linked to a blood cancer, and potentially point to new avenues for therapy. More details of this work will be presented at the AACR annual meeting in Chicago on Sunday, April 15 at 3 p.m.
- Collaboration to Advance Cures
The data available on the St. Jude Cloud represent a key resource to understanding the genetic roots of childhood cancer. St. Jude's partnership with DNAnexus and Microsoft allows access to these data to harness the collective power of the global research community to advance precision medicine for rare pediatric diseases like cancer.
"St. Jude Cloud is a powerful resource to drive global research and discovery forward," said Jinghui Zhang, Ph.D., chair of the St. Jude Department of Computational Biology and co-leader of the St. Jude Cloud project. "Providing genomic sequencing data to the global research community and making complex computational analysis pipelines easily accessible will lead to progress in eradicating childhood cancer. St. Jude has been committed to sequencing and understanding pediatric cancer genomes for nearly a decade, and we will continue to generate and share data with the research community in the future."
The data available through St. Jude Cloud is stored on Microsoft Azure, which can handle datasets on the massive scale required for large genomics studies such as those developed by St. Jude. Microsoft understands the complexities of large-scale genomics data and has processed half a petabyte of data for St. Jude Cloud to date.
"Health and technology partnerships are central to the advancement of scientific breakthroughs; allowing great minds and passionate hearts to work together with the common goal of ensuring one day, life-threating diseases in children are no longer a reality," said Peter Lee, Ph.D., corporate vice president, AI + Research, Microsoft. "We are extremely proud to collaborate with our research partners at St. Jude and DNAnexus and address the challenges of technological limitations, such as storage and the speed of accessing vast amounts of pediatric cancer data, and look forward to the progress that St. Jude Cloud will bring."
DNAnexus, the biomedical informatics and data management company for St. Jude Cloud, leverages Azure to provide an open, flexible and secure cloud platform that supports Microsoft Genomics service as well as other genomics analysis tools. Researchers around the world are able to access tools and diverse datasets in a secure and collaborative ecosystem.
"Collaboration fuels scientific advancements," said Richard Daly, chief executive officer at DNAnexus. "Whether you are working together across hallways or international borders, researchers need a secure space to foster collaboration and share data and tools. St. Jude has long been a leader in pediatric cancer and continues to develop new approaches to revolutionize the way medicine is practiced. We are proud to serve as the technology platform bringing together St. Jude researchers and their partners to advance cures for pediatric life-threatening diseases."