News Release

Predicting cell fates: Researchers develop AI solutions for next-gen biomedical research

Peer-Reviewed Publication

Helmholtz Munich (Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt (GmbH))

CellRank

image: CellRank’s fate probabilities for lung regeneration; each cell is placed at a position that reflects its probability of reaching any terminal state. view more 

Credit: Helmholtz Munich / Marius Lange

Fabian Theis, Head of the Computational Health Center at Helmholtz Munich and Professor for Mathematical Modelling of Biological Systems at TUM: “It’s been a crazy 4 weeks, with many of our scientific stories and methods coming to fruition in that same time window. Our research groups focuses on using single cell genomics to understand the origin of disease in a mechanistic fashion – for this we leverage and develop machine learning approaches to better represent this complex data. In the three new paper, we worked on single cell data integration, trajectory learning and spatial resolution, respectively. Besides the applications shown in the papers, we expect to support the next generation of single-cell research towards disease understanding.”

Here are the latest solutions developed by Helmholtz Munich and TUM researchers:

Solving the data integration challenge
To see whether an observation one makes in a single dataset can be generalized, you can check whether the same can be observed in other datasets of the same system. In single-cell data, so-called batch effects complicate combining datasets in this manner. These are differences in the molecular profiles between samples as they were generated at a different time, in a different place, or from a different person. Overcoming these effects is a central challenge in single-cell genomics with more than 50 proposed solutions. But which one is the best?  A group of researchers around Malte Lücken carefully curated 86 datasets and compared 16 of the most popular data integration methods on 13 tasks. After over 55,000 hours of computation time and a detailed evaluation of 590 results, they built a guide for optimized data integration. This allows for improved observations on disease processes across datasets at a population scale.

Predicting cell states with open-source software
Many questions in biology revolve around continuous processes like development or regeneration. For any cell in such a process, single-cell RNA-sequencing measures gene expression. The method, however, is destructive to cells and scientists obtain only static snapshots. Thus, many algorithms have been developed to reconstruct continuous processes from snapshots of gene expression. A common limitation: These algorithms cannot tell us anything about the direction of the process. To overcome this limitation, Marius Lange and colleagues developed a new algorithm called CellRank. It estimates directed cell-state trajectories by combining previous reconstruction approaches with RNA velocity, a concept to estimate gene up- or down-regulation. Across in-vitro and in-vivo applications, CellRank correctly inferred fate outcomes and recovered previously known genes. In a lung regeneration example, CellRank predicted novel intermediate cell states on a dedifferentiation trajectory whose existence was validated experimentally. CellRank is an open-source software package that is already used by biologists and bioinformaticians around the world to analyze complex cellular dynamics in situations like cancer, reprogramming or regeneration.

Visualizing spatial omics analysis 
Recent years have seen a growing development of technologies to measure gene expression variation in tissue. The advantage of such technologies is that scientists can see cells in their context, thus being able to investigate principles of tissue organization and cellular communication. Researchers need flexible computational frameworks in order to store, integrate and visualize the growing diversity of such data. To tackle this challenge, Giovanni Palla, Hannah Spitzer, and colleagues developed a new computational framework, called Squidpy. It enables analysts and developers to handle spatial gene expression data. Squidpy integrates tools for gene expression and image analysis to efficiently manipulate and interactively visualize spatial omics data. Squidpy is extensible and can be interfaced with a variety of machine learning tools in the python ecosystem. Scientists around the world are already using it to analyze spatial molecular data.

Next to the three publications in Nature Methods, recent findings from the Computational Health team at Helmholtz Munich were also highlighted on the current cover of Nature Biotechnology: https://www.nature.com/nbt/volumes/40/issues/1 For more information please read the corresponding press release AI helps to spot single diseased cells.

About the people
Fabian Theis leads the Computational Health Center at Helmholtz Munich and coordinates the Helmholtz Artificial Intelligence Cooperation Unit (Helmholtz AI). He holds the chair for Mathematical Modelling of Biological Systems at the Technical University of Munich (TUM). Giovanni Palla and Marius Lange are PhD candidates at Helmholtz Munich and TUM. Malte Lücken and Hannah Spitzer are Postdocs at Helmholtz Munich.

Original publications
Lücken et al. 2021: Benchmarking atlas-level data integration in single-cell genomics. Nature Methods, DOI: 10.1038/s41592-021-01336-8.
Lange et al. 2022: CellRank for directed single-cell fate mapping. Nature Methods, DOI: 10.1038/s41592-021-01346-6.
Palla, Spitzer et al. 2022: Squidpy: a scalable framework for spatial omics analysis. Nature Methods, DOI: 10.1038/s41592-021-01358-2.

About Helmholtz Munich
Helmholtz Munich is a leading biomedical research center. Its mission is to develop breakthrough solutions for better health in a rapidly changing world. Interdisciplinary research teams focus on environmentally triggered diseases, especially the therapy and prevention of diabetes, obesity, allergies and chronic lung diseases. With the power of artificial intelligence and bioengineering, the researchers accelerate the translation to patients. Helmholtz Munich has more than 2,500 employees and is headquartered in Munich/Neuherberg. It is a member of the Helmholtz Association, with more than 43,000 employees and 18 research centers the largest scientific organization in Germany. More about Helmholtz Munich (Helmholtz Zentrum München Deutsches Forschungszentrum für Gesundheit und Umwelt GmbH): www.helmholtz-munich.de/en    


Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.