Recent advances in imaging technology are transforming how scientists see the cellular universe, showing the form and movement of once grainy and blurred structures in stunning detail. But extracting the torrent of information contained in those images often surpasses the limits of existing computational and data analysis techniques, leaving scientists less than satisfied.
Now, researchers at the Howard Hughes Medical Institute's Janelia Research Campus have developed a way around that problem. They have created a new computational method to rapidly track the three-dimensional movements of cells in such data-rich images. Using the technique, the Janelia scientists can essentially automate much of the time-consuming process of reconstructing an animal's developmental building plan cell by cell.
Philipp Keller, a group leader at Janelia, led the team that developed the computational framework. He and his colleagues, including Janelia postdoc Fernando Amat, Janelia group leader Kristin Branson and former Janelia lab head Eugene Myers, who is now at the Max Plank Institute of Molecular Cell Biology and Genetics, have used the method to reconstruct cell lineage during development of the early nervous system in a fruit fly. Their method can be used to trace cell lineages in multiple organisms and efficiently processes data from multiple types of fluorescent microscopes.
The scientists describe their approach in a paper published online on July 20, 2014, in Nature Methods. Their open-source software can be downloaded for free at http://www.
In 2012, Keller developed the simultaneous multi-view (SiMView) light sheet microscope, which captures three-dimensional images with unprecedented speed and precision over periods of hours or days. The microscope's images can reveal the divisions and intricate rearrangements of individual cells as biological structures emerge in a developing embryo. Since then, Keller has been perfecting the system so he can use it to follow the development of an organism's early nervous system.
"We want to reconstruct the elemental building plan of animals, tracking each cell from very early development until late stages, so that we know everything that has happened in terms of cell movement and cell division," Keller says. "In particular, we want to understand how the nervous system forms. Ultimately, we would like to collect the developmental history of every cell in the nervous system and link that information to the cell's final function. For this purpose, we need to be able to follow individual cells on a fairly large scale and over a long period of time."
It takes more than a week for the nervous system to become functional in an embryonic mouse. Even in the fruit fly, the process takes a day. Following development for that long means Keller's team must image tens of thousands of cells at thousands of time points, and that adds up to terabytes of data. "We can get good image data sets, but if we want to reconstruct them, this is something that we can't really do without help from the computer," Keller says.
Amat, a bioinformatics specialist on Keller's team, and his colleagues have solved that problem with the new computational method that identifies and tracks dividing cells as quickly as their high-speed microscope can capture images. The process is largely automated, but incorporates a manual editing step to improve accuracy for a small percentage of cells that are difficult to track computationally.
Keller's team has been grappling with how to interpret this kind of imaging data since 2010. The problem was challenging not only because of the sheer volume of data his light sheet microscope produced, but also because of the data's complexity. Cells in a developing embryo have different shapes and behaviors and can be densely packed, making it difficult for a computer to identify and track individual cells. Inevitable variations in image quality further complicate the analysis.
Amat led the effort to develop an efficient solution. His first priority was to reduce the complexity of the data. His strategy was to first cluster the voxels (essentially three-dimensional pixels) that make up each image into larger units called supervoxels. Using a supervoxel as the smallest unit reduces an image's complexity a thousand-fold, Keller says.
Next, the program searches for ellipsoid shapes among groups of connected supervoxels, which it recognizes as cell nuclei. Once a cluster of supervoxels is identified as a cell nucleus, the computer uses that information to find the nucleus again in subsequent images. High-speed microscopy captures the images quickly enough that a single cell can't migrate very far from frame to frame. "We take advantage of that situation and use the solution from one time point as the starting point for the next point," Keller says.
"With this fairly fast, simple approach, we can solve easy cases fairly efficiently," Keller says. Those cases make up about 95 percent of the data. "In harder cases, where we might have mistakes, we use heavier machinery."
He explains that in instances where cells are harder to track - because image quality is poor or cells are crowded, for example - the computer draws on additional information. "We look at what all the cells in that neighborhood do a little bit into the future and a little bit into the past," Keller explains. Informative patterns usually emerge from that contextual information. The strategy takes more computing power than the initial tactics. "We don't want to do it for all the cells," Keller says. "But we try to crack these hard cases by gathering more information and making better informed decisions."
All of these steps can be carried out as quickly as images are acquired by the microscope, and the result is lineage information for every cell. "You know the path, you know where it is at a certain time point. You know it divided at a certain point, you know the daughter cells, you know what mother cell it came from," Keller says.
Finally, a human steps in to check the computer's work and fix any mistakes. A computer-generated "confidence score" for every cell at every time point guides the user to the small percentage of data most likely to require a human eye, making high overall accuracy possible without manual examination of each cell.
To test the power of the program, Keller's team collected images of the beginnings of the nervous system as it developed in an embryonic fruit fly. They used their method to trace the lineages of 295 neuroblasts (precursors of nerve cells) and discovered that it is possible to predict the future fate and function of many cells based on their early dynamic behavior.
Keller is eager to begin using the method to investigate a variety of questions about early development, and hopes that others will apply the approach to their own questions. To that end, the team took care to ensure that the technique can be used with a variety of data types. In addition to fruit flies, they successfully used the program to analyze images of zebrafish and mice, as well as data collected from a commercial light sheet microscope and a commercial confocal microscope.