"This is really the next stage in human genome research," says Whitehead Member Richard Young, who headed the project together with Whitehead Fellow Ernest Fraenkel and MIT Computer Scientist David Gifford.
Key to understanding how the genome is controlled are gene regulators, also known as transcription factors. These small molecules intermittently land on a region of DNA, close to a particular gene, and then switch that gene on. They can also influence the amount of protein that the gene will produce. Many diseases, such as diabetes and cancer, are associated with mutated gene regulators, which is one reason why scientists are so interested in them.
The problem is that very few of these regulators have been identified in any organism. Locating their landing sites is essential to identifying their function, and therein lies the rub: Gene regulators are hard to find. They typically just land on a small stretch of DNA, do their job, and then take off again. And owing to the vastness of the genome, locating just one gene regulator with conventional lab tools can take many years. The Whitehead/MIT team, in the September 2 issue of the journal Nature, report developing a method for scanning an entire genome and quickly identifying the precise landing sites for these regulators.
This work builds upon research reported by Young in the journal Science in October 2002, in which he mapped the general locations of approximately half the gene regulators in yeast.
"The results of the Science paper were pretty low-resolution," says Young. "We were only able to identify in a general way regions where these gene regulators landed. In this paper, we've located all 203 regulators in yeast," and using tools developed in Fraenkel's lab have also been able to nail down the exact landing points. As a result, scientists now can begin to understand how genes and their regulators "talk" to each other. According to Fraenkel, knowing these communication patterns ultimately will have a profound influence on our understanding of everything from infectious disease to cloning.
To eavesdrop on these cellular conversations, graduate student Chris Harbison from Young's lab and postdoctoral researcher Ben Gordon from Fraenkel's lab combined the latest biological tools with new computational methods.
Harbison took yeast cells and subjected them all to a dozen nutritional, chemical, and temperature environments. "We tried to come up with different conditions that a yeast cell would encounter in its natural habitat," says Harbison.
Gene regulators come out of hiding and do their job in response to environmental conditions, but they don't all respond to the same kinds of predicaments. Running the cells through a wide spectrum of stimuli was a way of waking up all the regulators--in a sense, shaking the bushes and then nabbing them once they're out.
Next, Harbison placed gene fragments associated with these regulators onto a series of microarrays--small dime-sized silicon or glass chips that contain thousands of pieces of DNA--which allowed him to come up with a list of approximate locations. Gordon and Fraenkel created computer algorithms that fused Harbison's data with data from other yeast species in order to find the exact landing points.
"The microarray information is like a noisy telephone call," says Fraenkel. "We needed to find the words in all the static before we could understand what they meant."
The next challenge is to scale the platform so it can tackle human cells, something that the researchers are gearing up to do. In fact, a paper from Young's lab in the journal Science last February anticipated this possibility. The team reported locating in human cells all the genome-wide landing sites of a handful of gene regulators associated with type 2 diabetes, revealing some surprising mechanisms of the disease. "That paper is just the beginning of what we'll soon be doing in human cells," Young declares.
Even though the yeast genome's 203 regulators are a far cry from the roughly 2,000 in human cells, Young explains, "now we have the technology and the concepts to get started on decoding the human genome."