WALNUT CREEK, Calif.-- Scientists at the U.S. Department of Energy (DOE) Joint Genome Institute (JGI) and several partner institutions have published the sequence and analysis of the complete genome of sorghum, a major food and fodder plant with high potential as a bioenergy crop. The genome data will aid scientists in optimizing sorghum and other crops not only for food and fodder use, but also for biofuels production. The comparative analysis of the sorghum genome appears in the January 29 edition of the journal Nature.
Prized for its drought resistance and high productivity, sorghum is currently the second most prevalent biofuels crop in the United States, behind corn. Grain sorghum produces the same amount of ethanol per bushel as corn while utilizing one-third less water. As the technology for producing "cellulosic" (whole plant fiber-based) biofuels matures, sorghum's rapid growth--rising from eight to 15 feet tall in one season--is likely to make it desirable as a cellulosic biofuels "feedstock."
"This is an important step on the road to the development of cost-effective biofuels made from nonfood plant fiber," said Anna C. Palmisano, DOE Associate Director of Science for Biological and Environmental Research. "Sorghum is an excellent candidate for biofuels production, with its ability to withstand drought and prosper on more marginal land. The fully sequenced genome will be an indispensable tool for researchers seeking to develop plant variants that maximize these benefits."
Plant DNA is often notoriously difficult to analyze because of large sections of repetitive sequence and sorghum was no different. Jeremy Schmutz of the DOE JGI partner HudsonAlpha Institute for Biotechnology (formerly the Stanford Human Genome Center) and John Bowers of the University of Georgia pointed to these complex repetitive regions as accounting for the significant size difference between the rice and sorghum genomes, while also suggesting a common overall genome structure for grasses.
"Sorghum will serve as a template genome to which the code of the other important biofuel feedstock grass genomes--switchgrass, Miscanthus, and sugarcane--will be compared," said Andrew Paterson, the publication's first author and Director of the Plant Genome Mapping Laboratory, University of Georgia.
Scientists and industry officials say that completion of the sorghum genome will aid with sequencing of numerous other related plants, including other key potential bioenergy crops.
"I expect our improved understanding of the sorghum genome to have a major impact on the development of improved bioenergy crops for the emerging biofuels and renewable power industries," said Neal Gutterson, President and Chief Executive Officer of Mendel Biotechnology.
Sorghum's is only the second grass genome to be completely sequenced to date, after rice. With approximately 730 million nucleotides, sorghum's genome is nearly 75 percent larger than the size of rice.
Researchers used the whole genome "shotgun" method of sequencing first pioneered in the Human Genome Project. In this method, short random DNA fragments are partially sequenced and then analyzed by powerful supercomputers to reconstruct the original genome sequence. The repetitive sections and the length of the sorghum genome made assembling this "puzzle" a highly challenging computational problem.
By comparing sorghum's assembled code with rice's, the scientists were able to provide a "reality check" for rice's previously published estimate of protein coding genes.
"We found that over 10,000 proposed rice genes are actually just fragments," said DOE JGI's Dan Rokhsar, the publication's co-corresponding author. "We are confident now that rice's gene count is similar to sorghum's at 30,000, typical of grasses."
Other major contributions to the sorghum project were made by the research groups of Joachim Messing of Rutgers University, Therese Mitros of the University of California, Berkeley, and Klaus Mayer of the Helmholtz Center in Munich.
The sorghum sequencing project was initiated in 2005 under the auspices of DOE JGI's Community Sequencing Program (CSP), funded by the Office of Biological and Environmental Research within the DOE Office of Science. The CSP was launched in 2004 to provide the scientific community at large with access to high-throughput sequencing at the DOE JGI Production Genomics Facility for larger projects of relevance to DOE missions. Sequencing project selection is based on scientific merit--judged through independent peer review--and relevance to issues in alternative energy production, global carbon cycling, and biogeochemistry. Information about the CSP's latest call for proposals can be found at: http://www.
This and other DOE JGI projects will be highlighted at the Institute's upcoming Fourth Annual User Meeting, March 25-27 (http://www.
The U.S. Department of Energy Joint Genome Institute, supported by the DOE Office of Science, unites the expertise of five DOE national laboratories--Lawrence Berkeley, Lawrence Livermore, Los Alamos, Oak Ridge, and Pacific Northwest--along with the HudsonAlpha Institute for Biotechnology to advance genomics in support of the DOE missions related to clean energy generation and environmental characterization and cleanup. DOE JGI's Walnut Creek, Calif., Production Genomics Facility provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges. Additional information about DOE JGI can be found at: http://www.