In research published today by Nature, an international team describes the finest map of changes to the structure of human genomes and a resource they have developed for researchers worldwide to look at the role of these changes in human disease. They also identify 75 'jumping genes' - regions of our genome that can be found in more than one location in some individuals.
However, the team cautions that they have not found large numbers of candidates that might alter susceptibility to complex diseases such as diabetes or heart disease among the common structural variants. They suggest strategies for finding this 'dark matter' of genetic variation.
Human genomes differ because of single-letter variations in the genetic code and also because whole segments of the code might be deleted or multiplied in different human genomes. These larger, structural differences are called copy number variants (CNVs).
The new research to map and characterize CNVs is of a scale and a power unmatched to date, involving hundreds of human genomes, billions of data points and many thousands of CNVs.
"This study is more than ten times as powerful as our first map, published three years ago," explains Dr Matt Hurles from the Wellcome Trust Sanger Institute and a leader on the project, "and much more detailed than any other. Importantly, we have also assigned the CNVs to a specific genetic background so that they can be readily examined in disease studies carried out by others, such as the Wellcome Trust Case Control Consortium.
"Nevertheless, we have not found large numbers of common CNVs that we can tie strongly to disease. There remains much to be discovered and much to understand and our freely available genotyped collection will drive that discovery."
The results show that any two genomes differ by more than 1000 CNVs, or around 0.8% of a person's genome sequence. Most of these CNVs are deletions, with a minority being duplications.
Two consequences are particularly striking in this study of apparently healthy people. First, 75 regions have jumped around in the genomes of these samples: second, more than 250 genes can lose one of the two copies in our genome without obvious consequences and a further 56 genes can fuse together potentially to form new composite genes.
"This paper detailing common CNVs in different world populations, and providing the first glimpse into evolutionary biology of such class of human variation, is unquestionably one of the most important advances in human genome research since the completion of a reference human genome," says Professor James R. Lupski, Vice Chair of the department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas. "It complements the cataloguing of single nucleotide variation delineated in the HapMap Project and will both enable some new approaches to, and further augment other studies of, basic human biology relevant to health and disease."
"The genetic 'blueprint' of humans is the human genome," says Sir Mark Walport, Director of the Wellcome Trust. "But we are each unique as individuals, shaped by variation in both genome and environment. Understanding the variation amongst human genomes is key to understanding the inherited differences between each of us in health and disease. A whole new dimension has been added to our understanding of variation in the human genome by the identification of copy number variants."
The results also give, for the first time, a minimum measure of the rate of CNV mutation: at least one in 17 children will have a new CNV. In many cases, that CNV will have no obvious clinical consequences. However, for some the effects are severe. In those cases the data are captured in the DECIPHER database, a repository of clinical information about CNVs designed to aid the diagnosis of rare disorders in young children.
But CNVs are not only about here and now; they are also ancient legacies of how our ancestors adapted to their environments. Among the most impressive variations between populations are CNVs that modify the activity of the immune system, known to be evolving rapidly in human populations, and genes implicated in muscle function. The researchers propose that the consequences of these CNVs can be dissected in population studies.
The team scanned 42 million locations on the genomes of 40 people, half of European ancestry and half of West-African ancestry. The scale of the method meant they could detect CNVs as small as 450 bases occurring in one in 20 individuals.
However, the researchers concede that their map of common variants will not account for much of the 'dark matter' of the genome - the missing heritability where, despite diligent searches, genetic variants have not been found for common disease.
"CNV studies have made huge advances in the past few years, but we are still looking only at the most common CNVs," explains Dr Steve Scherer of the Hospital for Sick Children, Toronto. "We suspect that there are many CNVs that have real clinical consequences that occur in perhaps one in 50 or one in 100 people - below the level we have detected.
"Success in the hunt for the missing genetic causes of common disease has become possible in the last few years and we expect to find more as higher resolution searches become possible."
The research group have maximized the value of their research by not only mapping the CNVs, but by also genotyping them - assigning them to a specific genetic background that makes them readily useful in wider genetic studies, such as the Wellcome Trust Case Control Consortium.
"We were determined to develop not only the map, but also to provide the resources that help other researchers and clinical cytogeneticists most rapidly use our CNV results," comments Dr Charles Lee, one of the project leaders from Brigham and Women's Hospital and Harvard Medical School in Boston, USA. "Already, the data that we have generated is benefiting other large-scale studies such as the 1000 Genomes Projects as well as making an enormous difference in the accurate interpretation of clinical genetic diagnoses.
"Nonetheless, the human CNV story is far from over."
Notes to Editors
Conrad DF, Pinto D et al. (2009) Copy number variation in the human genome: mechanism, selection and disease. Nature
This research was supported by the Wellcome Trust, Canada Foundation of Innovation and Ontario Innovation Trust, Canadian Institutes of Health Research, Genome Canada/Ontario Genomics Institute, the McLaughlin Centre for Molecular Medicine, Ontario Ministry of Research and Innovation, the Hospital for Sick Children Foundation, the Department of Pathology at Brigham and Women's Hospital and the National Institutes of Health (NIH)
- Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Pathology, Brigham and Women's Hospital and Harvard Medical School, Boston, USA
- Inserm UMR915, L'institut du thorax, Nantes, France
- Uppsala: Department of Genetics and Pathology, Rudbeck Laboratory Uppsala University, Uppsala, Sweden
- Department of Molecular Genetics, University of Toronto, Canada
The Wellcome Trust Sanger Institute, which receives the majority of its funding from the Wellcome Trust, was founded in 1992. The Institute is responsible for the completion of the sequence of approximately one-third of the human genome as well as genomes of model organisms and more than 90 pathogen genomes. In October 2006, new funding was awarded by the Wellcome Trust to exploit the wealth of genome data now available to answer important questions about health and disease.
The Wellcome Trust is the largest charity in the UK. It funds innovative biomedical research, in the UK and internationally, spending over £600 million each year to support the brightest scientists with the best ideas. The Wellcome Trust supports public debate about biomedical research and its impact on health and wellbeing.
The Hospital for Sick Children (SickKids), affiliated with the University of Toronto, is Canada's most research-intensive hospital and the largest centre dedicated to improving children's health in the country. As innovators in child health, SickKids improves the health of children by integrating care, research and teaching. Our mission is to provide the best in complex and specialized care by creating scientific and clinical advancements, sharing our knowledge and expertise and championing the development of an accessible, comprehensive and sustainable child health system. SickKids is committed to healthier children for a better world. www.sickkids.ca
Brigham and Women's Hospital is a 747-bed nonprofit teaching affiliate of Harvard Medical School and a founding member of Partners HealthCare System, an integrated health care delivery network. BWH is committed to excellence in patient care with expertise in virtually every specialty of medicine and surgery. The BWH medical preeminence dates back to 1832 and today that rich history in clinical care is coupled with its national leadership in quality improvement and patient safety initiatives, dedication to educating and training health care professionals, and strength in biomedical research. With $370M in funding and more than 500 research scientists, BWH is an acclaimed leader in clinical, basic and epidemiological investigation - including the landmark Nurses Health Study, Physicians Health Studies, and the Women's Health Initiative. www.brighamandwomens.org