Today, a team led by the Wellcome Trust Sanger Institute, together with colleagues in the USA and Switzerland, provide a measure of just how important regulatory region variation might be in a pilot study based on some 2% of the human genome. As many as 40 of 374 genes showed alteration in genetic activity that could be related to changes in DNA sequence called SNPs.
"We were amazed at the power of this study to detect associations between SNP variations and gene activity," commented Dr Manolis Dermitzakis, Investigator, Division of Informatics at the Wellcome Trust Sanger Institute. "We were even more amazed at the number of genes affected: more than 10% of our sample - or perhaps 3000 genes across the genome - could be subject to modification of activity in human populations due to common genetic variations."
The study combined the map of genetic variation developed through the HapMap with estimates of gene activity obtained from cell cultures from 60 individuals who provided samples for the HapMap. More than 630 genes were studied, of which 374 were active in the cell cultures. If gene activity in a cell culture was skewed from the average, it was investigated further.
These genes were correlated with more than 750,000 SNPs - sequence differences between individuals in the sample collection. A series of statistical tests were carried out to provide increased confidence in the association between gene activity and sequence variation.
"Our sample size of 60 individuals is relatively small," continued Dr Dermitzakis, "and we might expect not to detect rare variations. However, our pilot project gives us greater confidence to take on a genome-wide survey of gene activity."
A global map of sequence variation and gene activity will be an important tool in the interpretation of variation and disease. Such genome-wide association studies will be able to identify some regions of the genome with strong disease effects.
"The HapMap is proving to be useful in a wide range of applications," commented Dr Panos Deloukas, Senior Investigator, Division of Medical Genetics, Wellcome Trust Sanger Institute. "The journey for our biomedical research is from DNA sequence to individual people and individual disease. The HapMap is a bridge from sequence data to the differences in individuals."
The project focused on three regions of the human genome. The first, called the ENCODE regions, and about 30 million base-pairs of DNA, are being intensively studied around the world as a group of 'typical' human genome regions. The second was 35million base-pairs of chromosome 21 sequence: three copies of chromosome 21 lead to Down Syndrome. The third was a region of chromosome 20 - 10 million base-pairs - that is known to be associated with diabetes and obesity.
In comparison with gene sequences that contain the instructions to make proteins, regulatory regions that control genes are relatively poorly understood. Their structure is variable and their distance from the genes they control also varies among genes.
New tools are needed in the search of our genome for the sequences that contribute to disease, tools that will harness the massive amounts of DNA information and transform them into information of real biomedical utility. The methods described here, with the power of the HapMap data and the cell cultures available, will speed that transformation.
Notes for Editors
Stranger et al. (2005) Genome-wide associations of gene expression variation in humans. PLoS Genetics 1: pp. nos to come. DOI: 10.1371/journal.pgen.0010078
Dec 16, 2005
Dr Manolis Dermitzakis, Wellcome Trust Sanger Institute
Dr Panos Deloukas, Wellcome Trust Sanger Institute
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, UK
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, USA
- Department of Genetic Medicine and Development, University of Geneva Medical School, Geneva, Switzerland
- Illumina, Inc., San Diego, CA, USA
- Department of Oncology, University of Cambridge, Hutchison/MRC Research Centre, Hills Road, Cambridge CB2 2XZ, UK
- Program in Molecular and Computational Biology, University of Southern California, Los Angeles, CA 90089-2910
Dermitzakis Lab: http://www.
Deloukas Lab: http://www.
HapMap Project: http://www.
ENCODE Project: http://www.
The Wellcome Trust Sanger Institute was founded in 1992 as the focus for the UK sequencing effort of the human and mouse genomes. The Institute is responsible for the completion of the sequence of approximately one-third of the human genome and one-fifth of the mouse, The Institute is also a major contributor to the mapping and sequencing of the zebrafish genome and genomes of more than 90 disease-causing organisms, including TB and malaria. The Wellcome Trust Sanger Institute is based in Hinxton, Cambridge, UK.
The Wellcome Trust is an independent research-funding charity, established under the will of Sir Henry Wellcome in 1936. It is funded from a private endowment which is managed with long-term stability and growth in mind. The Trust's mission is to foster and promote research with the aim of improving human and animal health.
Don Powell Press Officer
Wellcome Trust Sanger Institute
Hinxton, Cambs, CB10 1SA, UK
Tel +44 (0)1223 494 956
Mobile +44 (0)7753 7753 97