Labels such as "European American", "white", or "Caucasian" are often viewed as representing a homogeneous category in gene mapping studies and census reports, but each of these labels actually groups together multiple populations, which have diverse origins due to the complex history of European immigration to the United States. In a recent study, published in the open-access journal PLoS Genetics, an international team of researchers provide the first genetic dissection of the population structure of European Americans, focusing on identifying the contributions from different genetic ancestries that are important for disease gene mapping.
This is a timely issue as the last year has seen a dramatic upswing in genetic association studies and the discovery of almost a hundred new risk factors for common genetic diseases such as cancer and diabetes. If the subtle population substructure that exists within European American populations is not understood and accounted for, genetic association studies can produce incorrect findings if disease cases are compared to healthy controls that on average have different ancestry.
By systematically examining data from four actual disease association studies in European Americans, this study describes and characterizes the majority of population substructure in European Americans that could lead to spurious associations. "Although our work is far from a complete description of European American population history, for the purpose of disease gene mapping studies it is adequate to measure how closely each person's genetic ancestry resembles three populations that can be roughly described as northwest European, southeast European, or Ashkenazi Jewish," says Dr. David Reich, one of the senior authors on the study, an Associate Professor of Genetics at Harvard Medical School and an Associate Member at the Broad Institute of Harvard and MIT. "With this approach, we can avoid most false-positive associations due to population substructure in European American disease gene mapping studies. Our previous work has addressed related challenges in studies of African Americans and Latino Americans."
Based on their discovery that ancestry from only three populations accounts for most of the potentially problematic substructure in European American disease association studies, the researchers scoured through published data sets to identify places in the genome where common DNA sequence variants differ substantially in frequency among these three ancestral populations and are therefore potentially informative for estimating genetic ancestry. The investigators then confirmed the utility of these genetic variants by testing them in DNA samples that their coauthors collected from the United Kingdom, Sweden, Poland, Spain, Italy, Greece and U.S. Ashkenazi Jews. "We identified 300 common genetic variants that have unusually different frequencies in the three ancestral populations: they are about 10 times more informative for predicting the ancestry of European Americans than random genetic variants", says lead author Dr. Alkes Price, a post-doctoral researcher at the Harvard Medical School Department of Genetics and the Broad Institute of Harvard and MIT. "We can thus correct for population substructure in European American disease association studies using just these 300 markers."
This panel of 300 markers should be valuable in targeted associated studies that follow up previously implicated candidate genes: by comparing the ancestry of disease cases to healthy controls using data from the panel of 300 markers, researchers can determine whether observed associations are genuine, and not false-positives due to population structure. The panel can also be used to match the ancestry of cases and controls prior to more comprehensive studies.
While the technology should provide a new tool in disease gene mapping studies, the researchers caution that the ability to roughly categorize individuals into populations with a small number of genetic markers is not useful in a clinical setting, nor does it completely eliminate the utility of self-described ethnicity. "Although these 300 markers give a reasonable estimate of the major components of genetic ancestry in European Americans, self-described ethnicity can still reflect environmental, social and cultural factors that may not be captured by estimating genetic ancestry," says Dr. Joel Hirschhorn, one of the senior authors of the study, an Associate Professor of Genetics at Children's Hospital Boston and Harvard Medical School, and a Senior Associate Member at the Broad Institute of Harvard and MIT, "Because the genetic differences between these populations are very small, the study is most important for helping in gene discovery efforts, which will lead to better understanding of human biology in health and disease, and hopefully improved care for all patients over the long term."
Published simultaneously in PLoS Genetics is an independent study led by Michael Seldin, in which Chao Tian and colleagues also present panels of markers that can be used to correct for population structure in European American disease association studies. A commentary jointly authored by Michael Seldin and Alkes Price on the practical application of the panels developed by the two groups accompanies these articles.
Dr. Alkes Price (email@example.com) (617-432-5994). Post-doctorial fellow, Harvard Medical School Department of Genetics and Broad Institute of Harvard and MIT
Dr. David Reich (firstname.lastname@example.org) (617-432-6548). Associate Professor, Harvard Medical School Department of Genetics and Broad Institute of Harvard and MIT
Dr. Joel Hirschhorn (email@example.com) (617-919-2129). Associate Professor, Divisions of Endocrinology and Genetics at Children's Hospital Boston, Harvard Medical School Department of Genetics and Broad Institute of Harvard and MIT
Title and full author list: "Discerning the Ancestry of European Americans in Genetic Association Studies"
Alkes L. Price*, Johannah Butler, Nick Patterson, Cristian Capelli, Vincenzo L. Pascali, Francesca Scarnicci, Andres Ruiz-Linares, Leif Groop, Angelica A. Saetta, Penelope Korkolopoulou, Uri Seligsohn, Alicja Waliszewska, Christine Schirmer, Kristin Ardlie, Alexis Ramos, James Nemesh, Lori Arbeitman, David B. Goldstein, David Reich*, Joel N. Hirschhorn*
* These three authors contributed equally
Related Research (the same embargo applies):
Tian C, Plenge RM, Ransom M, Lee A, Villoslada P, et al. (2008) Analysis and application of European genetic substructure using 300K SNP information. PLoS Genet 4(1): e4.
Press-only preview of related article: http://www.
URL after embargo: http://genetics.
Co-Authored Perspective (the same embargo applies):Seldin MF, Price AL (2008) Application of ancestry informative markers to association studies in European Americans. PLoS Genet 4(1): e5.
URL after embargo: http://genetics.
* About the Broad Institute of Harvard and MITThe Broad Institute of Harvard and MIT was founded in 2003 to bring the power of genomics to biomedicine. It pursues this mission by empowering creative scientists to construct new and robust tools for genomic medicine, to make them accessible to the global scientific community, and to apply them to the understanding and treatment of disease. The Institute is a research collaboration that involves faculty, professional staff and students from throughout the Harvard and MIT academic and medical communities. It is jointly governed by the two universities. Organized around Scientific Programs and Scientific Platforms, the unique structure of the Broad Institute enables scientists to collaborate on transformative projects across many scientific and medical disciplines. For further information about the Broad Institute visit http://www.
About Children's Hospital BostonChildren's Hospital Boston is home to the world's largest research enterprise based at a pediatric medical center, where its discoveries have benefited both children and adults since 1869. More than 500 scientists, including eight members of the National Academy of Sciences, 11 members of the Institute of Medicine and 12 members of the Howard Hughes Medical Institute comprise Children's research community. Founded as a 20-bed hospital for children, Children's Hospital Boston today is a 377-bed comprehensive center for pediatric and adolescent health care grounded in the values of excellence in patient care and sensitivity to the complex needs and diversity of children and families. Children's also is the primary pediatric teaching affiliate of Harvard Medical School. For more information about the hospital and its research visit: www.childrenshospital.org/newsroom.
About Harvard Medical SchoolHarvard Medical School has more than 7,000 full-time faculty working in eight academic departments based at the School's Boston quadrangle or in one of 47 academic departments at 18 Harvard teaching hospitals and research institutes. Those Harvard hospitals and research institutions include Beth Israel Deaconess Medical Center, Brigham and Women's Hospital, Cambridge Health Alliance, The CBR Institute for Biomedical Research, Children's Hospital Boston, Dana-Farber Cancer Institute, Forsyth Institute, Harvard Pilgrim Health Care, Joslin Diabetes Center, Judge Baker Children's Center, Massachusetts Eye and Ear Infirmary, Massachusetts General Hospital, Massachusetts Mental Health Center, McLean Hospital, Mount Auburn Hospital, Schepens Eye Research Institute, Spaulding Rehabilitation Hospital, VA Boston Healthcare System. For further information about Harvard Medical School visit http://hms.