601
|
Seldin MF, Shigeta R, Villoslada P, Selmi C, Tuomilehto J, Silva G, Belmont JW, Klareskog L, Gregersen PK. European population substructure: clustering of northern and southern populations. PLoS Genet 2006; 2:e143. [PMID: 17044734 PMCID: PMC1564423 DOI: 10.1371/journal.pgen.0020143] [Citation(s) in RCA: 178] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2006] [Accepted: 07/25/2006] [Indexed: 01/02/2023] Open
Abstract
Using a genome-wide single nucleotide polymorphism (SNP) panel, we observed population structure in a diverse group of Europeans and European Americans. Under a variety of conditions and tests, there is a consistent and reproducible distinction between “northern” and “southern” European population groups: most individual participants with southern European ancestry (Italian, Spanish, Portuguese, and Greek) have >85% membership in the “southern” population; and most northern, western, eastern, and central Europeans have >90% in the “northern” population group. Ashkenazi Jewish as well as Sephardic Jewish origin also showed >85% membership in the “southern” population, consistent with a later Mediterranean origin of these ethnic groups. Based on this work, we have developed a core set of informative SNP markers that can control for this partition in European population structure in a variety of clinical and genetic studies. Two unrelated persons in the human population have hundreds of thousands of base pair differences between them in DNA sequence. Previous studies have shown that a small proportion of these sequence differences correlate with a person's continental ancestry: broadly, Asia, Africa Oceana, America, or continental Europe. In the current study, DNA differences within a particular continental group, Europe, were examined. Overall, the analysis of sequence variation allowed the authors to distinguish individuals with northern European ancestry (Swedish, English, Irish, German, and Ukrainian) from individuals with southern European ancestry (Italian, Spanish, Portuguese, and Greek). Interestingly, Ashkenazi Jewish individuals tend to group together with individuals from southern European countries. This study is important because it provides a method of taking into account these differences when searching for genetic variations that are associated with particular human traits, such as disease susceptibility, response to drug treatment, or side effects from therapy. Specifically, these methods may allow scientists to uncover disease-associated genetic variations that might be hidden unless differences related to European ancestry are considered.
Collapse
Affiliation(s)
- Michael F Seldin
- Rowe Program in Human Genetics, Departments of Biological Chemistry and Medicine, University of California Davis, Davis, California, United States of America.
| | | | | | | | | | | | | | | | | |
Collapse
|
602
|
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D. Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 2006; 38:904-9. [PMID: 16862161 DOI: 10.1038/ng1847] [Citation(s) in RCA: 7199] [Impact Index Per Article: 378.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2006] [Accepted: 06/21/2006] [Indexed: 12/30/2022]
Abstract
Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.
Collapse
Affiliation(s)
- Alkes L Price
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | |
Collapse
|
603
|
|
604
|
Fredman D, Sawyer SL, Strömqvist L, Mottagui-Tabar S, Kidd KK, Wahlestedt C, Chanock SJ, Brookes AJ. Nonsynonymous SNPs: validation characteristics, derived allele frequency patterns, and suggestive evidence for natural selection. Hum Mutat 2006; 27:173-86. [PMID: 16429399 DOI: 10.1002/humu.20289] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We experimentally investigated more than 1,200 entries in dbSNP that would change amino-acids (nsSNPs), using various subsets of DNA samples drawn from 18 global populations (approximately 1,000 subjects in total). First, we mined the data for any SNP features that correlated with a high validation rate. Useful predictors of valid SNPs included multiple submissions to dbSNP, having a dbSNP validation statement, and being present in a low number of ESTs. Together, these features improved validation rates by almost 10-fold. Higher-abundance SNPs (e.g., T/C variants) also validated more frequently. Second, we considered derived alleles and noted a considerably (approximately 10%) increased average derived allele frequency (DAF) in Europeans vs. Africans, plus a further increase in some other populations. This was not primarily due to an SNP ascertainment bias, nor to the effects of natural selection. Instead, it can be explained as a drift-based, progressive increase in DAF that occurs over many generations and becomes exaggerated during population bottlenecks. This observation could be used as the basis for novel DAF-based tests for comparing demographic histories. Finally, we considered individual marker patterns and identified 37 SNPs with allele frequency variance or FST values consistent with the effects of population-specific natural selection. Four particularly striking clusters of these markers were apparent, and three of these coincide with genes/regions from among only several dozen such domains previously suggested by others to carry signatures of selection.
Collapse
Affiliation(s)
- David Fredman
- Center for Genomics and Bioinformatics, Karolinska Institute, Stockholm, Sweden
| | | | | | | | | | | | | | | |
Collapse
|
605
|
Zhang C, Bailey DK, Awad T, Liu G, Xing G, Cao M, Valmeekam V, Retief J, Matsuzaki H, Taub M, Seielstad M, Kennedy GC. A whole genome long-range haplotype (WGLRH) test for detecting imprints of positive selection in human populations. Bioinformatics 2006; 22:2122-8. [PMID: 16845142 DOI: 10.1093/bioinformatics/btl365] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The identification of signatures of positive selection can provide important insights into recent evolutionary history in human populations. Current methods mostly rely on allele frequency determination or focus on one or a small number of candidate chromosomal regions per study. With the availability of large-scale genotype data, efficient approaches for an unbiased whole genome scan are becoming necessary. METHODS We have developed a new method, the whole genome long-range haplotype test (WGLRH), which uses genome-wide distributions to test for recent positive selection. Adapted from the long-range haplotype (LRH) test, the WGLRH test uses patterns of linkage disequilibrium (LD) to identify regions with extremely low historic recombination. Common haplotypes with significantly longer than expected ranges of LD given their frequencies are identified as putative signatures of recent positive selection. In addition, we have also determined the ancestral alleles of SNPs by genotyping chimpanzee and gorilla DNA, and have identified SNPs where the non-ancestral alleles have risen to extremely high frequencies in human populations, termed 'flipped SNPs'. Combining the haplotype test and the flipped SNPs determination, the WGLRH test serves as an unbiased genome-wide screen for regions under putative selection, and is potentially applicable to the study of other human populations. RESULTS Using WGLRH and high-density oligonucleotide arrays interrogating 116 204 SNPs, we rapidly identified putative regions of positive selection in three populations (Asian, Caucasian, African-American), and extended these observations to a fourth population, Yoruba, with data obtained from the International HapMap consortium. We mapped significant regions to annotated genes. While some regions overlap with genes previously suggested to be under positive selection, many of the genes have not been previously implicated in natural selection and offer intriguing possibilities for further study. AVAILABILITY the programs for the WGLRH algorithm are freely available and can be downloaded at http://www.affymetrix.com/support/supplement/WGLRH_program.zip.
Collapse
Affiliation(s)
- Chun Zhang
- Affymetrix Inc, 3380 Central Expressway, Santa Clara, CA 95051, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
606
|
Kelley JL, Madeoy J, Calhoun JC, Swanson W, Akey JM. Genomic signatures of positive selection in humans and the limits of outlier approaches. Genome Res 2006; 16:980-9. [PMID: 16825663 PMCID: PMC1524870 DOI: 10.1101/gr.5157306] [Citation(s) in RCA: 159] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Identifying regions of the human genome that have been targets of positive selection will provide important insights into recent human evolutionary history and may facilitate the search for complex disease genes. However, the confounding effects of population demographic history and selection on patterns of genetic variation complicate inferences of selection when a small number of loci are studied. To this end, identifying outlier loci from empirical genome-wide distributions of genetic variation is a promising strategy to detect targets of selection. Here, we evaluate the power and efficiency of a simple outlier approach and describe a genome-wide scan for positive selection using a dense catalog of 1.58 million SNPs that were genotyped in three human populations. In total, we analyzed 14,589 genes, 385 of which possess patterns of genetic variation consistent with the hypothesis of positive selection. Furthermore, several extended genomic regions were found, spanning >500 kb, that contained multiple contiguous candidate selection genes. More generally, these data provide important practical insights into the limits of outlier approaches in genome-wide scans for selection, provide strong candidate selection genes to study in greater detail, and may have important implications for disease related research.
Collapse
Affiliation(s)
- Joanna L. Kelley
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Jennifer Madeoy
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - John C. Calhoun
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Willie Swanson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
| | - Joshua M. Akey
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195, USA
- Corresponding author.E-mail ; fax (206) 685-7301
| |
Collapse
|
607
|
Sabeti PC, Schaffner SF, Fry B, Lohmueller J, Varilly P, Shamovsky O, Palma A, Mikkelsen TS, Altshuler D, Lander ES. Positive natural selection in the human lineage. Science 2006; 312:1614-20. [PMID: 16778047 DOI: 10.1126/science.1124309] [Citation(s) in RCA: 779] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Positive natural selection is the force that drives the increase in prevalence of advantageous traits, and it has played a central role in our development as a species. Until recently, the study of natural selection in humans has largely been restricted to comparing individual candidate genes to theoretical expectations. The advent of genome-wide sequence and polymorphism data brings fundamental new tools to the study of natural selection. It is now possible to identify new candidates for selection and to reevaluate previous claims by comparison with empirical distributions of DNA sequence variation across the human genome and among populations. The flood of data and analytical methods, however, raises many new challenges. Here, we review approaches to detect positive natural selection, describe results from recent analyses of genome-wide data, and discuss the prospects and challenges ahead as we expand our understanding of the role of natural selection in shaping the human genome.
Collapse
Affiliation(s)
- P C Sabeti
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
608
|
Biswas S, Akey JM. Genomic insights into positive selection. Trends Genet 2006; 22:437-46. [PMID: 16808986 DOI: 10.1016/j.tig.2006.06.005] [Citation(s) in RCA: 283] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2005] [Revised: 03/27/2006] [Accepted: 06/06/2006] [Indexed: 01/28/2023]
Abstract
The traditional way of identifying targets of adaptive evolution has been to study a few loci that one hypothesizes a priori to have been under selection. This approach is complicated because of the confounding effects that population demographic history and selection have on patterns of DNA sequence variation. In principle, multilocus analyses can facilitate robust inferences of selection at individual loci. The deluge of large-scale catalogs of genetic variation has stimulated many genome-wide scans for positive selection in several species. Here, we review some of the salient observations of these studies, identify important challenges ahead, consider the limitations of genome-wide scans for selection and discuss the potential significance of a comprehensive understanding of genomic patterns of selection for disease-related research.
Collapse
Affiliation(s)
- Shameek Biswas
- Department of Genome Sciences, University of Washington, 1705 NE Pacific, Seattle, WA 98195, USA
| | | |
Collapse
|
609
|
Stover PJ, Garza C. Nutrition and developmental biology--implications for public health. Nutr Rev 2006; 64:S60-71; discussion S72-91. [PMID: 16770956 DOI: 10.1111/j.1753-4887.2006.tb00248.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Recent advances in understanding genome-nutrient and nutrient-network interactions, and the modifying effects of genetic variation on their function, have strengthened interests in acute and long-lasting diet/ nutrition influences on health. Relationships between early and mid-gestational and perinatal conditions (including those related to maternal nutrition) and outcomes, and later-onset chronic diseases have received particular attention. Controlled animal experiments support views that responses with long-lasting effects to nutritional milieus are enabled by epigenetic and other metabolic adjustments during critical windows. Thus, underlying mechanisms are beginning to be understood. For example, chromatin remodeling during development can alter gene expression levels, fix or determine future set points critical to intra- and inter-organ communication networks, alter morphogenesis, initiate remodeling events, etc., all with lifelong consequences. These also may affect DNA mutation rates and thereby influence adult cancer and other risks. There is increasing evidence that nutrient-based strategies will be of value to the prevention or delay of onset of chronic diseases and that these strategies may require initiation during embryonic or fetal stages of development to achieve maximal benefit.
Collapse
Affiliation(s)
- Patrick J Stover
- Division of Nutritional Sciences, Cornell University, Ithaca, New York 14853, USA.
| | | |
Collapse
|
610
|
Fry AE, Trafford CJ, Kimber MA, Chan MS, Rockett KA, Kwiatkowski DP. Haplotype homozygosity and derived alleles in the human genome. Am J Hum Genet 2006; 78:1053-9. [PMID: 16685655 PMCID: PMC1474085 DOI: 10.1086/504160] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2005] [Accepted: 03/13/2006] [Indexed: 11/03/2022] Open
Abstract
Haplotype-based techniques are being used to estimate the relative age of alleles--particularly in screening loci for signals of recent positive selection--but does this approach capture even coarse age differences? Using simulations and empirical data from the International HapMap Project, we show that a simple pairwise metric of haplotype homozygosity gives significantly higher mean values for human single-nucleotide-polymorphism alleles that appear to be derived than for those that appear to be ancestral, as determined by comparison with the chimpanzee genome. Our results support the use of haplotype-based techniques, such as extended haplotypic homozygosity, to assess the age of alleles.
Collapse
Affiliation(s)
- Andrew E Fry
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, United Kingdom.
| | | | | | | | | | | |
Collapse
|
611
|
Abstract
The beneficial substitution of an allele shapes patterns of genetic variation at linked sites. Thus, in principle, adaptations can be mapped by looking for the signature of directional selection in polymorphism data. In practice, such efforts are hampered by the need for an accurate characterization of the demographic history of the species and of the effects of positive selection. In an attempt to circumvent these difficulties, researchers are increasingly taking a purely empirical approach, in which a large number of genomic regions are ordered by summaries of the polymorphism data, and loci with extreme values are considered to be likely targets of positive selection. We evaluated the reliability of the "empirical" approach, focusing on applications to human data and to maize. To do so, we considered a coalescent model of directional selection in a sensible demographic setting, allowing for selection on standing variation as well as on a new mutation. Our simulations suggest that while empirical approaches will identify several interesting candidates, they will also miss many--in some cases, most--loci of interest. The extent of the trade-off depends on the mode of positive selection and the demographic history of the population. Specifically, the false-discovery rate is higher when directional selection involves a recessive rather than a co-dominant allele, when it acts on a previously neutral rather than a new allele, and when the population has experienced a population bottleneck rather than maintained a constant size. One implication of these results is that, insofar as attributes of the beneficial mutation (e.g., the dominance coefficient) affect the power to detect targets of selection, genomic scans will yield an unrepresentative subset of loci that contribute to adaptations.
Collapse
Affiliation(s)
- Kosuke M. Teshima
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| | - Graham Coop
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Molly Przeworski
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
- Corresponding authors.E-mail ; fax (773) 834-0505.E-mail ; fax (773) 834-0505
| |
Collapse
|
612
|
Maitland ML, DiRienzo A, Ratain MJ. Interpreting Disparate Responses to Cancer Therapy: The Role of Human Population Genetics. J Clin Oncol 2006; 24:2151-7. [PMID: 16682733 DOI: 10.1200/jco.2005.05.2282] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Increasingly, investigators are recognizing differences in tumor biology, drug metabolism, toxicity, and therapeutic response among different patient populations receiving anticancer agents. These observations provide exciting opportunities to identify the factors most important for predicting individual variability in pharmacologically relevant phenotypes and consequently for personalizing the delivery of cancer therapy. Although pharmacogenomic differences may explain some of these disparities, rigorous investigation of both genetic and nongenetic differences is important to identify the variables most important for optimal selection and dosing of treatment for an individual patient. For example, pharmacogenetic tests currently used in cancer therapy, such as genotyping UGT1A1 to reduce the incidence of severe toxicity of irinotecan and sequencing epidermal growth factor receptor from tumors to identify somatic mutations conferring sensitivity to tyrosine kinase inhibitors, were developed without initial identification of interpopulation differences. Although interpopulation variability in toxicity and efficacy of these agents has been observed, the basis for these population differences remains only partially explained. Here, we review concepts of human population genetics to inform interpretations of disparate drug effects of cancer therapy across patient populations. Understanding these principles will help investigators better design clinical trials to identify the variables most relevant to subsequent individualization of a cancer therapy.
Collapse
Affiliation(s)
- Michael L Maitland
- Section of Hematology/Oncology, Department of Medicine, University of Chicago, Chicago, IL, USA
| | | | | |
Collapse
|
613
|
Kukita Y, Miyatake K, Stokowski R, Hinds D, Higasa K, Wake N, Hirakawa T, Kato H, Matsuda T, Pant K, Cox D, Tahira T, Hayashi K. Genome-wide definitive haplotypes determined using a collection of complete hydatidiform moles. Genome Res 2006; 15:1511-8. [PMID: 16251461 PMCID: PMC1310639 DOI: 10.1101/gr.4371105] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
We present genome-wide definitive haplotypes, determined using a collection of 74 Japanese complete hydatidiform moles, each carrying a genome derived from a single sperm. The haplotypes incorporate 281,439 common SNPs, genotyped with a high throughput array-based oligonucleotide hybridization technique. Comparison of haplotypes inferred from pseudoindividuals (constructed from randomized mole pairs) with those of moles showed some switch errors in resolution of phases by the computational inference method. The effects of these errors on local haplotype structure and selection of tag SNPs are discussed. We also show that definitive haplotypes of moles may be useful for elucidation of long-range haplotype structure, and should be more effective for detecting extended haplotype homozygosity indicative of positive selection.
Collapse
Affiliation(s)
- Yoji Kukita
- Division of Genome Analysis, Research Center for Genetic Information, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Fukuoka 812-8582, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
614
|
Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res 2006; 15:1566-75. [PMID: 16251466 PMCID: PMC1310644 DOI: 10.1101/gr.4252305] [Citation(s) in RCA: 688] [Impact Index Per Article: 36.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Detecting selective sweeps from genomic SNP data is complicated by the intricate ascertainment schemes used to discover SNPs, and by the confounding influence of the underlying complex demographics and varying mutation and recombination rates. Current methods for detecting selective sweeps have little or no robustness to the demographic assumptions and varying recombination rates, and provide no method for correcting for ascertainment biases. Here, we present several new tests aimed at detecting selective sweeps from genomic SNP data. Using extensive simulations, we show that a new parametric test, based on composite likelihood, has a high power to detect selective sweeps and is surprisingly robust to assumptions regarding recombination rates and demography (i.e., has low Type I error). Our new test also provides estimates of the location of the selective sweep(s) and the magnitude of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence for selective sweeps is also found in many other regions, including genes known to be associated with disease risk such as DPP10 and COL4A3.
Collapse
Affiliation(s)
- Rasmus Nielsen
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | | | | | |
Collapse
|
615
|
Aly TA, Eller E, Ide A, Gowan K, Babu SR, Erlich HA, Rewers MJ, Eisenbarth GS, Fain PR. Multi-SNP analysis of MHC region: remarkable conservation of HLA-A1-B8-DR3 haplotype. Diabetes 2006; 55:1265-9. [PMID: 16644681 DOI: 10.2337/db05-1276] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Technology has become available to cost-effectively analyze thousands of single nucleotide polymorphisms (SNPs). We recently confirmed by genotyping a small series of class I alleles and microsatellite markers that the extended haplotype HLA-A1-B8-DR3 (8.1 AH) at the major histocompatibility complex (MHC) is a common and conserved haplotype. To further evaluate the region of conservation of the DR3 haplotypes, we genotyped 31 8.1 AHs and 29 other DR3 haplotypes with a panel of 656 SNPs spanning 4.8 Mb in the MHC region. This multi-SNP evaluation revealed a 2.9-Mb region that was essentially invariable for all 31 8.1 AHs. The 31 8.1 AHs were >99.9% identical for 384 consecutive SNPs of the 656 SNPs analyzed. Future association studies of MHC-linked susceptibility to type 1 diabetes will need to account for the extensive conservation of the 8.1 AH, since individuals who carry this haplotype provide no information about the differential effects of the alleles that are present on this haplotype.
Collapse
Affiliation(s)
- Theresa A Aly
- Barbara Davis Center for Childhood Diabetes, University of Colorado Health Sciences Center, Mail Stop B140, P.O. Box 6511, Aurora, CO 80045-6511, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
616
|
Ryan AW, Mapp J, Moyna S, Mattiangeli V, Kelleher D, Bradley DG, McManus R. Levels of interpopulation differentiation among different functional classes of immunologically important genes. Genes Immun 2006; 7:179-83. [PMID: 16222342 DOI: 10.1038/sj.gene.6364266] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
It has been postulated that gene function may influence the degree to which allele frequencies differ among populations. In order to evaluate this effect, genotypic data from resequencing studies of genes classified as cytokines, cytokine receptors, cell adhesion molecules, Toll-like receptors and coagulation proteins were analysed for genetic differentiation (FST) between population samples of European and African descent. FST values did not differ statistically among functional groups when all polymorphic sites were included in the analyses. However, analysis based on nonsynonymous SNPs alone suggested weak heterogeneity among functional classes (P=0.0424). Particularly high levels of differentiation were shown by individual nonsynonymous SNPs at some genes, most notably ICAM1 and some Toll-like receptors. These genes interact directly with pathogens, and may therefore have been subject to geographically localised natural selection. Such loci warrant particular attention in studies of genetic disease risk and local adaptation to environmental conditions.
Collapse
Affiliation(s)
- A W Ryan
- Department of Clinical Medicine, Trinity College, Trinity Centre for Health Sciences, St James's Hospital, Dublin, Ireland.
| | | | | | | | | | | | | |
Collapse
|
617
|
Lao O, Duijn KV, Kersbergen P, Knijff PD, Kayser M. Proportioning whole-genome single-nucleotide-polymorphism diversity for the identification of geographic population structure and genetic ancestry. Am J Hum Genet 2006; 78:680-90. [PMID: 16532397 PMCID: PMC1424693 DOI: 10.1086/501531] [Citation(s) in RCA: 127] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2005] [Accepted: 01/12/2006] [Indexed: 12/13/2022] Open
Abstract
The identification of geographic population structure and genetic ancestry on the basis of a minimal set of genetic markers is desirable for a wide range of applications in medical and forensic sciences. However, the absence of sharp discontinuities in the neutral genetic diversity among human populations implies that, in practice, a large number of neutral markers will be required to identify the genetic ancestry of one individual. We showed that it is possible to reduce the amount of markers required for detecting continental population structure to only 10 single-nucleotide polymorphisms (SNPs), by applying a newly developed ascertainment algorithm to Affymetrix GeneChip Mapping 10K SNP array data that we obtained from samples of globally dispersed human individuals (the Y Chromosome Consortium panel). Furthermore, this set of SNPs was able to recover the genetic ancestry of individuals from all four continents represented in the original data set when applied to an independent, much larger, worldwide population data set (Centre d'Etude du Polymorphisme Humain-Human Genome Diversity Project Cell Line Panel). Finally, we provide evidence that the unusual patterns of genetic variation we observed at the respective genomic regions surrounding the five most informative SNPs is in agreement with local positive selection being the explanation for the striking SNP allele-frequency differences we found between continental groups of human populations.
Collapse
Affiliation(s)
- Oscar Lao
- Department of Forensic Molecular Biology, Erasmus University Medical Centre Rotterdam, Rotterdam; Department of Biology, Netherlands Forensic Institute, The Hague; and Forensic Laboratory for DNA Research, Department of Human and Clinical Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Kate van Duijn
- Department of Forensic Molecular Biology, Erasmus University Medical Centre Rotterdam, Rotterdam; Department of Biology, Netherlands Forensic Institute, The Hague; and Forensic Laboratory for DNA Research, Department of Human and Clinical Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Paula Kersbergen
- Department of Forensic Molecular Biology, Erasmus University Medical Centre Rotterdam, Rotterdam; Department of Biology, Netherlands Forensic Institute, The Hague; and Forensic Laboratory for DNA Research, Department of Human and Clinical Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Peter de Knijff
- Department of Forensic Molecular Biology, Erasmus University Medical Centre Rotterdam, Rotterdam; Department of Biology, Netherlands Forensic Institute, The Hague; and Forensic Laboratory for DNA Research, Department of Human and Clinical Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| | - Manfred Kayser
- Department of Forensic Molecular Biology, Erasmus University Medical Centre Rotterdam, Rotterdam; Department of Biology, Netherlands Forensic Institute, The Hague; and Forensic Laboratory for DNA Research, Department of Human and Clinical Genetics, Leiden University Medical Centre, Leiden, The Netherlands
| |
Collapse
|
618
|
Xue Y, Daly A, Yngvadottir B, Liu M, Coop G, Kim Y, Sabeti P, Chen Y, Stalker J, Huckle E, Burton J, Leonard S, Rogers J, Tyler-Smith C. Spread of an inactive form of caspase-12 in humans is due to recent positive selection. Am J Hum Genet 2006; 78:659-70. [PMID: 16532395 PMCID: PMC1424700 DOI: 10.1086/503116] [Citation(s) in RCA: 123] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2005] [Accepted: 02/01/2006] [Indexed: 11/03/2022] Open
Abstract
The human caspase-12 gene is polymorphic for the presence or absence of a stop codon, which results in the occurrence of both active (ancestral) and inactive (derived) forms of the gene in the population. It has been shown elsewhere that carriers of the inactive gene are more resistant to severe sepsis. We have now investigated whether the inactive form has spread because of neutral drift or positive selection. We determined its distribution in a worldwide sample of 52 populations and resequenced the gene in 77 individuals from the HapMap Yoruba, Han Chinese, and European populations. There is strong evidence of positive selection from low diversity, skewed allele-frequency spectra, and the predominance of a single haplotype. We suggest that the inactive form of the gene arose in Africa approximately 100-500 thousand years ago (KYA) and was initially neutral or almost neutral but that positive selection beginning approximately 60-100 KYA drove it to near fixation. We further propose that its selective advantage was sepsis resistance in populations that experienced more infectious diseases as population sizes and densities increased.
Collapse
Affiliation(s)
- Yali Xue
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambs CB10 1SA, United Kingdom
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
619
|
Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS Biol 2006; 4:e72. [PMID: 16494531 PMCID: PMC1382018 DOI: 10.1371/journal.pbio.0040072] [Citation(s) in RCA: 1872] [Impact Index Per Article: 98.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2005] [Accepted: 01/10/2006] [Indexed: 11/20/2022] Open
Abstract
The identification of signals of very recent positive selection provides information about the adaptation of modern humans to local conditions. We report here on a genome-wide scan for signals of very recent positive selection in favor of variants that have not yet reached fixation. We describe a new analytical method for scanning single nucleotide polymorphism (SNP) data for signals of recent selection, and apply this to data from the International HapMap Project. In all three continental groups we find widespread signals of recent positive selection. Most signals are region-specific, though a significant excess are shared across groups. Contrary to some earlier low resolution studies that suggested a paucity of recent selection in sub-Saharan Africans, we find that by some measures our strongest signals of selection are from the Yoruba population. Finally, since these signals indicate the existence of genetic variants that have substantially different fitnesses, they must indicate loci that are the source of significant phenotypic variation. Though the relevant phenotypes are generally not known, such loci should be of particular interest in mapping studies of complex traits. For this purpose we have developed a set of SNPs that can be used to tag the strongest ∼250 signals of recent selection in each population. Applying their newly developed method, the authors search International HapMap Project data representing three populations for signals of recent selection across the human genome.
Collapse
Affiliation(s)
- Benjamin F Voight
- 1Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Sridhar Kudaravalli
- 1Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Xiaoquan Wen
- 1Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Jonathan K Pritchard
- 1Department of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
620
|
Patin E, Barreiro LB, Sabeti PC, Austerlitz F, Luca F, Sajantila A, Behar DM, Semino O, Sakuntabhai A, Guiso N, Gicquel B, McElreavey K, Harding RM, Heyer E, Quintana-Murci L. Deciphering the ancient and complex evolutionary history of human arylamine N-acetyltransferase genes. Am J Hum Genet 2006; 78:423-36. [PMID: 16416399 PMCID: PMC1380286 DOI: 10.1086/500614] [Citation(s) in RCA: 72] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2005] [Accepted: 12/21/2005] [Indexed: 12/24/2022] Open
Abstract
The human N-acetyltransferase genes NAT1 and NAT2 encode two phase-II enzymes that metabolize various drugs and carcinogens. Functional variability at these genes has been associated with adverse drug reactions and cancer susceptibility. Mutations in NAT2 leading to the so-called slow-acetylation phenotype reach high frequencies worldwide, which questions the significance of altered acetylation in human adaptation. To investigate the role of population history and natural selection in shaping NATs variation, we characterized genetic diversity through the resequencing and genotyping of NAT1, NAT2, and the pseudogene NATP in a collection of 13 different populations with distinct ethnic backgrounds and demographic pasts. This combined study design allowed us to define a detailed map of linkage disequilibrium of the NATs region as well as to perform a number of sequence-based neutrality tests and the long-range haplotype (LRH) test. Our data revealed distinctive patterns of variability for the two genes: the reduced diversity observed at NAT1 is consistent with the action of purifying selection, whereas NAT2 functional variation contributes to high levels of diversity. In addition, the LRH test identified a particular NAT2 haplotype (NAT2*5B) under recent positive selection in western/central Eurasians. This haplotype harbors the mutation 341T-->C and encodes the "slowest-acetylator" NAT2 enzyme, suggesting a general selective advantage for the slow-acetylator phenotype. Interestingly, the NAT2*5B haplotype, which seems to have conferred a selective advantage during the past approximately 6,500 years, exhibits today the strongest association with susceptibility to bladder cancer and adverse drug reactions. On the whole, the patterns observed for NAT2 well illustrate how geographically and temporally fluctuating xenobiotic environments may have influenced not only our genome variability but also our present-day susceptibility to disease.
Collapse
Affiliation(s)
- Etienne Patin
- Centre National de la Recherche Scientifique, CNRS, FRE 2849, Unit of Molecular Prevention and Therapy of Human Diseases, Paris, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
621
|
Zhang K, Zhu J, Shendure J, Porreca GJ, Aach JD, Mitra RD, Church GM. Long-range polony haplotyping of individual human chromosome molecules. Nat Genet 2006; 38:382-7. [PMID: 16493423 DOI: 10.1038/ng1741] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2005] [Accepted: 01/05/2006] [Indexed: 11/09/2022]
Abstract
We report a method for multilocus long-range haplotyping on human chromosome molecules in vitro based on the DNA polymerase colony (polony) technology. By immobilizing thousands of intact chromosome molecules within a polyacrylamide gel on a microscope slide and performing multiple amplifications from single molecules, we determined long-range haplotypes spanning a 153-Mb region of human chromosome 7 and found evidence of rare mitotic recombination events in human lymphocytes. Furthermore, the parallel nature of DNA polony technology allows efficient haplotyping on pooled DNAs from a population on one slide, with a throughput three orders of magnitudes higher than current molecular haplotyping methods. Linkage disequilibrium statistics established by our pooled DNA haplotyping method are more accurate than statistically inferred haplotypes. This haplotyping method is well suited for candidate gene-based association studies as well as for investigating the pattern of recombination in mammalian cells.
Collapse
Affiliation(s)
- Kun Zhang
- Department of Genetics, Harvard Medical School, Boston, Massachusetts 02115, USA.
| | | | | | | | | | | | | |
Collapse
|
622
|
Abstract
Genetic variation is known to affect food tolerances among human subpopulations and may also influence dietary requirements, giving rise to the new field of nutritional genomics and raising the possibility of individualizing nutritional intake for optimal health and disease prevention on the basis of an individual's genome. However, because gene-diet interactions are complex and poorly understood, the use of genomic knowledge to adjust population-based dietary recommendations is not without risk. Whereas current recommendations target most of the population to prevent nutritional deficiencies, inclusion of genomic criteria may indicate subpopulations that may incur differential benefit or risk from generalized recommendations and fortification policies. Current efforts to identify gene alleles that affect nutrient utilization have been enhanced by the identification of genetic variations that have expanded as a consequence of selection under extreme conditions. Identification of genetic variation that arose as a consequence of diet as a selective pressure helps to identify gene alleles that affect nutrient utilization. Understanding the molecular mechanisms underlying gene-nutrient interactions and their modification by genetic variation is expected to result in dietary recommendations and nutritional interventions that optimize individual health.
Collapse
Affiliation(s)
- Patrick J Stover
- Division of Nutritional Sciences, Cornell University, Ithaca NY 14853, USA.
| |
Collapse
|
623
|
Moore LT, McEvoy B, Cape E, Simms K, Bradley DG. A Y-chromosome signature of hegemony in Gaelic Ireland. Am J Hum Genet 2006; 78:334-8. [PMID: 16358217 PMCID: PMC1380239 DOI: 10.1086/500055] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2005] [Accepted: 11/18/2005] [Indexed: 11/03/2022] Open
Abstract
Seventeen-marker simple tandem repeat genetic analysis of Irish Y chromosomes reveals a previously unnoted modal haplotype that peaks in frequency in the northwestern part of the island. It shows a significant association with surnames purported to have descended from the most important and enduring dynasty of early medieval Ireland, the Ui Neill. This suggests that such phylogenetic predominance is a biological record of past hegemony and supports the veracity of semimythological early genealogies. The fact that about one in five males sampled in northwestern Ireland is likely a patrilineal descendent of a single early medieval ancestor is a powerful illustration of the potential link between prolificacy and power and of how Y-chromosome phylogeography can be influenced by social selection.
Collapse
Affiliation(s)
- Laoise T Moore
- Smurfit Institute of Genetics, Trinity College, Dublin, Ireland
| | | | | | | | | |
Collapse
|
624
|
Wang ET, Kodama G, Baldi P, Moyzis RK. Global landscape of recent inferred Darwinian selection for Homo sapiens. Proc Natl Acad Sci U S A 2006; 103:135-40. [PMID: 16371466 PMCID: PMC1317879 DOI: 10.1073/pnas.0509691102] [Citation(s) in RCA: 218] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2005] [Indexed: 12/24/2022] Open
Abstract
By using the 1.6 million single-nucleotide polymorphism (SNP) genotype data set from Perlegen Sciences [Hinds, D. A., Stuve, L. L., Nilsen, G. B., Halperin, E., Eskin, E., Ballinger, D. G., Frazer, K. A. & Cox, D. R. (2005) Science 307, 1072-1079], a probabilistic search for the landscape exhibited by positive Darwinian selection was conducted. By sorting each high-frequency allele by homozygosity, we search for the expected decay of adjacent SNP linkage disequilibrium (LD) at recently selected alleles, eliminating the need for inferring haplotype. We designate this approach the LD decay (LDD) test. By these criteria, 1.6% of Perlegen SNPs were found to exhibit the genetic architecture of selection. These results were confirmed on an independently generated data set of 1.0 million SNP genotypes (International Human Haplotype Map Phase I freeze). Simulation studies indicate that the LDD test, at the megabase scale used, effectively distinguishes selection from other causes of extensive LD, such as inversions, population bottlenecks, and admixture. The approximately 1,800 genes identified by the LDD test were clustered according to Gene Ontology (GO) categories. Based on overrepresentation analysis, several predominant biological themes are common in these selected alleles, including host-pathogen interactions, reproduction, DNA metabolism/cell cycle, protein metabolism, and neuronal function.
Collapse
Affiliation(s)
- Eric T Wang
- Department of Biological Chemistry, College of Medicine, Donald Bren School of Information and Computer Sciences, and Institute of Genomics and Bioinformatics, University of California, Irvine, CA 92697, USA
| | | | | | | |
Collapse
|
625
|
Swallow DM. DNA test for hypolactasia premature. Gut 2006; 55:131; author reply 131-2. [PMID: 16344579 PMCID: PMC1856366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/08/2022]
|
626
|
Gardner M, González-Neira A, Lao O, Calafell F, Bertranpetit J, Comas D. Extreme population differences across Neuregulin 1 gene, with implications for association studies. Mol Psychiatry 2006; 11:66-75. [PMID: 16189508 DOI: 10.1038/sj.mp.4001749] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Neuregulin 1 (NRG1) is one of the most exciting candidate genes for schizophrenia in recent years since its first association with the disease in an Icelandic population. Since then, many association studies have analysed allele and haplotype frequencies in distinct populations yielding varying results: some have replicated the association, although with different alleles or haplotypes being associated, whereas others have failed to replicate the association. These contradictory results might be attributed to population differences in allele and haplotype frequencies. In order to approach this issue, we have typed 13 SNPs across this large 1.4 Mb gene, including two of the SNPs originally found associated with schizophrenia in the Icelandic population, the objective being to discover if the underlying cause of the association discrepancies to date may be due to population-specific genetic variation. The analyses have been performed in a total of 1088 individuals from 39 populations, covering most of the genetic diversity in the human species. Most of the SNPs analysed displayed differing frequencies according to geographical region. These allele differences are especially relevant in two SNPs located in a large intron of the gene, as shown by the extreme F(ST) values, which reveal genetic stratification correlated to broad continental areas. This finding may be indicative of the influence of some local selective forces on this gene. Furthermore, haplotype analysis reveals a clear clustering according to geographical areas. In summary, our findings suggest that NRG1 presents extreme population differences in allele and haplotype frequencies. We have given recommendations for taking this into account in future association studies since this diversity could give rise to erroneous results.
Collapse
Affiliation(s)
- M Gardner
- Unitat de Biologia Evolutiva, Departament de Ciències Experimentals i de la Salut, Universitat Pompeu Fabra, Doctor Aiguader 80, 08003 Barcelona, Catalonia, Spain
| | | | | | | | | | | |
Collapse
|
627
|
Hanchard NA, Rockett KA, Spencer C, Coop G, Pinder M, Jallow M, Kimber M, McVean G, Mott R, Kwiatkowski DP. Screening for recently selected alleles by analysis of human haplotype similarity. Am J Hum Genet 2006; 78:153-9. [PMID: 16385459 PMCID: PMC1380214 DOI: 10.1086/499252] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2005] [Accepted: 10/17/2005] [Indexed: 11/03/2022] Open
Abstract
There is growing interest in the use of haplotype-based methods for detecting recent selection. Here, we describe a method that uses a sliding window to estimate similarity among the haplotypes associated with any given single-nucleotide polymorphism (SNP) allele. We used simulations of natural selection to provide estimates of the empirical power of the method to detect recently selected alleles and found it to be comparable in power to the popular long-range haplotype test and more powerful than methods based on nucleotide diversity. We then applied the method to a recently selected allele--the sickle mutation at the HBB locus--and found it to have a signal of selection that was significantly stronger than that of simulated models both with and without strong selection. Using this method, we also evaluated >4,000 SNPs on chromosome 20, indicating the applicability of the method to regional data sets.
Collapse
Affiliation(s)
- Neil A Hanchard
- Wellcome Trust Centre for Human Genetics, Oxford, United Kingdom.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
628
|
Schirmer M, Toliat MR, Haberl M, Suk A, Kamdem LK, Klein K, Brockmöller J, Nürnberg P, Zanger UM, Wojnowski L. Genetic signature consistent with selection against the CYP3A4*1B allele in non-African populations. Pharmacogenet Genomics 2006; 16:59-71. [PMID: 16344723 DOI: 10.1097/01.fpc.0000182779.03180.ba] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Cytochrome P450 3A enzymes (CYP3A) play a major role in the metabolism of steroid hormones, drugs and other chemicals, including many carcinogens. The individually variable CYP3A expression, which remains mostly unexplained, has been suggested to affect clinical phenotypes. We investigated the CYP3A locus in five ethnic groups. The degree of linkage disequilibrium (LD) differed among ethnic groups, but the most common alleles of the conserved LD regions were remarkably similar. Non-African haplotypes are few; for example, only four haplotypes account for 80% of common European Caucasian alleles. Large LD blocks of high frequencies were suggestive of selection. Accordingly, European Caucasian and Asian cohorts each contained a block of single nucleotide polymorphism (SNPs) with very high P excess values. The overlap between these blocks in these two groups contained only two of the investigated 26 SNPs and one of them was the CYP3A4*1B allele. The region centromeric of CYP3A4*1B exhibited high haplotype homozygosity in European Caucasians as opposed to African-Americans. CYP3A4*1B showed a moderate effect on CYP3A4 mRNA and protein expression, as well as on CYP3A activity assessed as Vmax of testosterone 6beta-hydroxylation in a liver bank. Our data are consistent with a functional relevance of CYP3A4*1B and with selection against this allele in non-African populations. The elimination of CYP3A4*1B involved different parts of the CYP3A locus in European Caucasians and Asians. Because CYP3A4 is involved in the vitamin D metabolism, rickets may have been the underlying selecting factor.
Collapse
Affiliation(s)
- Markus Schirmer
- Department of Pharmacology, Johannes Gutenberg University, Mainz, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
629
|
Abstract
Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution.
Collapse
|
630
|
Lewinsky RH, Jensen TGK, Møller J, Stensballe A, Olsen J, Troelsen JT. T-13910 DNA variant associated with lactase persistence interacts with Oct-1 and stimulates lactase promoter activity in vitro. Hum Mol Genet 2005; 14:3945-53. [PMID: 16301215 DOI: 10.1093/hmg/ddi418] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Two phenotypes exist in the human population with regard to expression of lactase in adults. Lactase non-persistence (adult-type hypolactasia and lactose intolerance) is characterized by a decline in the expression of lactase-phlorizin hydrolase (LPH) after weaning. In contrast, lactase-persistent individuals have a high LPH throughout their lifespan. Lactase persistence and non-persistence are associated with a T/C polymorphism at position -13,910 upstream the lactase gene. A nuclear factor binds more strongly to the T-13,910 variant associated with lactase persistence than the C-13,910 variant associated with lactase non-persistence. Oct-1 and glyceraldehyde-3-phosphate dehydrogenase were co-purified by DNA affinity purification using the sequence of the T-13,910 variant. Supershift analyses show that Oct-1 binds directly to the T-13,910 variant, and we suggest that GAPDH is co-purified due to interactions with Oct-1. Expression of Oct-1 stimulates reporter gene expression from the T and the C-13,910 variant/LPH promoter constructs only when it is co-expressed with HNF1alpha. Binding sites for other intestinal transcription factors (GATA-6, HNF4alpha, Fox and Cdx-2) were identified in the region of the -13,910 T/C polymorphism. Three of these sites are required for the enhancer activity of the -13,910 region. The data suggest that the binding of Oct-1 to the T-13,910 variant directs increased lactase promoter activity and this might provide an explanation for the lactase persistence phenotype in the human population.
Collapse
Affiliation(s)
- Rikke H Lewinsky
- Department of Medical Biochemistry and Genetics, Panum Institute, University of Copenhagen, Denmark
| | | | | | | | | | | |
Collapse
|
631
|
Carlson CS, Thomas DJ, Eberle MA, Swanson JE, Livingston RJ, Rieder MJ, Nickerson DA. Genomic regions exhibiting positive selection identified from dense genotype data. Genome Res 2005; 15:1553-65. [PMID: 16251465 PMCID: PMC1310643 DOI: 10.1101/gr.4326505] [Citation(s) in RCA: 205] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2005] [Accepted: 09/06/2005] [Indexed: 01/14/2023]
Abstract
The allele frequency spectrum of polymorphisms in DNA sequences can be used to test for signatures of natural selection that depart from the expected frequency spectrum under the neutral theory. We observed a significant (P = 0.001) correlation between the Tajima's D test statistic in full resequencing data and Tajima's D in a dense, genome-wide data set of genotyped polymorphisms for a set of 179 genes. Based on this, we used a sliding window analysis of Tajima's D across the human genome to identify regions putatively subject to strong, recent, selective sweeps. This survey identified seven Contiguous Regions of Tajima's D Reduction (CRTRs) in an African-descent population (AD), 23 in a European-descent population (ED), and 29 in a Chinese-descent population (XD). Only four CRTRs overlapped between populations: three between ED and XD and one between AD and ED. Full resequencing of eight genes within six CRTRs demonstrated frequency spectra inconsistent with neutral expectations for at least one gene within each CRTR. Identification of the functional polymorphism (and/or haplotype) responsible for the selective sweeps within each CRTR may provide interesting insights into the strongest selective pressures experienced by the human genome over recent evolutionary history.
Collapse
Affiliation(s)
- Christopher S Carlson
- Department of Genome Sciences, University of Washington, Seattle, Washington 98195-7730, USA.
| | | | | | | | | | | | | |
Collapse
|
632
|
Di Rienzo A, Hudson RR. An evolutionary framework for common diseases: the ancestral-susceptibility model. Trends Genet 2005; 21:596-601. [PMID: 16153740 DOI: 10.1016/j.tig.2005.08.007] [Citation(s) in RCA: 146] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2005] [Revised: 07/20/2005] [Accepted: 08/22/2005] [Indexed: 10/25/2022]
Abstract
Unlike rare mendelian diseases, which are due to new mutations (i.e. derived alleles), several alleles that increase the risk to common diseases are ancestral. Moreover, population genetics studies suggest that some derived alleles that protect against common diseases became advantageous recently. These observations can be explained within an evolutionary framework in which ancestral alleles reflect ancient adaptations to the lifestyle of ancient human populations, whereas the derived alleles were deleterious. However, with the shift in environment and lifestyle, the ancestral alleles now increase the risk of common diseases in modern populations. In this article, we develop an explicit evolutionary model and use population genetics simulations to investigate the expected haplotype structure and type of disease-association signals of ancestral risk alleles.
Collapse
Affiliation(s)
- Anna Di Rienzo
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA.
| | | |
Collapse
|
633
|
Weir BS, Cardon LR, Anderson AD, Nielsen DM, Hill WG. Measures of human population structure show heterogeneity among genomic regions. Genome Res 2005; 15:1468-76. [PMID: 16251456 PMCID: PMC1310634 DOI: 10.1101/gr.4398405] [Citation(s) in RCA: 227] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2005] [Revised: 08/31/2005] [Indexed: 11/25/2022]
Abstract
Estimates of genetic population structure (F(ST)) were constructed from all autosomes in two large SNP data sets. The Perlegen data set contains genotypes on approximately 1 million SNPs segregating in all three samples of Americans of African, Asian, and European descent; and the Phase I HapMap data set contains genotypes on approximately 0.6 million SNPs segregating in all four samples from specific Caucasian, Chinese, Japanese, and Yoruba populations. Substantial heterogeneity of F(ST) values was found between segments within chromosomes, although there was similarity between the two data sets. There was also substantial heterogeneity among population-specific F(ST) values, with the relative sizes of these values often changing along each chromosome. Population-structure estimates are often used as indicators of natural selection, but the analyses presented here show that individual-marker estimates are too variable to be useful. There is inherent variation in these statistics because of variation in genealogy even among neutral loci, and values at pairs of loci are correlated to an extent that reflects the linkage disequilibrium between them. Furthermore, it may be that the best indications of selection will come from population-specific F(ST) values rather than the usually reported population-average values.
Collapse
Affiliation(s)
- Bruce S Weir
- Program in Statistical Genetics, Department of Statistics, North Carolina State University, Raleigh, North Carolina 27695-7566, USA.
| | | | | | | | | |
Collapse
|
634
|
|
635
|
Abstract
The completion of the International HapMap Project marks the start of a new phase in human genetics. The aim of the project was to provide a resource that facilitates the design of efficient genome-wide association studies, through characterising patterns of genetic variation and linkage disequilibrium in a sample of 270 individuals across four geographical populations. In total, over one million SNPs have been typed across these genomes, providing an unprecedented view of human genetic diversity. In this review we focus on what the HapMap Project has taught us about the structure of human genetic variation and the fundamental molecular and evolutionary processes that shape it.
Collapse
Affiliation(s)
- Gil McVean
- Department of Statistics, University of Oxford, Oxford, United Kingdom.
| | | | | |
Collapse
|
636
|
Evans PD, Gilbert SL, Mekel-Bobrov N, Vallender EJ, Anderson JR, Vaez-Azizi LM, Tishkoff SA, Hudson RR, Lahn BT. Microcephalin, a Gene Regulating Brain Size, Continues to Evolve Adaptively in Humans. Science 2005; 309:1717-20. [PMID: 16151009 DOI: 10.1126/science.1113722] [Citation(s) in RCA: 251] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The gene Microcephalin (MCPH1) regulates brain size and has evolved under strong positive selection in the human evolutionary lineage. We show that one genetic variant of Microcephalin in modern humans, which arose approximately 37,000 years ago, increased in frequency too rapidly to be compatible with neutral drift. This indicates that it has spread under strong positive selection, although the exact nature of the selection is unknown. The finding that an important brain gene has continued to evolve adaptively in anatomically modern humans suggests the ongoing evolutionary plasticity of the human brain. It also makes Microcephalin an attractive candidate locus for studying the genetics of human variation in brain-related phenotypes.
Collapse
Affiliation(s)
- Patrick D Evans
- Howard Hughes Medical Institute, Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
637
|
Vasemägi A, Primmer CR. Challenges for identifying functionally important genetic variation: the promise of combining complementary research strategies. Mol Ecol 2005; 14:3623-42. [PMID: 16202085 DOI: 10.1111/j.1365-294x.2005.02690.x] [Citation(s) in RCA: 239] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Strategies for the identification of functional genetic variation underlying phenotypic traits of ecological and evolutionary importance have received considerable attention in the literature recently. This paper aims to bring together and compare the relative strengths and limitations of various potentially useful research strategies for dissecting functionally important genetic variation in a wide range of organisms. We briefly explore the relative strengths and limitations of traditional and emerging approaches and evaluate their potential use in free-living populations. While it is likely that much of the progress in functional genetic analyses will rely on progress in traditional model species, it is clear that with prudent choices of methods and appropriate sampling designs, much headway can be also made in a diverse range of species. We suggest that combining research approaches targeting different functional and biological levels can potentially increase understanding the genetic basis of ecological and evolutionary processes both in model and non-model organisms.
Collapse
Affiliation(s)
- A Vasemägi
- Department of Biology, University of Turku, Finland
| | | |
Collapse
|
638
|
|
639
|
Campbell CD, Ogburn EL, Lunetta KL, Lyon HN, Freedman ML, Groop LC, Altshuler D, Ardlie KG, Hirschhorn JN. Demonstrating stratification in a European American population. Nat Genet 2005; 37:868-72. [PMID: 16041375 DOI: 10.1038/ng1607] [Citation(s) in RCA: 321] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2005] [Accepted: 05/31/2005] [Indexed: 11/08/2022]
Abstract
Population stratification occurs in case-control association studies when allele frequencies differ between cases and controls because of ancestry. Stratification may lead to false positive associations, although this issue remains controversial. Empirical studies have found little evidence of stratification in European-derived populations, but potentially significant levels of stratification could not be ruled out. We studied a European American panel discordant for height, a heritable trait that varies widely across Europe. Genotyping 178 SNPs and applying standard analytical methods yielded no evidence of stratification. But a SNP in the gene LCT that varies widely in frequency across Europe was strongly associated with height (P < 10(-6)). This apparent association was largely or completely due to stratification; rematching individuals on the basis of European ancestry greatly reduced the apparent association, and no association was observed in Polish or Scandinavian individuals. The failure of standard methods to detect this stratification indicates that new methods may be required.
Collapse
Affiliation(s)
- Catarina D Campbell
- Program in Genomics and Division of Endocrinology, Children's Hospital, 300 Longwood Avenue, Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
640
|
Saunders MA, Slatkin M, Garner C, Hammer MF, Nachman MW. The extent of linkage disequilibrium caused by selection on G6PD in humans. Genetics 2005; 171:1219-29. [PMID: 16020776 PMCID: PMC1456824 DOI: 10.1534/genetics.105.048140] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The gene coding for glucose-6-phosphate dehydrogenase (G6PD) is subject to positive selection by malaria in some human populations. The G6PD A- allele, which is common in sub-Saharan Africa, is associated with deficient enzyme activity and protection from severe malaria. To delimit the impact of selection on patterns of linkage disequilibrium (LD) and nucleotide diversity, we resequenced 5.1 kb at G6PD and approximately 2-3 kb at each of eight loci in a 2.5-Mb region roughly centered on G6PD in a diverse sub-Saharan African panel of 51 unrelated men (including 20 G6PD A-, 11 G6PD A+, and 20 G6PD B chromosomes). The signature of selection is evident in the absence of genetic variation at G6PD and at three neighboring loci within 0.9 Mb from G6PD among all individuals bearing G6PD A- alleles. A genomic region of approximately 1.6 Mb around G6PD was characterized by long-range LD associated with the A- alleles. These patterns of nucleotide variability and LD suggest that G6PD A- is younger than previous age estimates and has increased in frequency in sub-Saharan Africa due to strong selection (0.1 < s < 0.2). These results also show that selection can lead to nonrandom associations among SNPs over great physical and genetic distances, even in African populations.
Collapse
Affiliation(s)
- Matthew A Saunders
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | | | | | |
Collapse
|
641
|
Abstract
OBJECTIVE Adult lactose intolerance, which affects the majority of the population in the world, has been associated with a single nucleotide polymorphism, C-13910T, located upstream of the lactase gene. MATERIAL AND METHODS Adult patients undergoing lactose tolerance tests with lactose challenge and plasma glucose measurements were included in the study comprising 44 Swedes and 7 non-Swedish individuals. A real-time PCR method was established for the genotyping. RESULTS Out of 51 patients 48 had concordant results on genotyping and lactose tolerance tests, e.g. -13910T/T and -13910C/T genotypes had high glucose elevations. All patients with the heterozygous genotype, -13910C/T, had high glucose elevations, and no gene-dose relationship was observed when comparing maximal glucose increases for cases with -13910C/T and -13910T/T genotypes. CONCLUSIONS Genotyping could replace lactose challenge as a first-stage screening test in adults of European descent, but should be used together with tolerance tests in children and patients where secondary lactose intolerance is suspected.
Collapse
Affiliation(s)
- Peter Ridefelt
- Department of Medical Sciences, Clinical Chemistry, University Hospital, Uppsala, Sweden.
| | | |
Collapse
|
642
|
|
643
|
Wang Z, Wang B, Tang K, Lee EJD, Chong SS, Lee CGL. A functional polymorphism within the MRP1 gene locus identified through its genomic signature of positive selection. Hum Mol Genet 2005; 14:2075-87. [PMID: 15944197 DOI: 10.1093/hmg/ddi212] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Searching for genomic evidence of positive selection has been hailed as an attractive strategy for identifying functional polymorphisms. Here, we demonstrate the feasibility of identifying functional polymorphism at the MRP1 gene locus using this strategy. The 190 kDa MRP1 protein is an efflux pump that regulates the accumulation of xenobiotics and drugs in cells. Functional sequence variations within this gene might account, in part, for inter-individual and population differences in drug response. To identify single nucleotide polymorphisms (SNPs) within the MRP1 gene with potentially important functional significance, we scanned for genomic signatures of recent positive selection at this locus in approximately 480 individuals sampled from the Chinese, Malay, Indian, European-American and African-American populations. The genetic profile of SNPs at this locus revealed high haplotype diversity and weak linkage disequilibrium (LD). Despite this weak LD, major allele G of SNP 5'FR/G-260C contained within a high frequency haplotype exhibited extended haplotype homozygosity across 135 kb in European-Americans. Using two independent genomic tests, long-range haplotype (LRH) test and the F(ST) statistic, we found statistical evidence of positive selection for this allele in the European-American population. When this SNP was recapitulated in an in vitro MRP1 promoter-reporter assay, significantly lower activity was observed from the G-containing promoter when compared with the C-containing promoter in all four cell lines that we tested (P<0.01). These observations confirm the power of this strategy in identifying functionally different alleles of genes and suggest that the different alleles at this SNP locus in the MRP1 gene may account, in part, for inter-individual variations and population differences in drug response.
Collapse
Affiliation(s)
- Zihua Wang
- Department of Biochemistry, National University of Singapore, Singapore
| | | | | | | | | | | |
Collapse
|
644
|
Ronald J, Akey JM. Genome-wide scans for loci under selection in humans. Hum Genomics 2005; 2:113-25. [PMID: 16004726 PMCID: PMC3525256 DOI: 10.1186/1479-7364-2-2-113] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2005] [Accepted: 01/31/2005] [Indexed: 11/10/2022] Open
Abstract
Natural selection, which can be defined as the differential contribution of genetic variants to future generations, is the driving force of Darwinian evolution. Identifying regions of the human genome that have been targets of natural selection is an important step in clarifying human evolutionary history and understanding how genetic variation results in phenotypic diversity, it may also facilitate the search for complex disease genes. Technological advances in high-throughput DNA sequencing and single nucleotide polymorphism genotyping have enabled several genome-wide scans of natural selection to be undertaken. Here, some of the observations that are beginning to emerge from these studies will be reviewed, including evidence for geographically restricted selective pressures (ie local adaptation) and a relationship between genes subject to natural selection and human disease. In addition, the paper will highlight several important problems that need to be addressed in future genome-wide studies of natural selection.
Collapse
Affiliation(s)
- James Ronald
- University of Washington, Seattle, Washington, USA
| | | |
Collapse
|
645
|
Coelho M, Luiselli D, Bertorelle G, Lopes AI, Seixas S, Destro-Bisol G, Rocha J. Microsatellite variation and evolution of human lactase persistence. Hum Genet 2005; 117:329-39. [PMID: 15928901 DOI: 10.1007/s00439-005-1322-z] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Accepted: 04/08/2005] [Indexed: 12/27/2022]
Abstract
The levels of haplotype diversity within the lineages defined by two single-nucleotide polymorphisms (SNPs) (-13910 C/T and -22018 G/A) associated with human lactase persistence were assessed with four fast-evolving microsatellite loci in 794 chromosomes from Portugal, Italy, Fulbe from Cameroon, São Tomé and Mozambique. Age estimates based on the intraallelic microsatellite variation indicate that the -13910*T allele, which is more tightly associated with lactase persistence, originated in Eurasia before the Neolithic and after the emergence of modern humans outside Africa. We detected significant departures from neutrality for the -13910*T variant in geographically and evolutionary distant populations from southern Europe (Portuguese and Italians) and Africa (Fulbe) by using a neutrality test based on the congruence between the frequency of the allele and the levels of intraallelic variability measured by the number of mutations in adjacent microsatellites. This result supports the role of selection in the evolution of lactase persistence, ruling out possible confounding effects from recombination suppression and population history. Reevaluation of the available evidence on variation of the -13910 and -22018 loci indicates that lactase persistence probably originated from different mutations in Europe and most of Africa, even if 13910*T is not the causal allele, suggesting that selective pressure could have promoted the convergent evolution of the trait. Our study shows that a limited number of microsatellite loci may provide sufficient resolution to reconstruct key aspects of the evolutionary history of lactase persistence, providing an alternative to approaches based on large numbers of SNPs.
Collapse
Affiliation(s)
- Margarida Coelho
- Instituto de Patologia e Imunologia Molecular, Universidade do Porto (IPATIMUP), R. Dr. Roberto Frias s/n, 4200-465 Porto, Portugal
| | | | | | | | | | | | | |
Collapse
|
646
|
Pearce R, Malisa A, Kachur SP, Barnes K, Sharp B, Roper C. Reduced Variation Around Drug-Resistant dhfr Alleles in African Plasmodium falciparum. Mol Biol Evol 2005; 22:1834-44. [PMID: 15917494 DOI: 10.1093/molbev/msi177] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
We have measured microsatellite diversity at 26 markers around the dhfr gene in pyrimethamine-sensitive and -resistant parasites collected in southeast Africa. Through direct comparison with diversity on sensitive chromosomes we have found significant loss of diversity across a region of 70 kb around the most highly resistant allele which is evidence of a selective sweep attributable to selection through widespread use of pyrimethamine (in combination with sulfadoxine) as treatment for malaria. Retrospective analysis through four years of direct and continuous selection from use of sulfadoxine-pyrimethamine as first-line malaria treatment on a Plasmodium falciparum population in KwaZulu Natal, South Africa, has revealed how recombination significantly narrowed the margins of the selective sweep over time. A deterministic model incorporating selection coefficients measured during the same interval indicates that the transition was toward a state of recombination-selection equilibrium. We compared loss of diversity around the same resistance allele in two populations at either extreme of the range of entomological inoculation rates (EIRs), namely, under one infective bite per year in Mpumalanga, South Africa, and more than one per day in southern Tanzania. EIRs determine effective recombination rates and are expected to profoundly influence the dimensions of the selective sweep. Surprisingly, the dimensions were broadly consistent across both populations. We conclude that despite different recombination rates and contrasting drug selection histories in neighboring countries, the region-wide movement of resistant parasites has played a key role in the establishment of resistance in these populations and the dimensions of the selective sweep are dominated by the influence of high initial starting frequencies.
Collapse
Affiliation(s)
- Richard Pearce
- London School of Hygiene and Tropical Medicine, Pathogen Molecular Biology Unit, Department of Infectious Tropical Diseases, London, United Kingdom.
| | | | | | | | | | | |
Collapse
|
647
|
Myles S, Bouzekri N, Haverfield E, Cherkaoui M, Dugoujon JM, Ward R. Genetic evidence in support of a shared Eurasian-North African dairying origin. Hum Genet 2005; 117:34-42. [PMID: 15806398 DOI: 10.1007/s00439-005-1266-3] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2004] [Accepted: 12/23/2004] [Indexed: 02/08/2023]
Abstract
The process by which pastoralism and agriculture spread from the Fertile Crescent over the past 10,000 years has been the subject of intense investigation by geneticists, linguists and archaeologists. However, no consensus has been reached as to whether this Neolithic transition is best characterized by a demic diffusion (with a significant genetic input from migrating farmers) or a cultural diffusion (without substantial migration of farmers). Milk consumption and thus lactose tolerance are assumed to have spread with pastoralism and we propose that by looking at the relevant mutations in and around the lactase gene in human populations, we can gain insight into the origin(s) and spread of dairying. We genotyped the putatively causal allele for lactose tolerance (-13910T) and constructed haplotypes from several polymorphisms in and around the lactase gene (LCT) in three North African Berber populations and compared our results with previously published data. We found that the frequency of the -13910T allele predicts the frequency of lactose tolerance in several Eurasian and North African Berber populations but not in most sub-Saharan African populations. Our analyses suggest that contemporary Berber populations possess the genetic signature of a past migration of pastoralists from the Middle East and that they share a dairying origin with Europeans and Asians, but not with sub-Saharan Africans.
Collapse
Affiliation(s)
- Sean Myles
- Institute of Biological Anthropology, University of Oxford, UK.
| | | | | | | | | | | |
Collapse
|
648
|
|
649
|
Kelley J, de Bono B, Trowsdale J. IRIS: A database surveying known human immune system genes. Genomics 2005; 85:503-11. [PMID: 15780753 DOI: 10.1016/j.ygeno.2005.01.009] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2004] [Revised: 01/19/2005] [Accepted: 01/20/2005] [Indexed: 11/29/2022]
Abstract
We have compiled an online database of known human defense genes: the Immunogenetic Related Information Source (IRIS). As of October 1, 2004, there are 1562 immune genes recorded in IRIS, representing 7% of the human genome. This resource contains searchable information including chromosomal location, sequence data, and a curated functional annotation for each entry. We used IRIS as a basis for analyzing the composition and characteristics of the immune genome, such as gene clustering, polymorphism, and relationship to disease. High protein sequence similarity correlated inversely with distance between immune genes, consistent with clustering of duplicated loci. We also found that, even though some immune genes exhibit high levels of polymorphism, such as MHC class I, the range of levels of polymorphism in immune genes is similar to that of nonimmune genes. Approximately 20% of immune genes have a known disease association. IRIS is available online at .
Collapse
Affiliation(s)
- James Kelley
- Department of Pathology, Immunology Division, University of Cambridge, Cambridge CB2 1QP, UK
| | | | | |
Collapse
|
650
|
Abstract
The Human Genome Project, in accomplishing its goal of sequencing one human genome, heralded a new era of research, a component of which is the systematic study of human genetic variation. Despite delays, the Human Genome Diversity Project has started to make progress in understanding the patterns of this variation and its causes, and also promises to provide important information for biomedical studies.
Collapse
Affiliation(s)
- L Luca Cavalli-Sforza
- L. Luca Cavalli-Sforza is at the Genetics Department, Stanford Medical School, Stanford, California 94305, USA.
| |
Collapse
|