51
|
Abstract
Comparison of the levels of nucleotide diversity in humans and apes may provide valuable information for inferring the demographic history of these species, the effect of social structure on genetic diversity, patterns of past migration, and signatures of past selection events. Previous DNA sequence data from both the mitochondrial and the nuclear genomes suggested a much higher level of nucleotide diversity in the African apes than in humans. Noting that the nuclear DNA data from the apes were very limited, we previously conducted a DNA polymorphism study in humans and another in chimpanzees and bonobos, using 50 DNA segments randomly chosen from the noncoding, nonrepetitive parts of the human genome. The data revealed that the nucleotide diversity (pi) in bonobos (0.077%) is actually lower than that in humans (0.087%) and that pi in chimpanzees (0.134%) is only 50% higher than that in humans. In the present study we sequenced the same 50 segments in 15 western lowland gorillas and estimated pi to be 0.158%. This is the highest value among the African apes but is only about two times higher than that in humans. Interestingly, available mtDNA sequence data also suggest a twofold higher nucleotide diversity in gorillas than in humans, but suggest a threefold higher nucleotide diversity in chimpanzees than in humans. The higher mtDNA diversity in chimpanzees might be due to the unique pattern in the evolution of chimpanzee mtDNA. From the nuclear DNA pi values, we estimated that the long-term effective population sizes of humans, bonobos, chimpanzees, and gorillas are, respectively, 10,400, 12,300, 21,300, and 25,200.
Collapse
Affiliation(s)
- Ning Yu
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | |
Collapse
|
52
|
Marth GT, Czabarka E, Murvai J, Sherry ST. The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics 2004; 166:351-72. [PMID: 15020430 PMCID: PMC1470693 DOI: 10.1534/genetics.166.1.351] [Citation(s) in RCA: 232] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have studied a genome-wide set of single-nucleotide polymorphism (SNP) allele frequency measures for African-American, East Asian, and European-American samples. For this analysis we derived a simple, closed mathematical formulation for the spectrum of expected allele frequencies when the sampled populations have experienced nonstationary demographic histories. The direct calculation generates the spectrum orders of magnitude faster than coalescent simulations do and allows us to generate spectra for a large number of alternative histories on a multidimensional parameter grid. Model-fitting experiments using this grid reveal significant population-specific differences among the demographic histories that best describe the observed allele frequency spectra. European and Asian spectra show a bottleneck-shaped history: a reduction of effective population size in the past followed by a recent phase of size recovery. In contrast, the African-American spectrum shows a history of moderate but uninterrupted population expansion. These differences are expected to have profound consequences for the design of medical association studies. The analytical methods developed for this study, i.e., a closed mathematical formulation for the allele frequency spectrum, correcting the ascertainment bias introduced by shallow SNP sampling, and dealing with variable sample sizes provide a general framework for the analysis of public variation data.
Collapse
Affiliation(s)
- Gabor T Marth
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA.
| | | | | | | |
Collapse
|
53
|
Lepretre AC, Patrat C, Jouannet P, Bienvenu T. Mutation analysis of the BOULE gene in men with non-obstructive azoospermia: identification of a novel polymorphic variant in the black population. ACTA ACUST UNITED AC 2004; 27:301-3. [PMID: 15379971 DOI: 10.1111/j.1365-2605.2004.00464.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The BOULE gene is a member of the 'deleted in azoospermia'DAZ family. As in flies and worms, disruption of the BOULE homologues leads to meiotic defects during the first meiotic division, we screened the coding region of the BOULE gene from 40 infertile men with non-obstructive azoospermia or severe oligoasthenoteratozoospermia by denaturing high-pressure liquid chromatography and direct sequencing in order to make progress in elucidating the aetiology of male infertility. Two infertile patients were heterozygous for a novel G to C transversion in exon 2 resulting in a Q2E amino acid substitution. As these two infertile men were from African origin, screening of fertile African subjects identified this novel variant in two fertile male subjects suggesting that this novel Q2E substitution had non-pathologic role. Taking into account the size of our sample, we conclude that BOULE coding sequence mutations are not an important factor in the aetiology of azoospermia.
Collapse
Affiliation(s)
- Anne Claire Lepretre
- Laboratoire de Biochimie et Génétique Moléculaires, CECOS, Hopital Cochin (AP-HP), Universite Paris V, Paris, France
| | | | | | | |
Collapse
|
54
|
Wang X, Thomas SD, Zhang J. Relaxation of selective constraint and loss of function in the evolution of human bitter taste receptor genes. Hum Mol Genet 2004; 13:2671-8. [PMID: 15367488 DOI: 10.1093/hmg/ddh289] [Citation(s) in RCA: 121] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Bitter taste perception prevents mammals from ingesting poisonous substances because many toxins taste bitter and cause aversion. We hypothesize that human bitter taste receptor (TAS2R) genes might be relaxed from selective constraints because of the change in diet, use of fire and reliance on other means of toxin avoidance that emerged in human evolution. Here, we examine the intra-specific variations of all 25 genes of the human TAS2R repertoire. Our data show hallmarks of neutral evolution, including similar rates of synonymous (d(S)) and non-synonymous (d(N)) nucleotide changes among rare polymorphisms, common polymorphisms and substitutions; no variation in d(N)/d(S) among functional domains; segregation of pseudogene alleles within species and fixation of loss-of-function mutations. These results, together with previous findings of large numbers of loss-of-function mutations in olfactory, pheromonal and visual sensory genes in humans, suggest surprisingly reduced sensory capabilities of humans in comparison with many other mammals.
Collapse
Affiliation(s)
- Xiaoxia Wang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, MI 48109, USA
| | | | | |
Collapse
|
55
|
Lopes AM, Calafell F, Amorim A. Microsatellite variation and evolutionary history of PCDHX/Y gene pair within the Xq21.3/Yp11.2 hominid-specific homology block. Mol Biol Evol 2004; 21:2092-101. [PMID: 15297598 DOI: 10.1093/molbev/msh218] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
To better understand the evolutionary dynamics of repetitive sequences in human sex chromosomes, we have analyzed seven new X/Y homologous microsatellites located within PCDHX/Y, one of the two recently described gene pairs in the Xq21.3/Yp11.2 hominid-specific homology block, in samples from Portugal and Mozambique. Sharp differences were observed on X/Y allele distributions, concerning both the presence of private alleles and a different modal repeat length for X-linked and Y-linked markers, and this difference was statistically significant. Higher diversity was found in X-linked microsatellites than in their Y chromosome counterparts; when comparing populations, Mozambicans showed more allele diversity for the X chromosome, but the contrary was true for the Y chromosome microsatellites. Evolutionary patterns, relying on intragenic PCDHX/Y SNPs, also revealed distinct scenarios for X and Y chromosomes. Greater microsatellite diversity was displayed by African X chromosomes within the most common haplotypes shared by both populations, whereas higher microsatellite diversity was found in Portugal for the ancestral Y chromosome haplotype. The most frequent PCDHY haplotype in Portuguese was the derived one, and it was not found in Mozambicans. TMRCA estimated by the rho parameter resulted in 13,700 years (7,500-20,000 years), which is consistent with a recent, post-Out-of-Africa origin for this haplotype. In conclusion, the newly described microsatellite loci generally displayed greater X-linked to Y-linked diversity and this pattern was also detected with slower evolving markers, with a remarkable differentiation between populations observed for Y chromosome haplotypes and, thus, greater divergence among Y chromosomes in human populations.
Collapse
Affiliation(s)
- Alexandra M Lopes
- IPATIMUP, Instituto de Patologia e Imunologia Molecular da Universidade do Porto, Portugal.
| | | | | |
Collapse
|
56
|
Pearson OM. Has the combination of genetic and fossil evidence solved the riddle of modern human origins? Evol Anthropol 2004. [DOI: 10.1002/evan.20017] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
57
|
Bosch E, Hurles ME, Navarro A, Jobling MA. Dynamics of a human interparalog gene conversion hotspot. Genome Res 2004; 14:835-44. [PMID: 15123583 PMCID: PMC479110 DOI: 10.1101/gr.2177404] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2003] [Accepted: 02/16/2004] [Indexed: 11/25/2022]
Abstract
Gene conversion between paralogs can alter their patterns of sequence identity, thus obscuring their evolutionary relationships and affecting their propensity to sponsor genomic rearrangements. The details of this important process are poorly understood in the human genome because allelic diversity complicates the interpretation of interparalog sequence differences. Here we exploit the haploid nature of the Y chromosome, which obviates complicating interallelic processes, together with its known phylogeny, to understand the dynamics of conversion between two directly repeated HERVs flanking the 780-kb AZFa region on Yq. Sequence analysis of a 787-bp segment of each of the HERVs in 36 Y chromosomes revealed one of the highest nucleotide diversities in the human genome, as well as evidence of a complex patchwork of highly directional gene conversion events. The rate of proximal-to-distal conversion events was estimated as 2.4 x 10(-4) to 1.2 x 10(-3) per generation (3.9 x 10(-7) to 1.9 x 10(-6) per base per generation), and the distal-to-proximal rate as about one-twentieth of this. Minimum observed conversion tract lengths ranged from 1 to 158 bp and maximum lengths from 19 to 1365 bp, with an estimated mean of 31 bp. Analysis of great ape homologs shows that conversion in this hotspot has a deep evolutionary history.
Collapse
Affiliation(s)
- Elena Bosch
- Department of Genetics, University of Leicester, Leicester LE1 7RH, UK
| | | | | | | |
Collapse
|
58
|
Wirth T, Wang X, Linz B, Novick RP, Lum JK, Blaser M, Morelli G, Falush D, Achtman M. Distinguishing human ethnic groups by means of sequences from Helicobacter pylori: lessons from Ladakh. Proc Natl Acad Sci U S A 2004; 101:4746-51. [PMID: 15051885 PMCID: PMC387319 DOI: 10.1073/pnas.0306629101] [Citation(s) in RCA: 119] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
The history of mankind remains one of the most challenging fields of study. However, the emergence of anatomically modern humans has been so recent that only a few genetically informative polymorphisms have accumulated. Here, we show that DNA sequences from Helicobacter pylori, a bacterium that colonizes the stomachs of most humans and is usually transmitted within families, can distinguish between closely related human populations and are superior in this respect to classical human genetic markers. H. pylori from Buddhists and Muslims, the two major ethnic communities in Ladakh (India), differ in their population-genetic structure. Moreover, the prokaryotic diversity is consistent with the Buddhists having arisen from an introgression of Tibetan speakers into an ancient Ladakhi population. H. pylori from Muslims contain a much stronger ancestral Ladakhi component, except for several isolates with an Indo-European signature, probably reflecting genetic flux from the Near East. These signatures in H. pylori sequences are congruent with the recent history of population movements in Ladakh, whereas similar signatures in human microsatellites or mtDNA were only marginally significant. H. pylori sequence analysis has the potential to become an important tool for unraveling short-term genetic changes in human populations.
Collapse
Affiliation(s)
- Thierry Wirth
- Department of Molecular Biology, Max-Planck-Institut für Infektionsbiologie, Schumannstrasse 21/22, 10117 Berlin, Germany.
| | | | | | | | | | | | | | | | | |
Collapse
|
59
|
Hughes JF, Coffin JM. Human endogenous retrovirus K solo-LTR formation and insertional polymorphisms: implications for human and viral evolution. Proc Natl Acad Sci U S A 2004; 101:1668-72. [PMID: 14757818 PMCID: PMC341815 DOI: 10.1073/pnas.0307885100] [Citation(s) in RCA: 153] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Human endogenous retroviruses (HERVs) are a potential source of genetic diversity in the human genome. Although many of these elements have been inactivated over time by the accumulation of deleterious mutations or internal recombination leading to solo-LTR formation, several members of the HERV-K family have been identified that remain nearly intact and probably represent recent integration events. To determine whether HERV-K elements have caused recent changes in the human genome, we have undertaken a study of the level of HERV-K polymorphism that exists in the human population. By using a high-resolution unblotting technique, we analyzed 13 human-specific HERV-K elements in 18 individuals. We found that solo LTRs have formed at five of these loci. These results enable the estimation of HERV solo-LTR formation in the human genome and indicate that these events occur much more frequently than described in inbred mice. Detailed sequence analysis of one provirus shows that solo-LTR formation occurred at least three separate times in recent history. An unoccupied preintegration site also was present at this locus in two individuals, indicating that although the age of this provirus is estimated to be approximately 1.2 million years, it has not yet become fixed in the human population.
Collapse
Affiliation(s)
- Jennifer F Hughes
- Department of Molecular Biology and Microbiology and Program in Genetics, Tufts University School of Medicine, 136 Harrison Avenue, Boston, MA 02111, USA
| | | |
Collapse
|
60
|
Kohn MH, Fang S, Wu CI. Inference of Positive and Negative Selection on the 5′ Regulatory Regions of Drosophila Genes. Mol Biol Evol 2004; 21:374-83. [PMID: 14660692 DOI: 10.1093/molbev/msh026] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Both positive selection and negative selection have been shown to drive the evolution of coding regions. It is of interest to know if the corresponding 5' regions of genes may be subjected to selection of comparable intensities. For such a comparison, we chose the Accessory gland protein (Acp) genes as our test case. About 700 bp and 600 bp for the 5' and coding regions, respectively, of eight previously unstudied genes were sequenced from 21 isogenic lines of D. melanogaster and one line from D. simulans. The ratio of divergence at the amino-acid replacement sites (A) over that at the synonymous sites (S) was twice the ratio for common polymorphism. Interestingly, the 5' region shows the same trend, with the 5'/S divergence ratio being 1.8 times higher than the 5'/S ratio for common polymorphism. There are several possible explanations for the 5'/S ratios, including demography, negative selection, and positive selection. Under normal conditions, positive selection is the most likely explanation. If that is true, about 45 to 50 percent of all fixed differences at both the replacement and 5' sites were adaptive, even though the substitution rate in the former is only half that of the latter (K(A)/K(S) approximately 0.3 vs. K(5')/K(S) approximately 0.6). As previous analyses have indicated, the inclusion of slightly deleterious polymorphism confounds the inference of positive selection. The analysis of published polymorphism data covering 97 verified 5' regions of Drosophila suggests more pronounced selective constraint on the 5' untranslated region and the core promoter (together corresponding to approximately 200 bp in this data set) when compared to the more distal portion of the 5' region of genes.
Collapse
Affiliation(s)
- Michael H Kohn
- Department of Ecology & Evolution, The University of Chicago, USA.
| | | | | |
Collapse
|
61
|
Abstract
It has been suggested that recombination may be mutagenic, which, if true, would inflate intraspecies diversity and interspecies silent divergence in regions of high recombination. Here, we test this hypothesis comparing human/orangutan genome-wide non-coding divergence (K) to that in the pseudoautosomal genes which were reported to recombine much more frequently than the rest of the genome. We demonstrate that, compared to the average human/orangutan non-coding divergence (K=3%), the substitution rate is significantly elevated in the introns of SHOX (K=5.7%), PPP2R3L (K=8.7%) and ASMT (K=6.5%) genes located in the human and orangutan Xp/Yp pseudoautosomal region (p-PAR), where recombination is over 20-fold higher than the genomic average. On the other hand, human/orangutan non-coding divergence at the Xp/Yp pseudoautosomal boundary (K=3.5%) and in the SYBL1 gene (K=2.7%), located in the human Xq/Yq pseudoautosomal region (q-PAR), where recombination is known to be less frequent than in p-PAR, was not significantly higher than the genome average. The data are consistent with the hypothesis that recombination may be mutagenic.
Collapse
Affiliation(s)
- Dmitry A Filatov
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT, UK.
| | | |
Collapse
|
62
|
Tishkoff SA, Verrelli BC. Patterns of human genetic diversity: implications for human evolutionary history and disease. Annu Rev Genomics Hum Genet 2003; 4:293-340. [PMID: 14527305 DOI: 10.1146/annurev.genom.4.070802.110226] [Citation(s) in RCA: 234] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Since the completion of the human genome sequencing project, the discovery and characterization of human genetic variation is a principal focus for future research. Comparative studies across ethnically diverse human populations and across human and nonhuman primate species is important for reconstructing human evolutionary history and for understanding the genetic basis of human disease. In this review, we summarize data on patterns of human genetic diversity and the evolutionary forces (mutation, genetic drift, migration, and selection) that have shaped these patterns of variation across both human populations and the genome. African population samples typically have higher levels of genetic diversity, a complex population substructure, and low levels of linkage disequilibrium (LD) relative to non-African populations. We discuss these differences and their implications for mapping disease genes and for understanding how population and genomic diversity have been important in the evolution, differentiation, and adaptation of humans.
Collapse
Affiliation(s)
- Sarah A Tishkoff
- Department of Biology, University of Maryland, College Park, Maryland 20742, USA.
| | | |
Collapse
|
63
|
Kittles RA, Weiss KM. Race, ancestry, and genes: implications for defining disease risk. Annu Rev Genomics Hum Genet 2003; 4:33-67. [PMID: 14527296 DOI: 10.1146/annurev.genom.4.070802.110356] [Citation(s) in RCA: 121] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Geneticists are interested in finding genes associated with disease. Because of widespread health disparities, race is a variable that is often said to be relevant in this context. The idea is that members of a preconceived "race" share common ancestry that may include genetic risk factors. Human variation has been shaped by the long-term processes of population history, and population samples that reflect that history carry statistical information about shared genetic variation or "ancestry." But race is an elusive concept and a term difficult even to define rigorously. Unfortunately, these problems are neither new nor related to recent genetic knowledge. Race is also one of the most politically charged subjects in American life because its associated sociocultural component has notoriously led to categorical treatment that has been misleading and politically misused. There are ways in which the concept of race (whether or not the term is used) can be a legitimate tool in the search for disease-associated genes. But in that context race reflects deeply confounded cultural as well as biological factors, and a careful distinction must be made between race as a statistical risk factor and causal genetic variables.
Collapse
Affiliation(s)
- Rick A Kittles
- National Human Genome Center, Howard University, Washington, D.C. 20060, USA.
| | | |
Collapse
|
64
|
Yu N, Jensen-Seaman MI, Chemnick L, Kidd JR, Deinard AS, Ryder O, Kidd KK, Li WH. Low nucleotide diversity in chimpanzees and bonobos. Genetics 2003; 164:1511-8. [PMID: 12930756 PMCID: PMC1462640 DOI: 10.1093/genetics/164.4.1511] [Citation(s) in RCA: 119] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Comparison of the levels of nucleotide diversity in humans and apes may provide much insight into the mechanisms of maintenance of DNA polymorphism and the demographic history of these organisms. In the past, abundant mitochondrial DNA (mtDNA) polymorphism data indicated that nucleotide diversity (pi) is more than threefold higher in chimpanzees than in humans. Furthermore, it has recently been claimed, on the basis of limited data, that this is also true for nuclear DNA. In this study we sequenced 50 noncoding, nonrepetitive DNA segments randomly chosen from the nuclear genome in 9 bonobos and 17 chimpanzees. Surprisingly, the pi value for bonobos is only 0.078%, even somewhat lower than that (0.088%) for humans for the same 50 segments. The pi values are 0.092, 0.130, and 0.082% for East, Central, and West African chimpanzees, respectively, and 0.132% for all chimpanzees. These values are similar to or at most only 1.5 times higher than that for humans. The much larger difference in mtDNA diversity than in nuclear DNA diversity between humans and chimpanzees is puzzling. We speculate that it is due mainly to a reduction in effective population size (N(e)) in the human lineage after the human-chimpanzee divergence, because a reduction in N(e) has a stronger effect on mtDNA diversity than on nuclear DNA diversity. Sequence data from this article have been deposited with the GenBank Data libraries under accession nos. AY 275957-AY 277244.
Collapse
Affiliation(s)
- Ning Yu
- Department of Ecology and Evolution, University of Chicago, 1101 E. 57th Street, Chicago, IL 60637, USA
| | | | | | | | | | | | | | | |
Collapse
|
65
|
Rannala B, Yang Z. Bayes estimation of species divergence times and ancestral population sizes using DNA sequences from multiple loci. Genetics 2003; 164:1645-56. [PMID: 12930768 PMCID: PMC1462670 DOI: 10.1093/genetics/164.4.1645] [Citation(s) in RCA: 689] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The effective population sizes of ancestral as well as modern species are important parameters in models of population genetics and human evolution. The commonly used method for estimating ancestral population sizes, based on counting mismatches between the species tree and the inferred gene trees, is highly biased as it ignores uncertainties in gene tree reconstruction. In this article, we develop a Bayes method for simultaneous estimation of the species divergence times and current and ancestral population sizes. The method uses DNA sequence data from multiple loci and extracts information about conflicts among gene tree topologies and coalescent times to estimate ancestral population sizes. The topology of the species tree is assumed known. A Markov chain Monte Carlo algorithm is implemented to integrate over uncertain gene trees and branch lengths (or coalescence times) at each locus as well as species divergence times. The method can handle any species tree and allows different numbers of sequences at different loci. We apply the method to published noncoding DNA sequences from the human and the great apes. There are strong correlations between posterior estimates of speciation times and ancestral population sizes. With the use of an informative prior for the human-chimpanzee divergence date, the population size of the common ancestor of the two species is estimated to be approximately 20,000, with a 95% credibility interval (8000, 40,000). Our estimates, however, are affected by model assumptions as well as data quality. We suggest that reliable estimates have yet to await more data and more realistic models.
Collapse
Affiliation(s)
- Bruce Rannala
- Department of Medical Genetics, University of Alberta, Edmonton, Alberta T6G 2H7, Canada
| | | |
Collapse
|
66
|
Abstract
Despite the remarkable developments in molecular biology over the past three decades, anthropological genetics has had only limited impact on systematics in human evolution. Genetics offers the opportunity to objectively test taxonomies based on morphology and may be used to supplement conventional approaches to hominid systematics. Our analyses, examining chromosomes and 46 estimates of genetic distance, indicate there may have been only around 4 species on the direct line to modern humans and 5 species in total. This contrasts with current taxonomies recognising up to 23 species. The genetic proximity of humans and chimpanzees has been used to suggest these species are congeneric. Our analysis of genetic distances between them is consistent with this proposal. It is time that chimpanzees, living humans and all fossil humans be classified in Homo. The creation of new genera can no longer be a solution to the complexities of fossil morphologies. Published genetic distances between common chimpanzees and bonobos, along with evidence for interbreeding, suggest they should be assigned to a single species. The short distance between humans and chimpanzees also places a strict limit on the number of possible evolutionary 'side branches' that might be recognised on the human lineage. All fossil taxa were genetically very close to each other and likely to have been below congeneric genetic distances seen for many mammals. Our estimates of genetic divergence suggest that periods of around 2 million years are required to produce sufficient genetic distance to represent speciation. Therefore, Neanderthals and so-called H. erectus were genetically so close to contemporary H. sapiens they were unlikely to have been separate species. Thus, it is likely there was only one species of human (H. sapiens) for most of the last 2 million years. We estimate the divergence time of H. sapiens from 16 genetic distances to be around 1.7 Ma which is consistent with evidence for the earliest migration out of Africa. These findings call into question the mitochondrial "African Eve" hypothesis based on a far more recent origin for H. sapiens and show that humans did not go through a bottleneck in their recent evolutionary history. Given the large offset in evolutionary rates of molecules and morphology seen in human evolution, Homo species are likely to be characterised by high levels of morphological variation and low levels of genetic variability. Thus, molecular data suggest the limits for intraspecific morphological variation used by many palaeoanthropologists have been set too low. The role of phenotypic plasticity has been greatly underestimated in human evolution. We call into question the use of mtDNA for studies of human evolution. This DNA is under strong selection, which violates the assumption of selective neutrality. This issue should be addressed by geneticists, including a reassessment of its use for molecular clocks. There is a need for greater cooperation between palaeoanthropologists and anthropological geneticists to better understand human evolution and to bring palaeoanthropology into the mainstream of evolutionary biology.
Collapse
Affiliation(s)
- D Curnoe
- Department of Archaeology and Natural History, Research School of Pacific and Asian Studies, Australian National University, Canberra ACT 0200, Australia.
| | | |
Collapse
|
67
|
Zhao Z, Fu YX, Hewett-Emmett D, Boerwinkle E. Investigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution. Gene 2003; 312:207-13. [PMID: 12909357 DOI: 10.1016/s0378-1119(03)00670-x] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We investigated the single nucleotide polymorphism (SNP) density across the human genome and in different genic categories using two SNP databases: Celera's CgsSNP, which includes SNPs identified by comparing genomic sequences, and Celera's RefSNP, which includes SNPs from a variety of sources and is biased toward disease-associated genes. Based on CgsSNP, the average numbers of SNPs per 10 kb was 8.33, 8.44, and 8.09 in the human genome, in intergenic regions, and in genic regions, respectively. In genic regions, the SNP density in intronic, exonic and adjoining untranslated regions was 8.21, 5.28, and 7.51 SNPs per 10 kb, respectively. The pattern of SNP density based on RefSNP was different from that based on CgsSNP, emphasizing its utility for genotype-phenotype association studies but not for most population genetic studies. The number of SNPs per chromosome was correlated with chromosome length, but the density of SNPs estimated by CgsSNP was not significantly correlated with the GC content of the chromosome. Based on CgsSNP, the ratio of nonsense to missense mutations (0.027), the ratio of missense to silent mutations (1.15), and the ratio of non-synonymous to synonymous mutations (1.18) was less than half of that expected in a human protein coding sequence under the neutral mutation theory, reflecting a role for natural selection, especially purifying selection.
Collapse
Affiliation(s)
- Zhongming Zhao
- Human Genetics Center, 1200 Herman Pressler, Suite E447, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | | | | | | |
Collapse
|
68
|
Dupanloup I, Pereira L, Bertorelle G, Calafell F, Prata MJ, Amorim A, Barbujani G. A recent shift from polygyny to monogamy in humans is suggested by the analysis of worldwide Y-chromosome diversity. J Mol Evol 2003; 57:85-97. [PMID: 12962309 DOI: 10.1007/s00239-003-2458-x] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Molecular genetic data contain information on the history of populations. Evidence of prehistoric demographic expansions has been detected in the mitochondrial diversity of most human populations and in a Y-chromosome STR analysis, but not in a previous study of 11 Y-chromosome SNPs in Europeans. In this paper, we show that mismatch distributions and tests of mutation/drift equilibrium based on up to 166 Y-chromosome SNPs, in 46 samples from all continents, also fail to support an increase of the male effective population size. Computer simulations show that the low nuclear versus mitochondrial mutation rates cannot explain these results. However, ascertainment bias, i.e., when only highly variable SNP sites are typed, may be concealing any Y SNPs evidence for a recent, but not an ancient, increase in male effective population sizes. The results of our SNP analyses can be reconciled with the expansion of male effective population sizes inferred from STR loci, and with mitochondrial evidence, by admitting that humans were essentially polygynous during much of their history. As a consequence, until recently only a few men may have contributed a large fraction of the Y-chromosome pool at every generation. The number of breeding males may have increased, and the variance of their reproductive success may have decreased, through a recent shift from polygyny to monogamy, which is supported by ethnological data and possibly accompanied the shift from mobile to sedentary communities.
Collapse
Affiliation(s)
- Isabelle Dupanloup
- Dipartimento di Biologia, Università di Ferrara, via L. Borsari 46, 1-44100 Ferrara, Italy
| | | | | | | | | | | | | |
Collapse
|
69
|
John PR, Makova K, Li WH, Jenkins T, Ramsay M. DNA polymorphism and selection at the melanocortin-1 receptor gene in normally pigmented southern African individuals. Ann N Y Acad Sci 2003; 994:299-306. [PMID: 12851329 DOI: 10.1111/j.1749-6632.2003.tb03193.x] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Skin pigmentation is a polygenic multifactorial trait determined by the cumulative effects of multiple genetic variants and environmental factors. Melanocortin-1 receptor (MC1R) is one of the genes involved in pigmentation, and has been implicated in the red hair and pale skin phenotype in human Caucasoid individuals. The present study was undertaken to identify variation at the MC1R locus in normally pigmented individuals in two African populations, sub-Saharan Negroids (22 unrelated individuals) and the San (17 unrelated individuals). The study showed considerable MC1R gene sequence variation with the detection of eight synonymous and three nonsynonymous mutations. This is the first report of nonsynonymous mutations in African individuals in the MC1R gene: L99I was found in a single San individual, S47I was detected in a single Negroid individual, and F196L was detected in five Negroid individuals (5/44; 0.11). The functional significance of these mutations is not known. Three of the eight synonymous mutations found, L106L (CTG --> CTA), F300F (TTC --> TTT), and T314T (ACA --> ACG) (also known as A942G), have been reported previously. T314T was the only variant that showed a significant difference between the Negroid and San populations (0.477 and 0.059, respectively; P = 1.6 x 10(-5)). Its low frequency in the San may be the result of random genetic drift in a population of small size, or selection. Several tests of neutrality of the MC1R coding region in these and other African populations were significant, suggesting that purifying selection (functional constraint) had occurred at this gene locus in Africans. This demonstrates that although some nonsynonymous MC1R mutations are tolerated in individuals with dark skin, this gene has likely played a significant role in the maintenance of dark pigmentation in Africans and normal pigment variation in non-African populations.
Collapse
Affiliation(s)
- Premila R John
- Department of Human Genetics, School of Pathology, National Health Laboratory Service and University of the Witwatersrand, Johannesburg 2000, South Africa
| | | | | | | | | |
Collapse
|
70
|
Salisbury BA, Pungliya M, Choi JY, Jiang R, Sun XJ, Stephens JC. SNP and haplotype variation in the human genome. Mutat Res 2003; 526:53-61. [PMID: 12714183 DOI: 10.1016/s0027-5107(03)00014-9] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
We have surveyed and summarized several aspects of DNA variability among humans. The variation described is the result of mutation followed by a combination of drift, migration and selection bringing the frequencies high enough to be observed. This paper describes what we have learned about how DNA variability differs among genes and populations. We sequenced functional regions of a set of 3950 genes. DNA was sampled from 82 unrelated humans: 20 African-Americans, 20 East Asians, 21 Caucasians, 18 Hispanic-Latinos and 3 Native Americans. Different aspects of variability showed a great deal of concordance. In particular, we studied patterns of single nucleotide polymorphism (SNP) allele and haplotype sharing among the four, large sample populations. We also examined how linkage disequilibrium (LD) between SNPs relates to physical distance in the different populations. It is clear from our findings that while many variants are common to all populations, many others have a more restricted distribution. Research that attempts to find genetic variants that explain phenotypic variants must be careful in their choice of study population.
Collapse
|
71
|
Subramanian S, Kumar S. Neutral substitutions occur at a faster rate in exons than in noncoding DNA in primate genomes. Genome Res 2003; 13:838-44. [PMID: 12727904 PMCID: PMC430942 DOI: 10.1101/gr.1152803] [Citation(s) in RCA: 98] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Point mutation rates in exons (synonymous sites) and noncoding (introns and intergenic) regions are generally assumed to be the same. However, comparative sequence analyses of synonymous substitutions in exons (81 genes) and that of long intergenic fragments (141.3 kbp) of human and chimpanzee genomes reveal a 30%-60% higher mutation rate in exons than in noncoding DNA. We propose a differential CpG content hypothesis to explain this fundamental, and seemingly unintuitive, pattern. We find that the increased exonic rate is the result of the relative overabundance of synonymous sites involved in CpG dinucleotides, as the evolutionary divergence in non-CpG sites is similar in noncoding DNA and synonymous sites of exons. Expectations and predictions of our hypothesis are confirmed in comparisons involving more distantly related species, including human-orangutan, human-baboon, and human-macaque. Our results suggest an underlying mechanism for higher mutation rate in GC-rich genomic regions, predict nonlinear accumulation of mutations in pseudogenes over time, and provide a possible explanation for the observed higher diversity of single nucleotide polymorphisms (SNPs) in the synonymous sites of exons compared to the noncoding regions.
Collapse
Affiliation(s)
- Sankar Subramanian
- Center for Evolutionary Functional Genomics, Arizona Biodesign Institute, Department of Biology, Arizona State University, Tempe, Arizona 85287-1501, USA
| | | |
Collapse
|
72
|
Nielsen R, Signorovitch J. Correcting for ascertainment biases when analyzing SNP data: applications to the estimation of linkage disequilibrium. Theor Popul Biol 2003; 63:245-55. [PMID: 12689795 DOI: 10.1016/s0040-5809(03)00005-4] [Citation(s) in RCA: 115] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
As large-scale sequencing efforts turn from single genome sequencing to polymorphism discovery, single nucleotide polymorphisms (SNPs) are becoming an increasingly important class of population genetic data. But because of the ascertainment biases introduced by many methods of SNP discovery, most SNP data cannot be analyzed using classical population genetic methods. Statistical methods must instead be developed that can explicitly take into account each method of SNP discovery. Here we review some of the current methods for analyzing SNPs and derive sampling distributions for single SNPs and pairs of SNPs for some common SNP discovery schemes. We also show that the ascertainment scheme has a large effect on the estimation of linkage disequilibrium and recombination, and describe some methods of correcting for ascertainment biases when estimating recombination rates from SNP data.
Collapse
Affiliation(s)
- Rasmus Nielsen
- Department of Biometrics, Cornell University, 439 Warren Hall, Ithaca, NY 14853-7801, USA.
| | | |
Collapse
|
73
|
Hawi Z, Lowe N, Kirley A, Gruenhage F, Nöthen M, Greenwood T, Kelsoe J, Fitzgerald M, Gill M. Linkage disequilibrium mapping at DAT1, DRD5 and DBH narrows the search for ADHD susceptibility alleles at these loci. Mol Psychiatry 2003; 8:299-308. [PMID: 12660802 DOI: 10.1038/sj.mp.4001290] [Citation(s) in RCA: 91] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Abnormalities in dopaminergic neurotransmission are now accepted as factors in predisposing to ADHD. Evidence of associations between dopamine transporter gene polymorphism and ADHD was first reported by Cook et al. We confirmed the DAT1 association and also identified two additional susceptibility loci at the DRD5 and DBH. Notably, none of the associated variants at these three genes are known to be expressed. Other variants within or closely mapped to the associated alleles are likely to be relevant. In this investigation, we analyse additional markers creating a high-density map across and flanking these genes, and measure intermarker linkage disequilibrium (LD). None of the newly examined markers were more strongly associated with ADHD. At DAT1, the pattern of intermarker LD and haplotype association with the phenotype between exon 9 and the 3' of the gene suggests that the functional variant at DAT1 may be located to this region. For DRD5, three markers, covering a region of approximately 68 kb including the single DRD5 exon are all associated with disease, and thus do not provide localizing information. However, the data for DBH point to a region close to the centre of the gene. Correlation between D' and physical distance was observed between markers at DAT1 and DRD5 for distances less than 50 kb. This was not the case for DBH, where LD breakdown was observed between the intron 5 and intron 9 polymorphisms although they are only 9 kb apart. Further genetic analysis is unlikely to refine the location of susceptibility variants and functional assessment of variants within associated regions is required.
Collapse
Affiliation(s)
- Z Hawi
- Department of Genetics and Psychiatry, Trinity College, Dublin, Ireland.
| | | | | | | | | | | | | | | | | |
Collapse
|
74
|
Zhang J, Webb DM, Podlaha O. Accelerated protein evolution and origins of human-specific features: Foxp2 as an example. Genetics 2002; 162:1825-35. [PMID: 12524352 PMCID: PMC1462353 DOI: 10.1093/genetics/162.4.1825] [Citation(s) in RCA: 173] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Genes responsible for human-specific phenotypes may have been under altered selective pressures in human evolution and thus exhibit changes in substitution rate and pattern at the protein sequence level. Using comparative analysis of human, chimpanzee, and mouse protein sequences, we identified two genes (PRM2 and FOXP2) with significantly enhanced evolutionary rates in the hominid lineage. PRM2 is a histone-like protein essential to spermatogenesis and was previously reported to be a likely target of sexual selection in humans and chimpanzees. FOXP2 is a transcription factor involved in speech and language development. Human FOXP2 experienced a >60-fold increase in substitution rate and incorporated two fixed amino acid changes in a broadly defined transcription suppression domain. A survey of a diverse group of placental mammals reveals the uniqueness of the human FOXP2 sequence and a population genetic analysis indicates possible adaptive selection behind the accelerated evolution. Taken together, our results suggest an important role that FOXP2 may have played in the origin of human speech and demonstrate a strategy for identifying candidate genes underlying the emergences of human-specific features.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.
| | | | | |
Collapse
|
75
|
Yang Z. Likelihood and Bayes estimation of ancestral population sizes in hominoids using data from multiple loci. Genetics 2002; 162:1811-23. [PMID: 12524351 PMCID: PMC1462394 DOI: 10.1093/genetics/162.4.1811] [Citation(s) in RCA: 153] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Polymorphisms in an ancestral population can cause conflicts between gene trees and the species tree. Such conflicts can be used to estimate ancestral population sizes when data from multiple loci are available. In this article I extend previous work for estimating ancestral population sizes to analyze sequence data from three species under a finite-site nucleotide substitution model. Both maximum-likelihood (ML) and Bayes methods are implemented for joint estimation of the two speciation dates and the two population size parameters. Both methods account for uncertainties in the gene tree due to few informative sites at each locus and make an efficient use of information in the data. The Bayes algorithm using Markov chain Monte Carlo (MCMC) enjoys a computational advantage over ML and also provides a framework for incorporating prior information about the parameters. The methods are applied to a data set of 53 nuclear noncoding contigs from human, chimpanzee, and gorilla published by Chen and Li. Estimates of the effective population size for the common ancestor of humans and chimpanzees by both ML and Bayes methods are approximately 12,000-21,000, comparable to estimates for modern humans, and do not support the notion of a dramatic size reduction in early human populations. Estimates published previously from the same data are several times larger and appear to be biased due to methodological deficiency. The divergence between humans and chimpanzees is dated at approximately 5.2 million years ago and the gorilla divergence 1.1-1.7 million years earlier. The analysis suggests that typical data sets contain useful information about the ancestral population sizes and that it is advantageous to analyze data of several species simultaneously.
Collapse
Affiliation(s)
- Ziheng Yang
- Galton Laboratory, Department of Biology, University College London, London WC1E 6BT, England.
| |
Collapse
|
76
|
Saunders MA, Hammer MF, Nachman MW. Nucleotide variability at G6pd and the signature of malarial selection in humans. Genetics 2002; 162:1849-61. [PMID: 12524354 PMCID: PMC1462360 DOI: 10.1093/genetics/162.4.1849] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is the most common enzymopathy in humans. Deficiency alleles for this X-linked disorder are geographically correlated with historical patterns of malaria, and the most common deficiency allele in Africa (G6PD A-) has been shown to confer some resistance to malaria in both hemizygous males and heterozygous females. We studied DNA sequence variation in 5.1 kb of G6pd from 47 individuals representing a worldwide sample to examine the impact of selection on patterns of human nucleotide diversity and to infer the evolutionary history of the G6PD A- allele. We also sequenced 3.7 kb of a neighboring locus, L1cam, from the same set of individuals to study the effect of selection on patterns of linkage disequilibrium. Despite strong clinical evidence for malarial selection maintaining G6PD deficiency alleles in human populations, the overall level of nucleotide heterozygosity at G6pd is typical of other genes on the X chromosome. However, the signature of selection is evident in the absence of genetic variation among A- alleles from different parts of Africa and in the unusually high levels of linkage disequilibrium over a considerable distance of the X chromosome. In spite of a long-term association between Plasmodium falciparum and the ancestors of modern humans, patterns of nucleotide variability and linkage disequilibrium suggest that the A- allele arose in Africa only within the last 10,000 years and spread due to selection.
Collapse
Affiliation(s)
- Matthew A Saunders
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA.
| | | | | |
Collapse
|
77
|
Eyre-Walker A, Keightley PD, Smith NGC, Gaffney D. Quantifying the slightly deleterious mutation model of molecular evolution. Mol Biol Evol 2002; 19:2142-9. [PMID: 12446806 DOI: 10.1093/oxfordjournals.molbev.a004039] [Citation(s) in RCA: 171] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We have attempted to quantify the frequency and effects of slightly deleterious mutations (SDMs), those that have selective effects close to the reciprocal of the effective population size of a species, by comparing the level of selective constraint in protein-coding genes of related species that have different present-day effective population sizes. In our two comparisons, the species with the smaller effective population size showed lower constraint, implying that SDMs had become fixed. The fixation of SDMs was supported by the observation of a higher fraction of radical to conservative amino acid substitutions in species with smaller effective population sizes. The fraction of strongly deleterious mutations (which rarely become fixed) is >70% in most species. Only approximately 10% or fewer of mutations seem to behave as SDMs, but SDMs could comprise a substantial fraction of mutations in protein-coding genes that have a chance of becoming fixed between species.
Collapse
Affiliation(s)
- Adam Eyre-Walker
- Centre for the Study of Evolution & School of Biological Sciences, University of Sussex, Brighton BN1 9QG, UK.
| | | | | | | |
Collapse
|
78
|
Abstract
DNA sequence data from humans can provide insight into the history of modern humans and the genetic variability in human populations. We report here a study of human DNA sequence variation at an X-linked noncoding region of 10,346 bp. The sample consists of 62 X chromosomes from Africa, Europe, and Asia. Forty-four polymorphic sites were found among the 62 sequences, resulting in 23 different haplotypes. Statistical analyses of the data led to the following inferences. (1) There is strong evidence of human population expansion in the relatively recent past, and this population expansion has had a significant effect on the pattern of polymorphism at this locus. (2) Non-African populations were unlikely to have been derived from a very small number of African lineages. (3) There was considerable geographic subdivision in the ancient human population, which could be an important reason why many studies failed to detect population expansion. (4) The long-term effective population size of humans is between 12,000 and 15,000. And (5) a non-African specific variant was found at a frequency of 35% in non-Africans, an estimate supported by the genotyping of additional 80 non-African and 106 African X chromosomes. This variant could have arisen in Eurasia more than 140,000 years ago, predating the emergence of modern humans. Moreover, this haplotype and all other haplotypes coalesced to the most recent common ancestor of the sample, which was estimated to be older than 490,000 years. Therefore, this region may have a long history in Eurasia.
Collapse
Affiliation(s)
- Ning Yu
- Department of Ecology and Evolution, University of Chicago, 1101 East 57th Street, Chicago, IL 60637, USA
| | | | | |
Collapse
|
79
|
Zhao Z, Boerwinkle E. Neighboring-nucleotide effects on single nucleotide polymorphisms: a study of 2.6 million polymorphisms across the human genome. Genome Res 2002; 12:1679-86. [PMID: 12421754 PMCID: PMC187558 DOI: 10.1101/gr.287302] [Citation(s) in RCA: 103] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We investigated substitution patterns and neighboring-nucleotide effects for 2,576,903 single nucleotide polymorphisms (SNPs) publicly available through the National Center for Biotechnology Information (NCBI). The proportions of substitutions were A/G, 32.77%; C/T, 32.81%; A/C, 8.98%; G/T, 9.06%; A/T, 7.46%; and C/G, 8.92%. The two nucleotides immediately neighboring the variable site showed major deviation from genome-wide and chromosome-specific expectations, although lesser biases extended as far as 200 bp. On the 5' side, the biases for A, C, G, and T were 1.43%, 4.91%, -1.70%, and -4.62%, respectively. These biases were -4.44%, -1.59%, 5.05%, and 0.99%, respectively, on the 3' side. The neighboring-nucleotide patterns for transitions were dominated by the hypermutability effects of CpG dinucleotides. Transitions were more common than transversions, and the probability of a transversion increased with increasing A + T content at the two adjacent sites. Neighboring-nucleotide biases were not consistent among chromosomes, with Chromosomes 19 and 22 standing out as different from the others. These data provide genome-wide information about the effects of neighboring nucleotides on mutational and evolutionary processes giving rise to contemporary patterns of nucleotide occurrence surrounding SNPs.
Collapse
Affiliation(s)
- Zhongming Zhao
- Human Genetics Center and Institute of Molecular Medicine, University of Texas Health Science Center at Houston, Houston, Texas 77030, USA
| | | |
Collapse
|
80
|
Abstract
Theoretical and empirical work indicates that patterns of neutral polymorphism can be affected by linked, selected mutations. Under background selection, deleterious mutations removed from a population by purifying selection cause a reduction in linked neutral diversity. Under genetic hitchhiking, the rise in frequency and fixation of beneficial mutations also reduces the level of linked neutral polymorphism. Here we review the evidence that levels of neutral polymorphism in humans are affected by selection at linked sites. We then discuss four approaches for distinguishing between background selection and genetic hitchhiking based on (i) the relationship between polymorphism level and recombination rate for neutral loci with high mutation rates, (ii) relative levels of variation on the X chromosome and the autosomes, (iii) the frequency distribution of neutral polymorphisms, and (iv) population-specific patterns of genetic variation. Although the evidence for selection at linked sites in humans is clear, current methods and data do not allow us to clearly assess the relative importance of background selection and genetic hitchhiking in humans. These results contrast with those obtained for Drosophila, where the signals of positive selection are stronger.
Collapse
Affiliation(s)
- Bret A Payseur
- Department of Ecology and Evolutionary Biology, Biosciences West Building, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
81
|
de Groot NG, Otting N, Doxiadis GGM, Balla-Jhagjhoorsingh SS, Heeney JL, van Rood JJ, Gagneux P, Bontrop RE. Evidence for an ancient selective sweep in the MHC class I gene repertoire of chimpanzees. Proc Natl Acad Sci U S A 2002; 99:11748-53. [PMID: 12186979 PMCID: PMC129340 DOI: 10.1073/pnas.182420799] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/16/2002] [Indexed: 11/18/2022] Open
Abstract
MHC class I molecules play an essential role in the immune defense against intracellular infections. The hallmark of the MHC is its extensive degree of polymorphism at the population level. However, the present comparison of MHC class I gene intron variation revealed that chimpanzees have experienced a severe repertoire reduction at the orthologues of the HLA-A, -B, and -C loci. The loss of variability predates the (sub)speciation of chimpanzees and did not effect other known gene systems. Therefore the selective sweep in the MHC class I gene may have resulted from a widespread viral infection. Based on the present results and the fact that chimpanzees have a natural resistance to the development of AIDS, we hypothesize that the selective sweep was caused by the chimpanzee-derived simian immunodeficiency virus (SIVcpz), the closest relative of HIV-1, or a closely related retrovirus. Hence, the contemporary chimpanzee populations represent the offspring of AIDS-resistant animals, the survivors of a HIV-like pandemic that took place in the distant past.
Collapse
Affiliation(s)
- Natasja G de Groot
- Department of Immunobiology, Biomedical Primate Research Centre, P.O. Box 3306, 2280 GH, Rijswijk, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
82
|
Wooding SP, Watkins WS, Bamshad MJ, Dunn DM, Weiss RB, Jorde LB. DNA sequence variation in a 3.7-kb noncoding sequence 5' of the CYP1A2 gene: implications for human population history and natural selection. Am J Hum Genet 2002; 71:528-42. [PMID: 12181774 PMCID: PMC379190 DOI: 10.1086/342260] [Citation(s) in RCA: 58] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2002] [Accepted: 06/10/2002] [Indexed: 11/04/2022] Open
Abstract
CYP1A2 is a cytochrome P450 gene that is involved in human physiological responses to a variety of drugs and toxins. To investigate the role of population history and natural selection in shaping genetic diversity in CYP1A2, we sequenced a 3.7-kb region 5' from CYP1A2 in a diverse collection of 113 individuals from three major continental regions of the Old World (Africa, Asia, and Europe). We also examined sequences in the 90-member National Institutes of Health DNA Polymorphism Discovery Resource (PDR). Eighteen single-nucleotide polymorphisms (SNPs) were found. Most of the high-frequency SNPs found in the Old World sample were also found in the PDR sample. However, six SNPs were detected in the Old World sample but not in the PDR sample, and two SNPs found in the PDR sample were not found in the Old World sample. Most pairs of SNPs were in complete linkage disequilibrium with one another, and there was no indication of a decline of disequilibrium with physical distance in this region. The average +/- SD nucleotide diversity in the Old World sample was 0.00043+/-0.00026. The African population had the highest level of nucleotide diversity and the lowest level of linkage disequilibrium. Two distinct haplotype clusters with broadly overlapping geographical distributions were present. Of the 17 haplotypes found in the Old World sample, 12 were found in the African sample, 8 were found in Indians, 5 were found in non-Indian Asians, and 5 were found in Europeans. Haplotypes found outside Africa were mostly a subset of those found within Africa. These patterns are all consistent with an African origin of modern humans. Seven SNPs were singletons, and the site-frequency spectrum showed a significant departure from neutral expectations, suggesting population expansion and/or natural selection. Comparison with outgroup species showed that four derived SNPs have achieved high (>0.90) frequencies in human populations, a trend consistent with the action of positive natural selection. These patterns have a number of implications for disease-association studies in CYP1A2 and other genes.
Collapse
Affiliation(s)
- S. P. Wooding
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| | - W. S. Watkins
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| | - M. J. Bamshad
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| | - D. M. Dunn
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| | - R. B. Weiss
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| | - L. B. Jorde
- Departments of Human Genetics and Pediatrics, University of Utah, Salt Lake City
| |
Collapse
|
83
|
Laporte V, Charlesworth B. Effective Population Size and Population Subdivision in Demographically Structured Populations. Genetics 2002; 162:501-19. [PMID: 12242257 PMCID: PMC1462266 DOI: 10.1093/genetics/162.1.501] [Citation(s) in RCA: 109] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
AbstractA fast-timescale approximation is applied to the coalescent process in a single population, which is demographically structured by sex and/or age. This provides a general expression for the probability that a pair of alleles sampled from the population coalesce in the previous time interval. The effective population size is defined as the reciprocal of twice the product of generation time and the coalescence probability. Biologically explicit formulas for effective population size with discrete generations and separate sexes are derived for a variety of different modes of inheritance. The method is also applied to a nuclear gene in a population of partially self-fertilizing hermaphrodites. The effects of population subdivision on a demographically structured population are analyzed, using a matrix of net rates of movement of genes between different local populations. This involves weighting the migration probabilities of individuals of a given age/sex class by the contribution of this class to the leading left eigenvector of the matrix describing the movements of genes between age/sex classes. The effects of sex-specific migration and nonrandom distributions of offspring number on levels of genetic variability and among-population differentiation are described for different modes of inheritance in an island model. Data on DNA sequence variability in human and plant populations are discussed in the light of the results.
Collapse
Affiliation(s)
- Valérie Laporte
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
84
|
Abstract
The HFE locus encodes an HLA class-I-type protein important in iron regulation and segregates replacement mutations that give rise to the most common form of genetic hemochromatosis. The high frequency of one disease-associated mutation, C282Y, and the nature of this disease have led some to suggest a selective advantage for this mutation. To investigate the context in which this mutation arose and gain a better understanding of HFE genetic variation, we surveyed nucleotide variability in 11.2 kb encompassing the HFE locus and experimentally determined haplotypes. We fully resequenced 60 chromosomes of African, Asian, or European ancestry as well as one chimpanzee, revealing 41 variable sites and a nucleotide diversity of 0.08%. This indicates that linkage to the HLA region has not substantially increased the level of HFE variation. Although several haplotypes are shared between populations, one haplotype predominates in Asia but is nearly absent elsewhere, causing higher than average genetic differentiation among the three major populations. Our samples show evidence of intragenic recombination, so the scarcity of recombination events within the C282Y allele class is consistent with selection increasing the frequency of a young allele. Otherwise, the pattern of variability in this region does not clearly indicate the action of positive selection at this or linked loci.
Collapse
|
85
|
Yu N, Chen FC, Ota S, Jorde LB, Pamilo P, Patthy L, Ramsay M, Jenkins T, Shyue SK, Li WH. Larger genetic differences within africans than between Africans and Eurasians. Genetics 2002; 161:269-74. [PMID: 12019240 PMCID: PMC1462113 DOI: 10.1093/genetics/161.1.269] [Citation(s) in RCA: 111] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The worldwide pattern of single nucleotide polymorphism (SNP) variation is of great interest to human geneticists, population geneticists, and evolutionists, but remains incompletely understood. We studied the pattern in noncoding regions, because they are less affected by natural selection than are coding regions. Thus, it can reflect better the history of human evolution and can serve as a baseline for understanding the maintenance of SNPs in human populations. We sequenced 50 noncoding DNA segments each approximately 500 bp long in 10 Africans, 10 Europeans, and 10 Asians. An analysis of the data suggests that the sampling scheme is adequate for our purpose. The average nucleotide diversity (pi) for the 50 segments is only 0.061% +/- 0.010% among Asians and 0.064% +/- 0.011% among Europeans but almost twice as high (0.115% +/- 0.016%) among Africans. The African diversity estimate is even higher than that between Africans and Eurasians (0.096% +/- 0.012%). From available data for noncoding autosomal regions (total length = 47,038 bp) and X-linked regions (47,421 bp), we estimated the pi-values for autosomal regions to be 0.105, 0.070, 0.069, and 0.097% for Africans, Asians, Europeans, and between Africans and Eurasians, and the corresponding values for X-linked regions to be 0.088, 0.042, 0.053, and 0.082%. Thus, Africans differ from one another slightly more than from Eurasians, and the genetic diversity in Eurasians is largely a subset of that in Africans, supporting the out of Africa model of human evolution. Clearly, one must specify the geographic origins of the individuals sampled when studying pi or SNP density.
Collapse
Affiliation(s)
- Ning Yu
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
86
|
Betrán E, Wang W, Jin L, Long M. Evolution of the phosphoglycerate mutase processed gene in human and chimpanzee revealing the origin of a new primate gene. Mol Biol Evol 2002; 19:654-63. [PMID: 11961099 DOI: 10.1093/oxfordjournals.molbev.a004124] [Citation(s) in RCA: 61] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Processed genes are created by retroposition from messenger RNA of expressed genes. The estimated amount of processed copies of genes in the human genome is 10,000-14,000. Some of these might be pseudogenes with the expected pattern for nonfunctional sequences, but some others might be an important source of new genes. We have studied the evolution of a Phosphoglycerate mutase processed gene (PGAM3) described in humans and believed to be a pseudogene. We sequenced PGAM3 in chimpanzee and macaque and obtained polymorphism data for human coding region. We found evidence that PGAM3 likely produces a functional protein, as an example of addressing functionality for human processed pseudogenes. First, the open reading frame was intact despite many deletions that occurred in the 3' untranslated region. Second, it appears that the gene is expressed. Finally, interspecies and intraspecies variation for PGAM3 was not consistent with the neutral model proposed for pseudogenes, suggesting that a new functional primate gene has originated. Amino acid divergence was significantly higher than synonymous divergence in PGAM3 lineage, supporting positive selection acting in this gene. This role of selection was further supported by the excess of rare alleles in a population genetic analysis. PGAM3 is located in a region of very low recombination; therefore, it is conceivable that the rapid fixation events in this newly arising gene may have contributed to a selective sweep of variation in the region.
Collapse
Affiliation(s)
- Esther Betrán
- Department of Ecology and Evolution, The University of Chicago, 1101 East 57th Street, Chicago, IL 60637, USA
| | | | | | | |
Collapse
|
87
|
Abstract
The question of the mode of origin of modern humans (Homo sapiens) has dominated palaeoanthropological debate over the last decade. This review discusses the main models proposed to explain modern human origins, and examines relevant fossil evidence from Eurasia, Africa and Australasia. Archaeological and genetic data are also discussed, as well as problems with the concept of 'modernity' itself. It is concluded that a recent African origin can be supported for H. sapiens, morphologically, behaviourally and genetically, but that more evidence will be needed, both from Africa and elsewhere, before an absolute African origin for our species and its behavioural characteristics can be established and explained.
Collapse
Affiliation(s)
- Chris Stringer
- Department of Palaeontology, The Natural History Museum, London SW7 5BD, UK.
| |
Collapse
|
88
|
|
89
|
Doris PA. Hypertension genetics, single nucleotide polymorphisms, and the common disease:common variant hypothesis. Hypertension 2002; 39:323-31. [PMID: 11882567 DOI: 10.1161/hy0202.104087] [Citation(s) in RCA: 116] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The investigation of heritable susceptibility to disease is an effort to associate disease phenotype with underlying genotype. Such genotype:phenotype associations have been demonstrated for a large number of monogenetic disorders. The usual strategy has been to use linkage mapping in affected families to identify chromosomal loci from which candidate genes and genotypes can be tested for association with disease. This strategy has not been similarly successful for common heritable disease susceptibilities including hypertension that involve multiple genes and gene-environment interactions. Development of extensive collections of single nucleotide polymorphisms (SNPs) raises the possibility that these SNPs can be used as markers in genome-wide association mapping studies to identify hypertension susceptibility loci. In this approach, large numbers of markers are typed in cases and controls with the expectation that markers interrogating SNPs that are involved in inheritance of disease susceptibility will emerge through their association with this trait in the affected population. Essential hypertension is a common disorder. The term "common" has 2 implications: first, that the disease is prevalent; and, second, that it is widespread. Such frequency and distribution characteristics could arise if the susceptibility alleles for hypertension were prevalent in the founding population of contemporary human beings and became distributed with human global dispersal. This common disease:common variant concept is attractive because it suggests that the genetic heterogeneity underlying hypertension susceptibility could be relatively small. It also allows the possibility that nonrandom association of alleles (linkage disequilibrium, LD) can be used to reduce the number of SNP markers required to identify disease susceptibility alleles because a single marker can act as a surrogate for variation flanking it. The influence of a number of important factors on the detectability of hypertension susceptibility alleles by SNP mapping approaches is not yet fully defined. These factors include the locus and allelic diversity of hypertension, the weaker relationship (compared with Mendelian traits) between genotype and phenotype, the accuracy of high throughput genotyping techniques, the extensive role of nongenetic factors, and the extent and heterogeneous nature of LD across the genome.
Collapse
Affiliation(s)
- Peter A Doris
- Center for Human Genetics, Institute of Molecular Medicine, University of Texas at Houston, 77030, USA.
| |
Collapse
|
90
|
Abstract
When and where did modern humans evolve? How did our ancestors spread over the world? Traditionally, answers to questions such as these have been sought in historical, archaeological, and fossil records. However, increasingly genetic data provide information about the evolution of our species. In this review, we focus on the comparison of the variation in the human gene pool to that of our closest evolutionary relatives, the great apes, because this provides a relevant perspective on human genetical evolution. For instance, comparisons to the great apes show that humans are unique in having little genetic variation as well as little genetic structure in their gene pool. Furthermore, genetic data indicate that humans, but not the great apes, have experienced a period of dramatic growth in their early history.
Collapse
Affiliation(s)
- H Kaessmann
- Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | |
Collapse
|
91
|
Lessios HA, Garrido MJ, Kessing BD. Demographic history of Diadema antillarum, a keystone herbivore on Caribbean reefs. Proc Biol Sci 2001; 268:2347-53. [PMID: 11703875 PMCID: PMC1088886 DOI: 10.1098/rspb.2001.1806] [Citation(s) in RCA: 63] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The sea urchin Diadema antillarum was the most important herbivore on Caribbean reefs until 1983, when mass mortality reduced its populations by more than 97%. Knowledge of its past demography is essential to reconstruct reef ecology as it was before human impact, which has been implicated as having caused high pre-mortality Diadema abundance. To determine the history of its population size, we sequenced the ATPase 6 and 8 region of mitochondrial DNA from populations in the Caribbean and in the eastern Atlantic (which was not affected by the mass mortality), as well as from the eastern Pacific D. mexicanum. The Caribbean population harbours an order of magnitude more molecular diversity than those of the eastern Pacific or the eastern Atlantic and, despite the recent mass mortality, its DNA sequences bear the genetic signature of a previous population expansion. By estimating mutation rates from divergence between D. antillarum and D. mexicanum, that were separated at a known time by the Isthmus of Panama, and by using estimates of effective population size derived from mismatch distributions and a maximum likelihood coalescence algorithm, we date the expansion as having occurred no more recently than 100 000 years before the present. Thus, Diadema was abundant in the Caribbean long before humans could have affected ecological processes; the genetic data contain no evidence of a recent, anthropogenically caused, population increase.
Collapse
Affiliation(s)
- H A Lessios
- Smithsonian Tropical Research Institute, Box 2072, Balboa, Panama.
| | | | | |
Collapse
|
92
|
Jensen-Seaman MI, Deinard AS, Kidd KK. Modern African ape populations as genetic and demographic models of the last common ancestor of humans, chimpanzees, and gorillas. J Hered 2001; 92:475-80. [PMID: 11948214 DOI: 10.1093/jhered/92.6.475] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
In order to fully understand human evolutionary history through the use of molecular data, it is essential to include our closest relatives as a comparison. We provide here estimates of nucleotide diversity and effective population size of modern African ape species using data from several independent noncoding nuclear loci, and use these estimates to make predictions about the nature of the ancestral population that eventually gave rise to the living species of African apes, including humans. Chimpanzees, bonobos, and gorillas possess two to three times more nucleotide diversity than modern humans. We hypothesize that the last common ancestor (LCA) of these species had an effective population size more similar to modern apes than modern humans. In addition, estimated dates for the divergence of the Homo, Pan, and Gorilla lineages suggest that the LCA may have had stronger geographic structuring to its mtDNA than its nuclear DNA, perhaps indicative of strong female philopatry or a dispersal system analogous to gorillas, where females disperse only short distances from their natal group. Synthesizing different classes of data, and the inferences drawn from them, allows us to predict some of the genetic and demographic properties of the LCA of humans, chimpanzees, and gorillas.
Collapse
Affiliation(s)
- M I Jensen-Seaman
- Department of Anthropology, Yale University, P.O. Box 208277, New Haven, CT 06520-8277, USA
| | | | | |
Collapse
|
93
|
Turner G, Barbulescu M, Su M, Jensen-Seaman MI, Kidd KK, Lenz J. Insertional polymorphisms of full-length endogenous retroviruses in humans. Curr Biol 2001; 11:1531-5. [PMID: 11591322 DOI: 10.1016/s0960-9822(01)00455-9] [Citation(s) in RCA: 252] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Human endogenous retrovirus K (HERV-K) is distinctive among the retroviruses in the human genome in that many HERV-K proviruses were inserted into the human germline after the human and chimpanzee lineages evolutionarily diverged [1, 2]. However, all full-length endogenous retroviruses described to date in humans are sufficiently old that all humans examined were homozygous for their presence [1]. Moreover, none are intact; all have lethal mutations [1, 3, 4]. Here, we describe the first endogenous retroviruses in humans for which both the full-length provirus and the preintegration site alleles are shown to be present in the human population today. One provirus, called HERV-K113, was present in about 30% of tested individuals, while a second, called HERV-K115, was found in about 15%. HERV-K113 has full-length open reading frames (ORFs) for all viral proteins and lacks any nonsynonymous substitutions in amino acid motifs that are well conserved among retroviruses. This is the first such endogenous retrovirus identified in humans. These findings indicate that HERV-K remained capable of reinfecting humans through very recent evolutionary times and that HERV-K113 is an excellent candidate for an endogenous retrovirus that is capable of reinfecting humans today.
Collapse
Affiliation(s)
- G Turner
- Department of Molecular Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | | | | | | | | | | |
Collapse
|
94
|
Abstract
Levels of heterozygosity for single nucleotide polymorphisms vary by more than one order of magnitude in different regions of the human genome. Regional differences in the rate of recombination explain a substantial fraction of the variation in levels of nucleotide polymorphism, consistent with the widespread action of natural selection at the molecular level.
Collapse
Affiliation(s)
- M W Nachman
- Dept of Ecology and Evolutionary Biology, University of Arizona, Tuscon 85721, USA.
| |
Collapse
|
95
|
Makova KD, Ramsay M, Jenkins T, Li WH. Human DNA sequence variation in a 6.6-kb region containing the melanocortin 1 receptor promoter. Genetics 2001; 158:1253-68. [PMID: 11454772 PMCID: PMC1461732 DOI: 10.1093/genetics/158.3.1253] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
An approximately 6.6-kb region located upstream from the melanocortin 1 receptor (MC1R) gene and containing its promoter was sequenced in 54 humans (18 Africans, 18 Asians, and 18 Europeans) and in one chimpanzee, gorilla, and orangutan. Seventy-six polymorphic sites were found among the human sequences and the average nucleotide diversity (pi) was 0.141%, one of the highest among all studies of nuclear sequence variation in humans. Opposite to the pattern observed in the MC1R coding region, in the present region pi is highest in Africans (0.136%) compared to Asians (0.116%) and Europeans (0.122%). The distributions of pi, theta, and Fu and Li's F-statistic are nonuniform along the sequence and among continents. The pattern of genetic variation is consistent with a population expansion in Africans. We also suggest a possible phase of population size reduction in non-Africans and purifying selection acting in the middle subregion and parts of the 5' subregion in Africans. We hypothesize diversifying selection acting on some sites in the 5' and 3' subregions or in the MC1R coding region in Asians and Europeans, though we cannot reject the possibility of relaxation of functional constraints in the MC1R gene in Asians and Europeans. The mutation rate in the sequenced region is 1.65 x 10(-9) per site per year. The age of the most recent common ancestor for this region is similar to that for the other long noncoding regions studied to date, providing evidence for ancient gene genealogies. Our population screening and phylogenetic footprinting suggest potentially important sites for the MC1R promoter function.
Collapse
Affiliation(s)
- K D Makova
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | |
Collapse
|
96
|
Wooding S. Do human and JC virus genes show evidence of host-parasite codemography? INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2001; 1:3-12. [PMID: 12798045 DOI: 10.1016/s1567-1348(01)00002-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Information about similarities and differences in the demographic history of host and parasite populations is potentially useful for making inferences about a variety of evolutionary processes. However, it is difficult to observe the historical demographic properties of natural populations directly. Here, the extent of demographic similarity in a host and its parasite was examined indirectly by inferring long-term population history from patterns of genetic variation. Nucleotide sequence diversity in human and JC virus (JCV) DNA is consistent with a long-term demographic connection between the two species: both show evidence of large-scale population expansion. However, genetic data also suggest that the two species have different patterns of population substructuring. These similarities and differences have implications for adaptive evolution in JCV that are not evident when the two species are considered separately.
Collapse
Affiliation(s)
- S Wooding
- Department of Anthropology, University of Utah, 270 South 1400 East, Salt Lake City, UT 84112-0060, USA.
| |
Collapse
|
97
|
Ke Y, Su B, Song X, Lu D, Chen L, Li H, Qi C, Marzuki S, Deka R, Underhill P, Xiao C, Shriver M, Lell J, Wallace D, Wells RS, Seielstad M, Oefner P, Zhu D, Jin J, Huang W, Chakraborty R, Chen Z, Jin L. African Origin of Modern Humans in East Asia: A Tale of 12,000 Y Chromosomes. Science 2001; 292:1151-3. [PMID: 11349147 DOI: 10.1126/science.1060011] [Citation(s) in RCA: 220] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
To test the hypotheses of modern human origin in East Asia, we sampled 12,127 male individuals from 163 populations and typed for three Y chromosome biallelic markers (YAP, M89, and M130). All the individuals carried a mutation at one of the three sites. These three mutations (YAP+, M89T, and M130T) coalesce to another mutation (M168T), which originated in Africa about 35,000 to 89,000 years ago. Therefore, the data do not support even a minimal in situ hominid contribution in the origin of anatomically modern humans in East Asia.
Collapse
Affiliation(s)
- Y Ke
- State Key Laboratory of Genetic Engineering, Institute of Genetics, School of Life Sciences, Fudan University, 220 Handan Road, Shanghai, China
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
98
|
Abstract
Levels and patterns of human DNA sequence variation vary widely among loci. However, some of this variation may be due to the different populations used in different studies. So far, few studies of diverse human populations have compared different genetic loci for the same samples of populations and individuals. Here, we present new polymorphism data from intron 4 of the Factor IX gene (FIX) sequenced in diverse Old World populations. An explicit comparison is made with another X-linked gene, PDHA1, for which the sampling of individuals was very similar. Despite having a similar amount of divergence from chimpanzees, as do other nuclear genes, FIX has comparatively much less DNA sequence variation among humans. Nucleotide diversity at FIX is the lowest among the existing non-Y chromosome nuclear gene datasets and is less than 10% of the diversity found at PDHA1. Estimates of effective population size based on FIX are 8,558, about half of the value obtained for PDHA1, and the time to the most recent common ancestry among human FIX gene copies (282,000 years) is one of the most recent estimates reported for human genes. Analyses presented here suggest a history for the FIX region that includes recent positive directional selection, or background, selection. The general conclusion emerging is that very large variations can exist between the histories of similar genomic regions, even when sampling differences are minimized.
Collapse
Affiliation(s)
- E E Harris
- Universidade de São Paulo, Cidade Universitaria, Departamento de Biologia, Instituto de Biociências, Rua do Matão-Travessa 14 number 321, CEP 05508-900, São Paulo, Brazil.
| | | |
Collapse
|
99
|
Watkins WS, Ricker CE, Bamshad MJ, Carroll ML, Nguyen SV, Batzer MA, Harpending HC, Rogers AR, Jorde LB. Patterns of ancestral human diversity: an analysis of Alu-insertion and restriction-site polymorphisms. Am J Hum Genet 2001; 68:738-52. [PMID: 11179020 PMCID: PMC1274485 DOI: 10.1086/318793] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2000] [Accepted: 01/17/2001] [Indexed: 11/04/2022] Open
Abstract
We have analyzed 35 widely distributed, polymorphic Alu loci in 715 individuals from 31 world populations. The average frequency of Alu insertions (the derived state) is lowest in Africa (.42) but is higher and similar in India (.55), Europe (.56), and Asia (.57). A comparison with 30 restriction-site polymorphisms (RSPs) for which the ancestral state has been determined shows that the frequency of derived RSP alleles is also lower in Africa (.35) than it is in Asia (.45) and in Europe (.46). Neighbor-joining networks based on Alu insertions or RSPs are rooted in Africa and show African populations as separate from other populations, with high statistical support. Correlations between genetic distances based on Alu and nuclear RSPs, short tandem-repeat polymorphisms, and mtDNA, in the same individuals, are high and significant. For the 35 loci, Alu gene diversity and the diversity attributable to population subdivision is highest in Africa but is lower and similar in Europe and Asia. The distribution of ancestral alleles is consistent with an origin of early modern human populations in sub-Saharan Africa, the isolation and preservation of ancestral alleles within Africa, and an expansion out of Africa into Eurasia. This expansion is characterized by increasing frequencies of Alu inserts and by derived RSP alleles with reduced genetic diversity in non-African populations.
Collapse
Affiliation(s)
- W S Watkins
- Department of Human Genetics, University of Utah Health Sciences Center, Salt Lake City, UT, 84112, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
100
|
Chen FC, Li WH. Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. Am J Hum Genet 2001; 68:444-56. [PMID: 11170892 PMCID: PMC1235277 DOI: 10.1086/318206] [Citation(s) in RCA: 473] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2000] [Accepted: 12/08/2000] [Indexed: 11/03/2022] Open
Abstract
To study the genomic divergences among hominoids and to estimate the effective population size of the common ancestor of humans and chimpanzees, we selected 53 autosomal intergenic nonrepetitive DNA segments from the human genome and sequenced them in a human, a chimpanzee, a gorilla, and an orangutan. The average sequence divergence was only 1.24% +/- 0.07% for the human-chimpanzee pair, 1.62% +/- 0.08% for the human-gorilla pair, and 1.63% +/- 0.08% for the chimpanzee-gorilla pair. These estimates, which were confirmed by additional data from GenBank, are substantially lower than previous ones, which included repetitive sequences and might have been based on less-accurate sequence data. The average sequence divergences between orangutans and humans, chimpanzees, and gorillas were 3.08% +/- 0.11%, 3.12% +/- 0.11%, and 3.09% +/- 0.11%, respectively, which also are substantially lower than previous estimates. The sequence divergences in other regions between hominoids were estimated from extensive data in GenBank and the literature, and Alus showed the highest divergence, followed in order by Y-linked noncoding regions, pseudogenes, autosomal intergenic regions, X-linked noncoding regions, synonymous sites, introns, and nonsynonymous sites. The neighbor-joining tree derived from the concatenated sequence of the 53 segments--24,234 bp in length--supports the Homo-Pan clade with a 100% bootstrap value. However, when each segment is analyzed separately, 22 of the 53 segments (approximately 42%) give a tree that is incongruent with the species tree, suggesting a large effective population size (N(e)) of the common ancestor of Homo and Pan. Indeed, a parsimony analysis of the 53 segments and 37 protein-coding genes leads to an estimate of N(e) = 52,000 to 96,000. As this estimate is 5 to 9 times larger than the long-term effective population size of humans (approximately 10,000) estimated from various genetic polymorphism data, the human lineage apparently had experienced a large reduction in effective population size after its separation from the chimpanzee lineage. Our analysis assumes a molecular clock, which is in fact supported by the sequence data used. Taking the orangutan speciation date as 12 to 16 million years ago, we obtain an estimate of 4.6 to 6.2 million years for the Homo-Pan divergence and an estimate of 6.2 to 8.4 million years for the gorilla speciation date, suggesting that the gorilla lineage branched off 1.6 to 2.2 million years earlier than did the human-chimpanzee divergence.
Collapse
Affiliation(s)
- Feng-Chi Chen
- Department of Life Science, National Tsing Hua University, Taiwan, and Department of Ecology and Evolution, University of Chicago, Chicago
| | - Wen-Hsiung Li
- Department of Life Science, National Tsing Hua University, Taiwan, and Department of Ecology and Evolution, University of Chicago, Chicago
| |
Collapse
|