51
|
Primate CpG islands are maintained by heterogeneous evolutionary regimes involving minimal selection. Cell 2011; 145:773-86. [PMID: 21620139 DOI: 10.1016/j.cell.2011.04.024] [Citation(s) in RCA: 109] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2010] [Revised: 03/28/2011] [Accepted: 04/26/2011] [Indexed: 12/13/2022]
Abstract
Mammalian CpG islands are key epigenomic elements that were first characterized experimentally as genomic fractions with low levels of DNA methylation. Currently, CpG islands are defined based on their genomic sequences alone. Here, we develop evolutionary models to show that several distinct evolutionary processes generate and maintain CpG islands. One central evolutionary regime resulting in enriched CpG content is driven by low levels of DNA methylation and consequentially low rates of CpG deamination. Another major force forming CpG islands is biased gene conversion that stabilizes constitutively methylated CpG islands by balancing rapid deamination with CpG fixation. Importantly, evolutionary analysis and population genetics data suggest that selection for high CpG content is not a significant factor contributing to conservation of CpGs in differentially methylated regions. The heterogeneous, but not selective, origins of CpG islands have direct implications for the understanding of DNA methylation patterns in healthy and diseased cells.
Collapse
|
52
|
Nabholz B, Künstner A, Wang R, Jarvis ED, Ellegren H. Dynamic evolution of base composition: causes and consequences in avian phylogenomics. Mol Biol Evol 2011; 28:2197-210. [PMID: 21393604 PMCID: PMC3144382 DOI: 10.1093/molbev/msr047] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Resolving the phylogenetic relationships among birds is a classical problem in systematics, and this is particularly so when it comes to understanding the relationships among Neoaves. Previous phylogenetic inference of birds has been limited to mitochondrial genomes or a few nuclear genes. Here, we apply deep brain transcriptome sequencing of nine bird species (several passerines, hummingbirds, dove, parrot, and emu), using next-generation sequencing technology to understand features of transcriptome evolution in birds and how this affects phylogenetic inference, and combine with data from two bird species using first generation technology. The phylogenomic data matrix comprises 1,995 genes and a total of 0.77 Mb of exonic sequence. First, we find an unexpected heterogeneity in the evolution of base composition among avian lineages. There is a pronounced increase in guanine + cytosine (GC) content in the third codon position in several independent lineages, with the strongest effect seen in passerines. Second, we evaluate the effect of GC content variation on phylogenetic reconstruction. We find important inconsistencies between the topologies obtained with or without taking GC variation into account, each supporting different conclusions of past studies and also influencing hypotheses on the evolution of the trait of vocal learning. Third, we demonstrate a link between GC content evolution and recombination rate and, focusing on the zebra finch lineage, find that recombination seems to drive GC content. Although we cannot reveal the causal relationships, this observation is consistent with the model of GC-biased gene conversion. Finally, we use this unparalleled amount of avian sequence data to study the rate of molecular evolution, calibrated by fossil evidence and augmented with data from alligator transcriptome sequencing. There is a 2- to 3-fold variation in substitution rate among lineages with passerines being the most rapidly evolving and ratites the slowest. This study illustrates the potential of next-generation sequencing for phylogenomic studies but also the pitfalls when using genome-wide data with heterogeneous base composition.
Collapse
Affiliation(s)
- Benoit Nabholz
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Axel Künstner
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Rui Wang
- Department of Neurobiology, Howard Hughes Medical Institute, Duke University Medical Center, Durham
| | - Erich D. Jarvis
- Department of Neurobiology, Howard Hughes Medical Institute, Duke University Medical Center, Durham
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
53
|
Katzman S, Capra JA, Haussler D, Pollard KS. Ongoing GC-biased evolution is widespread in the human genome and enriched near recombination hot spots. Genome Biol Evol 2011; 3:614-26. [PMID: 21697099 PMCID: PMC3157837 DOI: 10.1093/gbe/evr058] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Fast evolving regions of many metazoan genomes show a bias toward substitutions that change weak (A,T) into strong (G,C) base pairs. Single-nucleotide polymorphisms (SNPs) do not share this pattern, suggesting that it results from biased fixation rather than biased mutation. Supporting this hypothesis, analyses of polymorphism in specific regions of the human genome have identified a positive correlation between weak to strong (W→S) SNPs and derived allele frequency (DAF), suggesting that SNPs become increasingly GC biased over time, especially in regions of high recombination. Using polymorphism data generated by the 1000 Genomes Project from 179 individuals from 4 human populations, we evaluated the extent and distribution of ongoing GC-biased evolution in the human genome. We quantified GC fixation bias by comparing the DAFs of W→S mutations and S→W mutations using a Mann-Whitney U test. Genome-wide, W→S SNPs have significantly higher DAFs than S→W SNPs. This pattern is widespread across the human genome but varies in magnitude along the chromosomes. We found extreme GC-biased evolution in neighborhoods of recombination hot spots, a significant correlation between GC bias and recombination rate, and an inverse correlation between GC bias and chromosome arm length. These findings demonstrate the presence of ongoing fixation bias favoring G and C alleles throughout the human genome and suggest that the bias is caused by a recombination-associated process, such as GC-biased gene conversion.
Collapse
Affiliation(s)
- Sol Katzman
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, USA
| | | | | | | |
Collapse
|
54
|
Capra JA, Pollard KS. Substitution patterns are GC-biased in divergent sequences across the metazoans. Genome Biol Evol 2011; 3:516-27. [PMID: 21670083 PMCID: PMC3138425 DOI: 10.1093/gbe/evr051] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The fastest-evolving regions in the human and chimpanzee genomes show a remarkable excess of weak (A,T) to strong (G,C) nucleotide substitutions since divergence from their common ancestor. We investigated the phylogenetic extent and possible causes of this weak to strong (W→S) bias in divergent sequences (BDS) using recently sequenced genomes and recombination maps from eight trios of eukaryotic species. To quantify evidence for BDS, we inferred substitution histories using an efficient maximum likelihood approach with a context-dependent evolutionary model. We then annotated all lineage-specific substitutions in terms of W→S bias and density on the chromosomes. Finally, we used the inferred substitutions to calculate a BDS score—a log odds ratio between substitution type and density—and assessed its statistical significance with Fisher's exact test. Applying this approach, we found significant BDS in the coding and noncoding sequence of human, mouse, dog, stickleback, fruit fly, and worm. We also observed a significant lack of W→S BDS in chicken and yeast. The BDS score varies between species and across the chromosomes within each species. It is most strongly correlated with different genomic features in different species, but a strong correlation with recombination rates is found in several species. Our results demonstrate that a W→S substitution bias in fast-evolving sequences is a widespread phenomenon. The patterns of BDS observed suggest that a recombination-associated process, such as GC-biased gene conversion, is involved in the production of the bias in many species, but the strength of the BDS likely depends on many factors, including genome stability, variability in recombination rate over time and across the genome, the frequency of meiosis, and the amount of outcrossing in each species.
Collapse
Affiliation(s)
- John A. Capra
- Gladstone Institutes, University of California, San Francisco
| | - Katherine S. Pollard
- Gladstone Institutes, University of California, San Francisco
- Division of Biostatistics & Institute for Human Genetics, University of California, San Francisco
- Corresponding author: E-mail:
| |
Collapse
|
55
|
Burch LH, Yang Y, Sterling JF, Roberts SA, Chao FG, Xu H, Zhang L, Walsh J, Resnick MA, Mieczkowski PA, Gordenin DA. Damage-induced localized hypermutability. Cell Cycle 2011; 10:1073-85. [PMID: 21406975 DOI: 10.4161/cc.10.7.15319] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Genome instability continuously presents perils of cancer, genetic disease and death of a cell or an organism. At the same time, it provides for genome plasticity that is essential for development and evolution. We address here the genome instability confined to a small fraction of DNA adjacent to free DNA ends at uncapped telomeres and double-strand breaks. We found that budding yeast cells can tolerate nearly 20 kilobase regions of subtelomeric single-strand DNA that contain multiple UV-damaged nucleotides. During restoration to the double-strand state, multiple mutations are generated by error-prone translesion synthesis. Genome-wide sequencing demonstrated that multiple regions of damage-induced localized hypermutability can be tolerated, which leads to the simultaneous appearance of multiple mutation clusters in the genomes of UV- irradiated cells. High multiplicity and density of mutations suggest that this novel form of genome instability may play significant roles in generating new alleles for evolutionary selection as well as in the incidence of cancer and genetic disease.
Collapse
Affiliation(s)
- Lauranell H Burch
- National Institute of Environmental Health Sciences, Research Triangle Park, NC USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
56
|
Ratnakumar A, Mousset S, Glémin S, Berglund J, Galtier N, Duret L, Webster MT. Detecting positive selection within genomes: the problem of biased gene conversion. Philos Trans R Soc Lond B Biol Sci 2010; 365:2571-80. [PMID: 20643747 DOI: 10.1098/rstb.2010.0007] [Citation(s) in RCA: 102] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The identification of loci influenced by positive selection is a major goal of evolutionary genetics. A popular approach is to perform scans of alignments on a genome-wide scale in order to find regions evolving at accelerated rates on a particular branch of a phylogenetic tree. However, positive selection is not the only process that can lead to accelerated evolution. Notably, GC-biased gene conversion (gBGC) is a recombination-associated process that results in the biased fixation of G and C nucleotides. This process can potentially generate bursts of nucleotide substitutions within hotspots of meiotic recombination. Here, we analyse the results of a scan for positive selection on genes on branches across the primate phylogeny. We show that genes identified as targets of positive selection have a significant tendency to exhibit the genomic signature of gBGC. Using a maximum-likelihood framework, we estimate that more than 20 per cent of cases of significantly elevated non-synonymous to synonymous substitution rates ratio (d(N)/d(S)), particularly in shorter branches, could be due to gBGC. We demonstrate that in some cases, gBGC can lead to very high d(N)/d(S) (more than 2). Our results indicate that gBGC significantly affects the evolution of coding sequences in primates, often leading to patterns of evolution that can be mistaken for positive selection.
Collapse
Affiliation(s)
- Abhirami Ratnakumar
- Department of Medical Biochemistry and Microbiology, Uppsala University, Box 582, 751 23 Uppsala, Sweden
| | | | | | | | | | | | | |
Collapse
|
57
|
Fumagalli M, Cagliani R, Riva S, Pozzoli U, Biasin M, Piacentini L, Comi GP, Bresolin N, Clerici M, Sironi M. Population genetics of IFIH1: ancient population structure, local selection, and implications for susceptibility to type 1 diabetes. Mol Biol Evol 2010; 27:2555-66. [PMID: 20538742 DOI: 10.1093/molbev/msq141] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The human interferon induced with helicase C domain 1 (IFIH1) gene encodes a sensor of double-strand RNA involved in innate immunity against viruses, indicating that this gene is a likely target of virus-driven selective pressure. Notably, IFIH1 also plays a role in autoimmunity, as common and rare polymorphisms in this gene have been associated with type 1 diabetes (T1D). We analyzed the evolutionary history of IFIH1 in human populations. Results herein suggest that two major IFIH1 haplotype clades originated from ancestral population structure (or balancing selection) in the African continent and that local selective pressures have acted on the gene. Specifically, directional selection in Europe and Asia resulted in the spread of a common IFIH1 haplotype carrying a derived His460 allele. This variant changes a highly conserved arginine residue in the helicase domain, possibly conferring altered specificity in viral recognition. An alternative common haplotype has swept to high frequency in South Americans as a result of recent positive selection. Previous studies suggested that a portion of risk alleles for autoimmune diseases could have been maintained in humans as they conferred a selective advantage against infections. This is not the case for IFIH1, as population genetic differentiation and haplotype analyses indicated that the T1D susceptibility alleles behaved as neutral or nearly neutral polymorphisms. Our findings suggest that variants in IFIH1 confer different susceptibility to diverse viral infections and provide insight into the relationship between adaptation to past infection and predisposition to autoimmunity in modern populations.
Collapse
Affiliation(s)
- Matteo Fumagalli
- Bioinformatic Lab, Scientific Institute Istituto di Ricovero e Cura a Carattere Scientifico E. Medea, Bosisio Parini, Lecco, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
58
|
Katzman S, Kern AD, Pollard KS, Salama SR, Haussler D. GC-biased evolution near human accelerated regions. PLoS Genet 2010; 6:e1000960. [PMID: 20502635 PMCID: PMC2873926 DOI: 10.1371/journal.pgen.1000960] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2009] [Accepted: 04/20/2010] [Indexed: 12/30/2022] Open
Abstract
Regions of the genome that have been the target of positive selection specifically along the human lineage are of special importance in human biology. We used high throughput sequencing combined with methods to enrich human genomic samples for particular targets to obtain the sequence of 22 chromosomal samples at high depth in 40 kb neighborhoods of 49 previously identified 100–400 bp elements that show evidence for human accelerated evolution. In addition to selection, the pattern of nucleotide substitutions in several of these elements suggested an historical bias favoring the conversion of weak (A or T) alleles into strong (G or C) alleles. Here we found strong evidence in the derived allele frequency spectra of many of these 40 kb regions for ongoing weak-to-strong fixation bias. Comparison of the nucleotide composition at polymorphic loci to the composition at sites of fixed substitutions additionally reveals the signature of historical weak-to-strong fixation bias in a subset of these regions. Most of the regions with evidence for historical bias do not also have signatures of ongoing bias, suggesting that the evolutionary forces generating weak-to-strong bias are not constant over time. To investigate the role of selection in shaping these regions, we analyzed the spatial pattern of polymorphism in our samples. We found no significant evidence for selective sweeps, possibly because the signal of such sweeps has decayed beyond the power of our tests to detect them. Together, these results do not rule out functional roles for the observed changes in these regions—indeed there is good evidence that the first two are functional elements in humans—but they suggest that a fixation process (such as biased gene conversion) that is biased at the nucleotide level, but is otherwise selectively neutral, could be an important evolutionary force at play in them, both historically and at present. The search for functional regions in the human genome, beyond the protein-coding portion, often relies on signals of conservation across species. The Human Accelerated Regions (HARs) are strongly conserved elements, ranging in size from 100–400 bp, that show an unexpected number of human-specific changes. This pattern suggests that HARs may be functional elements that have significantly changed during human evolution. To analyze the evolutionary forces that led these changes, we studied 40 kb neighborhoods of the top 49 HARs. We took advantage of recently developed DNA sequencing technology, coupled with methods to isolate genomic DNA for our target regions only, to determine the genotypes in 22 chromosomal samples. This polymorphism data showed no significant evidence for adaptive selective sweeps in HAR regions. By contrast, we found strong evidence for a nucleotide bias in the fixation of mutations from A or T to G or C basepairs. Our work reveals that this bias in the HAR neighborhoods is not just an historic phenomenon, but is ongoing in the present day human population. This finding adds credence to the possibility that non-selective forces, such as biased gene conversion, could have contributed to the evolution of several of these regions.
Collapse
Affiliation(s)
- Sol Katzman
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Andrew D. Kern
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Katherine S. Pollard
- Gladstone Institutes, University of California San Francisco, San Francisco, California, United States of America
| | - Sofie R. Salama
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - David Haussler
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California, United States of America
- * E-mail:
| |
Collapse
|
59
|
Aleshin A, Zhi D. Recombination-associated sequence homogenization of neighboring Alu elements: signature of nonallelic gene conversion. Mol Biol Evol 2010; 27:2300-11. [PMID: 20453015 DOI: 10.1093/molbev/msq116] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Recently, researchers have begun to recognize that, in order to establish neutral models for disease association and evolutionary genomics studies, it is crucial to have a clear understanding of the genomic impact of nonallelic gene conversion. Drawing on previous successes in characterizing this phenomenon over protein-coding gene families, we undertook a computational analysis of neighboring Alu sequences in the genome scale. For this purpose, we developed adjusted comutation rate (aCMR), a novel statistical method measuring the excess number of identical point mutations shared by adjacent Alu sequences, vis-à-vis random pairs. Using aCMR, we uncovered a remarkable genome-wide sequence homogenization of neighboring Alus, with the strongest signal observed in the pseudoautosomal regions of the X and Y chromosomes. The magnitude of sequence homogenization between Alu pairs is greater with shorter interlocus distance, higher sequence identity, and parallel orientation. Moreover, shared substitutions show a strong directionality toward GC nucleotides, with multiple substitutions tending to cluster within the Alu sequence. Taken together, these observed recombination-associated sequence homogenization patterns are best explained by frequent ubiquitous gene conversion events between neighboring Alus. We believe that these observations help to illuminate the nature and impact of the enigmatic phenomenon of gene conversion.
Collapse
Affiliation(s)
- Alexey Aleshin
- Department of Medicine, Division of Hematology, Oncology, David Geffen School of Medicine, University of California, Los Angeles, USA
| | | |
Collapse
|
60
|
Navarro-Costa P, Gonçalves J, Plancha CE. The AZFc region of the Y chromosome: at the crossroads between genetic diversity and male infertility. Hum Reprod Update 2010; 16:525-42. [PMID: 20304777 PMCID: PMC2918367 DOI: 10.1093/humupd/dmq005] [Citation(s) in RCA: 90] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND The three azoospermia factor (AZF) regions of the Y chromosome represent genomic niches for spermatogenesis genes. Yet, the most distal region, AZFc, is a major generator of large-scale variation in the human genome. Determining to what extent this variability affects spermatogenesis is a highly contentious topic in human reproduction. METHODS In this review, an extensive characterization of the molecular mechanisms responsible for AZFc genotypical variation is undertaken. Such data are complemented with the assessment of the clinical consequences for male fertility imputable to the different AZFc variants. For this, a critical re-evaluation of 23 association studies was performed in order to extract unifying conclusions by curtailing methodological heterogeneities. RESULTS Intrachromosomal homologous recombination mechanisms, either crossover or non-crossover based, are the main drivers for AZFc genetic diversity. In particular, rearrangements affecting gene dosage are the most likely to introduce phenotypical disruptions in the spermatogenic profile. In the specific cases of partial AZFc deletions, both the actual existence and the severity of the spermatogenic defect are dependent on the evolutionary background of the Y chromosome. CONCLUSIONS AZFc is one of the most genetically dynamic regions in the human genome. This property may serve as counter against the genetic degeneracy associated with the lack of a meiotic partner. However, such strategy comes at a price: some rearrangements represent a risk factor or a de-facto causative agent of spermatogenic disruption. Interestingly, this precarious balance is modulated, among other yet unknown factors, by the evolutionary history of the Y chromosome.
Collapse
Affiliation(s)
- Paulo Navarro-Costa
- Instituto de Medicina Molecular, Faculdade de Medicina de Lisboa, Lisboa, Portugal.
| | | | | |
Collapse
|
61
|
Wong AK, Ruhe AL, Dumont BL, Robertson KR, Guerrero G, Shull SM, Ziegle JS, Millon LV, Broman KW, Payseur BA, Neff MW. A comprehensive linkage map of the dog genome. Genetics 2010; 184:595-605. [PMID: 19966068 PMCID: PMC2828735 DOI: 10.1534/genetics.109.106831] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2009] [Accepted: 11/30/2009] [Indexed: 12/15/2022] Open
Abstract
We have leveraged the reference sequence of a boxer to construct the first complete linkage map for the domestic dog. The new map improves access to the dog's unique biology, from human disease counterparts to fascinating evolutionary adaptations. The map was constructed with approximately 3000 microsatellite markers developed from the reference sequence. Familial resources afforded 450 mostly phase-known meioses for map assembly. The genotype data supported a framework map with approximately 1500 loci. An additional approximately 1500 markers served as map validators, contributing modestly to estimates of recombination rate but supporting the framework content. Data from approximately 22,000 SNPs informing on a subset of meioses supported map integrity. The sex-averaged map extended 21 M and revealed marked region- and sex-specific differences in recombination rate. The map will enable empiric coverage estimates and multipoint linkage analysis. Knowledge of the variation in recombination rate will also inform on genomewide patterns of linkage disequilibrium (LD), and thus benefit association, selective sweep, and phylogenetic mapping approaches. The computational and wet-bench strategies can be applied to the reference genome of any nonmodel organism to assemble a de novo linkage map.
Collapse
Affiliation(s)
- Aaron K. Wong
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Alison L. Ruhe
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Beth L. Dumont
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Kathryn R. Robertson
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Giovanna Guerrero
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Sheila M. Shull
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Janet S. Ziegle
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Lee V. Millon
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Karl W. Broman
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Bret A. Payseur
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| | - Mark W. Neff
- Veterinary Genetics Laboratory, School of Veterinary Medicine, University of California, Davis, California 95616, Applied Biosystems, Foster City, California 94404, Department of Biostatistics and Medical Informatics and Laboratory of Genetics, University of Wisconsin, Madison, Wisconsin 53706
| |
Collapse
|
62
|
Sholtis SJ, Noonan JP. Gene regulation and the origins of human biological uniqueness. Trends Genet 2010; 26:110-8. [PMID: 20106546 DOI: 10.1016/j.tig.2009.12.009] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2009] [Revised: 12/23/2009] [Accepted: 12/23/2009] [Indexed: 02/01/2023]
Abstract
What makes us human? It is likely that changes in gene expression and regulation, in addition to those in protein-coding genes, drove the evolution of uniquely human biological traits. In this review, we discuss how efforts to annotate regulatory functions in the human genome are being combined with maps of human-specific sequence acceleration to identify cis-regulatory elements with human-specific activity. Although the evolutionary interpretation of these events is a subject of considerable debate, the technical and analytical means are now at hand to identify the set of evolutionary genetic events that shaped our species.
Collapse
Affiliation(s)
- Samuel J Sholtis
- Department of Genetics, Yale University School of Medicine, New Haven, CT 06520, USA
| | | |
Collapse
|
63
|
Noonan JP. Regulatory DNAs and the evolution of human development. Curr Opin Genet Dev 2009; 19:557-64. [PMID: 19913404 DOI: 10.1016/j.gde.2009.10.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2009] [Revised: 10/06/2009] [Accepted: 10/07/2009] [Indexed: 10/20/2022]
Abstract
Changes in gene regulation have long been thought to underlie biological differences between humans and other primates. Recent advances have facilitated the study of human-specific sequence changes in regulatory DNAs in the context of human development. Comparative genomic analyses coupled with genome-wide in vivo developmental enhancer screens have identified thousands of known and likely regulatory elements in the genome. These have provided the substrate for statistical and experimental identification of regulatory sequences with human-specific developmental activities. On the basis of these early results, the outlines of an integrated strategy have emerged that combines gene expression atlases of human development, in vivo reverse genetic studies of regulatory DNAs, and maps of human-specific sequence change to reveal the genetic basis of unique human biology.
Collapse
Affiliation(s)
- James P Noonan
- Department of Genetics and Kavli Institute for Neuroscience, Yale University School of Medicine, New Haven, CT 06510, USA.
| |
Collapse
|
64
|
Pollard KS, Hubisz MJ, Rosenbloom KR, Siepel A. Detection of nonneutral substitution rates on mammalian phylogenies. Genome Res 2009; 20:110-21. [PMID: 19858363 DOI: 10.1101/gr.097857.109] [Citation(s) in RCA: 1515] [Impact Index Per Article: 101.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Methods for detecting nucleotide substitution rates that are faster or slower than expected under neutral drift are widely used to identify candidate functional elements in genomic sequences. However, most existing methods consider either reductions (conservation) or increases (acceleration) in rate but not both, or assume that selection acts uniformly across the branches of a phylogeny. Here we examine the more general problem of detecting departures from the neutral rate of substitution in either direction, possibly in a clade-specific manner. We consider four statistical, phylogenetic tests for addressing this problem: a likelihood ratio test, a score test, a test based on exact distributions of numbers of substitutions, and the genomic evolutionary rate profiling (GERP) test. All four tests have been implemented in a freely available program called phyloP. Based on extensive simulation experiments, these tests are remarkably similar in statistical power. With 36 mammalian species, they all appear to be capable of fairly good sensitivity with low false-positive rates in detecting strong selection at individual nucleotides, moderate selection in 3-bp elements, and weaker or clade-specific selection in longer elements. By applying phyloP to mammalian multiple alignments from the ENCODE project, we shed light on patterns of conservation/acceleration in known and predicted functional elements, approximate fractions of sites subject to constraint, and differences in clade-specific selection in the primate and glires clades. We also describe new "Conservation" tracks in the UCSC Genome Browser that display both phyloP and phastCons scores for genome-wide alignments of 44 vertebrate species.
Collapse
Affiliation(s)
- Katherine S Pollard
- Gladstone Institutes, University of California, San Francisco, San Francisco, California 94158, USA.
| | | | | | | |
Collapse
|
65
|
Monoallelic expression and tissue specificity are associated with high crossover rates. Trends Genet 2009; 25:519-22. [PMID: 19850368 DOI: 10.1016/j.tig.2009.10.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2009] [Revised: 10/02/2009] [Accepted: 10/05/2009] [Indexed: 11/20/2022]
Abstract
What determines the recombination rate of a gene? Following the observation that, in humans, imprinted genes have unusually high recombination levels, we ask whether increased recombination is seen for other monoallelically expressed genes and, more generally, how transcriptional properties relate to recombination. We find that monoallelically expressed genes do have high crossover rates and discover a striking negative correlation between within-gene crossover rate and expression breadth. We hypothesise that these findings are possibly symptomatic of a more general, adverse relationship between recombination and transcription in the human genome.
Collapse
|
66
|
Duret L, Galtier N. Biased gene conversion and the evolution of mammalian genomic landscapes. Annu Rev Genomics Hum Genet 2009; 10:285-311. [PMID: 19630562 DOI: 10.1146/annurev-genom-082908-150001] [Citation(s) in RCA: 468] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Recombination is typically thought of as a symmetrical process resulting in large-scale reciprocal genetic exchanges between homologous chromosomes. Recombination events, however, are also accompanied by short-scale, unidirectional exchanges known as gene conversion in the neighborhood of the initiating double-strand break. A large body of evidence suggests that gene conversion is GC-biased in many eukaryotes, including mammals and human. AT/GC heterozygotes produce more GC- than AT-gametes, thus conferring a population advantage to GC-alleles in high-recombining regions. This apparently unimportant feature of our molecular machinery has major evolutionary consequences. Structurally, GC-biased gene conversion explains the spatial distribution of GC-content in mammalian genomes-the so-called isochore structure. Functionally, GC-biased gene conversion promotes the segregation and fixation of deleterious AT --> GC mutations, thus increasing our genomic mutation load. Here we review the recent evidence for a GC-biased gene conversion process in mammals, and its consequences for genomic landscapes, molecular evolution, and human functional genomics.
Collapse
Affiliation(s)
- Laurent Duret
- Université de Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622, Villeurbanne, France.
| | | |
Collapse
|
67
|
Pink CJ, Swaminathan SK, Dunham I, Rogers J, Ward A, Hurst LD. Evidence that replication-associated mutation alone does not explain between-chromosome differences in substitution rates. Genome Biol Evol 2009; 1:13-22. [PMID: 20333173 PMCID: PMC2817397 DOI: 10.1093/gbe/evp001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/05/2009] [Indexed: 12/12/2022] Open
Abstract
Since Haldane first noticed an excess of paternally derived mutations, it has been considered that most mutations derive from errors during germ line replication. Miyata et al. (1987) proposed that differences in the rate of neutral evolution on X, Y, and autosome can be employed to measure the extent of this male bias. This commonly applied method assumes replication to be the sole source of between-chromosome variation in substitution rates. We propose a simple test of this assumption: If true, estimates of the male bias should be independent of which two chromosomal classes are compared. Prior evidence from rodents suggested that this might not be true, but conclusions were limited by a lack of rat Y-linked sequence. We therefore sequenced two rat Y-linked bacterial artificial chromosomes and determined evolutionary rate by comparison with mouse. For estimation of rates we consider both introns and synonymous rates. Surprisingly, for both data sets the prediction of congruent estimates of alpha is strongly rejected. Indeed, some comparisons suggest a female bias with autosomes evolving faster than Y-linked sequence. We conclude that the method of Miyata et al. (1987) has the potential to provide incorrect estimates. Correcting the method requires understanding of the other causes of substitution that might differ between chromosomal classes. One possible cause is recombination-associated substitution bias for which we find some evidence. We note that if, as some suggest, this association is dominantly owing to male recombination, the high estimates of alpha seen in birds is to be expected as Z chromosomes recombine in males.
Collapse
Affiliation(s)
- Catherine J Pink
- Department of Biology and Biochemistry, University of Bath, Bath, Somerset, United Kingdom
| | | | | | | | | | | |
Collapse
|
68
|
Jeffreys AJ, Neumann R. The rise and fall of a human recombination hot spot. Nat Genet 2009; 41:625-9. [PMID: 19349985 PMCID: PMC2678279 DOI: 10.1038/ng.346] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2008] [Accepted: 01/16/2009] [Indexed: 12/26/2022]
Abstract
Human meiotic crossovers mainly cluster into narrow hot spots that profoundly influence patterns of haplotype diversity and that may also affect genome instability and sequence evolution. Hot spots also seem to be ephemeral, but processes of hot-spot activation and their subsequent evolutionary dynamics remain unknown. We now analyze the life cycle of a recombination hot spot. Sperm typing revealed a polymorphic hot spot that was activated in cis by a single base change, providing evidence for a primary sequence determinant necessary, though not sufficient, to activate recombination. This activating mutation occurred roughly 70,000 y ago and has persisted to the present, most likely fortuitously through genetic drift despite its systematic elimination by biased gene conversion. Nonetheless, this self-destructive conversion will eventually lead to hot-spot extinction. These findings define a subclass of highly transient hot spots and highlight the importance of understanding hot-spot turnover and how it influences haplotype diversity.
Collapse
|
69
|
Abstract
RNA is not only a messenger operating between DNA and protein. Transcription of essentially the entire eukaryotic genome generates a myriad of non-protein-coding RNA species that show complex overlapping patterns of expression and regulation. Although long noncoding RNAs (lncRNAs) are among the least well-understood of these transcript species, they cannot all be dismissed as merely transcriptional "noise." Here, we review the evolution of lncRNAs and their roles in transcriptional regulation, epigenetic gene regulation, and disease.
Collapse
Affiliation(s)
- Chris P Ponting
- MRC Functional Genomics Unit, University of Oxford, Department of Physiology, Anatomy and Genetics, South Parks Road, Oxford, OX1 3QX, UK.
| | | | | |
Collapse
|
70
|
Abstract
Genes that have experienced accelerated evolutionary rates on the human lineage during recent evolution are candidates for involvement in human-specific adaptations. To determine the forces that cause increased evolutionary rates in certain genes, we analyzed alignments of 10,238 human genes to their orthologues in chimpanzee and macaque. Using a likelihood ratio test, we identified protein-coding sequences with an accelerated rate of base substitutions along the human lineage. Exons evolving at a fast rate in humans have a significant tendency to contain clusters of AT-to-GC (weak-to-strong) biased substitutions. This pattern is also observed in noncoding sequence flanking rapidly evolving exons. Accelerated exons occur in regions with elevated male recombination rates and exhibit an excess of nonsynonymous substitutions relative to the genomic average. We next analyzed genes with significantly elevated ratios of nonsynonymous to synonymous rates of base substitution (dN/dS) along the human lineage, and those with an excess of amino acid replacement substitutions relative to human polymorphism. These genes also show evidence of clusters of weak-to-strong biased substitutions. These findings indicate that a recombination-associated process, such as biased gene conversion (BGC), is driving fixation of GC alleles in the human genome. This process can lead to accelerated evolution in coding sequences and excess amino acid replacement substitutions, thereby generating significant results for tests of positive selection. Regions of the human genome that appear to evolve rapidly may have been under strong positive selection and could contain the genetic changes responsible for the uniqueness of our species. However, neutral (nonadaptive) evolutionary processes can give rise to signals that can be mistaken as signs of selection. In this article, we identify coding sequences that have undergone accelerated rates of change in humans, affecting the divergence of the proteins they encode. By analyzing patterns of molecular evolution in these genes and their distribution in the genome, we show that many protein-coding changes in the fastest-changing genes are not a result of selection operating on the genes, but instead result from biased fixation of AT-to-GC mutations. Our findings are consistent with a model of recombination-driven biased gene conversion. This leads to the provocative hypothesis that many of the genetic changes leading to human-specific characters may have been prompted by fixation of deleterious mutations. Natural selection is commonly believed to be the main engine of functional genetic change, but a separate neutral evolutionary process linked to recombination may have contributed significantly to the divergence of human proteins.
Collapse
|
71
|
Duret L, Galtier N. Comment on "Human-specific gain of function in a developmental enhancer". Science 2009; 323:714; author reply 714. [PMID: 19197042 DOI: 10.1126/science.1165848] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Prabhakar et al. (Reports, 5 September 2008, p. 1346) argued that the conserved noncoding sequence HACNS1 has undergone positive selection and contributed to human adaptation. However, the pattern of substitution in HACNS1 is more consistent with the neutral process of biased gene conversion (BGC). The reported human-specific gain of function is likely due to the accumulation of deleterious mutations driven by BGC, not positive selection.
Collapse
Affiliation(s)
- Laurent Duret
- Université de Lyon, Université Lyon 1, CNRS, UMR5558, Laboratoire de Biométrie et Biologie Evolutive, F-69622, Villeurbanne, France.
| | | |
Collapse
|
72
|
Prabhakar S, Visel A, Akiyama JA, Shoukry M, Lewis KD, Holt A, Plajzer-Frick I, Morrison H, FitzPatrick DR, Afzal V, Pennacchio LA, Rubin EM, Noonan JP. Response to Comment on "Human-Specific Gain of Function in a Developmental Enhancer". Science 2009. [DOI: 10.1126/science.1166571] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
73
|
Freudenberg J, Wang M, Yang Y, Li W. Partial correlation analysis indicates causal relationships between GC-content, exon density and recombination rate in the human genome. BMC Bioinformatics 2009; 10 Suppl 1:S66. [PMID: 19208170 PMCID: PMC2648766 DOI: 10.1186/1471-2105-10-s1-s66] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Several features are known to correlate with the GC-content in the human genome, including recombination rate, gene density and distance to telomere. However, by testing for pairwise correlation only, it is impossible to distinguish direct associations from indirect ones and to distinguish between causes and effects. RESULTS We use partial correlations to construct partially directed graphs for the following four variables: GC-content, recombination rate, exon density and distance-to-telomere. Recombination rate and exon density are unconditionally uncorrelated, but become inversely correlated by conditioning on GC-content. This pattern indicates a model where recombination rate and exon density are two independent causes of GC-content variation. CONCLUSION Causal inference and graphical models are useful methods to understand genome evolution and the mechanisms of isochore evolution in the human genome.
Collapse
Affiliation(s)
- Jan Freudenberg
- The Robert S Boas Center for Genomics and Human GeneticsFeinstein Institute for Medical Research, North Shore LIJ Health System, Manhasset, NY 11030, USA.
| | | | | | | |
Collapse
|
74
|
|
75
|
Galtier N, Duret L, Glémin S, Ranwez V. GC-biased gene conversion promotes the fixation of deleterious amino acid changes in primates. Trends Genet 2009; 25:1-5. [DOI: 10.1016/j.tig.2008.10.011] [Citation(s) in RCA: 153] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2008] [Revised: 10/17/2008] [Accepted: 10/24/2008] [Indexed: 01/22/2023]
|
76
|
Hellmann I, Mang Y, Gu Z, Li P, de la Vega FM, Clark AG, Nielsen R. Population genetic analysis of shotgun assemblies of genomic sequences from multiple individuals. Genes Dev 2008; 18:1020-9. [PMID: 18411405 PMCID: PMC2493391 DOI: 10.1101/gr.074187.107] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2007] [Accepted: 04/07/2008] [Indexed: 01/25/2023]
Abstract
We introduce a simple, broadly applicable method for obtaining estimates of nucleotide diversity from genomic shotgun sequencing data. The method takes into account the special nature of these data: random sampling of genomic segments from one or more individuals and a relatively high error rate for individual reads. Applying this method to data from the Celera human genome sequencing and SNP discovery project, we obtain estimates of nucleotide diversity in windows spanning the human genome and show that the diversity to divergence ratio is reduced in regions of low recombination. Furthermore, we show that the elevated diversity in telomeric regions is mainly due to elevated mutation rates and not due to decreased levels of background selection. However, we find indications that telomeres as well as centromeres experience greater impact from natural selection than intrachromosomal regions. Finally, we identify a number of genomic regions with increased or reduced diversity compared with the local level of human-chimpanzee divergence and the local recombination rate.
Collapse
Affiliation(s)
- Ines Hellmann
- Departments of Integrative Biology and Statistics, University of California, Berkeley, California 94720, USA.
| | | | | | | | | | | | | |
Collapse
|
77
|
Duret L, Arndt PF. The impact of recombination on nucleotide substitutions in the human genome. PLoS Genet 2008; 4:e1000071. [PMID: 18464896 PMCID: PMC2346554 DOI: 10.1371/journal.pgen.1000071] [Citation(s) in RCA: 254] [Impact Index Per Article: 15.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2007] [Accepted: 04/11/2008] [Indexed: 01/19/2023] Open
Abstract
Unraveling the evolutionary forces responsible for variations of neutral substitution patterns among taxa or along genomes is a major issue for detecting selection within sequences. Mammalian genomes show large-scale regional variations of GC-content (the isochores), but the substitution processes at the origin of this structure are poorly understood. We analyzed the pattern of neutral substitutions in 1 Gb of primate non-coding regions. We show that the GC-content toward which sequences are evolving is strongly negatively correlated to the distance to telomeres and positively correlated to the rate of crossovers (R2 = 47%). This demonstrates that recombination has a major impact on substitution patterns in human, driving the evolution of GC-content. The evolution of GC-content correlates much more strongly with male than with female crossover rate, which rules out selectionist models for the evolution of isochores. This effect of recombination is most probably a consequence of the neutral process of biased gene conversion (BGC) occurring within recombination hotspots. We show that the predictions of this model fit very well with the observed substitution patterns in the human genome. This model notably explains the positive correlation between substitution rate and recombination rate. Theoretical calculations indicate that variations in population size or density in recombination hotspots can have a very strong impact on the evolution of base composition. Furthermore, recombination hotspots can create strong substitution hotspots. This molecular drive affects both coding and non-coding regions. We therefore conclude that along with mutation, selection and drift, BGC is one of the major factors driving genome evolution. Our results also shed light on variations in the rate of crossover relative to non-crossover events, along chromosomes and according to sex, and also on the conservation of hotspot density between human and chimp. Mammalian genomes show a very strong heterogeneity of base composition along chromosomes (the so-called isochores). The functional significance of these peculiar genomic landscapes is highly debated: do isochores confer some selective advantage, or are they simply the by-product of neutral evolutionary processes? To resolve this issue, we analyzed the pattern of substitution in the human genome by comparison with chimpanzee and macaque. We show that the evolution of base composition (GC-content) is essentially determined by the rate of recombination. This effect appears to be much stronger in male than in female germline, which rules out selective explanations for the evolution of isochores. We show that this impact of recombination is most probably a consequence of the process of biased gene conversion (BGC). This neutral process mimics the action of selection and can induce strong substitution hotspots within recombination hotspots, sometimes leading to the fixation of deleterious mutations. BGC appears to be one of the major factors driving genome evolution. It is therefore essential to take this process into account if we want to be able to interpret genome sequences.
Collapse
Affiliation(s)
- Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, Université de Lyon, Université Lyon 1, CNRS, UMR 5558, Villeurbanne, France
- * E-mail: (LD); (PFA)
| | - Peter F. Arndt
- Department for Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Berlin, Germany
- * E-mail: (LD); (PFA)
| |
Collapse
|
78
|
Kehrer-Sawatzki H, Cooper DN. Molecular mechanisms of chromosomal rearrangement during primate evolution. Chromosome Res 2008; 16:41-56. [PMID: 18293104 DOI: 10.1007/s10577-007-1207-1] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
Breakpoint analysis of the large chromosomal rearrangements which have occurred during primate evolution promises to yield new insights into the underlying mechanisms of mutagenesis. Comparison of these evolutionary breakpoints with those that are disease-associated in humans, and which occur during either meiotic or mitotic cell division, should help to identify basic mechanistic similarities as well as differences. It has recently become clear that segmental duplications (SDs) have had a very significant impact on genome plasticity during primate evolution. In comparisons of the human and chimpanzee genomes, SDs have been found in flanking regions of 70-80% of inversions and approximately 40% of deletions/duplications. A strong spatial association between primate-specific breakpoints and SDs has also become evident from comparisons of human with other mammalian genomes. The lineage-specific hyperexpansion of certain SDs observed in the genomes of human, chimpanzee, gorilla and gibbon is indicative of the intrinsic instability of some SDs in primates. However, since many primate-specific breakpoints map to regions lacking SDs, but containing interspersed high-copy repetitive sequence elements such as SINEs, LINEs, LTRs, alpha-satellites and (AT)( n ) repeats, we may infer that a range of different molecular mechanisms have probably been involved in promoting chromosomal breakage during the evolution of primate genomes.
Collapse
|
79
|
Tyekucheva S, Makova KD, Karro JE, Hardison RC, Miller W, Chiaromonte F. Human-macaque comparisons illuminate variation in neutral substitution rates. Genome Biol 2008; 9:R76. [PMID: 18447906 PMCID: PMC2643947 DOI: 10.1186/gb-2008-9-4-r76] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2008] [Revised: 04/04/2008] [Accepted: 04/30/2008] [Indexed: 11/10/2022] Open
Abstract
The evolutionary distance between human and macaque is particularly attractive for investigating neutral substitution rates, which were calculated as a function of a number of genomic parameters. Background The evolutionary distance between human and macaque is particularly attractive for investigating local variation in neutral substitution rates, because substitutions can be inferred more reliably than in comparisons with rodents and are less influenced by the effects of current and ancient diversity than in comparisons with closer primates. Here we investigate the human-macaque neutral substitution rate as a function of a number of genomic parameters. Results Using regression analyses we find that male mutation bias, male (but not female) recombination rate, distance to telomeres and substitution rates computed from orthologous regions in mouse-rat and dog-cow comparisons are prominent predictors of the neutral rate. Additionally, we demonstrate that the previously observed biphasic relationship between neutral rate and GC content can be accounted for by properly combining rates at CpG and non-CpG sites. Finally, we find the neutral rate to be negatively correlated with the densities of several classes of computationally predicted functional elements, and less so with the densities of certain classes of experimentally verified functional elements. Conclusion Our results suggest that while female recombination may be mainly responsible for driving evolution in GC content, male recombination may be mutagenic, and that other mutagenic mechanisms acting near telomeres, and mechanisms whose effects are shared across mammalian genomes, play significant roles. We also have evidence that the nonlinear increase in rates at high GC levels may be largely due to hyper-mutability of CpG dinucleotides. Finally, our results suggest that the performance of conservation-based prediction methods can be improved by accounting for neutral rates.
Collapse
Affiliation(s)
- Svitlana Tyekucheva
- Center for Comparative Genomics and Bioinformatics, The Pennsylvania State University, University Park, PA 16802, USA
| | | | | | | | | | | |
Collapse
|