201
|
Chen CL, Perasso R, Qu LH, Amar L. Exploration of pairing constraints identifies a 9 base-pair core within box C/D snoRNA-rRNA duplexes. J Mol Biol 2007; 369:771-83. [PMID: 17459411 DOI: 10.1016/j.jmb.2007.03.052] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2006] [Revised: 03/16/2007] [Accepted: 03/19/2007] [Indexed: 10/23/2022]
Abstract
2'-O-ribose methylation of eukaryotic ribosomal RNAs is guided by RNA duplexes consisting of rRNA and box C/D small nucleolar (sno)RNA sequences, the methylated sites invariably mapping five positions apart from the D box. Here we have analyzed the RNA duplex pairing constraints by investigating the features of 415 duplexes from the fungus, plant and animal kingdoms, and the evolution of those duplexes within the 124 sets they group into. The D-box upstream 1st and >or=15th positions consist of Watson-Crick base-pairs, G:U base-pairs and mismatched bases with ratios close to random assortments; these positions display single base differences in >60% of the RNA duplex sets. The D-box upstream 2nd to 11th positions have >90% Watson-Crick base-pairs; they display single base mutations with a U-shaped distribution of lower values of 0% and 1.6% at the methylated site 5th and 4th positions, and double compensatory mutations leading to new Watson-Crick base-pairs with an inverted U-shaped distribution of higher values at the 8th to 11th positions. Half of the single mutations at the 3rd to 11th positions resulted in G:U base-pairing, mainly through A-->G mutations in the rRNA strands and C-->T mutations in the snoRNA strands. Double compensatory mutations at the 3rd to 11th positions are extremely frequent, representing 36% of all mutations; they frequently arose from an A-->G mutation in the rRNA strands followed by a T-->C mutation in the snoRNA strands. Differences in the mutational pathways through which the rRNA and snoRNA strand evolved must be related to differences in the rRNA and snoRNA copy number and gene organization. Altogether these data identify the D-box upstream 3rd to 11th positions as box C/D snoRNA-rRNA duplex cores. The impact of the pairing constraints on the evolution of the 9 base-pair RNA duplex cores is discussed.
Collapse
Affiliation(s)
- Chun-Long Chen
- Key Laboratory of Gene Engineering of the Ministry of Education, State Key Laboratory for Biocontrol, Zhongshan University, Guangzhou, 510275, People's Republic of China
| | | | | | | |
Collapse
|
202
|
Cutter AD, Charlesworth B. Selection intensity on preferred codons correlates with overall codon usage bias in Caenorhabditis remanei. Curr Biol 2006; 16:2053-7. [PMID: 17055986 DOI: 10.1016/j.cub.2006.08.067] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2006] [Revised: 08/23/2006] [Accepted: 08/24/2006] [Indexed: 11/16/2022]
Abstract
Adaptive codon usage provides evidence of natural selection in one of its most subtle forms: a fitness benefit of one synonymous codon relative to another. Codon usage bias is evident in the coding sequences of a broad array of taxa, reflecting selection for translational efficiency and/or accuracy as well as mutational biases. Here, we quantify the magnitude of selection acting on alternative codons in genes of the nematode Caenorhabditis remanei, an outcrossing relative of the model organism C. elegans, by fitting the expected mutation-selection-drift equilibrium frequency distribution of preferred and unpreferred codon variants to the empirical distribution. This method estimates the intensity of selection on synonymous codons in genes with high codon bias as N(e)s = 0.17, a value significantly greater than zero. In addition, we demonstrate for the first time that estimates of ongoing selection on codon usage among genes, inferred from nucleotide polymorphism data, correlate strongly with long-term patterns of codon usage bias, as measured by the frequency of optimal codons in a gene. From the pattern of polymorphisms in introns, we also infer that these findings do not result from the operation of biased gene conversion toward G or C nucleotides. We therefore conclude that coincident patterns of current and ancient selection are responsible for shaping biased codon usage in the C. remanei genome.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
203
|
Wright SI, Iorgovan G, Misra S, Mokhtari M. Neutral evolution of synonymous base composition in the Brassicaceae. J Mol Evol 2006; 64:136-41. [PMID: 17160366 DOI: 10.1007/s00239-005-0274-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2005] [Accepted: 10/03/2006] [Indexed: 10/23/2022]
Abstract
The GC content of synonymous sites is elevated in genes from both Brassica oleraceae and Arabidopsis lyrata compared with Arabidopsis thaliana. However, this shift in base composition is independent of gene expression level, and there is no evidence for a similar difference in the frequency of codons preferred by translational selection. The results suggest that composition evolution is caused by a change in mutation bias or biased gene conversion, rather than by a reduction in the efficacy of natural selection in selfing Arabidopsis.
Collapse
|
204
|
Chen JF, Lu F, Chen SS, Tao SH. Significant positive correlation between the recombination rate and GC content in the human pseudoautosomal region. Genome 2006; 49:413-9. [PMID: 16767166 DOI: 10.1139/g05-124] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
This paper establishes that recombination drives the evolution of GC content in a significant way. Because the human P-arm pseudoautosomal region (PAR1) has been shown to have a high recombination rate, at least 20-fold more frequent than the genomic average of approximately 1 cM/Mb, this region provides an ideal system to study the role of recombination in the evolution of base composition. Nine non-coding regions of PAR1 are analyzed in this study. We have observed a highly significant positive correlation between the recombination rate and GC content (rho = 0.837, p < or = 0.005). Five regions that lie in the distal part of PAR1 are shown to be significantly higher than genomic average divergence. By comparing the intra- and inter-specific AT->GC -GC->AT ratios, we have detected no fixation bias toward GC alleles except for L254915, which has excessive AT-->GC changes in the human lineage. Thus, we conclude that the high GC content of the PAR1 genes better fits the biased gene conversion (BGC) model.
Collapse
Affiliation(s)
- Jin-Feng Chen
- Institute of Bioinformatics, Northwest Agriculture and Forest University, Yangling, Shaanxi, China
| | | | | | | |
Collapse
|
205
|
Bartolomé C, Charlesworth B. Evolution of amino-acid sequences and codon usage on the Drosophila miranda neo-sex chromosomes. Genetics 2006; 174:2033-44. [PMID: 17028318 PMCID: PMC1698622 DOI: 10.1534/genetics.106.064113] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have studied patterns of DNA sequence variation and evolution for 22 genes located on the neo-X and neo-Y chromosomes of Drosophila miranda. As found previously, nucleotide site diversity is greatly reduced on the neo-Y chromosome, with a severely distorted frequency spectrum. There is also an accelerated rate of amino-acid sequence evolution on the neo-Y chromosome. Comparisons of nonsynonymous and silent variation and divergence suggest that amino-acid sequences on the neo-X chromosome are subject to purifying selection, whereas this is much weaker on the neo-Y. The same applies to synonymous variants affecting codon usage. There is also an indication of a recent relaxation of selection on synonymous mutations for genes on other chromosomes. Genes that are weakly expressed on the neo-Y chromosome appear to have a faster rate of accumulation of both nonsynonymous and unpreferred synonymous mutations than genes with high levels of expression, although the rate of accumulation when both types of mutation are pooled is higher for the neo-Y chromosome than for the neo-X chromosome even for highly expressed genes.
Collapse
Affiliation(s)
- Carolina Bartolomé
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
206
|
Cutter AD, Baird SE, Charlesworth D. High nucleotide polymorphism and rapid decay of linkage disequilibrium in wild populations of Caenorhabditis remanei. Genetics 2006; 174:901-13. [PMID: 16951062 PMCID: PMC1602088 DOI: 10.1534/genetics.106.061879] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The common ancestor of the self-fertilizing nematodes Caenorhabditis elegans and C. briggsae must have reproduced by obligate outcrossing, like most species in this genus. However, we have only a limited understanding about how genetic variation is patterned in such male-female (gonochoristic) Caenorhabditis species. Here, we report results from surveying nucleotide variation of six nuclear loci in a broad geographic sample of wild isolates of the gonochoristic C. remanei. We find high levels of diversity in this species, with silent-site diversity averaging 4.7%, implying an effective population size close to 1 million. Additionally, the pattern of polymorphisms reveals little evidence for population structure or deviation from neutral expectations, suggesting that the sampled C. remanei populations approximate panmixis and demographic equilibrium. Combined with the observation that linkage disequilibrium between pairs of polymorphic sites decays rapidly with distance, this suggests that C. remanei will provide an excellent system for identifying the genetic targets of natural selection from deviant patterns of polymorphism and linkage disequilibrium. The patterns revealed in this obligately outcrossing species may provide a useful model of the evolutionary circumstances in C. elegans' gonochoristic progenitor. This will be especially important if self-fertilization evolved recently in C. elegans history, because most of the evolutionary time separating C. elegans from its known relatives would have occurred in a state of obligate outcrossing.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, United Kingdom.
| | | | | |
Collapse
|
207
|
Abstract
Natural genetic transformation can facilitate gene transfer in many genera of bacteria and requires the presence of extracellular DNA. Although cell lysis can contribute to this extracellular DNA pool, several studies have suggested that the secretion of DNA from living bacteria may also provide genetic material for transformation. This paper reviews the evidence for specific secretion of DNA from intact bacteria into the extracellular environment and examines this behaviour from a population-genetics perspective. A mathematical model demonstrates that the joint action of DNA secretion and transformation creates a novel type of gene-level natural selection. This model demonstrates that gene-level selection could explain the existence of DNA secretion mechanisms that provide no benefit to individual cells or populations of bacteria. Additionally, the model predicts that any trait affecting DNA secretion will experience selection at the gene level in a transforming population. This analysis confirms that the secretion of DNA from intact bacterial cells is fully explicable with evolutionary theory, and reveals a novel mechanism for bacterial evolution.
Collapse
Affiliation(s)
- Jeremy A Draghi
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Paul E Turner
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| |
Collapse
|
208
|
Cutter AD, Wasmuth JD, Blaxter ML. The evolution of biased codon and amino acid usage in nematode genomes. Mol Biol Evol 2006; 23:2303-15. [PMID: 16936139 DOI: 10.1093/molbev/msl097] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Despite the degeneracy of the genetic code, whereby different codons encode the same amino acid, alternative codons and amino acids are utilized nonrandomly within and between genomes. Such biases in codon and amino acid usage have been demonstrated extensively in prokaryote genomes and likely reflect a balance between the action of mutation, selection, and genetic drift. Here, we quantify the effects of selection and mutation drift as causes of codon and amino acid-usage bias in a large collection of nematode partial genomes from 37 species spanning approximately 700 Myr of evolution, as inferred from expressed sequence tag (EST) measures of gene expression and from base composition variation. Average G + C content at silent sites among these taxa ranges from 10% to 63%, and EST counts range more than 100-fold, underlying marked differences between the identities of major codons and optimal codons for a given species as well as influencing patterns of amino acid abundance among taxa. Few species in our sample demonstrate a dominant role of selection in shaping intragenomic codon-usage biases, and these are principally free living rather than parasitic nematodes. This suggests that deviations in effective population size among species, with small effective sizes among parasites, are partly responsible for species differences in the extent to which selection shapes patterns of codon usage. Nevertheless, a consensus set of optimal codons emerges that is common to most taxa, indicating that, with some notable exceptions, selection for translational efficiency and accuracy favors similar sets of codons regardless of the major codon-usage trends defined by base compositional properties of individual nematode genomes.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh, United Kingdom.
| | | | | |
Collapse
|
209
|
Duret L, Eyre-Walker A, Galtier N. A new perspective on isochore evolution. Gene 2006; 385:71-4. [PMID: 16971063 DOI: 10.1016/j.gene.2006.04.030] [Citation(s) in RCA: 79] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2006] [Accepted: 04/07/2006] [Indexed: 11/30/2022]
Abstract
The genomes of mammals and birds show dramatic variation in base composition over large scales, the so called isochore structure of the genome. The origin of isochores is still controversial: various neutral and selectionist models have been proposed--and criticized--since the discovery of isochores in the 1970s. The availability of complete mammalian genomes has yielded new opportunities for addressing this question. In particular, it was recently proposed that (i) the isochore structure is declining in many mammalian groups, and that (ii) GC-content is influenced by local recombination rate, possibly via the mechanism of GC-biased gene conversion. In this article we review the existing support for these two hypotheses, and discuss how they can be combined to provide a new perspective on isochore evolution.
Collapse
Affiliation(s)
- Laurent Duret
- CNRS UMR 5558, BBE, Université C. Bernard Lyon 1, France
| | | | | |
Collapse
|
210
|
Abstract
Recombination and mutation have traditionally been regarded as independent evolutionary processes: the latter generates variation, which the former reshuffles. Recent studies, however, have suggested that allelic recombination influences the underlying mutation rate, as high mutation rates are inferred in regions of high recombination. Furthermore, recombination between duplicated sequences introduces structural variation into the human genome and facilitates the formation of clustered gene families. Comparisons of wholegenome sequences reveal the expansion of gene family clusters to be an important mode of genome evolution. The negative aspect of this genomic dynamism is the contribution of these rearrangements to genetic diseases.
Collapse
Affiliation(s)
- Matthew Hurles
- Wellcome Trust Sanger Institute, Genome Campus, Cambridge, CB10 1SA, UK.
| |
Collapse
|
211
|
Zhang F, Peterson T. Gene conversion between direct noncoding repeats promotes genetic and phenotypic diversity at a regulatory locus of Zea mays (L.). Genetics 2006; 174:753-62. [PMID: 16816430 PMCID: PMC1602084 DOI: 10.1534/genetics.105.053942] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
While evolution of coding sequences has been intensively studied, diversification of noncoding regulatory regions remains poorly understood. In this study, we investigated the molecular evolution of an enhancer region located 5 kb upstream of the transcription start site of the maize pericarp color1 (p1) gene. The p1 gene encodes an R2R3 Myb-like transcription factor that regulates the flavonoid biosynthetic pathway in maize floral organs. Distinct p1 alleles exhibit organ-specific expression patterns on kernel pericarp and cob glumes. A cob glume-specific regulatory region has been identified in the distal enhancer. Further characterization of 6 single-copy p1 alleles, including P1-rr (red pericarp/red cob) and P1-rw (red pericarp and white cob), reveals 3 distinct enhancer types. Sequence variations in the enhancer are correlated with the p1 gene expression patterns in cob glume. Structural comparisons and phylogenetic analyses suggest that evolution of the enhancer region is likely driven by gene conversion between long direct noncoding repeats (approximately 6 kb in length). Given that tandem and segmental duplications are common in both animal and plant genomes, our studies suggest that recombination between noncoding duplicated sequences could play an important role in creating genetic and phenotypic variations.
Collapse
Affiliation(s)
- Feng Zhang
- Department of Genetics, Development and Cell Biology and Department of Agronomy, Iowa State University, Ames, Iowa 50011, USA
| | | |
Collapse
|
212
|
Khelifi A, Meunier J, Duret L, Mouchiroud D. GC content evolution of the human and mouse genomes: insights from the study of processed pseudogenes in regions of different recombination rates. J Mol Evol 2006; 62:745-52. [PMID: 16752212 DOI: 10.1007/s00239-005-0186-0] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2005] [Accepted: 02/02/2006] [Indexed: 01/27/2023]
Abstract
Processed pseudogenes are generated by reverse transcription of a functional gene. They are generally nonfunctional after their insertion and, as a consequence, are no longer subjected to the selective constraints associated with functional genes. Because of this property they can be used as neutral markers in molecular evolution. In this work, we investigated the relationship between the evolution of GC content in recently inserted processed pseudogenes and the local recombination pattern in two mammalian genomes (human and mouse). We confirmed, using original markers, that recombination drives GC content in the human genome and we demonstrated that this is also true for the mouse genome despite lower recombination rates. Finally, we discussed the consequences on isochores evolution and the contrast between the human and the mouse pattern.
Collapse
Affiliation(s)
- Adel Khelifi
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université Claude Bernard-Lyon 1, 16 rue Raphael Dubois, 69622 Villeurbanne Cedex, France.
| | | | | | | |
Collapse
|
213
|
Comeron JM. Weak selection and recent mutational changes influence polymorphic synonymous mutations in humans. Proc Natl Acad Sci U S A 2006; 103:6940-5. [PMID: 16632609 PMCID: PMC1458998 DOI: 10.1073/pnas.0510638103] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recent large-scale genomic and evolutionary studies have revealed the small but detectable signature of weak selection on synonymous mutations during mammalian evolution, likely acting at the level of translational efficacy (i.e., translational selection). To investigate whether weak selection, and translational selection in particular, plays any role in shaping the fate of synonymous mutations that are present today in human populations, we studied genetic variation at the polymorphic level and patterns of evolution in the human lineage after human-chimpanzee separation. We find evidence that neutral mechanisms are influencing the frequency of polymorphic mutations in humans. Our results suggest a recent increase in mutational tendencies toward AT, observed in all isochores, that is responsible for AT mutations segregating at lower frequencies than GC mutations. In all, however, changes in mutational tendencies and other neutral scenarios are not sufficient to explain a difference between synonymous and noncoding mutations or a difference between synonymous mutations potentially advantageous or deleterious under a translational selection model. Furthermore, several estimates of selection intensity on synonymous mutations all suggest a detectable influence of weak selection acting at the level of translational selection. Thus, random genetic drift, recent changes in mutational tendencies, and weak selection influence the fate of synonymous mutations that are present today as polymorphisms. All of these features, neutral and selective, should be taken into account in evolutionary analyses that often assume constancy of mutational tendencies and complete neutrality of synonymous mutations.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biological Sciences, University of Iowa, 212 Biology Building, Iowa City, IA 52242, USA.
| |
Collapse
|
214
|
Khakhlova O, Bock R. Elimination of deleterious mutations in plastid genomes by gene conversion. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2006; 46:85-94. [PMID: 16553897 DOI: 10.1111/j.1365-313x.2006.02673.x] [Citation(s) in RCA: 221] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
Asexual reproduction is believed to be detrimental, mainly because of the accumulation of deleterious mutations over time, a hypothesis known as Muller's ratchet. In seed plants, most asexually reproducing genetic systems are polyploid, with apomictic species (plants forming seeds without fertilization) as well as plastids and mitochondria providing prominent examples. Whether or not polyploidy helps asexual genetic systems to escape Muller's ratchet is unknown. Gene conversion, particularly when slightly biased, represents a potential mechanism that could allow asexual genetic systems to reduce their mutation load in a genome copy number-dependent manner. However, direct experimental evidence for the operation of gene conversion between genome molecules to correct mutations is largely lacking. Here we describe an experimental system based on transgenic tobacco chloroplasts that allows us to analyze gene conversion events in higher plant plastid genomes. We provide evidence for gene conversion acting as a highly efficient mechanism by which the polyploid plastid genetic system can correct deleterious mutations and make one good genome out of two bad ones. Our finding that gene conversion can be biased may provide a molecular link between asexual reproduction, high genome copy numbers and low mutation rates.
Collapse
Affiliation(s)
- Olga Khakhlova
- Max-Planck-Institut für Molekulare Pflanzenphysiologie, Am Mühlenberg 1, D-14476 Potsdam-Golm, Germany
| | | |
Collapse
|
215
|
Webster MT, Axelsson E, Ellegren H. Strong Regional Biases in Nucleotide Substitution in the Chicken Genome. Mol Biol Evol 2006; 23:1203-16. [PMID: 16551647 DOI: 10.1093/molbev/msk008] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Interspersed repeats have emerged as a valuable tool for studying neutral patterns of molecular evolution. Here we analyze variation in the rate and pattern of nucleotide substitution across all autosomes in the chicken genome by comparing the present-day CR1 repeat sequences with their ancestral copies and reconstructing nucleotide substitutions with a maximum likelihood model. The results shed light on the origin and evolution of large-scale heterogeneity in GC content found in the genomes of birds and mammals--the isochore structure. In contrast to mammals, where GC content is becoming homogenized, heterogeneity in GC content is being reinforced in the chicken genome. This is also supported by patterns of substitution inferred from alignments of introns in chicken, turkey, and quail. Analysis of individual substitution frequencies is consistent with the biased gene conversion (BGC) model of isochore evolution, and it is likely that patterns of evolution in the chicken genome closely resemble those in the ancestral amniote genome, when it is inferred that isochores originated. Microchromosomes and distal regions of macrochromosomes are found to have elevated substitution rates and a more GC-biased pattern of nucleotide substitution. This can largely be accounted for by a strong correlation between GC content and the rate and pattern of substitution. The results suggest that an interaction between increased mutability at CpG motifs and fixation biases due to BGC could explain increased levels of divergence in GC-rich regions.
Collapse
Affiliation(s)
- Matthew T Webster
- Department of Evolution, Genomics and Systematics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
| | | | | |
Collapse
|
216
|
Abstract
Sequencing of the dog genome allows an investigation of the location-dependent evolutionary processes that occurred since the common ancestor of primates and carnivores, approximately 95 million years ago. We investigated variations in G+C nucleotide fraction and synonymous nucleotide substitution rates (Ks) across dog and human genomes. Our results show that dog genes located either in subtelomeric and pericentromeric regions, or in short synteny blocks, possess significantly elevated G+C fraction and Ks values. Human subtelomeric, but not pericentromeric, genes also exhibit these elevations. We then examined 1.048 Gb of human sequence that is likely not to have been located near a primate telomere at any time since the common ancestor of dog and human. We observed that regions of highest G+C or Ks ("hotspots"; median sizes of 0.5 or 1.3 Mb, respectively) within this sequence were preferentially segregated to dog subtelomeres and pericentromeres during the rearrangements that eventually gave rise to the extant canine karyotype. Our data cannot be accounted for solely on the basis of gradually elevating G+C fractions in subtelomeric regions as a consequence of biased gene conversion. Rather, we propose that high G+C sequences are found preferentially within dog subtelomeres as a direct consequence of chromosomal fission occurring more frequently within regions elevated in G+C.
Collapse
Affiliation(s)
- Caleb Webber
- MRC Functional Genetics Unit, Department of Human Anatomy and Genetics, University of Oxford, Oxford OX1 3QX, United Kingdom.
| | | |
Collapse
|
217
|
Joly E. Various hypotheses on MHC evolution suggested by the concerted evolution of CD94L and MHC class Ia molecules. Biol Direct 2006; 1:3. [PMID: 16542034 PMCID: PMC1397804 DOI: 10.1186/1745-6150-1-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2006] [Accepted: 01/31/2006] [Indexed: 01/05/2023] Open
Abstract
Background In the accompanying paper by Virginie Rouillon and myself, our demonstration that homogenisation by gene conversion occurs readily among MHC class I genes was made possible because of the exceptional conservation of the CD94L locus between divergent species of separate taxa, suggesting that the molecules of this family are endowed with very important and well preserved biological functions. These results lead me to elaborate various hypotheses on several aspects of MHC evolution. Hypotheses In a first part, I propose a highly hypothetical scenario of MHC evolution that could explain how modern day CD94L molecules can have so many diverse and well preserved biological functions. Next, I propose that MHC class I molecules evolve more rapidly and exuberantly than class II molecules because the former are subjected to more direct selective pressures, in particular from viruses. Third, I suggest that concerted evolution, by increasing inter-genic homogeneity would in turn favour further inter-allelic and inter-loci exchanges, hence resulting in a more evolvable MHC. As a fourth and last point, I propose that the high GC content of sequences coding for classical class I molecules could be a consequence of biased gene conversion. Testing of these various hypotheses should occur naturally over the coming years, with the ever increasing availability of more sequences related to MHC class I genes from various organisms. Ultimately, a better understanding of how MHC molecules evolve may help to decipher where and how our adaptive immune system arose, and keeps evolving in the face of the permanent challenge of infectious organisms. Reviewers This article was reviewed by Stephan Beck, Lutz Walter and Pierre Pontarotti.
Collapse
Affiliation(s)
- Etienne Joly
- Equipe de Neuro-Immuno-Génétique Moléculaire, IPBS, UMR CNRS 5089, 205 route de Narbonne, 31077 Toulouse Cedex, France.
| |
Collapse
|
218
|
Bussell JJ, Pearson NM, Kanda R, Filatov DA, Lahn BT. Human polymorphism and human-chimpanzee divergence in pseudoautosomal region correlate with local recombination rate. Gene 2005; 368:94-100. [PMID: 16356662 DOI: 10.1016/j.gene.2005.10.020] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2005] [Revised: 10/12/2005] [Accepted: 10/17/2005] [Indexed: 11/21/2022]
Abstract
Previous studies have shown widespread correlation between nucleotide polymorphism and recombination rate, but the cause of this correlation is unresolved. One explanation is that recombination is associated with point mutations, potentially through mutagenic effects of meiotic crossover. This hypothesis predicts that regions of frequent recombination should show both elevated nucleotide diversity within a species and increased nucleotide divergence between species. Here we tested this hypothesis by studying the human short-arm pseudoautosomal region (PAR1), which recombines between X and Y chromosomes in men at a rate approximately 20 times the genome average. We sequenced dispersed intronic loci within PAR1 in a panel of humans and in the chimpanzee and directly measured sequence variation and recombination rate from these data. In line with previous reports, we saw a correlation between human polymorphism level and local recombination rate. Moreover, we also found a highly significant correlation between human-chimpanzee divergence and recombination rate. These results are consistent with the hypothesis that recombination is associated with point mutations, possibly because recombination is mutagenic.
Collapse
Affiliation(s)
- Jennifer J Bussell
- Howard Hughes Medical Institute, Department of Human Genetics, University of Chicago, 920 East 58th Street, Chicago, IL 60637, USA
| | | | | | | | | |
Collapse
|
219
|
Fearnhead P, Smith NGC. A novel method with improved power to detect recombination hotspots from polymorphism data reveals multiple hotspots in human genes. Am J Hum Genet 2005; 77:781-94. [PMID: 16252238 PMCID: PMC1271387 DOI: 10.1086/497579] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2005] [Accepted: 08/25/2005] [Indexed: 11/03/2022] Open
Abstract
We introduce a new method for detection of recombination hotspots from population genetic data. This method is based on (a) defining an (approximate) penalized likelihood for how recombination rate varies with physical position and (b) maximizing this penalized likelihood over possible sets of recombination hotspots. Simulation results suggest that this is a more powerful method for detection of hotspots than are existing methods. We apply the method to data from 89 genes sequenced in African American and European American populations. We find many genes with multiple hotspots, and some hotspots show evidence of being population-specific. Our results suggest that hotspots are randomly positioned within genes and could be as frequent as one per 30 kb.
Collapse
Affiliation(s)
- Paul Fearnhead
- Department of Mathematics and Statistics, Lancaster University, Lancaster, United Kingdom.
| | | |
Collapse
|
220
|
Abstract
The study of base composition evolution in Drosophila has been achieved mostly through the analysis of coding sequences. Third codon position GC content, however, is influenced by both neutral forces (e.g., mutation bias) and natural selection for codon usage optimization. In this article, large data sets of noncoding DNA sequence polymorphism in D. melanogaster and D. simulans were gathered from public databases to try to disentangle these two factors-noncoding sequences are not affected by selection for codon usage. Allele frequency analyses revealed an asymmetric pattern of AT vs. GC noncoding polymorphisms: AT --> GC mutations are less numerous, and tend to segregate at a higher frequency, than GC --> AT ones, especially at GC-rich loci. This is indicative of nonstationary evolution of base composition and/or of GC-biased allele transmission. Fitting population genetics models to the allele frequency spectra confirmed this result and favored the hypothesis of a biased transmission. These results, together with previous reports, suggest that GC-biased gene conversion has influenced base composition evolution in Drosophila and explain the correlation between intron and exon GC content.
Collapse
Affiliation(s)
- Nicolas Galtier
- UMR 5171, "Génome, Populations, Interactions, Adaptation," CNRS, Université Montpellier 2, IFREMER, 34095 Montpellier, France.
| | | | | |
Collapse
|
221
|
Sharma A, Chavali S, Mahajan A, Tabassum R, Banerjee V, Tandon N, Bharadwaj D. Genetic Association, Post-translational Modification, and Protein-Protein Interactions in Type 2 Diabetes Mellitus. Mol Cell Proteomics 2005; 4:1029-37. [PMID: 15886397 DOI: 10.1074/mcp.m500024-mcp200] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Type 2 diabetes mellitus is a complex disorder with a strong genetic component. Inherited complex disease susceptibility in humans is most commonly associated with single nucleotide polymorphisms. The mechanisms by which this occurs are still poorly understood. Here we focus on analyzing the effect of a set of disease-causing missense variations of the monogenetic form of Type 2 diabetes mellitus and a set of disease-associated nonsynonymous variations in comparison with that of nonsynonymous variations without any experimental evidence for association with any disease. Analysis of different properties such as evolutionary conservation status, solvent accessibility, secondary structure, etc. suggests that disease-causing variations are associated with extreme changes in the value of the parameters relating to evolutionary conservation and/or protein stability. Disease-associated variations are rather moderately conserved and have a milder effect on protein function and stability. The majority of the genes harboring these variations are clustered in or near the insulin signaling network. Most of these variations are identified as potential sites for post-translational modifications; certain predictions have already reported experimental evidence. Overall our results indicate that Type 2 diabetes mellitus may result from a large number of single nucleotide polymorphisms that impair modular domain function and post-translational modifications involved in signaling. Our emphasis is more on conserved corresponding residues than the variation alone. We believe that the approach of considering a stretch of peptide sequence involving a polymorphism would be a better method of defining the role of the polymorphism in the manifestation of this disease. Because most of the variations associated with the disease are rare, we hypothesize that this disease is a "mosaic model" of interaction between a large number of rare alleles and a small number of common alleles along with the environment, which is little contrary to the existing common disease common variant model.
Collapse
Affiliation(s)
- Amitabh Sharma
- Functional Genomics Unit, Institute of Genomics and Integrative Biology, Council of Scientific and Industrial Research (CSIR), Delhi 110 007
| | | | | | | | | | | | | |
Collapse
|
222
|
Balakirev ES, Chechetkin VR, Lobzin VV, Ayala FJ. Entropy and GC Content in the beta-esterase gene cluster of the Drosophila melanogaster subgroup. Mol Biol Evol 2005; 22:2063-72. [PMID: 15972847 DOI: 10.1093/molbev/msi197] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
We perform spectral entropy and GC content analyses in the beta-esterase gene cluster, including the Est-6 gene and the psiEst-6 putative pseudogene, in seven species of the Drosophila melanogaster species subgroup. psiEst-6 combines features of functional and nonfunctional genes. The spectral entropies show distinctly lower structural ordering for psiEst-6 than for Est-6 in all species studied. Our observations agree with previous results for D. melanogaster and provide additional support to our hypothesis that after the duplication event Est-6 retained the esterase-coding function and its role during copulation, while psiEst-6 lost that function but now operates in conjunction with Est-6 as an intergene. Entropy accumulation is not a completely random process for either gene. Structural entropy is nucleotide dependent. The relative normalized deviations for structural entropy are higher for G than for C nucleotides. The entropy values are similar for Est-6 and psiEst-6 in the case of A and T but are lower for Est-6 in the case of G and C. The GC content in synonymous positions is uniformly higher in Est-6 than in psiEst-6, which agrees with the reduced GC content generally observed in pseudogenes and nonfunctional sequences. The observed differences in entropy and GC content reflect an evolutionary shift associated with the process of pseudogenization and subsequent functional divergence of psiEst-6 and Est-6 after the duplication event.
Collapse
Affiliation(s)
- Evgeniy S Balakirev
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| | | | | | | |
Collapse
|
223
|
Qin H, Wu WB, Comeron JM, Kreitman M, Li WH. Intragenic spatial patterns of codon usage bias in prokaryotic and eukaryotic genomes. Genetics 2005; 168:2245-60. [PMID: 15611189 PMCID: PMC1448744 DOI: 10.1534/genetics.104.030866] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To study the roles of translational accuracy, translational efficiency, and the Hill-Robertson effect in codon usage bias, we studied the intragenic spatial distribution of synonymous codon usage bias in four prokaryotic (Escherichia coli, Bacillus subtilis, Sulfolobus tokodaii, and Thermotoga maritima) and two eukaryotic (Saccharomyces cerevisiae and Drosophila melanogaster) genomes. We generated supersequences at each codon position across genes in a genome and computed the overall bias at each codon position. By quantitatively evaluating the trend of spatial patterns using isotonic regression, we show that in yeast and prokaryotic genomes, codon usage bias increases along translational direction, which is consistent with purifying selection against nonsense errors. Fruit fly genes show a nearly symmetric M-shaped spatial pattern of codon usage bias, with less bias in the middle and both ends. The low codon usage bias in the middle region is best explained by interference (the Hill-Robertson effect) between selections at different codon positions. In both yeast and fruit fly, spatial patterns of codon usage bias are characteristically different from patterns of GC-content variations. Effect of expression level on the strength of codon usage bias is more conspicuous than its effect on the shape of the spatial distribution.
Collapse
Affiliation(s)
- Hong Qin
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | |
Collapse
|
224
|
Benovoy D, Morris RT, Morin A, Drouin G. Ectopic Gene Conversions Increase the G + C Content of Duplicated Yeast and Arabidopsis Genes. Mol Biol Evol 2005; 22:1865-8. [PMID: 15917495 DOI: 10.1093/molbev/msi176] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Allelic recombination has previously been shown to increase the GC-content of the sequences of a wide variety of eukaryotic species. Ectopic recombination between clustered tandemly repeated genes has also been shown to increase their GC-content. Here we show that gene conversions between the dispersed genes found in the duplicated regions of the yeast and Arabidopsis genomes also increase their GC-content when these genes are more than 88% similar.
Collapse
Affiliation(s)
- David Benovoy
- Département de biologie, Université d'Ottawa, Ottawa, Ontario, Canada
| | | | | | | |
Collapse
|
225
|
Webster MT, Smith NGC, Hultin-Rosenberg L, Arndt PF, Ellegren H. Male-driven biased gene conversion governs the evolution of base composition in human alu repeats. Mol Biol Evol 2005; 22:1468-74. [PMID: 15772377 DOI: 10.1093/molbev/msi136] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Regional biases in substitution pattern are likely to be responsible for the large-scale variation in base composition observed in vertebrate genomes. However, the evolutionary forces responsible for these biases are still not clearly defined. In order to study the processes of mutation and fixation across the entire human genome, we analyzed patterns of substitution in Alu repeats since their insertion. We also studied patterns of human polymorphism within the repeats. There is a highly significant effect of recombination rate on the pattern of substitution, whereas no such effect is seen on the pattern of polymorphism. These results suggest that regional biases in substitution are caused by biased gene conversion, a process that increases the probability of fixation of mutations that increase GC content. Furthermore, the strongest correlate of substitution patterns is found to be male recombination rates rather than female or sex-averaged recombination rates. This indicates that in addition to sexual dimorphism in recombination rates, the sexes also differ in the relative rates of crossover and gene conversion.
Collapse
Affiliation(s)
- Matthew T Webster
- Department of Evolution, Genomics and Systematics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
| | | | | | | | | |
Collapse
|
226
|
Taguchi Y, Koide T, Shiroishi T, Yagi T. Molecular Evolution of Cadherin-Related Neuronal Receptor/Protocadherin α (CNR/Pcdhα) Gene Cluster in Mus musculus Subspecies. Mol Biol Evol 2005; 22:1433-43. [PMID: 15758202 DOI: 10.1093/molbev/msi130] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
The mouse cadherin-related neuronal receptor/protocadherin (CNR/Pcdh) gene clusters are located on chromosome 18. We sequenced single-nucleotide polymorphisms (SNPs) of the CNR/Pcdh(alpha)-coding region among 12 wild-derived and four laboratory strains; these included the four major subspecies groups of Mus musculus: domesticus, musculus, castaneus, and bactrianus. We detected 883 coding SNPs (cSNPs) in the CNR/Pcdh(alpha) variable exons and three in the constant exons. Among all the cSNPs, 586 synonymous (silent) and 297 nonsynonymous (amino acid exchanged) substitutions were found; therefore, the K(a)/K(s) ratio (nonsynonymous substitutions per synonymous substitution) was 0.51. The synonymous cSNPs were relatively concentrated in the first and fifth extracellular cadherin domain-encoding regions (ECs) of CNR/Pcdh(alpha). These regions have high nucleotide homology among the CNR/Pcdh(alpha) paralogs, suggesting that gene conversion events in synonymous and homologous regions of the CNR/Pcdh(alpha) cluster are related to the generation of cSNPs. A phylogenetic analysis revealed gene conversion events in the EC1 and EC5 regions. Assuming that the common sequences between rat and mouse are ancestral, the GC content of the third codon position has increased in the EC1 and EC5 regions, although biased substitutions from GC to AT were detected in all the codon positions. In addition, nonsynonymous substitutions were extremely high (11 of 13, K(a)/K(s) ratio 5.5) in the laboratory mouse strains. The artificial environment of laboratory mice may allow positive selection for nonsynonymous amino acid variations in CNR/Pcdh(alpha) during inbreeding. In this study, we analyzed the direction of cSNP generation, and concluded that subspecies-specific nucleotide substitutions and region-restricted gene conversion events may have contributed to the generation of genetic variations in the CNR/Pcdh genes within and between species.
Collapse
Affiliation(s)
- Yusuke Taguchi
- KOKORO Biology Group, Laboratories for Integrated Biology, Graduate School of Frontier Biosciences, Osaka University, Suita, Japan
| | | | | | | |
Collapse
|
227
|
Huang SW, Friedman R, Yu N, Yu A, Li WH. How strong is the mutagenicity of recombination in mammals? Mol Biol Evol 2005; 22:426-31. [PMID: 15496551 DOI: 10.1093/molbev/msi025] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
It is commonly believed that a high recombination rate such as that in a pseudoautosomal region (PAR) greatly increases the mutation rate because a 170-fold increase was estimated for the mouse PAR region. However, sequencing PAR and non-PAR introns of the Fxy gene in four Mus taxa, we found an increase of only twofold to fivefold. Furthermore, analyses of sequence data from human and orangutan PAR and X-linked regions and from autosomal regions showed a weak effect of recombination on mutation rate (a slope of less than 0.2% per cM/Mb), although a much stronger effect on GC content (1% to 2% per cM/Mb). Because typical recombination rates in mammals are much lower than those in PARs, the mutagenicity of recombination is weak or, at best, moderate, although its effect on GC% is much stronger. In addition, contrary to a previous study, we found no Fxy duplicate in Mus spretus.
Collapse
Affiliation(s)
- Shiao-Wei Huang
- Department of Ecology and Evolution, University of Chicago, Chicago, USA
| | | | | | | | | |
Collapse
|
228
|
Anuradha S, Muniyappa K. Molecular aspects of meiotic chromosome synapsis and recombination. PROGRESS IN NUCLEIC ACID RESEARCH AND MOLECULAR BIOLOGY 2005; 79:49-132. [PMID: 16096027 DOI: 10.1016/s0079-6603(04)79002-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- S Anuradha
- Department of Biochemistry, Indian Institute of Science, Bangalore 560012, India
| | | |
Collapse
|
229
|
Abstract
The gene duplication rate in the yeast genome is estimated without assuming the molecular clock model to be approximately 0.01 to 0.06 per gene per billion years; this rate is two orders of magnitude lower than a previous estimate based on the molecular clock model. This difference is explained by extensive concerted evolution via gene conversion between duplicated genes, which violates the assumption of the molecular clock in the analyses of duplicated genes. The average length of the period of concerted evolution and the gene conversion rate are estimated to be approximately 25 million years and approximately 28 times the mutation rate, respectively.
Collapse
Affiliation(s)
- Li-Zhi Gao
- Human Genetics Center, School of Public Health, University of Texas Health Science Center, 1200 Hermann Pressler, Houston, TX 77030, USA
| | | |
Collapse
|
230
|
Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 2004; 432:695-716. [PMID: 15592404 DOI: 10.1038/nature03154] [Citation(s) in RCA: 1958] [Impact Index Per Article: 97.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2004] [Accepted: 11/01/2004] [Indexed: 12/28/2022]
Abstract
We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Collapse
|
231
|
Bartolomé C, Maside X, Yi S, Grant AL, Charlesworth B. Patterns of selection on synonymous and nonsynonymous variants in Drosophila miranda. Genetics 2004; 169:1495-507. [PMID: 15545653 PMCID: PMC1449532 DOI: 10.1534/genetics.104.033068] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have investigated patterns of within-species polymorphism and between-species divergence for synonymous and nonsynonymous variants at a set of autosomal and X-linked loci of Drosophila miranda. D. pseudoobscura and D. affinis were used for the between-species comparisons. The results suggest the action of purifying selection on nonsynonymous, polymorphic variants. Among synonymous polymorphisms, there is a significant excess of synonymous mutations from preferred to unpreferred codons and of GC to AT mutations. There was no excess of GC to AT mutations among polymorphisms at noncoding sites. This suggests that selection is acting to maintain the use of preferred codons. Indirect evidence suggests that biased gene conversion in favor of GC base pairs may also be operating. The joint intensity of selection and biased gene conversion, in terms of the product of effective population size and the sum of the selection and conversion coefficients, was estimated to be approximately 0.65.
Collapse
Affiliation(s)
- Carolina Bartolomé
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, UK.
| | | | | | | | | |
Collapse
|
232
|
Nebert DW, Vesell ES. Advances in pharmacogenomics and individualized drug therapy: exciting challenges that lie ahead. Eur J Pharmacol 2004; 500:267-80. [PMID: 15464039 DOI: 10.1016/j.ejphar.2004.07.031] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2004] [Indexed: 12/16/2022]
Abstract
Between the 1930s and 1990s, several dozen predominantly monogenic, high-penetrance disorders involving pharmacogenetics were described, fueling the crusade that gene-drug interactions are quite simple. Then, in 1990, the Human Genome Project was established; in 1995, the term pharmacogenomics was introduced; finally, the complexities of determining an unequivocal phenotype, as well as an unequivocal genotype, have recently become apparent. Since 1965, more than 1000 reviews on this topic have painted an overly optimistic picture-suggesting that the advent of individualized drug therapy used by the practicing physician is fast approaching. For many reasons listed here, however, we emphasize that these high expectations must be tempered. We now realize that the nucleotide sequence of the genome represents only a starting point from which we must proceed to a more difficult stage: knowledge of the function encoded and how this affects the phenotype. To achieve individualized drug therapy, a high level of accuracy and precision is required of any clinical test proposed in human patients. Finally, we suggest that metabonomics, perhaps in combination with proteomics, might complement genomics in eventually helping us to achieve individualized drug therapy.
Collapse
Affiliation(s)
- Daniel W Nebert
- Division of Human Genetics, Department of Pediatrics and Molecular Developmental Biology, University of Cincinnati Medical Center, P.O. Box 670056, Cincinnati OH 45267-0056, USA.
| | | |
Collapse
|
233
|
Zhang J, Li F, Li J, Zhang MQ, Zhang X. Evidence and characteristics of putative human α recombination hotspots. Hum Mol Genet 2004; 13:2823-8. [PMID: 15385449 DOI: 10.1093/hmg/ddh310] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Understanding recombination rate variation is very important for studying genome diversity and evolution, and for investigation of phenotypic association and genetic diseases. Recombination hotspots have been observed in many species and are well studied in yeast. Recent study demonstrated that recombination hotspots are also a ubiquitous feature of the human genome. But the nature of human hotspots remains largely unknown. We have developed and validated a novel computational method for testing the existence of hotspots as well as for localizing them with either unphased or phased genotyping data. To study the characteristics of hotspots within or close to genes, we scanned for unusually high levels of recombination using the European population samples in the SeattleSNPs database, and found evidence for the existence of human alpha hotspots similar to those of yeast. This type of hotspots, found at promoter regions, accounts for about half of the total detected and appears to depend on some specific transcription factor binding sites (such as CGCCCCCGC). These characteristics can explain the observed weak correlation between hotspots and GC-content, and their variation may contribute to the diversity of hotspot distribution among different individuals and species. These long-sought putative human alpha recombination hotspots should deserve further experimental investigations.
Collapse
Affiliation(s)
- Jing Zhang
- MOE Key laboratory of Bioinformatics, Department of Automation, Tsinghua University, Bejing, China
| | | | | | | | | |
Collapse
|
234
|
|
235
|
Galtier N, Bonhomme F, Moulia C, Belkhir K, Caminade P, Desmarais E, Duquesne JJ, Orth A, Dod B, Boursot P. Mouse biodiversity in the genomic era. Cytogenet Genome Res 2004; 105:385-94. [PMID: 15237226 DOI: 10.1159/000078211] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2003] [Accepted: 12/08/2003] [Indexed: 11/19/2022] Open
Abstract
Comparative genomics has developed by comparison of distantly related genomes, for which the link between the reported evolutionary changes and species development/physiology/ecology is not obvious. It is argued that the mouse (genus Mus) is an optimal model for microevolutionary genomics in vertebrates. This is because the mouse genome sequence, physical and genetic map have been completed, because mouse genetics, morpho-anatomy, pathology, behavior and ecology are well-studied, and because the Mus genus is a diverse, well- documented taxon, allowing comparative studies at the level of individual, population, subspecies, and species. The potential of the interaction between mouse genome and mouse biodiversity is illustrated by recent studies of speciation in the house mouse Mus musculus, and studies about the evolution of isochores, the peculiar pattern of GC-content variation across mammalian genomes.
Collapse
Affiliation(s)
- N Galtier
- UMR 5171-Génome, Populations, Interactions, Adaptation, Université Montpellier 2, Montpellier, France.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
236
|
Abstract
Recent advances in the large-scale sequencing of mammalian genomes have provided a means to study divergence in not only genic sequences but also in the non-coding bulk of DNA. There is evidence of significant variation in the levels of divergence between presumably neutral regions, pointing at an underlying variation in the rate of mutation across the genome. Apparently, such variation occurs on different scales, including sequence context effects (the influence of neighboring nucleotides on the rate of mutation at individual sites), variation within chromosomes (on the scales of kilobases as well as megabases), and between chromosomes (among autosomes as well as between autosomes and sex chromosomes). An important aspect for further research in this area is to study whether there is an ultimate evolutionary explanation for mutation rate variation within mammalian genomes.
Collapse
Affiliation(s)
- Hans Ellegren
- Department of Evolutionary Biology, Uppsala University, Norbyvägen 18D, SE-752 36 Uppsala, Sweden.
| | | | | |
Collapse
|
237
|
Marais G, Charlesworth B, Wright SI. Recombination and base composition: the case of the highly self-fertilizing plant Arabidopsis thaliana. Genome Biol 2004; 5:R45. [PMID: 15239830 PMCID: PMC463295 DOI: 10.1186/gb-2004-5-7-r45] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2004] [Revised: 04/26/2004] [Accepted: 04/30/2004] [Indexed: 11/24/2022] Open
Abstract
The effects of recombination and self-fertilization on base composition were investigated both theoretically and experimentally in the Arabidopsis genome. Levels of inbreeding modulate the effect of recombination on base composition. Background Rates of recombination can vary among genomic regions in eukaryotes, and this is believed to have major effects on their genome organization in terms of base composition, DNA repeat density, intron size, evolutionary rates and gene order. In highly self-fertilizing species such as Arabidopsis thaliana, however, heterozygosity is expected to be strongly reduced and recombination will be much less effective, so that its influence on genome organization should be greatly reduced. Results Here we investigated theoretically the joint effects of recombination and self-fertilization on base composition, and tested the predictions with genomic data from the complete A. thaliana genome. We show that, in this species, both codon-usage bias and GC content do not correlate with the local rates of crossing over, in agreement with our theoretical results. Conclusions We conclude that levels of inbreeding modulate the effect of recombination on base composition, and possibly other genomic features (for example, transposable element dynamics). We argue that inbreeding should be considered when interpreting patterns of molecular evolution.
Collapse
Affiliation(s)
- G Marais
- Institute of Cell, Animal and Population Biology, University of Edinburgh, EH9 3JT Edinburgh, UK
| | - B Charlesworth
- Institute of Cell, Animal and Population Biology, University of Edinburgh, EH9 3JT Edinburgh, UK
| | - S I Wright
- Institute of Cell, Animal and Population Biology, University of Edinburgh, EH9 3JT Edinburgh, UK
- Current address: Department of Biology, York University, 4700 Keele St, Toronto, Ontario M3J 1P3, Canada
| |
Collapse
|
238
|
Jensen-Seaman MI, Furey TS, Payseur BA, Lu Y, Roskin KM, Chen CF, Thomas MA, Haussler D, Jacob HJ. Comparative recombination rates in the rat, mouse, and human genomes. Genome Res 2004; 14:528-38. [PMID: 15059993 PMCID: PMC383296 DOI: 10.1101/gr.1970304] [Citation(s) in RCA: 359] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2003] [Accepted: 02/09/2004] [Indexed: 11/24/2022]
Abstract
Levels of recombination vary among species, among chromosomes within species, and among regions within chromosomes in mammals. This heterogeneity may affect levels of diversity, efficiency of selection, and genome composition, as well as have practical consequences for the genetic mapping of traits. We compared the genetic maps to the genome sequence assemblies of rat, mouse, and human to estimate local recombination rates across these genomes. Humans have greater overall levels of recombination, as well as greater variance. In rat and mouse, the size of the chromosome and proximity to telomere have less effect on local recombination rate than in human. At the chromosome level, rat and mouse X chromosomes have the lowest recombination rates, whereas human chromosome X does not show the same pattern. In all species, local recombination rate is significantly correlated with several sequence variables, including GC%, CpG density, repetitive elements, and the neutral mutation rate, with some pronounced differences between species. Recombination rate in one species is not strongly correlated with the rate in another, when comparing homologous syntenic blocks of the genome. This comparative approach provides additional insight into the causes and consequences of genomic heterogeneity in recombination.
Collapse
Affiliation(s)
- Michael I Jensen-Seaman
- Human and Molecular Genetics Center, Medical College of Wisconsin, Milwaukee, Wisconsin 53226, USA.
| | | | | | | | | | | | | | | | | |
Collapse
|
239
|
Noonan JP, Grimwood J, Schmutz J, Dickson M, Myers RM. Gene conversion and the evolution of protocadherin gene cluster diversity. Genome Res 2004; 14:354-66. [PMID: 14993203 PMCID: PMC353213 DOI: 10.1101/gr.2133704] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The synaptic cell adhesion molecules encoded by the protocadherin gene cluster are hypothesized to provide a molecular code involved in the generation of synaptic complexity in the developing brain. Variation in copy number and sequence content of protocadherin cluster genes among vertebrate species could reflect adaptive differences in protocadherin function. We have completed an analysis of zebrafish protocadherin cluster genes. Zebrafish have two unlinked protocadherin clusters, DrPcdh1 and DrPcdh2. Like mammalian protocadherin clusters, DrPcdh1 has both alpha and gamma variable and constant region exons. A consensus protocadherin promoter motif sequence identified in mammals is also conserved in zebrafish. Few orthologous relationships, however, are apparent between zebrafish and mammalian protocadherin proteins. Here we show that protocadherin cluster genes in human, mouse, rat, and zebrafish are subject to striking gene conversion events. These events are restricted to regions of the coding sequence, particularly the coding sequences of ectodomain 6 and the cytoplasmic domain. Diversity among paralogs is restricted to particular ectodomains that are excluded from conversion events. Conversion events are also strongly correlated with an increase in third-position GC content. We propose that the combination of lineage-specific duplication, restricted gene conversion, and adaptive variation in diversified ectodomains drives vertebrate protocadherin cluster evolution.
Collapse
Affiliation(s)
- James P Noonan
- Department of Genetics, Stanford University School of Medicine, Stanford, California 94305-5120, USA
| | | | | | | | | |
Collapse
|
240
|
Abstract
Synonymous codons are not used at random, significantly influencing the base composition of the genome. The selection-mutation-drift model proposes that this bias reflects natural selection in favor of a subset of preferred codons. Previous estimates in Drosophila of the intensity of selective forces involved seem too large to be reconciled with theoretical predictions of the level of codon bias. This probably results from confounding effects of the demographic histories of the species concerned. We have studied three species of the virilis group of Drosophila, which are more likely to satisfy the assumptions of the evolutionary models. We analyzed the patterns of polymorphism and divergence in a sample of 18 genes and applied a new method for estimating the intensity of selection on synonymous mutations based on the frequencies of unpreferred mutations among polymorphic sites. This yielded estimates of selection intensities (N(e)s) of the order of 0.65, which is more compatible with the observed levels of codon bias. Our results support the action of both selection and mutational bias on codon usage bias and suggest that codon usage and genome base composition in the D. americana lineage are in approximate equilibrium. Biased gene conversion may also contribute to the observed patterns.
Collapse
Affiliation(s)
- Xulio Maside
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Kings Buildings, West Mains Road, EH9 3JT Edinburgh, UK.
| | | | | |
Collapse
|
241
|
Abstract
Unraveling the evolutionary forces responsible for variations of neutral substitution patterns among taxa or along genomes is a major issue in the identification of functional sequence features. Mammalian genomes show large-scale regional variations of GC-content (the isochores), but the substitution processes at the origin of this structure are poorly understood. We have analyzed the pattern of neutral substitutions in 14.3 Mb of primate noncoding regions. We show that the GC-content toward which sequences are evolving is strongly correlated (r(2) = 0.61, P </= 2 10(-16)) with the rate of crossovers (notably in females). This demonstrates that recombination drives the evolution of base composition in human (probably via the process of biased gene conversion). The present substitution patterns are very different from what they had been in the past, resulting in a major modification of the isochore structure of our genome. This non-equilibrium situation suggests that changes of recombination rates occur relatively frequently during evolution, possibly as a consequence of karyotype rearrangements. These results have important implications for understanding the spatial and temporal variations of substitution processes in a broad range of sexual organisms, and for detecting the hallmarks of natural selection in DNA sequences.
Collapse
|
242
|
Andolfatto P, Wall JD. Linkage Disequilibrium Patterns Across a Recombination Gradient in African Drosophila melanogaster. Genetics 2003; 165:1289-305. [PMID: 14668383 PMCID: PMC1462864 DOI: 10.1093/genetics/165.3.1289] [Citation(s) in RCA: 51] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Previous multilocus surveys of nucleotide polymorphism have documented a genome-wide excess of intralocus linkage disequilibrium (LD) in Drosophila melanogaster and D. simulans relative to expectations based on estimated mutation and recombination rates and observed levels of diversity. These studies examined patterns of variation from predominantly non-African populations that are thought to have recently expanded their ranges from central Africa. Here, we analyze polymorphism data from a Zimbabwean population of D. melanogaster, which is likely to be closer to the standard population model assumptions of a large population with constant size. Unlike previous studies, we find that levels of LD are roughly compatible with expectations based on estimated rates of crossing over. Further, a detailed examination of genes in different recombination environments suggests that markers near the telomere of the X chromosome show considerably less linkage disequilibrium than predicted by rates of crossing over, suggesting appreciable levels of exchange due to gene conversion. Assuming that these populations are near mutation-drift equilibrium, our results are most consistent with a model that posits heterogeneity in levels of exchange due to gene conversion across the X chromosome, with gene conversion being a minor determinant of LD levels in regions of high crossing over. Alternatively, if levels of exchange due to gene conversion are not negligible in regions of high crossing over, our results suggest a marked departure from mutation-drift equilibrium (i.e., toward an excess of LD) in this Zimbabwean population. Our results also have implications for the dynamics of weakly selected mutations in regions of reduced crossing over.
Collapse
Affiliation(s)
- Peter Andolfatto
- Department of Zoology, University of Toronto, Toronto, Ontario M5S 3G5, Canada.
| | | |
Collapse
|
243
|
Rao BJ, Sengupta K. Human Y-chromosome: a hall of mirrors. J Biosci 2003; 28:533-4. [PMID: 14517355 DOI: 10.1007/bf02703327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Affiliation(s)
- B J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road,Mumbai 400 005, India.
| | | |
Collapse
|
244
|
Abstract
As the efficacy of natural selection is expected to be a function of population size, in humans it is usually presumed that selection is a weak force and hence that gene characteristics are mostly determined by stochastic forces. In contrast, in species with large population sizes, selection is expected to be a much more effective force. Evidence for this has come from examining how genic parameters vary with expression level, which appears to determine many of a gene's features, such as codon bias, amino acid composition, and size. However, not until now has it been possible to examine whether human genes show the signature of selection mediated by expression level. Here, then, to investigate this issue, we gathered expression data for >10,000 human genes from public data sets obtained by different technologies (SAGE and high-density oligonucleotide chip arrays) and compared them with gene parameters. We find that, even after controlling for regional effects, highly expressed genes code for smaller proteins, have less intronic DNA, and higher codon and amino acid biases. We conclude that, contrary to the usual supposition, human genes show signatures consistent with selection mediated by expression level.
Collapse
Affiliation(s)
- Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, UK
| | | |
Collapse
|
245
|
Affiliation(s)
- Gabriel Marais
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Ashworth Laboratories, Edinburgh EH9 3JT, UK.
| | | |
Collapse
|