51
|
Nasrullah I, Butt AM, Tahir S, Idrees M, Tong Y. Genomic analysis of codon usage shows influence of mutation pressure, natural selection, and host features on Marburg virus evolution. BMC Evol Biol 2015; 15:174. [PMID: 26306510 PMCID: PMC4550055 DOI: 10.1186/s12862-015-0456-4] [Citation(s) in RCA: 81] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2015] [Accepted: 08/17/2015] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. RESULTS Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. CONCLUSIONS To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Collapse
Affiliation(s)
- Izza Nasrullah
- Department of Biochemistry, Faculty of Biological Sciences, Quaid-i-Azam University, Islamabad, 45320, Pakistan.
| | - Azeem M Butt
- Centre of Excellence in Molecular Biology (CEMB), University of the Punjab, Lahore, 53700, Pakistan.
| | - Shifa Tahir
- INRA, UMR85 Physiologie de la Reproduction et des Comportements, Nouzilly, F-37380, France. .,CNRS, UMR7247, F-37380, Nouzilly, France. .,Université François Rabelais de Tours, Tours, F-37380, France.
| | - Muhammad Idrees
- Centre of Excellence in Molecular Biology (CEMB), University of the Punjab, Lahore, 53700, Pakistan.
| | - Yigang Tong
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Microbiology and Epidemiology, Beijing, 100071, People's Republic of China.
| |
Collapse
|
52
|
Baeza M, Alcaíno J, Barahona S, Sepúlveda D, Cifuentes V. Codon usage and codon context bias in Xanthophyllomyces dendrorhous. BMC Genomics 2015; 16:293. [PMID: 25887493 PMCID: PMC4404019 DOI: 10.1186/s12864-015-1493-5] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2014] [Accepted: 03/27/2015] [Indexed: 01/11/2023] Open
Abstract
Background Synonymous codons are used differentially in organisms from the three domains of life, a phenomenon referred to as codon usage bias. In addition, codon pair bias, particularly in the 3’ codon context, has also been described in several organisms and is associated with the accuracy and rate of translation. An improved understanding of both types of bias is important for the optimization of heterologous protein expression, particularly in biotechnologically important organisms, such as the yeast Xanthophyllomyces dendrorhous, a promising bioresource for the carotenoid astaxanthin. Using genomic and transcriptomic data, the codon usage and codon context biases of X. dendrorhous open reading frames (ORFs) were analyzed to determine their expression levels, GC% and sequence lengths. X. dendrorhous totiviral ORFs were also included in these analyses. Results A total of 1,695 X. dendrorhous ORFs were identified through comparison with sequences in multiple databases, and the intron-exon structures of these sequences were determined. Although there were important expression variations among the ORFs under the studied conditions (different phases of growth and available carbon sources), most of these sequences were highly expressed under at least one of the analyzed conditions. Independent of the culture conditions, the highly expressed genes showed a strong bias in both codon usage and the 3’ context, with a minor association with the GC% and no relationship to the sequence length. The codon usage and codon-pair bias of the totiviral ORFs were highly variable with no similarities to the host ORFs. Conclusions There is a direct relation between the level of gene expression and codon usage and 3′ context bias in X. dendrorhous, which is more evident for ORFs that are expressed at the highest levels under the studied conditions. However, there is no direct relation between the totiviral ORF biases and the host ORFs. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1493-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Marcelo Baeza
- Departamento de Ciencias Ecológicas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Casilla 653, Santiago, Chile.
| | - Jennifer Alcaíno
- Departamento de Ciencias Ecológicas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Casilla 653, Santiago, Chile.
| | - Salvador Barahona
- Departamento de Ciencias Ecológicas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Casilla 653, Santiago, Chile.
| | - Dionisia Sepúlveda
- Departamento de Ciencias Ecológicas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Casilla 653, Santiago, Chile.
| | - Víctor Cifuentes
- Departamento de Ciencias Ecológicas, Facultad de Ciencias, Universidad de Chile, Las Palmeras 3425, Casilla 653, Santiago, Chile.
| |
Collapse
|
53
|
Shin SH, Choi SS. Lengths of coding and noncoding regions of a gene correlate with gene essentiality and rates of evolution. Genes Genomics 2015. [DOI: 10.1007/s13258-015-0265-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
54
|
Yang X, Luo X, Cai X. Analysis of codon usage pattern in Taenia saginata based on a transcriptome dataset. Parasit Vectors 2014; 7:527. [PMID: 25440955 PMCID: PMC4268816 DOI: 10.1186/s13071-014-0527-1] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2014] [Accepted: 11/06/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Codon usage bias is an important evolutionary feature in a genome and has been widely documented in many genomes. Analysis of codon usage bias has significance for mRNA translation, design of transgenes, new gene discovery, and studies of molecular biology and evolution, etc. However, the information about synonymous codon usage pattern of T. saginata genome remains unclear. T. saginata is a food-borne zoonotic cestode which infects approximataely 50 million humans worldwide, and causes significant health problems to the host and considerable socio-economic losses as a consequence. In this study, synonymous codon usage in T. saginata were examined. METHODS Total RNA was isolated from T. saginata cysticerci and 91,487 unigenes were generated using Illumina sequencing technology. After filtering, the final sequence collection containing 11,399 CDSs was used for our analysis. RESULTS Neutrality analysis showed that the T. saginata had a wide GC3 distribution and a significant correlation was observed between GC12 and GC3. NC-plot showed most of genes on or close to the expected curve, but only a few points with low-ENC values were below it, suggesting that mutational bias plays a major role in shaping codon usage. The Parity Rule 2 plot (PR2) analysis showed that GC and AT were not used proportionally. We also identified twenty-three optimal codons in the T. saginata genome, all of which were ended with a G or C residue. These results suggest that mutational and selection forces are probably driving factors of codon usage bias in T. saginata genome. Meanwhile, other factors such as protein length, gene expression, GC content of genes, the hydropathicity of each protein also influence codon usage. CONCLUSIONS Here, we systematically analyzed the codon usage pattern and identified factors shaping in codon usage bias in T. saginata. Currently, no complete nuclear genome is available for codon usage analysis at the genome level in T. saginata. This is the first report to investigate codon biology in T. sagninata. Such information does not only bring about a new perspective for understanding the mechanisms of biased usage of synonymous codons but also provide useful clues for molecular genetic engineering and evolutionary studies.
Collapse
Affiliation(s)
- Xing Yang
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| | - Xuenong Luo
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China.
| | - Xuepeng Cai
- State Key Laboratory of Veterinary Etiological Biology, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou, 730046, PR China. .,College of Veterinary Medicine, Jilin University, Changchun, 130000, PR China.
| |
Collapse
|
55
|
Evidence for stabilizing selection on codon usage in chromosomal rearrangements of Drosophila pseudoobscura. G3-GENES GENOMES GENETICS 2014; 4:2433-49. [PMID: 25326424 PMCID: PMC4267939 DOI: 10.1534/g3.114.014860] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
There has been a renewed interest in investigating the role of stabilizing selection acting on genome-wide traits such as codon usage bias. Codon bias, when synonymous codons are used at unequal frequencies, occurs in a wide variety of taxa. Standard evolutionary models explain the maintenance of codon bias through a balance of genetic drift, mutation and weak purifying selection. The efficacy of selection is expected to be reduced in regions of suppressed recombination. Contrary to observations in Drosophila melanogaster, some recent studies have failed to detect a relationship between the recombination rate, intensity of selection acting at synonymous sites, and the magnitude of codon bias as predicted under these standard models. Here, we examined codon bias in 2798 protein coding loci on the third chromosome of D. pseudoobscura using whole-genome sequences of 47 individuals, representing five common third chromosome gene arrangements. Fine-scale recombination maps were constructed using more than 1 million segregating sites. As expected, recombination was demonstrated to be significantly suppressed between chromosome arrangements, allowing for a direct examination of the relationship between recombination, selection, and codon bias. As with other Drosophila species, we observe a strong mutational bias away from the most frequently used codons. We find the rate of synonymous and nonsynonymous polymorphism is variable between different amino acids. However, we do not observe a reduction in codon bias or the strength of selection in regions of suppressed recombination as expected. Instead, we find that the interaction between weak stabilizing selection and mutational bias likely plays a role in shaping the composition of synonymous codons across the third chromosome in D. pseudoobscura.
Collapse
|
56
|
Zeng J, Yi SV. Specific modifications of histone tails, but not DNA methylation, mirror the temporal variation of mammalian recombination hotspots. Genome Biol Evol 2014; 6:2918-29. [PMID: 25326136 PMCID: PMC4224356 DOI: 10.1093/gbe/evu230] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Recombination clusters nonuniformly across mammalian genomes at discrete genomic loci referred to as recombination hotspots. Despite their ubiquitous presence, individual hotspots rapidly lose their activities, and the molecular and evolutionary mechanisms underlying such frequent hotspot turnovers (the so-called “recombination hotspot paradox”) remain unresolved. Even though some sequence motifs are significantly associated with hotspots, multiple lines of evidence indicate that factors other than underlying sequences, such as epigenetic modifications, may affect the evolution of recombination hotspots. Thus, identifying epigenetic factors that covary with recombination at fine-scale is a promising step for this important research area. It was previously reported that recombination rates correlate with indirect measures of DNA methylation in the human genome. Here, we analyze experimentally determined DNA methylation and histone modification of human sperms, and show that the correlation between DNA methylation and recombination in long-range windows does not hold with respect to the spatial and temporal variation of recombination at hotspots. On the other hand, two histone modifications (H3K4me3 and H3K27me3) overlap extensively with recombination hotspots. Similar trends were observed in mice. These results indicate that specific histone modifications rather than DNA methylation are associated with the rapid evolution of recombination hotspots. Furthermore, many human recombination hotspots occupy “bivalent” chromatin regions that harbor both active (H3K4me3) and repressive (H3K27me3) marks. This may explain why human recombination hotspots tend to occur in nongenic regions, in contrast to yeast and Arabidopsis hotspots that are characterized by generally active chromatins. Our results highlight the dynamic epigenetic underpinnings of recombination hotspot evolution.
Collapse
Affiliation(s)
- Jia Zeng
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia Tech
| | - Soojin V Yi
- School of Biology, Georgia Institute of Technology, Atlanta, Georgia Tech
| |
Collapse
|
57
|
Williamson RJ, Josephs EB, Platts AE, Hazzouri KM, Haudry A, Blanchette M, Wright SI. Evidence for widespread positive and negative selection in coding and conserved noncoding regions of Capsella grandiflora. PLoS Genet 2014; 10:e1004622. [PMID: 25255320 PMCID: PMC4178662 DOI: 10.1371/journal.pgen.1004622] [Citation(s) in RCA: 104] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2014] [Accepted: 07/21/2014] [Indexed: 12/30/2022] Open
Abstract
The extent that both positive and negative selection vary across different portions of plant genomes remains poorly understood. Here, we sequence whole genomes of 13 Capsella grandiflora individuals and quantify the amount of selection across the genome. Using an estimate of the distribution of fitness effects, we show that selection is strong in coding regions, but weak in most noncoding regions, with the exception of 5' and 3' untranslated regions (UTRs). However, estimates of selection on noncoding regions conserved across the Brassicaceae family show strong signals of selection. Additionally, we see reductions in neutral diversity around functional substitutions in both coding and conserved noncoding regions, indicating recent selective sweeps at these sites. Finally, using expression data from leaf tissue we show that genes that are more highly expressed experience stronger negative selection but comparable levels of positive selection to lowly expressed genes. Overall, we observe widespread positive and negative selection in coding and regulatory regions, but our results also suggest that both positive and negative selection on plant noncoding sequence are considerably rarer than in animal genomes.
Collapse
Affiliation(s)
- Robert J. Williamson
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Emily B. Josephs
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- * E-mail:
| | - Adrian E. Platts
- Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada
- School for Computer Science, McGill University, Montreal, Quebec, Canada
| | - Khaled M. Hazzouri
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates
| | - Annabelle Haudry
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Université Lyon 1, Centre National de la Recherche Scientifique (CNRS), Unité Mixte de Recherche (UMR) 5558, Laboratoire de Biométrie et Biologie Evolutive, Villeurbanne, France
| | - Mathieu Blanchette
- Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada
- School for Computer Science, McGill University, Montreal, Quebec, Canada
| | - Stephen I. Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
58
|
Kessler MD, Dean MD. Effective population size does not predict codon usage bias in mammals. Ecol Evol 2014; 4:3887-900. [PMID: 25505518 PMCID: PMC4242573 DOI: 10.1002/ece3.1249] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Revised: 08/04/2014] [Accepted: 08/07/2014] [Indexed: 12/20/2022] Open
Abstract
Synonymous codons are not used at equal frequency throughout the genome, a phenomenon termed codon usage bias (CUB). It is often assumed that interspecific variation in the intensity of CUB is related to species differences in effective population sizes (Ne), with selection on CUB operating less efficiently in species with small Ne. Here, we specifically ask whether variation in Ne predicts differences in CUB in mammals and report two main findings. First, across 41 mammalian genomes, CUB was not correlated with two indirect proxies of Ne (body mass and generation time), even though there was statistically significant evidence of selection shaping CUB across all species. Interestingly, autosomal genes showed higher codon usage bias compared to X-linked genes, and high-recombination genes showed higher codon usage bias compared to low recombination genes, suggesting intraspecific variation in Ne predicts variation in CUB. Second, across six mammalian species with genetic estimates of Ne (human, chimpanzee, rabbit, and three mouse species: Mus musculus, M. domesticus, and M. castaneus), Ne and CUB were weakly and inconsistently correlated. At least in mammals, interspecific divergence in Ne does not strongly predict variation in CUB. One hypothesis is that each species responds to a unique distribution of selection coefficients, confounding any straightforward link between Ne and CUB.
Collapse
Affiliation(s)
- Michael D Kessler
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| | - Matthew D Dean
- Molecular and Computational Biology, University of Southern California 1050 Childs Way, Los Angeles, California, 90089
| |
Collapse
|
59
|
Šnábel V, Utsuki D, Kato T, Sunaga F, Ooi HK, Gambetta B, Taira K. Molecular identification of Heterakis spumosa obtained from brown rats (Rattus norvegicus) in Japan and its infectivity in experimental mice. Parasitol Res 2014; 113:3449-55. [PMID: 24997621 DOI: 10.1007/s00436-014-4014-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2014] [Accepted: 06/26/2014] [Indexed: 10/25/2022]
Abstract
Heterakis spumosa is a nematode of invasive rodents, mainly affiliated with Rattus spp. of Asian origin. Despite the ecological importance and cosmopolitan distribution, little information is available on the genetic characteristics and infectivity to experimental animals of this roundworm. Heterakis isolates obtained from naturally infected brown rats caught in 2007 in the city of Sagamihara, east central Honshu, Japan, and maintained by laboratory passages were subjected to mitochondrial sequence analysis and experimental infection in mice. Sequencing of the cox1 gene revealed that nucleotides of H. spumosa and previously examined Heterakis isolonche isolates from gallinaceous birds in Japan differed by 11.2-12.2% that conforms to the range expected for interspecific differences. The two H. spumosa isolates differed by a single 138T/C non-synonymous substitution in the 393-bp mt sequence. In a dendrogram, the H. spumosa samples formed a subcluster with members of the nematode superfamily Heterakoidea, H. isolonche and Ascaridia galli. In an experimental infection study, ICR, AKR, B10.BR and C57BL/6 mice strains were inoculated with 200 H. spumosa eggs/head and necropsied at 14 and 90 days post-inoculation (DPI) when the number of worms was recorded. Eggs were initially detected in faeces from 32-35 DPI in ICR, AKR and B10.BR mice and the highest mean number of eggs per gram of faeces (EPG) was 4,800 at 38 DPI, 2,200 at 58 DPI and 800 at 44 and 72 DPI in ICR, AKR and B10.BR mice, respectively. No eggs were observed in faeces of the C57BL/6 mouse strain during the experiment. A similar number of juvenile worms were isolated from all mouse strains at 14 DPI, whereas no adult worms were detected in C57BL/6 mice at 90 DPI.
Collapse
Affiliation(s)
- Viliam Šnábel
- Institute of Parasitology, Slovak Academy of Sciences, 040 01, Košice, Slovakia,
| | | | | | | | | | | | | |
Collapse
|
60
|
Evidence that natural selection on codon usage in Drosophila pseudoobscura varies across codons. G3-GENES GENOMES GENETICS 2014; 4:681-92. [PMID: 24531731 PMCID: PMC4059240 DOI: 10.1534/g3.114.010488] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Like other species of Drosophila, Drosophila pseudoobscura has a distinct bias toward the usage of C- and G-ending codons. Previous studies have indicated that this bias is due, at least in part, to natural selection. Codon bias clearly differs among amino acids (and other codon classes) in Drosophila, which may reflect differences in the intensity of selection on codon usage. Ongoing natural selection on synonymous codon usage should be reflected in the shapes of the site frequency spectra of derived states at polymorphic positions. Specifically, regardless of other demographic effects on the spectrum, it should be shifted toward higher values for changes from less-preferred to more-preferred codons, and toward lower values for the converse. If the intensity of natural selection is increased, shifts in the site frequency spectra should be more pronounced. A total of 33,729 synonymous polymorphic sites on Chromosome 2 in D. pseudoobscura were analyzed. Shifts in the site frequency spectra are consistent with differential intensity of natural selection on codon usage, with stronger shifts associated with higher codon bias. The shifts, in general, are greater for polymorphic synonymous sites than for polymorphic intron sites, also consistent with natural selection. However, unlike observations in D. melanogaster, codon bias is not reduced in areas of low recombination in D. pseudoobscura; the site frequency spectrum signal for selection on codon usage remains strong in these regions. However, diversity is reduced, as expected. It is possible that estimates of low recombination reflect a recent change in recombination rate.
Collapse
|
61
|
Hübner S, Rashkovetsky E, Kim YB, Oh JH, Michalak K, Weiner D, Korol AB, Nevo E, Michalak P. Genome differentiation of Drosophila melanogaster from a microclimate contrast in Evolution Canyon, Israel. Proc Natl Acad Sci U S A 2013; 110:21059-64. [PMID: 24324170 PMCID: PMC3876225 DOI: 10.1073/pnas.1321533111] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
The opposite slopes of "Evolution Canyon" in Israel have served as a natural model system of adaptation to a microclimate contrast. Long-term studies of Drosophila melanogaster populations inhabiting the canyon have exhibited significant interslope divergence in thermal and drought stress resistance, candidate genes, mobile elements, habitat choice, mating discrimination, and wing-shape variation, all despite close physical proximity of the contrasting habitats, as well as substantial interslope migration. To examine patterns of genetic differentiation at the genome-wide level, we used high coverage sequencing of the flies' genomes. A total of 572 genes were significantly different in allele frequency between the slopes, 106 out of which were associated with 74 significantly overrepresented gene ontology (GO) terms, particularly so with response to stimulus and developmental and reproductive processes, thus corroborating previous observations of interslope divergence in stress response, life history, and mating functions. There were at least 37 chromosomal "islands" of interslope divergence and low sequence polymorphism, plausible signatures of selective sweeps, more abundant in flies derived from one (north-facing) of the slopes. Positive correlation between local recombination rate and the level of nucleotide polymorphism was also found.
Collapse
Affiliation(s)
- Sariel Hübner
- Institute of Evolution, Haifa University, Haifa 31905, Israel
- Department of Botany, University of British Columbia, Vancouver, BC, Canada V6T 1Z4
| | | | - Young Bun Kim
- Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; and
| | - Jung Hun Oh
- Department of Medical Physics, Memorial Sloan-Kettering Cancer Center, New York, NY 10065
| | - Katarzyna Michalak
- Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; and
| | - Dmitry Weiner
- Institute of Evolution, Haifa University, Haifa 31905, Israel
| | | | - Eviatar Nevo
- Institute of Evolution, Haifa University, Haifa 31905, Israel
| | - Pawel Michalak
- Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University, Blacksburg, VA 24061; and
| |
Collapse
|
62
|
Synonymous codon usage in TTSuV2: analysis and comparison with TTSuV1. PLoS One 2013; 8:e81469. [PMID: 24303050 PMCID: PMC3841265 DOI: 10.1371/journal.pone.0081469] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Accepted: 10/12/2013] [Indexed: 11/19/2022] Open
Abstract
Two species of the DNA virus Torque teno sus virus (TTSuV), TTSuV1 and TTSuV2, have become widely distributed in pig-farming countries in recent years. In this study, we performed a comprehensive analysis of synonymous codon usage bias in 41 available TTSuV2 coding sequences (CDS), and compared the codon usage patterns of TTSuV2 and TTSuV1. TTSuV codon usage patterns were found to be phylogenetically conserved. Values for the effective number of codons (ENC) indicated that the overall extent of codon usage bias in both TTSuV2 and TTSuV1 was not significant, the most frequently occurring codons had an A or C at the third codon position. Correspondence analysis (COA) was performed and TTSuV2 and TTSuV1 sequences were located in different quadrants of the first two major axes. A plot of the ENC revealed that compositional constraint was the major factor determining the codon usage bias for TTSuV2. In addition, hierarchical cluster analysis of 41 TTSuV2 isolates based on relative synonymous codon usage (RSCU) values suggested that there was no association between geographic distribution and codon bias of TTSuV2 sequences. Finally, the comparison of RSCU for TTSuV2, TTSuV1 and the corresponding host sequence indicated that the codon usage pattern of TTSuV2 was similar to that of TTSuV1. However the similarity was low for each virus and its host. These conclusions provide important insight into the synonymous codon usage pattern of TTSuV2, as well as better understangding of the molecular evolution of TTSuV2 genomes.
Collapse
|
63
|
Morgan CC, Mc Cartney AM, Donoghue MTA, Loughran NB, Spillane C, Teeling EC, O'Connell MJ. Molecular adaptation of telomere associated genes in mammals. BMC Evol Biol 2013; 13:251. [PMID: 24237966 PMCID: PMC3833184 DOI: 10.1186/1471-2148-13-251] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2013] [Accepted: 11/06/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Placental mammals display a huge range of life history traits, including size, longevity, metabolic rate and germ line generation time. Although a number of general trends have been proposed between these traits, there are exceptions that warrant further investigation. Species such as naked mole rat, human and certain bat species all exhibit extreme longevity with respect to body size. It has long been established that telomeres and telomere maintenance have a clear role in ageing but it has not yet been established whether there is evidence for adaptation in telomere maintenance proteins that could account for increased longevity in these species. RESULTS Here we carry out a molecular investigation of selective pressure variation, specifically focusing on telomere associated genes across placental mammals. In general we observe a large number of instances of positive selection acting on telomere genes. Although these signatures of selection overall are not significantly correlated with either longevity or body size we do identify positive selection in the microbat species Myotis lucifugus in functionally important regions of the telomere maintenance genes DKC1 and TERT, and in naked mole rat in the DNA repair gene BRCA1. CONCLUSION These results demonstrate the multifarious selective pressures acting across the mammal phylogeny driving lineage-specific adaptations of telomere associated genes. Our results show that regardless of the longevity of a species, these proteins have evolved under positive selection thereby removing increased longevity as the single selective force driving this rapid rate of evolution. However, evidence of molecular adaptations specific to naked mole rat and Myotis lucifugus highlight functionally significant regions in genes that may alter the way in which telomeres are regulated and maintained in these longer-lived species.
Collapse
|
64
|
Robinson MC, Stone EA, Singh ND. Population genomic analysis reveals no evidence for GC-biased gene conversion in Drosophila melanogaster. Mol Biol Evol 2013; 31:425-33. [PMID: 24214536 DOI: 10.1093/molbev/mst220] [Citation(s) in RCA: 38] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Gene conversion is the nonreciprocal exchange of genetic material between homologous chromosomes. Multiple lines of evidence from a variety of taxa strongly suggest that gene conversion events are biased toward GC-bearing alleles. However, in Drosophila, the data have largely been indirect and unclear, with some studies supporting the predictions of a GC-biased gene conversion model and other data showing contradictory findings. Here, we test whether gene conversion events are GC-biased in Drosophila melanogaster using whole-genome polymorphism and divergence data. Our results provide no support for GC-biased gene conversion and thus suggest that this process is unlikely to significantly contribute to patterns of polymorphism and divergence in this system.
Collapse
Affiliation(s)
- Matthew C Robinson
- Department of Biological Sciences, Program in Genetics, North Carolina State University
| | | | | |
Collapse
|
65
|
A comparison of synonymous codon usage bias patterns in DNA and RNA virus genomes: quantifying the relative importance of mutational pressure and natural selection. BIOMED RESEARCH INTERNATIONAL 2013; 2013:406342. [PMID: 24199191 PMCID: PMC3808105 DOI: 10.1155/2013/406342] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 06/30/2013] [Accepted: 08/04/2013] [Indexed: 11/17/2022]
Abstract
Codon usage bias patterns have been broadly explored for many viruses. However, the relative importance of mutation pressure and natural selection is still under debate. In the present study, I tried to resolve controversial issues on determining the principal factors of codon usage patterns for DNA and RNA viruses, respectively, by examining over 38000 ORFs. By utilizing variation partitioning technique, the results showed that 27% and 21% of total variation could be attributed to mutational pressure, while 5% and 6% of total variation could be explained by natural selection for DNA and RNA viruses, respectively, in codon usage patterns. Furthermore, the combined effect of mutational pressure and natural selection on influencing codon usage patterns of viruses is substantial (explaining 10% and 8% of total variation of codon usage patterns). With respect to GC variation, GC content is always negatively and significantly correlated with aromaticity. Interestingly, the signs for the significant correlations between GC, gene lengths, and hydrophobicity are completely opposite between DNA and RNA viruses, being positive for DNA viruses while being negative for RNA viruses. At last, GC12 versus G3s plot suggests that natural selection is more important than mutational pressure on influencing the GC content in the first and second codon positions.
Collapse
|
66
|
Genomic signatures of selection at linked sites: unifying the disparity among species. Nat Rev Genet 2013; 14:262-74. [PMID: 23478346 DOI: 10.1038/nrg3425] [Citation(s) in RCA: 315] [Impact Index Per Article: 28.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Population genetics theory supplies powerful predictions about how natural selection interacts with genetic linkage to sculpt the genomic landscape of nucleotide polymorphism. Both the spread of beneficial mutations and the removal of deleterious mutations act to depress polymorphism levels, especially in low-recombination regions. However, empiricists have documented extreme disparities among species. Here we characterize the dominant features that could drive differences in linked selection among species--including roles for selective sweeps being 'hard' or 'soft'--and the concealing effects of demography and confounding genomic variables. We advocate targeted studies of closely related species to unify our understanding of how selection and linkage interact to shape genome evolution.
Collapse
|
67
|
Choi SS, Hannenhalli S. Three independent determinants of protein evolutionary rate. J Mol Evol 2013; 76:98-111. [PMID: 23400388 DOI: 10.1007/s00239-013-9543-6] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2012] [Accepted: 01/16/2013] [Indexed: 12/15/2022]
Abstract
One of the most widely accepted ideas related to the evolutionary rates of proteins is that functionally important residues or regions evolve slower than other regions, a reasonable outcome of which should be a slower evolutionary rate of the proteins with a higher density of functionally important sites. Oddly, the role of functional importance, mainly measured by essentiality, in determining evolutionary rate has been challenged in recent studies. Several variables other than protein essentiality, such as expression level, gene compactness, protein-protein interactions, etc., have been suggested to affect protein evolutionary rate. In the present review, we try to refine the concept of functional importance of a gene, and consider three factors-functional importance, expression level, and gene compactness, as independent determinants of evolutionary rate of a protein, based not only on their known correlation with evolutionary rate but also on a reasonable mechanistic model. We suggest a framework based on these mechanistic models to correctly interpret the correlations between evolutionary rates and the various variables as well as the interrelationships among the variables.
Collapse
Affiliation(s)
- Sun Shim Choi
- Department of Medical Biotechnology, College of Biomedical Science, and Institute of Bioscience & Biotechnology, Kangwon National University, Chuncheon, South Korea.
| | | |
Collapse
|
68
|
McGaugh SE, Heil CSS, Manzano-Winkler B, Loewe L, Goldstein S, Himmel TL, Noor MAF. Recombination modulates how selection affects linked sites in Drosophila. PLoS Biol 2012; 10:e1001422. [PMID: 23152720 PMCID: PMC3496668 DOI: 10.1371/journal.pbio.1001422] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 10/05/2012] [Indexed: 11/18/2022] Open
Abstract
Recombination rate in Drosophila species shapes the impact of selection in the genome and is positively correlated with nucleotide diversity. One of the most influential observations in molecular evolution has been a strong association between local recombination rate and nucleotide polymorphisms across the genome. This is interpreted as evidence for ubiquitous natural selection. The alternative explanation, that recombination is mutagenic, has been rejected by the absence of a similar association between local recombination rate and nucleotide divergence between species. However, many recent studies show that recombination rates are often very different even in closely related species, questioning whether an association between recombination rate and divergence between species has been tested satisfactorily. To circumvent this problem, we directly surveyed recombination across approximately 43% of the D. pseudoobscura physical genome in two separate recombination maps and 31% of the D. miranda physical genome, and we identified both global and local differences in recombination rate between these two closely related species. Using only regions with conserved recombination rates between and within species and accounting for multiple covariates, our data support the conclusion that recombination is positively related to diversity because recombination modulates Hill–Robertson effects in the genome and not because recombination is predominately mutagenic. Finally, we find evidence for dips in diversity around nonsynonymous substitutions. We infer that at least some of this reduction in diversity resulted from selective sweeps and examine these dips in the context of recombination rate. Individuals within a species differ in the DNA sequences of their genes. This sequence variation affects how well individuals survive or reproduce and is transmitted to their offspring. Genes near each other on individual chromosomes tend to be passed to offspring together—neighboring genes are unlikely to be separated by exchanges of genetic material derived from different parents during meiotic recombination. When genes are inherited together, however, the evolutionary forces acting on one gene can interfere with variation at its neighbors. Thus, variation at multiple genes can be lost if natural selection acts on one gene in close proximity. Recombination can prevent or reduce this loss of variation, but previous tests of this phenomenon failed to account for recombination rate differences between species. In this study, we show that some parts of the genome differ in recombination rate between two species of fruit fly, Drosophila pseudoobscura and D. miranda. Avoiding an assumption made in previous studies, we then examine sequence variation within and between fly species in those parts of the genome that have conserved recombination rates. Based on the results, we conclude that recombination indeed preserves variation within species that would otherwise have been eliminated by natural selection.
Collapse
Affiliation(s)
- Suzanne E McGaugh
- Biology Department, Duke University, Durham, North Carolina, United States of America.
| | | | | | | | | | | | | |
Collapse
|
69
|
Inferences of demography and selection in an African population of Drosophila melanogaster. Genetics 2012; 193:215-28. [PMID: 23105013 DOI: 10.1534/genetics.112.145318] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
It remains a central problem in population genetics to infer the past action of natural selection, and these inferences pose a challenge because demographic events will also substantially affect patterns of polymorphism and divergence. Thus it is imperative to explicitly model the underlying demographic history of the population whenever making inferences about natural selection. In light of the considerable interest in adaptation in African populations of Drosophila melanogaster, which are considered ancestral to the species, we generated a large polymorphism data set representing 2.1 Mb from each of 20 individuals from a Ugandan population of D. melanogaster. In contrast to previous inferences of a simple population expansion in eastern Africa, our demographic modeling of this ancestral population reveals a strong signature of a population bottleneck followed by population expansion, which has significant implications for future demographic modeling of derived populations of this species. Taking this more complex underlying demographic history into account, we also estimate a mean X-linked region-wide rate of adaptation of 6 × 10(-11)/site/generation and a mean selection coefficient of beneficial mutations of 0.0009. These inferences regarding the rate and strength of selection are largely consistent with most other estimates from D. melanogaster and indicate a relatively high rate of adaptation driven by weakly beneficial mutations.
Collapse
|
70
|
Comeron JM, Ratnappan R, Bailin S. The many landscapes of recombination in Drosophila melanogaster. PLoS Genet 2012; 8:e1002905. [PMID: 23071443 PMCID: PMC3469467 DOI: 10.1371/journal.pgen.1002905] [Citation(s) in RCA: 332] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2012] [Accepted: 07/02/2012] [Indexed: 01/06/2023] Open
Abstract
Recombination is a fundamental biological process with profound evolutionary implications. Theory predicts that recombination increases the effectiveness of selection in natural populations. Yet, direct tests of this prediction have been restricted to qualitative trends due to the lack of detailed characterization of recombination rate variation across genomes and within species. The use of imprecise recombination rates can also skew population genetic analyses designed to assess the presence and mode of selection across genomes. Here we report the first integrated high-resolution description of genomic and population variation in recombination, which also distinguishes between the two outcomes of meiotic recombination: crossing over (CO) and gene conversion (GC). We characterized the products of 5,860 female meioses in Drosophila melanogaster by genotyping a total of 139 million informative SNPs and mapped 106,964 recombination events at a resolution down to 2 kilobases. This approach allowed us to generate whole-genome CO and GC maps as well as a detailed description of variation in recombination among individuals of this species. We describe many levels of variation in recombination rates. At a large-scale (100 kb), CO rates exhibit extreme and highly punctuated variation along chromosomes, with hot and coldspots. We also show extensive intra-specific variation in CO landscapes that is associated with hotspots at low frequency in our sample. GC rates are more uniformly distributed across the genome than CO rates and detectable in regions with reduced or absent CO. At a local scale, recombination events are associated with numerous sequence motifs and tend to occur within transcript regions, thus suggesting that chromatin accessibility favors double-strand breaks. All these non-independent layers of variation in recombination across genomes and among individuals need to be taken into account in order to obtain relevant estimates of recombination rates, and should be included in a new generation of population genetic models of the interaction between selection and linkage.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biology, University of Iowa, Iowa City, Iowa, USA.
| | | | | |
Collapse
|
71
|
Zhou T, Lu ZH, Sun X. The Correlation between Recombination Rate and Codon Bias in Yeast Mainly Results from Mutational Bias Associated with Recombination Rather than Hill-Robertson Interference. CONFERENCE PROCEEDINGS : ... ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL CONFERENCE 2012; 2005:4787-90. [PMID: 17281312 DOI: 10.1109/iembs.2005.1615542] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Codon usage has been reported to be correlated with local recombination rate, which can be explained by two proposed models. In the present study, correspondence analysis was used to investigate the major trends in codon usage variation among S. cerevisiae genes. It was found that the first principle source of codon usage variation in yeast is due to the variance of expressional levels, which is consistent with the previous translational selection model. Moreover, recombination rate is also correlated with the codon pattern, which might be a byproduct of mutational bias associated with recombination rather than the consequence of Hill-Robertson interference. A recent study has analysed the genome sequence, but reached opposite conclusions: the positive correlation between recombination rate and codon bias in yeast mainly results from Hill-Robertson interference. In light of this conflicting result, we have discussed the possible reason and found that the previous analysis was undermined by mistaken assumptions that weak selection acting at expression level led to the correlation between recombination and codon bias.
Collapse
Affiliation(s)
- T Zhou
- Key Laboratory of Molecular and Biomolecular Electronics of the Ministry of Education, Southeast University, Nanjing 210096, China
| | | | | |
Collapse
|
72
|
Behura SK, Severson DW. Codon usage bias: causative factors, quantification methods and genome-wide patterns: with emphasis on insect genomes. Biol Rev Camb Philos Soc 2012; 88:49-61. [PMID: 22889422 DOI: 10.1111/j.1469-185x.2012.00242.x] [Citation(s) in RCA: 124] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Codon usage bias refers to the phenomenon where specific codons are used more often than other synonymous codons during translation of genes, the extent of which varies within and among species. Molecular evolutionary investigations suggest that codon bias is manifested as a result of balance between mutational and translational selection of such genes and that this phenomenon is widespread across species and may contribute to genome evolution in a significant manner. With the advent of whole-genome sequencing of numerous species, both prokaryotes and eukaryotes, genome-wide patterns of codon bias are emerging in different organisms. Various factors such as expression level, GC content, recombination rates, RNA stability, codon position, gene length and others (including environmental stress and population size) can influence codon usage bias within and among species. Moreover, there has been a continuous quest towards developing new concepts and tools to measure the extent of codon usage bias of genes. In this review, we outline the fundamental concepts of evolution of the genetic code, discuss various factors that may influence biased usage of synonymous codons and then outline different principles and methods of measurement of codon usage bias. Finally, we discuss selected studies performed using whole-genome sequences of different insect species to show how codon bias patterns vary within and among genomes. We conclude with generalized remarks on specific emerging aspects of codon bias studies and highlight the recent explosion of genome-sequencing efforts on arthropods (such as twelve Drosophila species, species of ants, honeybee, Nasonia and Anopheles mosquitoes as well as the recent launch of a genome-sequencing project involving 5000 insects and other arthropods) that may help us to understand better the evolution of codon bias and its biological significance.
Collapse
Affiliation(s)
- Susanta K Behura
- Department of Biological Sciences, Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN 46556, USA.
| | | |
Collapse
|
73
|
Abstract
A major current molecular evolution challenge is to link comparative genomic patterns to species' biology and ecology. Breeding systems are pivotal because they affect many population genetic processes, and thus genome evolution. We review theoretical predictions and empirical evidence about molecular evolutionary processes under three distinct breeding systems-outcrossing, selfing, and asexuality. Breeding systems may have a profound impact on genome evolution, including molecular evolutionary rates, base composition, genomic conflict, and possibly genome size. However, while asexual species essentially conform to theoretical predictions, the situation is less simple in selfing species. We discuss the possible reasons to potentially explain this paradox. In reverse, comparative and population genomic data and approaches help revisiting old questions on the long-term evolution of breeding systems.
Collapse
|
74
|
The evolution of novelty in conserved gene families. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:490894. [PMID: 22779028 PMCID: PMC3388334 DOI: 10.1155/2012/490894] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2012] [Accepted: 04/23/2012] [Indexed: 12/05/2022]
Abstract
One of the major aims of contemporary evolutionary biology is the understanding of the current pattern of biological diversity. This involves, first, the description of character distribution at various nodes of the phylogenetic tree of life and, second, the functional explanation of such changes. The analysis of character distribution is a powerful tool at both the morphological and molecular levels. Recent high-throughput sequencing approaches provide new opportunities to study the genetic architecture of organisms at the genome-wide level. In eukaryotes, one overarching finding is the absence of simple correlations of gene count and biological complexity. Instead, the domain architecture of proteins is becoming a central focus for large-scale evolutionary innovations. Here, we review examples of the evolution of novelty in conserved gene families in insects and nematodes. We highlight how in the absence of whole-genome duplications molecular novelty can arise, how members of gene families have diversified at distinct mechanistic levels, and how gene expression can be maintained in the context of multiple innovations in regulatory mechanisms.
Collapse
|
75
|
Pessia E, Popa A, Mousset S, Rezvoy C, Duret L, Marais GAB. Evidence for widespread GC-biased gene conversion in eukaryotes. Genome Biol Evol 2012; 4:675-82. [PMID: 22628461 PMCID: PMC5635611 DOI: 10.1093/gbe/evs052] [Citation(s) in RCA: 109] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
GC-biased gene conversion (gBGC) is a process that tends to increase the GC content of recombining DNA over evolutionary time and is thought to explain the evolution of GC content in mammals and yeasts. Evidence for gBGC outside these two groups is growing but is still limited. Here, we analyzed 36 completely sequenced genomes representing four of the five major groups in eukaryotes (Unikonts, Excavates, Chromalveolates and Plantae). gBGC was investigated by directly comparing GC content and recombination rates in species where recombination data are available, that is, half of them. To study all species of our dataset, we used chromosome size as a proxy for recombination rate and compared it with GC content. Among the 17 species showing a significant relationship between GC content and chromosome size, 15 are consistent with the predictions of the gBGC model. Importantly, the species showing a pattern consistent with gBGC are found in all the four major groups of eukaryotes studied, which suggests that gBGC may be widespread in eukaryotes.
Collapse
Affiliation(s)
- Eugénie Pessia
- Université Lyon 1, Centre National de la Recherche Scientifique, UMR5558, Laboratoire de Biométrie et Biologie évolutive, Villeurbanne, Cedex, France
| | | | | | | | | | | |
Collapse
|
76
|
Paape T, Zhou P, Branca A, Briskine R, Young N, Tiffin P. Fine-scale population recombination rates, hotspots, and correlates of recombination in the Medicago truncatula genome. Genome Biol Evol 2012; 4:726-37. [PMID: 22554552 PMCID: PMC3381680 DOI: 10.1093/gbe/evs046] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Recombination rates vary across the genome and in many species show significant relationships with several genomic features, including distance to the centromere, gene density, and GC content. Studies of fine-scale recombination rates have also revealed that in several species, there are recombination hotspots, that is, short regions with recombination rates 10–100 greater than those in surrounding regions. In this study, we analyzed whole-genome resequence data from 26 accessions of the model legume Medicago truncatula to gain insight into the genomic features that are related to high- and low-recombination rates and recombination hotspots at 1 kb scales. We found that high-recombination regions (1-kb windows among those in the highest 5% of the distribution) on all three chromosomes were significantly closer to the centromere, had higher gene density, and lower GC content than low-recombination windows. High-recombination windows are also significantly overrepresented among some gene functional categories—most strongly NB–ARC and LRR genes, both of which are important in plant defense against pathogens. Similar to high-recombination windows, recombination hotspots (1-kb windows with significantly higher recombination than the surrounding region) are significantly nearer to the centromere than nonhotspot windows. By contrast, we detected no difference in gene density or GC content between hotspot and nonhotspot windows. Using linear model wavelet analysis to examine the relationship between recombination and genomic features across multiple spatial scales, we find a significant negative correlation with distance to the centromere across scales up to 512 kb, whereas gene density and GC content show significantly positive and negative correlations, respectively, only up to 64 kb. Correlations between recombination and genomic features, particularly gene density and polymorphism, suggest that they are scale dependent and need to be assessed at scales relevant to the evolution of those features.
Collapse
Affiliation(s)
- Timothy Paape
- Department of Plant Biology, University of Minnesota, MN, USA.
| | | | | | | | | | | |
Collapse
|
77
|
Popa A, Samollow P, Gautier C, Mouchiroud D. The sex-specific impact of meiotic recombination on nucleotide composition. Genome Biol Evol 2012; 4:412-22. [PMID: 22417915 PMCID: PMC3318449 DOI: 10.1093/gbe/evs023] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Meiotic recombination is an important evolutionary force shaping the nucleotide landscape of genomes. For most vertebrates, the frequency of recombination varies slightly or considerably between the sexes (heterochiasmy). In humans, male, rather than female, recombination rate has been found to be more highly correlated with the guanine and cytosine (GC) content across the genome. In the present study, we review the results in human and extend the examination of the evolutionary impact of heterochiasmy beyond primates to include four additional eutherian mammals (mouse, dog, pig, and sheep), a metatherian mammal (opossum), and a bird (chicken). Specifically, we compared sex-specific recombination rates (RRs) with nucleotide substitution patterns evaluated in transposable elements. Our results, based on a comparative approach, reveal a great diversity in the relationship between heterochiasmy and nucleotide composition. We find that the stronger male impact on this relationship is a conserved feature of human, mouse, dog, and sheep. In contrast, variation in genomic GC content in pig and opossum is more strongly correlated with female, rather than male, RR. Moreover, we show that the sex-differential impact of recombination is mainly driven by the chromosomal localization of recombination events. Independent of sex, the higher the RR in a genomic region and the longer this recombination activity is conserved in time, the stronger the bias in nucleotide substitution pattern, through such mechanisms as biased gene conversion. Over time, this bias will increase the local GC content of the region.
Collapse
|
78
|
Lee Y, Seifert SN, Nieman CC, McAbee RD, Goodell P, Fryxell RT, Lanzaro GC, Cornel AJ. High degree of single nucleotide polymorphisms in California Culex pipiens (Diptera: Culicidae) sensu lato. JOURNAL OF MEDICAL ENTOMOLOGY 2012; 49:299-306. [PMID: 22493847 PMCID: PMC3553656 DOI: 10.1603/me11108] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Resolution of systematic relationships among members of the Culex pipiens (L.) complex has important implications for public health as well as for studies on the evolution of sibling species. Currently held views contend that in California considerable genetic introgression occurs between Cx. pipiens and Cx. quinquefasciatus Say, and as such, these taxa behave as if they are a single species. Development of high throughput SNP genotyping tools for the analysis of Cx. pipiens complex population structure is therefore desirable. As a first step toward this goal, we sequenced 12 gene fragments from specimens collected in Marin and Fresno counties. On average, we found a higher single nucleotide polymorphism (SNP) density than any other mosquito species reported thus far. Coding regions contained significantly higher GC content (median 54.7%) than noncoding regions (42.4%; Wilcoxon rank sum test, P = 5.29 x 10(-5)). Differences in SNP allele frequencies observed between mosquitoes from Marin and Fresno counties indicated significant genetic divergence and suggest that SNP markers will be useful for future detailed population genetic studies of this group. The high density of SNPs highlights the difficulty in identifying species within the complex and may be associated with the large degree of phenotypic variation observed in this group of mosquitoes.
Collapse
Affiliation(s)
- Yoosook Lee
- School of Veterinary Medicine, Department of Pathology, Microbiology and Immunology, University of California-Davis, Davis, CA 95616, USA.
| | | | | | | | | | | | | | | |
Collapse
|
79
|
|
80
|
Webster MT, Hurst LD. Direct and indirect consequences of meiotic recombination: implications for genome evolution. Trends Genet 2011; 28:101-9. [PMID: 22154475 DOI: 10.1016/j.tig.2011.11.002] [Citation(s) in RCA: 83] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Revised: 11/08/2011] [Accepted: 11/09/2011] [Indexed: 12/23/2022]
Abstract
There is considerable variation within eukaryotic genomes in the local rate of crossing over. Why is this and what effect does it have on genome evolution? On the genome scale, it is known that by shuffling alleles, recombination increases the efficacy of selection. By contrast, the extent to which differences in the recombination rate modulate the efficacy of selection between genomic regions is unclear. Recombination also has direct consequences on the origin and fate of mutations: biased gene conversion and other forms of meiotic drive promote the fixation of mutations in a similar way to selection, and recombination itself may be mutagenic. Consideration of both the direct and indirect effects of recombination is necessary to understand why its rate is so variable and for correct interpretation of patterns of genome evolution.
Collapse
Affiliation(s)
- Matthew T Webster
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden.
| | | |
Collapse
|
81
|
Smukowski CS, Noor MAF. Recombination rate variation in closely related species. Heredity (Edinb) 2011; 107:496-508. [PMID: 21673743 PMCID: PMC3242630 DOI: 10.1038/hdy.2011.44] [Citation(s) in RCA: 139] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2010] [Revised: 03/21/2011] [Accepted: 04/27/2011] [Indexed: 11/09/2022] Open
Abstract
Despite their importance to successful meiosis and various evolutionary processes, meiotic recombination rates sometimes vary within species or between closely related species. For example, humans and chimpanzees share virtually no recombination hotspot locations in the surveyed portion of the genomes. However, conservation of recombination rates between closely related species has also been documented, raising an apparent contradiction. Here, we evaluate how and why conflicting patterns of recombination rate conservation and divergence may be observed, with particular emphasis on features that affect recombination, and the scale and method with which recombination is surveyed. Additionally, we review recent studies identifying features influencing fine-scale and broad-scale recombination patterns and informing how quickly recombination rates evolve, how changes in recombination impact selection and evolution in natural populations, and more broadly, which forces influence genome evolution.
Collapse
Affiliation(s)
- C S Smukowski
- Department of Biology, Duke University, Durham, NC 27708, USA.
| | | |
Collapse
|
82
|
Xu C, Cai X, Chen Q, Zhou H, Cai Y, Ben A. Factors affecting synonymous codon usage bias in chloroplast genome of oncidium gower ramsey. Evol Bioinform Online 2011; 7:271-8. [PMID: 22253533 PMCID: PMC3255522 DOI: 10.4137/ebo.s8092] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Oncidium Gower Ramsey is a fascinating and important ornamental flower in floral industry. In this research, the complete nucleotide sequence of the chloroplast genome in Oncidium Gower Ramsey was studied, then analyzed using Codonw software. Correspondence analysis and method of effective number of codon as Nc-plot were conducted to analyze synonymous codon usage. According to the corresponding analysis, codon bias in the chloroplast genome of Oncidium Gower Ramsey is related to their gene length, mutation bias, gene hydropathy level of each protein, gene function and selection or gene expression only subtly affect codon usage. This study will provide insights into the molecular evolution study and high-level transgene expression.
Collapse
Affiliation(s)
- Chen Xu
- School of Biochemical and Environmental Engineering, Nanjing Xiaozhuang University, Nanjing 211171, Jiangsu, China
| | | | | | | | | | | |
Collapse
|
83
|
Rao Y, Wu G, Wang Z, Chai X, Nie Q, Zhang X. Mutation bias is the driving force of codon usage in the Gallus gallus genome. DNA Res 2011; 18:499-512. [PMID: 22039174 PMCID: PMC3223081 DOI: 10.1093/dnares/dsr035] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Synonymous codons are used with different frequencies both among species and among genes within the same genome and are controlled by neutral processes (such as mutation and drift) as well as by selection. Up to now, a systematic examination of the codon usage for the chicken genome has not been performed. Here, we carried out a whole genome analysis of the chicken genome by the use of the relative synonymous codon usage (RSCU) method and identified 11 putative optimal codons, all of them ending with uracil (U), which is significantly departing from the pattern observed in other eukaryotes. Optimal codons in the chicken genome are most likely the ones corresponding to highly expressed transfer RNA (tRNAs) or tRNA gene copy numbers in the cell. Codon bias, measured as the frequency of optimal codons (Fop), is negatively correlated with the G + C content, recombination rate, but positively correlated with gene expression, protein length, gene length and intron length. The positive correlation between codon bias and protein, gene and intron length is quite different from other multi-cellular organism, as this trend has been only found in unicellular organisms. Our data displayed that regional G + C content explains a large proportion of the variance of codon bias in chicken. Stepwise selection model analyses indicate that G + C content of coding sequence is the most important factor for codon bias. It appears that variation in the G + C content of CDSs accounts for over 60% of the variation of codon bias. This study suggests that both mutation bias and selection contribute to codon bias. However, mutation bias is the driving force of the codon usage in the Gallus gallus genome. Our data also provide evidence that the negative correlation between codon bias and recombination rates in G. gallus is determined mostly by recombination-dependent mutational patterns.
Collapse
Affiliation(s)
- Yousheng Rao
- Department of Biological Technology, Jiangxi Educational Institute, Nanchang, China.
| | | | | | | | | | | |
Collapse
|
84
|
Conceição IC, Long AD, Gruber JD, Beldade P. Genomic sequence around butterfly wing development genes: annotation and comparative analysis. PLoS One 2011; 6:e23778. [PMID: 21909358 PMCID: PMC3166123 DOI: 10.1371/journal.pone.0023778] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2011] [Accepted: 07/27/2011] [Indexed: 12/22/2022] Open
Abstract
BACKGROUND Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations) and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes). CONCLUSIONS The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1) the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2) the high conservation of non-coding sequence around the genes wingless and Ecdysone receptor, both involved in multiple developmental processes including wing pattern formation.
Collapse
MESH Headings
- Alcohol Dehydrogenase/genetics
- Animals
- Base Composition/genetics
- Base Sequence
- Bombyx/genetics
- Butterflies/genetics
- Butterflies/growth & development
- Chromosomes, Artificial, Bacterial/genetics
- Computational Biology
- Conserved Sequence/genetics
- DNA Transposable Elements/genetics
- DNA, Intergenic/genetics
- Databases, Genetic
- Expressed Sequence Tags
- Gene Order/genetics
- Genes, Developmental/genetics
- Genes, Insect/genetics
- MicroRNAs/genetics
- Molecular Sequence Annotation
- Molecular Sequence Data
- Open Reading Frames/genetics
- Phylogeny
- Repetitive Sequences, Nucleic Acid/genetics
- Reproducibility of Results
- Sequence Homology, Nucleic Acid
- Synteny/genetics
- Wings, Animal/growth & development
- Wings, Animal/metabolism
Collapse
Affiliation(s)
| | - Anthony D. Long
- University of California Irvine, Irvine, California, United States of America
| | - Jonathan D. Gruber
- University of California Irvine, Irvine, California, United States of America
| | - Patrícia Beldade
- Instituto Gulbenkian de Ciência, Oeiras, Portugal
- Institute of Biology, Leiden University, Leiden, The Netherlands
| |
Collapse
|
85
|
Castillo-Ramírez S, Harris SR, Holden MTG, He M, Parkhill J, Bentley SD, Feil EJ. The impact of recombination on dN/dS within recently emerged bacterial clones. PLoS Pathog 2011; 7:e1002129. [PMID: 21779170 PMCID: PMC3136474 DOI: 10.1371/journal.ppat.1002129] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2010] [Accepted: 05/04/2011] [Indexed: 12/23/2022] Open
Abstract
The development of next-generation sequencing platforms is set to reveal an unprecedented level of detail on short-term molecular evolutionary processes in bacteria. Here we re-analyse genome-wide single nucleotide polymorphism (SNP) datasets for recently emerged clones of methicillin resistant Staphylococcus aureus (MRSA) and Clostridium difficile. We note a highly significant enrichment of synonymous SNPs in those genes which have been affected by recombination, i.e. those genes on mobile elements designated “non-core” (in the case of S. aureus), or those core genes which have been affected by homologous replacements (S. aureus and C. difficile). This observation suggests that the previously documented decrease in dN/dS over time in bacteria applies not only to genomes of differing levels of divergence overall, but also to horizontally acquired genes of differing levels of divergence within a single genome. We also consider the role of increased drift acting on recently emerged, highly specialised clones, and the impact of recombination on selection at linked sites. This work has implications for a wide range of genomic analyses. As bacteria diversify, many of the nucleotide changes that emerge will render the cell slightly less competitive, and these mutations will tend to be removed by natural selection. However, this purging process does not happen instantaneously, and this delay allows deleterious mutations to survive in the population long enough to be sampled. Genomes at the very initial stages of diversification therefore exhibit a relatively high proportion of slightly deleterious mutations and, as most of these are non-synonymous mutations, this is manifest as a high dN/dS ratio. However, the effective population size will also impact on this ratio, as will selection operating on neighbouring SNPs. Here we examine the distribution of synonymous and non-synonymous SNPs within recently emerged clones of two important nosocomial pathogens, methicillin resistant Staphylococcus aureus (MRSA) and Clostridium difficile. In both species, we note a much higher proportion of synonymous changes in those single nucleotide polymorphisms (SNPs) likely to have emerged through recombination compared to de novo mutations. We argue that this effect is explained by the very recent emergence of the mutational SNPs combined with a reduction in the efficiency of selection due to niche specialisation.
Collapse
Affiliation(s)
| | - Simon R. Harris
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Matthew T. G. Holden
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Miao He
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Julian Parkhill
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Stephen D. Bentley
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Edward J. Feil
- Department of Biology and Biochemistry, University of Bath, Claverton Down, Bath, United Kingdom
- * E-mail:
| |
Collapse
|
86
|
Capra JA, Pollard KS. Substitution patterns are GC-biased in divergent sequences across the metazoans. Genome Biol Evol 2011; 3:516-27. [PMID: 21670083 PMCID: PMC3138425 DOI: 10.1093/gbe/evr051] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
The fastest-evolving regions in the human and chimpanzee genomes show a remarkable excess of weak (A,T) to strong (G,C) nucleotide substitutions since divergence from their common ancestor. We investigated the phylogenetic extent and possible causes of this weak to strong (W→S) bias in divergent sequences (BDS) using recently sequenced genomes and recombination maps from eight trios of eukaryotic species. To quantify evidence for BDS, we inferred substitution histories using an efficient maximum likelihood approach with a context-dependent evolutionary model. We then annotated all lineage-specific substitutions in terms of W→S bias and density on the chromosomes. Finally, we used the inferred substitutions to calculate a BDS score—a log odds ratio between substitution type and density—and assessed its statistical significance with Fisher's exact test. Applying this approach, we found significant BDS in the coding and noncoding sequence of human, mouse, dog, stickleback, fruit fly, and worm. We also observed a significant lack of W→S BDS in chicken and yeast. The BDS score varies between species and across the chromosomes within each species. It is most strongly correlated with different genomic features in different species, but a strong correlation with recombination rates is found in several species. Our results demonstrate that a W→S substitution bias in fast-evolving sequences is a widespread phenomenon. The patterns of BDS observed suggest that a recombination-associated process, such as GC-biased gene conversion, is involved in the production of the bias in many species, but the strength of the BDS likely depends on many factors, including genome stability, variability in recombination rate over time and across the genome, the frequency of meiosis, and the amount of outcrossing in each species.
Collapse
Affiliation(s)
- John A. Capra
- Gladstone Institutes, University of California, San Francisco
| | - Katherine S. Pollard
- Gladstone Institutes, University of California, San Francisco
- Division of Biostatistics & Institute for Human Genetics, University of California, San Francisco
- Corresponding author: E-mail:
| |
Collapse
|
87
|
Cheung SC, Liu LZ, Lan LL, Liu QQ, Sun SS, Chan JC, Tong PC. Glucose lowering effect of transgenic human insulin-like growth factor-I from rice: in vitro and in vivo studies. BMC Biotechnol 2011; 11:37. [PMID: 21486461 PMCID: PMC3098155 DOI: 10.1186/1472-6750-11-37] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2010] [Accepted: 04/12/2011] [Indexed: 12/13/2022] Open
Abstract
Background Human insulin-like growth factor-I (hIGF-I) is a growth factor which is highly resemble to insulin. It is essential for cell proliferation and has been proposed for treatment of various endocrine-associated diseases including growth hormone insensitivity syndrome and diabetes mellitus. In the present study, an efficient plant expression system was developed to produce biologically active recombinant hIGF-I (rhIGF-I) in transgenic rice grains. Results The plant-codon-optimized hIGF-I was introduced into rice via Agrobacterium-mediated transformation. To enhance the stability and yield of rhIGF-I, the endoplasmic reticulum-retention signal and glutelin signal peptide were used to deliver rhIGF-I to endoplasmic reticulum for stable accumulation. We found that only glutelin signal peptide could lead to successful expression of hIGF-I and one gram of hIGF-I rice grain possessed the maximum activity level equivalent to 3.2 micro molar of commercial rhIGF-I. In vitro functional analysis showed that the rice-derived rhIGF-I was effective in inducing membrane ruffling and glucose uptake on rat skeletal muscle cells. Oral meal test with rice-containing rhIGF-I acutely reduced blood glucose levels in streptozotocin-induced and Zucker diabetic rats, whereas it had no effect in normal rats. Conclusion Our findings provided an alternative expression system to produce large quantities of biologically active rhIGF-I. The provision of large quantity of recombinant proteins will promote further research on the therapeutic potential of rhIGF-I.
Collapse
Affiliation(s)
- Stanley Ck Cheung
- Department of Medicine and Therapeutics, The Chinese University of Hong Kong, Prince of Wales Hospital, Shatin, Hong Kong
| | | | | | | | | | | | | |
Collapse
|
88
|
Stoletzki N. The surprising negative correlation of gene length and optimal codon use--disentangling translational selection from GC-biased gene conversion in yeast. BMC Evol Biol 2011; 11:93. [PMID: 21481245 PMCID: PMC3096941 DOI: 10.1186/1471-2148-11-93] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2010] [Accepted: 04/11/2011] [Indexed: 02/06/2023] Open
Abstract
Background Surprisingly, in several multi-cellular eukaryotes optimal codon use correlates negatively with gene length. This contrasts with the expectation under selection for translational accuracy. While suggested explanations focus on variation in strength and efficiency of translational selection, it has rarely been noticed that the negative correlation is reported only in organisms whose optimal codons are biased towards codons that end with G or C (-GC). This raises the question whether forces that affect base composition - such as GC-biased gene conversion - contribute to the negative correlation between optimal codon use and gene length. Results Yeast is a good organism to study this as equal numbers of optimal codons end in -GC and -AT and one may hence compare frequencies of optimal GC- with optimal AT-ending codons to disentangle the forces. Results of this study demonstrate in yeast frequencies of GC-ending (optimal AND non-optimal) codons decrease with gene length and increase with recombination. A decrease of GC-ending codons along genes contributes to the negative correlation with gene length. Correlations with recombination and gene expression differentiate between GC-ending and optimal codons, and also substitution patterns support effects of GC-biased gene conversion. Conclusion While the general effect of GC-biased gene conversion is well known, the negative correlation of optimal codon use with gene length has not been considered in this context before. Initiation of gene conversion events in promoter regions and the presence of a gene conversion gradient most likely explain the observed decrease of GC-ending codons with gene length and gene position.
Collapse
Affiliation(s)
- Nina Stoletzki
- Ludwig-Maximilan Universität, Biocenter, Grosshadernerstr, 2, D-82152 Planegg-Martinsried, Germany.
| |
Collapse
|
89
|
Degeneration in codon usage within the region of suppressed recombination in the mating-type chromosomes of Neurospora tetrasperma. EUKARYOTIC CELL 2011; 10:594-603. [PMID: 21335530 DOI: 10.1128/ec.00284-10] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The origin and early evolution of sex chromosomes are currently poorly understood. The Neurospora tetrasperma mating-type (mat) chromosomes have recently emerged as a model system for the study of early sex chromosome evolution, since they contain a young (<6 million years ago [Mya]), large (>6.6-Mb) region of suppressed recombination. Here we examined preferred-codon usage in 290 genes (121,831 codon positions) in order to test for early signs of genomic degeneration in N. tetrasperma mat chromosomes. We report several key findings about codon usage in the region of recombination suppression, including the following: (i) this region has been subjected to marked and largely independent degeneration among gene alleles; (ii) the level of degeneration is magnified over longer periods of recombination suppression; and (iii) both mat a and mat A chromosomes have been subjected to deterioration. The frequency of shifts from preferred codons to nonpreferred codons is greater for shorter genes than for longer genes, suggesting that short genes play an especially significant role in early sex chromosome evolution. Furthermore, we show that these degenerative changes in codon usage are best explained by altered selection efficiency in the recombinationally suppressed region. These findings demonstrate that the fungus N. tetrasperma provides an effective system for the study of degenerative genomic changes in young regions of recombination suppression in sex-regulating chromosomes.
Collapse
|
90
|
Marsolier-Kergoat MC. A simple model for the influence of meiotic conversion tracts on GC content. PLoS One 2011; 6:e16109. [PMID: 21249197 PMCID: PMC3020949 DOI: 10.1371/journal.pone.0016109] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2010] [Accepted: 12/10/2010] [Indexed: 11/19/2022] Open
Abstract
A strong correlation between GC content and recombination rate is observed in many eukaryotes, which is thought to be due to conversion events linked to the repair of meiotic double-strand breaks. In several organisms, the length of conversion tracts has been shown to decrease exponentially with increasing distance from the sites of meiotic double-strand breaks. I show here that this behavior leads to a simple analytical model for the evolution and the equilibrium state of the GC content of sequences devoid of meiotic double-strand break sites. In the yeast Saccharomyces cerevisiae, meiotic double-strand breaks are practically excluded from protein-coding sequences. A good fit was observed between the predictions of the model and the variations of the average GC content of the third codon position (GC3) of S. cerevisiae genes. Moreover, recombination parameters that can be extracted by fitting the data to the model coincide with experimentally determined values. These results thus indicate that meiotic recombination plays an important part in determining the fluctuations of GC content in yeast coding sequences. The model also accounted for the different patterns of GC variations observed in the genes of Candida species that exhibit a variety of sexual lifestyles, and hence a wide range of meiotic recombination rates. Finally, the variations of the average GC3 content of human and chicken coding sequences could also be fitted by the model. These results suggest the existence of a widespread pattern of GC variation in eukaryotic genes due to meiotic recombination, which would imply the generality of two features of meiotic recombination: its association with GC-biased gene conversion and the quasi-exclusion of meiotic double-strand breaks from coding sequences. Moreover, the model points out to specific constraints on protein fragments encoded by exon terminal sequences, which are the most affected by the GC bias.
Collapse
|
91
|
|
92
|
Genetic and evolutionary correlates of fine-scale recombination rate variation in Drosophila persimilis. J Mol Evol 2010; 71:332-45. [PMID: 20890595 DOI: 10.1007/s00239-010-9388-1] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2010] [Accepted: 09/08/2010] [Indexed: 12/31/2022]
Abstract
Recombination is fundamental to meiosis in many species and generates variation on which natural selection can act, yet fine-scale linkage maps are cumbersome to construct. We generated a fine-scale map of recombination rates across two major chromosomes in Drosophila persimilis using 181 SNP markers spanning two of five major chromosome arms. Using this map, we report significant fine-scale heterogeneity of local recombination rates. However, we also observed "recombinational neighborhoods," where adjacent intervals had similar recombination rates after excluding regions near the centromere and telomere. We further found significant positive associations of fine-scale recombination rate with repetitive element abundance and a 13-bp sequence motif known to associate with human recombination rates. We noted strong crossover interference extending 5-7 Mb from the initial crossover event. Further, we observed that fine-scale recombination rates in D. persimilis are strongly correlated with those obtained from a comparable study of its sister species, D. pseudoobscura. We documented a significant relationship between recombination rates and intron nucleotide sequence diversity within species, but no relationship between recombination rate and intron divergence between species. These results are consistent with selection models (hitchhiking and background selection) rather than mutagenic recombination models for explaining the relationship of recombination with nucleotide diversity within species. Finally, we found significant correlations between recombination rate and GC content, supporting both GC-biased gene conversion (BGC) models and selection-driven codon bias models. Overall, this genome-enabled map of fine-scale recombination rates allowed us to confirm findings of broader-scale studies and identify multiple novel features that merit further investigation.
Collapse
|
93
|
Williford A, Comeron JM. Local effects of limited recombination: historical perspective and consequences for population estimates of adaptive evolution. J Hered 2010; 101 Suppl 1:S127-34. [PMID: 20421321 DOI: 10.1093/jhered/esq012] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Recent years have witnessed the integration of theoretical advances in population genetics with large-scale analyses of complete genomes, with a growing number of studies suggesting pervasive natural selection that includes frequent deleterious as well as adaptive mutations. In finite populations, however, mutations under selection alter the fate of genetically linked mutations (the so-called Hill-Robertson effect). Here we review the evolutionary consequences of selection at linked sites (linked selection) focusing on its effects on nearby nucleotides in genomic regions with nonreduced recombination. We argue that these local effects of linkage may account for differences in selection intensity among genes. We also show that even high levels of recombination are unlikely to remove all effects of linked selection, causing a reduction in the polymorphism to divergence ratio (r(pd)) at neutral sites. Because a number of methods employed to estimate the magnitude and frequency of adaptive mutations take reduced r(pd) as evidence of positive selection, ignoring local linkage effects may lead to misleading estimates of the proportion of adaptive substitutions and estimates of positive selection. These biases are caused by employing methods that do not account for local variation in the relative effective population size (N(e)) caused by linked selection.
Collapse
Affiliation(s)
- Anna Williford
- Department of Biology, University of Iowa, Iowa, IA 52242, USA
| | | |
Collapse
|
94
|
Harrison RJ, Charlesworth B. Biased Gene Conversion Affects Patterns of Codon Usage and Amino Acid Usage in the Saccharomyces sensu stricto Group of Yeasts. Mol Biol Evol 2010; 28:117-29. [DOI: 10.1093/molbev/msq191] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
95
|
Evolutionary analysis of the CACTA DNA-transposon Caspar across wheat species using sequence comparison and in situ hybridization. Mol Genet Genomics 2010; 284:11-23. [DOI: 10.1007/s00438-010-0544-5] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2009] [Accepted: 05/04/2010] [Indexed: 01/17/2023]
|
96
|
Li S, Shih CH, Kohn MH. Functional and evolutionary correlates of gene constellations in the Drosophila melanogaster genome that deviate from the stereotypical gene architecture. BMC Genomics 2010; 11:322. [PMID: 20497561 PMCID: PMC2891614 DOI: 10.1186/1471-2164-11-322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2009] [Accepted: 05/24/2010] [Indexed: 01/19/2023] Open
Abstract
Background The biological dimensions of genes are manifold. These include genomic properties, (e.g., X/autosomal linkage, recombination) and functional properties (e.g., expression level, tissue specificity). Multiple properties, each generally of subtle influence individually, may affect the evolution of genes or merely be (auto-)correlates. Results of multidimensional analyses may reveal the relative importance of these properties on the evolution of genes, and therefore help evaluate whether these properties should be considered during analyses. While numerous properties are now considered during studies, most work still assumes the stereotypical solitary gene as commonly depicted in textbooks. Here, we investigate the Drosophila melanogaster genome to determine whether deviations from the stereotypical gene architecture correlate with other properties of genes. Results Deviations from the stereotypical gene architecture were classified as the following gene constellations: Overlapping genes were defined as those that overlap in the 5-prime, exonic, or intronic regions. Chromatin co-clustering genes were defined as genes that co-clustered within 20 kb of transcriptional territories. If this scheme is applied the stereotypical gene emerges as a rare occurrence (7.5%), slightly varied schemes yielded between ~1%-50%. Moreover, when following our scheme, paired-overlapping genes and chromatin co-clustering genes accounted for 50.1 and 42.4% of the genes analyzed, respectively. Gene constellation was a correlate of a number of functional and evolutionary properties of genes, but its statistical effect was ~1-2 orders of magnitude lower than the effects of recombination, chromosome linkage and protein function. Analysis of datasets on male reproductive proteins showed these were biased in their representation of gene constellations and evolutionary rate Ka/Ks estimates, but these biases did not overwhelm the biologically meaningful observation of high evolutionary rates of male reproductive genes. Conclusion Given the rarity of the solitary stereotypical gene, and the abundance of gene constellations that deviate from it, the presence of gene constellations, while once thought to be exceptional in large Eukaryote genomes, might have broader relevance to the understanding and study of the genome. However, according to our definition, while gene constellations can be significant correlates of functional properties of genes, they generally are weak correlates of the evolution of genes. Thus, the need for their consideration would depend on the context of studies.
Collapse
Affiliation(s)
- Shuwei Li
- Department of Ecology and Evolutionary Biology, Rice University, 6100 Main Street, MS 170, Houston, Texas 77005, USA
| | | | | |
Collapse
|
97
|
Fiston-Lavier AS, Singh ND, Lipatov M, Petrov DA. Drosophila melanogaster recombination rate calculator. Gene 2010; 463:18-20. [PMID: 20452408 DOI: 10.1016/j.gene.2010.04.015] [Citation(s) in RCA: 104] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2010] [Revised: 04/26/2010] [Accepted: 04/28/2010] [Indexed: 10/19/2022]
Abstract
Recombination rate is a key evolutionary parameter that determines the degree to which sites are linked. Estimating recombination rates is thus of crucial importance for population genetic and molecular evolutionary studies. We present here a user-friendly web-based tool that can be used to retrieve recombination rate estimates for single and/or multiple loci in the Drosophila melanogaster genome given a user-defined choice of the genome release. We used the Marey map approach that is based on comparing the genetic and physical maps to infer recombination rates along the major chromosomes of the D.melanogaster genome. Our implementation of this approach is based on building third-order polynomials which are used to interpolate recombination rates at all points on the chromosome except for telomeric and centromeric regions in which such polynomials are known to provide particularly poor estimation.
Collapse
|
98
|
Surprising fitness consequences of GC-biased gene conversion: I. Mutation load and inbreeding depression. Genetics 2010; 185:939-59. [PMID: 20421602 DOI: 10.1534/genetics.110.116368] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
GC-biased gene conversion (gBGC) is a recombination-associated process mimicking selection in favor of G and C alleles. It is increasingly recognized as a widespread force in shaping the genomic nucleotide landscape. In recombination hotspots, gBGC can lead to bursts of fixation of GC nucleotides and to accelerated nucleotide substitution rates. It was recently shown that these episodes of strong gBGC could give spurious signatures of adaptation and/or relaxed selection. There is also evidence that gBGC could drive the fixation of deleterious amino acid mutations in some primate genes. This raises the question of the potential fitness effects of gBGC. While gBGC has been metaphorically termed the "Achilles' heel" of our genome, we do not know whether interference between gBGC and selection merely has practical consequences for the analysis of sequence data or whether it has broader fundamental implications for individuals and populations. I developed a population genetics model to predict the consequences of gBGC on the mutation load and inbreeding depression. I also used estimates available for humans to quantitatively evaluate the fitness impact of gBGC. Surprising features emerged from this model: (i) Contrary to classical mutation load models, gBGC generates a fixation load independent of population size and could contribute to a significant part of the load; (ii) gBGC can maintain recessive deleterious mutations for a long time at intermediate frequency, in a similar way to overdominance, and these mutations generate high inbreeding depression, even if they are slightly deleterious; (iii) since mating systems affect both the selection efficacy and gBGC intensity, gBGC challenges classical predictions concerning the interaction between mating systems and deleterious mutations, and gBGC could constitute an additional cost of outcrossing; and (iv) if mutations are biased toward A and T alleles, very low gBGC levels can reduce the load. A robust prediction is that the gBGC level minimizing the load depends only on the mutational bias and population size. These surprising results suggest that gBGC may have nonnegligible fitness consequences and could play a significant role in the evolution of genetic systems. They also shed light on the evolution of gBGC itself.
Collapse
|
99
|
Zeng K, Charlesworth B. Studying patterns of recent evolution at synonymous sites and intronic sites in Drosophila melanogaster. J Mol Evol 2009; 70:116-28. [PMID: 20041239 DOI: 10.1007/s00239-009-9314-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2009] [Accepted: 12/07/2009] [Indexed: 10/20/2022]
Abstract
Most previous studies of the evolution of codon usage bias (CUB) and intronic GC content (iGC) in Drosophila melanogaster were based on between-species comparisons, reflecting long-term evolutionary events. However, a complete picture of the evolution of CUB and iGC cannot be drawn without knowledge of their more recent evolutionary history. Here, we used a polymorphism dataset collected from Zimbabwe to study patterns of the recent evolution of CUB and iGC. Analyzing coding and intronic data jointly with a model which can simultaneously estimate selection, mutational, and demographic parameters, we have found that: (1) natural selection is probably acting on synonymous codons; (2) a constant population size model seems to be sufficient to explain most of the observed synonymous polymorphism patterns; (3) GC is favored over AT in introns. In agreement with the long-term evolutionary patterns, ongoing selection acting on X-linked synonymous codons is stronger than that acting on autosomal codons. The selective differences between preferred and unpreferred codons tend to be greater than the differences between GC and AT in introns, suggesting that natural selection, not just biased gene conversion, may have influenced the evolution of CUB. Interestingly, evidence for non-equilibrium evolution comes exclusively from the intronic data. However, three different models, an equilibrium model with two classes of selected sites and two non-equilibrium models with changes in either population size or mutational parameters, fit the intronic data equally well. These results show that using inadequate selection (or demographic) models can result in incorrect estimates of demographic (or selection) parameters.
Collapse
Affiliation(s)
- Kai Zeng
- Ashworth Laboratories, School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK.
| | | |
Collapse
|
100
|
Patterns of DNA-sequence divergence between Drosophila miranda and D. pseudoobscura. J Mol Evol 2009; 69:601-11. [PMID: 19859648 DOI: 10.1007/s00239-009-9298-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2009] [Accepted: 10/07/2009] [Indexed: 12/22/2022]
Abstract
Contrary to the classical view, a large amount of non-coding DNA seems to be selectively constrained in Drosophila and other species. Here, using Drosophila miranda BAC sequences and the Drosophila pseudoobscura genome sequence, we aligned coding and non-coding sequences between D. pseudoobscura and D. miranda, and investigated their patterns of evolution. We found two patterns that have previously been observed in comparisons between Drosophila melanogaster and its relatives. First, there is a negative correlation between intron divergence and intron length, suggesting that longer non-coding sequences may contain more regulatory elements than shorter sequences. Our other main finding is a negative correlation between the rate of non-synonymous substitutions (d(N)) and codon usage bias (F(op)), showing that fast-evolving genes have a lower codon usage bias, consistent with strong positive selection interfering with weak selection for codon usage.
Collapse
|