1
|
Sundar Panja A. The systematic codon usage bias has an important effect on genetic adaption in native species. Gene 2024; 926:148627. [PMID: 38823656 DOI: 10.1016/j.gene.2024.148627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 05/06/2024] [Accepted: 05/29/2024] [Indexed: 06/03/2024]
Abstract
Random mutations increase genetic variety and natural selection enhances adaption over generations. Codon usage biases (CUB) provide clues about the genome adaptation mechanisms of native species and extremophile species. Significant numbers of gene (CDS) of nine classes of endangered, native species, including extremophiles and mesophiles were utilised to compute CUB. Codon usage patterns differ among the lineages of endangered and extremophiles with native species. Polymorphic usage of nucleotides with codon burial suggests parallelism of native species within relatively confined taxonomic groups. Utilizing the deviation pattern of CUB of endangered and native species, I present a calculation parameter to estimate the extinction risk of endangered species. Species diversity and extinction risk are both positively associated with the propensity of random mutation in CDS (Coding DNA sequence). Codon bias tenet profoundly selected and it governs to adaptive evolution of native species.
Collapse
Affiliation(s)
- Anindya Sundar Panja
- Department of Biotechnology, Molecular Informatics Laboratory, Oriental Institute of Science and Technology, Vidyasagar University, Midnapore, West Bengal 721102, India.
| |
Collapse
|
2
|
Kaj I, Mugal CF, Müller-Widmann R. A Wright-Fisher graph model and the impact of directional selection on genetic variation. Theor Popul Biol 2024; 159:13-24. [PMID: 39019334 DOI: 10.1016/j.tpb.2024.07.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 07/06/2024] [Accepted: 07/12/2024] [Indexed: 07/19/2024]
Abstract
We introduce a multi-allele Wright-Fisher model with mutation and selection such that allele frequencies at a single locus are traced by the path of a hybrid jump-diffusion process. The state space of the process is given by the vertices and edges of a topological graph, i.e. edges are unit intervals. Vertices represent monomorphic population states and positions on the edges mark the biallelic proportions of ancestral and derived alleles during polymorphic segments. In this setting, mutations can only occur at monomorphic loci. We derive the stationary distribution in mutation-selection-drift equilibrium and obtain the expected allele frequency spectrum under large population size scaling. For the extended model with multiple independent loci we derive rigorous upper bounds for a wide class of associated measures of genetic variation. Within this framework we present mathematically precise arguments to conclude that the presence of directional selection reduces the magnitude of genetic variation, as constrained by the bounds for neutral evolution.
Collapse
Affiliation(s)
- Ingemar Kaj
- Department of Mathematics, Uppsala University, Uppsala, Sweden.
| | - Carina F Mugal
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden; Laboratory of Biometry and Evolutionary Biology, University of Lyon 1, UMR CNRS 5558, Villeurbanne, France
| | | |
Collapse
|
3
|
Weibel CA, Wheeler AL, James JE, Willis SM, McShea H, Masel J. The protein domains of vertebrate species in which selection is more effective have greater intrinsic structural disorder. eLife 2024; 12:RP87335. [PMID: 39239703 PMCID: PMC11379457 DOI: 10.7554/elife.87335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024] Open
Abstract
The nearly neutral theory of molecular evolution posits variation among species in the effectiveness of selection. In an idealized model, the census population size determines both this minimum magnitude of the selection coefficient required for deleterious variants to be reliably purged, and the amount of neutral diversity. Empirically, an 'effective population size' is often estimated from the amount of putatively neutral genetic diversity and is assumed to also capture a species' effectiveness of selection. A potentially more direct measure of the effectiveness of selection is the degree to which selection maintains preferred codons. However, past metrics that compare codon bias across species are confounded by among-species variation in %GC content and/or amino acid composition. Here, we propose a new Codon Adaptation Index of Species (CAIS), based on Kullback-Leibler divergence, that corrects for both confounders. We demonstrate the use of CAIS correlations, as well as the Effective Number of Codons, to show that the protein domains of more highly adapted vertebrate species evolve higher intrinsic structural disorder.
Collapse
Affiliation(s)
- Catherine A Weibel
- Department of Mathematics, University of Arizona, Tucson, United States
- Department of Physics, University of Arizona, Tucson, United States
| | - Andrew L Wheeler
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, United States
| | - Jennifer E James
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States
| | - Sara M Willis
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States
| | - Hanon McShea
- Department of Earth System Science, Stanford University, Stanford, United States
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, United States
| |
Collapse
|
4
|
Qiu Y, Kang YM, Korfmann C, Pouyet F, Eckford A, Palazzo AF. The GC-content at the 5' ends of human protein-coding genes is undergoing mutational decay. Genome Biol 2024; 25:219. [PMID: 39138526 PMCID: PMC11323403 DOI: 10.1186/s13059-024-03364-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 07/31/2024] [Indexed: 08/15/2024] Open
Abstract
BACKGROUND In vertebrates, most protein-coding genes have a peak of GC-content near their 5' transcriptional start site (TSS). This feature promotes both the efficient nuclear export and translation of mRNAs. Despite the importance of GC-content for RNA metabolism, its general features, origin, and maintenance remain mysterious. We investigate the evolutionary forces shaping GC-content at the transcriptional start site (TSS) of genes through both comparative genomic analysis of nucleotide substitution rates between different species and by examining human de novo mutations. RESULTS Our data suggests that GC-peaks at TSSs were present in the last common ancestor of amniotes, and likely that of vertebrates. We observe that in apes and rodents, where recombination is directed away from TSSs by PRDM9, GC-content at the 5' end of protein-coding gene is currently undergoing mutational decay. In canids, which lack PRDM9 and perform recombination at TSSs, GC-content at the 5' end of protein-coding is increasing. We show that these patterns extend into the 5' end of the open reading frame, thus impacting synonymous codon position choices. CONCLUSIONS Our results indicate that the dynamics of this GC-peak in amniotes is largely shaped by historic patterns of recombination. Since decay of GC-content towards the mutation rate equilibrium is the default state for non-functional DNA, the observed decrease in GC-content at TSSs in apes and rodents indicates that the GC-peak is not being maintained by selection on most protein-coding genes in those species.
Collapse
Affiliation(s)
- Yi Qiu
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5G1M1, Canada
| | - Yoon Mo Kang
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5G1M1, Canada
| | - Christopher Korfmann
- Department of Electrical Engineering and Computer Science, York University, Toronto, Ontario, M3J1P3, Canada
| | - Fanny Pouyet
- Laboratoire Interdisciplinaire des Sciences du Numérique, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Andrew Eckford
- Department of Electrical Engineering and Computer Science, York University, Toronto, Ontario, M3J1P3, Canada
| | - Alexander F Palazzo
- Department of Biochemistry, University of Toronto, Toronto, Ontario, M5G1M1, Canada.
| |
Collapse
|
5
|
McShea H, Weibel C, Wehbi S, Goodman P, James JE, Wheeler AL, Masel J. The effectiveness of selection in a species affects the direction of amino acid frequency evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.01.526552. [PMID: 38948853 PMCID: PMC11212923 DOI: 10.1101/2023.02.01.526552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Nearly neutral theory predicts that species with higher effective population size (N e ) are better able to purge slightly deleterious mutations. We compare evolution in high-N e vs. low-N e vertebrates to reveal which amino acid frequencies are subject to subtle selective preferences. We take three complementary approaches, two measuring flux and one measuring outcomes. First, we fit non-stationary substitution models of amino acid flux using maximum likelihood, comparing the high-N e clade of rodents and lagomorphs to its low-N e sister clade of primates and colugos. Second, we compare evolutionary outcomes across a wider range of vertebrates, via correlations between amino acid frequencies and N e . Third, we dissect the details of flux in human, chimpanzee, mouse, and rat, as scored by parsimony - this also enables comparison to a historical paper. All three methods agree on which amino acids are preferred under more effective selection. Preferred amino acids tend to be smaller, less costly to synthesize, and to promote intrinsic structural disorder. Parsimony-induced bias in the historical study produces an apparent reduction in structural disorder, perhaps driven by slightly deleterious substitutions. Within highly exchangeable pairs of amino acids, arginine is strongly preferred over lysine, and valine over isoleucine, consistent with more effective selection preferring a marginally larger free energy of folding. These two preferences match differences between thermophiles and mesophilic relatives. These results reveal the biophysical consequences of mutation-selection-drift balance, and demonstrate the utility of nearly neutral theory for understanding protein evolution.
Collapse
Affiliation(s)
- Hanon McShea
- Department of Earth System Science, Stanford University
| | - Catherine Weibel
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Applied Physics, Stanford University
| | - Sawsan Wehbi
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | | | - Jennifer E James
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Ecology and Genetics, Uppsala University
| | - Andrew L Wheeler
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | - Joanna Masel
- Department of Ecology & Evolutionary Biology, University of Arizona
| |
Collapse
|
6
|
Joseph J, Prentout D, Laverré A, Tricou T, Duret L. High prevalence of PRDM9-independent recombination hotspots in placental mammals. Proc Natl Acad Sci U S A 2024; 121:e2401973121. [PMID: 38809707 PMCID: PMC11161765 DOI: 10.1073/pnas.2401973121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 04/26/2024] [Indexed: 05/31/2024] Open
Abstract
In many mammals, recombination events are concentrated in hotspots directed by a sequence-specific DNA-binding protein named PRDM9. Intriguingly, PRDM9 has been lost several times in vertebrates, and notably among mammals, it has been pseudogenized in the ancestor of canids. In the absence of PRDM9, recombination hotspots tend to occur in promoter-like features such as CpG islands. It has thus been proposed that one role of PRDM9 could be to direct recombination away from PRDM9-independent hotspots. However, the ability of PRDM9 to direct recombination hotspots has been assessed in only a handful of species, and a clear picture of how much recombination occurs outside of PRDM9-directed hotspots in mammals is still lacking. In this study, we derived an estimator of past recombination activity based on signatures of GC-biased gene conversion in substitution patterns. We quantified recombination activity in PRDM9-independent hotspots in 52 species of boreoeutherian mammals. We observe a wide range of recombination rates at these loci: several species (such as mice, humans, some felids, or cetaceans) show a deficit of recombination, while a majority of mammals display a clear peak of recombination. Our results demonstrate that PRDM9-directed and PRDM9-independent hotspots can coexist in mammals and that their coexistence appears to be the rule rather than the exception. Additionally, we show that the location of PRDM9-independent hotspots is relatively more stable than that of PRDM9-directed hotspots, but that PRDM9-independent hotspots nevertheless evolve slowly in concert with DNA hypomethylation.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Djivan Prentout
- Department of Biological Sciences, Columbia University, New York, NY10027
| | - Alexandre Laverré
- Department of Ecology and Evolution, University of Lausanne, LausanneCH-1015, Switzerland
- Swiss Institute of Bioinformatics, LausanneCH-1015, Switzerland
| | - Théo Tricou
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne69100, France
| |
Collapse
|
7
|
Joseph J. Increased Positive Selection in Highly Recombining Genes Does not Necessarily Reflect an Evolutionary Advantage of Recombination. Mol Biol Evol 2024; 41:msae107. [PMID: 38829800 PMCID: PMC11173204 DOI: 10.1093/molbev/msae107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/08/2024] [Accepted: 05/28/2024] [Indexed: 06/05/2024] Open
Abstract
It is commonly thought that the long-term advantage of meiotic recombination is to dissipate genetic linkage, allowing natural selection to act independently on different loci. It is thus theoretically expected that genes with higher recombination rates evolve under more effective selection. On the other hand, recombination is often associated with GC-biased gene conversion (gBGC), which theoretically interferes with selection by promoting the fixation of deleterious GC alleles. To test these predictions, several studies assessed whether selection was more effective in highly recombining genes (due to dissipation of genetic linkage) or less effective (due to gBGC), assuming a fixed distribution of fitness effects (DFE) for all genes. In this study, I directly derive the DFE from a gene's evolutionary history (shaped by mutation, selection, drift, and gBGC) under empirical fitness landscapes. I show that genes that have experienced high levels of gBGC are less fit and thus have more opportunities for beneficial mutations. Only a small decrease in the genome-wide intensity of gBGC leads to the fixation of these beneficial mutations, particularly in highly recombining genes. This results in increased positive selection in highly recombining genes that is not caused by more effective selection. Additionally, I show that the death of a recombination hotspot can lead to a higher dN/dS than its birth, but with substitution patterns biased towards AT, and only at selected positions. This shows that controlling for a substitution bias towards GC is therefore not sufficient to rule out the contribution of gBGC to signatures of accelerated evolution. Finally, although gBGC does not affect the fixation probability of GC-conservative mutations, I show that by altering the DFE, gBGC can also significantly affect nonsynonymous GC-conservative substitution patterns.
Collapse
Affiliation(s)
- Julien Joseph
- Laboratoire de Biométrie et Biologie Evolutive, Université Lyon 1, CNRS, UMR 5558, Villeurbanne, France
| |
Collapse
|
8
|
Zavala B, Dineen L, Fisher KJ, Opulente DA, Harrison MC, Wolters JF, Shen XX, Zhou X, Groenewald M, Hittinger CT, Rokas A, LaBella AL. Genomic factors shaping codon usage across the Saccharomycotina subphylum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.23.595506. [PMID: 38826271 PMCID: PMC11142207 DOI: 10.1101/2024.05.23.595506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Codon usage bias, or the unequal use of synonymous codons, is observed across genes, genomes, and between species. The biased use of synonymous codons has been implicated in many cellular functions, such as translation dynamics and transcript stability, but can also be shaped by neutral forces. The Saccharomycotina, the fungal subphylum containing the yeasts Saccharomyces cerevisiae and Candida albicans , has been a model system for studying codon usage. We characterized codon usage across 1,154 strains from 1,051 species to gain insight into the biases, molecular mechanisms, evolution, and genomic features contributing to codon usage patterns across the subphylum. We found evidence of a general preference for A/T-ending codons and correlations between codon usage bias, GC content, and tRNA-ome size. Codon usage bias is also distinct between the 12 orders within the subphylum to such a degree that yeasts can be classified into orders with an accuracy greater than 90% using a machine learning algorithm trained on codon usage. We also characterized the degree to which codon usage bias is impacted by translational selection. Interestingly, the degree of translational selection was influenced by a combination of genome features and assembly metrics that included the number of coding sequences, BUSCO count, and genome length. Our analysis also revealed an extreme bias in codon usage in the Saccharomycodales associated with a lack of predicted arginine tRNAs. The order contains 24 species, and 23 are computationally predicted to lack tRNAs that decode CGN codons, leaving only the AGN codons to encode arginine. Analysis of Saccharomycodales gene expression, tRNA sequences, and codon evolution suggests that extreme avoidance of the CGN codons is associated with a decline in arginine tRNA function. Codon usage bias within the Saccharomycotina is generally consistent with previous investigations in fungi, which show a role for both genomic features and GC bias in shaping codon usage. However, we find cases of extreme codon usage preference and avoidance along yeast lineages, suggesting additional forces may be shaping the evolution of specific codons.
Collapse
|
9
|
Wang X, Zhao W, Cui S, Su B, Huang Y, Chen H. Characterization of the Mitogenome of the Genus Dendrocerus Ratzeburg (Hymenoptera: Megaspilidae) with the Specific Designed Primers. Animals (Basel) 2024; 14:1454. [PMID: 38791671 PMCID: PMC11117285 DOI: 10.3390/ani14101454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 05/05/2024] [Accepted: 05/09/2024] [Indexed: 05/26/2024] Open
Abstract
In Hymenoptera, the monophyly of Evaniomorpha has been the focus of debate among different scholars. In this study, we sequenced two mitochondrial genomes of Dendrocerus (Hymenoptera: Megaspilidae) to analyze the mitochondrial genomic features of Dendrocerus and provide new molecular data for phylogenetic studies of Evaniomorpha. The mitogenome sizes of D. bellus and D. anisodontus were 15,445 bp and 15,373 bp, respectively, with the trnG of D. bellus missing. The nucleotide composition was significantly biased toward adenine and thymine, with A + T contents of 81.2% (D. bellus) and 82.4% (D. anisodontus). Using Ceraphron sp. (Ceraphronidae) as reference, the Ka/Ks values of NAD4L and NAD6 in D. anisodontus were both greater than one, indicating that non-synonymous mutations are favored by Darwinian selection, which is rare in other hymenopteran species. Compared with Ceraphon sp. gene order, nine operations were identified in D. anisodontus, including four reversals, four TDRLs (tandem duplication random losses) and one transposition, or four reversals and five TDRLs. Phylogenetic analysis of 40 mitochondrial genomes showed that Evaniomorpha was not a monophyletic group, which was also supported by the PBD values. Ceraphronoidea is a monophyletic group and is a sister to Aulacidae + Gasteruptiidae. Based on the conserved region of the newly sequenced mitochondrial genomes, a pair of specific primers MegaF/MegaR was designed for sequencing the COX1 genes in Megaspilidae and a 60% rate of success was achieved in the genus Dendrocerus.
Collapse
Affiliation(s)
- Xu Wang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; (X.W.); (W.Z.); (S.C.)
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100000, China;
| | - Wenjing Zhao
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; (X.W.); (W.Z.); (S.C.)
| | - Shanshan Cui
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China; (X.W.); (W.Z.); (S.C.)
| | - Baoshan Su
- Collaborative Innovation Center of Recovery and Reconstruction of Degraded Ecosystem in Wanjiang Basin Co-Founded by Anhui Province and Ministry of Education, School of Ecology and Environment, Anhui Normal University, Wuhu 241000, China;
| | - Yixin Huang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100000, China;
- Collaborative Innovation Center of Recovery and Reconstruction of Degraded Ecosystem in Wanjiang Basin Co-Founded by Anhui Province and Ministry of Education, School of Ecology and Environment, Anhui Normal University, Wuhu 241000, China;
| | - Huayan Chen
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, Chinese Academy of Sciences, Guangzhou 510650, China
- State Key Laboratory of Plant Diversity and Specialty Crops, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China
- South China National Botanical Garden, Guangzhou 510650, China
| |
Collapse
|
10
|
Kotari I, Kosiol C, Borges R. The Patterns of Codon Usage between Chordates and Arthropods are Different but Co-evolving with Mutational Biases. Mol Biol Evol 2024; 41:msae080. [PMID: 38667829 PMCID: PMC11108087 DOI: 10.1093/molbev/msae080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 03/22/2024] [Accepted: 04/15/2024] [Indexed: 05/22/2024] Open
Abstract
Different frequencies amongst codons that encode the same amino acid (i.e. synonymous codons) have been observed in multiple species. Studies focused on uncovering the forces that drive such codon usage showed that a combined effect of mutational biases and translational selection works to produce different frequencies of synonymous codons. However, only few have been able to measure and distinguish between these forces that may leave similar traces on the coding regions. Here, we have developed a codon model that allows the disentangling of mutation, selection on amino acids and synonymous codons, and GC-biased gene conversion (gBGC) which we employed on an extensive dataset of 415 chordates and 191 arthropods. We found that chordates need 15 more synonymous codon categories than arthropods to explain the empirical codon frequencies, which suggests that the extent of codon usage can vary greatly between animal phyla. Moreover, methylation at CpG sites seems to partially explain these patterns of codon usage in chordates but not in arthropods. Despite the differences between the two phyla, our findings demonstrate that in both, GC-rich codons are disfavored when mutations are GC-biased, and the opposite is true when mutations are AT-biased. This indicates that selection on the genomic coding regions might act primarily to stabilize its GC/AT content on a genome-wide level. Our study shows that the degree of synonymous codon usage varies considerably among animals, but is likely governed by a common underlying dynamic.
Collapse
Affiliation(s)
- Ioanna Kotari
- Institut für Populationsgenetik, University of Veterinary Medicine, Veterinärplatz 1, Vienna 1210, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Carolin Kosiol
- Centre for Biological Diversity, School of Biology, University of St Andrews, Fife KY16 9TH, UK
| | - Rui Borges
- Institut für Populationsgenetik, University of Veterinary Medicine, Veterinärplatz 1, Vienna 1210, Austria
| |
Collapse
|
11
|
Aktürk Dizman Y. Analysis of codon usage bias of exonuclease genes in invertebrate iridescent viruses. Virology 2024; 593:110030. [PMID: 38402641 DOI: 10.1016/j.virol.2024.110030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/04/2024] [Accepted: 02/13/2024] [Indexed: 02/27/2024]
Abstract
Invertebrate iridescent viruses (IIVs) are double-stranded DNA viruses that belong to the Iridoviridae family. IIVs result diseases that vary in severity from subclinical to lethal in invertebrate hosts. Codon usage bias (CUB) analysis is a versatile method for comprehending the genetic and evolutionary aspects of species. In this study, we analyzed the CUB in 10 invertebrate iridescent viruses exonuclease genes by calculating and comparing the nucleotide contents, effective number of codons (ENC), codon adaptation index (CAI), relative synonymous codon usage (RSCU), and others. The results revealed that IIVs exonuclease genes are rich in A/T. The ENC analysis displayed a low codon usage bias in IIVs exonuclease genes. ENC-plot, neutrality plot, and parity rule 2 plot demonstrated that besides mutational pressure, other factors like natural selection, dinucleotide content, and aromaticity also contributed to CUB. The findings could enhance our understanding of the evolution of IIVs exonuclease genes.
Collapse
Affiliation(s)
- Yeşim Aktürk Dizman
- Department of Biology, Faculty of Arts and Sciences, Recep Tayyip Erdogan University, 53100, Rize, Türkiye.
| |
Collapse
|
12
|
Weibel CA, Wheeler AL, James JE, Willis SM, McShea H, Masel J. The protein domains of vertebrate species in which selection is more effective have greater intrinsic structural disorder. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.03.02.530449. [PMID: 38712167 PMCID: PMC11071303 DOI: 10.1101/2023.03.02.530449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
The nearly neutral theory of molecular evolution posits variation among species in the effectiveness of selection. In an idealized model, the census population size determines both this minimum magnitude of the selection coefficient required for deleterious variants to be reliably purged, and the amount of neutral diversity. Empirically, an "effective population size" is often estimated from the amount of putatively neutral genetic diversity and is assumed to also capture a species' effectiveness of selection. A potentially more direct measure of the effectiveness of selection is the degree to which selection maintains preferred codons. However, past metrics that compare codon bias across species are confounded by among-species variation in %GC content and/or amino acid composition. Here we propose a new Codon Adaptation Index of Species (CAIS), based on Kullback-Leibler divergence, that corrects for both confounders. We demonstrate the use of CAIS correlations, as well as the Effective Number of Codons, to show that the protein domains of more highly adapted vertebrate species evolve higher intrinsic structural disorder.
Collapse
Affiliation(s)
- Catherine A. Weibel
- Department of Mathematics, University of Arizona, Tucson, Arizona 85721, USA
- Department of Physics, University of Arizona, Tucson, Arizona 85721, USA
- present address: Department of Applied Physics, Stanford University, California, USA
| | - Andrew L. Wheeler
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, Arizona 85721, USA
| | - Jennifer E. James
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
- present address: Department of Ecology and Genetics, Evolutionary Biology Center, Uppsala University, Sweden
| | - Sara M. Willis
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
- present address: University Information Technology Services, University of Arizona, Tucson, Arizona 85721, USA
| | - Hanon McShea
- Department of Earth System Science, Stanford University
| | - Joanna Masel
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona 85721, USA
| |
Collapse
|
13
|
Kyriacou RG, Mulhair PO, Holland PWH. GC Content Across Insect Genomes: Phylogenetic Patterns, Causes and Consequences. J Mol Evol 2024; 92:138-152. [PMID: 38491221 PMCID: PMC10978632 DOI: 10.1007/s00239-024-10160-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 02/06/2024] [Indexed: 03/18/2024]
Abstract
The proportions of A:T and G:C nucleotide pairs are often unequal and can vary greatly between animal species and along chromosomes. The causes and consequences of this variation are incompletely understood. The recent release of high-quality genome sequences from the Darwin Tree of Life and other large-scale genome projects provides an opportunity for GC heterogeneity to be compared across a large number of insect species. Here we analyse GC content along chromosomes, and within protein-coding genes and codons, of 150 insect species from four holometabolous orders: Coleoptera, Diptera, Hymenoptera, and Lepidoptera. We find that protein-coding sequences have higher GC content than the genome average, and that Lepidoptera generally have higher GC content than the other three insect orders examined. GC content is higher in small chromosomes in most Lepidoptera species, but this pattern is less consistent in other orders. GC content also increases towards subtelomeric regions within protein-coding genes in Diptera, Coleoptera and Lepidoptera. Two species of Diptera, Bombylius major and B. discolor, have very atypical genomes with ubiquitous increase in AT content, especially at third codon positions. Despite dramatic AT-biased codon usage, we find no evidence that this has driven divergent protein evolution. We argue that the GC landscape of Lepidoptera, Diptera and Coleoptera genomes is influenced by GC-biased gene conversion, strongest in Lepidoptera, with some outlier taxa affected drastically by counteracting processes.
Collapse
Affiliation(s)
- Riccardo G Kyriacou
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter O Mulhair
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK
| | - Peter W H Holland
- Department of Biology, University of Oxford, 11a Mansfield Road, Oxford, OX1 3SZ, UK.
| |
Collapse
|
14
|
Galtier N. Half a Century of Controversy: The Neutralist/Selectionist Debate in Molecular Evolution. Genome Biol Evol 2024; 16:evae003. [PMID: 38311843 PMCID: PMC10839204 DOI: 10.1093/gbe/evae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/01/2024] [Indexed: 02/06/2024] Open
Abstract
The neutral and nearly neutral theories, introduced more than 50 yr ago, have raised and still raise passionate discussion regarding the forces governing molecular evolution and their relative importance. The debate, initially focused on the amount of within-species polymorphism and constancy of the substitution rate, has spread, matured, and now underlies a wide range of topics and questions. The neutralist/selectionist controversy has structured the field and influences the way molecular evolutionary scientists conceive their research.
Collapse
Affiliation(s)
- Nicolas Galtier
- ISEM, CNRS, IRD, Université de Montpellier, Montpellier, France
| |
Collapse
|
15
|
Bourret J, Borvető F, Bravo IG. Subfunctionalisation of paralogous genes and evolution of differential codon usage preferences: The showcase of polypyrimidine tract binding proteins. J Evol Biol 2023; 36:1375-1392. [PMID: 37667674 DOI: 10.1111/jeb.14212] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 07/11/2023] [Accepted: 07/12/2023] [Indexed: 09/06/2023]
Abstract
Gene paralogs are copies of an ancestral gene that appear after gene or full genome duplication. When two sister gene copies are maintained in the genome, redundancy may release certain evolutionary pressures, allowing one of them to access novel functions. Here, we focused our study on gene paralogs on the evolutionary history of the three polypyrimidine tract binding protein genes (PTBP) and their concurrent evolution of differential codon usage preferences (CUPrefs) in vertebrate species. PTBP1-3 show high identity at the amino acid level (up to 80%) but display strongly different nucleotide composition, divergent CUPrefs and, in humans and in many other vertebrates, distinct tissue-specific expression levels. Our phylogenetic inference results show that the duplication events leading to the three extant PTBP1-3 lineages predate the basal diversification within vertebrates, and genomic context analysis illustrates that local synteny has been well preserved over time for the three paralogs. We identify a distinct evolutionary pattern towards GC3-enriching substitutions in PTBP1, concurrent with enrichment in frequently used codons and with a tissue-wide expression. In contrast, PTBP2s are enriched in AT-ending, rare codons, and display tissue-restricted expression. As a result of this substitution trend, CUPrefs sharply differ between mammalian PTBP1s and the rest of PTBPs. Genomic context analysis suggests that GC3-rich nucleotide composition in PTBP1s is driven by local substitution processes, while the evidence in this direction is thinner for PTBP2-3. An actual lack of co-variation between the observed GC composition of PTBP2-3 and that of the surrounding non-coding genomic environment would raise an interrogation on the origin of CUPrefs, warranting further research on a putative tissue-specific translational selection. Finally, we communicate an intriguing trend for the use of the UUG-Leu codon, which matches the trends of AT-ending codons. Our results are compatible with a scenario in which a combination of directional mutation-selection processes would have differentially shaped CUPrefs of PTBPs in vertebrates: the observed GC-enrichment of PTBP1 in placental mammals may be linked to genomic location and to the strong and broad tissue-expression, while AT-enrichment of PTBP2 and PTBP3 would be associated with rare CUPrefs and thus, possibly to specialized spatio-temporal expression. Our interpretation is coherent with a gene subfunctionalisation process by differential expression regulation associated with the evolution of specific CUPrefs.
Collapse
Affiliation(s)
- Jérôme Bourret
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| | - Fanni Borvető
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| | - Ignacio G Bravo
- Laboratoire MIVEGEC (CNRS IRD Univ Montpellier), Centre National de la Recherche Scientifique (CNRS), Montpellier, France
| |
Collapse
|
16
|
Li M, Wang J, Dai R, Smagghe G, Wang X, You S. Comparative analysis of codon usage patterns and phylogenetic implications of five mitochondrial genomes of the genus Japanagallia Ishihara, 1955 (Hemiptera, Cicadellidae, Megophthalminae). PeerJ 2023; 11:e16058. [PMID: 37780390 PMCID: PMC10538298 DOI: 10.7717/peerj.16058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 08/17/2023] [Indexed: 10/03/2023] Open
Abstract
Japanagallia is a genus of Cicadomorpha in the family of leafhoppers that are plant piercing-sucking insects, and it is difficult to distinguish by morphological characteristics. So far, only one complete mitochondrial genome data has been reported for the genus Japanagallia. Therefore, in order to better understand this group, we assembled and annotated the complete mitochondrial genomes of five Japanagallia species, and analyzed their codon usage patterns. Nucleotide composition analysis showed that AT content was higher than GC content, and the protein-coding sequences preferred to end with A/T at the third codon position. Relative synonymous codon usage analysis revealed most over-represented codon ends with A or T. Parity plot analysis revealed the codon usage bias of mitochondrial genes was influenced by both natural selection and mutation pressure. In the neutrality plot, the slopes of regression lines were < 0.5, suggesting that natural selection was playing a major role while mutation pressure was of minor importance. The effective number of codons showed that the codon usage bias between genes and genomes was low. Correspondence analysis revealed that the codon usage pattern differed among 13 protein-coding genes. Phylogenetic analyses based on three datasets using two methods (maximum likelihood and Bayesian inference), restored the Megophthalminae monophyly with high support values (bootstrap support values (BS) = 100, Bayesian posterior probability (PP) = 1). In the obtained topology, the seven Japanagallia species were clustered into a monophyletic group and formed a sister group with Durgade. In conclusion, our study can provide a reference for the future research on organism evolution, identification and phylogeny relationships of Japanagallia species.
Collapse
Affiliation(s)
- Min Li
- Institute of Entomology, Guizhou University, The Provincial Key Laboratory for Agricultural Pest Management Mountainous Region, Guiyang, Guizhou, China
| | - Jiajia Wang
- College of Biology and Food Engineering, Chuzhou University, Chuzhou, Anhui, China
| | - Renhuai Dai
- Institute of Entomology, Guizhou University, The Provincial Key Laboratory for Agricultural Pest Management Mountainous Region, Guiyang, Guizhou, China
| | - Guy Smagghe
- Institute of Entomology, Guizhou University, The Provincial Key Laboratory for Agricultural Pest Management Mountainous Region, Guiyang, Guizhou, China
- Cellular and Molecular Life Sciences, Department of Biology, Brussels, Belgium
- Laboratory of Agrozoology, Dep. of Crop Protection, Ghent University, Ghent, Belgium
| | - Xianyi Wang
- Engineering Research Center of Medical Biotechnology, School of Biology and Engineering, Guizhou Medical University, Guiyang, Guizhou, China
| | - Siying You
- Institute of Entomology, Guizhou University, The Provincial Key Laboratory for Agricultural Pest Management Mountainous Region, Guiyang, Guizhou, China
| |
Collapse
|
17
|
Johnson MM, Hockenberry AJ, McGuffie MJ, Vieira LC, Wilke CO. Growth-dependent Gene Expression Variation Influences the Strength of Codon Usage Biases. Mol Biol Evol 2023; 40:msad189. [PMID: 37619989 PMCID: PMC10482319 DOI: 10.1093/molbev/msad189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 08/11/2023] [Indexed: 08/26/2023] Open
Abstract
The most highly expressed genes in microbial genomes tend to use a limited set of synonymous codons, often referred to as "preferred codons." The existence of preferred codons is commonly attributed to selection pressures on various aspects of protein translation including accuracy and/or speed. However, gene expression is condition-dependent and even within single-celled organisms transcript and protein abundances can vary depending on a variety of environmental and other factors. Here, we show that growth rate-dependent expression variation is an important constraint that significantly influences the evolution of gene sequences. Using large-scale transcriptomic and proteomic data sets in Escherichia coli and Saccharomyces cerevisiae, we confirm that codon usage biases are strongly associated with gene expression but highlight that this relationship is most pronounced when gene expression measurements are taken during rapid growth conditions. Specifically, genes whose relative expression increases during periods of rapid growth have stronger codon usage biases than comparably expressed genes whose expression decreases during rapid growth conditions. These findings highlight that gene expression measured in any particular condition tells only part of the story regarding the forces shaping the evolution of microbial gene sequences. More generally, our results imply that microbial physiology during rapid growth is critical for explaining long-term translational constraints.
Collapse
Affiliation(s)
- Mackenzie M Johnson
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Adam J Hockenberry
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Matthew J McGuffie
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX, USA
| | - Luiz Carlos Vieira
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
18
|
Mujawar A, Phadte P, Palkina KA, Markina NM, Mohammad A, Thakur BL, Sarkisyan KS, Balakireva AV, Ray P, Yamplosky I, De A. Triple Reporter Assay: A Non-Overlapping Luciferase Assay for the Measurement of Complex Macromolecular Regulation in Cancer Cells Using a New Mushroom Luciferase-Luciferin Pair. SENSORS (BASEL, SWITZERLAND) 2023; 23:7313. [PMID: 37687774 PMCID: PMC10490530 DOI: 10.3390/s23177313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 08/14/2023] [Accepted: 08/18/2023] [Indexed: 09/10/2023]
Abstract
This study demonstrates the development of a humanized luciferase imaging reporter based on a recently discovered mushroom luciferase (Luz) from Neonothopanus nambi. In vitro and in vivo assessments showed that human-codon-optimized Luz (hLuz) has significantly higher activity than native Luz in various cancer cell types. The potential of hLuz in non-invasive bioluminescence imaging was demonstrated by human tumor xenografts subcutaneously and by the orthotopic lungs xenograft in immunocompromised mice. Luz enzyme or its unique 3OH-hispidin substrate was found to be non-cross-reacting with commonly used luciferase reporters such as Firefly (FLuc2), Renilla (RLuc), or nano-luciferase (NLuc). Based on this feature, a non-overlapping, multiplex luciferase assay using hLuz was envisioned to surpass the limitation of dual reporter assay. Multiplex reporter functionality was demonstrated by designing a new sensor construct to measure the NF-κB transcriptional activity using hLuz and utilized in conjunction with two available constructs, p53-NLuc and PIK3CA promoter-FLuc2. By expressing these constructs in the A2780 cell line, we unveiled a complex macromolecular regulation of high relevance in ovarian cancer. The assays performed elucidated the direct regulatory action of p53 or NF-κB on the PIK3CA promoter. However, only the multiplexed assessment revealed further complexities as stabilized p53 expression attenuates NF-κB transcriptional activity and thereby indirectly influences its regulation on the PIK3CA gene. Thus, this study suggests the importance of live cell multiplexed measurement of gene regulatory function using more than two luciferases to address more realistic situations in disease biology.
Collapse
Affiliation(s)
- Aaiyas Mujawar
- Molecular Functional Imaging Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India; (A.M.); (A.M.)
- Faculty of Life Science, Homi Bhabha National Institute, Mumbai 400094, India; (P.P.); (B.L.T.); (P.R.)
| | - Pratham Phadte
- Faculty of Life Science, Homi Bhabha National Institute, Mumbai 400094, India; (P.P.); (B.L.T.); (P.R.)
- Imaging Cell Signalling and Therapeutics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India
| | - Ksenia A. Palkina
- Institute of Bioorganic Chemistry (IBCh), Russian Academy of Sciences, Moscow 119991, Russia; (K.A.P.); (N.M.M.); (K.S.S.); (A.V.B.)
- Planta LLC, Bolshoi Boulevard, 42 Street 1, Moscow 121205, Russia
| | - Nadezhda M. Markina
- Institute of Bioorganic Chemistry (IBCh), Russian Academy of Sciences, Moscow 119991, Russia; (K.A.P.); (N.M.M.); (K.S.S.); (A.V.B.)
- Planta LLC, Bolshoi Boulevard, 42 Street 1, Moscow 121205, Russia
| | - Ameena Mohammad
- Molecular Functional Imaging Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India; (A.M.); (A.M.)
| | - Bhushan L. Thakur
- Faculty of Life Science, Homi Bhabha National Institute, Mumbai 400094, India; (P.P.); (B.L.T.); (P.R.)
- Imaging Cell Signalling and Therapeutics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India
| | - Karen S. Sarkisyan
- Institute of Bioorganic Chemistry (IBCh), Russian Academy of Sciences, Moscow 119991, Russia; (K.A.P.); (N.M.M.); (K.S.S.); (A.V.B.)
- Synthetic Biology Group, MRC London Institute of Medical Sciences, London W12 0NN, UK
| | - Anastasia V. Balakireva
- Institute of Bioorganic Chemistry (IBCh), Russian Academy of Sciences, Moscow 119991, Russia; (K.A.P.); (N.M.M.); (K.S.S.); (A.V.B.)
- Planta LLC, Bolshoi Boulevard, 42 Street 1, Moscow 121205, Russia
| | - Pritha Ray
- Faculty of Life Science, Homi Bhabha National Institute, Mumbai 400094, India; (P.P.); (B.L.T.); (P.R.)
- Imaging Cell Signalling and Therapeutics Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India
| | - Ilia Yamplosky
- Institute of Bioorganic Chemistry (IBCh), Russian Academy of Sciences, Moscow 119991, Russia; (K.A.P.); (N.M.M.); (K.S.S.); (A.V.B.)
| | - Abhijit De
- Molecular Functional Imaging Laboratory, Advanced Centre for Treatment, Research and Education in Cancer, Navi Mumbai 410210, India; (A.M.); (A.M.)
- Faculty of Life Science, Homi Bhabha National Institute, Mumbai 400094, India; (P.P.); (B.L.T.); (P.R.)
| |
Collapse
|
19
|
Näsvall K, Boman J, Talla V, Backström N. Base Composition, Codon Usage, and Patterns of Gene Sequence Evolution in Butterflies. Genome Biol Evol 2023; 15:evad150. [PMID: 37565492 PMCID: PMC10462419 DOI: 10.1093/gbe/evad150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/17/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023] Open
Abstract
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
Collapse
Affiliation(s)
- Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| |
Collapse
|
20
|
Fu Y, Liang F, Li C, Warren A, Shin MK, Li L. Codon Usage Bias Analysis in Macronuclear Genomes of Ciliated Protozoa. Microorganisms 2023; 11:1833. [PMID: 37513005 PMCID: PMC10384029 DOI: 10.3390/microorganisms11071833] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/12/2023] [Accepted: 07/13/2023] [Indexed: 07/30/2023] Open
Abstract
Ciliated protozoa (ciliates) are unicellular eukaryotes, several of which are important model organisms for molecular biology research. Analyses of codon usage bias (CUB) of the macronuclear (MAC) genome of ciliates can promote a better understanding of the genetic mode and evolutionary history of these organisms and help optimize codons to improve gene editing efficiency in model ciliates. In this study, the following indices were calculated: the guanine-cytosine (GC) content, the frequency of the nucleotides at the third position of codons (T3, C3, A3, G3), the effective number of codons (ENc), GC content at the 3rd position of synonymous codons (GC3s), and the relative synonymous codon usage (RSCU). Parity rule 2 plot analysis, Neutrality plot analysis, ENc plot analysis, and correlation analysis were employed to explore the main influencing factors of CUB. The results showed that the GC content in the MAC genomes of each of 21 ciliate species, the genomes of which were relatively complete, was lower than 50%, and the base compositions of GC and GC3s were markedly distinct. Synonymous codon analysis revealed that the codons in most of the 21 ciliates ended with A or T and four codons were the general putative optimal codons. Collectively, our results indicated that most of the ciliates investigated preferred using the codons with anof AT-ending and that codon usage bias was affected by gene mutation and natural selection.
Collapse
Affiliation(s)
- Yu Fu
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Fasheng Liang
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Congjun Li
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| | - Alan Warren
- Department of Life Sciences, Natural History Museum, London SW7 5BD, UK
| | - Mann Kyoon Shin
- Department of Biology, University of Ulsan, Ulsan 44610, Republic of Korea
| | - Lifang Li
- Laboratory of Marine Protozoan Biodiversity and Evolution, Marine College, Shandong University, Weihai 264209, China
| |
Collapse
|
21
|
Johnson MM, Hockenberry AJ, McGuffie MJ, Vieira LC, Wilke CO. Growth-dependent gene expression variation influences the strength of codon usage biases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.14.532645. [PMID: 36993177 PMCID: PMC10055066 DOI: 10.1101/2023.03.14.532645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
The most highly expressed genes in microbial genomes tend to use a limited set of synonymous codons, often referred to as "preferred codons." The existence of preferred codons is commonly attributed to selection pressures on various aspects of protein translation including accuracy and/or speed. However, gene expression is condition-dependent and even within single-celled organisms transcript and protein abundances can vary depending on a variety of environmental and other factors. Here, we show that growth rate-dependent expression variation is an important constraint that significantly influences the evolution of gene sequences. Using large-scale transcriptomic and proteomic data sets in Escherichia coli and Saccharomyces cerevisiae, we confirm that codon usage biases are strongly associated with gene expression but highlight that this relationship is most pronounced when gene expression measurements are taken during rapid growth conditions. Specifically, genes whose relative expression increases during periods of rapid growth have stronger codon usage biases than comparably expressed genes whose expression decreases during rapid growth conditions. These findings highlight that gene expression measured in any particular condition tells only part of the story regarding the forces shaping the evolution of microbial gene sequences. More generally, our results imply that microbial physiology during rapid growth is critical for explaining long-term translational constraints.
Collapse
Affiliation(s)
- Mackenzie M Johnson
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Adam J Hockenberry
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Matthew J McGuffie
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Luiz Carlos Vieira
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| |
Collapse
|
22
|
The mitochondrial genomes of big-eared bats, Macrotus waterhousii and Macrotus californicus (Chiroptera: Phyllostomidae: Macrotinae). Gene 2023; 863:147295. [PMID: 36804001 DOI: 10.1016/j.gene.2023.147295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 01/25/2023] [Accepted: 02/15/2023] [Indexed: 02/21/2023]
Abstract
In the species-rich family Phyllostomidae, the genus Macrotus ('big eared' bats) contains only two species; Macrotus waterhousii, distributed in western, central, and southern Mexico, Guatemala and some Caribbean Islands, and Macrotus californicus, distributed in the southwestern USA, and in the Baja California peninsula and the state of Sonora in Mexico. In this study, we sequenced and assembled the mitochondrial genome of Macrotus waterhousii and characterized in detail this genome and that of the congeneric M. californicus. Then, we examined the phylogenetic position of Macrotus in the family Phyllostomidae based on protein coding genes (PCGs). The AT-rich mitochondrial genomes of M. waterhousii and M. californicus are 16,792 and 16,691 bp long, respectively, and each encode 13 PCGs, 22 tRNA genes, 2 rRNA genes, and a putative non-coding control region 1,336 and 1,232 bp long, respectively. Mitochondrial synteny in Macrotus is identical to that reported before for all other cofamilial species. In the two studied species, all tRNAs exhibit a 'typical' cloverleaf secondary structure with the exception of trnS1, which lacks the D arm. A selective pressure analysis demonstrated that all PCGs are under purifying selection. The CR of the two species feature three domains previously reported in other mammals, including bats: extended terminal associated sequences (ETAS), central (CD), and conserved sequence block (CSB). A phylogenetic analysis based on the 13 mitochondrial PCGs demonstrated that Macrotus is monophyletic and the subfamily Macrotinae is a sister group of all remaining phyllostomids in our analysis, except Micronycterinae. The assembly and detailed analysis of these mitochondrial genomes represents a step further to continue improving the understanding of phylogenetic relationships within the species-rich family Phyllostomidae.
Collapse
|
23
|
Xu M, Gu Z, Huang J, Guo B, Jiang L, Xu K, Ye Y, Li J. The Complete Mitochondrial Genome of Mytilisepta virgata (Mollusca: Bivalvia), Novel Gene Rearrangements, and the Phylogenetic Relationships of Mytilidae. Genes (Basel) 2023; 14:910. [PMID: 37107667 PMCID: PMC10137486 DOI: 10.3390/genes14040910] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/10/2023] [Accepted: 04/10/2023] [Indexed: 04/29/2023] Open
Abstract
The circular mitochondrial genome of Mytilisepta virgata spans 14,713 bp, which contains 13 protein-coding genes (PCGs), 2 ribosomal RNA genes, and 22 transfer RNA genes. Analysis of the 13 PCGs reveals that the mitochondrial gene arrangement of Mytilisepta is relatively conserved at the genus level. The location of the atp8 gene in Mytilisepta keenae differs from that of other species. However, compared with the putative molluscan ancestral gene order, M. virgata exhibits a high level of rearrangement. We constructed phylogenetic trees based on concatenated 12 PCGs from Mytilidae. As a result, we found that M. virgata is in the same clade as other Mytilisepta spp. The result of estimated divergence times revealed that M. virgata and M. keenae diverged around the early Paleogene period, although the oldest Mytilisepta fossil was from the late or upper Eocene period. Our results provide robust statistical evidence for a sister-group relationship within Mytilida. The findings not only confirm previous results, but also provide valuable insights into the evolutionary history of Mytilidae.
Collapse
Affiliation(s)
- Minhui Xu
- National Engineering Research Center for Marine Aquaculture, Zhejiang Ocean University, Zhoushan 316022, China
| | - Zhongqi Gu
- Shengsi Marine Science and Technology Institute, Shengsi, Zhoushan 202450, China
| | - Ji Huang
- Shengsi Marine Science and Technology Institute, Shengsi, Zhoushan 202450, China
| | - Baoying Guo
- National Engineering Research Center for Marine Aquaculture, Zhejiang Ocean University, Zhoushan 316022, China
| | - Lihua Jiang
- National Engineering Research Center for Marine Aquaculture, Zhejiang Ocean University, Zhoushan 316022, China
| | - Kaida Xu
- Key Laboratory of Sustainable Utilization of Technology Research for Fisheries Resources of Zhejiang Province, Scientific Observing and Experimental Station of Fishery Resources for Key Fishing Grounds, Ministry of Agriculture and Rural Affairs of China, Zhejiang Marine Fisheries Research Institute, Zhoushan 316021, China
| | - Yingying Ye
- National Engineering Research Center for Marine Aquaculture, Zhejiang Ocean University, Zhoushan 316022, China
| | - Jiji Li
- National Engineering Research Center for Marine Aquaculture, Zhejiang Ocean University, Zhoushan 316022, China
| |
Collapse
|
24
|
Heames B, Buchel F, Aubel M, Tretyachenko V, Loginov D, Novák P, Lange A, Bornberg-Bauer E, Hlouchová K. Experimental characterization of de novo proteins and their unevolved random-sequence counterparts. Nat Ecol Evol 2023; 7:570-580. [PMID: 37024625 PMCID: PMC10089919 DOI: 10.1038/s41559-023-02010-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 02/10/2023] [Indexed: 04/08/2023]
Abstract
De novo gene emergence provides a route for new proteins to be formed from previously non-coding DNA. Proteins born in this way are considered random sequences and typically assumed to lack defined structure. While it remains unclear how likely a de novo protein is to assume a soluble and stable tertiary structure, intersecting evidence from random sequence and de novo-designed proteins suggests that native-like biophysical properties are abundant in sequence space. Taking putative de novo proteins identified in human and fly, we experimentally characterize a library of these sequences to assess their solubility and structure propensity. We compare this library to a set of synthetic random proteins with no evolutionary history. Bioinformatic prediction suggests that de novo proteins may have remarkably similar distributions of biophysical properties to unevolved random sequences of a given length and amino acid composition. However, upon expression in vitro, de novo proteins exhibit moderately higher solubility which is further induced by the DnaK chaperone system. We suggest that while synthetic random sequences are a useful proxy for de novo proteins in terms of structure propensity, de novo proteins may be better integrated in the cellular system than random expectation, given their higher solubility.
Collapse
Affiliation(s)
- Brennen Heames
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Filip Buchel
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic
- Department of Biochemistry, Charles University, Prague, Czech Republic
| | - Margaux Aubel
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | | | - Dmitry Loginov
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Petr Novák
- Institute of Microbiology, Czech Academy of Sciences, Prague, Czech Republic
| | - Andreas Lange
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
- Department of Protein Evolution, MPI for Developmental Biology, Tübingen, Germany.
| | - Klára Hlouchová
- Department of Cell Biology, Charles University, BIOCEV, Prague, Czech Republic.
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague, Czech Republic.
| |
Collapse
|
25
|
Khandia R, Pandey MK, Rzhepakovsky IV, Khan AA, Alexiou A. Synonymous Codon Variant Analysis for Autophagic Genes Dysregulated in Neurodegeneration. Mol Neurobiol 2023; 60:2252-2267. [PMID: 36637744 DOI: 10.1007/s12035-022-03081-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Accepted: 09/27/2022] [Indexed: 01/14/2023]
Abstract
Neurodegenerative disorders are often a culmination of the accumulation of abnormally folded proteins and defective organelles. Autophagy is a process of removing these defective proteins, organelles, and harmful substances from the body, and it works to maintain homeostasis. If autophagic removal of defective proteins has interfered, it affects neuronal health. Some of the autophagic genes are specifically found to be associated with neurodegenerative phenotypes. Non-functional, mutated, or gene copies having silent mutations, often termed synonymous variants, might explain this. However, these synonymous variant which codes for exactly similar proteins have different translation rates, stability, and gene expression profiling. Hence, it would be interesting to study the pattern of synonymous variant usage. In the study, synonymous variant usage in various transcripts of autophagic genes ATG5, ATG7, ATG8A, ATG16, and ATG17/FIP200 reported to cause neurodegeneration (if dysregulated) is studied. These genes were analyzed for their synonymous variant usage; nucleotide composition; any possible nucleotide skew in a gene; physical properties of autophagic protein including GRAVY and AROMA; hydropathicity; instability index; and frequency of acidic, basic, neutral amino acids; and gene expression level. The study will help understand various evolutionary forces acting on these genes and the possible augmentation of a gene if showing unusual behavior.
Collapse
Affiliation(s)
- Rekha Khandia
- Department of Biochemistry and Genetics, Barkatullah University, Bhopal, 462026, India.
| | - Megha Katare Pandey
- Department of Translational Medicine, All India Institute of Medical Sciences, Bhopal, 462020, India
| | | | - Azmat Ali Khan
- Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451, Saudi Arabia.
| | - Athanasios Alexiou
- Novel Global Community Educational Foundation, Hebersham, Australia
- AFNP Med, Wien, Austria
| |
Collapse
|
26
|
Picard MAL, Leblay F, Cassan C, Willemsen A, Daron J, Bauffe F, Decourcelle M, Demange A, Bravo IG. Transcriptomic, proteomic, and functional consequences of codon usage bias in human cells during heterologous gene expression. Protein Sci 2023; 32:e4576. [PMID: 36692287 PMCID: PMC9926478 DOI: 10.1002/pro.4576] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/12/2023] [Accepted: 01/14/2023] [Indexed: 01/25/2023]
Abstract
Differences in codon frequency between genomes, genes, or positions along a gene, modulate transcription and translation efficiency, leading to phenotypic and functional differences. Here, we present a multiscale analysis of the effects of synonymous codon recoding during heterologous gene expression in human cells, quantifying the phenotypic consequences of codon usage bias at different molecular and cellular levels, with an emphasis on translation elongation. Six synonymous versions of an antibiotic resistance gene were generated, fused to a fluorescent reporter, and independently expressed in HEK293 cells. Multiscale phenotype was analyzed by means of quantitative transcriptome and proteome assessment, as proxies for gene expression; cellular fluorescence, as a proxy for single-cell level expression; and real-time cell proliferation in absence or presence of antibiotic, as a proxy for the cell fitness. We show that differences in codon usage bias strongly impact the molecular and cellular phenotype: (i) they result in large differences in mRNA levels and protein levels, leading to differences of over 15 times in translation efficiency; (ii) they introduce unpredicted splicing events; (iii) they lead to reproducible phenotypic heterogeneity; and (iv) they lead to a trade-off between the benefit of antibiotic resistance and the burden of heterologous expression. In human cells in culture, codon usage bias modulates gene expression by modifying mRNA availability and suitability for translation, leading to differences in protein levels and eventually eliciting functional phenotypic changes.
Collapse
Affiliation(s)
- Marion A. L. Picard
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Fiona Leblay
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Cécile Cassan
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Anouk Willemsen
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Josquin Daron
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Frédérique Bauffe
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Mathilde Decourcelle
- BioCampus Montpellier (University of Montpellier, CNRS, INSERM)MontpellierFrance
| | - Antonin Demange
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| | - Ignacio G. Bravo
- French National Center for Scientific ResearchLaboratory MIVEGEC (CNRS, IRD, University of Montpellier)MontpellierFrance
| |
Collapse
|
27
|
Benisty H, Hernandez-Alias X, Weber M, Anglada-Girotto M, Mantica F, Radusky L, Senger G, Calvet F, Weghorn D, Irimia M, Schaefer MH, Serrano L. Genes enriched in A/T-ending codons are co-regulated and conserved across mammals. Cell Syst 2023; 14:312-323.e3. [PMID: 36889307 DOI: 10.1016/j.cels.2023.02.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 07/11/2022] [Accepted: 02/09/2023] [Indexed: 03/09/2023]
Abstract
Codon usage influences gene expression distinctly depending on the cell context. Yet, the importance of codon bias in the simultaneous turnover of specific groups of protein-coding genes remains to be investigated. Here, we find that genes enriched in A/T-ending codons are expressed more coordinately in general and across tissues and development than those enriched in G/C-ending codons. tRNA abundance measurements indicate that this coordination is linked to the expression changes of tRNA isoacceptors reading A/T-ending codons. Genes with similar codon composition are more likely to be part of the same protein complex, especially for genes with A/T-ending codons. The codon preferences of genes with A/T-ending codons are conserved among mammals and other vertebrates. We suggest that this orchestration contributes to tissue-specific and ontogenetic-specific expression, which can facilitate, for instance, timely protein complex formation.
Collapse
Affiliation(s)
- Hannah Benisty
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain.
| | - Xavier Hernandez-Alias
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Marc Weber
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Miquel Anglada-Girotto
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Federica Mantica
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Leandro Radusky
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Gökçe Senger
- Department of Experimental Oncology, European Institute of Oncology (IEO) IRCCS, Via Adamello 16, Milan 20139, Italy
| | - Ferriol Calvet
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Donate Weghorn
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain
| | - Manuel Irimia
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain
| | - Martin H Schaefer
- Department of Experimental Oncology, European Institute of Oncology (IEO) IRCCS, Via Adamello 16, Milan 20139, Italy
| | - Luis Serrano
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, Barcelona 08003, Spain; Universitat Pompeu Fabra (UPF), Barcelona 08003, Spain; ICREA, Pg. Lluis Companys 23, Barcelona 08010, Spain.
| |
Collapse
|
28
|
De Novo Assembly and Characterization of the Transcriptome of an Omnivorous Camel Cricket ( Tachycines meditationis). Int J Mol Sci 2023; 24:ijms24044005. [PMID: 36835417 PMCID: PMC9966759 DOI: 10.3390/ijms24044005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Revised: 01/12/2023] [Accepted: 01/16/2023] [Indexed: 02/18/2023] Open
Abstract
Tachycines meditationis (Orthoptera: Rhaphidophoridae: Tachycines) is a widely distributed insect in eastern Asia. This species is common in urban environments, and its unique omnivorous diet may contribute to its success in various habitats. However, molecular studies on the species are scarce. Here, we obtained the first transcriptome sequence of T. meditationis and performed preliminary analyses to test whether the evolution of coding sequences fits the expectations based on the species' ecology. We retrieved 476,495 effective transcripts and annotated 46,593 coding sequences (CDS). We analysed the codon usage and found that directional mutation pressure was the leading cause of codon usage bias in this species. This genome-wide relaxed codon usage pattern in T. meditationis is surprising, given the potentially large population size of this species. Moreover, despite the omnivorous diet, the chemosensory genes of this species do not exhibit codon usage deviating significantly from the genome-level pattern. They also do not seem to experience more gene family expansion than other cave cricket species do. A thorough search for rapidly evolved genes using the dN/dS value showed that genes associated with substance synthesis and metabolic pathways, such as retinol metabolism, aminoacyl-tRNA biosynthesis, and fatty acid metabolism, underwent species-specific positive selection. While some results seem to contradict the species ecology, our transcriptome assembly provides a valuable molecular resource for future studies on camel cricket evolution and molecular genetics for feeding ecology in insects, in general.
Collapse
|
29
|
Xie DF, Xie C, Ren T, Song BN, Zhou SD, He XJ. Plastid phylogenomic insights into relationships, divergence, and evolution of Apiales. PLANTA 2022; 256:117. [PMID: 36376499 DOI: 10.1007/s00425-022-04031-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Accepted: 11/06/2022] [Indexed: 06/16/2023]
Abstract
Members of Apiales are monophyletic and radiated in the Late Cretaceous. Fruit morphologies are critical for Apiales evolution and negative selection and mutation pressure play important roles in environmental adaptation. Apiales include many foods, spices, medicinal, and ornamental plants, but the phylogenetic relationships, origin and divergence, and adaptive evolution remain poorly understood. Here, we reconstructed Apiales phylogeny based on 72 plastid genes from 280 species plastid genomes representing six of seven families of this order. Highly supported phylogenetic relationships were detected, which revealed that each family of Apiales is monophyletic and confirmed that Pennanticeae is a member of Apiales. Genera Centella and Dickinsia are members of Apiaceae, and the genus Hydrocotyle previously classified into Apiaceae is confirmed to belong to Araliaceae. Besides, coalescent phylogenetic analysis and gene trees cluster revealed ten genes that can be used for distinguishing species among families of Apiales. Molecular dating suggested that the Apiales originated during the mid-Cretaceous (109.51 Ma), with the families' radiation occurring in the Late Cretaceous. Apiaceae species exhibit higher differentiation compared to other families. Ancestral trait reconstruction suggested that fruit morphological evolution may be related to shifts in plant types (herbaceous or woody), which in turn is related to the distribution areas and species numbers. Codon bias and positive selection analyses suggest that negative selection and mutation pressure may play important roles in environmental adaptation of Apiales members. Our results improve the phylogenetic framework of Apiales and provide insights into the origin, divergence, and adaptive evolution of this order and its members.
Collapse
Affiliation(s)
- Deng-Feng Xie
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Chuan Xie
- Sichuan Academy of Forestry, Chengdu, 610081, Sichuan, People's Republic of China
| | - Ting Ren
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Bo-Ni Song
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Song-Dong Zhou
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Xing-Jin He
- Key Laboratory of Bio-Resources and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China.
| |
Collapse
|
30
|
Yang J, Chu Q, Meng G, Kong W. The complete chloroplast genome sequences of three Broussonetia species and comparative analysis within the Moraceae. PeerJ 2022; 10:e14293. [PMID: 36340196 PMCID: PMC9632464 DOI: 10.7717/peerj.14293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 10/03/2022] [Indexed: 01/22/2023] Open
Abstract
Background Species of Broussonetia (family Moraceae) are commonly used to make textiles and high-grade paper. The distribution of Broussonetia papyrifera L. is considered to be related to the spread and location of humans. The complete chloroplast (cp) genomes of B. papyrifera, Broussonetia kazinoki Sieb., and Broussonetia kaempferi Sieb. were analyzed to better understand the status and evolutionary biology of the genus Broussonetia. Methods The cp genomes were assembled and characterized using SOAPdenovo2 and DOGMA. Phylogenetic and molecular dating analysis were performed using the concatenated nucleotide sequences of 35 species in the Moraceae family and were based on 66 protein-coding genes (PCGs). An analysis of the sequence divergence (pi) of each PCG among the 35 cp genomes was conducted using DnaSP v6. Codon usage indices were calculated using the CodonW program. Results All three cp genomes had the typical land plant quadripartite structure, ranging in size from 160,239 bp to 160,841 bp. The ribosomal protein L22 gene (RPL22) was either incomplete or missing in all three Broussonetia species. Phylogenetic analysis revealed two clades. Clade 1 included Morus and Artocarpus, whereas clade 2 included the other seven genera. Malaisia scandens Lour. was clustered within the genus Broussonetia. The differentiation of Broussonetia was estimated to have taken place 26 million years ago. The PCGs' pi values ranged from 0.0005 to 0.0419, indicating small differences within the Moraceae family. The distribution of most of the genes in the effective number of codons plot (ENc-plot) fell on or near the trend line; the slopes of the trend line of neutrality plots were within the range of 0.0363-0.171. These results will facilitate the identification, taxonomy, and utilization of the Broussonetia species and further the evolutionary studies of the Moraceae family.
Collapse
Affiliation(s)
- Jinhong Yang
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Qu Chu
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Gang Meng
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| | - Weiqing Kong
- Shaanxi Key Laboratory of Sericulture, Ankang University, Ankang, China
| |
Collapse
|
31
|
Huang YX, Xing ZP, Zhang H, Xu ZB, Tao LL, Hu HY, Kitching IJ, Wang X. Characterization of the Complete Mitochondrial Genome of Eight Diurnal Hawkmoths (Lepidoptera: Sphingidae): New Insights into the Origin and Evolution of Diurnalism in Sphingids. INSECTS 2022; 13:887. [PMID: 36292835 PMCID: PMC9604448 DOI: 10.3390/insects13100887] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/20/2022] [Accepted: 09/27/2022] [Indexed: 06/16/2023]
Abstract
In this study, the mitochondrial genomes of 22 species from three subfamilies in the Sphingidae were sequenced, assembled, and annotated. Eight diurnal hawkmoths were included, of which six were newly sequenced (Hemaris radians, Macroglossum bombylans, M. fritzei, M. pyrrhosticta, Neogurelca himachala, and Sataspes xylocoparis) and two were previously published (Cephonodes hylas and Macroglossum stellatarum). The mitochondrial genomes of these eight diurnal hawkmoths were comparatively analyzed in terms of sequence length, nucleotide composition, relative synonymous codon usage, non-synonymous/synonymous substitution ratio, gene spacing, and repeat sequences. The mitogenomes of the eight species, ranging in length from 15,201 to 15,461 bp, encode the complete set of 37 genes usually found in animal mitogenomes. The base composition of the mitochondrial genomes showed A+T bias. The most commonly used codons were UUA (Leu), AUU (Ile), UUU (Phe), AUA (Met), and AAU (Asn), whereas GCG (Ala) and CCG (Pro) were rarely used. A phylogenetic tree of Sphingidae was constructed based on both maximum likelihood and Bayesian methods. We verified the monophyly of the four current subfamilies of Sphingidae, all of which had high support. In addition, we performed divergence time estimation and ancestral character reconstruction analyses. Diurnal behavior in hawkmoths originated 29.19 million years ago (Mya). It may have been influenced by the combination of herbaceous flourishing, which occurred 26-28 Mya, the uplift of the Tibetan Plateau, and the large-scale evolution of bats in the Oligocene to Pre-Miocene. Moreover, diurnalism in hawkmoths had multiple independent origins in Sphingidae.
Collapse
Affiliation(s)
- Yi-Xin Huang
- Collaborative Innovation Center of Recovery and Reconstruction of Degraded Ecosystem in Wanjiang Basin Co-Founded by Anhui Province and Ministry of Education, School of Ecology and Environment, Anhui Normal University, Wuhu 241000, China
- Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
| | - Zhi-Ping Xing
- Collaborative Innovation Center of Recovery and Reconstruction of Degraded Ecosystem in Wanjiang Basin Co-Founded by Anhui Province and Ministry of Education, School of Ecology and Environment, Anhui Normal University, Wuhu 241000, China
| | - Hao Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Zhen-Bang Xu
- Institute of Resource Plants, Yunnan University, Kunming 650500, China
| | - Li-Long Tao
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Hao-Yuan Hu
- Collaborative Innovation Center of Recovery and Reconstruction of Degraded Ecosystem in Wanjiang Basin Co-Founded by Anhui Province and Ministry of Education, School of Ecology and Environment, Anhui Normal University, Wuhu 241000, China
| | | | - Xu Wang
- Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| |
Collapse
|
32
|
Miller JB, Meurs TE, Hodgman MW, Song B, Miller KN, Ebbert MTW, Kauwe JSK, Ridge PG. The Ramp Atlas: facilitating tissue and cell-specific ramp sequence analyses through an intuitive web interface. NAR Genom Bioinform 2022; 4:lqac039. [PMID: 35664804 PMCID: PMC9155233 DOI: 10.1093/nargab/lqac039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Revised: 03/01/2022] [Accepted: 05/24/2022] [Indexed: 11/14/2022] Open
Abstract
Ramp sequences occur when the average translational efficiency of codons near the 5′ end of highly expressed genes is significantly lower than the rest of the gene sequence, which counterintuitively increases translational efficiency by decreasing downstream ribosomal collisions. Here, we show that the relative codon adaptiveness within different tissues changes the existence of a ramp sequence without altering the underlying genetic code. We present the first comprehensive analysis of tissue and cell type-specific ramp sequences and report 3108 genes with ramp sequences that change between tissues and cell types, which corresponds with increased gene expression within those tissues and cells. The Ramp Atlas (https://ramps.byu.edu/) allows researchers to query precomputed ramp sequences in 18 388 genes across 62 tissues and 66 cell types and calculate tissue-specific ramp sequences from user-uploaded FASTA files through an intuitive web interface. We used The Ramp Atlas to identify seven SARS-CoV-2 genes and seven human SARS-CoV-2 entry factor genes with tissue-specific ramp sequences that may help explain viral proliferation within those tissues. We anticipate that The Ramp Atlas will facilitate personalized and creative tissue-specific ramp sequence analyses for both human and viral genes that will increase our ability to utilize this often-overlooked regulatory region.
Collapse
Affiliation(s)
- Justin B Miller
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Taylor E Meurs
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Matthew W Hodgman
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - Benjamin Song
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Kyle N Miller
- Department of Computer Science, Utah Valley University, Orem, UT 84058, USA
| | - Mark T W Ebbert
- Sanders-Brown Center on Aging, University of Kentucky, Lexington, KY 40504, USA
| | - John S K Kauwe
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| | - Perry G Ridge
- Department of Biology, Brigham Young University, Provo, UT 84602, USA
| |
Collapse
|
33
|
Cope AL, Shah P. Intragenomic variation in non-adaptive nucleotide biases causes underestimation of selection on synonymous codon usage. PLoS Genet 2022; 18:e1010256. [PMID: 35714134 PMCID: PMC9246145 DOI: 10.1371/journal.pgen.1010256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 06/30/2022] [Accepted: 05/13/2022] [Indexed: 11/20/2022] Open
Abstract
Patterns of non-uniform usage of synonymous codons vary across genes in an organism and between species across all domains of life. This codon usage bias (CUB) is due to a combination of non-adaptive (e.g. mutation biases) and adaptive (e.g. natural selection for translation efficiency/accuracy) evolutionary forces. Most models quantify the effects of mutation bias and selection on CUB assuming uniform mutational and other non-adaptive forces across the genome. However, non-adaptive nucleotide biases can vary within a genome due to processes such as biased gene conversion (BGC), potentially obfuscating signals of selection on codon usage. Moreover, genome-wide estimates of non-adaptive nucleotide biases are lacking for non-model organisms. We combine an unsupervised learning method with a population genetics model of synonymous coding sequence evolution to assess the impact of intragenomic variation in non-adaptive nucleotide bias on quantification of natural selection on synonymous codon usage across 49 Saccharomycotina yeasts. We find that in the absence of a priori information, unsupervised learning can be used to identify genes evolving under different non-adaptive nucleotide biases. We find that the impact of intragenomic variation in non-adaptive nucleotide bias varies widely, even among closely-related species. We show that the overall strength and direction of translational selection can be underestimated by failing to account for intragenomic variation in non-adaptive nucleotide biases. Interestingly, genes falling into clusters identified by machine learning are also physically clustered across chromosomes. Our results indicate the need for more nuanced models of sequence evolution that systematically incorporate the effects of variable non-adaptive nucleotide biases on codon frequencies.
Collapse
Affiliation(s)
- Alexander L. Cope
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
- Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, United States of America
| | - Premal Shah
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
| |
Collapse
|
34
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
35
|
Muyle AM, Seymour DK, Lv Y, Huettel B, Gaut BS. Gene-body methylation in plants: mechanisms, functions and important implications for understanding evolutionary processes. Genome Biol Evol 2022; 14:6550137. [PMID: 35298639 PMCID: PMC8995044 DOI: 10.1093/gbe/evac038] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/11/2022] [Indexed: 11/13/2022] Open
Abstract
Gene body methylation (gbM) is an epigenetic mark where gene exons are methylated in the CG context only, as opposed to CHG and CHH contexts (where H stands for A, C, or T). CG methylation is transmitted transgenerationally in plants, opening the possibility that gbM may be shaped by adaptation. This presupposes, however, that gbM has a function that affects phenotype, which has been a topic of debate in the literature. Here, we review our current knowledge of gbM in plants. We start by presenting the well-elucidated mechanisms of plant gbM establishment and maintenance. We then review more controversial topics: the evolution of gbM and the potential selective pressures that act on it. Finally, we discuss the potential functions of gbM that may affect organismal phenotypes: gene expression stabilization and upregulation, inhibition of aberrant transcription (reverse and internal), prevention of aberrant intron retention, and protection against TE insertions. To bolster the review of these topics, we include novel analyses to assess the effect of gbM on transcripts. Overall, a growing body of literature finds that gbM correlates with levels and patterns of gene expression. It is not clear, however, if this is a causal relationship. Altogether, functional work suggests that the effects of gbM, if any, must be relatively small, but there is nonetheless evidence that it is shaped by natural selection. We conclude by discussing the potential adaptive character of gbM and its implications for an updated view of the mechanisms of adaptation in plants.
Collapse
Affiliation(s)
| | | | - Yuanda Lv
- Provincial Key Laboratory of Agrobiology, Institute of Crop Germplasm and Biotechnology, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Bruno Huettel
- Max Planck Genome Centre Cologne, Max Planck Institute for Plant Breeding, Cologne, Germany
| | | |
Collapse
|
36
|
Abstract
SignificanceThe dynamics of deleterious variation under contrasting demographic scenarios remain poorly understood in spite of their relevance in evolutionary and conservation terms. Here we apply a genomic approach to study differences in the burden of deleterious alleles between the endangered Iberian lynx (Lynx pardinus) and the widespread Eurasian lynx (Lynx lynx). Our analysis unveils a significantly lower deleterious burden in the former species that should be ascribed to genetic purging, that is, to the increased opportunities of selection against recessive homozygotes due to the inbreeding caused by its smaller population size, as illustrated by our analytical predictions. This research provides theoretical and empirical evidence on the evolutionary relevance of genetic purging under certain demographic conditions.
Collapse
|
37
|
Fields PD, McTaggart S, Reisser CMO, Haag C, Palmer WH, Little TJ, Ebert D, Obbard DJ. Population-genomic analysis identifies a low rate of global adaptive fixation in the proteins of the cyclical parthenogen Daphnia magna. Mol Biol Evol 2022; 39:6542319. [PMID: 35244177 PMCID: PMC8963301 DOI: 10.1093/molbev/msac048] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Daphnia are well-established ecological and evolutionary models, and the interaction between D. magna and its microparasites is widely considered a paragon of the host-parasite coevolutionary process. Like other well-studied arthropods such as Drosophila melanogaster and Anopheles gambiae, D. magna is a small, widespread, and abundant species that is therefore expected to display a large long-term population size and high rates of adaptive protein evolution. However, unlike these other species, D. magna is cyclically asexual and lives in a highly structured environment (ponds and lakes) with moderate levels of dispersal, both of which are predicted to impact upon long-term effective population size and adaptive protein evolution. To investigate patterns of adaptive protein fixation, we produced the complete coding genomes of 36 D. magna clones sampled from across the European range (Western Palaearctic), along with draft sequences for the close relatives D. similis and D. lumholtzi, used as outgroups. We analyzed genome-wide patterns of adaptive fixation, with a particular focus on genes that have an a priori expectation of high rates, such as those likely to mediate immune responses, RNA interference against viruses and transposable elements, and those with a strongly male-biased expression pattern. We find that, as expected, D. magna displays high levels of diversity and that this is highly structured among populations. However, compared with Drosophila, we find that D. magna proteins appear to have a high proportion of weakly deleterious variants and do not show evidence of pervasive adaptive fixation across its entire range. This is true of the genome as a whole, and also of putative ‘arms race’ genes that often show elevated levels of adaptive substitution in other species. In addition to the likely impact of extensive, and previously documented, local adaptation, we speculate that these findings may reflect reduced efficacy of selection associated with cyclical asexual reproduction.
Collapse
Affiliation(s)
- Peter D Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, Basel, CH-4051, Switzerland
| | - Seanna McTaggart
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Céline M O Reisser
- Centre d'Ecologie Fonctionnelle et Evolutive CEFE UMR 5175, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, campus CNRS, 1919, route de Mende, 34293 Montpellier Cedex 5, France.,MARBEC, Univ Montpellier, CNRS, IFREMER, IRD, Montpellier, France
| | - Christoph Haag
- Centre d'Ecologie Fonctionnelle et Evolutive CEFE UMR 5175, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, campus CNRS, 1919, route de Mende, 34293 Montpellier Cedex 5, France
| | - William H Palmer
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Tom J Little
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, Basel, CH-4051, Switzerland
| | - Darren J Obbard
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| |
Collapse
|
38
|
Abdoli R, Mazumder TH, Nematollahian S, Zanjani RS, Mesbah RA, Uddin A. Gaining insights into the compositional constraints and molecular phylogeny of five silkworms mitochondrial genome. Int J Biol Macromol 2022; 206:543-552. [PMID: 35245576 DOI: 10.1016/j.ijbiomac.2022.02.135] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2021] [Revised: 12/08/2021] [Accepted: 02/22/2022] [Indexed: 11/28/2022]
Abstract
This study was performed to identify codon usage bias (CUB), genetic similarity and phylogenetic analysis of complete mitochondrial genomes along with separate sequences of 13 protein coding genes per each genome from five types of silkworm including Bombyx mori, Bombyx mandarina, Samia cynthia ricini, Antheraea pernyi and Antheraea assama. Nucleotide composition analysis suggested that AT content was higher than GC content and t-test analysis revealed significance difference (p < 0.01) between AT and GC content. Relative synonymous CUB analysis revealed most over-represented codon ends with A or T. Parity plot analysis revealed both natural selection and mutation pressure influenced CUB of mitochondrial genes while neutrality plot analysis suggested that role of natural selection was higher than mutation pressure. The effective number of codons (ENC) revealed the CUB was low among genes and genomes. In phylogenetic analysis of complete mitochondrial genomes, the B. mori fell in a same cluster with Bombyx mandarina and showed the most similarity (96.7%). In terms of protein coding genes, COX1, COX2 and COX3 showed the most obvious differences. In conclusion, comparative analysis of mitochondrial genomes could be used to identify differences in gene organization, accurate phylogenetic analysis and clustering of different types of silkworms.
Collapse
Affiliation(s)
- Ramin Abdoli
- Iran Silk Research Center, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran.
| | | | - Shahla Nematollahian
- Iran Silk Research Center, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran
| | - Reza Sourati Zanjani
- Iran Silk Research Center, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran
| | - Rahim Abdollahi Mesbah
- Iran Silk Research Center, Agricultural Research, Education and Extension Organization (AREEO), Tehran, Iran
| | - Arif Uddin
- Department of Zoology, Moinul Hoque Choudhury Memorial Science College, Algapur, Hailakandi 788150, Assam, India.
| |
Collapse
|
39
|
Abstract
Non-random usage of synonymous codons, known as “codon bias”, has been described in many organisms, from bacteria to Drosophila, but little is known about it in phytoplankton. This phenomenon is thought to be driven by selection for translational efficiency. As the efficacy of selection is proportional to the effective population size, species with large population sizes, such as phytoplankton, are expected to have strong codon bias. To test this, we measured codon bias in 215 strains from Haptophyta, Chlorophyta, Ochrophyta (except diatoms that were studied previously), Dinophyta, Cryptophyta, Ciliophora, unicellular Rhodophyta and Chlorarachniophyta. Codon bias is modest in most groups, despite the astronomically large population sizes of marine phytoplankton. The strength of the codon bias, measured with the effective number of codons, is the strongest in Haptophyta and the weakest in Chlorarachniophyta. The optimal codons are GC-ending in most cases, but several shifts to AT-ending codons were observed (mainly in Ochrophyta and Ciliophora). As it takes a long time to reach a new equilibrium after such shifts, species having AT-ending codons show a lower frequency of optimal codons compared to other species. Genetic diversity, calculated for species with more than three strains sequenced, is modest, indicating that the effective population sizes are many orders of magnitude lower than the astronomically large census population sizes, which helps to explain the modest codon bias in marine phytoplankton. This study represents the first comparative analysis of codon bias across multiple major phytoplankton groups.
Collapse
|
40
|
Wang F, Tekle YI. Variation of natural selection in the Amoebozoa reveals heterogeneity across the phylogeny and adaptive evolution in diverse lineages. Front Ecol Evol 2022; 10:851816. [PMID: 36874909 PMCID: PMC9980437 DOI: 10.3389/fevo.2022.851816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The evolution and diversity of the supergroup Amoebozoa is complex and poorly understood. The supergroup encompasses predominantly amoeboid lineages characterized by extreme diversity in phenotype, behavior and genetics. The study of natural selection, a driving force of diversification, within and among species of Amoebozoa will play a crucial role in understanding the evolution of the supergroup. In this study, we searched for traces of natural selection based on a set of highly conserved protein-coding genes in a phylogenetic framework from a broad sampling of amoebozoans. Using these genes, we estimated substitution rates and inferred patterns of selective pressure in lineages and sites with various models. We also examined the effect of selective pressure on codon usage bias and potential correlations with observed biological traits and habitat. Results showed large heterogeneity of selection across lineages of Amoebozoa, indicating potential species-specific optimization of adaptation to their diverse ecological environment. Overall, lineages in Tubulinea had undergone stronger purifying selection with higher average substitution rates compared to Discosea and Evosea. Evidence of adaptive evolution was observed in some representative lineages and in a gene (Rpl7a) within Evosea, suggesting potential innovation and beneficial mutations in these lineages. Our results revealed that members of the fast-evolving lineages, Entamoeba and Cutosea, all underwent strong purifying selection but had distinct patterns of codon usage bias. For the first time, this study revealed an overall pattern of natural selection across the phylogeny of Amoebozoa and provided significant implications on their distinctive evolutionary processes.
Collapse
Affiliation(s)
- Fang Wang
- Department of Biology, Spelman College, Atlanta, GA, United States
| | - Yonas I Tekle
- Department of Biology, Spelman College, Atlanta, GA, United States
| |
Collapse
|
41
|
Li Y, Wang R, Wang H, Pu F, Feng X, Jin L, Ma Z, Ma XX. Codon Usage Bias in Autophagy-Related Gene 13 in Eukaryotes: Uncovering the Genetic Divergence by the Interplay Between Nucleotides and Codon Usages. Front Cell Infect Microbiol 2021; 11:771010. [PMID: 34804999 PMCID: PMC8602353 DOI: 10.3389/fcimb.2021.771010] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 10/12/2021] [Indexed: 12/15/2022] Open
Abstract
Synonymous codon usage bias is a universal characteristic of genomes across various organisms. Autophagy-related gene 13 (atg13) is one essential gene for autophagy initiation, yet the evolutionary trends of the atg13 gene at the usages of nucleotide and synonymous codon remains unexplored. According to phylogenetic analyses for the atg13 gene of 226 eukaryotic organisms at the nucleotide and amino acid levels, it is clear that their nucleotide usages exhibit more genetic information than their amino acid usages. Specifically, the overall nucleotide usage bias quantified by information entropy reflected that the usage biases at the first and second codon positions were stronger than those at the third position of the atg13 genes. Furthermore, the bias level of nucleotide ‘G’ usage is highest, while that of nucleotide ‘C’ usage is lowest in the atg13 genes. On top of that, genetic features represented by synonymous codon usage exhibits a species-specific pattern on the evolution of the atg13 genes to some extent. Interestingly, the codon usages of atg13 genes in the ancestor animals (Latimeria chalumnae, Petromyzon marinus, and Rhinatrema bivittatum) are strongly influenced by mutation pressure from nucleotide composition constraint. However, the distributions of nucleotide composition at different codon positions in the atg13 gene display that natural selection still dominates atg13 codon usages during organisms’ evolution.
Collapse
Affiliation(s)
- Yicong Li
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Rui Wang
- Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States
| | - Huihui Wang
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Feiyang Pu
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Xili Feng
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Li Jin
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Zhongren Ma
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| | - Xiao-Xia Ma
- Biomedical Research Center, Northwest Minzu University, Lanzhou, China
| |
Collapse
|
42
|
Wang P, Mao Y, Su Y, Wang J. Comparative analysis of transcriptomic data shows the effects of multiple evolutionary selection processes on codon usage in Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. BMC Genomics 2021; 22:781. [PMID: 34717552 PMCID: PMC8557549 DOI: 10.1186/s12864-021-08106-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Accepted: 10/19/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Kuruma shrimp, a major commercial shrimp species in the world, has two cryptic or sibling species, Marsupenaeus japonicus and Marsupenaeus pulchricaudatus. Codon usage analysis would contribute to our understanding of the genetic and evolutionary characteristics of the two Marsupenaeus species. In this study, we analyzed codon usage and related indices using coding sequences (CDSs) from RNA-seq data. RESULTS Using CodonW 1.4.2 software, we performed the codon bias analysis of transcriptomes obtained from hepatopancreas tissues, which indicated weak codon bias. Almost all parameters had similar correlations for both species. The gene expression level (FPKM) was negatively correlated with A/T3s. We determined 12 and 14 optimal codons for M. japonicus and M. pulchricaudatus, respectively, and all optimal codons have a C/G-ending. The two Marsupenaeus species had different usage frequencies of codon pairs, which contributed to further analysis of transcriptional differences between them. Orthologous genes that underwent positive selection (ω > 1) had a higher correlation coefficient than that of experienced purifying selection (ω < 1). Parity Rule 2 (PR2) and effective number of codons (ENc) plot analysis showed that the codon usage patterns of both species were influenced by both mutations and selection. Moreover, the average observed ENc value was lower than the expected value for both species, suggesting that factors other than GC may play roles in these phenomena. The results of multispecies clustering based on codon preference were consistent with traditional classification. CONCLUSIONS This study provides a relatively comprehensive understanding of the correlations among codon usage bias, gene expression, and selection pressures of CDSs for M. japonicus and M. pulchricaudatus. The genetic evolution was driven by mutations and selection pressure. Moreover, the results point out new insights into the specificities and evolutionary characteristics of the two Marsupenaeus species.
Collapse
Affiliation(s)
- Panpan Wang
- Jiangsu Key Laboratory of Marine Bioresources and Environment/ Jiangsu Key Laboratory of Marine Biotechnology, Jiangsu Ocean University, Lianyungang, 222005, China
- Co-Innovation Center of Jiangsu Marine Bio-Industry Technology, Jiangsu Ocean University, Lianyungang, 222005, China
- The Jiangsu Provincial Infrastructure for Conservation and Utilization of Agricultural Germplasm, Nanjing, 210014, China
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| | - Yong Mao
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China.
- Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, Xiamen University, Xiamen, 361102, China.
| | - Yongquan Su
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| | - Jun Wang
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, 361102, Fujian, China
| |
Collapse
|
43
|
Daron J, Bravo IG. Variability in Codon Usage in Coronaviruses Is Mainly Driven by Mutational Bias and Selective Constraints on CpG Dinucleotide. Viruses 2021; 13:v13091800. [PMID: 34578381 PMCID: PMC8473333 DOI: 10.3390/v13091800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 08/30/2021] [Accepted: 08/31/2021] [Indexed: 12/18/2022] Open
Abstract
The Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the third human-emerged virus of the 21st century from the Coronaviridae family, causing the ongoing coronavirus disease 2019 (COVID-19) pandemic. Due to the high zoonotic potential of coronaviruses, it is critical to unravel their evolutionary history of host species breadth, host-switch potential, adaptation and emergence, to identify viruses posing a pandemic risk in humans. We present here a comprehensive analysis of the composition and codon usage bias of the 82 Orthocoronavirinae members, infecting 47 different avian and mammalian hosts. Our results clearly establish that synonymous codon usage varies widely among viruses, is only weakly dependent on their primary host, and is dominated by mutational bias towards AU-enrichment and by CpG avoidance. Indeed, variation in GC3 explains around 34%, while variation in CpG frequency explains around 14% of total variation in codon usage bias. Further insight on the mutational equilibrium within Orthocoronavirinae revealed that most coronavirus genomes are close to their neutral equilibrium, the exception being the three recently infecting human coronaviruses, which lie further away from the mutational equilibrium than their endemic human coronavirus counterparts. Finally, our results suggest that, while replicating in humans, SARS-CoV-2 is slowly becoming AU-richer, likely until attaining a new mutational equilibrium.
Collapse
Affiliation(s)
- Josquin Daron
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Correspondence:
| | - Ignacio G. Bravo
- Laboratoire MIVEGEC (CNRS, IRD, Université de Montpellier), 34394 Montpellier, France;
- Center for Research on the Ecology and Evolution of Diseases (CREES), 34394 Montpellier, France
| |
Collapse
|
44
|
Iriarte A, Lamolle G, Musto H. Codon Usage Bias: An Endless Tale. J Mol Evol 2021; 89:589-593. [PMID: 34383106 DOI: 10.1007/s00239-021-10027-z] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 08/06/2021] [Indexed: 11/28/2022]
Abstract
Since the genetic code is degenerate, several codons are translated to the same amino acid. Although these triplets were historically considered to be "synonymous" and therefore expected to be used at rather equal frequencies in all genomes, we now know that this is not the case. Indeed, since several coding sequences were obtained in the late '70s and early '80s in the last century, coming from either the same or different species, it was evident that (a) each genome, taken globally, displayed different codon usage patterns, which means that different genomes display a particular global codon usage table when all genes are considered together, and (b) there is a strong intragenomic diversity: in other words, within a given species the codon usage pattern can (and usually do) differ greatly among genes in the same genome. These different patterns were attributed to two main factors: first, the mutational bias characteristic of each genome, which determines that GC- poor species display a general bias towards A/T codons while the reverse is true for GC- rich species. Second, the differences in codon usage among genes from the same species are due to natural selection acting at the level of translation, in such a way that highly expressed genes tend to use codons that match with the most abundant isoacceptor tRNAs. Thus, these genes are translated at a highest rate, which in turn leads to avoid the limiting factor in translation which is the number of available ribosomes per cell. Although these explanations are still valid, new factors are almost constantly postulated to affect codon usage. In this mini review, we shall try to summarize them.
Collapse
Affiliation(s)
- Andrés Iriarte
- Laboratorio de Genómica Evolutiva, Depto. de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, 11400, Montevideo, Uruguay.,Laboratorio de Biología Computacional, Depto. de Desarrollo Biotecnológico, Instituto de Higiene, Facultad de Medicina, Universidad de la República, 11600, Montevideo, Uruguay
| | - Guillermo Lamolle
- Laboratorio de Genómica Evolutiva, Depto. de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, 11400, Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Genómica Evolutiva, Depto. de Biología Celular y Molecular, Facultad de Ciencias, Universidad de la República, 11400, Montevideo, Uruguay.
| |
Collapse
|
45
|
Literman R, Schwartz R. Genome-Scale Profiling Reveals Noncoding Loci Carry Higher Proportions of Concordant Data. Mol Biol Evol 2021; 38:2306-2318. [PMID: 33528497 PMCID: PMC8136493 DOI: 10.1093/molbev/msab026] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Many evolutionary relationships remain controversial despite whole-genome sequencing data. These controversies arise, in part, due to challenges associated with accurately modeling the complex phylogenetic signal coming from genomic regions experiencing distinct evolutionary forces. Here, we examine how different regions of the genome support or contradict well-established relationships among three mammal groups using millions of orthologous parsimony-informative biallelic sites (PIBS) distributed across primate, rodent, and Pecora genomes. We compared PIBS concordance percentages among locus types (e.g. coding sequences (CDS), introns, intergenic regions), and contrasted PIBS utility over evolutionary timescales. Sites derived from noncoding sequences provided more data and proportionally more concordant sites compared with those from CDS in all clades. CDS PIBS were also predominant drivers of tree incongruence in two cases of topological conflict. PIBS derived from most locus types provided surprisingly consistent support for splitting events spread across the timescales we examined, although we find evidence that CDS and intronic PIBS may, respectively and to a limited degree, inform disproportionately about older and younger splits. In this era of accessible wholegenome sequence data, these results:1) suggest benefits to more intentionally focusing on noncoding loci as robust data for tree inference and 2) reinforce the importance of accurate modeling, especially when using CDS data.
Collapse
Affiliation(s)
- Robert Literman
- Department of Biological Sciences, University of Rhode Island, South Kingstown, RI, USA.,Center for Food Safety and Applied Nutrition, Office of Regulatory Science, U.S. Food and Drug Administration, College Park, MD, USA
| | - Rachel Schwartz
- Department of Biological Sciences, University of Rhode Island, South Kingstown, RI, USA
| |
Collapse
|
46
|
Weak selection on synonymous codons substantially inflates dN/dS estimates in bacteria. Proc Natl Acad Sci U S A 2021; 118:2023575118. [PMID: 33972434 DOI: 10.1073/pnas.2023575118] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Synonymous codon substitutions are not always selectively neutral as revealed by several types of analyses, including studies of codon usage patterns among genes. We analyzed codon usage in 13 bacterial genomes sampled from across a large order of bacteria, Enterobacterales, and identified presumptively neutral and selected classes of synonymous substitutions. To estimate substitution rates, given a neutral/selected classification of synonymous substitutions, we developed a flexible [Formula: see text] substitution model that allows multiple classes of synonymous substitutions. Under this multiclass synonymous substitution (MSS) model, the denominator of [Formula: see text] includes only the strictly neutral class of synonymous substitutions. On average, the value of [Formula: see text] under the MSS model was 80% of that under the standard codon model in which all synonymous substitutions are assumed to be neutral. The indication is that conventional [Formula: see text] analyses overestimate these values and thus overestimate the frequency of positive diversifying selection and underestimate the strength of purifying selection. To quantify the strength of selection necessary to explain this reduction, we developed a model of selected compensatory codon substitutions. The reduction in synonymous substitution rate, and thus the contribution that selection makes to codon bias variation among genes, can be adequately explained by very weak selection, with a mean product of population size and selection coefficient, [Formula: see text].
Collapse
|
47
|
Mordstein C, Cano L, Morales AC, Young B, Ho AT, Rice AM, Liss M, Hurst LD, Kudla G. Transcription, mRNA export and immune evasion shape the codon usage of viruses. Genome Biol Evol 2021; 13:6275682. [PMID: 33988683 PMCID: PMC8410142 DOI: 10.1093/gbe/evab106] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/10/2021] [Indexed: 12/15/2022] Open
Abstract
The nucleotide composition, dinucleotide composition, and codon usage of many viruses differs from their hosts. These differences arise because viruses are subject to unique mutation and selection pressures that do not apply to host genomes; however, the molecular mechanisms that underlie these evolutionary forces are unclear. Here, we analysed the patterns of codon usage in 1,520 vertebrate-infecting viruses, focusing on parameters known to be under selection and associated with gene regulation. We find that GC content, dinucleotide content, and splicing and m6A modification-related sequence motifs are associated with the type of genetic material (DNA or RNA), strandedness, and replication compartment of viruses. In an experimental follow-up, we find that the effects of GC content on gene expression depend on whether the genetic material is delivered to the cell as DNA or mRNA, whether it is transcribed by endogenous or exogenous RNA polymerase, and whether transcription takes place in the nucleus or cytoplasm. Our results suggest that viral codon usage cannot be explained by a simple adaptation to the codon usage of the host - instead, it reflects the combination of multiple selective and mutational pressures, including the need for efficient transcription, export, and immune evasion.
Collapse
Affiliation(s)
- Christine Mordstein
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Laura Cano
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| | - Atahualpa Castillo Morales
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Bethan Young
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK.,The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alexander T Ho
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Alan M Rice
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Michael Liss
- Thermo Fisher Scientific, GENEART GmbH, Regensburg, Germany
| | - Laurence D Hurst
- The Milner Centre for Evolution, Department of Biology and Biochemistry, University of Bath, Bath, BA2 7AY, UK
| | - Grzegorz Kudla
- MRC Human Genetics Unit, Institute for Genetics and Molecular Medicine, The University of Edinburgh, Edinburgh, UK
| |
Collapse
|
48
|
Boman J, Mugal CF, Backström N. The Effects of GC-Biased Gene Conversion on Patterns of Genetic Diversity among and across Butterfly Genomes. Genome Biol Evol 2021; 13:evab064. [PMID: 33760095 PMCID: PMC8175052 DOI: 10.1093/gbe/evab064] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/22/2021] [Indexed: 12/28/2022] Open
Abstract
Recombination reshuffles the alleles of a population through crossover and gene conversion. These mechanisms have considerable consequences on the evolution and maintenance of genetic diversity. Crossover, for example, can increase genetic diversity by breaking the linkage between selected and nearby neutral variants. Bias in favor of G or C alleles during gene conversion may instead promote the fixation of one allele over the other, thus decreasing diversity. Mutation bias from G or C to A and T opposes GC-biased gene conversion (gBGC). Less recognized is that these two processes may-when balanced-promote genetic diversity. Here, we investigate how gBGC and mutation bias shape genetic diversity patterns in wood white butterflies (Leptidea sp.). This constitutes the first in-depth investigation of gBGC in butterflies. Using 60 resequenced genomes from six populations of three species, we find substantial variation in the strength of gBGC across lineages. When modeling the balance of gBGC and mutation bias and comparing analytical results with empirical data, we reject gBGC as the main determinant of genetic diversity in these butterfly species. As alternatives, we consider linked selection and GC content. We find evidence that high values of both reduce diversity. We also show that the joint effects of gBGC and mutation bias can give rise to a diversity pattern which resembles the signature of linked selection. Consequently, gBGC should be considered when interpreting the effects of linked selection on levels of genetic diversity.
Collapse
Affiliation(s)
- Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Carina F Mugal
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Sweden
| |
Collapse
|
49
|
Callens M, Pradier L, Finnegan M, Rose C, Bedhomme S. Read between the lines: Diversity of non-translational selection pressures on local codon usage. Genome Biol Evol 2021; 13:6263832. [PMID: 33944930 PMCID: PMC8410138 DOI: 10.1093/gbe/evab097] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/28/2021] [Indexed: 12/14/2022] Open
Abstract
Protein coding genes can contain specific motifs within their nucleotide sequence that function as a signal for various biological pathways. The presence of such sequence motifs within a gene can have beneficial or detrimental effects on the phenotype and fitness of an organism, and this can lead to the enrichment or avoidance of this sequence motif. The degeneracy of the genetic code allows for the existence of alternative synonymous sequences that exclude or include these motifs, while keeping the encoded amino acid sequence intact. This implies that locally, there can be a selective pressure for preferentially using a codon over its synonymous alternative in order to avoid or enrich a specific sequence motif. This selective pressure could -in addition to mutation, drift and selection for translation efficiency and accuracy- contribute to shape the codon usage bias. In this review, we discuss patterns of avoidance of (or enrichment for) the various biological signals contained in specific nucleotide sequence motifs: transcription and translation initiation and termination signals, mRNA maturation signals, and antiviral immune system targets. Experimental data on the phenotypic or fitness effects of synonymous mutations in these sequence motifs confirm that they can be targets of local selection pressures on codon usage. We also formulate the hypothesis that transposable elements could have a similar impact on codon usage through their preferred integration sequences. Overall, selection on codon usage appears to be a combination of a global selection pressure imposed by the translation machinery, and a patchwork of local selection pressures related to biological signals contained in specific sequence motifs.
Collapse
Affiliation(s)
- Martijn Callens
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche pour le Développement, 34000 Montpellier, France
| | - Léa Pradier
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche pour le Développement, 34000 Montpellier, France
| | - Michael Finnegan
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche pour le Développement, 34000 Montpellier, France
| | - Caroline Rose
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche pour le Développement, 34000 Montpellier, France
| | - Stéphanie Bedhomme
- Centre d'Ecologie Fonctionnelle et Evolutive, CNRS, Université de Montpellier, Université Paul Valéry Montpellier 3, Ecole Pratique des Hautes Etudes, Institut de Recherche pour le Développement, 34000 Montpellier, France
| |
Collapse
|
50
|
Muyle A, Ross-Ibarra J, Seymour DK, Gaut BS. Gene body methylation is under selection in Arabidopsis thaliana. Genetics 2021; 218:6237897. [PMID: 33871638 PMCID: PMC8225343 DOI: 10.1093/genetics/iyab061] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 04/07/2021] [Indexed: 11/28/2022] Open
Abstract
In plants, mammals and insects, some genes are methylated in the CG dinucleotide context, a phenomenon called gene body methylation (gbM). It has been controversial whether this phenomenon has any functional role. Here, we took advantage of the availability of 876 leaf methylomes in Arabidopsis thaliana to characterize the population frequency of methylation at the gene level and to estimate the site-frequency spectrum of allelic states. Using a population genetics model specifically designed for epigenetic data, we found that genes with ancestral gbM are under significant selection to remain methylated. Conversely, ancestrally unmethylated genes were under selection to remain unmethylated. Repeating the analyses at the level of individual cytosines confirmed these results. Estimated selection coefficients were small, on the order of 4 Nes = 1.4, which is similar to the magnitude of selection acting on codon usage. We also estimated that A. thaliana is losing gbM threefold more rapidly than gaining it, which could be due to a recent reduction in the efficacy of selection after a switch to selfing. Finally, we investigated the potential function of gbM through its link with gene expression. Across genes with polymorphic methylation states, the expression of gene body methylated alleles was consistently and significantly higher than unmethylated alleles. Although it is difficult to disentangle genetic from epigenetic effects, our work suggests that gbM has a small but measurable effect on fitness, perhaps due to its association to a phenotype-like gene expression.
Collapse
Affiliation(s)
- Aline Muyle
- Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA 92697-2525, USA
| | - Jeffrey Ross-Ibarra
- Evolution and Ecology, Center for Population Biology and Genome Center, University of California, Davis, Davis, CA 95616, USA
| | - Danelle K Seymour
- Botany & Plant Sciences, University of California, Riverside, Riverside, CA 92521, USA
| | - Brandon S Gaut
- Ecology and Evolutionary Biology, University of California, Irvine, Irvine, CA 92697-2525, USA
| |
Collapse
|