1
|
Roberts M, Josephs EB. Weaker selection on genes with treatment-specific expression consistent with a limit on plasticity evolution in Arabidopsis thaliana. Genetics 2023; 224:iyad074. [PMID: 37094602 PMCID: PMC10484170 DOI: 10.1093/genetics/iyad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 03/06/2023] [Accepted: 04/07/2023] [Indexed: 04/26/2023] Open
Abstract
Differential gene expression between environments often underlies phenotypic plasticity. However, environment-specific expression patterns are hypothesized to relax selection on genes, and thus limit plasticity evolution. We collated over 27 terabases of RNA-sequencing data on Arabidopsis thaliana from over 300 peer-reviewed studies and 200 treatment conditions to investigate this hypothesis. Consistent with relaxed selection, genes with more treatment-specific expression have higher levels of nucleotide diversity and divergence at nonsynonymous sites but lack stronger signals of positive selection. This result persisted even after controlling for expression level, gene length, GC content, the tissue specificity of expression, and technical variation between studies. Overall, our investigation supports the existence of a hypothesized trade-off between the environment specificity of a gene's expression and the strength of selection on said gene in A. thaliana. Future studies should leverage multiple genome-scale datasets to tease apart the contributions of many variables in limiting plasticity evolution.
Collapse
Affiliation(s)
- Miles Roberts
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI 48824, USA
| | - Emily B Josephs
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
2
|
Lin C, Zhang L, Zhang X, Wang X, Wang C, Zhang Y, Wang J, Li X, Song Z. Spatiotemporal and Transcriptional Characterization on Tanshinone Initial Synthesis in Salvia miltiorrhiza Roots. Int J Mol Sci 2022; 23:ijms232113607. [PMID: 36362395 PMCID: PMC9655840 DOI: 10.3390/ijms232113607] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/29/2022] [Accepted: 11/01/2022] [Indexed: 11/11/2022] Open
Abstract
Tanshinones are the bioactive constituents of Danshen (Salvia miltiorrhiza Bunge), which is used in Traditional Chinese Medicine to treat cardiovascular and other diseases, and they synthesize and accumulate in the root periderm of S. miltiorrhiza. However, there is no relevant report on the initial stage of tanshinone synthesis, as well as the root structure and gene expression characteristics. The present study aims to provide new insights into how these bioactive principles begin to synthesize by characterizing possible differences in their biosynthesis and accumulation during early root development from both spatial and temporal aspects. The morphological characteristics and the content of tanshinones in roots of S. miltiorrhiza were investigated in detail by monitoring the seedlings within 65 days after germination (DAGs). The ONT transcriptome sequencing was applied to investigate gene expression patterns. The periderm of the S. miltiorrhiza storage taproot initially synthesized tanshinone on about 30 DAGs. Three critical stages of tanshinone synthesis were preliminarily determined: preparation, the initial synthesis, and the continuous rapid synthesis. The difference of taproots in the first two stages was the smallest, and the differentially expressed genes (DEGs) were mainly enriched in terpene synthesis. Most genes involved in tanshinone synthesis were up regulated during the gradual formation of the red taproot. Plant hormone signal transduction and ABC transport pathways were widely involved in S. miltiorrhiza taproot development. Five candidate genes that may participate in or regulate tanshinone synthesis were screened according to the co-expression pattern. Moreover, photosynthetic ferredoxin (FD), cytochrome P450 reductase (CPR), and CCAAT binding transcription factor (CBF) were predicted to interact with the known downstream essential enzyme genes directly. The above results provide a necessary basis for analyzing the initial synthesis and regulation mechanism of Tanshinones.
Collapse
Affiliation(s)
- Caicai Lin
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Lin Zhang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Xia Zhang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Xin Wang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Chaoyang Wang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Yufeng Zhang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
| | - Jianhua Wang
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai’an 271018, China
| | - Xingfeng Li
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai’an 271018, China
- Correspondence: (X.L.); (Z.S.)
| | - Zhenqiao Song
- Agronomy College, Shandong Agricultural University, Tai’an 271018, China
- State Key Laboratory of Crop Biology, Shandong Agricultural University, Tai’an 271018, China
- Correspondence: (X.L.); (Z.S.)
| |
Collapse
|
3
|
Luzuriaga-Neira A, Subramanian K, Alvarez-Ponce D. Functional compensation of mouse duplicates by their paralogs expressed in the same tissues. Genome Biol Evol 2022; 14:evac126. [PMID: 35945673 PMCID: PMC9387915 DOI: 10.1093/gbe/evac126] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 07/30/2022] [Indexed: 11/14/2022] Open
Abstract
Analyses in a number of organisms have shown that duplicated genes are less likely to be essential than singletons. This implies that genes can often compensate for the loss of their paralogs. However, it is unclear why the loss of some duplicates can be compensated by their paralogs, whereas the loss of other duplicates cannot. Surprisingly, initial analyses in mice did not detect differences in the essentiality of duplicates and singletons. Only subsequent analyses, using larger gene knockout datasets and controlling for a number of confounding factors, did detect significant differences. Previous studies have not taken into account the tissues in which duplicates are expressed. We hypothesized that in complex organisms, in order for a gene's loss to be compensated by one or more of its paralogs, such paralogs need to be expressed in at least the same set of tissues as the lost gene. To test our hypothesis, we classified mouse duplicates into two categories based on the expression patterns of their paralogs: "compensable duplicates" (those with paralogs expressed in all the tissues in which the gene is expressed) and "non-compensable duplicates" (those whose paralogs are not expressed in all the tissues where the gene is expressed). In agreement with our hypothesis, the essentiality of non-compensable duplicates is similar to that of singletons, whereas compensable duplicates exhibit a substantially lower essentiality. Our results imply that duplicates can often compensate for the loss of their paralogs, but only if they are expressed in the same tissues. Indeed, the compensation ability is more dependent on expression patterns than on protein sequence similarity. The existence of these two kinds of duplicates with different essentialities, which has been overlooked by prior studies, may have hindered the detection of differences between singletons and duplicates.
Collapse
|
4
|
Ding DW, Sun X. Relating Translation Efficiency to Protein Networks Provides Evolutionary Insights in Shewanella and Its Implications for Extracellular Electron Transfer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:605-613. [PMID: 32750850 DOI: 10.1109/tcbb.2020.2996295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Shewanella species are well-known for their extracellular electron transfer (EET) capacity, by which these microorganisms can transfer the electrons from intracellular environment to extracellular space for the reduction of the extracellular insoluble electron acceptors. Using a time-stamped data for the paired protein-mRNA, we investigate the impact of differential translation on the EET process of Shewanella oneidensis MR-1. Firstly, differentially translated proteins when O2 levels are switched from high-O2 to low-O2 are identified by using a soft clustering method, 629 up-regulated translated proteins and 767 down-regulated translated proteins are considered to reflect the changes from inactivated to activated EET process. Then, we showed that the degrees of connectivity of differentially translated proteins were significantly larger than those of non-differentially translated proteins, and thereby these differentially translated proteins will be more important in the protein networks. After that, we networked these differentially translated proteins to construct the differentially translated sub-networks, and discussed the most important proteins that are involved in the EET process with the help of centralization analysis of these differentially translated networks. Furthermore, we also studied the differentially translated operonic genes. Taking together, this work searches the key proteins that potentially activated the EET process from a translational efficiency viewpoint.
Collapse
|
5
|
Huang R, Xie X, Chen A, Li F, Tian E, Chao Z. The chloroplast genomes of four Bupleurum (Apiaceae) species endemic to Southwestern China, a diversity center of the genus, as well as their evolutionary implications and phylogenetic inferences. BMC Genomics 2021; 22:714. [PMID: 34600494 PMCID: PMC8487540 DOI: 10.1186/s12864-021-08008-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 09/13/2021] [Indexed: 11/28/2022] Open
Abstract
Background As one of the largest genera in Apiaceae, Bupleurum L. is well known for its high medicinal value. The genus has frequently attracted the attention of evolutionary biologist and taxonomist for its distinctive characteristics in the Apiaceae family. Although some chloroplast genomes data have been now available, the changes in the structure of chloroplast genomes and selective pressure in the genus have not been fully understood. In addition, few of the species are endemic to Southwest China, a distribution and diversity center of Chinese Bupleurum. Endemic species are key components of biodiversity and ecosystems, and investigation of the chloroplast genomes features of endemic species in Bupleurum will be helpful to develop a better understanding of evolutionary process and phylogeny of the genus. In this study, we analyzed the sequences of whole chloroplast genomes of 4 Southwest China endemic Bupleurum species in comparison with the published data of 17 Bupleurum species to determine the evolutionary characteristics of the genus and the phylogenetic relationships of Asian Bupleurum. Results The complete chloroplast genome sequences of the 4 endemic Bupleurum species are 155,025 bp to 155,323 bp in length including a SSC and a LSC region separated by a pair of IRs. Comparative analysis revealed an identical chloroplast gene content across the 21 Bupleurum species, including a total of 114 unique genes (30 tRNA genes, 4 rRNA genes and 80 protein-coding genes). Chloroplast genomes of the 21 Bupleurum species showed no rearrangements and a high sequence identity (96.4–99.2%). They also shared a similar tendency of SDRs and SSRs, but differed in number (59–83). In spite of their high conservation, they contained some mutational hotspots, which can be potentially exploited as high-resolution DNA barcodes for species discrimination. Selective pressure analysis showed that four genes were under positive selection. Phylogenetic analysis revealed that the 21 Bupleurum formed two major clades, which are likely to correspond to their geographical distribution. Conclusions The chloroplast genome data of the four endemic Bupleurum species provide important insights into the characteristics and evolution of chloroplast genomes of this genu, and the phylogeny of Bupleurum. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-08008-z.
Collapse
Affiliation(s)
- Rong Huang
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China
| | - Xuena Xie
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China
| | - Aimin Chen
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China
| | - Fang Li
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China
| | - Enwei Tian
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China
| | - Zhi Chao
- Department of Pharmacy, Zhujiang Hospital, Southern Medical University, Guangzhou, 510282, China. .,Faculty of Medicinal Plants and Pharmacognosy, School of Traditional Chinese Medicine, Southern Medical University, Guangzhou, 510515, China. .,Guangdong Provincial Key Laboratory of Chinese Medicine Pharmaceutics, Guangzhou, 510515, China.
| |
Collapse
|
6
|
Yan Y, Li Z, Li Y, Wu Z, Yang R. Correlated Evolution of Large DNA Fragments in the 3D Genome of Arabidopsis thaliana. Mol Biol Evol 2021; 37:1621-1636. [PMID: 32044988 DOI: 10.1093/molbev/msaa031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In eukaryotes, the three-dimensional (3D) conformation of the genome is far from random, and this nonrandom chromatin organization is strongly correlated with gene expression and protein function, which are two critical determinants of the selective constraints and evolutionary rates of genes. However, whether genes and other elements that are located close to each other in the 3D genome evolve in a coordinated way has not been investigated in any organism. To address this question, we constructed chromatin interaction networks (CINs) in Arabidopsis thaliana based on high-throughput chromosome conformation capture data and demonstrated that adjacent large DNA fragments in the CIN indeed exhibit more similar levels of polymorphism and evolutionary rates than random fragment pairs. Using simulations that account for the linear distance between fragments, we proved that the 3D chromosomal organization plays a role in the observed correlated evolution. Spatially interacting fragments also exhibit more similar mutation rates and functional constraints in both coding and noncoding regions than the random expectations, indicating that the correlated evolution between 3D neighbors is a result of combined evolutionary forces. A collection of 39 genomic and epigenomic features can explain much of the variance in genetic diversity and evolutionary rates across the genome. Moreover, features that have a greater effect on the evolution of regional sequences tend to show higher similarity between neighboring fragments in the CIN, suggesting a pivotal role of epigenetic modifications and chromatin organization in determining the correlated evolution of large DNA fragments in the 3D genome.
Collapse
Affiliation(s)
- Yubin Yan
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Zhaohong Li
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Ye Li
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Zefeng Wu
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| | - Ruolin Yang
- College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, China
| |
Collapse
|
7
|
Alvarez-Ponce D, Aguilar-Rodríguez J, Fares MA. Molecular Chaperones Accelerate the Evolution of Their Protein Clients in Yeast. Genome Biol Evol 2020; 11:2360-2375. [PMID: 31297528 PMCID: PMC6735891 DOI: 10.1093/gbe/evz147] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/05/2019] [Indexed: 12/23/2022] Open
Abstract
Protein stability is a major constraint on protein evolution. Molecular chaperones, also known as heat-shock proteins, can relax this constraint and promote protein evolution by diminishing the deleterious effect of mutations on protein stability and folding. This effect, however, has only been stablished for a few chaperones. Here, we use a comprehensive chaperone–protein interaction network to study the effect of all yeast chaperones on the evolution of their protein substrates, that is, their clients. In particular, we analyze how yeast chaperones affect the evolutionary rates of their clients at two very different evolutionary time scales. We first study the effect of chaperone-mediated folding on protein evolution over the evolutionary divergence of Saccharomyces cerevisiae and S. paradoxus. We then test whether yeast chaperones have left a similar signature on the patterns of standing genetic variation found in modern wild and domesticated strains of S. cerevisiae. We find that genes encoding chaperone clients have diverged faster than genes encoding non-client proteins when controlling for their number of protein–protein interactions. We also find that genes encoding client proteins have accumulated more intraspecific genetic diversity than those encoding non-client proteins. In a number of multivariate analyses, controlling by other well-known factors that affect protein evolution, we find that chaperone dependence explains the largest fraction of the observed variance in the rate of evolution at both evolutionary time scales. Chaperones affecting rates of protein evolution mostly belong to two major chaperone families: Hsp70s and Hsp90s. Our analyses show that protein chaperones, by virtue of their ability to buffer destabilizing mutations and their role in modulating protein genotype–phenotype maps, have a considerable accelerating effect on protein evolution.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Biology Department, University of Nevada, Reno.,Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, Valencia, Spain
| | - José Aguilar-Rodríguez
- Department of Biology, Stanford University, CA.,Department of Chemical and Systems Biology, Stanford University School of Medicine, CA
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, Valencia, Spain.,Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Ireland
| |
Collapse
|
8
|
Defoort J, Van de Peer Y, Carretero-Paulet L. The Evolution of Gene Duplicates in Angiosperms and the Impact of Protein-Protein Interactions and the Mechanism of Duplication. Genome Biol Evol 2020; 11:2292-2305. [PMID: 31364708 PMCID: PMC6735927 DOI: 10.1093/gbe/evz156] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2019] [Indexed: 01/17/2023] Open
Abstract
Gene duplicates, generated through either whole genome duplication (WGD) or small-scale duplication (SSD), are prominent in angiosperms and are believed to play an important role in adaptation and in generating evolutionary novelty. Previous studies reported contrasting evolutionary and functional dynamics of duplicate genes depending on the mechanism of origin, a behavior that is hypothesized to stem from constraints to maintain the relative dosage balance between the genes concerned and their interaction context. However, the mechanisms ultimately influencing loss and retention of gene duplicates over evolutionary time are not yet fully elucidated. Here, by using a robust classification of gene duplicates in Arabidopsis thaliana, Solanum lycopersicum, and Zea mays, large RNAseq expression compendia and an extensive protein-protein interaction (PPI) network from Arabidopsis, we investigated the impact of PPIs on the differential evolutionary and functional fate of WGD and SSD duplicates. In all three species, retained WGD duplicates show stronger constraints to diverge at the sequence and expression level than SSD ones, a pattern that is also observed for shared PPI partners between Arabidopsis duplicates. PPIs are preferentially distributed among WGD duplicates and specific functional categories. Furthermore, duplicates with PPIs tend to be under stronger constraints to evolve than their counterparts without PPIs regardless of their mechanism of origin. Our results support dosage balance constraint as a specific property of genes involved in biological interactions, including physical PPIs, and suggest that additional factors may be differently influencing the evolution of genes following duplication, depending on the species, time, and mechanism of origin.
Collapse
Affiliation(s)
- Jonas Defoort
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Belgium.,VIB Center for Plant Systems Biology, Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Belgium.,VIB Center for Plant Systems Biology, Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, Belgium.,Department of Biochemistry, Genetics and Microbiology, University of Pretoria, South Africa
| | - Lorenzo Carretero-Paulet
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Belgium.,VIB Center for Plant Systems Biology, Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, Belgium
| |
Collapse
|
9
|
Mustafin ZS, Zamyatin VI, Konstantinov DK, Doroshkov AV, Lashin SA, Afonnikov DA. Phylostratigraphic Analysis Shows the Earliest Origination of the Abiotic Stress Associated Genes in A. thaliana. Genes (Basel) 2019; 10:genes10120963. [PMID: 31766757 PMCID: PMC6947294 DOI: 10.3390/genes10120963] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Revised: 11/16/2019] [Accepted: 11/18/2019] [Indexed: 12/27/2022] Open
Abstract
Plants constantly fight with stressful factors as high or low temperature, drought, soil salinity and flooding. Plants have evolved a set of stress response mechanisms, which involve physiological and biochemical changes that result in adaptive or morphological changes. At a molecular level, stress response in plants is performed by genetic networks, which also undergo changes in the process of evolution. The study of the network structure and evolution may highlight mechanisms of plants adaptation to adverse conditions, as well as their response to stresses and help in discovery and functional characterization of the stress-related genes. We performed an analysis of Arabidopsis thaliana genes associated with several types of abiotic stresses (heat, cold, water-related, light, osmotic, salt, and oxidative) at the network level using a phylostratigraphic approach. Our results show that a substantial fraction of genes associated with various types of abiotic stress is of ancient origin and evolves under strong purifying selection. The interaction networks of genes associated with stress response have a modular structure with a regulatory component being one of the largest for five of seven stress types. We demonstrated a positive relationship between the number of interactions of gene in the stress gene network and its age. Moreover, genes of the same age tend to be connected in stress gene networks. We also demonstrated that old stress-related genes usually participate in the response for various types of stress and are involved in numerous biological processes unrelated to stress. Our results demonstrate that the stress response genes represent the ancient and one of the fundamental molecular systems in plants.
Collapse
Affiliation(s)
- Zakhar S. Mustafin
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Kurchatov Genomics Center, Institute of Cytology and Genetics, SB RAS, 630090 Novosibirsk, Russia
| | - Vladimir I. Zamyatin
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Kurchatov Genomics Center, Institute of Cytology and Genetics, SB RAS, 630090 Novosibirsk, Russia
- Faculty of Natural Sciences, Novosibirsk State University (NSU), 630090 Novosibirsk, Russia
| | - Dmitrii K. Konstantinov
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Faculty of Natural Sciences, Novosibirsk State University (NSU), 630090 Novosibirsk, Russia
| | - Aleksej V. Doroshkov
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Faculty of Natural Sciences, Novosibirsk State University (NSU), 630090 Novosibirsk, Russia
| | - Sergey A. Lashin
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Kurchatov Genomics Center, Institute of Cytology and Genetics, SB RAS, 630090 Novosibirsk, Russia
- Faculty of Natural Sciences, Novosibirsk State University (NSU), 630090 Novosibirsk, Russia
- Correspondence: (S.A.L.); (D.A.A.); Tel.: +7-383-363-49-63 (D.A.A.)
| | - Dmitry A. Afonnikov
- The Institute of Cytology and Genetics of the Siberian Branch of the Russian Academy of Sciences (IC & G SB RAS), 630090 Novosibirsk, Russia; (Z.S.M.); (V.I.Z.); (D.K.K.); (A.V.D.)
- Kurchatov Genomics Center, Institute of Cytology and Genetics, SB RAS, 630090 Novosibirsk, Russia
- Faculty of Natural Sciences, Novosibirsk State University (NSU), 630090 Novosibirsk, Russia
- Correspondence: (S.A.L.); (D.A.A.); Tel.: +7-383-363-49-63 (D.A.A.)
| |
Collapse
|
10
|
Vizán-Rico HI, Mayer C, Petersen M, McKenna DD, Zhou X, Gómez-Zurita J. Patterns and Constraints in the Evolution of Sperm Individualization Genes in Insects, with an Emphasis on Beetles. Genes (Basel) 2019; 10:E776. [PMID: 31590243 PMCID: PMC6826512 DOI: 10.3390/genes10100776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2019] [Revised: 09/20/2019] [Accepted: 10/01/2019] [Indexed: 11/17/2022] Open
Abstract
Gene expression profiles can change dramatically between sexes and sex bias may contribute specific macroevolutionary dynamics for sex-biased genes. However, these dynamics are poorly understood at large evolutionary scales due to the paucity of studies that have assessed orthology and functional homology for sex-biased genes and the pleiotropic effects possibly constraining their evolutionary potential. Here, we explore the correlation of sex-biased expression with macroevolutionary processes that are associated with sex-biased genes, including duplications and accelerated evolutionary rates. Specifically, we examined these traits in a group of 44 genes that orchestrate sperm individualization during spermatogenesis, with both unbiased and sex-biased expression. We studied these genes in the broad evolutionary framework of the Insecta, with a particular focus on beetles (order Coleoptera). We studied data mined from 119 insect genomes, including 6 beetle models, and from 19 additional beetle transcriptomes. For the subset of physically and/or genetically interacting proteins, we also analyzed how their network structure may condition the mode of gene evolution. The collection of genes was highly heterogeneous in duplication status, evolutionary rates, and rate stability, but there was statistical evidence for sex bias correlated with faster evolutionary rates, consistent with theoretical predictions. Faster rates were also correlated with clocklike (insect amino acids) and non-clocklike (beetle nucleotides) substitution patterns in these genes. Statistical associations (higher rates for central nodes) or lack thereof (centrality of duplicated genes) were in contrast to some current evolutionary hypotheses, highlighting the need for more research on these topics.
Collapse
Affiliation(s)
- Helena I. Vizán-Rico
- Animal Biodiversity and Evolution, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain;
| | - Christoph Mayer
- Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113 Bonn, Germany; (C.M.); (M.P.)
| | - Malte Petersen
- Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, 53113 Bonn, Germany; (C.M.); (M.P.)
| | - Duane D. McKenna
- Center for Biodiversity Research, Department of Biological Sciences, University of Memphis, Memphis, TN 38152, USA;
| | - Xin Zhou
- Department of Entomology, College of Plant Protection, China Agricultural University, Beijing 100193, China;
| | - Jesús Gómez-Zurita
- Animal Biodiversity and Evolution, Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain;
| |
Collapse
|
11
|
Dependency Between Protein-Protein Interactions and Protein Variability and Evolutionary Rates in Vertebrates: Observed Relationships and Stochastic Modeling. J Mol Evol 2019; 87:184-198. [PMID: 31302723 PMCID: PMC6658588 DOI: 10.1007/s00239-019-09899-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2019] [Accepted: 07/01/2019] [Indexed: 01/23/2023]
Abstract
Recent developments in sequencing and growth of bioinformatics resources provide us with vast depositories of protein network and single nucleotide polymorphism data. It allows us to re-examine, on a larger and more comprehensive scale, the relationship between protein–protein interactions and protein variability and evolutionary rates. This relationship has remained far from unambiguously resolved for quite a long time, reflecting shifting analysis approaches in the literature, and growing data availability. In this study, we utilized several public genomic databases to investigate this relationship in human, mouse, pig, chicken, and zebrafish. We observed strong non-linear relationship patterns (tending towards convex decreasing function shapes) between protein variability and the density of corresponding protein–protein interactions across all five species. To investigate further, we carried out stochastic simulations, modeling the interplay between protein connectivity and variability. Our results indicate that a simple negative linear correlation model, often suggested (or tacitly assumed) in the literature, as either a null or an alternative hypothesis, is not a good fit with the observed data. After considering different (but still relatively simple, and not overfitting) simulation models, we found that a convex decreasing protein variability–connectivity function (specifically, exponential decay) led to a much better fit with the real data. We conclude that simple correlation models might be inadequate for describing protein variability–connectivity interplay in vertebrates; they often tend towards false negatives (showing no more than marginal linear or rank correlation where there are in fact strong non-random patterns).
Collapse
|
12
|
Panchy NL, Azodi CB, Winship EF, O'Malley RC, Shiu SH. Expression and regulatory asymmetry of retained Arabidopsis thaliana transcription factor genes derived from whole genome duplication. BMC Evol Biol 2019; 19:77. [PMID: 30866803 PMCID: PMC6416927 DOI: 10.1186/s12862-019-1398-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 02/22/2019] [Indexed: 12/19/2022] Open
Abstract
Background Transcription factors (TFs) play a key role in regulating plant development and response to environmental stimuli. While most genes revert to single copy after whole genome duplication (WGD) event, transcription factors are retained at a significantly higher rate. Little is known about how TF duplicates have diverged in their expression and regulation, the answer to which may contribute to a better understanding of the elevated retention rate among TFs. Results Here we assessed what features may explain differences in the retention of TF duplicates and other genes using Arabidopsis thaliana as a model. We integrated 34 expression, sequence, and conservation features to build a linear model for predicting the extent of duplicate retention following WGD events among TFs and 19 groups of genes with other functions. We found that TFs was the least well predicted, demonstrating the features of TFs are substantially deviated from duplicate genes in other function groups. Consistent with this, the evolution of TF expression patterns and cis-regulatory cites favors the partitioning of ancestral states among the resulting duplicates: one “ancestral” TF duplicate retains most ancestral expression and cis-regulatory sites, while the “non-ancestral” duplicate is enriched for novel regulatory sites. By modeling the retention of ancestral expression and cis-regulatory states in duplicate pairs using a system of differential equations, we found that TF duplicate pairs in a partitioned state are preferentially maintained. Conclusions These TF duplicates with asymmetrically partitioned ancestral states are likely maintained because one copy retains ancestral functions while the other, at least in some cases, acquires novel cis-regulatory sites that may be important for novel, adaptive traits. Electronic supplementary material The online version of this article (10.1186/s12862-019-1398-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Nicholas L Panchy
- Genetics Program, Michigan State University, East Lansing, MI, 48824, USA.,Present address: NIMBioS, University of Tennessee, Claxton Bldg. 1122 Volunteer Blvd., Suite 106, Knoxville, TN, 37996-3410, USA
| | - Christina B Azodi
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Eamon F Winship
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, 48824, USA.,Present address: MYcroarray, 5692 Plymouth Rd, Ann Arbor, MI, 48105, USA
| | | | - Shin-Han Shiu
- Genetics Program, Michigan State University, East Lansing, MI, 48824, USA. .,Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA. .,Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, 48824, USA. .,Plant Biology Laboratories, Michigan State University, 612 Wilson Road, Room 166, East Lansing, MI, 48824-1312, USA.
| |
Collapse
|
13
|
Lipinska AP, Serrano-Serrano ML, Cormier A, Peters AF, Kogame K, Cock JM, Coelho SM. Rapid turnover of life-cycle-related genes in the brown algae. Genome Biol 2019; 20:35. [PMID: 30764885 PMCID: PMC6374913 DOI: 10.1186/s13059-019-1630-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 01/16/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sexual life cycles in eukaryotes involve a cyclic alternation between haploid and diploid phases. While most animals possess a diploid life cycle, many plants and algae alternate between multicellular haploid (gametophyte) and diploid (sporophyte) generations. In many algae, gametophytes and sporophytes are independent and free-living and may present dramatic phenotypic differences. The same shared genome can therefore be subject to different, even conflicting, selection pressures during each of the life cycle generations. Here, we analyze the nature and extent of genome-wide, generation-biased gene expression in four species of brown algae with contrasting levels of dimorphism between life cycle generations. RESULTS We show that the proportion of the transcriptome that is generation-specific is broadly associated with the level of phenotypic dimorphism between the life cycle stages. Importantly, our data reveals a remarkably high turnover rate for life-cycle-related gene sets across the brown algae and highlights the importance not only of co-option of regulatory programs from one generation to the other but also of a role for newly emerged, lineage-specific gene expression patterns in the evolution of the gametophyte and sporophyte developmental programs in this major eukaryotic group. Moreover, we show that generation-biased genes display distinct evolutionary modes, with gametophyte-biased genes evolving rapidly at the coding sequence level whereas sporophyte-biased genes tend to exhibit changes in their patterns of expression. CONCLUSION Our analysis uncovers the characteristics, expression patterns, and evolution of generation-biased genes and underlines the selective forces that shape this previously underappreciated source of phenotypic diversity.
Collapse
Affiliation(s)
- Agnieszka P Lipinska
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France
| | | | - Alexandre Cormier
- Laboratoire Ecologie et Biologie des Interactions, Equipe Ecologie Evolution Symbiose, Université de Poitiers, UMR CNRS 7267, Poitiers, France
| | | | - Kazuhiro Kogame
- Department of Biological Sciences, Faculty of Sciences, Hokkaido University, Sapporo, 060-0810, Japan
| | - J Mark Cock
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France
| | - Susana M Coelho
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France.
| |
Collapse
|
14
|
Gao X, Zhang X, Meng H, Li J, Zhang D, Liu C. Comparative chloroplast genomes of Paris Sect. Marmorata: insights into repeat regions and evolutionary implications. BMC Genomics 2018; 19:878. [PMID: 30598104 PMCID: PMC6311911 DOI: 10.1186/s12864-018-5281-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background Species of Paris Sect. Marmorata are valuable medicinal plants to synthesize steroidal saponins with effective pharmacological therapy. However, the wild resources of the species are threatened by plundering exploitation before the molecular genetics studies uncover the genomes and evolutionary significance. Thus, the availability of complete chloroplast genome sequences of Sect. Marmorata is necessary and crucial to the understanding the plastome evolution of this section and facilitating future population genetics studies. Here, we determined chloroplast genomes of Sect. Marmorata, and conducted the whole chloroplast genome comparison. Results This study presented detailed sequences and structural variations of chloroplast genomes of Sect. Marmorata. Over 40 large repeats and approximately 130 simple sequence repeats as well as a group of genomic hotspots were detected. Inverted repeat contraction of this section was inferred via comparing the chloroplast genomes with the one of P. verticillata. Additionally, almost all the plastid protein coding genes were found to prefer ending with A/U. Mutation bias and selection pressure predominately shaped the codon bias of most genes. And most of the genes underwent purifying selection, whereas photosynthetic genes experienced a relatively relaxed purifying selection. Conclusions Repeat sequences and hotspot regions can be scanned to detect the intraspecific and interspecific variability, and selected to infer the phylogenetic relationships of Sect. Marmorata and other species in subgenus Daiswa. Mutation and natural selection were the main forces to drive the codon bias pattern of most plastid protein coding genes. Therefore, this study enhances the understanding about evolution of Sect. Marmorata from the chloroplast genome, and provide genomic insights into genetic analyses of Sect. Marmorata. Electronic supplementary material The online version of this article (10.1186/s12864-018-5281-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xiaoyang Gao
- CAS Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Science, Menglun, 666303, Yunnan, China
| | - Xuan Zhang
- CAS Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Science, Menglun, 666303, Yunnan, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Honghu Meng
- Center for Integrative Conservation, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Kunming, 650223, Yunnan, China
| | - Jing Li
- CAS Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Science, Menglun, 666303, Yunnan, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Di Zhang
- CAS Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Science, Menglun, 666303, Yunnan, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Changning Liu
- CAS Key Laboratory of Tropical Plant Resources and Sustainable Use, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Science, Menglun, 666303, Yunnan, China.
| |
Collapse
|
15
|
Aguilar-Rodríguez J, Wagner A. Metabolic Determinants of Enzyme Evolution in a Genome-Scale Bacterial Metabolic Network. Genome Biol Evol 2018; 10:3076-3088. [PMID: 30351420 PMCID: PMC6257574 DOI: 10.1093/gbe/evy234] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/22/2018] [Indexed: 11/12/2022] Open
Abstract
Different genes and proteins evolve at very different rates. To identify the factors that explain these differences is an important aspect of research in molecular evolution. One such factor is the role a protein plays in a large molecular network. Here, we analyze the evolutionary rates of enzyme-coding genes in the genome-scale metabolic network of Escherichia coli to find the evolutionary constraints imposed by the structure and function of this complex metabolic system. Central and highly connected enzymes appear to evolve more slowly than less connected enzymes, but we find that they do so as a by-product of their high abundance, and not because of their position in the metabolic network. In contrast, enzymes catalyzing reactions with high metabolic flux-high substrate to product conversion rates-evolve slowly even after we account for their abundance. Moreover, enzymes catalyzing reactions that are difficult to by-pass through alternative pathways, such that they are essential in many different genetic backgrounds, also evolve more slowly. Our analyses show that an enzyme's role in the function of a metabolic network affects its evolution more than its place in the network's structure. They highlight the value of a system-level perspective for studies of molecular evolution.
Collapse
Affiliation(s)
- José Aguilar-Rodríguez
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Department of Biology, Stanford University, Stanford, CA and Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, New Mexico
| |
Collapse
|
16
|
Payne BL, Alvarez-Ponce D. Higher Rates of Protein Evolution in the Self-Fertilizing Plant Arabidopsis thaliana than in the Out-Crossers Arabidopsis lyrata and Arabidopsis halleri. Genome Biol Evol 2018; 10:895-900. [PMID: 29608724 PMCID: PMC5865523 DOI: 10.1093/gbe/evy053] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/05/2018] [Indexed: 11/13/2022] Open
Abstract
The common transition from out-crossing to self-fertilization in plants decreases effective population size. This is expected to result in a reduced efficacy of natural selection and in increased rates of protein evolution in selfing plants compared with their outcrossing congeners. Prior analyses, based on a very limited number of genes, detected no differences between the rates of protein evolution in the selfing Arabidopsis thaliana compared with the out-crosser Arabidopsis lyrata. Here, we reevaluate this trend using the complete genomes of A. thaliana, A. lyrata, Arabidopsis halleri, and the outgroups Capsella rubella and Thellungiella parvula. Our analyses indicate slightly but measurably higher nonsynonymous divergences (dN), synonymous divergences (dS) and dN/dS ratios in A. thaliana compared with the other Arabidopsis species, indicating that purifying selection is indeed less efficacious in A. thaliana.
Collapse
|
17
|
Evolutionary Perspectives of Genotype-Phenotype Factors in Leishmania Metabolism. J Mol Evol 2018; 86:443-456. [PMID: 30022295 DOI: 10.1007/s00239-018-9857-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 07/13/2018] [Indexed: 10/28/2022]
Abstract
The sandfly midgut and the human macrophage phagolysosome provide antagonistic metabolic niches for the endoparasite Leishmania to survive and populate. Although these environments fluctuate across developmental stages, the relative changes in both these environments across parasite generations might remain gradual. Such environmental restrictions might endow parasite metabolism with a choice of specific genotypic and phenotypic factors that can constrain enzyme evolution for successful adaptation to the host. With respect to the available cellular information for Leishmania species, for the first time, we measure the relative contribution of eight inter-correlated predictors related to codon usage, GC content, gene expression, gene length, multi-functionality, and flux-coupling potential of an enzyme on the evolutionary rates of singleton metabolic genes and further compare their effects across three Leishmania species. Our analysis reveals that codon adaptation, multi-functionality, and flux-coupling potential of an enzyme are independent contributors of enzyme evolutionary rates, which can together explain a large variation in enzyme evolutionary rates across species. We also hypothesize that a species-specific occurrence of duplicated genes in novel subcellular locations can create new flux routes through certain singleton flux-coupled enzymes, thereby constraining their evolution. A cross-species comparison revealed both common and species-specific genes whose evolutionary divergence was constrained by multiple independent factors. Out of these, previously known pharmacological targets and virulence factors in Leishmania were identified, suggesting their evolutionary reasons for being important survival factors to the parasite. All these results provide a fundamental understanding of the factors underlying adaptive strategies of the parasite, which can be further targeted.
Collapse
|
18
|
Alvarez-Ponce D, Feyertag F, Chakraborty S. Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network. Genome Biol Evol 2018; 9:1742-1756. [PMID: 28854629 PMCID: PMC5570066 DOI: 10.1093/gbe/evx117] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2017] [Indexed: 02/06/2023] Open
Abstract
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets.
Collapse
|
19
|
Hanada K, Tezuka A, Nozawa M, Suzuki Y, Sugano S, Nagano AJ, Ito M, Morinaga SI. Functional divergence of duplicate genes several million years after gene duplication in Arabidopsis. DNA Res 2018; 25:4898128. [PMID: 29481587 PMCID: PMC6014284 DOI: 10.1093/dnares/dsy005] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Accepted: 02/02/2018] [Indexed: 12/02/2022] Open
Abstract
Lineage-specific duplicated genes likely contribute to the phenotypic divergence in closely related species. However, neither the frequency of duplication events nor the degree of selection pressures immediately after gene duplication is clear in the speciation process. Here, using Illumina DNA-sequencing reads from Arabidopsis halleri, which has multiple closely related species with high-quality genome assemblies (A. thaliana and A. lyrata), we succeeded in generating orthologous gene groups in Brassicaceae. The duplication frequency of retained genes in the Arabidopsis lineage was ∼10 times higher than the duplication frequency inferred by comparative genomics of Arabidopsis, poplar, rice and moss (Physcomitrella patens). The difference of duplication frequencies can be explained by a rapid decay of anciently duplicated genes. To examine the degree of selection pressure on genes duplicated in either the A. halleri-lyrata or the A. halleri lineage, we examined positive and purifying selection in the A. halleri-lyrata and A. halleri lineages throughout the ratios of nonsynonymous to synonymous substitution rates (KA/KS). Duplicate genes tended to have a higher proportion of positive selection compared with non-duplicated genes. Interestingly, we found that functional divergence of duplicated genes was accelerated several million years after gene duplication compared with immediately after gene duplication.
Collapse
Affiliation(s)
- Kousuke Hanada
- Department of Bioscience and Bioinformatics, Frontier Research Academy for Young Researchers, Kyusyu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan
- RIKEN Center for Sustainable Resource Science, RIKEN, Yokohama, Kanagawa 230-0045, Japan
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
| | - Ayumi Tezuka
- Department of Bioscience and Bioinformatics, Frontier Research Academy for Young Researchers, Kyusyu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan
| | - Masafumi Nozawa
- Center for Information Biology, National Institute of Genetics, Mishima, Shizuoka 411-8540, Japan
- Department of Genetics, SOKENDAI, Mishima, Shizuoka 411-8540, Japan
- Department of Biological Sciences, Tokyo Metropolitan University, Hachioji, Tokyo 192-0397, Japan
| | - Yutaka Suzuki
- Graduate School of Frontier Science, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Sumio Sugano
- Graduate School of Frontier Science, The University of Tokyo, Kashiwa, Chiba 277-8562, Japan
| | - Atsushi J Nagano
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
- Center of Ecological Research, Kyoto University, Hirano, Otsu, Shiga 520-2113, Japan
| | - Motomi Ito
- Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
| | - Shin-Ichi Morinaga
- CREST, Japan Science and Technology Agency, Kawaguchi, Saitama 332-0012, Japan
- Graduate School of Arts and Sciences, The University of Tokyo, Tokyo 153-8902, Japan
- College of Bioresource Sciences, Nihon University, Fujisawa, Kanagawa 252-0880, Japan
| |
Collapse
|
20
|
Böndel KB, Nosenko T, Stephan W. Signatures of natural selection in abiotic stress-responsive genes of Solanum chilense. ROYAL SOCIETY OPEN SCIENCE 2018; 5:171198. [PMID: 29410831 PMCID: PMC5792908 DOI: 10.1098/rsos.171198] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2017] [Accepted: 12/04/2017] [Indexed: 06/01/2023]
Abstract
Environmental conditions are strong selective forces, which may influence adaptation and speciation. The wild tomato species Solanum chilense, native to South America, is exposed to a range of abiotic stress factors. To identify signatures of natural selection and local adaptation, we analysed 16 genes involved in the abiotic stress response and compared the results to a set of reference genes in 23 populations across the entire species range. The abiotic stress-responsive genes are characterized by elevated nonsynonymous nucleotide diversity and divergence. We detected signatures of positive selection in several abiotic stress-responsive genes on both the population and species levels. Local adaptation to abiotic stresses is particularly apparent at the boundary of the species distribution in populations from coastal low-altitude and mountainous high-altitude regions.
Collapse
|
21
|
Banerjee S, Feyertag F, Alvarez-Ponce D. Intrinsic protein disorder reduces small-scale gene duplicability. DNA Res 2017; 24:435-444. [PMID: 28430886 PMCID: PMC5737077 DOI: 10.1093/dnares/dsx015] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2016] [Accepted: 03/28/2017] [Indexed: 01/23/2023] Open
Abstract
Whereas the rate of gene duplication is relatively high, only certain duplications survive the filter of natural selection and can contribute to genome evolution. However, the reasons why certain genes can be retained after duplication whereas others cannot remain largely unknown. Many proteins contain intrinsically disordered regions (IDRs), whose structures fluctuate between alternative conformational states. Due to their high flexibility, IDRs often enable protein–protein interactions and are the target of post-translational modifications. Intrinsically disordered proteins (IDPs) have characteristics that might either stimulate or hamper the retention of their encoding genes after duplication. On the one hand, IDRs may enable functional diversification, thus promoting duplicate retention. On the other hand, increased IDP availability is expected to result in deleterious unspecific interactions. Here, we interrogate the proteomes of human, Drosophila melanogaster, Caenorhabditis elegans, Saccharomyces cerevisiae, Arabidopsis thaliana and Escherichia coli, in order to ascertain the impact of protein intrinsic disorder on gene duplicability. We show that, in general, proteins encoded by duplicated genes tend to be less disordered than those encoded by singletons. The only exception is proteins encoded by ohnologs, which tend to be more disordered than those encoded by singletons or genes resulting from small-scale duplications. Our results indicate that duplication of genes encoding IDPs outside the context of whole-genome duplication (WGD) is often deleterious, but that IDRs facilitate retention of duplicates in the context of WGD. We discuss the potential evolutionary implications of our results.
Collapse
Affiliation(s)
- Sanghita Banerjee
- Department of Biology, University of Nevada, Reno, NV 89557, USA.,Machine Intelligence Unit, Indian Statistical Institute, Kolkata 700108, India
| | - Felix Feyertag
- Department of Biology, University of Nevada, Reno, NV 89557, USA
| | | |
Collapse
|
22
|
Connectivity in gene coexpression networks negatively correlates with rates of molecular evolution in flowering plants. PLoS One 2017; 12:e0182289. [PMID: 28759647 PMCID: PMC5536297 DOI: 10.1371/journal.pone.0182289] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 07/14/2017] [Indexed: 12/22/2022] Open
Abstract
Gene coexpression networks are a useful tool for summarizing transcriptomic data and providing insight into patterns of gene regulation in a variety of species. Though there has been considerable interest in studying the evolution of network topology across species, less attention has been paid to the relationship between network position and patterns of molecular evolution. Here, we generated coexpression networks from publicly available expression data for seven flowering plant taxa (Arabidopsis thaliana, Glycine max, Oryza sativa, Populus spp., Solanum lycopersicum, Vitis spp., and Zea mays) to investigate the relationship between network position and rates of molecular evolution. We found a significant negative correlation between network connectivity and rates of molecular evolution, with more highly connected (i.e., “hub”) genes having significantly lower nonsynonymous substitution rates and dN/dS ratios compared to less highly connected (i.e., “peripheral”) genes across the taxa surveyed. These findings suggest that more centrally located hub genes are, on average, subject to higher levels of evolutionary constraint than are genes located on the periphery of gene coexpression networks. The consistency of this result across disparate taxa suggests that it holds for flowering plants in general, as opposed to being a species-specific phenomenon.
Collapse
|
23
|
Feyertag F, Berninsone PM, Alvarez-Ponce D. Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation. Mol Biol Evol 2017; 34:692-706. [PMID: 28007979 DOI: 10.1093/molbev/msw268] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada, Reno, Reno, NV
| | | | | |
Collapse
|
24
|
Alvarez-Ponce D, Sabater-Muñoz B, Toft C, Ruiz-González MX, Fares MA. Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli. Genome Biol Evol 2016; 8:2914-2927. [PMID: 27566759 PMCID: PMC5630975 DOI: 10.1093/gbe/evw205] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length.
Collapse
Affiliation(s)
| | - Beatriz Sabater-Muñoz
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| | - Christina Toft
- Department of Genetics, University of Valencia, Valencia, Spain Departamento de Biotecnología, Instituto de Agroquímica y Tecnología de los Alimentos (CSIC), Valencia, Spain
| | - Mario X Ruiz-González
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Current Address: Secretaría de Educación Superior, Ciencia, Tecnología e Innovación, Proyecto Prometeo; Departamento de Ciencias Biológicas, Universidad Tócnica Particular de Loja, Loja, Ecuador
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|
25
|
Panchy N, Lehti-Shiu M, Shiu SH. Evolution of Gene Duplication in Plants. PLANT PHYSIOLOGY 2016; 171:2294-316. [PMID: 27288366 PMCID: PMC4972278 DOI: 10.1104/pp.16.00523] [Citation(s) in RCA: 760] [Impact Index Per Article: 95.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2016] [Accepted: 05/17/2016] [Indexed: 05/18/2023]
Abstract
Ancient duplication events and a high rate of retention of extant pairs of duplicate genes have contributed to an abundance of duplicate genes in plant genomes. These duplicates have contributed to the evolution of novel functions, such as the production of floral structures, induction of disease resistance, and adaptation to stress. Additionally, recent whole-genome duplications that have occurred in the lineages of several domesticated crop species, including wheat (Triticum aestivum), cotton (Gossypium hirsutum), and soybean (Glycine max), have contributed to important agronomic traits, such as grain quality, fruit shape, and flowering time. Therefore, understanding the mechanisms and impacts of gene duplication will be important to future studies of plants in general and of agronomically important crops in particular. In this review, we survey the current knowledge about gene duplication, including gene duplication mechanisms, the potential fates of duplicate genes, models explaining duplicate gene retention, the properties that distinguish duplicate from singleton genes, and the evolutionary impact of gene duplication.
Collapse
Affiliation(s)
- Nicholas Panchy
- Genetics Program (N.P., S.-H.S.) and Department of Plant Biology (M.L.-S., S.-H.S.), Michigan State University, East Lansing, Michigan 48824
| | - Melissa Lehti-Shiu
- Genetics Program (N.P., S.-H.S.) and Department of Plant Biology (M.L.-S., S.-H.S.), Michigan State University, East Lansing, Michigan 48824
| | - Shin-Han Shiu
- Genetics Program (N.P., S.-H.S.) and Department of Plant Biology (M.L.-S., S.-H.S.), Michigan State University, East Lansing, Michigan 48824
| |
Collapse
|
26
|
Gossmann TI, Saleh D, Schmid MW, Spence MA, Schmid KJ. Transcriptomes of Plant Gametophytes Have a Higher Proportion of Rapidly Evolving and Young Genes than Sporophytes. Mol Biol Evol 2016; 33:1669-78. [PMID: 26956888 PMCID: PMC4915351 DOI: 10.1093/molbev/msw044] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Reproductive traits in plants tend to evolve rapidly due to various causes that include plant-pollinator coevolution and pollen competition, but the genomic basis of reproductive trait evolution is still largely unknown. To characterize evolutionary patterns of genome wide gene expression in reproductive tissues in the gametophyte and to compare them to developmental stages of the sporophyte, we analyzed evolutionary conservation and genetic diversity of protein-coding genes using microarray-based transcriptome data from three plant species, Arabidopsis thaliana, rice (Oryza sativa), and soybean (Glycine max). In all three species a significant shift in gene expression occurs during gametogenesis in which genes of younger evolutionary age and higher genetic diversity contribute significantly more to the transcriptome than in other stages. We refer to this phenomenon as "evolutionary bulge" during plant reproductive development because it differentiates the gametophyte from the sporophyte. We show that multiple, not mutually exclusive, causes may explain the bulge pattern, most prominently reduced tissue complexity of the gametophyte, a varying extent of selection on reproductive traits during gametogenesis as well as differences between male and female tissues. This highlights the importance of plant reproduction for understanding evolutionary forces determining the relationship of genomic and phenotypic variation in plants.
Collapse
Affiliation(s)
- Toni I Gossmann
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Dounia Saleh
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
| | - Marc W Schmid
- Institute for Plant Biology and Zurich-Basel Plant Science Center, University of Zurich, Zurich, Switzerland
| | - Michael A Spence
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Karl J Schmid
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
| |
Collapse
|
27
|
Positive Selection and Centrality in the Yeast and Fly Protein-Protein Interaction Networks. BIOMED RESEARCH INTERNATIONAL 2016; 2016:4658506. [PMID: 27119079 PMCID: PMC4826914 DOI: 10.1155/2016/4658506] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2015] [Accepted: 03/07/2016] [Indexed: 01/28/2023]
Abstract
Proteins within a molecular network are expected to be subject to different selective pressures depending on their relative hierarchical positions. However, it is not obvious what genes within a network should be more likely to evolve under positive selection. On one hand, only mutations at genes with a relatively high degree of control over adaptive phenotypes (such as those encoding highly connected proteins) are expected to be “seen” by natural selection. On the other hand, a high degree of pleiotropy at these genes is expected to hinder adaptation. Previous analyses of the human protein-protein interaction network have shown that genes under long-term, recurrent positive selection (as inferred from interspecific comparisons) tend to act at the periphery of the network. It is unknown, however, whether these trends apply to other organisms. Here, we show that long-term positive selection has preferentially targeted the periphery of the yeast interactome. Conversely, in flies, genes under positive selection encode significantly more connected and central proteins. These observations are not due to covariation of genes' adaptability and centrality with confounding factors. Therefore, the distribution of proteins encoded by genes under recurrent positive selection across protein-protein interaction networks varies from one species to another.
Collapse
|
28
|
Roque E, Fares MA, Yenush L, Rochina MC, Wen J, Mysore KS, Gómez-Mena C, Beltrán JP, Cañas LA. Evolution by gene duplication of Medicago truncatula PISTILLATA-like transcription factors. JOURNAL OF EXPERIMENTAL BOTANY 2016; 67:1805-1817. [PMID: 26773809 PMCID: PMC4783364 DOI: 10.1093/jxb/erv571] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
PISTILLATA (PI) is a member of the B-function MADS-box gene family, which controls the identity of both petals and stamens in Arabidopsis thaliana. In Medicago truncatula (Mt), there are two PI-like paralogs, known as MtPI and MtNGL9. These genes differ in their expression patterns, but it is not known whether their functions have also diverged. Describing the evolution of certain duplicated genes, such as transcription factors, remains a challenge owing to the complex expression patterns and functional divergence between the gene copies. Here, we report a number of functional studies, including analyses of gene expression, protein-protein interactions, and reverse genetic approaches designed to demonstrate the respective contributions of each M. truncatula PI-like paralog to the B-function in this species. Also, we have integrated molecular evolution approaches to determine the mode of evolution of Mt PI-like genes after duplication. Our results demonstrate that MtPI functions as a master regulator of B-function in M. truncatula, maintaining the overall ancestral function, while MtNGL9 does not seem to have a role in this regard, suggesting that the pseudogenization could be the functional evolutionary fate for this gene. However, we provide evidence that purifying selection is the primary evolutionary force acting on this paralog, pinpointing the conservation of its biochemical function and, alternatively, the acquisition of a new role for this gene.
Collapse
Affiliation(s)
- Edelín Roque
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Lynne Yenush
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Mari Cruz Rochina
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Jiangqi Wen
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA
| | - Kirankumar S Mysore
- Plant Biology Division, The Samuel Roberts Noble Foundation, 2510 Sam Noble Parkway, Ardmore, OK 73401, USA
| | - Concepción Gómez-Mena
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - José Pío Beltrán
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| | - Luis A Cañas
- Instituto de Biología Molecular y Celular de Plantas Consejo Superior de Investigaciones Científicas & Universidad Politécnica de Valencia (CSIC-UPV), Ciudad Politécnica de la Innovación, Edf. 8E, C/ Ingeniero Fausto Elio s/n, E-46011 Valencia, Spain
| |
Collapse
|
29
|
Li Z, Defoort J, Tasdighian S, Maere S, Van de Peer Y, De Smet R. Gene Duplicability of Core Genes Is Highly Consistent across All Angiosperms. THE PLANT CELL 2016; 28:326-44. [PMID: 26744215 PMCID: PMC4790876 DOI: 10.1105/tpc.15.00877] [Citation(s) in RCA: 136] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/13/2015] [Accepted: 01/04/2016] [Indexed: 05/02/2023]
Abstract
Gene duplication is an important mechanism for adding to genomic novelty. Hence, which genes undergo duplication and are preserved following duplication is an important question. It has been observed that gene duplicability, or the ability of genes to be retained following duplication, is a nonrandom process, with certain genes being more amenable to survive duplication events than others. Primarily, gene essentiality and the type of duplication (small-scale versus large-scale) have been shown in different species to influence the (long-term) survival of novel genes. However, an overarching view of "gene duplicability" is lacking, mainly due to the fact that previous studies usually focused on individual species and did not account for the influence of genomic context and the time of duplication. Here, we present a large-scale study in which we investigated duplicate retention for 9178 gene families shared between 37 flowering plant species, referred to as angiosperm core gene families. For most gene families, we observe a strikingly consistent pattern of gene duplicability across species, with gene families being either primarily single-copy or multicopy in all species. An intermediate class contains gene families that are often retained in duplicate for periods extending to tens of millions of years after whole-genome duplication, but ultimately appear to be largely restored to singleton status, suggesting that these genes may be dosage balance sensitive. The distinction between single-copy and multicopy gene families is reflected in their functional annotation, with single-copy genes being mainly involved in the maintenance of genome stability and organelle function and multicopy genes in signaling, transport, and metabolism. The intermediate class was overrepresented in regulatory genes, further suggesting that these represent putative dosage-balance-sensitive genes.
Collapse
Affiliation(s)
- Zhen Li
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Jonas Defoort
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Setareh Tasdighian
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Steven Maere
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium Genomics Research Institute, University of Pretoria, Pretoria 0028, South Africa
| | - Riet De Smet
- Department of Plant Systems Biology, VIB, B-9052 Ghent, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium Bioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
| |
Collapse
|
30
|
Arenas M. Trends in substitution models of molecular evolution. Front Genet 2015; 6:319. [PMID: 26579193 PMCID: PMC4620419 DOI: 10.3389/fgene.2015.00319] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 10/09/2015] [Indexed: 11/13/2022] Open
Abstract
Substitution models of evolution describe the process of genetic variation through fixed mutations and constitute the basis of the evolutionary analysis at the molecular level. Almost 40 years after the development of first substitution models, highly sophisticated, and data-specific substitution models continue emerging with the aim of better mimicking real evolutionary processes. Here I describe current trends in substitution models of DNA, codon and amino acid sequence evolution, including advantages and pitfalls of the most popular models. The perspective concludes that despite the large number of currently available substitution models, further research is required for more realistic modeling, especially for DNA coding and amino acid data. Additionally, the development of more accurate complex models should be coupled with new implementations and improvements of methods and frameworks for substitution model selection and downstream evolutionary analysis.
Collapse
Affiliation(s)
- Miguel Arenas
- Institute of Molecular Pathology and Immunology of the University of Porto Porto, Portugal
| |
Collapse
|
31
|
Nazareno AG, Carlsen M, Lohmann LG. Complete Chloroplast Genome of Tanaecium tetragonolobum: The First Bignoniaceae Plastome. PLoS One 2015; 10:e0129930. [PMID: 26103589 PMCID: PMC4478014 DOI: 10.1371/journal.pone.0129930] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2015] [Accepted: 05/13/2015] [Indexed: 12/13/2022] Open
Abstract
Bignoniaceae is a Pantropical plant family that is especially abundant in the Neotropics. Members of the Bignoniaceae are diverse in many ecosystems and represent key components of the Tropical flora. Despite the ecological importance of the Bignoniaceae and all the efforts to reconstruct the phylogeny of this group, whole chloroplast genome information has not yet been reported for any members of the family. Here, we report the complete chloroplast genome sequence of Tanaecium tetragonolobum (Jacq.) L.G. Lohmann, which was reconstructed using de novo and referenced-based assembly of single-end reads generated by shotgun sequencing of total genomic DNA in an Illumina platform. The gene order and organization of the chloroplast genome of T. tetragonolobum exhibits the general structure of flowering plants, and is similar to other Lamiales chloroplast genomes. The chloroplast genome of T. tetragonolobum is a circular molecule of 153,776 base pairs (bp) with a quadripartite structure containing two single copy regions, a large single copy region (LSC, 84,612 bp) and a small single copy region (SSC, 17,586 bp) separated by inverted repeat regions (IRs, 25,789 bp). In addition, the chloroplast genome of T. tetragonolobum has 38.3% GC content and includes 121 genes, of which 86 are protein-coding, 31 are transfer RNA, and four are ribosomal RNA. The chloroplast genome of T. tetragonolobum presents a total of 47 tandem repeats and 347 simple sequence repeats (SSRs) with mononucleotides being the most common and di-, tri-, tetra-, and hexanucleotides occurring with less frequency. The results obtained here were compared to other chloroplast genomes of Lamiales available to date, providing new insight into the evolution of chloroplast genomes within Lamiales. Overall, the evolutionary rates of genes in Lamiales are lineage-, locus-, and region-specific, indicating that the evolutionary pattern of nucleotide substitution in chloroplast genomes of flowering plants is complex. The discovery of tandem repeats within T. tetragonolobum and the presence of divergent regions between chloroplast genomes of Lamiales provides the basis for the development of markers at various taxonomic levels. The newly developed markers have the potential to greatly improve the resolution of molecular phylogenies.
Collapse
Affiliation(s)
- Alison Gonçalves Nazareno
- Universidade de São Paulo, Instituto de Biociências, Departamento de Botânica, São Paulo, São Paulo, Brazil
- * E-mail: (AGN); (LGL)
| | - Monica Carlsen
- University of Missouri-St. Louis, Biology Department, St. Louis, Missouri, United States of America
| | - Lúcia Garcez Lohmann
- Universidade de São Paulo, Instituto de Biociências, Departamento de Botânica, São Paulo, São Paulo, Brazil
- * E-mail: (AGN); (LGL)
| |
Collapse
|
32
|
Luisi P, Alvarez-Ponce D, Pybus M, Fares MA, Bertranpetit J, Laayouni H. Recent positive selection has acted on genes encoding proteins with more interactions within the whole human interactome. Genome Biol Evol 2015; 7:1141-54. [PMID: 25840415 PMCID: PMC4419801 DOI: 10.1093/gbe/evv055] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Genes vary in their likelihood to undergo adaptive evolution. The genomic factors that determine adaptability, however, remain poorly understood. Genes function in the context of molecular networks, with some occupying more important positions than others and thus being likely to be under stronger selective pressures. However, how positive selection distributes across the different parts of molecular networks is still not fully understood. Here, we inferred positive selection using comparative genomics and population genetics approaches through the comparison of 10 mammalian and 270 human genomes, respectively. In agreement with previous results, we found that genes with lower network centralities are more likely to evolve under positive selection (as inferred from divergence data). Surprisingly, polymorphism data yield results in the opposite direction than divergence data: Genes with higher centralities are more likely to have been targeted by recent positive selection during recent human evolution. Our results indicate that the relationship between centrality and the impact of adaptive evolution highly depends on the mode of positive selection and/or the evolutionary time-scale.
Collapse
Affiliation(s)
- Pierre Luisi
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - David Alvarez-Ponce
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Biology Department, University of Nevada, Reno Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Marc Pybus
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Mario A Fares
- Integrative Systems Biology Group, Instituto de Biología Molecular y Celular de Plantas, Consejo Superior de Investigaciones Científicas (CSIC)-Universidad Politécnica de Valencia (UPV), Spain Smurfit Institute of Genetics, University of Dublin, Trinity College, Ireland
| | - Jaume Bertranpetit
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain
| | - Hafid Laayouni
- Institute of Evolutionary Biology, Universitat Pompeu Fabra-CSIC, CEXS-UPF-PRBB, Barcelona, Catalonia, Spain Departament de Genètica i de Microbiologia, Grup de Biologia Evolutiva (GBE), Universitat Autonòma de Barcelona, Bellaterra, Spain
| |
Collapse
|
33
|
Fares MA. Experimental Evolution and Next Generation Sequencing Illuminate the Evolutionary Trajectories of Microbes. ADVANCES IN THE UNDERSTANDING OF BIOLOGICAL SCIENCES USING NEXT GENERATION SEQUENCING (NGS) APPROACHES 2015:101-113. [DOI: 10.1007/978-3-319-17157-9_7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
|
34
|
Guo Z, Jiang W, Lages N, Borcherds W, Wang D. Relationship between gene duplicability and diversifiability in the topology of biochemical networks. BMC Genomics 2014; 15:577. [PMID: 25005725 PMCID: PMC4129122 DOI: 10.1186/1471-2164-15-577] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2014] [Accepted: 06/26/2014] [Indexed: 01/21/2023] Open
Abstract
Background Selective gene duplicability, the extensive expansion of a small number of gene families, is universal. Quantitatively, the number of genes (P(K)) with K duplicates in a genome decreases precipitously as K increases, and often follows a power law (P(k)∝k-α). Functional diversification, either neo- or sub-functionalization, is a major evolution route for duplicate genes. Results Using three lines of genomic datasets, we studied the relationship between gene duplicability and diversifiability in the topology of biochemical networks. First, we explored scenario where two pathways in the biochemical networks antagonize each other. Synthetic knockout of respective genes for the two pathways rescues the phenotypic defects of each individual knockout. We identified duplicate gene pairs with sufficient divergences that represent this antagonism relationship in the yeast S. cerevisiae. Such pairs overwhelmingly belong to large gene families, thus tend to have high duplicability. Second, we used distances between proteins of duplicate genes in the protein interaction network as a metric of their diversification. The higher a gene’s duplicate count, the further the proteins of this gene and its duplicates drift away from one another in the networks, which is especially true for genetically antagonizing duplicate genes. Third, we computed a sequence-homology-based clustering coefficient to quantify sequence diversifiability among duplicate genes – the lower the coefficient, the more the sequences have diverged. Duplicate count (K) of a gene is negatively correlated to the clustering coefficient of its duplicates, suggesting that gene duplicability is related to the extent of sequence divergence within the duplicate gene family. Conclusion Thus, a positive correlation exists between gene diversifiability and duplicability in the context of biochemical networks – an improvement of our understanding of gene duplicability.
Collapse
Affiliation(s)
| | | | | | | | - Degeng Wang
- Greehey Children's Cancer Research Institute, University of Texas Health Science Center at San Antonio, 8403 Floyd Curl Drive, San Antonio, TX 78229-3900, USA.
| |
Collapse
|
35
|
Swanson EM, Snell-Rood EC. A Molecular Signaling Approach to Linking Intraspecific Variation and Macro-evolutionary Patterns. Integr Comp Biol 2014; 54:805-21. [DOI: 10.1093/icb/icu057] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
|
36
|
Chakraborty S, Ghosh TC. Evolutionary rate heterogeneity of core and attachment proteins in yeast protein complexes. Genome Biol Evol 2013; 5:1366-75. [PMID: 23814130 PMCID: PMC3730348 DOI: 10.1093/gbe/evt096] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
In general, proteins do not work alone; they form macromolecular complexes to play fundamental roles in diverse cellular functions. On the basis of their iterative clustering procedure and frequency of occurrence in the macromolecular complexes, the protein subunits have been categorized as core and attachment. Core protein subunits are the main functional elements, whereas attachment proteins act as modifiers or activators in protein complexes. In this article, using the current data set of yeast protein complexes, we found that core proteins are evolving at a faster rate than attachment proteins in spite of their functional importance. Interestingly, our investigation revealed that attachment proteins are present in a higher number of macromolecular complexes than core proteins. We also observed that the protein complex number (defined as the number of protein complexes in which a protein subunit belongs) has a stronger influence on gene/protein essentiality than multifunctionality. Finally, our results suggest that the observed differences in the rates of protein evolution between core and attachment proteins are due to differences in protein complex number and expression level. Moreover, we conclude that proteins which are present in higher numbers of macromolecular complexes enhance their overall expression level by increasing their transcription rate as well as translation rate, and thus the protein complex number imposes a strong selection pressure on the evolution of yeast proteome.
Collapse
|
37
|
Schumacher J, Rosenkranz D, Herlyn H. Mating systems and protein-protein interactions determine evolutionary rates of primate sperm proteins. Proc Biol Sci 2013; 281:20132607. [PMID: 24307672 PMCID: PMC3866406 DOI: 10.1098/rspb.2013.2607] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
To assess the relative impact of functional constraint and post-mating sexual selection on sequence evolution of reproductive proteins, we examined 169 primate sperm proteins. In order to recognize potential genome-wide trends, we additionally analysed a sample of altogether 318 non-reproductive (brain and postsynaptic) proteins. Based on cDNAs of eight primate species (Anthropoidea), we observed that pre-mating sperm proteins engaged in sperm composition and assembly show significantly lower incidence of site-specific positive selection and overall lower non-synonymous to synonymous substitution rates (dN/dS) across sites as compared with post-mating sperm proteins involved in capacitation, hyperactivation, acrosome reaction and fertilization. Moreover, database screening revealed overall more intracellular protein interaction partners in pre-mating than in post-mating sperm proteins. Finally, post-mating sperm proteins evolved at significantly higher evolutionary rates than pre-mating sperm and non-reproductive proteins on the branches to multi-male breeding species, while no such increase was observed on the branches to unimale and monogamous species. We conclude that less protein–protein interactions of post-mating sperm proteins account for lowered functional constraint, allowing for stronger impact of post-mating sexual selection, while the opposite holds true for pre-mating sperm proteins. This pattern is particularly strong in multi-male breeding species showing high female promiscuity.
Collapse
Affiliation(s)
- Julia Schumacher
- Institute of Anthropology, University of Mainz, , Anselm-Franz-von-Bentzel-Weg 7, 55099 Mainz, Germany
| | | | | |
Collapse
|
38
|
Davila-Velderrain J, Servin-Marquez A, Alvarez-Buylla ER. Molecular evolution constraints in the floral organ specification gene regulatory network module across 18 angiosperm genomes. Mol Biol Evol 2013; 31:560-73. [PMID: 24273325 DOI: 10.1093/molbev/mst223] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The gene regulatory network of floral organ cell fate specification of Arabidopsis thaliana is a robust developmental regulatory module. Although such finding was proposed to explain the overall conservation of floral organ types and organization among angiosperms, it has not been confirmed that the network components are conserved at the molecular level among flowering plants. Using the genomic data that have accumulated, we address the conservation of the genes involved in this network and the forces that have shaped its evolution during the divergence of angiosperms. We recovered the network gene homologs for 18 species of flowering plants spanning nine families. We found that all the genes are highly conserved with no evidence of positive selection. We studied the sequence conservation features of the genes in the context of their known biological function and the strength of the purifying selection acting upon them in relation to their placement within the network. Our results suggest an association between protein length and sequence conservation, evolutionary rates, and functional category. On the other hand, we found no significant correlation between the strength of purifying selection and gene placement. Our results confirm that the studied robust developmental regulatory module has been subjected to strong functional constraints. However, unlike previous studies, our results do not support the notion that network topology plays a major role in constraining evolutionary rates. We speculate that the dynamical functional role of genes within the network and not just its connectivity could play an important role in constraining evolution.
Collapse
|
39
|
Colombo M, Laayouni H, Invergo BM, Bertranpetit J, Montanucci L. Metabolic flux is a determinant of the evolutionary rates of enzyme-encoding genes. Evolution 2013; 68:605-13. [PMID: 24102646 DOI: 10.1111/evo.12262] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2013] [Accepted: 08/15/2013] [Indexed: 01/25/2023]
Abstract
Relationships between evolutionary rates and gene properties on a genomic, functional, pathway, or system level are being explored to unravel the principles of the evolutionary process. In particular, functional network properties have been analyzed to recognize the constraints they may impose on the evolutionary fate of genes. Here we took as a case study the core metabolic network in human erythrocytes and we analyzed the relationship between the evolutionary rates of its genes and the metabolic flux distribution throughout it. We found that metabolic flux correlates with the ratio of nonsynonymous to synonymous substitution rates. Genes encoding enzymes that carry high fluxes have been more constrained in their evolution, while purifying selection is more relaxed in genes encoding enzymes carrying low metabolic fluxes. These results demonstrate the importance of considering the dynamical functioning of gene networks when assessing the action of selection on system-level properties.
Collapse
Affiliation(s)
- Martino Colombo
- Institute of Evolutionary Biology (CSIC- Pompeu Fabra University), CEXS-UPF-PRBB, Dr. Aiguader 88, 08003 Barcelona, Catalonia, Spain
| | | | | | | | | |
Collapse
|