1
|
Pollet L, Xia Y. Structure-guided Evolutionary Analysis of Interactome Network Rewiring at Single Residue Resolution in Yeasts. J Mol Biol 2024; 436:168641. [PMID: 38844045 DOI: 10.1016/j.jmb.2024.168641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 04/30/2024] [Accepted: 06/01/2024] [Indexed: 06/16/2024]
Abstract
Protein-protein interactions (PPIs) are known to rewire extensively during evolution leading to lineage-specific and species-specific changes in molecular processes. However, the detailed molecular evolutionary mechanisms underlying interactome network rewiring are not well-understood. Here, we combine high-confidence PPI data, high-resolution three-dimensional structures of protein complexes, and homology-based structural annotation transfer to construct structurally-resolved interactome networks for the two yeasts S. cerevisiae and S. pombe. We then classify PPIs according to whether they are preserved or different between the two yeast species and compare site-specific evolutionary rates of interfacial versus non-interfacial residues for these different categories of PPIs. We find that residues in PPI interfaces evolve significantly more slowly than non-interfacial residues when using lineage-specific measures of evolutionary rate, but not when using non-lineage-specific measures. Furthermore, both lineage-specific and non-lineage-specific evolutionary rate measures can distinguish interfacial residues from non-interfacial residues for preserved PPIs between the two yeasts, but only the lineage-specific measure is appropriate for rewired PPIs. Finally, both lineage-specific and non-lineage-specific evolutionary rate measures are appropriate for elucidating structural determinants of protein evolution for residues outside of PPI interfaces. Overall, our results demonstrate that unlike tertiary structures of single proteins, PPIs and PPI interfaces can be highly volatile in their evolution, thus requiring the use of lineage-specific measures when studying their evolution. These results yield insight into the evolutionary design principles of PPIs and the mechanisms by which interactions are preserved or rewired between species, improving our understanding of the molecular evolution of PPIs and PPI interfaces at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
2
|
Shi T, Gao Z, Chen J, Van de Peer Y. Dosage sensitivity shapes balanced expression and gene longevity of homoeologs after whole-genome duplications in angiosperms. THE PLANT CELL 2024:koae227. [PMID: 39121058 PMCID: PMC7616505 DOI: 10.1093/plcell/koae227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 07/23/2024] [Accepted: 08/07/2024] [Indexed: 08/11/2024]
Abstract
Following whole-genome duplication (WGD), duplicate gene pairs (homoeologs) can evolve varying degrees of expression divergence. However, the determinants influencing these relative expression level differences (RFPKM) between homoeologs remain elusive. Here, we analyzed the RFPKM between homoeologs in three angiosperms, Nymphaea colorata, Nelumbo nucifera, and Acorus tatarinowii, all having undergone a single WGD since the origin of angiosperms. Our results show significant positive correlations in RFPKM of homoeologs among tissues within the same species, and among orthologs across these three species, indicating convergent expression balance/bias between homoeologous gene copies following independent WGDs. We linked RFPKM between homoeologs to gene attributes associated with dosage balance constraints, such as protein-protein interactions, lethal-phenotype scores in Arabidopsis (Arabidopsis thaliana) orthologs, domain numbers, and expression breadth. Notably, homoeologs with lower RFPKM often had more interactions and higher lethal-phenotype scores, indicating selective pressures favoring balanced expression. Also, homoeologs with lower RFPKM were more likely to be retained after WGDs in angiosperms. Within Nelumbo, greater RFPKM between homoeologs correlated with increased cis- and trans-regulatory differentiation between species, highlighting the ongoing escalation of gene expression divergence. We further found that expression degeneration in one copy of homoeologs is inclined towards nonfunctionalization. Our research highlights the importance of balanced expression, shaped by dosage balance constraints, in the evolutionary retention of homoeologs in plants.
Collapse
Affiliation(s)
- Tao Shi
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Zhiyan Gao
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Jinming Chen
- Aquatic Plant Research Center, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Centre for Plant Systems Biology, VIB, Ghent, Belgium
- Department of Biochemistry, Genetics and Microbiology, University of Pretoria, 0028 Pretoria, South Africa
- College of Horticulture, Academy for Advanced Interdisciplinary Studies, Nanjing Agricultural University, 210095 Nanjing, China
| |
Collapse
|
3
|
Hayashi K, Fernie AR. A neat wheat trick to hide genes from selection. TRENDS IN PLANT SCIENCE 2024; 29:837-838. [PMID: 38538389 DOI: 10.1016/j.tplants.2024.03.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 03/14/2024] [Accepted: 03/14/2024] [Indexed: 08/10/2024]
Abstract
A recent study by Li et al. demonstrated that the removal of like heterochromatin protein 1 (LHP1) in common wheat causes developmental drawbacks, yet confers resistance to stripe rust infection. Due to its role in regulating diversified defense genes, LHP1 was suggested to be an epigenetic gatekeeper potentially promoting adaptive evolution in allopolyploid wheat.
Collapse
Affiliation(s)
- Koki Hayashi
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Alisdair R Fernie
- Max-Planck-Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany.
| |
Collapse
|
4
|
Jiang D, Kejiou N, Qiu Y, Palazzo AF, Pennell M. Genetic and selective constraints on the optimization of gene product diversity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603951. [PMID: 39091777 PMCID: PMC11291005 DOI: 10.1101/2024.07.17.603951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
RNA and protein expressed from the same gene can have diverse isoforms due to various post-transcriptional and post-translational modifications. For the vast majority of alternative isoforms, It is unknown whether they are adaptive or simply biological noise. As we cannot experimentally probe the function of each isoform, we can ask whether the distribution of isoforms across genes and across species is consistent with expectations from different evolutionary processes. However, there is currently no theoretical framework that can generate such predictions. To address this, we developed a mathematical model where isoform abundances are determined collectively by cis-acting loci, trans-acting factors, gene expression levels, and isoform decay rates to predict isoform abundance distributions across species and genes in the face of mutation, genetic drift, and selection. We found that factors beyond selection, such as effective population size and the number of cis-acting loci, significantly influence evolutionary outcomes. Notably, suboptimal phenotypes are more likely to evolve when the population is small and/or when the number of cis-loci is large. We also explored scenarios where modification processes have both beneficial and detrimental effects, revealing a non-monotonic relationship between effective population size and optimization, demonstrating how opposing selection pressures on cis- and trans-acting loci can constrain the optimization of gene product diversity. As a demonstration of the power of our theory, we compared the expected distribution of A-to-I RNA editing levels in coleoids and found this to be largely consistent with non-adaptive explanations.
Collapse
Affiliation(s)
- Daohan Jiang
- Department of Quantitative and Computational Biology, University of Southern California, USA
| | - Nevraj Kejiou
- Department of Biochemistry, University of Toronto, Canada
| | - Yi Qiu
- Department of Biochemistry, University of Toronto, Canada
| | | | - Matt Pennell
- Department of Quantitative and Computational Biology, University of Southern California, USA
- Department of Biological Sciences, University of Southern California, USA
| |
Collapse
|
5
|
Yuan B, He G, Dong W. The first complete mitochondrial genome of the genus Laelaps with novel gene arrangement reveals extensive rearrangement and phylogenetics in the superfamily Dermanyssoidea. EXPERIMENTAL & APPLIED ACAROLOGY 2024:10.1007/s10493-024-00943-2. [PMID: 39017744 DOI: 10.1007/s10493-024-00943-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Accepted: 06/26/2024] [Indexed: 07/18/2024]
Abstract
We collected 56 specimens of Laelaps chini from the endemic Hengduan Mountain rat species (Eothenomys miletus) and obtained the first complete mitochondrial genome of L. chini by next-generation sequencing (NGS). The L. chini mitogenome is 16,507 bp in size and contains 37 genes and a control region of 2380 bp in length. The L. chini mitogenome has a high AT content and a compact arrangement with four overlapping regions ranging from 1 to 2 bp and 16 spacer regions ranging from 1 to 48 bp. We analyzed 13 protein-coding genes of L. chini mitogenome and found that protein-coding genes in the L. chini mitogenome preferred codons ending in A/U and codon usage pattern was mainly influenced by natural selection. Cox1 has the slowest evolution rate and cox3 has the fastest evolution rate. We combined the mitochondrial genome of eight species of gamasid mites in the superfamily Dermanyssoidea from Genbank and the L. chini mitochondrial genome to analyze its rearrangement patterns and breakpoint numbers. We found that the L. chini mitogenome showed a novel arrangement pattern and nine species of gamasid mites in the superfamily Dermanyssoidea, which have been sequenced complete mitochondrial genomes to date, all showed different degrees of rearrangement. Laelaps chini, Echinolaelaps echidninus and Echinolaelaps fukinenensis were closely related species based on genetic distance and phylogenetic analyses. Notably they are clustered with Varroa destructor of the family Varroidae, suggesting that the family Varroidae is more closely related to the family Laelapidae, but more data are needed to test whether Varroa can be classified under the family Laelapidae. The L. chini mitogenome is the first complete mitochondrial genome for the genus Laelaps, and contributes to further exploration of the mitochondrial gene rearrangements and phylogeny for the superfamily Dermanyssoidea.
Collapse
Affiliation(s)
- Bili Yuan
- Yunnan Provincial Key Laboratory for Zoonosis Control and Prevention, Institute of Pathogens and Vectors, Dali University, Dali, 671000, Yunnan, China
| | - Gangxian He
- Yunnan Provincial Key Laboratory for Zoonosis Control and Prevention, Institute of Pathogens and Vectors, Dali University, Dali, 671000, Yunnan, China
| | - Wenge Dong
- Yunnan Provincial Key Laboratory for Zoonosis Control and Prevention, Institute of Pathogens and Vectors, Dali University, Dali, 671000, Yunnan, China.
| |
Collapse
|
6
|
Okude G, Yamasaki YY, Toyoda A, Mori S, Kitano J. Genome-wide analysis of histone modifications can contribute to the identification of candidate cis-regulatory regions in the threespine stickleback fish. BMC Genomics 2024; 25:685. [PMID: 38992624 PMCID: PMC11241946 DOI: 10.1186/s12864-024-10602-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 07/08/2024] [Indexed: 07/13/2024] Open
Abstract
BACKGROUND Cis-regulatory mutations often underlie phenotypic evolution. However, because identifying the locations of promoters and enhancers in non-coding regions is challenging, we have fewer examples of identified causative cis-regulatory mutations that underlie naturally occurring phenotypic variations than of causative amino acid-altering mutations. Because cis-regulatory elements have epigenetic marks of specific histone modifications, we can detect cis-regulatory elements by mapping and analyzing them. Here, we investigated histone modifications and chromatin accessibility with cleavage under targets and tagmentation (CUT&Tag) and assay for transposase-accessible chromatin-sequencing (ATAC-seq). RESULTS Using the threespine stickleback (Gasterosteus aculeatus) as a model, we confirmed that the genes for which nearby regions showed active marks, such as H3K4me1, H3K4me3, and high chromatin accessibility, were highly expressed. In contrast, the expression levels of genes for which nearby regions showed repressive marks, such as H3K27me3, were reduced, suggesting that our chromatin analysis protocols overall worked well. Genomic regions with peaks of histone modifications showed higher nucleotide diversity within and between populations. By comparing gene expression in the gills of the marine and stream ecotypes, we identified several insertions and deletions (indels) with transposable element fragments in the candidate cis-regulatory regions. CONCLUSIONS Thus, mapping and analyzing histone modifications can help identify cis-regulatory elements and accelerate the identification of causative mutations in the non-coding regions underlying naturally occurring phenotypic variations.
Collapse
Affiliation(s)
- Genta Okude
- Ecological Genetics Laboratory, National Institute of Genetics, Yata 1111, Mishima, Shizuoka, 411-8540, Japan.
| | - Yo Y Yamasaki
- Ecological Genetics Laboratory, National Institute of Genetics, Yata 1111, Mishima, Shizuoka, 411-8540, Japan
| | - Atsushi Toyoda
- Comparative Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, Japan
| | - Seiichi Mori
- Faculty of Economics, Gifu-Kyoritsu University, Ogaki, Gifu, Japan
| | - Jun Kitano
- Ecological Genetics Laboratory, National Institute of Genetics, Yata 1111, Mishima, Shizuoka, 411-8540, Japan.
| |
Collapse
|
7
|
Chen P, Zhang J. The loci of environmental adaptation in a model eukaryote. Nat Commun 2024; 15:5672. [PMID: 38971805 PMCID: PMC11227561 DOI: 10.1038/s41467-024-50002-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 06/25/2024] [Indexed: 07/08/2024] Open
Abstract
While the underlying genetic changes have been uncovered in some cases of adaptive evolution, the lack of a systematic study prevents a general understanding of the genomic basis of adaptation. For example, it is unclear whether protein-coding or noncoding mutations are more important to adaptive evolution and whether adaptations to different environments are brought by genetic changes distributed in diverse genes and biological processes or concentrated in a core set. We here perform laboratory evolution of 3360 Saccharomyces cerevisiae populations in 252 environments of varying levels of stress. We find the yeast adaptations to be primarily fueled by large-effect coding mutations overrepresented in a relatively small gene set, despite prevalent antagonistic pleiotropy across environments. Populations generally adapt faster in more stressful environments, partly because of greater benefits of the same mutations in more stressful environments. These and other findings from this model eukaryote help unravel the genomic principles of environmental adaptation.
Collapse
Affiliation(s)
- Piaopiao Chen
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA
- College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan, 48109, USA.
| |
Collapse
|
8
|
Papachristos K, Sayadi A, Arnqvist G. Comparative Genomic Analysis of the Pattern of Evolution of Male and Female Reproductive Proteins in Seed Beetles. Genome Biol Evol 2024; 16:evae143. [PMID: 38941482 PMCID: PMC11251426 DOI: 10.1093/gbe/evae143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 06/10/2024] [Accepted: 06/26/2024] [Indexed: 06/30/2024] Open
Abstract
Male seminal fluid proteins often show signs of positive selection and divergent evolution, believed to reflect male-female coevolution. Yet, our understanding of the predicted concerted evolution of seminal fluid proteins and female reproductive proteins is limited. We sequenced, assembled, and annotated the genome of two species of seed beetles allowing a comparative analysis of four closely related species of these herbivorous insects. We compare the general pattern of evolution in genes encoding seminal fluid proteins and female reproductive proteins with those in digestive protein genes and well-conserved reference genes. We found that female reproductive proteins showed an overall ratio of nonsynonymous to synonymous substitutions (ω) similar to that of conserved genes, while seminal fluid proteins and digestive proteins exhibited higher overall ω values. Further, seminal fluid proteins and digestive proteins showed a higher proportion of sites putatively under positive selection, and explicit tests showed no difference in relaxed selection between protein types. Evolutionary rate covariation analyses showed that evolutionary rates among seminal fluid proteins were on average more closely correlated with those in female reproductive proteins than with either digestive or conserved genes. Gene expression showed the expected negative covariation with ω values, except for male-biased genes where this negative relationship was reversed. In conclusion, seminal fluid proteins showed relatively rapid evolution and signs of positive selection. In contrast, female reproductive proteins evolved at a lower rate under selective constraints, on par with genes known to be well conserved. Although our findings provide support for concerted evolution of seminal fluid proteins and female reproductive proteins, they also suggest that these two classes of proteins evolve under partly distinct selective regimes.
Collapse
Affiliation(s)
| | - Ahmed Sayadi
- Rheumatology, Department of Medical Sciences, Uppsala University, Uppsala, Sweden
| | - Göran Arnqvist
- Animal Ecology, Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| |
Collapse
|
9
|
Potera K, Tomala K. Using yeasts for the studies of nonfunctional factors in protein evolution. Yeast 2024. [PMID: 38895906 DOI: 10.1002/yea.3970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 05/08/2024] [Accepted: 06/06/2024] [Indexed: 06/21/2024] Open
Abstract
The evolution of protein sequence is driven not only by factors directly related to protein function and shape but also by nonfunctional factors. Such factors in protein evolution might be categorized as those connected to energetic costs, synthesis efficiency, and avoidance of misfolding and toxicity. A common approach to studying them is correlational analysis contrasting them with some characteristics of the protein, like amino acid composition, but these features are interdependent. To avoid possible bias, empirical studies are needed, and not enough work has been done to date. In this review, we describe the role of nonfunctional factors in protein evolution and present an experimental approach using yeast as a suitable model organism. The focus of the proposed approach is on the potential negative impact on the fitness of mutations that change protein properties not related to function and the frequency of mutations that change these properties. Experimental results of testing the misfolding avoidance hypothesis as an explanation for why highly expressed proteins evolve slowly are inconsistent with correlational research results. Therefore, more efforts should be made to empirically test the effects of nonfunctional factors in protein evolution and to contrast these results with the results of the correlational analysis approach.
Collapse
Affiliation(s)
- Katarzyna Potera
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Krakow, Poland
| | - Katarzyna Tomala
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| |
Collapse
|
10
|
Tomasch J, Kopejtka K, Shivaramu S, Mujakić I, Koblížek M. On the evolution of chromosomal regions with high gene strand bias in bacteria. mBio 2024; 15:e0060224. [PMID: 38752745 PMCID: PMC11237797 DOI: 10.1128/mbio.00602-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Accepted: 04/17/2024] [Indexed: 06/13/2024] Open
Abstract
On circular bacterial chromosomes, the majority of genes are coded on the leading strand. This gene strand bias (GSB) ranges from up to 85% in some Bacillota to a little more than 50% in other phyla. The factors determining the extent of the strand bias remain to be found. Here, we report that species in the phylum Gemmatimonadota share a unique chromosome architecture, distinct from neighboring phyla: in a conserved 600-kb region around the terminus of replication, almost all genes were located on the leading strands, while on the remaining part of the chromosome, the strand preference was more balanced. The high strand bias (HSB) region harbors the rRNA clusters, core, and highly expressed genes. Selective pressure for reduction of collisions with DNA replication to minimize detrimental mutations can explain the conservation of essential genes in this region. Repetitive and mobile elements are underrepresented, suggesting reduced recombination frequency by structural isolation from other parts of the chromosome. We propose that the HSB region forms a distinct chromosomal domain. Gemmatimonadota chromosomes evolved mainly by expansion through horizontal gene transfer and duplications outside of the ancient high strand bias region. In support of our hypothesis, we could further identify two Spiroplasma strains on a similar evolutionary path.IMPORTANCEOn bacterial chromosomes, a preferred location of genes on the leading strand has evolved to reduce conflicts between replication and transcription. Despite a vast body of research, the question why bacteria show large differences in their gene strand bias is still not solved. The discovery of "hybrid" chromosomes in different phyla, including Gemmatimonadota, in which a conserved high strand bias is found exclusively in a region at ter, points toward a role of nucleoid structure, additional to replication, in the evolution of strand preferences. A fine-grained structural analysis of the ever-increasing number of available bacterial genomes could help to better understand the forces that shape the sequential and spatial organization of the cell's information content.
Collapse
Affiliation(s)
- Jürgen Tomasch
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Sciences, Třeboň, Czechia
| | - Karel Kopejtka
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Sciences, Třeboň, Czechia
| | - Sahana Shivaramu
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Sciences, Třeboň, Czechia
| | - Izabela Mujakić
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Sciences, Třeboň, Czechia
| | - Michal Koblížek
- Laboratory of Anoxygenic Phototrophs, Institute of Microbiology of the Czech Academy of Sciences, Třeboň, Czechia
| |
Collapse
|
11
|
Akeju OJ, Cope AL. Re-examining Correlations Between Synonymous Codon Usage and Protein Bond Angles in Escherichia coli. Genome Biol Evol 2024; 16:evae080. [PMID: 38619010 PMCID: PMC11077309 DOI: 10.1093/gbe/evae080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 04/05/2024] [Accepted: 04/10/2024] [Indexed: 04/16/2024] Open
Abstract
Rosenberg AA, Marx A, Bronstein AM (Codon-specific Ramachandran plots show amino acid backbone conformation depends on identity of the translated codon. Nat Commun. 2022:13:2815) recently found a surprising correlation between synonymous codon usage and the dihedral bond angles of the resulting amino acid. However, their analysis did not account for the strongest known correlate of codon usage: gene expression. We re-examined the relationship between bond angles and codon usage by applying the approach of Rosenberg et al. to simulated protein-coding sequences that (i) have random codon usage, (ii) codon usage determined by mutation biases, and (iii) maintain the general relationship between codon usage and gene expression via the assumption of selection-mutation-drift equilibrium. We observed correlations between dihedral bond angle and codon usage when codon usage is entirely random, indicating possible conflation of noise with differences in bond angle distributions between synonymous codons. More relevant to the general analysis of codon usage patterns, we found surprisingly good agreement between the analysis of the real sequences and the analysis of sequences simulated assuming selection-mutation-drift equilibrium, with 91% of significant synonymous codon pairs detected in the former were also detected in the latter. We believe the correlation between codon usage and dihedral bond angles resulted from the variation in codon usage across genes due to the interplay between mutation bias, natural selection for translation efficiency, and gene expression, further underscoring these factors must be controlled for when looking for novel patterns related to codon usage.
Collapse
Affiliation(s)
| | - Alexander L Cope
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, USA
- Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, USA
| |
Collapse
|
12
|
Ames M, Hamernik A, Behling W, Douches DS, Halterman DA, Bethke PC. A survey of the Sli gene in wild and cultivated potato. PLANT DIRECT 2024; 8:e589. [PMID: 38766508 PMCID: PMC11099725 DOI: 10.1002/pld3.589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 02/27/2024] [Accepted: 04/18/2024] [Indexed: 05/22/2024]
Abstract
Inbred-hybrid breeding of diploid potatoes necessitates breeding lines that are self-compatible. One way of incorporating self-compatibility into incompatible cultivated potato (Solanum tuberosum) germplasm is to introduce the S-locus inhibitor gene (Sli), which functions as a dominant inhibitor of gametophytic self-incompatibility. To learn more about Sli diversity and function in wild species relatives of cultivated potato, we obtained Sli gene sequences that extended from the 5'UTR to the 3'UTR from 133 individuals from 22 wild species relatives of potato and eight diverse cultivated potato clones. DNA sequence alignment and phylogenetic trees based on genomic and protein sequences show that there are two highly conserved groups of Sli sequences. DNA sequences in one group contain the 533 bp insertion upstream of the start codon identified previously in self-compatible potato. The second group lacks the insertion. Three diploid and four polyploid individuals of wild species collected from geographically disjointed localities contained Sli with the 533 bp insertion. For most of the wild species clones examined, however, Sli did not have the insertion. Phylogenetic analysis indicated that Sli sequences with the insertion, in wild species and in cultivated clones, trace back to a single origin. Some diploid wild potatoes that have Sli with the insertion were self-incompatible and some wild potatoes that lack the insertion were self-compatible. Although there is evidence of positive selection for some codon positions in Sli, there is no evidence of diversifying selection at the gene level. In silico analysis of Sli protein structure did not support the hypothesis that amino acid changes from wild-type (no insertion) to insertion-type account for changes in protein function. Our study demonstrated that genetic factors besides the Sli gene must be important for conditioning a switch in the mating system from self-incompatible to self-compatible in wild potatoes.
Collapse
Affiliation(s)
- Mercedes Ames
- US Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of HorticultureUniversity of WisconsinMadisonWisconsinUSA
| | - Andy Hamernik
- US Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of HorticultureUniversity of WisconsinMadisonWisconsinUSA
| | - William Behling
- Department of Plant, Soil and Microbial SciencesMichigan State UniversityEast LansingMichiganUSA
| | - David S. Douches
- Department of Plant, Soil and Microbial SciencesMichigan State UniversityEast LansingMichiganUSA
| | - Dennis A. Halterman
- US Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of HorticultureUniversity of WisconsinMadisonWisconsinUSA
| | - Paul C. Bethke
- US Department of Agriculture, Agricultural Research Service, Vegetable Crops Research Unit, Department of HorticultureUniversity of WisconsinMadisonWisconsinUSA
| |
Collapse
|
13
|
Ali F. Patterns of Change in Nucleotide Diversity Over Gene Length. Genome Biol Evol 2024; 16:evae078. [PMID: 38608148 PMCID: PMC11040516 DOI: 10.1093/gbe/evae078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/26/2024] [Accepted: 04/03/2024] [Indexed: 04/14/2024] Open
Abstract
Nucleotide diversity at a site is influenced by the relative strengths of neutral and selective population genetic processes. Therefore, attempts to estimate Effective population size based on the diversity of synonymous sites demand a better understanding of their selective constraints. The nucleotide diversity of a gene was previously found to correlate with its length. In this work, I measure nucleotide diversity at synonymous sites and uncover a pattern of low diversity towards the translation initiation site of a gene. The degree of reduction in diversity at the translation initiation site and the length of this region of reduced diversity can be quantified as "Effect Size" and "Effect Length" respectively, using parameters of an asymptotic regression model. Estimates of Effect Length across bacteria covaried with recombination rates as well as with a multitude of translation-associated traits such as the avoidance of mRNA secondary structure around translation initiation site, the number of rRNAs, and relative codon usage of ribosomal genes. Evolutionary simulations under purifying selection reproduce the observed patterns and diversity-length correlation and highlight that selective constraints on the 5'-region of a gene may be more extensive than previously believed. These results have implications for the estimation of effective population size, and relative mutation rates, and for genome scans of genes under positive selection based on "silent-site" diversity.
Collapse
Affiliation(s)
- Farhan Ali
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
14
|
Teyssonniere EM, Shichino Y, Mito M, Friedrich A, Iwasaki S, Schacherer J. Translation variation across genetic backgrounds reveals a post-transcriptional buffering signature in yeast. Nucleic Acids Res 2024; 52:2434-2445. [PMID: 38261993 PMCID: PMC10954453 DOI: 10.1093/nar/gkae030] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 01/25/2024] Open
Abstract
Gene expression is known to vary among individuals, and this variability can impact the phenotypic diversity observed in natural populations. While the transcriptome and proteome have been extensively studied, little is known about the translation process itself. Here, we therefore performed ribosome and transcriptomic profiling on a genetically and ecologically diverse set of natural isolates of the Saccharomyces cerevisiae yeast. Interestingly, we found that the Euclidean distances between each profile and the expression fold changes in each pairwise isolate comparison were higher at the transcriptomic level. This observation clearly indicates that the transcriptional variation observed in the different isolates is buffered through a phenomenon known as post-transcriptional buffering at the translation level. Furthermore, this phenomenon seemed to have a specific signature by preferentially affecting essential genes as well as genes involved in complex-forming proteins, and low transcribed genes. We also explored the translation of the S. cerevisiae pangenome and found that the accessory genes related to introgression events displayed similar transcription and translation levels as the core genome. By contrast, genes acquired through horizontal gene transfer events tended to be less efficiently translated. Together, our results highlight both the extent and signature of the post-transcriptional buffering.
Collapse
Affiliation(s)
| | - Yuichi Shichino
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
| | - Mari Mito
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
| | - Anne Friedrich
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
| | - Shintaro Iwasaki
- RNA Systems Biochemistry Laboratory, RIKEN Cluster for Pioneering Research, Wako, Saitama 351-0198, Japan
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba 277-8561, Japan
| | - Joseph Schacherer
- Université de Strasbourg, CNRS, GMGM UMR, 7156 Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| |
Collapse
|
15
|
Liao BY, Weng MP, Chang TY, Chang AYF, Ching YH, Wu CH. Degeneration of the Olfactory System in a Murid Rodent that Evolved Diurnalism. Mol Biol Evol 2024; 41:msae037. [PMID: 38376543 PMCID: PMC10906987 DOI: 10.1093/molbev/msae037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 01/03/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024] Open
Abstract
In mammalian research, it has been debated what can initiate an evolutionary tradeoff between different senses, and the phenomenon of sensory tradeoff in rodents, the most abundant mammalian clade, is not evident. The Nile rat (Arvicanthis niloticus), a murid rodent, recently adapted to a diurnal niche through an evolutionary acquisition of daylight vision with enhanced visual acuity. As such, this model provides an opportunity for a cross-species investigation where comparative morphological and multi-omic analyses of the Nile rat are made with its closely related nocturnal species, e.g. the mouse (Mus musculus) and the rat (Rattus norvegicus). Thus, morphological examinations were performed, and evolutionary reductions in relative sizes of turbinal bone surfaces, the cribriform plate, and the olfactory bulb were discovered in Nile rats. Subsequently, we compared multiple murid genomes, and profiled olfactory epithelium transcriptomes of mice and Nile rats at various ages with RNA sequencing. The results further demonstrate that, in comparison with mouse olfactory receptor (OR) genes, Nile rat OR genes have experienced less frequent gain, more frequent loss, and more frequent expression reduction during their evolution. Furthermore, functional degeneration of coding sequences in the Nile rat lineage was found in OR genes, yet not in other genes. Taken together, these results suggest that acquisition of improved vision in the Nile rat has been accompanied by degeneration of both olfaction-related anatomical structures and OR gene repertoires, consistent with the hypothesis of an olfaction-vision tradeoff initiated by the switch from a nocturnal to a diurnal lifestyle in mammals.
Collapse
Affiliation(s)
- Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Meng-Pin Weng
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Ting-Yan Chang
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Andrew Ying-Fei Chang
- Institute of Population Health Sciences, National Health Research Institutes, Taiwan, Republic of China
| | - Yung-Hao Ching
- Department of Molecular Biology and Human Genetics, Tzu Chi University, Taiwan, Republic of China
| | - Chia-Hwa Wu
- Laboratory Animal Center, National Health Research Institutes, Taiwan, Republic of China
| |
Collapse
|
16
|
Balogun EJ, Ness RW. The Effects of De Novo Mutation on Gene Expression and the Consequences for Fitness in Chlamydomonas reinhardtii. Mol Biol Evol 2024; 41:msae035. [PMID: 38366781 PMCID: PMC10910851 DOI: 10.1093/molbev/msae035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 02/01/2024] [Accepted: 02/13/2024] [Indexed: 02/18/2024] Open
Abstract
Mutation is the ultimate source of genetic variation, the bedrock of evolution. Yet, predicting the consequences of new mutations remains a challenge in biology. Gene expression provides a potential link between a genotype and its phenotype. But the variation in gene expression created by de novo mutation and the fitness consequences of mutational changes to expression remain relatively unexplored. Here, we investigate the effects of >2,600 de novo mutations on gene expression across the transcriptome of 28 mutation accumulation lines derived from 2 independent wild-type genotypes of the green algae Chlamydomonas reinhardtii. We observed that the amount of genetic variance in gene expression created by mutation (Vm) was similar to the variance that mutation generates in typical polygenic phenotypic traits and approximately 15-fold the variance seen in the limited species where Vm in gene expression has been estimated. Despite the clear effect of mutation on expression, we did not observe a simple additive effect of mutation on expression change, with no linear correlation between the total expression change and mutation count of individual MA lines. We therefore inferred the distribution of expression effects of new mutations to connect the number of mutations to the number of differentially expressed genes (DEGs). Our inferred DEE is highly L-shaped with 95% of mutations causing 0-1 DEG while the remaining 5% are spread over a long tail of large effect mutations that cause multiple genes to change expression. The distribution is consistent with many cis-acting mutation targets that affect the expression of only 1 gene and a large target of trans-acting targets that have the potential to affect tens or hundreds of genes. Further evidence for cis-acting mutations can be seen in the overabundance of mutations in or near differentially expressed genes. Supporting evidence for trans-acting mutations comes from a 15:1 ratio of DEGs to mutations and the clusters of DEGs in the co-expression network, indicative of shared regulatory architecture. Lastly, we show that there is a negative correlation with the extent of expression divergence from the ancestor and fitness, providing direct evidence of the deleterious effects of perturbing gene expression.
Collapse
Affiliation(s)
- Eniolaye J Balogun
- Department of Biology, William G. Davis Building, University of Toronto, Mississauga L5L-1C6, Canada
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto M5S-3B2, Canada
| | - Rob W Ness
- Department of Biology, William G. Davis Building, University of Toronto, Mississauga L5L-1C6, Canada
| |
Collapse
|
17
|
Waneka G, Pate B, Monroe JG, Sloan DB. Investigating low frequency somatic mutations in Arabidopsis with Duplex Sequencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.31.578196. [PMID: 38352550 PMCID: PMC10862904 DOI: 10.1101/2024.01.31.578196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/22/2024]
Abstract
Mutations are the source of novel genetic diversity but can also lead to disease and maladaptation. The conventional view is that mutations occur randomly with respect to their environment-specific fitness consequences. However, intragenomic mutation rates can vary dramatically due to transcription coupled repair and based on local epigenomic modifications, which are non-uniformly distributed across genomes. One sequence feature associated with decreased mutation is higher expression level, which can vary depending on environmental cues. To understand whether the association between expression level and mutation rate creates a systematic relationship with environment-specific fitness effects, we perturbed expression through a heat treatment in Arabidopsis thaliana. We quantified gene expression to identify differentially expressed genes, which we then targeted for mutation detection using Duplex Sequencing. This approach provided a highly accurate measurement of the frequency of rare somatic mutations in vegetative plant tissues, which has been a recent source of uncertainty in plant mutation research. We included mutant lines lacking mismatch repair (MMR) and base excision repair (BER) capabilities to understand how repair mechanisms may drive biased mutation accumulation. We found wild type (WT) and BER mutant mutation frequencies to be very low (mean variant frequency 1.8×10-8 and 2.6×10-8, respectively), while MMR mutant frequencies were significantly elevated (1.13×10-6). These results show that somatic variant frequencies are extremely low in WT plants, indicating that larger datasets will be needed to address the fundamental evolutionary question as to whether environmental change leads to gene-specific changes in mutation rate.
Collapse
Affiliation(s)
- Gus Waneka
- Department of Biology, Colorado State University, Fort Collins, Colorado, USA
| | - Braden Pate
- Department of Biology, Colorado State University, Fort Collins, Colorado, USA
| | - J Grey Monroe
- Department of Plant Sciences, University of California, Davis, Davis, CA USA
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, Colorado, USA
| |
Collapse
|
18
|
de Jong MJ, van Oosterhout C, Hoelzel AR, Janke A. Moderating the neutralist-selectionist debate: exactly which propositions are we debating, and which arguments are valid? Biol Rev Camb Philos Soc 2024; 99:23-55. [PMID: 37621151 DOI: 10.1111/brv.13010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 08/26/2023]
Abstract
Half a century after its foundation, the neutral theory of molecular evolution continues to attract controversy. The debate has been hampered by the coexistence of different interpretations of the core proposition of the neutral theory, the 'neutral mutation-random drift' hypothesis. In this review, we trace the origins of these ambiguities and suggest potential solutions. We highlight the difference between the original, the revised and the nearly neutral hypothesis, and re-emphasise that none of them equates to the null hypothesis of strict neutrality. We distinguish the neutral hypothesis of protein evolution, the main focus of the ongoing debate, from the neutral hypotheses of genomic and functional DNA evolution, which for many species are generally accepted. We advocate a further distinction between a narrow and an extended neutral hypothesis (of which the latter posits that random non-conservative amino acid substitutions can cause non-ecological phenotypic divergence), and we discuss the implications for evolutionary biology beyond the domain of molecular evolution. We furthermore point out that the debate has widened from its initial focus on point mutations, and also concerns the fitness effects of large-scale mutations, which can alter the dosage of genes and regulatory sequences. We evaluate the validity of neutralist and selectionist arguments and find that the tested predictions, apart from being sensitive to violation of underlying assumptions, are often derived from the null hypothesis of strict neutrality, or equally consistent with the opposing selectionist hypothesis, except when assuming molecular panselectionism. Our review aims to facilitate a constructive neutralist-selectionist debate, and thereby to contribute to answering a key question of evolutionary biology: what proportions of amino acid and nucleotide substitutions and polymorphisms are adaptive?
Collapse
Affiliation(s)
- Menno J de Jong
- Senckenberg Biodiversity and Climate Research Institute (SBiK-F), Georg-Voigt-Strasse 14-16, Frankfurt am Main, 60325, Germany
| | - Cock van Oosterhout
- Centre for Ecology, Evolution and Conservation, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| | - A Rus Hoelzel
- Department of Biosciences, Durham University, South Road, Durham, DH1 3LE, UK
| | - Axel Janke
- Senckenberg Biodiversity and Climate Research Institute (SBiK-F), Georg-Voigt-Strasse 14-16, Frankfurt am Main, 60325, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Strasse 9, Frankfurt am Main, 60438, Germany
- LOEWE-Centre for Translational Biodiversity Genomics (TBG), Senckenberg Nature Research Society, Georg-Voigt-Straße 14-16, Frankfurt am Main, 60325, Germany
| |
Collapse
|
19
|
Couce A, Limdi A, Magnan M, Owen SV, Herren CM, Lenski RE, Tenaillon O, Baym M. Changing fitness effects of mutations through long-term bacterial evolution. Science 2024; 383:eadd1417. [PMID: 38271521 DOI: 10.1126/science.add1417] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 12/12/2023] [Indexed: 01/27/2024]
Abstract
The distribution of fitness effects of new mutations shapes evolution, but it is challenging to observe how it changes as organisms adapt. Using Escherichia coli lineages spanning 50,000 generations of evolution, we quantify the fitness effects of insertion mutations in every gene. Macroscopically, the fraction of deleterious mutations changed little over time whereas the beneficial tail declined sharply, approaching an exponential distribution. Microscopically, changes in individual gene essentiality and deleterious effects often occurred in parallel; altered essentiality is only partly explained by structural variation. The identity and effect sizes of beneficial mutations changed rapidly over time, but many targets of selection remained predictable because of the importance of loss-of-function mutations. Taken together, these results reveal the dynamic-but statistically predictable-nature of mutational fitness effects.
Collapse
Affiliation(s)
- Alejandro Couce
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
- Centro de Biotecnología y Genómica de Plantas, Universidad Politécnica de Madrid (UPM), 28223 Madrid, Spain
| | - Anurag Limdi
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Melanie Magnan
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
| | - Siân V Owen
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| | - Cristina M Herren
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
- Department of Marine and Environmental Sciences, Northeastern University, Boston, MA 02115, USA
| | - Richard E Lenski
- Department of Microbiology and Molecular Genetics, Michigan State University, East Lansing, MI 48824, USA
- Program in Ecology, Evolution, and Behavior, Michigan State University, East Lansing, MI 48824, USA
| | - Olivier Tenaillon
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, IAME, F-75018 Paris, France
- Université Paris Cité, Inserm, Institut Cochin, F-75014 Paris, France
| | - Michael Baym
- Department of Biomedical Informatics, and Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
20
|
Li Z, Zhang Y, Ding CH, Chen Y, Wang H, Zhang J, Ying S, Wang M, Zhang R, Liu J, Xie Y, Tang T, Diao H, Ye L, Zhuang Y, Teng W, Zhang B, Huang L, Tong Y, Zhang W, Li G, Benhamed M, Dong Z, Gou JY, Zhang Y. LHP1-mediated epigenetic buffering of subgenome diversity and defense responses confers genome plasticity and adaptability in allopolyploid wheat. Nat Commun 2023; 14:7538. [PMID: 37985755 PMCID: PMC10661560 DOI: 10.1038/s41467-023-43178-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 10/25/2023] [Indexed: 11/22/2023] Open
Abstract
Polyploidization is a major driver of genome diversification and environmental adaptation. However, the merger of different genomes may result in genomic conflicts, raising a major question regarding how genetic diversity is interpreted and regulated to enable environmental plasticity. By analyzing the genome-wide binding of 191 trans-factors in allopolyploid wheat, we identified like heterochromatin protein 1 (LHP1) as a master regulator of subgenome-diversified genes. Transcriptomic and epigenomic analyses of LHP1 mutants reveal its role in buffering the expression of subgenome-diversified defense genes by controlling H3K27me3 homeostasis. Stripe rust infection releases latent subgenomic variations by eliminating H3K27me3-related repression. The simultaneous inactivation of LHP1 homoeologs by CRISPR-Cas9 confers robust stripe rust resistance in wheat seedlings. The conditional repression of subgenome-diversified defenses ensures developmental plasticity to external changes, while also promoting neutral-to-non-neutral selection transitions and adaptive evolution. These findings establish an LHP1-mediated buffering system at the intersection of genotypes, environments, and phenotypes in polyploid wheat. Manipulating the epigenetic buffering capacity offers a tool to harness cryptic subgenomic variations for crop improvement.
Collapse
Affiliation(s)
- Zijuan Li
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- University of the Chinese Academy of Sciences, 100049, Beijing, China
| | - Yuyun Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- University of the Chinese Academy of Sciences, 100049, Beijing, China
| | - Ci-Hang Ding
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing, China
| | - Yan Chen
- Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, 510006, Guangzhou, China
| | - Haoyu Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- Henan University, School of Life Science, 457000, Kaifeng, Henan, China
| | - Jinyu Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- University of the Chinese Academy of Sciences, 100049, Beijing, China
| | - Songbei Ying
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
| | - Meiyue Wang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
| | - Rongzhi Zhang
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
- Ministry of Agriculture, Key Laboratory of Wheat Biology and Genetic Improvement on North Yellow and Huai River Valley, Jinan, China
- National Engineering Research Center for Wheat and Maize, Jinan, Shandong, China
| | - Jinyi Liu
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- University of the Chinese Academy of Sciences, 100049, Beijing, China
| | - Yilin Xie
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- University of the Chinese Academy of Sciences, 100049, Beijing, China
| | - Tengfei Tang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
- Henan University, School of Life Science, 457000, Kaifeng, Henan, China
| | - Huishan Diao
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China
| | - Luhuan Ye
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
| | - Yili Zhuang
- National Key Laboratory of Plant Molecular Genetics, CAS Center for Excellence in Molecular Plant Sciences, Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, 300 Fenglin Road, 200032, Shanghai, China
| | - Wan Teng
- University of the Chinese Academy of Sciences, 100049, Beijing, China
- The State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, the Innovative Academy of Seed Design, Chinese Academy of Sciences, 100101, Beijing, China
| | - Bo Zhang
- Northwest Institute of Plateau Biology, Chinese Academy of Sciences, 810008, Xining, China
| | - Lin Huang
- State Key Laboratory of Crop Gene Exploration and Utilization in Southwest China, Sichuan Agricultural University, 611130, Wenjiang, Chengdu, China
| | - Yiping Tong
- University of the Chinese Academy of Sciences, 100049, Beijing, China
- The State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, the Innovative Academy of Seed Design, Chinese Academy of Sciences, 100101, Beijing, China
| | - Wenli Zhang
- State Key Laboratory of Crop Genetics & Germplasm Enhancement and Utilization, Jiangsu Collaborative Innovation Center for Modern Crop Production, Nanjing Agricultural University, No.1 Weigang, 210095, Nanjing, Jiangsu, China
| | - Genying Li
- Crop Research Institute, Shandong Academy of Agricultural Sciences, Jinan, China
- Ministry of Agriculture, Key Laboratory of Wheat Biology and Genetic Improvement on North Yellow and Huai River Valley, Jinan, China
- National Engineering Research Center for Wheat and Maize, Jinan, Shandong, China
| | - Moussa Benhamed
- Université Paris Cité, Institute of Plant Sciences Paris-Saclay (IPS2), F-75006, Paris, France.
- Université Paris-Saclay, CNRS, INRAE, Univ Evry, Institute of Plant Sciences Paris-Saclay (IPS2), 91405, Orsay, France.
| | - Zhicheng Dong
- Guangdong Provincial Key Laboratory of Plant Adaptation and Molecular Design, Guangzhou Key Laboratory of Crop Gene Editing, Innovative Center of Molecular Genetics and Evolution, School of Life Sciences, Guangzhou University, 510006, Guangzhou, China.
| | - Jin-Ying Gou
- Frontiers Science Center for Molecular Design Breeding, China Agricultural University, Beijing, China.
| | - Yijing Zhang
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center of Genetics and Development, Department of Biochemistry, Institute of Plant Biology, School of Life Sciences, Fudan University, 200438, Shanghai, China.
| |
Collapse
|
21
|
Bai Y, Wan X, Lei M, Wang L, Chen T. Research advances in mechanisms of arsenic hyperaccumulation of Pteris vittata: Perspectives from plant physiology, molecular biology, and phylogeny. JOURNAL OF HAZARDOUS MATERIALS 2023; 460:132463. [PMID: 37690196 DOI: 10.1016/j.jhazmat.2023.132463] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/24/2023] [Accepted: 08/31/2023] [Indexed: 09/12/2023]
Abstract
Pteris vittata, as the firstly discovered arsenic (As) hyperaccumulator, has great application value in As-contaminated soil remediation. Currently, the genes involved in As hyperaccumulation in P. vittata have been mined continuously, while they have not been used in practice to enhance phytoremediation efficiency. Aiming to better assist the practice of phytoremediation, this review collects 130 studies to clarify the progress in research into the As hyperaccumulation process in P. vittata from multiple perspectives. Antioxidant defense, rhizosphere activities, vacuolar sequestration, and As efflux are important physiological activities involved in As hyperaccumulation in P. vittata. Among related 19 genes, PHT, TIP, ACR3, ACR2 and HAC family genes play essential roles in arsenate (AsⅤ) transport, arsenite (AsⅢ) transport, vacuole sequestration of AsⅢ, and the reduction of AsⅤ to AsⅢ, respectively. Gene ontology enrichment analysis indicated it is necessary to further explore genes that can bind to related ions, with transport activity, or with function of transmembrane transport. Phylogeny analysis results implied ACR2, HAC and ACR3 family genes with rapid evolutionary rate may be the decisive factors for P. vittata as an As hyperaccumulator. A deeper understanding of the As hyperaccumulation network and key gene components could provide useful tools for further bio-engineered phytoremediation.
Collapse
Affiliation(s)
- Yang Bai
- Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; Sino-Danish College, University of Chinese Academy of Sciences, Beijing 100049, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xiaoming Wan
- Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Mei Lei
- Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lingqing Wang
- Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tongbin Chen
- Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
22
|
Man J, Harrington TA, Lally K, Bartlett ME. Asymmetric Evolution of Protein Domains in the Leucine-Rich Repeat Receptor-Like Kinase Family of Plant Signaling Proteins. Mol Biol Evol 2023; 40:msad220. [PMID: 37787619 PMCID: PMC10588794 DOI: 10.1093/molbev/msad220] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 08/29/2023] [Accepted: 09/26/2023] [Indexed: 10/04/2023] Open
Abstract
The coding sequences of developmental genes are expected to be deeply conserved, with cis-regulatory change driving the modulation of gene function. In contrast, proteins with roles in defense are expected to evolve rapidly, in molecular arms races with pathogens. However, some gene families include both developmental and defense genes. In these families, does the tempo and mode of evolution differ between genes with divergent functions, despite shared ancestry and structure? The leucine-rich repeat receptor-like kinase (LRR-RLKs) protein family includes members with roles in plant development and defense, thus providing an ideal system for answering this question. LRR-RLKs are receptors that traverse plasma membranes. LRR domains bind extracellular ligands; RLK domains initiate intracellular signaling cascades in response to ligand binding. In LRR-RLKs with roles in defense, LRR domains evolve faster than RLK domains. To determine whether this asymmetry extends to LRR-RLKs that function primarily in development, we assessed evolutionary rates and tested for selection acting on 11 subfamilies of LRR-RLKs, using deeply sampled protein trees. To assess functional evolution, we performed heterologous complementation assays in Arabidopsis thaliana (Arabidopsis). We found that the LRR domains of all tested LRR-RLK proteins evolved faster than their cognate RLK domains. All tested subfamilies of LRR-RLKs had strikingly similar patterns of molecular evolution, despite divergent functions. Heterologous transformation experiments revealed that multiple mechanisms likely contribute to the evolution of LRR-RLK function, including escape from adaptive conflict. Our results indicate specific and distinct evolutionary pressures acting on LRR versus RLK domains, despite diverse organismal roles for LRR-RLK proteins.
Collapse
Affiliation(s)
- Jarrett Man
- Department of Biology, University of Massachusetts Amherst, Amherst, MA 01002, USA
| | - T A Harrington
- Department of Biology, University of Massachusetts Amherst, Amherst, MA 01002, USA
| | - Kyra Lally
- Department of Biology, University of Massachusetts Amherst, Amherst, MA 01002, USA
| | - Madelaine E Bartlett
- Department of Biology, University of Massachusetts Amherst, Amherst, MA 01002, USA
| |
Collapse
|
23
|
Petak C, Frati L, Brennan RS, Pespeni MH. Whole-Genome Sequencing Reveals That Regulatory and Low Pleiotropy Variants Underlie Local Adaptation to Environmental Variability in Purple Sea Urchins. Am Nat 2023; 202:571-586. [PMID: 37792925 DOI: 10.1086/726013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]
Abstract
AbstractOrganisms experience environments that vary across both space and time. Such environmental heterogeneity shapes standing genetic variation and may influence species' capacity to adapt to rapid environmental change. However, we know little about the kind of genetic variation that is involved in local adaptation to environmental variability. To address this gap, we sequenced the whole genomes of 140 purple sea urchins (Strongylocentrotus purpuratus) from seven populations that vary in their degree of pH variability. Despite no evidence of global population structure, we found a suite of single-nucleotide polymorphisms (SNPs) tightly correlated with local pH variability (outlier SNPs), which were overrepresented in regions putatively involved in gene regulation (long noncoding RNA and enhancers), supporting the idea that variation in regulatory regions is important for local adaptation to variability. In addition, outliers in genes were found to be (i) enriched for biomineralization and ion homeostasis functions related to low pH response, (ii) less central to the protein-protein interaction network, and (iii) underrepresented among genes highly expressed during early development. Taken together, these results suggest that loci that underlie local adaptation to pH variability in purple sea urchins fall in regions with potentially low pleiotropic effects (based on analyses involving regulatory regions, network centrality, and expression time) involved in low pH response (based on functional enrichment).
Collapse
|
24
|
Holtz MA, Racicot R, Preininger D, Stuckert AMM, Mangiamele LA. Genome assembly of the foot-flagging frog, Staurois parvus: a resource for understanding mechanisms of behavior. G3 (BETHESDA, MD.) 2023; 13:jkad193. [PMID: 37625789 PMCID: PMC10542557 DOI: 10.1093/g3journal/jkad193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Revised: 03/22/2023] [Accepted: 08/13/2023] [Indexed: 08/27/2023]
Abstract
Elaborate and skilled movements of the body have been selected in a variety of species as courtship and rivalry signals. One roadblock in studying these behaviors has been a lack of resources for understanding how they evolved at the genetic level. The Bornean rock frog (Staurois parvus) is an ideal species in which to address this issue. Males wave their hindlimbs in a "foot-flagging" display when competing for mates. The evolution of foot flagging in S. parvus and other species is accompanied by increases in the expression of the androgen receptor gene within its neuromuscular system, but it remains unclear what genetic or transcriptional changes are associated with this behavioral phenotype. We have now assembled the genome of S. parvus, resulting in 3.98 Gbp of 22,402 contigs with an N50 of 611,229 bp. The genome will be a resource for finding genes related to the physiology underlying foot flagging and to adaptations of the neuromuscular system. As a first application of the genome, we also began work in comparative genomics and differential gene expression analysis. We show that the androgen receptor is diverged from other anuran species, and we identify unique expression patterns of genes in the spinal cord and leg muscle that are important for axial patterning, cell specification and morphology, or muscle contraction. This genome will continue to be an important tool for future -omics studies to understand the evolution of elaborate signaling behaviors in this and potentially related species.
Collapse
Affiliation(s)
- Mika A Holtz
- Department of Biological Sciences, Smith College, Northampton, MA 01053, USA
| | - Riccardo Racicot
- Department of Biological Sciences, Smith College, Northampton, MA 01053, USA
| | - Doris Preininger
- Vienna Zoo, 1130 Vienna, Austria
- Department of Evolutionary Biology, University of Vienna, 1030 Vienna, Austria
| | - Adam M M Stuckert
- Department of Biology & Biochemistry, University of Houston, Houston, TX 77204, USA
| | - Lisa A Mangiamele
- Department of Biological Sciences, Smith College, Northampton, MA 01053, USA
| |
Collapse
|
25
|
Smith SK, Frazel PW, Khodadadi-Jamayran A, Zappile P, Marier C, Okhovat M, Brown S, Long MA, Heguy A, Phelps SM. De novo assembly and annotation of the singing mouse genome. BMC Genomics 2023; 24:569. [PMID: 37749493 PMCID: PMC10521431 DOI: 10.1186/s12864-023-09678-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 09/14/2023] [Indexed: 09/27/2023] Open
Abstract
BACKGROUND Developing genomic resources for a diverse range of species is an important step towards understanding the mechanisms underlying complex traits. Specifically, organisms that exhibit unique and accessible phenotypes-of-interest allow researchers to address questions that may be ill-suited to traditional model organisms. We sequenced the genome and transcriptome of Alston's singing mouse (Scotinomys teguina), an emerging model for social cognition and vocal communication. In addition to producing advertisement songs used for mate attraction and male-male competition, these rodents are diurnal, live at high-altitudes, and are obligate insectivores, providing opportunities to explore diverse physiological, ecological, and evolutionary questions. RESULTS Using PromethION, Illumina, and PacBio sequencing, we produced an annotated genome and transcriptome, which were validated using gene expression and functional enrichment analyses. To assess the usefulness of our assemblies, we performed single nuclei sequencing on cells of the orofacial motor cortex, a brain region implicated in song coordination, identifying 12 cell types. CONCLUSIONS These resources will provide the opportunity to identify the molecular basis of complex traits in singing mice as well as to contribute data that can be used for large-scale comparative analyses.
Collapse
Affiliation(s)
- Samantha K Smith
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, 78712, USA.
| | - Paul W Frazel
- Department of Neuroscience and Physiology, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Alireza Khodadadi-Jamayran
- Applied Bioinformatics Laboratory, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Paul Zappile
- Genome Technology Center, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Christian Marier
- Genome Technology Center, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Mariam Okhovat
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, 78712, USA
- Present Address: Oregon Health & Science University, Portland, OR, USA
| | - Stuart Brown
- NYU Center for Health Informatics and Bioinformatics, New York University Grossman School of Medicine, New York, NY, 10016, USA
- Present Address: Exxon Mobil Corporate, Houston, TX, USA
| | - Michael A Long
- Department of Neuroscience and Physiology, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Adriana Heguy
- Genome Technology Center, New York University Grossman School of Medicine, New York, NY, 10016, USA
| | - Steven M Phelps
- Department of Integrative Biology, University of Texas at Austin, Austin, TX, 78712, USA
| |
Collapse
|
26
|
Yu G, Liu Y, Li Z, Deng S, Wu Z, Zhang X, Chen W, Yang J, Chen X, Yang JR. Genome-wide probing of eukaryotic nascent RNA structure elucidates cotranscriptional folding and its antimutagenic effect. Nat Commun 2023; 14:5853. [PMID: 37730811 PMCID: PMC10511511 DOI: 10.1038/s41467-023-41550-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 09/08/2023] [Indexed: 09/22/2023] Open
Abstract
The transcriptional intermediates of RNAs fold into secondary structures with multiple regulatory roles, yet the details of such cotranscriptional RNA folding are largely unresolved in eukaryotes. Here, we present eSPET-seq (Structural Probing of Elongating Transcripts in eukaryotes), a method to assess the cotranscriptional RNA folding in Saccharomyces cerevisiae. Our study reveals pervasive structural transitions during cotranscriptional folding and overall structural similarities between nascent and mature RNAs. Furthermore, a combined analysis with genome-wide R-loop and mutation rate approximations provides quantitative evidence for the antimutator effect of nascent RNA folding through competitive inhibition of the R-loops, known to facilitate transcription-associated mutagenesis. Taken together, we present an experimental evaluation of cotranscriptional folding in eukaryotes and demonstrate the antimutator effect of nascent RNA folding. These results suggest genome-wide coupling between the processing and transmission of genetic information through RNA folding.
Collapse
Affiliation(s)
- Gongwang Yu
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yao Liu
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zizhang Li
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Shuyun Deng
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zhuoxing Wu
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Xiaoyu Zhang
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Wenbo Chen
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Junnan Yang
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Xiaoshu Chen
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jian-Rong Yang
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China.
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China.
- Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China.
| |
Collapse
|
27
|
Liska O, Boross G, Rocabert C, Szappanos B, Tengölics R, Papp B. Principles of metabolome conservation in animals. Proc Natl Acad Sci U S A 2023; 120:e2302147120. [PMID: 37603743 PMCID: PMC10468614 DOI: 10.1073/pnas.2302147120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 07/16/2023] [Indexed: 08/23/2023] Open
Abstract
Metabolite levels shape cellular physiology and disease susceptibility, yet the general principles governing metabolome evolution are largely unknown. Here, we introduce a measure of conservation of individual metabolite levels among related species. By analyzing multispecies tissue metabolome datasets in phylogenetically diverse mammals and fruit flies, we show that conservation varies extensively across metabolites. Three major functional properties, metabolite abundance, essentiality, and association with human diseases predict conservation, highlighting a striking parallel between the evolutionary forces driving metabolome and protein sequence conservation. Metabolic network simulations recapitulated these general patterns and revealed that abundant metabolites are highly conserved due to their strong coupling to key metabolic fluxes in the network. Finally, we show that biomarkers of metabolic diseases can be distinguished from other metabolites simply based on evolutionary conservation, without requiring any prior clinical knowledge. Overall, this study uncovers simple rules that govern metabolic evolution in animals and implies that most tissue metabolome differences between species are permitted, rather than favored by natural selection. More broadly, our work paves the way toward using evolutionary information to identify biomarkers, as well as to detect pathogenic metabolome alterations in individual patients.
Collapse
Affiliation(s)
- Orsolya Liska
- Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- Doctoral School of Biology, University of Szeged, 6726Szeged, Hungary
| | - Gábor Boross
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- Department of Biology, Stanford University, Stanford, City of Palo Alto, CA94305-5020
| | - Charles Rocabert
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- Inria, 78150Rocquencourt, 69100Villeurbanne, France
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, 00014Helsinki, Finland
- Institute for Computational Cell Biology, Heinrich-Heine Universität, 40225Düsseldorf, Germany
| | - Balázs Szappanos
- Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- Department of Biotechnology, University of Szeged, 6726Szeged, Hungary
| | - Roland Tengölics
- Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- Metabolomics Lab, Core facilities, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
| | - Balázs Papp
- Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Lab, 6728Szeged, Hungary
- National Laboratory of Biotechnology, Synthetic and System Biology Unit, Institute of Biochemistry, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
- National Laboratory for Health Security, Biological Research Centre, Eötvös Loránd Research Network, 6726Szeged, Hungary
| |
Collapse
|
28
|
Cui L, Cheng H, Yang Z, Xia C, Zhang L, Kong X. Comparative Analysis Reveals Different Evolutionary Fates and Biological Functions in Wheat Duplicated Genes ( Triticum aestivum L.). PLANTS (BASEL, SWITZERLAND) 2023; 12:3021. [PMID: 37687268 PMCID: PMC10489728 DOI: 10.3390/plants12173021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 08/20/2023] [Accepted: 08/21/2023] [Indexed: 09/10/2023]
Abstract
Wheat (Triticum aestivum L.) is a staple food crop that provides 20% of total human calorie consumption. Gene duplication has been considered to play an important role in evolution by providing new genetic resources. However, the evolutionary fates and biological functions of the duplicated genes in wheat remain to be elucidated. In this study, the resulting data showed that the duplicated genes evolved faster with shorter gene lengths, higher codon usage bias, lower expression levels, and higher tissue specificity when compared to non-duplicated genes. Our analysis further revealed functions of duplicated genes in various biological processes with significant enrichment to environmental stresses. In addition, duplicated genes derived from dispersed, proximal, tandem, transposed, and whole-genome duplication differed in abundance, evolutionary rate, gene compactness, expression pattern, and genetic diversity. Tandem and proximal duplicates experienced stronger selective pressure and showed a more compact gene structure with diverse expression profiles than other duplication modes. Moreover, genes derived from different duplication modes showed an asymmetrical evolutionary pattern for wheat A, B, and D subgenomes. Several candidate duplication hotspots associated with wheat domestication or polyploidization were characterized as potential targets for wheat molecular breeding. Our comprehensive analysis revealed the evolutionary trajectory of duplicated genes and laid the foundation for future functional studies on wheat.
Collapse
Affiliation(s)
- Licao Cui
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang 330045, China
| | - Hao Cheng
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Life Sciences, Northwest A&F University, Yangling 712100, China
| | - Zhe Yang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
| | - Chuan Xia
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
| | - Lichao Zhang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
| | - Xiuying Kong
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China; (L.C.); (H.C.); (Z.Y.); (C.X.); (L.Z.)
| |
Collapse
|
29
|
Pál C, Papp B. How selection shapes the short- and long-term dynamics of molecular evolution. Proc Natl Acad Sci U S A 2023; 120:e2311012120. [PMID: 37531373 PMCID: PMC10433269 DOI: 10.1073/pnas.2311012120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/04/2023] Open
Affiliation(s)
- Csaba Pál
- Synthetic and System Biology Unit, Biological Research Centre, National Laboratory of Biotechnology, Eötvös Loránd Research Network, SzegedHU-6726, Hungary
| | - Balázs Papp
- Synthetic and System Biology Unit, Biological Research Centre, National Laboratory of Biotechnology, Eötvös Loránd Research Network, SzegedHU-6726, Hungary
- Hungarian Centre of Excellence for Molecular Medicine - Biological Research Centre Metabolic Systems Biology Research Group, SzegedHU-6726, Hungary
- National Laboratory for Health Security, Biological Research Centre, Eötvös Loránd Research Network, SzegedHU-6726, Hungary
| |
Collapse
|
30
|
Cervantes S, Kesälahti R, Kumpula TA, Mattila TM, Helanterä H, Pyhäjärvi T. Strong Purifying Selection in Haploid Tissue-Specific Genes of Scots Pine Supports the Masking Theory. Mol Biol Evol 2023; 40:msad183. [PMID: 37565532 PMCID: PMC10457172 DOI: 10.1093/molbev/msad183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/16/2023] [Accepted: 08/10/2023] [Indexed: 08/12/2023] Open
Abstract
The masking theory states that genes expressed in a haploid stage will be under more efficient selection. In contrast, selection will be less efficient in genes expressed in a diploid stage, where the fitness effects of recessive deleterious or beneficial mutations can be hidden from selection in heterozygous form. This difference can influence several evolutionary processes such as the maintenance of genetic variation, adaptation rate, and genetic load. Masking theory expectations have been confirmed in single-cell haploid and diploid organisms. However, in multicellular organisms, such as plants, the effects of haploid selection are not clear-cut. In plants, the great majority of studies indicating haploid selection have been carried out using male haploid tissues in angiosperms. Hence, evidence in these systems is confounded with the effects of sexual selection and intraspecific competition. Evidence from other plant groups is scarce, and results show no support for the masking theory. Here, we have used a gymnosperm Scots pine megagametophyte, a maternally derived seed haploid tissue, and four diploid tissues to test the strength of purifying selection on a set of genes with tissue-specific expression. By using targeted resequencing data of those genes, we obtained estimates of genetic diversity, the site frequency spectrum of 0-fold and 4-fold sites, and inferred the distribution of fitness effects of new mutations in haploid and diploid tissue-specific genes. Our results show that purifying selection is stronger for tissue-specific genes expressed in the haploid megagametophyte tissue and that this signal of strong selection is not an artifact driven by high expression levels.
Collapse
Affiliation(s)
- Sandra Cervantes
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Robert Kesälahti
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Timo A Kumpula
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Laboratory of Cancer Genetics and Tumor Biology, Research Unit of Translational Medicine, University of Oulu, Oulu, Finland
| | - Tiina M Mattila
- Human Evolution, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Heikki Helanterä
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
31
|
Vigué L, Tenaillon O. Predicting the effect of mutations to investigate recent events of selection across 60,472 Escherichia coli strains. Proc Natl Acad Sci U S A 2023; 120:e2304177120. [PMID: 37487088 PMCID: PMC10401003 DOI: 10.1073/pnas.2304177120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 05/25/2023] [Indexed: 07/26/2023] Open
Abstract
Microbial genomics studies focusing on the dynamics of selection have often used a small number of distant genomes. As a result, they could only analyze mutations that had become fixed during the divergence between species. However, thousands of genomes of some species are now available in public databases, thanks to high-throughput sequencing. These data provide a more complete picture of the polymorphisms segregating within a species, offering a unique insight into the processes that shape the recent evolution of a species. In this study, we present GLASS (Gene-Level Amino-acid Score Shift), a selection test that is based on the predicted effects of amino acid changes. By comparing the distribution of effects of mutations observed in a gene to the expectation in the absence of selection, GLASS can quantify the intensity of selection. We applied GLASS to a dataset of 60,472 Escherichia coli strains and used this to reexamine the longstanding debate about the role of essentiality versus expression level in the rate of protein evolution. We found that selection has contrasting short-term and long-term dynamics, with essential genes being subject to strong purifying selection in the short term, while expression level determines the rate of gene evolution in the long term. GLASS also found an overrepresentation of inactivating mutations in specific transcription factors, such as efflux pump repressors, which is consistent with selection for antibiotic resistance. These gene-inactivating polymorphisms do not reach fixation, suggesting another contrast between short-term fitness gains and long-term counterselection.
Collapse
Affiliation(s)
- Lucile Vigué
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, Infection, Antimicrobials, Modelling, Evolution, F-75018Paris, France
| | - Olivier Tenaillon
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, Infection, Antimicrobials, Modelling, Evolution, F-75018Paris, France
| |
Collapse
|
32
|
Luzuriaga-Neira AR, Ritchie AM, Payne BL, Carrillo-Parramon O, Liberles DA, Alvarez-Ponce D. Highly Abundant Proteins Are Highly Thermostable. Genome Biol Evol 2023; 15:evad112. [PMID: 37399326 DOI: 10.1093/gbe/evad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2023] [Indexed: 07/05/2023] Open
Abstract
Highly abundant proteins tend to evolve slowly (a trend called E-R anticorrelation), and a number of hypotheses have been proposed to explain this phenomenon. The misfolding avoidance hypothesis attributes the E-R anticorrelation to the abundance-dependent toxic effects of protein misfolding. To avoid these toxic effects, protein sequences (particularly those of highly expressed proteins) would be under selection to fold properly. One prediction of the misfolding avoidance hypothesis is that highly abundant proteins should exhibit high thermostability (i.e., a highly negative free energy of folding, ΔG). Thus far, only a handful of analyses have tested for a relationship between protein abundance and thermostability, producing contradictory results. These analyses have been limited by 1) the scarcity of ΔG data, 2) the fact that these data have been obtained by different laboratories and under different experimental conditions, 3) the problems associated with using proteins' melting energy (Tm) as a proxy for ΔG, and 4) the difficulty of controlling for potentially confounding variables. Here, we use computational methods to compare the free energy of folding of pairs of human-mouse orthologous proteins with different expression levels. Even though the effect size is limited, the most highly expressed ortholog is often the one with a more negative ΔG of folding, indicating that highly expressed proteins are often more thermostable.
Collapse
Affiliation(s)
| | - Andrew M Ritchie
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | | | | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | |
Collapse
|
33
|
Roberts M, Josephs EB. Weaker selection on genes with treatment-specific expression consistent with a limit on plasticity evolution in Arabidopsis thaliana. Genetics 2023; 224:iyad074. [PMID: 37094602 PMCID: PMC10484170 DOI: 10.1093/genetics/iyad074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 03/06/2023] [Accepted: 04/07/2023] [Indexed: 04/26/2023] Open
Abstract
Differential gene expression between environments often underlies phenotypic plasticity. However, environment-specific expression patterns are hypothesized to relax selection on genes, and thus limit plasticity evolution. We collated over 27 terabases of RNA-sequencing data on Arabidopsis thaliana from over 300 peer-reviewed studies and 200 treatment conditions to investigate this hypothesis. Consistent with relaxed selection, genes with more treatment-specific expression have higher levels of nucleotide diversity and divergence at nonsynonymous sites but lack stronger signals of positive selection. This result persisted even after controlling for expression level, gene length, GC content, the tissue specificity of expression, and technical variation between studies. Overall, our investigation supports the existence of a hypothesized trade-off between the environment specificity of a gene's expression and the strength of selection on said gene in A. thaliana. Future studies should leverage multiple genome-scale datasets to tease apart the contributions of many variables in limiting plasticity evolution.
Collapse
Affiliation(s)
- Miles Roberts
- Genetics and Genome Sciences Program, Michigan State University, East Lansing, MI 48824, USA
| | - Emily B Josephs
- Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA
- Ecology, Evolution, and Behavior Program, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
34
|
Hara Y, Kuraku S. The impact of local genomic properties on the evolutionary fate of genes. eLife 2023; 12:82290. [PMID: 37223962 DOI: 10.7554/elife.82290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 04/25/2023] [Indexed: 05/25/2023] Open
Abstract
Functionally indispensable genes are likely to be retained and otherwise to be lost during evolution. This evolutionary fate of a gene can also be affected by factors independent of gene dispensability, including the mutability of genomic positions, but such features have not been examined well. To uncover the genomic features associated with gene loss, we investigated the characteristics of genomic regions where genes have been independently lost in multiple lineages. With a comprehensive scan of gene phylogenies of vertebrates with a careful inspection of evolutionary gene losses, we identified 813 human genes whose orthologs were lost in multiple mammalian lineages: designated 'elusive genes.' These elusive genes were located in genomic regions with rapid nucleotide substitution, high GC content, and high gene density. A comparison of the orthologous regions of such elusive genes across vertebrates revealed that these features had been established before the radiation of the extant vertebrates approximately 500 million years ago. The association of human elusive genes with transcriptomic and epigenomic characteristics illuminated that the genomic regions containing such genes were subject to repressive transcriptional regulation. Thus, the heterogeneous genomic features driving gene fates toward loss have been in place and may sometimes have relaxed the functional indispensability of such genes. This study sheds light on the complex interplay between gene function and local genomic properties in shaping gene evolution that has persisted since the vertebrate ancestor.
Collapse
Affiliation(s)
- Yuichiro Hara
- Research Center for Genome & Medical Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Shigehiro Kuraku
- Molecular Life History Laboratory, Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan
- Department of Genetics, Sokendai (Graduate University for Advanced Studies), Mishima, Japan
- RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
35
|
Wang D, Yang D, Yang L, Diao L, Zhang Y, Li Y, Wang H, Ren J, Cheng L, Tan Q, Zhang R, Han X, Zhang X, Wang B, Li D, Chen M, Hermjakob H, Li Y, LaBaer J, Zhou Z, Yu X. Human Autoantigen Atlas: Searching for the Hallmarks of Autoantigens. J Proteome Res 2023. [PMID: 37183442 DOI: 10.1021/acs.jproteome.2c00799] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Understanding autoimmunity to endogenous proteins is crucial in diagnosing and treating autoimmune diseases. In this work, we developed a user-friendly AAgAtlas portal (http://biokb.ncpsb.org.cn/aagatlas_portal/index.php#), which can be used to search for 8045 non-redundant autoantigens (AAgs) and 47 post-translationally modified AAgs against 1073 human diseases that are prioritized by a credential score developed by multisource evidence. Using AAgAtlas, the immunogenic properties of human AAgs was systematically elucidated according to their genetic, biophysical, cytological, expression profile, and evolutionary characteristics. The results indicated that human AAgs are evolutionally conserved in protein sequence and enriched in three hydrophilic and polar amino acid residues (K, D, and E) that are located at the protein surface. AAgs are enriched in proteins that are involved in nucleic acid binding, transferase, and the cytoskeleton. Genome, transcriptome, and proteome analyses further indicated that AAb production is associated with gene variance and abnormal protein expression related to the pathological activities of different tumors. Collectively, our data outlines the hallmarks of human AAgs that facilitate the understanding of humoral autoimmunity and the identification of biomarkers of human diseases.
Collapse
Affiliation(s)
- Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Liuhui Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuqi Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hongye Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jing Ren
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Linlin Cheng
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Qiaoyun Tan
- Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Ran Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xiaohong Han
- Clinical Pharmacology Research Center, Peking Union Medical College Hospital, State Key Laboratory of Complex Severe and Rare Diseases, NMPA Key Laboratory for Clinical Research and Evaluation of Drug, Beijing Key Laboratory of Clinical PK & PD Investigation for Innovative Drugs, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China
| | - Xiaohan Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Bingwei Wang
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Meng Chen
- National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Yongzhe Li
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Joshua LaBaer
- The Virginia G. Piper Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States
| | - Zhou Zhou
- Department of Laboratory Medicine, National Center for Cardiovascular Diseases and Fuwai Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100037, China
| | - Xiaobo Yu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| |
Collapse
|
36
|
Sloan DB, DeTar RA, Warren JM. Aminoacyl-tRNA Synthetase Evolution within the Dynamic Tripartite Translation System of Plant Cells. Genome Biol Evol 2023; 15:evad050. [PMID: 36951086 PMCID: PMC10098043 DOI: 10.1093/gbe/evad050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/07/2023] [Accepted: 03/09/2023] [Indexed: 03/24/2023] Open
Abstract
Eukaryotes maintain separate protein translation systems for nuclear and organellar genes, including distinct sets of tRNAs and aminoacyl-tRNA synthetases (aaRSs). In animals, mitochondrial-targeted aaRSs are expressed at lower levels and are less conserved in sequence than cytosolic aaRSs involved in translation of nuclear mRNAs, likely reflecting lower translational demands in mitochondria. In plants, translation is further complicated by the presence of plastids, which share most aaRSs with mitochondria. In addition, plant mitochondrial tRNA pools have a dynamic history of gene loss and functional replacement by tRNAs from other compartments. To investigate the consequences of these distinctive features of translation in plants, we analyzed sequence evolution in angiosperm aaRSs. In contrast to previously studied eukaryotic systems, we found that plant organellar and cytosolic aaRSs exhibit only a small difference in expression levels, and organellar aaRSs are slightly more conserved than cytosolic aaRSs. We hypothesize that these patterns result from high translational demands associated with photosynthesis in mature chloroplasts. We also investigated aaRS evolution in Sileneae, an angiosperm lineage with extensive mitochondrial tRNA replacement and aaRS retargeting. We predicted positive selection for changes in aaRS sequence resulting from these recent changes in subcellular localization and tRNA substrates but found little evidence for accelerated sequence divergence. Overall, the complex tripartite translation system in plant cells appears to have imposed more constraints on the long-term evolutionary rates of organellar aaRSs compared with other eukaryotic lineages, and plant aaRS protein sequences appear largely robust to more recent perturbations in subcellular localization and tRNA interactions.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins
| | - Rachael A DeTar
- Department of Biology, Colorado State University, Fort Collins
| | - Jessica M Warren
- Center for Mechanisms of Evolution, Biodesign Institute and School of Life Sciences, Arizona State University, Tempe
| |
Collapse
|
37
|
Mehlhoff JD, Ostermeier M. Genes Vary Greatly in Their Propensity for Collateral Fitness Effects of Mutations. Mol Biol Evol 2023; 40:7043719. [PMID: 36798991 PMCID: PMC9999109 DOI: 10.1093/molbev/msad038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 01/18/2023] [Accepted: 02/13/2023] [Indexed: 02/18/2023] Open
Abstract
Mutations can have deleterious fitness effects when they decrease protein specific activity or decrease active protein abundance. Mutations will also be deleterious when they cause misfolding or misinteractions that are toxic to the cell (i.e., independent of whether the mutations affect specific activity and abundance). The extent to which protein evolution is shaped by these and other collateral fitness effects is unclear in part because little is known of their frequency and magnitude. Using deep mutational scanning (DMS), we previously found at least 42% of missense mutations in the TEM-1 β-lactamase antibiotic resistance gene cause deleterious collateral fitness effects. Here, we used DMS to comprehensively determine the collateral fitness effects of missense mutations in three genes encoding the antibiotic resistance proteins New Delhi metallo-β-lactamase (NDM-1), chloramphenicol acetyltransferase I (CAT-I), and 2″-aminoglycoside nucleotidyltransferase (AadB). AadB (20%), CAT-I (0.9%), and NDM-1 (0.2%) were less susceptible to deleterious collateral fitness effects than TEM-1 (42%) indicating that genes have different propensities for these effects. As was observed with TEM-1, all the studied deleterious aadB mutants increased aggregation. However, aggregation did not correlate with collateral fitness effects for many of the deleterious mutants of CAT-I and NDM-1. Select deleterious mutants caused unexpected phenotypes to emerge. The introduction of internal start codons in CAT-1 caused loss of the episome and a mutation in aadB made its cognate antibiotic essential for growth. Our study illustrates how the complexity of the cell provides a rich environment for collateral fitness effects and new phenotypes to emerge.
Collapse
Affiliation(s)
- Jacob D Mehlhoff
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD
| | - Marc Ostermeier
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD
| |
Collapse
|
38
|
Syed NA, Bhatti A, John P. Molecular dynamics simulations and bioinformatics' analysis of deleterious missense single nucleotide polymorphisms in Glyoxalase-1 gene. J Biomol Struct Dyn 2023; 41:13707-13717. [PMID: 36812296 DOI: 10.1080/07391102.2023.2181654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 02/12/2023] [Indexed: 02/24/2023]
Abstract
Glyoxalase-1 (Glo-1) is a key member of the Glyoxalase system, the primary line of defense against dicarbonyl stress which, in tandem, with reduced levels of expression or activity of Glyoxalase-1 enzyme, has been implicated in various human diseases like type 2 diabetes mellitus (T2DM) and its vascular complications. The association of Glo-1 single nucleotide polymorphisms with genetic susceptibility to T2DM and its vascular complications is yet to be explored. Therefore, in this study, we have employed a computational approach to identify the most damaging missense or nonsynonymous SNPs (nsSNPs) in Glo-1 gene. Initially, we characterized missense SNPs that are damaging to the structural and functional integrity of Glo-1 using various bioinformatic tools. These tools included SIFT, PolyPhen-2, SNAP, PANTHER, PROVEAN, PhD-SNP, SNPs&GO, I-Mutant, MUpro and MutPred2. One of these missense SNPs (rs1038747749; corresponding to amino acid change Arginine to Glutamine at position 38) was found to be highly conserved in evolution and is an important part of the enzyme's active site, glutathione binding site, as well as the dimeric interface based on the results obtained from ConSurf and NCBI Conserved Domain Search tools. Project HOPE reported that this mutation replaces a positively charged polar amino acid (Arginine) with a small, neutrally charged amino acid (Glutamine). Comparative modelling of wildtype and mutant (R38Q) Glo-1 proteins was performed in the run up to molecular dynamics simulation analysis which showed that rs1038747749 adversely impacts Glo-1 protein's stability, rigidity, compactness, hydrogen bonds/interactions as demonstrated by the results of various parameters computed during the analysis.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Nida Ali Syed
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Attya Bhatti
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Peter John
- Department of Healthcare Biotechnology, Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| |
Collapse
|
39
|
Nelson CW, Mirabello L. Human papillomavirus genomics: Understanding carcinogenicity. Tumour Virus Res 2023; 15:200258. [PMID: 36812987 PMCID: PMC10063409 DOI: 10.1016/j.tvr.2023.200258] [Citation(s) in RCA: 21] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 02/01/2023] [Accepted: 02/17/2023] [Indexed: 02/22/2023] Open
Abstract
Human papillomavirus (HPV) causes virtually all cervical cancers and many cancers at other anatomical sites in both men and women. However, only 12 of 448 known HPV types are currently classified as carcinogens, and even the most carcinogenic type - HPV16 - only rarely leads to cancer. HPV is therefore necessary but insufficient for cervical cancer, with other contributing factors including host and viral genetics. Over the last decade, HPV whole genome sequencing has established that even fine-scale within-type HPV variation influences precancer/cancer risks, and that these risks vary by histology and host race/ethnicity. In this review, we place these findings in the context of the HPV life cycle and evolution at various levels of viral diversity: between-type, within-type, and within-host. We also discuss key concepts necessary for interpreting HPV genomic data, including features of the viral genome; events leading to carcinogenesis; the role of APOBEC3 in HPV infection and evolution; and methodologies that use deep (high-coverage) sequencing to characterize within-host variation, as opposed to relying on a single representative (consensus) sequence. Given the continued high burden of HPV-associated cancers, understanding HPV carcinogenicity remains important for better understanding, preventing, and treating cancers attributable to infection.
Collapse
Affiliation(s)
- Chase W Nelson
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, 20850, USA; Institute for Comparative Genomics, American Museum of Natural History, New York, NY, 10024, USA.
| | - Lisa Mirabello
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Rockville, MD, 20850, USA.
| |
Collapse
|
40
|
Nithya C, Kiran M, Nagarajaram HA. Dissection of hubs and bottlenecks in a protein-protein interaction network. Comput Biol Chem 2023; 102:107802. [PMID: 36603332 DOI: 10.1016/j.compbiolchem.2022.107802] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 11/20/2022] [Accepted: 12/08/2022] [Indexed: 12/23/2022]
Abstract
Analysis of degree centrality in conjunction with betweenness centrality of proteins in a human protein-protein interaction network revealed three categories of centrally important proteins: a) proteins with high degree and betweenness (hub-bottlenecks denoted as MX), b) proteins with high betweenness and low degree (non-hub-bottlenecks/pure bottlenecks denoted as PB) and c) proteins with high degree and low betweenness (hub-non-bottlenecks/pure hubs denoted as PH). When subjected to a detailed statistical analysis of their molecular-level properties, the proteins belonging to each of these categories were found to be associated with distinct canonical molecular properties, i.e., "molecular markers". The MX proteins are a) conformationally versatile, mainly comprising of essential proteins, b) the targets for interactions by the proteins of viral and bacterial pathogens, c) evolutionally constrained, involved in multiple pathways, enriched with disease genes and d) involved in the functions such as protein stabilization, phosphorylation, and mRNA slicing processes. PB proteins are a) enriched with extracellular and cancer-related proteins, b) enriched with the approved drug targets and c) involved in cell-cell signaling processes. Finally, PH are a) structurally versatile, b) enriched with essential proteins primarily involved in housekeeping processes (transcription and replication). The fact that the proteins belonging to these three categories form three distinct sets in terms of their molecular properties reveals the existence of trichotomy among hubs and bottlenecks, and this knowledge is of paramount importance while prioritizing protein targets for further studies such as drug design and disease association studies based on their network centrality values.
Collapse
Affiliation(s)
- Chandramohan Nithya
- Department of Biotechnology and Bioinformatics, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | - Manjari Kiran
- Department of Systems and Computational Biology, School of Life Sciences, University of Hyderabad, Hyderabad, Telangana 500046, India
| | | |
Collapse
|
41
|
Zhang J. What Has Genomics Taught An Evolutionary Biologist? GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:1-12. [PMID: 36720382 PMCID: PMC10373158 DOI: 10.1016/j.gpb.2023.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 01/06/2023] [Accepted: 01/19/2023] [Indexed: 01/30/2023]
Abstract
Genomics, an interdisciplinary field of biology on the structure, function, and evolution of genomes, has revolutionized many subdisciplines of life sciences, including my field of evolutionary biology, by supplying huge data, bringing high-throughput technologies, and offering a new approach to biology. In this review, I describe what I have learned from genomics and highlight the fundamental knowledge and mechanistic insights gained. I focus on three broad topics that are central to evolutionary biology and beyond-variation, interaction, and selection-and use primarily my own research and study subjects as examples. In the next decade or two, I expect that the most important contributions of genomics to evolutionary biology will be to provide genome sequences of nearly all known species on Earth, facilitate high-throughput phenotyping of natural variants and systematically constructed mutants for mapping genotype-phenotype-fitness landscapes, and assist the determination of causality in evolutionary processes using experimental evolution.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
42
|
Sloan DB, Warren JM, Williams AM, Kuster SA, Forsythe ES. Incompatibility and Interchangeability in Molecular Evolution. Genome Biol Evol 2023; 15:evac184. [PMID: 36583227 PMCID: PMC9839398 DOI: 10.1093/gbe/evac184] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 12/20/2022] [Accepted: 12/22/2022] [Indexed: 12/31/2022] Open
Abstract
There is remarkable variation in the rate at which genetic incompatibilities in molecular interactions accumulate. In some cases, minor changes-even single-nucleotide substitutions-create major incompatibilities when hybridization forces new variants to function in a novel genetic background from an isolated population. In other cases, genes or even entire functional pathways can be horizontally transferred between anciently divergent evolutionary lineages that span the tree of life with little evidence of incompatibilities. In this review, we explore whether there are general principles that can explain why certain genes are prone to incompatibilities while others maintain interchangeability. We summarize evidence pointing to four genetic features that may contribute to greater resistance to functional replacement: (1) function in multisubunit enzyme complexes and protein-protein interactions, (2) sensitivity to changes in gene dosage, (3) rapid rate of sequence evolution, and (4) overall importance to cell viability, which creates sensitivity to small perturbations in molecular function. We discuss the relative levels of support for these different hypotheses and lay out future directions that may help explain the striking contrasts in patterns of incompatibility and interchangeability throughout the history of molecular evolution.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, Colorado
| | - Jessica M Warren
- Center for Mechanisms of Evolution, Biodesign Institute and School of Life Sciences, Arizona State University, Tempe, Arizona
| | - Alissa M Williams
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee
| | - Shady A Kuster
- Department of Biology, Colorado State University, Fort Collins, Colorado
| | - Evan S Forsythe
- Department of Biology, Colorado State University, Fort Collins, Colorado
| |
Collapse
|
43
|
Tao W, Li R, Li T, Li Z, Li Y, Cui L. The evolutionary patterns, expression profiles, and genetic diversity of expanded genes in barley. FRONTIERS IN PLANT SCIENCE 2023; 14:1168124. [PMID: 37180392 PMCID: PMC10171312 DOI: 10.3389/fpls.2023.1168124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 03/28/2023] [Indexed: 05/16/2023]
Abstract
Gene duplication resulting from whole-genome duplication (WGD), small-scale duplication (SSD), or unequal hybridization plays an important role in the expansion of gene families. Gene family expansion can also mediate species formation and adaptive evolution. Barley (Hordeum vulgare) is the world's fourth largest cereal crop, and it contains valuable genetic resources due to its ability to tolerate various types of environmental stress. In this study, 27,438 orthogroups in the genomes of seven Poaceae were identified, and 214 of them were significantly expanded in barley. The evolutionary rates, gene properties, expression profiles, and nucleotide diversity between expanded and non-expanded genes were compared. Expanded genes evolved more rapidly and experienced lower negative selection. Expanded genes, including their exons and introns, were shorter, they had fewer exons, their GC content was lower, and their first exons were longer compared with non-expanded genes. Codon usage bias was also lower for expanded genes than for non-expanded genes; the expression levels of expanded genes were lower than those of non-expanded genes, and the expression of expanded genes showed higher tissue specificity than that of non-expanded genes. Several stress-response-related genes/gene families were identified, and these genes could be used to breed barley plants with greater resistance to environmental stress. Overall, our analysis revealed evolutionary, structural, and functional differences between expanded and non-expanded genes in barley. Additional studies are needed to clarify the functions of the candidate genes identified in our study and evaluate their utility for breeding barley plants with greater stress resistance.
Collapse
Affiliation(s)
- Wenjing Tao
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
- State Key Laboratory of Cellular Stress Biology, School of Life Sciences, Faculty of Medicine and Life Sciences, Xiamen University, Xiamen, Fujian, China
| | - Ruiying Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Tingting Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Zhimin Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
| | - Yihan Li
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
- *Correspondence: Yihan Li, ; Licao Cui,
| | - Licao Cui
- College of Bioscience and Engineering, Jiangxi Agricultural University, Nanchang, Jiangxi, China
- *Correspondence: Yihan Li, ; Licao Cui,
| |
Collapse
|
44
|
Hao J, Liang Y, Ping J, Li J, Shi W, Su Y, Wang T. Chloroplast gene expression level is negatively correlated with evolutionary rates and selective pressure while positively with codon usage bias in Ophioglossum vulgatum L. BMC PLANT BIOLOGY 2022; 22:580. [PMID: 36510137 PMCID: PMC9746204 DOI: 10.1186/s12870-022-03960-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Accepted: 11/24/2022] [Indexed: 05/02/2023]
Abstract
BACKGROUND Characterization of the key factors determining gene expression level has been of significant interest. Previous studies on the relationship among evolutionary rates, codon usage bias, and expression level mostly focused on either nuclear genes or unicellular/multicellular organisms but few in chloroplast (cp) genes. Ophioglossum vulgatum is a unique fern and has important scientific and medicinal values. In this study, we sequenced its cp genome and transcriptome to estimate the evolutionary rates (dN and dS), selective pressure (dN/dS), gene expression level, codon usage bias, and their correlations. RESULTS The correlation coefficients between dN, dS, and dN/dS, and Transcripts Per Million (TPM) average values were -0.278 (P = 0.027 < 0.05), -0.331 (P = 0.008 < 0.05), and -0.311 (P = 0.013 < 0.05), respectively. The codon adaptation index (CAI) and tRNA adaptation index (tAI) were significantly positively correlated with TPM average values (P < 0.05). CONCLUSIONS Our results indicated that when the gene expression level was higher, the evolutionary rates and selective pressure were lower, but the codon usage bias was stronger. We provided evidence from cp gene data which supported the E-R (E stands for gene expression level and R stands for evolutionary rate) anti-correlation.
Collapse
Affiliation(s)
- Jing Hao
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingyi Liang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Jingyao Ping
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Jinye Li
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Wanxin Shi
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingjuan Su
- School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
- Research Institute of Sun Yat-sen University in Shenzhen, Shenzhen, 518057, China.
| | - Ting Wang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
45
|
Zhang J, Xu C. Gene product diversity: adaptive or not? Trends Genet 2022; 38:1112-1122. [PMID: 35641344 PMCID: PMC9560964 DOI: 10.1016/j.tig.2022.05.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 04/30/2022] [Accepted: 05/03/2022] [Indexed: 01/24/2023]
Abstract
One gene does not equal one RNA or protein. The genomic revolution has revealed numerous different RNA and protein molecules that can be produced from one gene, such as circular RNAs generated by back-splicing, proteins with residues mismatching the genomic encoding because of RNA editing, and proteins extended in the C terminus via stop codon readthrough in translation. Are these diverse products results of exquisite gene regulations or imprecise biological processes? While there are cases where the gene product diversity appears beneficial, genome-scale patterns suggest that much of this diversity arises from nonadaptive, molecular errors. This finding has important implications for studying the functions of diverse gene products and for understanding the fundamental properties and evolution of cellular life.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Chuan Xu
- Bio-X Institutes, Key Laboratory for the Genetics of Developmental and Neuropsychiatric Disorders, Ministry of Education, Shanghai Jiao Tong University, Shanghai 200240, China
| |
Collapse
|
46
|
Moldovan MA, Gaydukova SA. Unusual Dependence between Gene Expression and Negative Selection in Euplotes. Mol Biol 2022. [DOI: 10.1134/s0026893323010090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
47
|
Shibai A, Kotani H, Sakata N, Furusawa C, Tsuru S. Purifying selection enduringly acts on the sequence evolution of highly expressed proteins in Escherichia coli. G3 GENES|GENOMES|GENETICS 2022; 12:6694045. [PMID: 36073932 PMCID: PMC9635659 DOI: 10.1093/g3journal/jkac235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/27/2022] [Indexed: 11/17/2022]
Abstract
The evolutionary speed of a protein sequence is constrained by its expression level, with highly expressed proteins evolving relatively slowly. This negative correlation between expression levels and evolutionary rates (known as the E–R anticorrelation) has already been widely observed in past macroevolution between species from bacteria to animals. However, it remains unclear whether this seemingly general law also governs recent evolution, including past and de novo, within a species. However, the advent of genomic sequencing and high-throughput phenotyping, particularly for bacteria, has revealed fundamental gaps between the 2 evolutionary processes and has provided empirical data opposing the possible underlying mechanisms which are widely believed. These conflicts raise questions about the generalization of the E–R anticorrelation and the relevance of plausible mechanisms. To explore the ubiquitous impact of expression levels on molecular evolution and test the relevance of the possible underlying mechanisms, we analyzed the genome sequences of 99 strains of Escherichia coli for evolution within species in nature. We also analyzed genomic mutations accumulated under laboratory conditions as a model of de novo evolution within species. Here, we show that E–R anticorrelation is significant in both past and de novo evolution within species in E. coli. Our data also confirmed ongoing purifying selection on highly expressed genes. Ongoing selection included codon-level purifying selection, supporting the relevance of the underlying mechanisms. However, the impact of codon-level purifying selection on the constraints in evolution within species might be smaller than previously expected from evolution between species.
Collapse
Affiliation(s)
- Atsushi Shibai
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Hazuki Kotani
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Natsue Sakata
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Chikara Furusawa
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| | - Saburo Tsuru
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| |
Collapse
|
48
|
Ramstein GP, Buckler ES. Prediction of evolutionary constraint by genomic annotations improves functional prioritization of genomic variants in maize. Genome Biol 2022; 23:183. [PMID: 36050782 PMCID: PMC9438327 DOI: 10.1186/s13059-022-02747-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 08/15/2022] [Indexed: 11/10/2022] Open
Abstract
Background Crop improvement through cross-population genomic prediction and genome editing requires identification of causal variants at high resolution, within fewer than hundreds of base pairs. Most genetic mapping studies have generally lacked such resolution. In contrast, evolutionary approaches can detect genetic effects at high resolution, but they are limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Here we use genomic annotations to accurately predict nucleotide conservation across angiosperms, as a proxy for fitness effect of mutations. Results Using only sequence analysis, we annotate nonsynonymous mutations in 25,824 maize gene models, with information from bioinformatics and deep learning. Our predictions are validated by experimental information: within-species conservation, chromatin accessibility, and gene expression. According to gene ontology and pathway enrichment analyses, predicted nucleotide conservation points to genes in central carbon metabolism. Importantly, it improves genomic prediction for fitness-related traits such as grain yield, in elite maize panels, by stringent prioritization of fewer than 1% of single-site variants. Conclusions Our results suggest that predicting nucleotide conservation across angiosperms may effectively prioritize sites most likely to impact fitness-related traits in crops, without being limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Our approach—Prediction of mutation Impact by Calibrated Nucleotide Conservation (PICNC)—could be useful to select polymorphisms for accurate genomic prediction, and candidate mutations for efficient base editing. The trained PICNC models and predicted nucleotide conservation at protein-coding SNPs in maize are publicly available in CyVerse (10.25739/hybz-2957). Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02747-2.
Collapse
Affiliation(s)
- Guillaume P Ramstein
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus, Denmark. .,Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.,USDA-ARS, Ithaca, NY, 14853, USA
| |
Collapse
|
49
|
Karamycheva S, Wolf YI, Persi E, Koonin EV, Makarova KS. Analysis of lineage-specific protein family variability in prokaryotes combined with evolutionary reconstructions. Biol Direct 2022; 17:22. [PMID: 36042479 PMCID: PMC9425974 DOI: 10.1186/s13062-022-00337-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 08/13/2022] [Indexed: 12/24/2022] Open
Abstract
Background Evolutionary rate is a key characteristic of gene families that is linked to the functional importance of the respective genes as well as specific biological functions of the proteins they encode. Accurate estimation of evolutionary rates is a challenging task that requires precise phylogenetic analysis. Here we present an easy to estimate protein family level measure of sequence variability based on alignment column homogeneity in multiple alignments of protein sequences from Clade-Specific Clusters of Orthologous Genes (csCOGs). Results We report genome-wide estimates of variability for 8 diverse groups of bacteria and archaea and investigate the connection between variability and various genomic and biological features. The variability estimates are based on homogeneity distributions across amino acid sequence alignments and can be obtained for multiple groups of genomes at minimal computational expense. About half of the variance in variability values can be explained by the analyzed features, with the greatest contribution coming from the extent of gene paralogy in the given csCOG. The correlation between variability and paralogy appears to originate, primarily, not from gene duplication, but from acquisition of distant paralogs and xenologs, introducing sequence variants that are more divergent than those that could have evolved in situ during the lifetime of the given group of organisms. Both high-variability and low-variability csCOGs were identified in all functional categories, but as expected, proteins encoded by integrated mobile elements as well as proteins involved in defense functions and cell motility are, on average, more variable than proteins with housekeeping functions. Additionally, using linear discriminant analysis, we found that variability and fraction of genomes carrying a given gene are the two variables that provide the best prediction of gene essentiality as compared to the results of transposon mutagenesis in Sulfolobus islandicus. Conclusions Variability, a measure of sequence diversity within an alignment relative to the overall diversity within a group of organisms, offers a convenient proxy for evolutionary rate estimates and is informative with respect to prediction of functional properties of proteins. In particular, variability is a strong predictor of gene essentiality for the respective organisms and indicative of sub- or neofunctionalization of paralogs. Supplementary Information The online version contains supplementary material available at 10.1186/s13062-022-00337-7.
Collapse
Affiliation(s)
- Svetlana Karamycheva
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Erez Persi
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, 20894, USA.
| |
Collapse
|
50
|
Low protein expression enhances phenotypic evolvability by intensifying selection on folding stability. Nat Ecol Evol 2022; 6:1155-1164. [PMID: 35798838 PMCID: PMC7613228 DOI: 10.1038/s41559-022-01797-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 05/19/2022] [Indexed: 01/09/2023]
Abstract
Protein abundance affects the evolution of protein genotypes, but we do not know how it affects the evolution of protein phenotypes. Here we investigate the role of protein abundance in the evolvability of green fluorescent protein (GFP) towards the novel phenotype of cyan fluorescence. We evolve GFP in E. coli through multiple cycles of mutation and selection and show that low GFP expression facilitates the evolution of cyan fluorescence. A computational model whose predictions we test experimentally helps explain why: lowly expressed proteins are under stronger selection for proper folding, which facilitates their evolvability on short evolutionary time scales. The reason is that high fluorescence can be achieved by either few proteins that fold well or by many proteins that fold less well. In other words, we observe a synergy between a protein's scarcity and its stability. Because many proteins meet the essential requirements for this scarcity-stability synergy, it may be a widespread mechanism by which low expression helps proteins evolve new phenotypes and functions.
Collapse
|