1
|
Arakaza A, Liu X, Zhu J, Zou L. Assessment of serum levels and placental bed tissue expression of IGF-1, bFGF, and PLGF in patients with placenta previa complicated with placenta accreta spectrum disorders. J Matern Fetal Neonatal Med 2024; 37:2305264. [PMID: 38247274 DOI: 10.1080/14767058.2024.2305264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 01/07/2024] [Indexed: 01/23/2024]
Abstract
OBJECTIVE This study aims to detect the serum levels of IGF-1, bFGF, and PLGF and their expressions in placental bed tissues of patients with placenta previa complicated with PAS disorders. METHODS This case and control study included 40 multiparous pregnant women with complete placenta previa between 34 weeks and 38 weeks of gestation and they were divided into two groups: 25 patients with PAS (case group) and 15 patients without PAS (control group). The venous blood samples were collected 2 h before the cesarean section, and the placental bed tissues were taken intraoperatively at the placental implantation site and then were histologically examined to evaluate the gravity of the myometrial invasion of the placenta. According to FIGO PAS increasing grading, the 25 patients were also divided into three groups: PAS grade I group, PAS grade II group, and PAS grade III group. The concentrations of IGF-1, bFGF, and PLGF in serum were measured using ELISA, and the mean ratio of the relative mRNA expression of each biomarker in placental bed tissues was calculated using qRT-PCR. The staining intensity and the positive cells were quantitatively measured and expressed as means by using Image J software for IHC analysis. RESULTS IGF-1 had low serum levels and high placental bed expression in placenta previa patients with PAS disorders compared to those without PAS (all p < 0.0001). PLGF had high serum levels (p = 0.0200) and high placental bed expression (p < 0.0001) in placenta previa patients with PAS disorders compared to those without PAS. IGF-1 serum levels decreased up to PAS grade II (means were 24.3 ± 4.03, 21.98 ± 3.29, and 22.03 ± 7.31, respectively for PAS grade I, PAS grade II, PAS grade III groups, p = 0.0006). PLGF serum levels increased up to PAS grade II (means were 12.96 ± 2.74, 14.97 ± 2.56, and 14.89 ± 2.14, respectively for the three groups, p = 0.0392). However, IGF-1 and PLGF mRNA placental bed expression increased up to PAS grade III. The relative expression of mRNA means for the three groups was 3.194 ± 1.40, 3.509 ± 0.63, and 3.872 ± 0.70, respectively for IGF-1; and 2.784 ± 1.14, 2.810 ± 0.71, and 2.869 ± 0.48, respectively for PLGF (all p < 0.0001). Their IHC (immunohistochemical) staining also had increasing trends, but p > 0.05. bFGF was not significantly expressed in placenta previa with PAS disorders in most of the analysis sections (p > 0.05). CONCLUSIONS Low serum levels and high expression in placental bed tissues of IGF-1, or high serum levels and high expression in placental bed tissues of PLGF, may differentiate placenta previa patients with FIGO PAS grade I and PAS grade II from those without PAS disorders. However, they could not significantly predict the degree of placental invasiveness in FIGO PAS grades II and III.
Collapse
Affiliation(s)
- Arcade Arakaza
- Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Xiaoxia Liu
- Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Jianwen Zhu
- Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Li Zou
- Department of Obstetrics and Gynecology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
2
|
Ali F. Patterns of Change in Nucleotide Diversity Over Gene Length. Genome Biol Evol 2024; 16:evae078. [PMID: 38608148 PMCID: PMC11040516 DOI: 10.1093/gbe/evae078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/26/2024] [Accepted: 04/03/2024] [Indexed: 04/14/2024] Open
Abstract
Nucleotide diversity at a site is influenced by the relative strengths of neutral and selective population genetic processes. Therefore, attempts to estimate Effective population size based on the diversity of synonymous sites demand a better understanding of their selective constraints. The nucleotide diversity of a gene was previously found to correlate with its length. In this work, I measure nucleotide diversity at synonymous sites and uncover a pattern of low diversity towards the translation initiation site of a gene. The degree of reduction in diversity at the translation initiation site and the length of this region of reduced diversity can be quantified as "Effect Size" and "Effect Length" respectively, using parameters of an asymptotic regression model. Estimates of Effect Length across bacteria covaried with recombination rates as well as with a multitude of translation-associated traits such as the avoidance of mRNA secondary structure around translation initiation site, the number of rRNAs, and relative codon usage of ribosomal genes. Evolutionary simulations under purifying selection reproduce the observed patterns and diversity-length correlation and highlight that selective constraints on the 5'-region of a gene may be more extensive than previously believed. These results have implications for the estimation of effective population size, and relative mutation rates, and for genome scans of genes under positive selection based on "silent-site" diversity.
Collapse
Affiliation(s)
- Farhan Ali
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85281, USA
| |
Collapse
|
3
|
Sandler G, Agrawal AF, Wright SI. Population Genomics of the Facultatively Sexual Liverwort Marchantia polymorpha. Genome Biol Evol 2023; 15:evad196. [PMID: 37883717 PMCID: PMC10667032 DOI: 10.1093/gbe/evad196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 10/15/2023] [Accepted: 10/18/2023] [Indexed: 10/28/2023] Open
Abstract
The population genomics of facultatively sexual organisms are understudied compared with their abundance across the tree of life. We explore patterns of genetic diversity in two subspecies of the facultatively sexual liverwort Marchantia polymorpha using samples from across Southern Ontario, Canada. Despite the ease with which M. polymorpha should be able to propagate asexually, we find no evidence of strictly clonal descent among our samples and little to no signal of isolation by distance. Patterns of identity-by-descent tract sharing further showed evidence of recent recombination and close relatedness between geographically distant isolates, suggesting long distance gene flow and at least a modest frequency of sexual reproduction. However, the M. polymorpha genome contains overall very low levels of nucleotide diversity and signs of inefficient selection evidenced by a relatively high fraction of segregating deleterious variants. We interpret these patterns as possible evidence of the action of linked selection and a small effective population size due to past generations of asexual propagation. Overall, the M. polymorpha genome harbors signals of a complex history of both sexual and asexual reproduction.
Collapse
Affiliation(s)
- George Sandler
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
| | - Aneil F Agrawal
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Center for Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada
- Center for Analysis of Genome Evolution and Function, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
4
|
Yu G, Liu Y, Li Z, Deng S, Wu Z, Zhang X, Chen W, Yang J, Chen X, Yang JR. Genome-wide probing of eukaryotic nascent RNA structure elucidates cotranscriptional folding and its antimutagenic effect. Nat Commun 2023; 14:5853. [PMID: 37730811 PMCID: PMC10511511 DOI: 10.1038/s41467-023-41550-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Accepted: 09/08/2023] [Indexed: 09/22/2023] Open
Abstract
The transcriptional intermediates of RNAs fold into secondary structures with multiple regulatory roles, yet the details of such cotranscriptional RNA folding are largely unresolved in eukaryotes. Here, we present eSPET-seq (Structural Probing of Elongating Transcripts in eukaryotes), a method to assess the cotranscriptional RNA folding in Saccharomyces cerevisiae. Our study reveals pervasive structural transitions during cotranscriptional folding and overall structural similarities between nascent and mature RNAs. Furthermore, a combined analysis with genome-wide R-loop and mutation rate approximations provides quantitative evidence for the antimutator effect of nascent RNA folding through competitive inhibition of the R-loops, known to facilitate transcription-associated mutagenesis. Taken together, we present an experimental evaluation of cotranscriptional folding in eukaryotes and demonstrate the antimutator effect of nascent RNA folding. These results suggest genome-wide coupling between the processing and transmission of genetic information through RNA folding.
Collapse
Affiliation(s)
- Gongwang Yu
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Yao Liu
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zizhang Li
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Shuyun Deng
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Zhuoxing Wu
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Xiaoyu Zhang
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Wenbo Chen
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Junnan Yang
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Xiaoshu Chen
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
- Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China
| | - Jian-Rong Yang
- Advanced Medical Technology Center, The First Affiliated Hospital, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China.
- Department of Genetics and Biomedical Informatics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China.
- Key Laboratory of Tropical Disease Control, Ministry of Education, Sun Yat-sen University, Guangzhou, 510080, China.
| |
Collapse
|
5
|
Johnson MM, Hockenberry AJ, McGuffie MJ, Vieira LC, Wilke CO. Growth-dependent Gene Expression Variation Influences the Strength of Codon Usage Biases. Mol Biol Evol 2023; 40:msad189. [PMID: 37619989 PMCID: PMC10482319 DOI: 10.1093/molbev/msad189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 08/11/2023] [Indexed: 08/26/2023] Open
Abstract
The most highly expressed genes in microbial genomes tend to use a limited set of synonymous codons, often referred to as "preferred codons." The existence of preferred codons is commonly attributed to selection pressures on various aspects of protein translation including accuracy and/or speed. However, gene expression is condition-dependent and even within single-celled organisms transcript and protein abundances can vary depending on a variety of environmental and other factors. Here, we show that growth rate-dependent expression variation is an important constraint that significantly influences the evolution of gene sequences. Using large-scale transcriptomic and proteomic data sets in Escherichia coli and Saccharomyces cerevisiae, we confirm that codon usage biases are strongly associated with gene expression but highlight that this relationship is most pronounced when gene expression measurements are taken during rapid growth conditions. Specifically, genes whose relative expression increases during periods of rapid growth have stronger codon usage biases than comparably expressed genes whose expression decreases during rapid growth conditions. These findings highlight that gene expression measured in any particular condition tells only part of the story regarding the forces shaping the evolution of microbial gene sequences. More generally, our results imply that microbial physiology during rapid growth is critical for explaining long-term translational constraints.
Collapse
Affiliation(s)
- Mackenzie M Johnson
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Adam J Hockenberry
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Matthew J McGuffie
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX, USA
| | - Luiz Carlos Vieira
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
6
|
Wienecke AN, Barry ML, Pollard DA. Natural variation in codon bias and mRNA folding strength interact synergistically to modify protein expression in Saccharomyces cerevisiae. Genetics 2023; 224:iyad113. [PMID: 37310925 PMCID: PMC10411576 DOI: 10.1093/genetics/iyad113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 04/10/2023] [Accepted: 05/15/2023] [Indexed: 06/15/2023] Open
Abstract
Codon bias and mRNA folding strength (mF) are hypothesized molecular mechanisms by which polymorphisms in genes modify protein expression. Natural patterns of codon bias and mF across genes as well as effects of altering codon bias and mF suggest that the influence of these 2 mechanisms may vary depending on the specific location of polymorphisms within a transcript. Despite the central role codon bias and mF may play in natural trait variation within populations, systematic studies of how polymorphic codon bias and mF relate to protein expression variation are lacking. To address this need, we analyzed genomic, transcriptomic, and proteomic data for 22 Saccharomyces cerevisiae isolates, estimated protein accumulation for each allele of 1,620 genes as the log of protein molecules per RNA molecule (logPPR), and built linear mixed-effects models associating allelic variation in codon bias and mF with allelic variation in logPPR. We found that codon bias and mF interact synergistically in a positive association with logPPR, and this interaction explains almost all the effects of codon bias and mF. We examined how the locations of polymorphisms within transcripts influence their effects and found that codon bias primarily acts through polymorphisms in domain-encoding and 3' coding sequences, while mF acts most significantly through coding sequences with weaker effects from untranslated regions. Our results present the most comprehensive characterization to date of how polymorphisms in transcripts influence protein expression.
Collapse
Affiliation(s)
- Anastacia N Wienecke
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
- Department of Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Margaret L Barry
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
| | - Daniel A Pollard
- Biology Department, Western Washington University, Bellingham, WA 98225, USA
| |
Collapse
|
7
|
Ali F. Patterns of change in nucleotide diversity over gene length. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548940. [PMID: 37503020 PMCID: PMC10369989 DOI: 10.1101/2023.07.13.548940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Nucleotide diversity at a site is influenced by the relative strengths of neutral and selective population genetic processes. Therefore, attempts to identify sites under positive selection require an understanding of the expected diversity in its absence. The nucleotide diversity of a gene was previously found to correlate with its length. In this work, I measure nucleotide diversity at synonymous sites and uncover a pattern of low diversity towards the translation initiation site (TIS) of a gene. The degree of reduction in diversity at the TIS and the length of this region of reduced diversity can be quantified as "Effect Size" and "Effect Length" respectively, using parameters of an asymptotic regression model. Estimates of Effect Length across bacteria covaried with recombination rates as well as with a multitude of fast-growth adaptations such as the avoidance of mRNA secondary structure around TIS, the number of rRNAs, and relative codon usage of ribosomal genes. Thus, the dependence of nucleotide diversity on gene length is governed by a combination of selective and non-selective processes. These results have implications for the estimation of effective population size and relative mutation rates based on "silent-site" diversity, and for pN/pS-based prediction of genes under selection.
Collapse
Affiliation(s)
- Farhan Ali
- Biodesign Institute, Arizona State University, Tempe, Arizona
| |
Collapse
|
8
|
Johnson MM, Hockenberry AJ, McGuffie MJ, Vieira LC, Wilke CO. Growth-dependent gene expression variation influences the strength of codon usage biases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.14.532645. [PMID: 36993177 PMCID: PMC10055066 DOI: 10.1101/2023.03.14.532645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
The most highly expressed genes in microbial genomes tend to use a limited set of synonymous codons, often referred to as "preferred codons." The existence of preferred codons is commonly attributed to selection pressures on various aspects of protein translation including accuracy and/or speed. However, gene expression is condition-dependent and even within single-celled organisms transcript and protein abundances can vary depending on a variety of environmental and other factors. Here, we show that growth rate-dependent expression variation is an important constraint that significantly influences the evolution of gene sequences. Using large-scale transcriptomic and proteomic data sets in Escherichia coli and Saccharomyces cerevisiae, we confirm that codon usage biases are strongly associated with gene expression but highlight that this relationship is most pronounced when gene expression measurements are taken during rapid growth conditions. Specifically, genes whose relative expression increases during periods of rapid growth have stronger codon usage biases than comparably expressed genes whose expression decreases during rapid growth conditions. These findings highlight that gene expression measured in any particular condition tells only part of the story regarding the forces shaping the evolution of microbial gene sequences. More generally, our results imply that microbial physiology during rapid growth is critical for explaining long-term translational constraints.
Collapse
Affiliation(s)
- Mackenzie M Johnson
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Adam J Hockenberry
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Matthew J McGuffie
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Luiz Carlos Vieira
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| | - Claus O Wilke
- Department of Integrative Biology, The University of Texas at Austin, Austin, TX, United States of America
| |
Collapse
|
9
|
Wang D, Yang D, Yang L, Diao L, Zhang Y, Li Y, Wang H, Ren J, Cheng L, Tan Q, Zhang R, Han X, Zhang X, Wang B, Li D, Chen M, Hermjakob H, Li Y, LaBaer J, Zhou Z, Yu X. Human Autoantigen Atlas: Searching for the Hallmarks of Autoantigens. J Proteome Res 2023. [PMID: 37183442 DOI: 10.1021/acs.jproteome.2c00799] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Understanding autoimmunity to endogenous proteins is crucial in diagnosing and treating autoimmune diseases. In this work, we developed a user-friendly AAgAtlas portal (http://biokb.ncpsb.org.cn/aagatlas_portal/index.php#), which can be used to search for 8045 non-redundant autoantigens (AAgs) and 47 post-translationally modified AAgs against 1073 human diseases that are prioritized by a credential score developed by multisource evidence. Using AAgAtlas, the immunogenic properties of human AAgs was systematically elucidated according to their genetic, biophysical, cytological, expression profile, and evolutionary characteristics. The results indicated that human AAgs are evolutionally conserved in protein sequence and enriched in three hydrophilic and polar amino acid residues (K, D, and E) that are located at the protein surface. AAgs are enriched in proteins that are involved in nucleic acid binding, transferase, and the cytoskeleton. Genome, transcriptome, and proteome analyses further indicated that AAb production is associated with gene variance and abnormal protein expression related to the pathological activities of different tumors. Collectively, our data outlines the hallmarks of human AAgs that facilitate the understanding of humoral autoimmunity and the identification of biomarkers of human diseases.
Collapse
Affiliation(s)
- Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Dong Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Liuhui Yang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuqi Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Hongye Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jing Ren
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Linlin Cheng
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Qiaoyun Tan
- Cancer Center, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| | - Ran Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Xiaohong Han
- Clinical Pharmacology Research Center, Peking Union Medical College Hospital, State Key Laboratory of Complex Severe and Rare Diseases, NMPA Key Laboratory for Clinical Research and Evaluation of Drug, Beijing Key Laboratory of Clinical PK & PD Investigation for Innovative Drugs, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100730, China
| | - Xiaohan Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Bingwei Wang
- College of Medicine and Integrated Medicine, Nanjing University of Chinese Medicine, Nanjing 210023, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Meng Chen
- National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing 100021, China
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Yongzhe Li
- Department of Clinical Laboratory, Peking Union Medical College Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100730, China
| | - Joshua LaBaer
- The Virginia G. Piper Center for Personalized Diagnostics, Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States
| | - Zhou Zhou
- Department of Laboratory Medicine, National Center for Cardiovascular Diseases and Fuwai Hospital, Peking Union Medical College and Chinese Academy of Medical Sciences, Beijing 100037, China
| | - Xiaobo Yu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| |
Collapse
|
10
|
Hao J, Liang Y, Ping J, Li J, Shi W, Su Y, Wang T. Chloroplast gene expression level is negatively correlated with evolutionary rates and selective pressure while positively with codon usage bias in Ophioglossum vulgatum L. BMC PLANT BIOLOGY 2022; 22:580. [PMID: 36510137 PMCID: PMC9746204 DOI: 10.1186/s12870-022-03960-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Accepted: 11/24/2022] [Indexed: 05/02/2023]
Abstract
BACKGROUND Characterization of the key factors determining gene expression level has been of significant interest. Previous studies on the relationship among evolutionary rates, codon usage bias, and expression level mostly focused on either nuclear genes or unicellular/multicellular organisms but few in chloroplast (cp) genes. Ophioglossum vulgatum is a unique fern and has important scientific and medicinal values. In this study, we sequenced its cp genome and transcriptome to estimate the evolutionary rates (dN and dS), selective pressure (dN/dS), gene expression level, codon usage bias, and their correlations. RESULTS The correlation coefficients between dN, dS, and dN/dS, and Transcripts Per Million (TPM) average values were -0.278 (P = 0.027 < 0.05), -0.331 (P = 0.008 < 0.05), and -0.311 (P = 0.013 < 0.05), respectively. The codon adaptation index (CAI) and tRNA adaptation index (tAI) were significantly positively correlated with TPM average values (P < 0.05). CONCLUSIONS Our results indicated that when the gene expression level was higher, the evolutionary rates and selective pressure were lower, but the codon usage bias was stronger. We provided evidence from cp gene data which supported the E-R (E stands for gene expression level and R stands for evolutionary rate) anti-correlation.
Collapse
Affiliation(s)
- Jing Hao
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingyi Liang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Jingyao Ping
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Jinye Li
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Wanxin Shi
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingjuan Su
- School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
- Research Institute of Sun Yat-sen University in Shenzhen, Shenzhen, 518057, China.
| | - Ting Wang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
11
|
Bédard C, Cisneros AF, Jordan D, Landry CR. Correlation between protein abundance and sequence conservation: what do recent experiments say? Curr Opin Genet Dev 2022; 77:101984. [PMID: 36162152 DOI: 10.1016/j.gde.2022.101984] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 08/23/2022] [Accepted: 08/26/2022] [Indexed: 01/27/2023]
Abstract
Cells evolve in a space of parameter values set by physical and chemical forces. These constraints create associations among cellular properties. A particularly strong association is the negative correlation between the rate of evolution of proteins and their abundance in the cell. Highly expressed proteins evolve slower than lowly expressed ones. Multiple hypotheses have been put forward to explain this relationship, including, for instance, the requirement for higher mRNA stability, misfolding avoidance, and misinteraction avoidance for highly expressed proteins. Here, we review some of these hypotheses, their predictions, and how they are supported to finally discuss recent experiments that have been performed to test these predictions.
Collapse
Affiliation(s)
- Camille Bédard
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada. https://twitter.com/@CamilleBed17
| | - Angel F Cisneros
- Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada. https://twitter.com/@AngelFCC119
| | - David Jordan
- Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada. https://twitter.com/@DavidJordan1997
| | - Christian R Landry
- Département de Biologie, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada; Institut de Biologie Intégrative et des Systèmes, Université Laval, G1V 0A6, Canada; PROTEO, Le regroupement québécois de recherche sur la fonction, l'ingénierie et les applications des protéines, Université Laval, G1V 0A6, Canada; Centre de Recherche sur les Données Massives, Université Laval, G1V 0A6, Canada; Département de Biochimie, de Microbiologie et de Bio-informatique, Faculté des Sciences et de Génie, Université Laval, G1V 0A6, Canada.
| |
Collapse
|
12
|
Shibai A, Kotani H, Sakata N, Furusawa C, Tsuru S. Purifying selection enduringly acts on the sequence evolution of highly expressed proteins in Escherichia coli. G3 GENES|GENOMES|GENETICS 2022; 12:6694045. [PMID: 36073932 PMCID: PMC9635659 DOI: 10.1093/g3journal/jkac235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 08/27/2022] [Indexed: 11/17/2022]
Abstract
The evolutionary speed of a protein sequence is constrained by its expression level, with highly expressed proteins evolving relatively slowly. This negative correlation between expression levels and evolutionary rates (known as the E–R anticorrelation) has already been widely observed in past macroevolution between species from bacteria to animals. However, it remains unclear whether this seemingly general law also governs recent evolution, including past and de novo, within a species. However, the advent of genomic sequencing and high-throughput phenotyping, particularly for bacteria, has revealed fundamental gaps between the 2 evolutionary processes and has provided empirical data opposing the possible underlying mechanisms which are widely believed. These conflicts raise questions about the generalization of the E–R anticorrelation and the relevance of plausible mechanisms. To explore the ubiquitous impact of expression levels on molecular evolution and test the relevance of the possible underlying mechanisms, we analyzed the genome sequences of 99 strains of Escherichia coli for evolution within species in nature. We also analyzed genomic mutations accumulated under laboratory conditions as a model of de novo evolution within species. Here, we show that E–R anticorrelation is significant in both past and de novo evolution within species in E. coli. Our data also confirmed ongoing purifying selection on highly expressed genes. Ongoing selection included codon-level purifying selection, supporting the relevance of the underlying mechanisms. However, the impact of codon-level purifying selection on the constraints in evolution within species might be smaller than previously expected from evolution between species.
Collapse
Affiliation(s)
- Atsushi Shibai
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Hazuki Kotani
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Natsue Sakata
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
| | - Chikara Furusawa
- Center for Biosystems Dynamics Research (BDR), RIKEN , Osaka 565-0874, Japan
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| | - Saburo Tsuru
- Universal Biology Institute, School of Science, The University of Tokyo , Tokyo 113-0033, Japan
| |
Collapse
|
13
|
Ramstein GP, Buckler ES. Prediction of evolutionary constraint by genomic annotations improves functional prioritization of genomic variants in maize. Genome Biol 2022; 23:183. [PMID: 36050782 PMCID: PMC9438327 DOI: 10.1186/s13059-022-02747-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 08/15/2022] [Indexed: 11/10/2022] Open
Abstract
Background Crop improvement through cross-population genomic prediction and genome editing requires identification of causal variants at high resolution, within fewer than hundreds of base pairs. Most genetic mapping studies have generally lacked such resolution. In contrast, evolutionary approaches can detect genetic effects at high resolution, but they are limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Here we use genomic annotations to accurately predict nucleotide conservation across angiosperms, as a proxy for fitness effect of mutations. Results Using only sequence analysis, we annotate nonsynonymous mutations in 25,824 maize gene models, with information from bioinformatics and deep learning. Our predictions are validated by experimental information: within-species conservation, chromatin accessibility, and gene expression. According to gene ontology and pathway enrichment analyses, predicted nucleotide conservation points to genes in central carbon metabolism. Importantly, it improves genomic prediction for fitness-related traits such as grain yield, in elite maize panels, by stringent prioritization of fewer than 1% of single-site variants. Conclusions Our results suggest that predicting nucleotide conservation across angiosperms may effectively prioritize sites most likely to impact fitness-related traits in crops, without being limited by shifting selection, missing data, and low depth of multiple-sequence alignments. Our approach—Prediction of mutation Impact by Calibrated Nucleotide Conservation (PICNC)—could be useful to select polymorphisms for accurate genomic prediction, and candidate mutations for efficient base editing. The trained PICNC models and predicted nucleotide conservation at protein-coding SNPs in maize are publicly available in CyVerse (10.25739/hybz-2957). Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02747-2.
Collapse
Affiliation(s)
- Guillaume P Ramstein
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus, Denmark. .,Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.
| | - Edward S Buckler
- Institute for Genomic Diversity, Cornell University, Ithaca, NY, 14853, USA.,USDA-ARS, Ithaca, NY, 14853, USA
| |
Collapse
|
14
|
Abstract
A study of the plant Arabidopsis thaliana detected lower mutation rates in genomic regions where mutations are more likely to be deleterious, challenging the principle that mutagenesis is blind to its consequence. To examine the generality of this finding, we analyze large mutational data from baker's yeast and humans. The yeast data do not exhibit this trend, whereas the human data show an opposite trend that disappears upon the control of potential confounders. We find that the Arabidopsis study identified substantially more mutations than reported in the original data-generating studies and expected from Arabidopsis' mutation rate. These extra mutations are enriched in polynucleotide tracts and have relatively low sequencing qualities so are likely sequencing errors. Furthermore, the polynucleotide “mutations” can produce the purported mutational trend in Arabidopsis. Together, our results do not support lower mutagenesis of genomic regions of stronger selective constraints in the plant, fungal, and animal models examined.
Collapse
Affiliation(s)
- Haoxuan Liu
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA.,Evolutionary and Organismal Biology Research Center, School of Medicine, Zhejiang University, Hangzhou, 310000, China
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
15
|
Sun M, Zhang J. Preferred synonymous codons are translated more accurately: Proteomic evidence, among-species variation, and mechanistic basis. SCIENCE ADVANCES 2022; 8:eabl9812. [PMID: 35857447 PMCID: PMC9258949 DOI: 10.1126/sciadv.abl9812] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 05/20/2022] [Indexed: 06/15/2023]
Abstract
A commonly stated cause of unequal uses of synonymous codons is their differential translational accuracies. A key prediction of this long-standing translational accuracy hypothesis (TAH) of codon usage bias is higher translational accuracies of more frequently used synonymous codons, which, however, has had no direct evidence beyond case studies. Analyzing proteomic data from Escherichia coli, we present direct, global evidence for this prediction. The experimentally measured codon-specific translational accuracies validate a sequence-based proxy; this proxy provides support for the TAH from the vast majority of over 1000 taxa surveyed in all domains of life. We find that the relative translational accuracies of synonymous codons vary substantially among taxa and are strongly correlated with the amounts of cognate transfer RNAs (tRNAs) relative to those of near-cognate tRNAs. These and other observations suggest a model in which selections for translational efficiency and accuracy drive codon usage bias and its coevolution with the tRNA pool.
Collapse
|
16
|
Synonymous mutations in representative yeast genes are mostly strongly non-neutral. Nature 2022; 606:725-731. [PMID: 35676473 PMCID: PMC9650438 DOI: 10.1038/s41586-022-04823-w] [Citation(s) in RCA: 136] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 04/28/2022] [Indexed: 01/12/2023]
Abstract
Synonymous mutations in protein-coding genes do not alter protein sequences and are thus generally presumed to be neutral or nearly neutral1-5. Here, to experimentally verify this presumption, we constructed 8,341 yeast mutants each carrying a synonymous, nonsynonymous or nonsense mutation in one of 21 endogenous genes with diverse functions and expression levels and measured their fitness relative to the wild type in a rich medium. Three-quarters of synonymous mutations resulted in a significant reduction in fitness, and the distribution of fitness effects was overall similar-albeit nonidentical-between synonymous and nonsynonymous mutations. Both synonymous and nonsynonymous mutations frequently disturbed the level of mRNA expression of the mutated gene, and the extent of the disturbance partially predicted the fitness effect. Investigations in additional environments revealed greater across-environment fitness variations for nonsynonymous mutants than for synonymous mutants despite their similar fitness distributions in each environment, suggesting that a smaller proportion of nonsynonymous mutants than synonymous mutants are always non-deleterious in a changing environment to permit fixation, potentially explaining the common observation of substantially lower nonsynonymous than synonymous substitution rates. The strong non-neutrality of most synonymous mutations, if it holds true for other genes and in other organisms, would require re-examination of numerous biological conclusions about mutation, selection, effective population size, divergence time and disease mechanisms that rely on the assumption that synoymous mutations are neutral.
Collapse
|
17
|
Sen P, Aziz R, Deka RC, Feil EJ, Ray SK, Satapathy SS. Stem Region of tRNA Genes Favors Transition Substitution Towards Keto Bases in Bacteria. J Mol Evol 2022; 90:114-123. [PMID: 35084523 DOI: 10.1007/s00239-021-10045-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 12/21/2021] [Indexed: 11/25/2022]
Abstract
Transversion and transition mutations have variable effects on the stability of RNA secondary structure considering that the former destabilizes the double helix geometry to a greater extent by introducing purine:purine (R:R) or pyrimidine:pyrimidine (Y:Y) base pairs. Therefore, transversion frequency is likely to be lower than that of transition in the secondary structure regions of RNA genes. Here, we performed an analysis of transition and transversion frequencies in tRNA genes defined well with secondary structure and compared with the intergenic regions in five bacterial species namely Escherichia coli, Klebsiella pneumoniae, Salmonella enterica, Staphylococcus aureus and Streptococcus pneumoniae using a large genome sequence data set. In general, the transversion frequency was observed to be lower than that of transition in both tRNA genes and intergenic regions. The transition to transversion ratio was observed to be greater in tRNA genes than that in the intergenic regions in all the five bacteria that we studied. Interestingly, the intraspecies base substitution analysis in tRNA genes revealed that non-compensatory substitutions were more frequent than compensatory substitutions in the stem region. Further, transition to transversion ratio in the loop region was observed to be significantly lesser than that among the non-compensatory substitutions in the stem region. This indicated that the transversion is more deleterious than transition in the stem regions. In addition, substitutions from amino bases (A/C) to keto bases (G/T) were also observed to be more than the reverse substitutions in the stem region. Substitution from amino bases to keto bases are likely to facilitate the stable G:U pairing unlike the reverse substitution that facilitates the unstable A:C pairing in the stem region of tRNA. This work provides additional support that the secondary structure of tRNA molecule is what drives the different substitutions in its gene sequence.
Collapse
Affiliation(s)
- Piyali Sen
- Department of Computer Science and Engineering, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Ruksana Aziz
- Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Ramesh C Deka
- Chemical Sciences, Tezpur University, Napaam, Tezpur, Assam, 784028, India
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India
| | - Edward J Feil
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, Bath, BA2 7AY, UK.
| | - Suvendra Kumar Ray
- Molecular Biology and Biotechnology, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
| | - Siddhartha Sankar Satapathy
- Department of Computer Science and Engineering, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
- Center for Multidisciplinary Research, Tezpur University, Napaam, Tezpur, Assam, 784028, India.
| |
Collapse
|
18
|
Expression level is a major modifier of the fitness landscape of a protein coding gene. Nat Ecol Evol 2021; 6:103-115. [PMID: 34795386 DOI: 10.1038/s41559-021-01578-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Accepted: 10/01/2021] [Indexed: 11/09/2022]
Abstract
The phenotypic consequence of a genetic mutation depends on many factors including the expression level of a gene. However, a comprehensive quantification of this expression effect is still lacking, as is a further general mechanistic understanding of the effect. Here, we measured the fitness effect of almost all (>97.5%) single-nucleotide mutations in GFP, an exogenous gene with no physiological function, and URA3, a conditionally essential gene. Both genes were driven by two promoters whose expression levels differed by around tenfold. The resulting fitness landscapes revealed that the fitness effects of at least 42% of all single-nucleotide mutations within the genes were expression dependent. Although only a small fraction of variation in fitness effects among different mutations can be explained by biophysical properties of the protein and messenger RNA of the gene, our analyses revealed that the avoidance of stochastic molecular errors generally underlies the expression dependency of mutational effects and suggested protein misfolding as the most important type of molecular error among those examined. Our results therefore directly explained the slower evolution of highly expressed genes and highlighted cytotoxicity due to stochastic molecular errors as a non-negligible component for understanding the phenotypic consequence of mutations.
Collapse
|
19
|
Sarkar C, Alvarez-Ponce D. Extracellular domains of transmembrane proteins defy the expression level-evolutionary rate anticorrelation. Genome Biol Evol 2021; 14:6402012. [PMID: 34665250 PMCID: PMC8755491 DOI: 10.1093/gbe/evab235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2021] [Indexed: 11/13/2022] Open
Abstract
Highly expressed proteins tend to evolve slowly, a trend known as the expression level-rate of evolution (E-R) anticorrelation. Whereas the reasons for this anticorrelation remain unclear, the most influential hypotheses attribute it to highly expressed proteins being subjected to strong selective pressures to avoid misfolding and/or misinteraction. In accordance with these hypotheses, work in our laboratory has recently shown that extracellular (secreted) proteins lack an E-R anticorrelation (or exhibit a weaker than usual E-R anticorrelation). Extracellular proteins are folded inside the endoplasmic reticulum, where enhanced quality control of folding mechanisms exist, and function in the extracellular space, where misinteraction is unlikely to occur or to produce deleterious effects. Transmembrane proteins contain both intracellular domains (which are folded and function in the cytosol) and extracellular domains (which complete their folding in the endoplasmic reticulum and function in the extracellular space). We thus hypothesized that the extracellular domains of transmembrane proteins should exhibit a weaker E-R anticorrelation than their intracellular domains. Our analyses of human, Saccharomyces and Arabidopsis transmembrane proteins allowed us to confirm our hypothesis. Our results are in agreement with models attributing the E-R anticorrelation to the deleterious effects of misfolding and/or misinteraction.
Collapse
Affiliation(s)
- Chandra Sarkar
- Department of Biology, University of Nevada, Reno, NV, USA
| | | |
Collapse
|
20
|
Sirovy KA, Johnson KM, Casas SM, La Peyre JF, Kelly MW. Lack of genotype-by-environment interaction suggests limited potential for evolutionary changes in plasticity in the eastern oyster, Crassostrea virginica. Mol Ecol 2021; 30:5721-5734. [PMID: 34462983 DOI: 10.1111/mec.16156] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 08/18/2021] [Accepted: 08/23/2021] [Indexed: 12/13/2022]
Abstract
Eastern oysters in the northern Gulf of Mexico are facing rapid environmental changes and can respond to this change via plasticity or evolution. Plasticity can act as an immediate buffer against environmental change, but this buffering could impact the organism's ability to evolve in subsequent generations. While plasticity and evolution are not mutually exclusive, the relative contribution and interaction between them remains unclear. In this study, we investigate the roles of plastic and evolved responses to environmental variation and Perkinsus marinus infection in Crassostrea virginica by using a common garden experiment with 80 oysters from six families outplanted at two field sites naturally differing in salinity. We use growth data, P. marinus infection intensities, 3' RNA sequencing (TagSeq) and low-coverage whole-genome sequencing to identify the effect of genotype, environment and genotype-by-environment interaction on the oyster's response to site. As one of first studies to characterize the joint effects of genotype and environment on transcriptomic and morphological profiles in a natural setting, we demonstrate that C. virginica has a highly plastic response to environment and that this response is parallel among genotypes. We also find that genes responding to genotype have distinct and opposing profiles compared to genes responding to environment with regard to expression levels, Ka/Ks ratios and nucleotide diversity. Our findings suggest that C. virginica may be able to buffer the immediate impacts of future environmental changes by altering gene expression and physiology, but the lack of genetic variation in plasticity suggests limited capacity for evolved responses.
Collapse
Affiliation(s)
- Kyle A Sirovy
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Kevin M Johnson
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| | - Sandra M Casas
- School of Animal Sciences, Louisiana State University Agricultural Center, Baton Rouge, Louisiana, USA
| | - Jerome F La Peyre
- School of Animal Sciences, Louisiana State University Agricultural Center, Baton Rouge, Louisiana, USA
| | - Morgan W Kelly
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, USA
| |
Collapse
|
21
|
Variables Influencing Differences in Sequence Conservation in the Fission Yeast Schizosaccharomyces pombe. J Mol Evol 2021; 89:601-610. [PMID: 34436628 PMCID: PMC8599406 DOI: 10.1007/s00239-021-10028-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2021] [Accepted: 08/17/2021] [Indexed: 11/17/2022]
Abstract
Which variables determine the constraints on gene sequence evolution is one of the most central questions in molecular evolution. In the fission yeast Schizosaccharomyces pombe, an important model organism, the variables influencing the rate of sequence evolution have yet to be determined. Previous studies in other single celled organisms have generally found gene expression levels to be most significant, with numerous other variables such as gene length and functional importance identified as having a smaller impact. Using publicly available data, we used partial least squares regression, principal components regression, and partial correlations to determine the variables most strongly associated with sequence evolution constraints. We identify centrality in the protein–protein interactions network, amino acid composition, and cellular location as the most important determinants of sequence conservation. However, each factor only explains a small amount of variance, and there are numerous variables having a significant or heterogeneous influence. Our models explain more than half of the variance in dN, raising the possibility that future refined models could quantify the role of stochastics in evolutionary rate variation.
Collapse
|
22
|
Coding variants in the PCNT and CEP295 genes contribute to breast cancer risk in Chinese women. Pathol Res Pract 2021; 225:153581. [PMID: 34418690 DOI: 10.1016/j.prp.2021.153581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 08/02/2021] [Accepted: 08/02/2021] [Indexed: 11/20/2022]
Abstract
BACKGROUND Centrioles play pivotal roles in the assembly of centrosomes, their dysfunction is associated with multiple inherited diseases or cancers. To date, few studies have focused on the associations between coding single nucleotide polymorphisms (SNPs) in the centriole duplication cycle genes and the risk of breast cancer in Chinese women. METHODS Twenty-one SNPs were selected from the coding regions of 10 critical centriole genes. The associations between the selected SNPs and breast cancer susceptibility were assessed in a case-control study of Chinese women, which included 1032 cases and 1063 controls. Potential biological functions in the influence of protein stability and the profile of expression quantitative trait loci (eQTL) of the identified SNPs were further evaluated using in silico databases. RESULTS Multivariate logistic regression analyses revealed that a missense SNP rs7279204 in PCNT was significantly associated with an increased risk of breast cancer (additive model: adjusted OR=1.19, 95% CI: 1.02-1.38), while a missense SNP rs77922978 in CEP295 was significantly associated with a decreased risk of breast cancer (additive model: adjusted OR=0.74, 95% CI: 0.56-0.97). Stratification analyses suggested that rs7279204 and rs77922978 exhibited different effects among later first live birth, ER-negative and PR-negative women (P<0.05). Moreover, rs77922978 showed significant differences for ER and PR status strata (heterogeneity test P=0.028, P=0.046). In addition, bioinformatic analyses indicated that the two variants may possess potential functions of reducing the protein stability of their host genes. Further eQTL analysis showed that the rs7279204 was not only correlated with the expression of its host gene PCNT, but also correlated with the expression of its nearby genes, implying its potential roles in regulation of some cancer susceptibility genes. CONCLUSIONS The SNPs rs7279204 and rs77922978 within the coding region of the PCNT and CEP295 genes may contribute to the susceptibility of breast cancer in Han Chinese population.
Collapse
|
23
|
Zou Z, Zhang J. Are Nonsynonymous Transversions Generally More Deleterious than Nonsynonymous Transitions? Mol Biol Evol 2021; 38:181-191. [PMID: 32805043 PMCID: PMC7783172 DOI: 10.1093/molbev/msaa200] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
It has been suggested that, due to the structure of the genetic code, nonsynonymous transitions are less likely than transversions to cause radical changes in amino acid physicochemical properties so are on average less deleterious. This view was supported by some but not all mutagenesis experiments. Because laboratory measures of fitness effects have limited sensitivities and relative frequencies of different mutations in mutagenesis studies may not match those in nature, we here revisit this issue using comparative genomics. We extend the standard codon model of sequence evolution by adding the parameter η that quantifies the ratio of the fixation probability of transitional nonsynonymous mutations to that of transversional nonsynonymous mutations. We then estimate η from the concatenated alignment of all protein-coding DNA sequences of two closely related genomes. Surprisingly, η ranges from 0.13 to 2.0 across 90 species pairs sampled from the tree of life, with 51 incidences of η < 1 and 30 incidences of η >1 that are statistically significant. Hence, whether nonsynonymous transversions are overall more deleterious than nonsynonymous transitions is species-dependent. Because the corresponding groups of amino acid replacements differ between nonsynonymous transitions and transversions, η is influenced by the relative exchangeabilities of amino acid pairs. Indeed, an extensive search reveals that the large variation in η is primarily explainable by the recently reported among-species disparity in amino acid exchangeabilities. These findings demonstrate that genome-wide nucleotide substitution patterns in coding sequences have species-specific features and are more variable among evolutionary lineages than are currently thought.
Collapse
Affiliation(s)
- Zhengting Zou
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
- Corresponding author: E-mail: .Associate editor: Jeffrey Townsend
| |
Collapse
|
24
|
Razban RM, Dasmeh P, Serohijos AWR, Shakhnovich EI. Avoidance of protein unfolding constrains protein stability in long-term evolution. Biophys J 2021; 120:2413-2424. [PMID: 33932438 PMCID: PMC8390877 DOI: 10.1016/j.bpj.2021.03.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 02/24/2021] [Accepted: 03/17/2021] [Indexed: 11/28/2022] Open
Abstract
Every amino acid residue can influence a protein's overall stability, making stability highly susceptible to change throughout evolution. We consider the distribution of protein stabilities evolutionarily permittable under two previously reported protein fitness functions: flux dynamics and misfolding avoidance. We develop an evolutionary dynamics theory and find that it agrees better with an extensive protein stability data set for dihydrofolate reductase orthologs under the misfolding avoidance fitness function rather than the flux dynamics fitness function. Further investigation with ribonuclease H data demonstrates that not any misfolded state is avoided; rather, it is only the unfolded state. At the end, we discuss how our work pertains to the universal protein abundance-evolutionary rate correlation seen across organisms' proteomes. We derive a closed-form expression relating protein abundance to evolutionary rate that captures Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens experimental trends without fitted parameters.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; Departement de Biochimie, Université de Montréal, Montreal, Quebec, Canada
| | | | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
25
|
Maddamsetti R. Universal Constraints on Protein Evolution in the Long-Term Evolution Experiment with Escherichia coli. Genome Biol Evol 2021; 13:evab070. [PMID: 33856016 PMCID: PMC8233687 DOI: 10.1093/gbe/evab070] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2021] [Indexed: 12/18/2022] Open
Abstract
Although it is well known that abundant proteins evolve slowly across the tree of life, there is little consensus for why this is true. Here, I report that abundant proteins evolve slowly in the hypermutator populations of Lenski's long-term evolution experiment with Escherichia coli (LTEE). Specifically, the density of all observed mutations per gene, as measured in metagenomic time series covering 60,000 generations of the LTEE, significantly anticorrelates with mRNA abundance, protein abundance, and degree of protein-protein interaction. The same pattern holds for nonsynonymous mutation density. However, synonymous mutation density, measured across the LTEE hypermutator populations, positively correlates with protein abundance. These results show that universal constraints on protein evolution are visible in data spanning three decades of experimental evolution. Therefore, it should be possible to design experiments to answer why abundant proteins evolve slowly.
Collapse
Affiliation(s)
- Rohan Maddamsetti
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| |
Collapse
|
26
|
Dubreuil B, Levy ED. Abundance Imparts Evolutionary Constraints of Similar Magnitude on the Buried, Surface, and Disordered Regions of Proteins. Front Mol Biosci 2021; 8:626729. [PMID: 33996892 PMCID: PMC8119896 DOI: 10.3389/fmolb.2021.626729] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
An understanding of the forces shaping protein conservation is key, both for the fundamental knowledge it represents and to allow for optimal use of evolutionary information in practical applications. Sequence conservation is typically examined at one of two levels. The first is a residue-level, where intra-protein differences are analyzed and the second is a protein-level, where inter-protein differences are studied. At a residue level, we know that solvent-accessibility is a prime determinant of conservation. By inverting this logic, we inferred that disordered regions are slightly more solvent-accessible on average than the most exposed surface residues in domains. By integrating abundance information with evolutionary data within and across proteins, we confirmed a previously reported strong surface-core association in the evolution of structured regions, but we found a comparatively weak association between disordered and structured regions. The facts that disordered and structured regions experience different structural constraints and evolve independently provide a unique setup to examine an outstanding question: why is a protein’s abundance the main determinant of its sequence conservation? Indeed, any structural or biophysical property linked to the abundance-conservation relationship should increase the relative conservation of regions concerned with that property (e.g., disordered residues with mis-interactions, domain residues with misfolding). Surprisingly, however, we found the conservation of disordered and structured regions to increase in equal proportion with abundance. This observation implies that either abundance-related constraints are structure-independent, or multiple constraints apply to different regions and perfectly balance each other.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
27
|
Specificity of RNA Folding and Its Association with Evolutionarily Adaptive mRNA Secondary Structures. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:882-900. [PMID: 33607297 PMCID: PMC9403030 DOI: 10.1016/j.gpb.2019.11.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/17/2018] [Revised: 08/03/2019] [Accepted: 11/08/2019] [Indexed: 11/23/2022]
Abstract
The secondary structure is a fundamental feature of both noncoding and messenger RNAs. However, our understanding of the secondary structure of mRNA, especially that of the coding regions, remains elusive, likely due to translation and the lack of RNA-binding proteins that sustain the consensus structure, such as those that bind to noncoding RNA. Indeed, mRNA has recently been found to adopt diverse alternative structures, the overall functional significance of which remains untested. We hereby approached this problem by estimating the folding specificity, i.e., the probability that a fragment of RNA folds back to the same partner once refolded. We showed that the folding specificity of mRNA is lower than that of noncoding RNA and exhibits moderate evolutionary conservation. Notably, we found that specific rather than alternative folding is likely evolutionarily adaptive since specific folding is frequently associated with functionally important genes or sites within a gene. Additional analysis in combination with ribosome density suggests the ability to modulate ribosome movement as one potential functional advantage provided by specific folding. Our findings revealed a novel facet of the RNA structurome with important functional and evolutionary implications and indicated a potential method for distinguishing the mRNA secondary structures maintained by natural selection from molecular noise.
Collapse
|
28
|
Burgarella C, Berger A, Glémin S, David J, Terrier N, Deu M, Pot D. The Road to Sorghum Domestication: Evidence From Nucleotide Diversity and Gene Expression Patterns. FRONTIERS IN PLANT SCIENCE 2021; 12:666075. [PMID: 34527004 PMCID: PMC8435843 DOI: 10.3389/fpls.2021.666075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 07/20/2021] [Indexed: 05/17/2023]
Abstract
Native African cereals (sorghum, millets) ensure food security to millions of low-income people from low fertility and drought-prone regions of Africa and Asia. In spite of their agronomic importance, the genetic bases of their phenotype and adaptations are still not well-understood. Here we focus on Sorghum bicolor, which is the fifth cereal worldwide for grain production and constitutes the staple food for around 500 million people. We leverage transcriptomic resources to address the adaptive consequences of the domestication process. Gene expression and nucleotide variability were analyzed in 11 domesticated and nine wild accessions. We documented a downregulation of expression and a reduction of diversity both in nucleotide polymorphism (30%) and gene expression levels (18%) in domesticated sorghum. These findings at the genome-wide level support the occurrence of a global reduction of diversity during the domestication process, although several genes also showed patterns consistent with the action of selection. Nine hundred and forty-nine genes were significantly differentially expressed between wild and domesticated gene pools. Their functional annotation points to metabolic pathways most likely contributing to the sorghum domestication syndrome, such as photosynthesis and auxin metabolism. Coexpression network analyzes revealed 21 clusters of genes sharing similar expression patterns. Four clusters (totaling 2,449 genes) were significantly enriched in differentially expressed genes between the wild and domesticated pools and two were also enriched in domestication and improvement genes previously identified in sorghum. These findings reinforce the evidence that the combined and intricated effects of the domestication and improvement processes do not only affect the behaviors of a few genes but led to a large rewiring of the transcriptome. Overall, these analyzes pave the way toward the identification of key domestication genes valuable for genetic resources characterization and breeding purposes.
Collapse
Affiliation(s)
- Concetta Burgarella
- CIRAD, UMR AGAP Institut, Montpellier, France
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
- *Correspondence: Concetta Burgarella
| | - Angélique Berger
- CIRAD, UMR AGAP Institut, Montpellier, France
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Sylvain Glémin
- CNRS, Univ. Rennes, ECOBIO – UMR 6553, Rennes, France
- Department of Ecology and Evolution, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Jacques David
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Nancy Terrier
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - Monique Deu
- CIRAD, UMR AGAP Institut, Montpellier, France
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
| | - David Pot
- CIRAD, UMR AGAP Institut, Montpellier, France
- AGAP Institut, Univ F-34398 Montpellier, CIRAD, INRAE, Institut Agro, Montpellier, France
- David Pot
| |
Collapse
|
29
|
Alvarez-Ponce D. Richard Dickerson, Molecular Clocks, and Rates of Protein Evolution. J Mol Evol 2020; 89:122-126. [PMID: 33205299 DOI: 10.1007/s00239-020-09973-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Accepted: 11/07/2020] [Indexed: 12/29/2022]
Abstract
Proteins approximately behave as molecular clocks, accumulating amino acid replacements at a more or less constant rate. Nonetheless, each protein displays a characteristic rate of evolution: whereas some proteins remain largely unaltered over large periods of time, others can rapidly accumulate amino acid replacements. An article by Richard Dickerson, published in the first issue of the Journal of Molecular Evolution (J Mol Evol 1:26-45, 1971), described the first analysis in which the rates of evolution of many proteins were compared, and the differences were interpreted in the light of their function. When comparing the sequences of fibrinopeptides, hemoglobin, and cytochrome c of different species, he observed a linear relationship between the number of amino acid replacements and divergence time. Remarkably, fibrinopeptides had evolved fast, cytochrome c had evolved slowly, and hemoglobin exhibited an intermediate rate of evolution. As the Journal of Molecular Evolution celebrates its 50th anniversary, I highlight this landmark article and reflect on its impact on the field of Molecular Evolution.
Collapse
Affiliation(s)
- David Alvarez-Ponce
- Department of Biology, University of Nevada, Reno, 1664 N. Virginia Street, Reno, NV, 89557, USA.
| |
Collapse
|
30
|
Victor MP, Acharya D, Chakraborty S, Ghosh TC. The combined influence of codon composition and tRNA copy number regulates translational efficiency by influencing synonymous nucleotide substitution. Gene 2020; 745:144640. [PMID: 32247037 DOI: 10.1016/j.gene.2020.144640] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Revised: 03/29/2020] [Accepted: 03/31/2020] [Indexed: 01/26/2023]
Abstract
Codon usage bias is an important genomic phenomenon, where highly expressed genes use optimal codons for smoother translation with high yield, facilitated by the cognate tRNAs. Here, we presented the tRNA co-adaptation index (co-AI) by correlating tRNA gene copy number and codon composition in Saccharomyces cerevisiae. We observed that this co-AI is positively correlated with protein abundance and translation rate. Considering nucleotide substitutions, co-AI influences synonymous substitutions more than gene expression and protein abundance, the most important determinants of evolutionary rate. Co-AI correlates positively with mRNA secondary structure stability and mRNA half-life, which may lead to protein accumulation under high co-AI. However, the highly expressed proteins encoded by high co-AI genes are assisted by molecular chaperones to attain their proper functional conformation and prevent accumulation.
Collapse
Affiliation(s)
- Manish P Victor
- Division of Bioinformatics, Bose Institute, Kolkata, West Bengal, India
| | - Debarun Acharya
- Department of Microbiology, Bose Institute, Kolkata, West Bengal, India.
| | | | - Tapash C Ghosh
- Division of Bioinformatics, Bose Institute, Kolkata, West Bengal, India; Department of Microbiology, Raiganj University, Raiganj, West Bengal, India.
| |
Collapse
|
31
|
Singh R, Sophiarani Y. A report on DNA sequence determinants in gene expression. Bioinformation 2020; 16:422-431. [PMID: 32831525 PMCID: PMC7434957 DOI: 10.6026/97320630016422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 04/24/2020] [Indexed: 11/26/2022] Open
Abstract
The biased usage of nucleotides in coding sequence and its correlation with gene expression has been observed in several studies. A complex set of interactions between genes and other components of the expression system determine the amount of proteins produced from coding sequences. It is known that the elongation rate of polypeptide chain is affected by both codon usage bias and specific amino acid compositional constraints. Therefore, it is of interest to review local DNA-sequence elements and other positional as well as combinatorial constraints that play significant role in gene expression.
Collapse
Affiliation(s)
- Ravail Singh
- Indian Institute of Integrative Medicine, CSIR, Canal Road, Jammu-180001
| | | |
Collapse
|
32
|
Zhou J, Ren H, Hu M, Zhou J, Li B, Kong N, Zhang Q, Jin Y, Liang L, Yue J. Characterization of Burkholderia cepacia Complex Core Genome and the Underlying Recombination and Positive Selection. Front Genet 2020; 11:506. [PMID: 32528528 PMCID: PMC7253759 DOI: 10.3389/fgene.2020.00506] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 04/24/2020] [Indexed: 11/13/2022] Open
Abstract
Recombination and positive selection are two key factors that play a vital role in pathogenic microorganisms’ population adaptation and diversification. The Burkholderia cepacia complex (Bcc) represents bacterial species with high similarity, which can cause severe infections among cases suffering from the chronic granulomatous disorder and cystic fibrosis (CF). At present, no genome-wide study has been carried out focusing on investigating the core genome of Bcc associated with the two evolutionary forces. The general characteristics of the core genome of Bcc species remain scarce as well. In this study, we explored the core orthologous genes of 116 Bcc strains using comparative genomic analysis and studied the two adaptive evolutionary forces: recombination and positive selection. We estimated 1005 orthogroups consisting entirely of single copy genes. These single copy orthologous genes in some Cluster of Orthologous Groups (COG) categories showed significant differences in the comparison of several evolutionary properties, and the encoding proteins were relatively simple and compact. Our findings showed that 5.8% of the core orthologous genes strongly supported recombination; in the meantime, 1.1% supported positive selection. We found that genes involved in protein synthesis as well as material transport and metabolism are favored by selection pressure. More importantly, homologous recombination contributed more genetic variation to a large number of genes and largely maintained the genetic cohesion in Bcc. This high level of recombination between Bcc species blurs their taxonomic boundaries, which leads Bcc species to be difficult or impossible to distinguish phenotypically and genotypically.
Collapse
Affiliation(s)
- Jianglin Zhou
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Hongguang Ren
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Mingda Hu
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Jing Zhou
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Beiping Li
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Na Kong
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China.,Institutes of Physical Science and Information Technology, Anhui University, Hefei, China
| | - Qi Zhang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Yuan Jin
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Long Liang
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| | - Junjie Yue
- State Key Laboratory of Pathogen and Biosecurity, Beijing Institute of Biotechnology, Beijing, China
| |
Collapse
|
33
|
Peeri M, Tuller T. High-resolution modeling of the selection on local mRNA folding strength in coding sequences across the tree of life. Genome Biol 2020; 21:63. [PMID: 32151272 PMCID: PMC7063772 DOI: 10.1186/s13059-020-01971-y] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Accepted: 02/22/2020] [Indexed: 02/06/2023] Open
Abstract
BACKGROUND mRNA can form local secondary structure within the protein-coding sequence, and the strength of this structure is thought to influence gene expression regulation. Previous studies suggest that secondary structure strength may be maintained under selection, but the details of this phenomenon are not well understood. RESULTS We perform a comprehensive study of the selection on local mRNA folding strengths considering variation between species across the tree of life. We show for the first time that local folding strength selection tends to follow a conserved characteristic profile in most phyla, with selection for weak folding at the two ends of the coding region and for strong folding elsewhere in the coding sequence, with an additional peak of selection for strong folding located downstream of the start codon. The strength of this pattern varies between species and organism groups, and we highlight contradicting cases. To better understand the underlying evolutionary process, we show that selection strengths in the different regions are strongly correlated, and report four factors which have a clear predictive effect on local mRNA folding selection within the coding sequence in different species. CONCLUSIONS The correlations observed between selection for local secondary structure strength in the different regions and with the four genomic and environmental factors suggest that they are shaped by the same evolutionary process throughout the coding sequence, and might be maintained under direct selection related to optimization of gene expression and specifically translation regulation.
Collapse
Affiliation(s)
- Michael Peeri
- Department of Biomedical Engineering, Tel-Aviv University, Tel-Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel-Aviv University, Tel-Aviv, Israel.
- Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel.
| |
Collapse
|
34
|
Mikhailov KV, Efeykin BD, Panchin AY, Knorre DA, Logacheva MD, Penin AA, Muntyan MS, Nikitin MA, Popova OV, Zanegina ON, Vyssokikh MY, Spiridonov SE, Aleoshin VV, Panchin YV. Coding palindromes in mitochondrial genes of Nematomorpha. Nucleic Acids Res 2020; 47:6858-6870. [PMID: 31194871 PMCID: PMC6649704 DOI: 10.1093/nar/gkz517] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 05/29/2019] [Accepted: 06/01/2019] [Indexed: 12/11/2022] Open
Abstract
Inverted repeats are common DNA elements, but they rarely overlap with protein-coding sequences due to the ensuing conflict with the structure and function of the encoded protein. We discovered numerous perfect inverted repeats of considerable length (up to 284 bp) embedded within the protein-coding genes in mitochondrial genomes of four Nematomorpha species. Strikingly, both arms of the inverted repeats encode conserved regions of the amino acid sequence. We confirmed enzymatic activity of the respiratory complex I encoded by inverted repeat-containing genes. The nucleotide composition of inverted repeats suggests strong selection at the amino acid level in these regions. We conclude that the inverted repeat-containing genes are transcribed and translated into functional proteins. The survey of available mitochondrial genomes reveals that several other organisms possess similar albeit shorter embedded repeats. Mitochondrial genomes of Nematomorpha demonstrate an extraordinary evolutionary compromise where protein function and stringent secondary structure elements within the coding regions are preserved simultaneously.
Collapse
Affiliation(s)
- Kirill V Mikhailov
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Boris D Efeykin
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation.,Severtsov Institute of Ecology and Evolution, Moscow 119071, Russian Federation
| | - Alexander Y Panchin
- Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Dmitry A Knorre
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Institute of Molecular Medicine, Sechenov First Moscow State Medical University, Moscow 119991, Russian Federation
| | - Maria D Logacheva
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation.,Center for Data-Intensive Biomedicine and Biotechnology, Skolkovo Institute of Science and Technology, Moscow 143028, Russian Federation
| | - Aleksey A Penin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Maria S Muntyan
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Mikhail A Nikitin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Olga V Popova
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Olga N Zanegina
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Mikhail Y Vyssokikh
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation
| | - Sergei E Spiridonov
- Severtsov Institute of Ecology and Evolution, Moscow 119071, Russian Federation
| | - Vladimir V Aleoshin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| | - Yuri V Panchin
- Belozersky Institute of Physico-Chemical Biology, Lomonosov Moscow State University, Leninskiye Gory 1-40, Moscow 119991, Russian Federation.,Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow 127994, Russian Federation
| |
Collapse
|
35
|
Victor MP, Acharya D, Begum T, Ghosh TC. The optimization of mRNA expression level by its intrinsic properties—Insights from codon usage pattern and structural stability of mRNA. Genomics 2019; 111:1292-1297. [DOI: 10.1016/j.ygeno.2018.08.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Revised: 08/14/2018] [Accepted: 08/24/2018] [Indexed: 11/17/2022]
|
36
|
Razban RM. Protein Melting Temperature Cannot Fully Assess Whether Protein Folding Free Energy Underlies the Universal Abundance-Evolutionary Rate Correlation Seen in Proteins. Mol Biol Evol 2019; 36:1955-1963. [PMID: 31093676 PMCID: PMC6736436 DOI: 10.1093/molbev/msz119] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The protein misfolding avoidance hypothesis explains the universal negative correlation between protein abundance and sequence evolutionary rate across the proteome by identifying protein folding free energy (ΔG) as the confounding variable. Abundant proteins resist toxic misfolding events by being more stable, and more stable proteins evolve slower because their mutations are more destabilizing. Direct supporting evidence consists only of computer simulations. A study taking advantage of a recent experimental breakthrough in measuring protein stability proteome-wide through melting temperature (Tm) (Leuenberger et al. 2017), found weak misfolding avoidance hypothesis support for the Escherichia coli proteome, and no support for the Saccharomyces cerevisiae, Homo sapiens, and Thermus thermophilus proteomes (Plata and Vitkup 2018). I find that the nontrivial relationship between Tm and ΔG and inaccuracy in Tm measurements by Leuenberger et al. 2017 can be responsible for not observing strong positive abundance-Tm and strong negative Tm-evolutionary rate correlations.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA
| |
Collapse
|
37
|
Mirihana Arachchilage G, Hetti Arachchilage M, Venkataraman A, Piontkivska H, Basu S. Stable G-quadruplex enabling sequences are selected against by the context-dependent codon bias. Gene 2019; 696:149-161. [PMID: 30753890 DOI: 10.1016/j.gene.2019.02.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Revised: 01/14/2019] [Accepted: 02/05/2019] [Indexed: 12/22/2022]
Abstract
The distributions of secondary structural elements appear to differ between coding regions (CDS) of mRNAs compared to the untranslated regions (UTRs), presumably as a mechanism to fine-tune gene expression, including efficiency of translation. However, a systematic and comprehensive analysis of secondary structure avoidance because of potential bias in codon usage is difficult as some of the common secondary structures, such as, hairpins can be formed by numerous sequence combinations. Using G-quadruplex (GQ) as the model secondary structure we studied the impact of codon bias on GQs within the CDS. Because GQs can be predicted using specific consensus sequence motifs, they provide an excellent platform for investigation of the selectivity of such putative structures at the codon level. Using a bioinformatics approach, we calculated the frequencies of putative GQs within the CDS of a variety of species. Our results suggest that the most stable GQs appear to be significantly underrepresented within the CDS, through the use of specific synonymous codon combinations. Furthermore, we identified many peptide sequence motifs in which silent mutations can potentially alter translation via stable GQ formation. This work not only provides a comprehensive analysis on how stable secondary structures appear to be avoided within the CDS of mRNA, but also broadens the current understanding of synonymous codon usage as they relate to the structure-function relationship of RNA.
Collapse
Affiliation(s)
| | | | - Aparna Venkataraman
- Department of Chemistry and Biochemistry, Kent State University, Kent, OH 44242, United States of America
| | - Helen Piontkivska
- Department of Biological Sciences, Kent State University, Kent, OH 44242, United States of America
| | - Soumitra Basu
- Department of Chemistry and Biochemistry, Kent State University, Kent, OH 44242, United States of America.
| |
Collapse
|
38
|
Marek A, Tomala K. The Contribution of Purifying Selection, Linkage, and Mutation Bias to the Negative Correlation between Gene Expression and Polymorphism Density in Yeast Populations. Genome Biol Evol 2018; 10:2986-2996. [PMID: 30321329 PMCID: PMC6250307 DOI: 10.1093/gbe/evy225] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/10/2018] [Indexed: 11/13/2022] Open
Abstract
The negative correlation between the rate of protein evolution and expression level of a gene has been recognized as a universal law of the evolutionary biology (Koonin 2011). In our study, we apply a population-based approach to systematically investigate the relative importance of unequal mutation rate, linkage, and selection in the origin of the expression-polymorphism anticorrelation. We analyzed the DNA sequence of protein coding genes of 24 Saccharomyces cerevisiae and 58 Schizosaccharomyces pombe strains. We found that highly expressed genes had a substantially decreased number of polymorphic sites when compared with genes transcribed less extensively. This expression-dependent reduction was especially strong in the nonsynonymous sites, although it was also present in the synonymous sites and untranslated regions, both up and down of a gene. Most importantly, no such trend was found in introns. We used these observations, as well as analyses of site frequency spectra and data from mutation accumulation experiments, to show that the purifying selection acting on nonsynonymous sites was the main, but not exclusive, factor impeding molecular evolution within the coding sequences of highly expressed genes. Linkage could not fully explain the observed pattern of polymorphism within the untranslated regions and synonymous sites, although the contribution of selection acting directly on synonymous variants was extremely small. Finally, we found that the impact of mutational bias was rather negligible.
Collapse
Affiliation(s)
- Agnieszka Marek
- Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| | - Katarzyna Tomala
- Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| |
Collapse
|
39
|
Song K, Li L, Zhang G. Relationship Among Intron Length, Gene Expression, and Nucleotide Diversity in the Pacific Oyster Crassostrea gigas. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2018; 20:676-684. [PMID: 29967965 DOI: 10.1007/s10126-018-9838-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 06/05/2018] [Indexed: 06/08/2023]
Abstract
Crassostrea gigas is a model mollusk, but its genetic features have not been studied comprehensively. In this study, we used whole-genome resequencing data to identify and characterize nucleotide diversity and population recombination rate in a diverse collection of 21 C. gigas samples. Our analyses revealed that C. gigas harbors both extremely high genetic diversity and recombination rates across the whole genome as compared with those of the other taxa. The noncoding regions, introns, intergenic spacers, and untranslated regions (UTRs) showed a lower level diversity than the synonymous sites. The larger introns tended to have lower diversity. Moreover, we found a negative association of the non-synonymous diversity with gene expression, which suggested that purifying selection played an important role in shaping genetic diversity. The nucleotide diversity at the 100- and 50-kb levels was positively correlated with population recombination rates, which was expected if the diversity was shaped by purifying selection or hitchhiking of advantageous mutants. Our work gives a general picture of the oyster's polymorphism pattern and its association with recombination rates.
Collapse
Affiliation(s)
- Kai Song
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, Shandong, China
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, Shandong, China
- National & Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, 266071, Shandong, China
| | - Li Li
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, Shandong, China.
- National & Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, 266071, Shandong, China.
- Laboratory for Marine Fisheries and Aquaculture, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, Shandong, China.
- Institute of Oceanology, Chinese Academy of Sciences, 7th Nanhai Rd., Qingdao, China.
| | - Guofan Zhang
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071, Shandong, China.
- Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, 266071, Shandong, China.
- National & Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, 266071, Shandong, China.
- Institute of Oceanology, Chinese Academy of Sciences, 7th Nanhai Rd., Qingdao, China.
| |
Collapse
|
40
|
Zhang H, Dou S, He F, Luo J, Wei L, Lu J. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. PLoS Biol 2018; 16:e2003903. [PMID: 30028832 PMCID: PMC6070289 DOI: 10.1371/journal.pbio.2003903] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 08/01/2018] [Accepted: 07/03/2018] [Indexed: 11/19/2022] Open
Abstract
Upstream open reading frames (uORFs) play important roles in regulating the main coding DNA sequences (CDSs) via translational repression. Despite their prevalence in the genomes, uORFs are overall discriminated against by natural selection. However, it remains unclear why in the genomes there are so many uORFs more conserved than expected under the assumption of neutral evolution. Here, we generated genome-wide maps of translational efficiency (TE) at the codon level throughout the life cycle of Drosophila melanogaster. We identified 35,735 uORFs that were expressed, and 32,224 (90.2%) of them showed evidence of ribosome occupancy during Drosophila development. The ribosome occupancy of uORFs is determined by genomic features, such as optimized sequence contexts around their start codons, a shorter distance to CDSs, and higher coding potentials. Our population genomic analysis suggests the segregating mutations that create or disrupt uORFs are overall deleterious in D. melanogaster. However, we found for the first time that many (68.3% of) newly fixed uORFs that are associated with ribosomes in D. melanogaster are driven by positive Darwinian selection. Our findings also suggest that uORFs play a vital role in controlling the translational program in Drosophila. Moreover, we found that many uORFs are transcribed or translated in a developmental stage-, sex-, or tissue-specific manner, suggesting that selective transcription or translation of uORFs could potentially modulate the TE of the downstream CDSs during Drosophila development. Upstream open reading frames (uORFs) in the 5′ untranslated regions (UTRs) of messenger RNAs can potentially inhibit translation of the downstream regions that encode proteins by sequestering protein-making machinery the ribosome. Moreover, mutations that destroy existing uORFs or create new ones are known to cause human disease. Although mutations that create new uORFs are generally deleterious and are selected against, many uORFs are evolutionarily conserved across eukaryotic species. To resolve this dilemma, we used extensive mRNA-Seq and ribosome profiling to generate high-resolution genome-wide maps of ribosome occupancy and translational efficiency (TE) during the life cycle of the fruit fly D. melanogaster. This allowed us to identify the sequence features of uORFs that influence their ability to associate with ribosomes. We demonstrate for the first time that the majority of the newly fixed uORFs in D. melanogaster, especially the translated ones, are under positive Darwinian selection. We also show that uORFs exert widespread repressive effects on the translation of the downstream protein-coding region. We find that many uORFs are transcribed or translated in a developmental stage-, sex-, or tissue-specific manner. Our results suggest that during Drosophila development, changes in the TE of uORFs, as well as the inclusion/exclusion of uORFs, are frequently exploited to inversely influence the translation of the downstream protein-coding regions. Our study provides novel insights into the molecular mechanisms and functional consequences of uORF-mediated regulation.
Collapse
Affiliation(s)
- Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Shengqian Dou
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Feng He
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Junjie Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Liping Wei
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- * E-mail:
| |
Collapse
|
41
|
Du MZ, Liu S, Zeng Z, Alemayehu LA, Wei W, Guo FB. Amino acid compositions contribute to the proteins' evolution under the influence of their abundances and genomic GC content. Sci Rep 2018; 8:7382. [PMID: 29743515 PMCID: PMC5943316 DOI: 10.1038/s41598-018-25364-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2017] [Accepted: 04/16/2018] [Indexed: 12/23/2022] Open
Abstract
Inconsistent results on the association between evolutionary rates and amino acid composition of proteins have been reported in eukaryotes. However, there are few studies of how amino acid composition can influence evolutionary rates in bacteria. Thus, we constructed linear regression models between composition frequencies of amino acids and evolutionary rates for bacteria. Compositions of all amino acids can on average explain 21.5% of the variation in evolutionary rates among 273 investigated bacterial organisms. In five model organisms, amino acid composition contributes more to variation in evolutionary rates than protein abundance, and frequency of optimal codons. The contribution of individual amino acid composition to evolutionary rate varies among organisms. The closer the GC-content of genome to its maximum or minimum, the better the correlation between the amino acid content and the evolutionary rate of proteins would appear in that genome. The types of amino acids that significantly contribute to evolutionary rates can be grouped into GC-rich and AT-rich amino acids. Besides, the amino acid with high composition also contributes more to evolutionary rates than amino acid with low composition in proteome. In summary, amino acid composition significantly contributes to the rate of evolution in bacterial organisms and this in turn is impacted by GC-content.
Collapse
Affiliation(s)
- Meng-Ze Du
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Shuo Liu
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Zhi Zeng
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Labena Abraham Alemayehu
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| | - Wen Wei
- School of Life Sciences, Chongqing University, Chongqing, China.
| | - Feng-Biao Guo
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China. .,Centre for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China. .,Key Laboratory for Neuroinformation of the Ministry of Education, University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
42
|
Hockenberry AJ, Pah AR, Jewett MC, Amaral LAN. Leveraging genome-wide datasets to quantify the functional role of the anti-Shine-Dalgarno sequence in regulating translation efficiency. Open Biol 2017; 7:rsob.160239. [PMID: 28100663 PMCID: PMC5303271 DOI: 10.1098/rsob.160239] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 12/15/2016] [Indexed: 11/18/2022] Open
Abstract
Studies dating back to the 1970s established that sequence complementarity between the anti-Shine–Dalgarno (aSD) sequence on prokaryotic ribosomes and the 5′ untranslated region of mRNAs helps to facilitate translation initiation. The optimal location of aSD sequence binding relative to the start codon, the full extents of the aSD sequence and the functional form of the relationship between aSD sequence complementarity and translation efficiency have not been fully resolved. Here, we investigate these relationships by leveraging the sequence diversity of endogenous genes and recently available genome-wide estimates of translation efficiency. We show that—after accounting for predicted mRNA structure—aSD sequence complementarity increases the translation of endogenous mRNAs by roughly 50%. Further, we observe that this relationship is nonlinear, with translation efficiency maximized for mRNAs with intermediate levels of aSD sequence complementarity. The mechanistic insights that we observe are highly robust: we find nearly identical results in multiple datasets spanning three distantly related bacteria. Further, we verify our main conclusions by re-analysing a controlled experimental dataset.
Collapse
Affiliation(s)
- Adam J Hockenberry
- Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL 60208, USA.,Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA
| | - Adam R Pah
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL 60208, USA.,Kellogg School of Management, Northwestern University, Evanston, IL 60208, USA
| | - Michael C Jewett
- Interdisciplinary Program in Biological Sciences, Northwestern University, Evanston, IL 60208, USA .,Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.,Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA
| | - Luís A N Amaral
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA .,Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL 60208, USA.,Department of Physics and Astronomy, Northwestern University, Evanston, IL 60208, USA
| |
Collapse
|
43
|
Song H, Gao H, Liu J, Tian P, Nan Z. Comprehensive analysis of correlations among codon usage bias, gene expression, and substitution rate in Arachis duranensis and Arachis ipaënsis orthologs. Sci Rep 2017; 7:14853. [PMID: 29093502 PMCID: PMC5665869 DOI: 10.1038/s41598-017-13981-1] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2017] [Accepted: 10/04/2017] [Indexed: 11/22/2022] Open
Abstract
The relationship between evolutionary rates and gene expression in model plant orthologs is well documented. However, little is known about the relationships between gene expression and evolutionary trends in Arachis orthologs. We identified 7,435 one-to-one orthologs, including 925 single-copy and 6,510 multiple-copy sequences in Arachis duranensis and Arachis ipaënsis. Codon usage was stronger for shorter polypeptides, which were encoded by codons with higher GC contents. Highly expressed coding sequences had higher codon usage bias, GC content, and expression breadth. Additionally, expression breadth was positively correlated with polypeptide length, but there was no correlation between gene expression and polypeptide length. Inferred selective pressure was also negatively correlated with both gene expression and expression breadth in all one-to-one orthologs, while positively but non-significantly correlated with gene expression in sequences with signatures of positive selection. Gene expression levels and expression breadth were significantly higher for single-copy genes than for multiple-copy genes. Similarly, the gene expression and expression breadth in sequences with signatures of purifying selection were higher than those of sequences with positive selective signatures. These results indicated that gene expression differed between single-copy and multiple-copy genes as well as sequences with signatures of positive and purifying selection.
Collapse
Affiliation(s)
- Hui Song
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China.
| | - Hongjuan Gao
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Jing Liu
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Pei Tian
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China
| | - Zhibiao Nan
- State Key Laboratory of Grassland Agro-ecosystems, College of Pastoral Agriculture Science and Technology, Lanzhou University, Lanzhou, 730000, China.
| |
Collapse
|
44
|
Nucleotides upstream of the Kozak sequence strongly influence gene expression in the yeast S. cerevisiae. J Biol Eng 2017; 11:25. [PMID: 28835771 PMCID: PMC5563945 DOI: 10.1186/s13036-017-0068-1] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 06/13/2017] [Indexed: 11/25/2022] Open
Abstract
Background In the yeast Saccharomyces cerevisiae, as in every eukaryotic organism, the mRNA 5′-untranslated region (UTR) is important for translation initiation. However, the patterns and mechanisms that determine the efficiency with which ribozomes bind mRNA, the elongation of ribosomes through the 5′-UTR, and the formation of a stable translation initiation complex are not clear. Genes that are highly expressed in S. cerevisiae seem to prefer a 5′-UTR rich in adenine and poor in guanine, particularly in the Kozak sequence, which occupies roughly the first six nucleotides upstream of the START codon. Results We measured the fluorescence produced by 58 synthetic versions of the S. cerevisiae minimal CYC1 promoter (pCYC1min), each containing a different 5′-UTR. First, we replaced with adenine the last 15 nucleotides of the original pCYC1min 5′-UTR—a theoretically optimal configuration for high gene expression. Next, we carried out single and multiple point mutations on it. Protein synthesis was highly affected by both single and multiple point mutations upstream of the Kozak sequence. RNAfold simulations revealed that significant changes in the mRNA secondary structures occur by mutating more than three adenines into guanines between positions −15 and −9. Furthermore, the effect of point mutations turned out to be strongly context-dependent, indicating that adenines placed just upstream of the START codon do not per se guarantee an increase in gene expression, as previously suggested. Conclusions New synthetic eukaryotic promoters, which differ for their translation initiation rate, can be built by acting on the nucleotides upstream of the Kozak sequence. Translation efficiency could, potentially, be influenced by another portion of the 5′-UTR further upstream of the START codon. A deeper understanding of the role of the 5′-UTR in gene expression would improve criteria for choosing and using promoters inside yeast synthetic gene circuits. Electronic supplementary material The online version of this article (doi:10.1186/s13036-017-0068-1) contains supplementary material, which is available to authorized users.
Collapse
|
45
|
Feyertag F, Berninsone PM, Alvarez-Ponce D. Secreted Proteins Defy the Expression Level-Evolutionary Rate Anticorrelation. Mol Biol Evol 2017; 34:692-706. [PMID: 28007979 DOI: 10.1093/molbev/msw268] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
The rates of evolution of the proteins of any organism vary across orders of magnitude. A primary factor influencing rates of protein evolution is expression. A strong negative correlation between expression levels and evolutionary rates (the so-called E-R anticorrelation) has been observed in virtually all studied organisms. This effect is currently attributed to the abundance-dependent fitness costs of misfolding and unspecific protein-protein interactions, among other factors. Secreted proteins are folded in the endoplasmic reticulum, a compartment where chaperones, folding catalysts, and stringent quality control mechanisms promote their correct folding and may reduce the fitness costs of misfolding. In addition, confinement of secreted proteins to the extracellular space may reduce misinteractions and their deleterious effects. We hypothesize that each of these factors (the secretory pathway quality control and extracellular location) may reduce the strength of the E-R anticorrelation. Indeed, here we show that among human proteins that are secreted to the extracellular space, rates of evolution do not correlate with protein abundances. This trend is robust to controlling for several potentially confounding factors and is also observed when analyzing protein abundance data for 6 human tissues. In addition, analysis of mRNA abundance data for 32 human tissues shows that the E-R correlation is always less negative, and sometimes nonsignificant, in secreted proteins. Similar observations were made in Caenorhabditis elegans and in Escherichia coli, and to a lesser extent in Drosophila melanogaster, Saccharomyces cerevisiae and Arabidopsis thaliana. Our observations contribute to understand the causes of the E-R anticorrelation.
Collapse
Affiliation(s)
- Felix Feyertag
- Department of Biology, University of Nevada, Reno, Reno, NV
| | | | | |
Collapse
|
46
|
Lynch M, Ackerman MS, Gout JF, Long H, Sung W, Thomas WK, Foster PL. Genetic drift, selection and the evolution of the mutation rate. Nat Rev Genet 2017; 17:704-714. [PMID: 27739533 DOI: 10.1038/nrg.2016.104] [Citation(s) in RCA: 438] [Impact Index Per Article: 62.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
As one of the few cellular traits that can be quantified across the tree of life, DNA-replication fidelity provides an excellent platform for understanding fundamental evolutionary processes. Furthermore, because mutation is the ultimate source of all genetic variation, clarifying why mutation rates vary is crucial for understanding all areas of biology. A potentially revealing hypothesis for mutation-rate evolution is that natural selection primarily operates to improve replication fidelity, with the ultimate limits to what can be achieved set by the power of random genetic drift. This drift-barrier hypothesis is consistent with comparative measures of mutation rates, provides a simple explanation for the existence of error-prone polymerases and yields a formal counter-argument to the view that selection fine-tunes gene-specific mutation rates.
Collapse
Affiliation(s)
- Michael Lynch
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| | - Matthew S Ackerman
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| | - Jean-Francois Gout
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| | - Hongan Long
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| | - Way Sung
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| | - W Kelley Thomas
- Department of Molecular, Cellular, and Biomedical Sciences, University of New Hampshire, Durham, New Hampshire 03824, USA
| | - Patricia L Foster
- Department of Biology, Indiana University, Bloomington, Indiana 47401, USA
| |
Collapse
|
47
|
Josephs EB, Wright SI, Stinchcombe JR, Schoen DJ. The Relationship between Selection, Network Connectivity, and Regulatory Variation within a Population of Capsella grandiflora. Genome Biol Evol 2017; 9:1099-1109. [PMID: 28402527 PMCID: PMC5408089 DOI: 10.1093/gbe/evx068] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2017] [Indexed: 12/12/2022] Open
Abstract
Interactions between genes can have important consequences for how selection shapes sequence variation at these genes. Specifically, genes that have pleiotropic effects by affecting the expression level of many other genes may be under stronger selective constraint. We used coexpression networks to measure connectivity between genes and investigated the relationship between gene connectivity and selection in a natural population of the plant Capsella grandiflora. We observed that network connectivity was negatively correlated with genetic divergence due to stronger negative selection on highly-connected genes even when controlling for variation in gene expression level. However, the presence of local regulatory variation for a gene's expression level was also associated with reduced negative selection and lower gene connectivity. While it is difficult to disentangle the causal relationships between these factors, our results show that both connectivity and local regulatory variation are important factors for explaining variation in selection between genes.
Collapse
Affiliation(s)
- Emily B. Josephs
- Department of Evolution and Ecology, University of California, Davis
| | - Stephen I. Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - John R. Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Ontario, Canada
| | - Daniel J. Schoen
- Department of Biology, McGill University, Stewart Biology Building, Montreal, Quebec, Canada
| |
Collapse
|
48
|
Duan Y, Dou S, Luo S, Zhang H, Lu J. Adaptation of A-to-I RNA editing in Drosophila. PLoS Genet 2017; 13:e1006648. [PMID: 28282384 PMCID: PMC5365144 DOI: 10.1371/journal.pgen.1006648] [Citation(s) in RCA: 47] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2016] [Revised: 03/24/2017] [Accepted: 02/21/2017] [Indexed: 01/14/2023] Open
Abstract
Adenosine-to-inosine (A-to-I) editing is hypothesized to facilitate adaptive evolution by expanding proteomic diversity through an epigenetic approach. However, it is challenging to provide evidences to support this hypothesis at the whole editome level. In this study, we systematically characterized 2,114 A-to-I RNA editing sites in female and male brains of D. melanogaster, and nearly half of these sites had events evolutionarily conserved across Drosophila species. We detected strong signatures of positive selection on the nonsynonymous editing sites in Drosophila brains, and the beneficial editing sites were significantly enriched in genes related to chemical and electrical neurotransmission. The signal of adaptation was even more pronounced for the editing sites located in X chromosome or for those commonly observed across Drosophila species. We identified a set of gene candidates (termed "PSEB" genes) that had nonsynonymous editing events favored by natural selection. We presented evidence that editing preferentially increased mutation sequence space of evolutionarily conserved genes, which supported the adaptive evolution hypothesis of editing. We found prevalent nonsynonymous editing sites that were favored by natural selection in female and male adults from five strains of D. melanogaster. We showed that temperature played a more important role than gender effect in shaping the editing levels, although the effect of temperature is relatively weaker compared to that of species effect. We also explored the relevant factors that shape the selective patterns of the global editomes. Altogether we demonstrated that abundant nonsynonymous editing sites in Drosophila brains were adaptive and maintained by natural selection during evolution. Our results shed new light on the evolutionary principles and functional consequences of RNA editing.
Collapse
Affiliation(s)
- Yuange Duan
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences & Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Shengqian Dou
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences & Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Shiqi Luo
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences & Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Hong Zhang
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences & Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Jian Lu
- State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences & Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| |
Collapse
|
49
|
Redondo RAF, de Vladar HP, Włodarski T, Bollback JP. Evolutionary interplay between structure, energy and epistasis in the coat protein of the ϕX174 phage family. J R Soc Interface 2017; 14:20160139. [PMID: 28053111 PMCID: PMC5310724 DOI: 10.1098/rsif.2016.0139] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2016] [Accepted: 11/29/2016] [Indexed: 01/01/2023] Open
Abstract
Viral capsids are structurally constrained by interactions among the amino acids (AAs) of their constituent proteins. Therefore, epistasis is expected to evolve among physically interacting sites and to influence the rates of substitution. To study the evolution of epistasis, we focused on the major structural protein of the ϕX174 phage family by first reconstructing the ancestral protein sequences of 18 species using a Bayesian statistical framework. The inferred ancestral reconstruction differed at eight AAs, for a total of 256 possible ancestral haplotypes. For each ancestral haplotype and the extant species, we estimated, in silico, the distribution of free energies and epistasis of the capsid structure. We found that free energy has not significantly increased but epistasis has. We decomposed epistasis up to fifth order and found that higher-order epistasis sometimes compensates pairwise interactions making the free energy seem additive. The dN/dS ratio is low, suggesting strong purifying selection, and that structure is under stabilizing selection. We synthesized phages carrying ancestral haplotypes of the coat protein gene and measured their fitness experimentally. Our findings indicate that stabilizing mutations can have higher fitness, and that fitness optima do not necessarily coincide with energy minima.
Collapse
Affiliation(s)
| | - Harold P de Vladar
- IST Austria, Am Campus 1, 3400 Klosterneuburg, Austria
- Center for the Conceptual Foundations of Science, Parmenides Foundation, 82049 Pullach, Germany
| | - Tomasz Włodarski
- Department of Structural and Molecular Biology, University College London, London WC1E 6BT, UK
| | | |
Collapse
|
50
|
Alvarez-Ponce D, Sabater-Muñoz B, Toft C, Ruiz-González MX, Fares MA. Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli. Genome Biol Evol 2016; 8:2914-2927. [PMID: 27566759 PMCID: PMC5630975 DOI: 10.1093/gbe/evw205] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length.
Collapse
Affiliation(s)
| | - Beatriz Sabater-Muñoz
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| | - Christina Toft
- Department of Genetics, University of Valencia, Valencia, Spain Departamento de Biotecnología, Instituto de Agroquímica y Tecnología de los Alimentos (CSIC), Valencia, Spain
| | - Mario X Ruiz-González
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Current Address: Secretaría de Educación Superior, Ciencia, Tecnología e Innovación, Proyecto Prometeo; Departamento de Ciencias Biológicas, Universidad Tócnica Particular de Loja, Loja, Ecuador
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|