1
|
Akeju OJ, Cope AL. Re-examining Correlations Between Synonymous Codon Usage and Protein Bond Angles in Escherichia coli. Genome Biol Evol 2024; 16:evae080. [PMID: 38619010 PMCID: PMC11077309 DOI: 10.1093/gbe/evae080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 04/05/2024] [Accepted: 04/10/2024] [Indexed: 04/16/2024] Open
Abstract
Rosenberg AA, Marx A, Bronstein AM (Codon-specific Ramachandran plots show amino acid backbone conformation depends on identity of the translated codon. Nat Commun. 2022:13:2815) recently found a surprising correlation between synonymous codon usage and the dihedral bond angles of the resulting amino acid. However, their analysis did not account for the strongest known correlate of codon usage: gene expression. We re-examined the relationship between bond angles and codon usage by applying the approach of Rosenberg et al. to simulated protein-coding sequences that (i) have random codon usage, (ii) codon usage determined by mutation biases, and (iii) maintain the general relationship between codon usage and gene expression via the assumption of selection-mutation-drift equilibrium. We observed correlations between dihedral bond angle and codon usage when codon usage is entirely random, indicating possible conflation of noise with differences in bond angle distributions between synonymous codons. More relevant to the general analysis of codon usage patterns, we found surprisingly good agreement between the analysis of the real sequences and the analysis of sequences simulated assuming selection-mutation-drift equilibrium, with 91% of significant synonymous codon pairs detected in the former were also detected in the latter. We believe the correlation between codon usage and dihedral bond angles resulted from the variation in codon usage across genes due to the interplay between mutation bias, natural selection for translation efficiency, and gene expression, further underscoring these factors must be controlled for when looking for novel patterns related to codon usage.
Collapse
Affiliation(s)
| | - Alexander L Cope
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, USA
- Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, USA
| |
Collapse
|
2
|
Wang SH, Zhao Y, Wang CC, Chu F, Miao LY, Zhang L, Zhuo L, Chen X. RFEM: A framework for essential microRNA identification in mice based on rotation forest and multiple feature fusion. Comput Biol Med 2024; 171:108177. [PMID: 38422957 DOI: 10.1016/j.compbiomed.2024.108177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Revised: 01/21/2024] [Accepted: 02/18/2024] [Indexed: 03/02/2024]
Abstract
With the increasing number of microRNAs (miRNAs), identifying essential miRNAs has become an important task that needs to be solved urgently. However, there are few computational methods for essential miRNA identification. Here, we proposed a novel framework called Rotation Forest for Essential MicroRNA identification (RFEM) to predict the essentiality of miRNAs in mice. We first constructed 1,264 miRNA features of all miRNA samples by fusing 38 miRNA features obtained from the PESM paper and 1,226 miRNA functional features calculated based on miRNA-target gene interactions. Then, we employed 182 training samples with 1,264 features to train the rotation forest model, which was applied to compute the essentiality scores of the candidate samples. The main innovations of RFEM were as follows: 1) miRNA functional features were introduced to enrich the diversity of miRNA features; 2) the rotation forest model used decision tree as the base classifier and could increase the difference among base classifiers through feature transformation to achieve better ensemble results. Experimental results show that RFEM significantly outperformed two previous models with the AUC (AUPR) of 0.942 (0.944) in three comparison experiments under 5-fold cross validation, which proved the model's reliable performance. Moreover, ablation study was further conducted to demonstrate the effectiveness of the novel miRNA functional features. Additionally, in the case studies of assessing the essentiality of unlabeled miRNAs, experimental literature confirmed that 7 of the top 10 predicted miRNAs have crucial biological functions in mice. Therefore, RFEM would be a reliable tool for identifying essential miRNAs.
Collapse
Affiliation(s)
- Shu-Hao Wang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China; Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| | - Yan Zhao
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
| | - Chun-Chun Wang
- School of Science, Jiangnan University, Wuxi, 214122, China
| | - Fei Chu
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China; Artificial Intelligence Research Institute, China University of Mining and Technology, Xuzhou, 221116, China
| | - Lian-Ying Miao
- School of Mathematics, China University of Mining and Technology, Xuzhou, 221116, China
| | - Li Zhang
- School of Information and Control Engineering, China University of Mining and Technology, Xuzhou, 221116, China
| | - Linlin Zhuo
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, 325000, China.
| | - Xing Chen
- School of Science, Jiangnan University, Wuxi, 214122, China.
| |
Collapse
|
3
|
Kundu S, Kang HE, Kim AR, Lee SR, Kim EB, Amin MHF, Andriyono S, Kim HW, Kang K. Mitogenomic Characterization and Phylogenetic Placement of African Hind, Cephalopholis taeniops: Shedding Light on the Evolution of Groupers (Serranidae: Epinephelinae). Int J Mol Sci 2024; 25:1822. [PMID: 38339100 PMCID: PMC10855530 DOI: 10.3390/ijms25031822] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 01/23/2024] [Accepted: 01/31/2024] [Indexed: 02/12/2024] Open
Abstract
The global exploration of evolutionary trends in groupers, based on mitogenomes, is currently underway. This research extensively investigates the structure of and variations in Cephalopholis species mitogenomes, along with their phylogenetic relationships, focusing specifically on Cephalopholis taeniops from the Eastern Atlantic Ocean. The generated mitogenome spans 16,572 base pairs and exhibits a gene order analogous to that of the ancestral teleost's, featuring 13 protein-coding genes (PCGs), two ribosomal RNA genes (rRNAs), 22 transfer RNA genes (tRNAs), and an AT-rich control region. The mitogenome of C. taeniops displays an AT bias (54.99%), aligning with related species. The majority of PCGs in the mitogenome initiate with the start codon ATG, with the exceptions being COI (GTG) and atp6 (TTG). The relative synonymous codon usage analysis revealed the maximum abundance of leucine, proline, serine, and threonine. The nonsynonymous/synonymous ratios were <1, which indicates a strong negative selection among all PCGs of the Cephalopholis species. In C. taeniops, the prevalent transfer RNAs display conventional cloverleaf secondary structures, except for tRNA-serine (GCT), which lacks a dihydrouracil (DHU) stem. A comparative examination of conserved domains and sequence blocks across various Cephalopholis species indicates noteworthy variations in length and nucleotide diversity. Maximum likelihood, neighbor-joining, and Bayesian phylogenetic analyses, employing the concatenated PCGs and a combination of PCGs + rRNAs, distinctly separate all Cephalopholis species, including C. taeniops. Overall, these findings deepen our understanding of evolutionary relationships among serranid groupers, emphasizing the significance of structural considerations in mitogenomic analyses.
Collapse
Affiliation(s)
- Shantanu Kundu
- Institute of Fisheries Science, Pukyong National University, Busan 48513, Republic of Korea;
| | - Hye-Eun Kang
- Institute of Marine Life Science, Pukyong National University, Busan 48513, Republic of Korea;
| | - Ah Ran Kim
- Marine Integrated Biomedical Technology Center, National Key Research Institutes in Universities, Pukyong National University, Busan 48513, Republic of Korea; (A.R.K.); (S.R.L.)
| | - Soo Rin Lee
- Marine Integrated Biomedical Technology Center, National Key Research Institutes in Universities, Pukyong National University, Busan 48513, Republic of Korea; (A.R.K.); (S.R.L.)
| | - Eun-Bi Kim
- Ocean Georesources Research Department, Korea Institute of Ocean Science and Technology, Busan 49111, Republic of Korea;
| | - Muhammad Hilman Fu’adil Amin
- Advance Tropical Biodiversity, Genomics, and Conservation Research Group, Department of Biology, Faculty of Science and Technology, Airlangga University, Surabaya 60115, Indonesia;
| | - Sapto Andriyono
- Department of Marine, Faculty of Fisheries and Marine, Airlangga University, Surabaya 60115, Indonesia
| | - Hyun-Woo Kim
- Marine Integrated Biomedical Technology Center, National Key Research Institutes in Universities, Pukyong National University, Busan 48513, Republic of Korea; (A.R.K.); (S.R.L.)
- Department of Marine Biology, Pukyong National University, Busan 48513, Republic of Korea
| | - Kyoungmi Kang
- International Graduate Program of Fisheries Science, Pukyong National University, Busan 48513, Republic of Korea
| |
Collapse
|
4
|
Malyutina A, Tang J, Amiryousefi A. Resolving network clusters disparity based on dissimilarity measurements with nonmetric analysis of variance. iScience 2023; 26:108354. [PMID: 38026214 PMCID: PMC10663764 DOI: 10.1016/j.isci.2023.108354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 06/22/2023] [Accepted: 10/24/2023] [Indexed: 12/01/2023] Open
Abstract
Classic ANOVA (cA) tests the explanatory power of a partitioning on a set of objects. More fit for clusters proximity analysis, nonparametric ANOVA (npA) extends to a case where instead of the object values themselves, their mutual distances are available. However, extending the cA applicability, the metric conditions in npA are limiting. Based on the central limit theorem (CLT), here we introduce nonmetric ANOVA (nmA) that by relaxing the metric properties between objects, allows an ANOVA-like statistical testing of a network clusters disparity. We present a parametric test statistic which under the null hypothesis of no differences between the competing clusters means, follows an exact F-distribution. We apply our method on three diverse biological examples, discuss its parallel performance, and note the specific use of each method tailored by the inherent data properties. The R code is provided at github.com/AmiryousefiLab/nmANOVA.
Collapse
Affiliation(s)
- Alina Malyutina
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland
| | - Jing Tang
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland
| | - Ali Amiryousefi
- Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, 00014 Helsinki, Finland
- Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Avenue, Boston, MA 02115, USA
| |
Collapse
|
5
|
Chen X, Fang D, Xu Y, Duan K, Yoshida S, Yang S, Sahu SK, Fu H, Guang X, Liu M, Wu C, Liu Y, Mu W, Chen Y, Fan Y, Wang F, Peng S, Shi D, Wang Y, Yu R, Zhang W, Bai Y, Liu ZJ, Yan Q, Liu X, Xu X, Yang H, Wu J, Graham SW, Liu H. Balanophora genomes display massively convergent evolution with other extreme holoparasites and provide novel insights into parasite-host interactions. NATURE PLANTS 2023; 9:1627-1642. [PMID: 37735254 DOI: 10.1038/s41477-023-01517-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Accepted: 08/18/2023] [Indexed: 09/23/2023]
Abstract
Parasitic plants have evolved to be subtly or severely dependent on host plants to complete their life cycle. To provide new insights into the biology of parasitic plants in general, we assembled genomes for members of the sandalwood order Santalales, including a stem hemiparasite (Scurrula) and two highly modified root holoparasites (Balanophora) that possess chimaeric host-parasite tubers. Comprehensive genome comparisons reveal that hemiparasitic Scurrula has experienced a relatively minor degree of gene loss compared with autotrophic plants, consistent with its moderate degree of parasitism. Nonetheless, patterns of gene loss appear to be substantially divergent across distantly related lineages of hemiparasites. In contrast, Balanophora has experienced substantial gene loss for the same sets of genes as an independently evolved holoparasite lineage, the endoparasitic Sapria (Malpighiales), and the two holoparasite lineages experienced convergent contraction of large gene families through loss of paralogues. This unprecedented convergence supports the idea that despite their extreme and strikingly divergent life histories and morphology, the evolution of these and other holoparasitic lineages can be shaped by highly predictable modes of genome reduction. We observe substantial evidence of relaxed selection in retained genes for both hemi- and holoparasitic species. Transcriptome data also document unusual and novel interactions between Balanophora and host plants at the host-parasite tuber interface tissues, with evidence of mRNA exchange, substantial and active hormone exchange and immune responses in parasite and host.
Collapse
Affiliation(s)
- Xiaoli Chen
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Dongming Fang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Yuxing Xu
- Department of Economic Plants and Biotechnology, Yunnan Key Laboratory for Wild Plant Resources, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Kunyu Duan
- BGI College and Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
| | - Satoko Yoshida
- Graduate School of Science and Technology, Nara Institute of Science and Technology, Ikoma, Japan
| | - Shuai Yang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Sunil Kumar Sahu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Hui Fu
- BGI College and Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
| | - Xuanmin Guang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Min Liu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Chenyu Wu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Yang Liu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
- Key Laboratory of Southern Subtropical Plant Diversity, Fairy Lake Botanical Garden, Shenzhen and Chinese Academy of Sciences, Shenzhen, China
| | - Weixue Mu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Yewen Chen
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Yannan Fan
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Fang Wang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
| | - Shufeng Peng
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Dishen Shi
- BGI College and Henan Institute of Medical and Pharmaceutical Sciences, Zhengzhou University, Zhengzhou, China
| | - Yayu Wang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Runxian Yu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Wen Zhang
- China National GeneBank, BGI-Shenzhen, Shenzhen, China
| | - Yuqing Bai
- Administrative Office of Wutong Mountain National Park, Shenzhen, China
| | - Zhong-Jian Liu
- Key Laboratory of National Forestry and Grassland Administration for Orchid Conservation and Utilization at College of Landscape Architecture and Art, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Qiaoshun Yan
- Ailaoshan Station for Subtropical Forest Ecosystem Studies, Chinese Academy of Sciences, Jingdong, China
| | - Xin Liu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
| | - Xun Xu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
- Guangdong Provincial Key Laboratory of Genome Read and Write, Shenzhen, China
| | - Huanming Yang
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China
- Key Laboratory of Southern Subtropical Plant Diversity, Fairy Lake Botanical Garden, Shenzhen and Chinese Academy of Sciences, Shenzhen, China
- James D. Watson Institute of Genome Sciences, Hangzhou, China
| | - Jianqiang Wu
- Department of Economic Plants and Biotechnology, Yunnan Key Laboratory for Wild Plant Resources, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Sean W Graham
- Department of Botany, University of British Columbia, Vancouver, BC, Canada.
- Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada.
| | - Huan Liu
- State Key Laboratory of Agricultural Genomics, BGI Research, Shenzhen, China.
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
6
|
Vigué L, Tenaillon O. Predicting the effect of mutations to investigate recent events of selection across 60,472 Escherichia coli strains. Proc Natl Acad Sci U S A 2023; 120:e2304177120. [PMID: 37487088 PMCID: PMC10401003 DOI: 10.1073/pnas.2304177120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 05/25/2023] [Indexed: 07/26/2023] Open
Abstract
Microbial genomics studies focusing on the dynamics of selection have often used a small number of distant genomes. As a result, they could only analyze mutations that had become fixed during the divergence between species. However, thousands of genomes of some species are now available in public databases, thanks to high-throughput sequencing. These data provide a more complete picture of the polymorphisms segregating within a species, offering a unique insight into the processes that shape the recent evolution of a species. In this study, we present GLASS (Gene-Level Amino-acid Score Shift), a selection test that is based on the predicted effects of amino acid changes. By comparing the distribution of effects of mutations observed in a gene to the expectation in the absence of selection, GLASS can quantify the intensity of selection. We applied GLASS to a dataset of 60,472 Escherichia coli strains and used this to reexamine the longstanding debate about the role of essentiality versus expression level in the rate of protein evolution. We found that selection has contrasting short-term and long-term dynamics, with essential genes being subject to strong purifying selection in the short term, while expression level determines the rate of gene evolution in the long term. GLASS also found an overrepresentation of inactivating mutations in specific transcription factors, such as efflux pump repressors, which is consistent with selection for antibiotic resistance. These gene-inactivating polymorphisms do not reach fixation, suggesting another contrast between short-term fitness gains and long-term counterselection.
Collapse
Affiliation(s)
- Lucile Vigué
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, Infection, Antimicrobials, Modelling, Evolution, F-75018Paris, France
| | - Olivier Tenaillon
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, Infection, Antimicrobials, Modelling, Evolution, F-75018Paris, France
| |
Collapse
|
7
|
Hara Y, Kuraku S. The impact of local genomic properties on the evolutionary fate of genes. eLife 2023; 12:82290. [PMID: 37223962 DOI: 10.7554/elife.82290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 04/25/2023] [Indexed: 05/25/2023] Open
Abstract
Functionally indispensable genes are likely to be retained and otherwise to be lost during evolution. This evolutionary fate of a gene can also be affected by factors independent of gene dispensability, including the mutability of genomic positions, but such features have not been examined well. To uncover the genomic features associated with gene loss, we investigated the characteristics of genomic regions where genes have been independently lost in multiple lineages. With a comprehensive scan of gene phylogenies of vertebrates with a careful inspection of evolutionary gene losses, we identified 813 human genes whose orthologs were lost in multiple mammalian lineages: designated 'elusive genes.' These elusive genes were located in genomic regions with rapid nucleotide substitution, high GC content, and high gene density. A comparison of the orthologous regions of such elusive genes across vertebrates revealed that these features had been established before the radiation of the extant vertebrates approximately 500 million years ago. The association of human elusive genes with transcriptomic and epigenomic characteristics illuminated that the genomic regions containing such genes were subject to repressive transcriptional regulation. Thus, the heterogeneous genomic features driving gene fates toward loss have been in place and may sometimes have relaxed the functional indispensability of such genes. This study sheds light on the complex interplay between gene function and local genomic properties in shaping gene evolution that has persisted since the vertebrate ancestor.
Collapse
Affiliation(s)
- Yuichiro Hara
- Research Center for Genome & Medical Sciences, Tokyo Metropolitan Institute of Medical Science, Tokyo, Japan
| | - Shigehiro Kuraku
- Molecular Life History Laboratory, Department of Genomics and Evolutionary Biology, National Institute of Genetics, Mishima, Japan
- Department of Genetics, Sokendai (Graduate University for Advanced Studies), Mishima, Japan
- RIKEN Center for Biosystems Dynamics Research, Kobe, Japan
| |
Collapse
|
8
|
Glazenburg MM, Laan L. Complexity and self-organization in the evolution of cell polarization. J Cell Sci 2023; 136:jcs259639. [PMID: 36691920 DOI: 10.1242/jcs.259639] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Cellular life exhibits order and complexity, which typically increase over the course of evolution. Cell polarization is a well-studied example of an ordering process that breaks the internal symmetry of a cell by establishing a preferential axis. Like many cellular processes, polarization is driven by self-organization, meaning that the macroscopic pattern emerges as a consequence of microscopic molecular interactions at the biophysical level. However, the role of self-organization in the evolution of complex protein networks remains obscure. In this Review, we provide an overview of the evolution of polarization as a self-organizing process, focusing on the model species Saccharomyces cerevisiae and its fungal relatives. Moreover, we use this model system to discuss how self-organization might relate to evolutionary change, offering a shift in perspective on evolution at the microscopic scale.
Collapse
Affiliation(s)
- Marieke M Glazenburg
- Department of Bionanoscience, Kavli Institute of Nanoscience, Faculty of Applied Sciences, Delft University of Technology, 2629 HZ Delft, The Netherlands
| | - Liedewij Laan
- Department of Bionanoscience, Kavli Institute of Nanoscience, Faculty of Applied Sciences, Delft University of Technology, 2629 HZ Delft, The Netherlands
| |
Collapse
|
9
|
Sun Y, Su Y, Hussain A, Xiong L, Li C, Zhang J, Meng Z, Dong Z, Yu G. Complete genome sequence of the Pogostemon cablin bacterial wilt pathogen Ralstonia solanacearum strain SY1. Genes Genomics 2023; 45:123-134. [PMID: 35670995 PMCID: PMC9171469 DOI: 10.1007/s13258-022-01270-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 05/09/2022] [Indexed: 01/07/2023]
Abstract
BACKGROUND Ralstonia solanacearum causes bacterial wilt of Pogostemon cablin which is an important aromatic herb and also the main materials of COVID-19 therapeutic traditional drugs. However, we are lacking the information on the genomic sequences of R. solanacearum isolated from P. cablin. OBJECTIVE The acquisition and analysis of this whole-genome sequence of the P. cablin bacterial wilt pathogen. METHODS An R. solanacearum strain, named SY1, was isolated from infected P. cablin plants, and the complete genome sequence was sequenced and analyzed. RESULTS The SY1 strain contains a 3.70-Mb chromosome and a 2.18-Mb megaplasmid, with GC contents of 67.57% and 67.41%, respectively. A total of 3308 predicted genes were located on the chromosome and 1657 genes were located in the megaplasmid. SY1 strain has 273 unique genes compared with five representative R. solanacearum strains, and these genes were enriched in the plant-pathogen interaction pathway. SY1 possessed a higher syntenic relationship with phylotype I strains, and the arsenal of type III effectors predicted in SY1 were also more closely related to those of phylotype I strains. SY1 contained 14 and 5 genomic islands in its chromosome and megaplasmid, respectively, and two prophage sequences in its chromosome. In addition, 215 and 130 genes were annotated as carbohydrate-active enzymes and antibiotic resistance genes, respectively. CONCLUSION This is the first genome-scale assembly and annotation for R. solanacearum which isolated from infected P. cablin plants. The arsenal of virulence and antibiotic resistance may as the determinants in SY1 for infection of P. cablin plants.
Collapse
Affiliation(s)
- Yunhao Sun
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Yutong Su
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Ansar Hussain
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Lina Xiong
- School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Chunji Li
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Jie Zhang
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Resources and Environment, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Zhen Meng
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Zhangyong Dong
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China.
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China.
| | - Guohui Yu
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China.
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China.
| |
Collapse
|
10
|
Xiao L, Fan D, Qi H, Cong Y, Du Z. Defect-buffering cellular plasticity increases robustness of metazoan embryogenesis. Cell Syst 2022; 13:615-630.e9. [PMID: 35882226 DOI: 10.1016/j.cels.2022.07.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 04/14/2022] [Accepted: 06/30/2022] [Indexed: 01/26/2023]
Abstract
Developmental processes are intrinsically robust so as to preserve a normal-like state in response to genetic and environmental fluctuations. However, the robustness and potential phenotypic plasticity of individual developing cells under genetic perturbations remain to be systematically evaluated. Using large-scale gene perturbation, live imaging, lineage tracing, and single-cell phenomics, we quantified the phenotypic landscape of C. elegans embryogenesis in >2,000 embryos following individual knockdown of over 750 conserved genes. We observed that cellular genetic systems are not sufficiently robust to single-gene perturbations across all cells; rather, gene knockdowns frequently induced cellular defects. Dynamic phenotypic analyses revealed many cellular defects to be transient, with cells exhibiting phenotypic plasticity that serves to alleviate, correct, and accommodate the defects. Moreover, potential developmentally related cell modules may buffer the phenotypic effects of individual cell position changes. Our findings reveal non-negligible contributions of cellular plasticity and multicellularity as compensatory strategies to increase developmental robustness.
Collapse
Affiliation(s)
- Long Xiao
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Duchangjiang Fan
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Huan Qi
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China
| | - Yulin Cong
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhuo Du
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| |
Collapse
|
11
|
Haines AS, Kendrew SG, Crowhurst N, Stephens ER, Connolly J, Hothersall J, Miller CE, Collis AJ, Huckle BD, Thomas CM. High quality genome annotation and expression visualisation of a mupirocin-producing bacterium. PLoS One 2022; 17:e0268072. [PMID: 35511780 PMCID: PMC9070926 DOI: 10.1371/journal.pone.0268072] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 04/21/2022] [Indexed: 11/19/2022] Open
Abstract
Pseudomonas strain NCIMB10586, in the P. fluorescens subgroup, produces the polyketide antibiotic mupirocin, and has potential as a host for industrial production of a range of valuable products. To underpin further studies on its genetics and physiology, we have used a combination of standard and atypical approaches to achieve a quality of the genome sequence and annotation, above current standards for automated pathways. Assembly of Illumina reads to a PacBio genome sequence created a retrospectively hybrid assembly, identifying and fixing 415 sequencing errors which would otherwise affect almost 5% of annotated coding regions. Our annotation pipeline combined automation based on related well-annotated genomes and stringent, partially manual, tests for functional features. The strain was close to P. synxantha and P. libaniensis and was found to be highly similar to a strain being developed as a weed-pest control agent in Canada. Since mupirocin is a secondary metabolite whose production is switched on late in exponential phase, we carried out RNAseq analysis over an 18 h growth period and have developed a method to normalise RNAseq samples as a group, rather than pair-wise. To review such data we have developed an easily interpreted way to present the expression profiles across a region, or the whole genome at a glance. At the 2-hour granularity of our time-course, the mupirocin cluster increases in expression as an essentially uniform bloc, although the mupirocin resistance gene stands out as being expressed at all the time points.
Collapse
Affiliation(s)
- Anthony S. Haines
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| | - Steve G. Kendrew
- Manufacturing Science and Technology, GlaxoSmithKline, Worthing, West Sussex, United Kingdom
| | - Nicola Crowhurst
- Manufacturing Science and Technology, GlaxoSmithKline, Worthing, West Sussex, United Kingdom
| | - Elton R. Stephens
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| | - Jack Connolly
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| | - Joanne Hothersall
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| | - Claire E. Miller
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| | - Andrew J. Collis
- Manufacturing Science and Technology, GlaxoSmithKline, Worthing, West Sussex, United Kingdom
| | - Benjamin D. Huckle
- Manufacturing Science and Technology, GlaxoSmithKline, Worthing, West Sussex, United Kingdom
| | - Christopher M. Thomas
- School of Biosciences, University of Birmingham, Edgbaston, Birmingham, United Kingdom
| |
Collapse
|
12
|
Ghadie MA, Xia Y. Are transient protein-protein interactions more dispensable? PLoS Comput Biol 2022; 18:e1010013. [PMID: 35404956 PMCID: PMC9000134 DOI: 10.1371/journal.pcbi.1010013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 03/11/2022] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions (PPIs) are key drivers of cell function and evolution. While it is widely assumed that most permanent PPIs are important for cellular function, it remains unclear whether transient PPIs are equally important. Here, we estimate and compare dispensable content among transient PPIs and permanent PPIs in human. Starting with a human reference interactome mapped by experiments, we construct a human structural interactome by building three-dimensional structural models for PPIs, and then distinguish transient PPIs from permanent PPIs using several structural and biophysical properties. We map common mutations from healthy individuals and disease-causing mutations onto the structural interactome, and perform structure-based calculations of the probabilities for common mutations (assumed to be neutral) and disease mutations (assumed to be mildly deleterious) to disrupt transient PPIs and permanent PPIs. Using Bayes' theorem we estimate that a similarly small fraction (<~20%) of both transient and permanent PPIs are completely dispensable, i.e., effectively neutral upon disruption. Hence, transient and permanent interactions are subject to similarly strong selective constraints in the human interactome.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Canada
| |
Collapse
|
13
|
Tan X, Dai X, Chen T, Wu Y, Yang D, Zheng Y, Chen H, Wan X, Yang Y. Complete Genome Sequence Analysis of Ralstonia solanacearum Strain PeaFJ1 Provides Insights Into Its Strong Virulence in Peanut Plants. Front Microbiol 2022; 13:830900. [PMID: 35273586 PMCID: PMC8904134 DOI: 10.3389/fmicb.2022.830900] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/12/2022] [Indexed: 11/22/2022] Open
Abstract
The bacterial wilt of peanut (Arachis hypogaea L.) caused by Ralstonia solanacearum is a devastating soil-borne disease that seriously restricted the world peanut production. However, the molecular mechanism of R. solanacearum–peanut interaction remains largely unknown. We found that R. solanacearum HA4-1 and PeaFJ1 isolated from peanut plants showed different pathogenicity by inoculating more than 110 cultivated peanuts. Phylogenetic tree analysis demonstrated that HA4-1 and PeaFJ1 both belonged to phylotype I and sequevar 14M, which indicates a high degree of genomic homology between them. Genomic sequencing and comparative genomic analysis of PeaFJ1 revealed 153 strain-specific genes compared with HA4-1. The PeaFJ1 strain-specific genes consisted of diverse virulence-related genes including LysR-type transcriptional regulators, two-component system-related genes, and genes contributing to motility and adhesion. In addition, the repertoire of the type III effectors of PeaFJ1 was bioinformatically compared with that of HA4-1 to find the candidate effectors responsible for their different virulences. There are 79 effectors in the PeaFJ1 genome, only 4 of which are different effectors compared with HA4-1, including RipS4, RipBB, RipBS, and RS_T3E_Hyp6. Based on the virulence profiles of the two strains against peanuts, we speculated that RipS4 and RipBB are candidate virulence effectors in PeaFJ1 while RipBS and RS_T3E_Hyp6 are avirulence effectors in HA4-1. In general, our research greatly reduced the scope of virulence-related genes and made it easier to find out the candidates that caused the difference in pathogenicity between the two strains. These results will help to reveal the molecular mechanism of peanut–R. solanacearum interaction and develop targeted control strategies in the future.
Collapse
Affiliation(s)
- Xiaodan Tan
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Xiaoqiu Dai
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Ting Chen
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Yushuang Wu
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Dong Yang
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Yixiong Zheng
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Huilan Chen
- Key Laboratory of Horticultural Plant Biology (HZAU), Ministry of Education, Key Laboratory of Potato Biology and Biotechnology (HZAU), Ministry of Agriculture and Rural Affairs, Huazhong Agricultural University, Wuhan, China
| | - Xiaorong Wan
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Yong Yang
- Guangzhou Key Laboratory for Research and Development of Crop Germplasm Resources, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| |
Collapse
|
14
|
Zhong M, Sun Y, Zhang X, Liang H, Xiong L, Han Q. Complete genome sequence of the kiwifruit bacterial canker pathogen Pseudomonas savastanoi strain MHT1. BMC Microbiol 2022; 22:44. [PMID: 35120460 PMCID: PMC8815115 DOI: 10.1186/s12866-022-02459-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 01/25/2022] [Indexed: 11/21/2022] Open
Abstract
Background Pseudomonas savastanoi is an important plant pathogen that infects and causes symptoms in a variety of economically important crops, causing considerable loss of yield and quality. Because there has been no research reported to date on bacterial canker of kiwifruit (Actinidia chinensis) plants caused by P. savastanoi and, in particular, no in-depth studies of the complete genome sequence or pathogenic mechanism, long-lasting and environmentally friendly control measures against this pathogen in kiwifruit are lacking. This study therefore has both theoretical value and practical significance. Results We report the complete genome sequence of P. savastanoi strain MHT1, which was first reported as the pathogen causing bacterial canker in kiwifruit plants. The genome consists of a 6.00-Mb chromosome with 58.5% GC content and 5008 predicted genes. Comparative genome analysis of four sequenced genomes of representative P. savastanoi strains revealed that 230 genes are unique to the MHT1 strain and that these genes are enriched in antibiotic metabolic processes and metabolic pathways, which may be associated with the drug resistance and host range observed in this strain. MHT1 showed high syntenic relationships with different P. savastanoi strains. Furthermore, MHT1 has eight conserved effectors that are highly homologous to effectors from P. syringae, Pseudomonas amygdali, and Ralstonia solanacearum strains. The MHT1 genome contains six genomic islands and two prophage sequences. In addition, 380 genes were annotated as antibiotic resistance genes and another 734 as encoding carbohydrate-active enzymes. Conclusion The whole-genome sequence of this kiwifruit bacterial canker pathogen extends our knowledge of the P. savastanoi genome, sets the stage for further studies of the interaction between kiwifruit and P. savastanoi, and provides an important theoretical foundation for the prevention and control of bacterial canker. Supplementary Information The online version contains supplementary material available at 10.1186/s12866-022-02459-4.
Collapse
Affiliation(s)
- Mingzhao Zhong
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, China.,College of Resources and Environment, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Yunhao Sun
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, China.,College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Xianzhi Zhang
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China
| | - Hong Liang
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China.,Guangdong Zhongkai Technology Development Co., Ltd, Huizhou, China
| | - Lina Xiong
- School of Life Sciences, Sun Yat-sen University, Guangzhou, China
| | - Qunxin Han
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou, China. .,College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, China. .,, Present Address: Guangzhou, People's Republic of China.
| |
Collapse
|
15
|
Complete mitogenome of the endangered and endemic Nicobar treeshrew (Tupaia nicobarica) and comparison with other Scandentians. Sci Rep 2022; 12:877. [PMID: 35042947 PMCID: PMC8766473 DOI: 10.1038/s41598-022-04907-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Accepted: 12/29/2021] [Indexed: 11/09/2022] Open
Abstract
The Nicobar treeshrew (Tupaia nicobarica) is an endangered small mammal endemic to the Nicobar Island of the Andaman Sea, India regarded as an alternative experimental animal model in biomedical research. The present study aimed to assemble the first mitochondrial genome of T. nicobarica to elucidate its phylogenetic position with respect to other Scandentians. The structure and variation of the novel mitochondrial genome were analyzed and compared with other Scandentians. The complete mitogenome (17,164 bp) encodes 37 genes, including 13 protein-coding genes (PCGs), 22 transfer RNA (tRNAs), two ribosomal RNA (rRNAs), and one control region (CR). Most of the genes were encoded on majority strand, except nad6 and eight tRNAs. The nonsynonymous/synonymous ratio in all PCGs indicates strong negative selection among all Tupaiidae species. The comparative study of CRs revealed the occurrence of tandem repeats (CGTACA) found in T. nicobarica. The phylogenetic analyses (Maximum Likelihood and Bayesian Inference) showed distinct clustering of T. nicobarica with high branch supports and depict a substantial divergence time (12–19 MYA) from the ancestor lineage of Tupaiidae. The 16S rRNA dataset corroborates the taxonomic rank of two subspecies of T. nicobarica from the Great and Little Nicobar Islands. In the future, whole nuclear genome sequencing is necessary to further improve our understanding of evolutionary relationships among treeshrews, and will have implications for biomedical research.
Collapse
|
16
|
Vedelek B, Kovács Á, Boros IM. Evolutionary mode for the functional preservation of fast-evolving Drosophila telomere capping proteins. Open Biol 2021; 11:210261. [PMID: 34784790 PMCID: PMC8596017 DOI: 10.1098/rsob.210261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
DNA end protection is fundamental for the long-term preservation of the genome. In vertebrates the Shelterin protein complex protects telomeric DNA ends, thereby contributing to the maintenance of genome integrity. In the Drosophila genus, this function is thought to be performed by the Terminin complex, an assembly of fast-evolving subunits. Considering that DNA end protection is fundamental for successful genome replication, the accelerated evolution of Terminin subunits is counterintuitive, as conservation is supposed to maintain the assembly and concerted function of the interacting partners. This problem extends over Drosophila telomere biology and provides insight into the evolution of protein assemblies. In order to learn more about the mechanistic details of this phenomenon we have investigated the intra- and interspecies assemblies of Verrocchio and Modigliani, two Terminin subunits using in vitro assays. Based on our results and on homology-based three-dimensional models for Ver and Moi, we conclude that both proteins contain Ob-fold and contribute to the ssDNA binding of the Terminin complex. We propose that the preservation of Ver function is achieved by conservation of specific amino acids responsible for folding or localized in interacting surfaces. We also provide here the first evidence on Moi DNA binding.
Collapse
Affiliation(s)
- Balázs Vedelek
- Department of Biochemistry and Molecular Biology, University of Szeged, Szeged, Hungary,Institute of Biochemistry, Biological Research Centre, Szeged, Hungary
| | - Ákos Kovács
- Department of Biochemistry and Molecular Biology, University of Szeged, Szeged, Hungary
| | - Imre M. Boros
- Department of Biochemistry and Molecular Biology, University of Szeged, Szeged, Hungary,Institute of Biochemistry, Biological Research Centre, Szeged, Hungary
| |
Collapse
|
17
|
Campos TL, Korhonen PK, Hofmann A, Gasser RB, Young ND. Harnessing model organism genomics to underpin the machine learning-based prediction of essential genes in eukaryotes - Biotechnological implications. Biotechnol Adv 2021; 54:107822. [PMID: 34461202 DOI: 10.1016/j.biotechadv.2021.107822] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Revised: 08/17/2021] [Accepted: 08/24/2021] [Indexed: 12/17/2022]
Abstract
The availability of high-quality genomes and advances in functional genomics have enabled large-scale studies of essential genes in model eukaryotes, including the 'elegant worm' (Caenorhabditis elegans; Nematoda) and the 'vinegar fly' (Drosophila melanogaster; Arthropoda). However, this is not the case for other, much less-studied organisms, such as socioeconomically important parasites, for which functional genomic platforms usually do not exist. Thus, there is a need to develop innovative techniques or approaches for the prediction, identification and investigation of essential genes. A key approach that could enable the prediction of such genes is machine learning (ML). Here, we undertake an historical review of experimental and computational approaches employed for the characterisation of essential genes in eukaryotes, with a particular focus on model ecdysozoans (C. elegans and D. melanogaster), and discuss the possible applicability of ML-approaches to organisms such as socioeconomically important parasites. We highlight some recent results showing that high-performance ML, combined with feature engineering, allows a reliable prediction of essential genes from extensive, publicly available 'omic data sets, with major potential to prioritise such genes (with statistical confidence) for subsequent functional genomic validation. These findings could 'open the door' to fundamental and applied research areas. Evidence of some commonality in the essential gene-complement between these two organisms indicates that an ML-engineering approach could find broader applicability to ecdysozoans such as parasitic nematodes or arthropods, provided that suitably large and informative data sets become/are available for proper feature engineering, and for the robust training and validation of algorithms. This area warrants detailed exploration to, for example, facilitate the identification and characterisation of essential molecules as novel targets for drugs and vaccines against parasitic diseases. This focus is particularly important, given the substantial impact that such diseases have worldwide, and the current challenges associated with their prevention and control and with drug resistance in parasite populations.
Collapse
Affiliation(s)
- Tulio L Campos
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia; Bioinformatics Core Facility, Instituto Aggeu Magalhães, Fundação Oswaldo Cruz (IAM-Fiocruz), Recife, Pernambuco, Brazil
| | - Pasi K Korhonen
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Andreas Hofmann
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia.
| | - Neil D Young
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, Victoria 3010, Australia.
| |
Collapse
|
18
|
Comparative Genomics of Eight Fusarium graminearum Strains with Contrasting Aggressiveness Reveals an Expanded Open Pangenome and Extended Effector Content Signatures. Int J Mol Sci 2021; 22:ijms22126257. [PMID: 34200775 PMCID: PMC8230406 DOI: 10.3390/ijms22126257] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Revised: 06/02/2021] [Accepted: 06/07/2021] [Indexed: 01/25/2023] Open
Abstract
Fusarium graminearum, the primary cause of Fusarium head blight (FHB) in small-grain cereals, demonstrates remarkably variable levels of aggressiveness in its host, producing different infection dynamics and contrasted symptom severity. While the secreted proteins, including effectors, are thought to be one of the essential components of aggressiveness, our knowledge of the intra-species genomic diversity of F. graminearum is still limited. In this work, we sequenced eight European F. graminearum strains of contrasting aggressiveness to characterize their respective genome structure, their gene content and to delineate their specificities. By combining the available sequences of 12 other F. graminearum strains, we outlined a reference pangenome that expands the repertoire of the known genes in the reference PH-1 genome by 32%, including nearly 21,000 non-redundant sequences and gathering a common base of 9250 conserved core-genes. More than 1000 genes with high non-synonymous mutation rates may be under diverse selection, especially regarding the trichothecene biosynthesis gene cluster. About 900 secreted protein clusters (SPCs) have been described. Mostly localized in the fast sub-genome of F. graminearum supposed to evolve rapidly to promote adaptation and rapid responses to the host's infection, these SPCs gather a range of putative proteinaceous effectors systematically found in the core secretome, with the chloroplast and the plant nucleus as the main predicted targets in the host cell. This work describes new knowledge on the intra-species diversity in F. graminearum and emphasizes putative determinants of aggressiveness, providing a wealth of new candidate genes potentially involved in the Fusarium head blight disease.
Collapse
|
19
|
Moran J, Finlay D, Tikhonov M. Improve it or lose it: Evolvability cost of competition for expression. Phys Rev E 2021; 103:062402. [PMID: 34271680 DOI: 10.1103/physreve.103.062402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 05/24/2021] [Indexed: 11/07/2022]
Abstract
Expression level is known to be a strong determinant of a protein's rate of evolution. But the converse can also be true: evolutionary dynamics can affect expression levels of proteins. Having implications in both directions fosters the possibility of an "improve it or lose it" feedback loop, where higher expressed systems are more likely to improve and be expressed even higher, while those that are expressed less are eventually lost to drift. Using a minimal model to study this in the context of a changing environment, we demonstrate that one unexpected consequence of such a feedback loop is that a slow switch to a new environment can allow genotypes to reach higher fitness sooner than a direct exposure to it.
Collapse
Affiliation(s)
- Jacob Moran
- Department of Physics, Washington University in St. Louis, St. Louis, Missouri 63130, USA
| | - Devon Finlay
- Department of Physics, Washington University in St. Louis, St. Louis, Missouri 63130, USA
| | - Mikhail Tikhonov
- Department of Physics, Washington University in St. Louis, St. Louis, Missouri 63130, USA
| |
Collapse
|
20
|
Dubreuil B, Levy ED. Abundance Imparts Evolutionary Constraints of Similar Magnitude on the Buried, Surface, and Disordered Regions of Proteins. Front Mol Biosci 2021; 8:626729. [PMID: 33996892 PMCID: PMC8119896 DOI: 10.3389/fmolb.2021.626729] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
An understanding of the forces shaping protein conservation is key, both for the fundamental knowledge it represents and to allow for optimal use of evolutionary information in practical applications. Sequence conservation is typically examined at one of two levels. The first is a residue-level, where intra-protein differences are analyzed and the second is a protein-level, where inter-protein differences are studied. At a residue level, we know that solvent-accessibility is a prime determinant of conservation. By inverting this logic, we inferred that disordered regions are slightly more solvent-accessible on average than the most exposed surface residues in domains. By integrating abundance information with evolutionary data within and across proteins, we confirmed a previously reported strong surface-core association in the evolution of structured regions, but we found a comparatively weak association between disordered and structured regions. The facts that disordered and structured regions experience different structural constraints and evolve independently provide a unique setup to examine an outstanding question: why is a protein’s abundance the main determinant of its sequence conservation? Indeed, any structural or biophysical property linked to the abundance-conservation relationship should increase the relative conservation of regions concerned with that property (e.g., disordered residues with mis-interactions, domain residues with misfolding). Surprisingly, however, we found the conservation of disordered and structured regions to increase in equal proportion with abundance. This observation implies that either abundance-related constraints are structure-independent, or multiple constraints apply to different regions and perfectly balance each other.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
21
|
Li C, Cheng P, Zheng L, Li Y, Chen Y, Wen S, Yu G. Comparative genomics analysis of two banana Fusarium wilt biocontrol endophytes Bacillus subtilis R31 and TR21 provides insights into their differences on phytobeneficial trait. Genomics 2021; 113:900-909. [PMID: 33592313 DOI: 10.1016/j.ygeno.2021.02.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/26/2020] [Accepted: 02/08/2021] [Indexed: 01/27/2023]
Abstract
Fusarium wilt of banana is considered one of the most destructive plant diseases. Bacillus subtilis R31 and TR21, isolated from Dendrobium sp. leaves, exhibit different phytobeneficial effects on banana Fusarium wilt bio-controlling. Here, we performed genome sequencing and comparative genomics analysis of R31 and TR21 to enhance our understanding of the different phytobeneficial traits. These results revealed that the strain-specific genes of R31 involved in sporulation, quorum sensing, and antibiotic synthesis allow R31 to present a better capacity of sporulation, rhizosphere adaptation, and quorum sensing than TR21. Selective pressure analysis indicated that the glycosylase and endo-alpha-(1- > 5)-L-arabinanase genes were strong positive selected, which may contribute to the TR21 to colonize well in banana's vascular bundles. Altogether, our findings presented here should advance further agricultural application of R31 and TR21 as two promising resources of plant growth promotion and biological control via genetic engineering.
Collapse
Affiliation(s)
- Chunji Li
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China; College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China.
| | - Ping Cheng
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China; College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China.
| | - Li Zheng
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China; College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China
| | - Yongjian Li
- Zhuhai Modern Agriculture Development Center, Zhuhai 519075, People's Republic of China
| | - Yanhong Chen
- Zhuhai Modern Agriculture Development Center, Zhuhai 519075, People's Republic of China
| | - Shuheng Wen
- Guangdong Geolong Biotechnology Co., Ltd., Zhuhai 519050, People's Republic of China
| | - Guohui Yu
- Innovative Institute for Plant Health, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China; College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou 510225, People's Republic of China.
| |
Collapse
|
22
|
Usmanova DR, Plata G, Vitkup D. The Relationship between the Misfolding Avoidance Hypothesis and Protein Evolutionary Rates in the Light of Empirical Evidence. Genome Biol Evol 2021; 13:6081017. [PMID: 33432359 PMCID: PMC7874998 DOI: 10.1093/gbe/evab006] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2021] [Indexed: 12/14/2022] Open
Abstract
For more than a decade, the misfolding avoidance hypothesis (MAH) and related theories have dominated evolutionary discussions aimed at explaining the variance of the molecular clock across cellular proteins. In this study, we use various experimental data to further investigate the consistency of the MAH predictions with empirical evidence. We also critically discuss experimental results that motivated the MAH development and that are often viewed as evidence of its major contribution to the variability of protein evolutionary rates. We demonstrate, in Escherichia coli and Homo sapiens, the lack of a substantial negative correlation between protein evolutionary rates and Gibbs free energies of unfolding, a direct measure of protein stability. We then analyze multiple new genome-scale data sets characterizing protein aggregation and interaction propensities, the properties that are likely optimized in evolution to alleviate deleterious effects associated with toxic protein misfolding and misinteractions. Our results demonstrate that the propensity of proteins to aggregate, the fraction of charged amino acids, and protein stickiness do correlate with protein abundances. Nevertheless, across multiple organisms and various data sets we do not observe substantial correlations between proteins’ aggregation- and stability-related properties and evolutionary rates. Therefore, diverse empirical data support the conclusion that the MAH and similar hypotheses do not play a major role in mediating a strong negative correlation between protein expression and the molecular clock, and thus in explaining the variability of evolutionary rates across cellular proteins.
Collapse
Affiliation(s)
- Dinara R Usmanova
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Germán Plata
- Department of Systems Biology, Columbia University, New York, NY, USA.,Elanco Animal Health, Greenfield, IN, USA
| | - Dennis Vitkup
- Department of Systems Biology, Columbia University, New York, NY, USA.,Department of Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|
23
|
Cheng YH, Liu CFJ, Yu YH, Jhou YT, Fujishima M, Tsai IJ, Leu JY. Genome plasticity in Paramecium bursaria revealed by population genomics. BMC Biol 2020; 18:180. [PMID: 33250052 PMCID: PMC7702705 DOI: 10.1186/s12915-020-00912-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Accepted: 10/29/2020] [Indexed: 11/25/2022] Open
Abstract
Background Ciliates are an ancient and diverse eukaryotic group found in various environments. A unique feature of ciliates is their nuclear dimorphism, by which two types of nuclei, the diploid germline micronucleus (MIC) and polyploidy somatic macronucleus (MAC), are present in the same cytoplasm and serve different functions. During each sexual cycle, ciliates develop a new macronucleus in which newly fused genomes are extensively rearranged to generate functional minichromosomes. Interestingly, each ciliate species seems to have its way of processing genomes, providing a diversity of resources for studying genome plasticity and its regulation. Here, we sequenced and analyzed the macronuclear genome of different strains of Paramecium bursaria, a highly divergent species of the genus Paramecium which can stably establish endosymbioses with green algae. Results We assembled a high-quality macronuclear genome of P. bursaria and further refined genome annotation by comparing population genomic data. We identified several species-specific expansions in protein families and gene lineages that are potentially associated with endosymbiosis. Moreover, we observed an intensive chromosome breakage pattern that occurred during or shortly after sexual reproduction and contributed to highly variable gene dosage throughout the genome. However, patterns of copy number variation were highly correlated among genetically divergent strains, suggesting that copy number is adjusted by some regulatory mechanisms or natural selection. Further analysis showed that genes with low copy number variation among populations tended to function in basic cellular pathways, whereas highly variable genes were enriched in environmental response pathways. Conclusions We report programmed DNA rearrangements in the P. bursaria macronuclear genome that allow cells to adjust gene copy number globally according to individual gene functions. Our results suggest that large-scale gene copy number variation may represent an ancient mechanism for cells to adapt to different environments. Supplementary information The online version contains supplementary material available at 10.1186/s12915-020-00912-2.
Collapse
Affiliation(s)
- Yu-Hsuan Cheng
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, 106, Taiwan.,Institute of Molecular Biology, Academia Sinica, 128 Sec. 2, Academia Road, Nankang, Taipei, 115, Taiwan
| | - Chien-Fu Jeff Liu
- Institute of Molecular Biology, Academia Sinica, 128 Sec. 2, Academia Road, Nankang, Taipei, 115, Taiwan
| | - Yen-Hsin Yu
- Institute of Molecular Biology, Academia Sinica, 128 Sec. 2, Academia Road, Nankang, Taipei, 115, Taiwan
| | - Yu-Ting Jhou
- Institute of Molecular Biology, Academia Sinica, 128 Sec. 2, Academia Road, Nankang, Taipei, 115, Taiwan
| | - Masahiro Fujishima
- Graduate School of Sciences and Technology for Innovation, Yamaguchi University, Yamaguchi, 753-8512, Japan
| | - Isheng Jason Tsai
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, 106, Taiwan.,Biodiversity Research Center, Academia Sinica, Taipei, 115, Taiwan
| | - Jun-Yi Leu
- Genome and Systems Biology Degree Program, Academia Sinica and National Taiwan University, Taipei, 106, Taiwan. .,Institute of Molecular Biology, Academia Sinica, 128 Sec. 2, Academia Road, Nankang, Taipei, 115, Taiwan.
| |
Collapse
|
24
|
Evolutionary History of Alzheimer Disease-Causing Protein Family Presenilins with Pathological Implications. J Mol Evol 2020; 88:674-688. [DOI: 10.1007/s00239-020-09966-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Accepted: 09/22/2020] [Indexed: 12/14/2022]
|
25
|
Jasti N, Sebagh D, Riaz M, Wang X, Koripella B, Palanisamy V, Mohammad N, Chen Q, Friedrich M. Towards reconstructing the dipteran demise of an ancient essential gene: E3 ubiquitin ligase Murine double minute. Dev Genes Evol 2020; 230:279-294. [PMID: 32623522 DOI: 10.1007/s00427-020-00663-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 06/21/2020] [Indexed: 01/09/2023]
Abstract
Genome studies have uncovered many examples of essential gene loss, raising the question of how ancient genes transition from essentiality to dispensability. We explored this process for the deeply conserved E3 ubiquitin ligase Murine double minute (Mdm), which is lacking in Drosophila despite the conservation of its main regulatory target, the cellular stress response gene p53. Conducting gene expression and knockdown experiments in the red flour beetle Tribolium castaneum, we found evidence that Mdm has remained essential in insects where it is present. Using bioinformatics approaches, we confirm the absence of the Mdm gene family in Drosophila, mapping its loss to the stem lineage of schizophoran Diptera and Pipunculidae (big-headed flies), about 95-85 million years ago. Intriguingly, this gene loss event was preceded by the de novo origin of the gene Companion of reaper (Corp), a novel p53 regulatory factor that is characterized by functional similarities to vertebrate Mdm2 despite lacking E3 ubiquitin ligase protein domains. Speaking against a 1:1 compensatory gene gain/loss scenario, however, we found that hoverflies (Syrphidae) and pointed-wing flies (Lonchopteridae) possess both Mdm and Corp. This implies that the two p53 regulators have been coexisting for ~ 150 million years in select dipteran clades and for at least 50 million years in the lineage to Schizophora and Pipunculidae. Given these extensive time spans of Mdm/Corp coexistence, we speculate that the loss of Mdm in the lineage to Drosophila involved further acquisitions of compensatory gene activities besides the emergence of Corp. Combined with the previously noted reduction of an ancestral P53 contact domain in the Mdm homologs of crustaceans and insects, we conclude that the loss of the ancient Mdm gene family in flies was the outcome of incremental functional regression over long macroevolutionary time scales.
Collapse
Affiliation(s)
- Naveen Jasti
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA.,Institute for Protein Design, Washington University, 1959 NE Pacific Street, Seattle, WA, 98195, USA
| | - Dylan Sebagh
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Mohammed Riaz
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Xin Wang
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Bharat Koripella
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Vasanth Palanisamy
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Nabeel Mohammad
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Qing Chen
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Markus Friedrich
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA. .,Department of Anatomy and Cell Biology, Wayne State University, School of Medicine, 540 East Canfield Avenue, Detroit, MI, 48201, USA.
| |
Collapse
|
26
|
Singh A, Lakhanpaul S. Detection, characterization and evolutionary aspects of S54LP of SP (SAP54 Like Protein of Sesame Phyllody): a phytoplasma effector molecule associated with phyllody development in sesame ( Sesamum indicum L.). PHYSIOLOGY AND MOLECULAR BIOLOGY OF PLANTS : AN INTERNATIONAL JOURNAL OF FUNCTIONAL PLANT BIOLOGY 2020; 26:445-458. [PMID: 32205922 PMCID: PMC7078397 DOI: 10.1007/s12298-020-00764-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Revised: 12/24/2019] [Accepted: 01/14/2020] [Indexed: 05/05/2023]
Abstract
SAP54, an effector protein secreted by phytoplasmas has been reported to induce phyllody. S54LP of SP (SAP54 Like Protein of Sesame Phyllody), a SAP54 ortholog from phyllody and witches' broom affected sesame (Sesamum indicum L.) was amplified, cloned and sequenced. Comparative sequence and phylogenetic analysis of diverse phytoplasma strains was carried out to delineate the evolution of S54LP of SP. The degree of polymorphism across SAP54 orthologs and the evolutionary forces acting on this effector protein were ascertained. Site-specific selection across SAP54 orthologs was estimated using Fixed Effects Likelihood (FEL) approach. Nonsynonymous substitutions were detected in the SAP54 orthologs' sequences from phytoplasmas belonging to same (sub) group. Phylogenetic analysis based on S54LP of SP grouped phytoplasmas belonging to same 16SrDNA (sub) groups into different clusters. Analysis of selection forces acting on SAP54 orthologs from nine different phytoplasma (sub)groups, affecting plant species belonging to twelve different families across ten countries showed the orthologs to be under purifying (negative) selection. One amino acid residue was found to be under pervasive diversifying (positive) selection and a total of three amino acid sites were found to be under pervasive purifying (negative) selection. The location of these amino acids in the signal peptide and mature protein was studied with an aim to understand their role in protein-protein interaction. Asparagine residues (at positions 68 and 84) were found to be under pervasive purifying selection suggesting their functional importance in the effector protein. Our study suggests lack of coevolution between SAP54 and 16SrDNA. Signal peptide appears to evolve at a rate slightly higher than the mature protein. Overall, SAP54 and its orthologs are evolving under purifying selection confirming their functional importance in phytoplasma virulence.
Collapse
Affiliation(s)
- Amrita Singh
- Department of Botany, University of Delhi, Delhi, 110007 India
| | | |
Collapse
|
27
|
Li CJ, Zhao D, Li BX, Zhang N, Yan JY, Zou HT. Whole genome sequencing and comparative genomic analysis of oleaginous red yeast Sporobolomyces pararoseus NGR identifies candidate genes for biotechnological potential and ballistospores-shooting. BMC Genomics 2020; 21:181. [PMID: 32093624 PMCID: PMC7041287 DOI: 10.1186/s12864-020-6593-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 02/19/2020] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Sporobolomyces pararoseus is regarded as an oleaginous red yeast, which synthesizes numerous valuable compounds with wide industrial usages. This species hold biotechnological interests in biodiesel, food and cosmetics industries. Moreover, the ballistospores-shooting promotes the colonizing of S. pararoseus in most terrestrial and marine ecosystems. However, very little is known about the basic genomic features of S. pararoseus. To assess the biotechnological potential and ballistospores-shooting mechanism of S. pararoseus on genome-scale, the whole genome sequencing was performed by next-generation sequencing technology. RESULTS Here, we used Illumina Hiseq platform to firstly assemble S. pararoseus genome into 20.9 Mb containing 54 scaffolds and 5963 predicted genes with a N50 length of 2,038,020 bp and GC content of 47.59%. Genome completeness (BUSCO alignment: 95.4%) and RNA-seq analysis (expressed genes: 98.68%) indicated the high-quality features of the current genome. Through the annotation information of the genome, we screened many key genes involved in carotenoids, lipids, carbohydrate metabolism and signal transduction pathways. A phylogenetic assessment suggested that the evolutionary trajectory of the order Sporidiobolales species was evolved from genus Sporobolomyces to Rhodotorula through the mediator Rhodosporidiobolus. Compared to the lacking ballistospores Rhodotorula toruloides and Saccharomyces cerevisiae, we found genes enriched for spore germination and sugar metabolism. These genes might be responsible for the ballistospores-shooting in S. pararoseus NGR. CONCLUSION These results greatly advance our understanding of S. pararoseus NGR in biotechnological potential and ballistospores-shooting, which help further research of genetic manipulation, metabolic engineering as well as its evolutionary direction.
Collapse
Affiliation(s)
- Chun-Ji Li
- College of Land and Environment, Shenyang Agricultural University, Shenyang, 110866, People's Republic of China
- College of Agriculture and Biology, Zhongkai University of Agriculture and Engineering, Guangzhou, 510225, People's Republic of China
| | - Die Zhao
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, 430070, People's Republic of China
| | - Bing-Xue Li
- College of Land and Environment, Shenyang Agricultural University, Shenyang, 110866, People's Republic of China.
| | - Ning Zhang
- College of Biological Science and Technology, Shenyang Agricultural University, Shenyang, 110866, People's Republic of China
| | - Jian-Yu Yan
- College of Land and Environment, Shenyang Agricultural University, Shenyang, 110866, People's Republic of China
| | - Hong-Tao Zou
- College of Land and Environment, Shenyang Agricultural University, Shenyang, 110866, People's Republic of China
| |
Collapse
|
28
|
Fumasoni M, Murray AW. The evolutionary plasticity of chromosome metabolism allows adaptation to constitutive DNA replication stress. eLife 2020; 9:e51963. [PMID: 32043971 PMCID: PMC7069727 DOI: 10.7554/elife.51963] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Accepted: 02/11/2020] [Indexed: 01/22/2023] Open
Abstract
Many biological features are conserved and thus considered to be resistant to evolutionary change. While rapid genetic adaptation following the removal of conserved genes has been observed, we often lack a mechanistic understanding of how adaptation happens. We used the budding yeast, Saccharomyces cerevisiae, to investigate the evolutionary plasticity of chromosome metabolism, a network of evolutionary conserved modules. We experimentally evolved cells constitutively experiencing DNA replication stress caused by the absence of Ctf4, a protein that coordinates the enzymatic activities at replication forks. Parallel populations adapted to replication stress, over 1000 generations, by acquiring multiple, concerted mutations. These mutations altered conserved features of two chromosome metabolism modules, DNA replication and sister chromatid cohesion, and inactivated a third, the DNA damage checkpoint. The selected mutations define a functionally reproducible evolutionary trajectory. We suggest that the evolutionary plasticity of chromosome metabolism has implications for genome evolution in natural populations and cancer.
Collapse
Affiliation(s)
- Marco Fumasoni
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States
| | - Andrew W Murray
- Department of Molecular and Cellular Biology, Harvard UniversityCambridgeUnited States
| |
Collapse
|
29
|
Rutter MT, Murren CJ, Callahan HS, Bisner AM, Leebens-Mack J, Wolyniak MJ, Strand AE. Distributed phenomics with the unPAK project reveals the effects of mutations. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2019; 100:199-211. [PMID: 31155775 DOI: 10.1111/tpj.14427] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Revised: 05/01/2019] [Accepted: 05/10/2019] [Indexed: 06/09/2023]
Abstract
Determining how genes are associated with traits in plants and other organisms is a major challenge in modern biology. The unPAK project - undergraduates phenotyping Arabidopsis knockouts - has generated phenotype data for thousands of non-lethal insertion mutation lines within a single Arabidopsis thaliana genomic background. The focal phenotypes examined by unPAK are complex macroscopic fitness-related traits, which have ecological, evolutionary and agricultural importance. These phenotypes are placed in the context of the wild-type and also natural accessions (phytometers), and standardized for environmental differences between assays. Data from the unPAK project are used to describe broad patterns in the phenotypic consequences of insertion mutation, and to identify individual mutant lines with distinct phenotypes as candidates for further study. Inclusion of undergraduate researchers is at the core of unPAK activities, and an important broader impact of the project is providing students an opportunity to obtain research experience.
Collapse
Affiliation(s)
- Matthew T Rutter
- Department of Biology, College of Charleston, 66 George Street, Charleston, SC, 29424, USA
| | - Courtney J Murren
- Department of Biology, College of Charleston, 66 George Street, Charleston, SC, 29424, USA
| | - Hilary S Callahan
- Department of Biology, Barnard College, 3009 Broadway, New York, NY, 10027, USA
| | - April M Bisner
- Department of Biology, College of Charleston, 66 George Street, Charleston, SC, 29424, USA
| | - Jim Leebens-Mack
- Department of Plant Biology, University of Georgia, 120 Carlton St, Athens, GA, 30602, USA
| | | | - Allan E Strand
- Department of Biology, College of Charleston, 66 George Street, Charleston, SC, 29424, USA
| |
Collapse
|
30
|
Kundu S, Kumar V, Tyagi K, Chakraborty R, Chandra K. The first complete mitochondrial genome of the Indian Tent Turtle, Pangshura tentoria (Testudines: Geoemydidae): Characterization and comparative analysis. Ecol Evol 2019; 9:10854-10868. [PMID: 31624586 PMCID: PMC6787814 DOI: 10.1002/ece3.5606] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 06/24/2019] [Accepted: 08/07/2019] [Indexed: 11/09/2022] Open
Abstract
The characterization of a complete mitogenome is widely used in genomics studies for systematics and evolutionary research. However, the sequences and structural motifs contained within the mitogenome of Testudines taxa have rarely been examined. The present study decodes the first complete mitochondrial genome of the Indian Tent Turtle, Pangshura tentoria (16,657 bp) by using next-generation sequencing. This denovo assembly encodes 37 genes: 13 protein-coding genes (PCGs), 22 transfer RNA (tRNAs), two ribosomal RNA, and one control region (CR). Most of the genes were encoded on majority strand, except for one PCG (NADH dehydrogenase subunit 6) and eight tRNAs. Most of the PCGs were started with an ATG initiation codon, except for Cytochrome oxidase subunit 1 with "GTG" and NADH dehydrogenase subunit 5 with "ATA." The termination codons, "TAA" and "AGA" were observed in two subunits of NADH dehydrogenase gene. The relative synonymous codon usage analysis revealed the maximum abundance of alanine, isoleucine, leucine, and threonine. The nonsynonymous/synonymous ratios were <1 in all PCGs, which indicates strong negative selection among all Geoemydid species. The study also found the typical cloverleaf secondary structure in most of the tRNA genes, except for serine with the lack of the conventional DHU arm. The comparative study of Geoemydid mitogenomes revealed the occurrence of tandem repeats was frequent in the 3' end of CR. Further, two copies of a unique tandem repeat "TTCTCTTT" were identified in P. tentoria. The Bayesian and maximum-likelihood phylogenetic trees using concatenation of 13 PCGs revealed the close relationships of P. tentoria with Batagur trivittata in the studied dataset. All the Geoemydid species showed distinct clustering with high bootstrap support congruent with previous evolutionary hypotheses. We suggest that the generations of more mitogenomes of Geoemydid species are required, to improve our understanding of their in-depth phylogenetic and evolutionary relationships.
Collapse
Affiliation(s)
- Shantanu Kundu
- Centre for DNA TaxonomyMolecular Systematics DivisionZoological Survey of IndiaKolkataIndia
| | - Vikas Kumar
- Centre for DNA TaxonomyMolecular Systematics DivisionZoological Survey of IndiaKolkataIndia
| | - Kaomud Tyagi
- Centre for DNA TaxonomyMolecular Systematics DivisionZoological Survey of IndiaKolkataIndia
| | - Rajasree Chakraborty
- Centre for DNA TaxonomyMolecular Systematics DivisionZoological Survey of IndiaKolkataIndia
| | - Kailash Chandra
- Centre for DNA TaxonomyMolecular Systematics DivisionZoological Survey of IndiaKolkataIndia
| |
Collapse
|
31
|
Jain A, Perisa D, Fliedner F, von Haeseler A, Ebersberger I. The Evolutionary Traceability of a Protein. Genome Biol Evol 2019; 11:531-545. [PMID: 30649284 PMCID: PMC6394115 DOI: 10.1093/gbe/evz008] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/11/2019] [Indexed: 12/12/2022] Open
Abstract
Orthologs document the evolution of genes and metabolic capacities encoded in extant and ancient genomes. However, the similarity between orthologs decays with time, and ultimately it becomes insufficient to infer common ancestry. This leaves ancient gene set reconstructions incomplete and distorted to an unknown extent. Here we introduce the “evolutionary traceability” as a measure that quantifies, for each protein, the evolutionary distance beyond which the sensitivity of the ortholog search becomes limiting. Using yeast, we show that genes that were thought to date back to the last universal common ancestor are of high traceability. Their functions mostly involve catalysis, ion transport, and ribonucleoprotein complex assembly. In turn, the fraction of yeast genes whose traceability is not sufficient to infer their presence in last universal common ancestor is enriched for regulatory functions. Computing the traceabilities of genes that have been experimentally characterized as being essential for a self-replicating cell reveals that many of the genes that lack orthologs outside bacteria have low traceability. This leaves open whether their orthologs in the eukaryotic and archaeal domains have been overlooked. Looking at the example of REC8, a protein essential for chromosome cohesion, we demonstrate how a traceability-informed adjustment of the search sensitivity identifies hitherto missed orthologs in the fast-evolving microsporidia. Taken together, the evolutionary traceability helps to differentiate between true absence and nondetection of orthologs, and thus improves our understanding about the evolutionary conservation of functional protein networks. “protTrace,” a software tool for computing evolutionary traceability, is freely available at https://github.com/BIONF/protTrace.git; last accessed February 10, 2019.
Collapse
Affiliation(s)
- Arpit Jain
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Dominik Perisa
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Fabian Fliedner
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany
| | - Arndt von Haeseler
- Center for Integrative Bioinformatics Vienna, Max F. Perutz Laboratories, University of Vienna, Medical University Vienna, Austria.,Bioinformatics and Computational Biology, Faculty of Computer Science, University of Vienna, Austria
| | - Ingo Ebersberger
- Applied Bioinformatics Group, Institute of Cell Biology & Neuroscience, Goethe University, Frankfurt, Germany.,Senckenberg Biodiversity and Climate Research Center (BiK-F), Frankfurt, Germany.,LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany
| |
Collapse
|
32
|
Lipinska AP, Serrano-Serrano ML, Cormier A, Peters AF, Kogame K, Cock JM, Coelho SM. Rapid turnover of life-cycle-related genes in the brown algae. Genome Biol 2019; 20:35. [PMID: 30764885 PMCID: PMC6374913 DOI: 10.1186/s13059-019-1630-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Accepted: 01/16/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Sexual life cycles in eukaryotes involve a cyclic alternation between haploid and diploid phases. While most animals possess a diploid life cycle, many plants and algae alternate between multicellular haploid (gametophyte) and diploid (sporophyte) generations. In many algae, gametophytes and sporophytes are independent and free-living and may present dramatic phenotypic differences. The same shared genome can therefore be subject to different, even conflicting, selection pressures during each of the life cycle generations. Here, we analyze the nature and extent of genome-wide, generation-biased gene expression in four species of brown algae with contrasting levels of dimorphism between life cycle generations. RESULTS We show that the proportion of the transcriptome that is generation-specific is broadly associated with the level of phenotypic dimorphism between the life cycle stages. Importantly, our data reveals a remarkably high turnover rate for life-cycle-related gene sets across the brown algae and highlights the importance not only of co-option of regulatory programs from one generation to the other but also of a role for newly emerged, lineage-specific gene expression patterns in the evolution of the gametophyte and sporophyte developmental programs in this major eukaryotic group. Moreover, we show that generation-biased genes display distinct evolutionary modes, with gametophyte-biased genes evolving rapidly at the coding sequence level whereas sporophyte-biased genes tend to exhibit changes in their patterns of expression. CONCLUSION Our analysis uncovers the characteristics, expression patterns, and evolution of generation-biased genes and underlines the selective forces that shape this previously underappreciated source of phenotypic diversity.
Collapse
Affiliation(s)
- Agnieszka P Lipinska
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France
| | | | - Alexandre Cormier
- Laboratoire Ecologie et Biologie des Interactions, Equipe Ecologie Evolution Symbiose, Université de Poitiers, UMR CNRS 7267, Poitiers, France
| | | | - Kazuhiro Kogame
- Department of Biological Sciences, Faculty of Sciences, Hokkaido University, Sapporo, 060-0810, Japan
| | - J Mark Cock
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France
| | - Susana M Coelho
- Sorbonne Université, UPMC Univ Paris 06, CNRS, Algal Genetics Group, Integrative Biology of Marine Models, Station Biologique de Roscoff, CS 90074, F-29688, Roscoff, France.
| |
Collapse
|
33
|
Kundu S, Kumar V, Tyagi K, Chakraborty R, Singha D, Rahaman I, Pakrashi A, Chandra K. Complete mitochondrial genome of Black Soft-shell Turtle (Nilssonia nigricans) and comparative analysis with other Trionychidae. Sci Rep 2018; 8:17378. [PMID: 30478342 PMCID: PMC6255766 DOI: 10.1038/s41598-018-35822-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 11/09/2018] [Indexed: 11/08/2022] Open
Abstract
The characterization of mitochondrial genome has been evidenced as an efficient field of study for phylogenetic and evolutionary analysis in vertebrates including turtles. The aim of this study was to distinguish the structure and variability of the Trionychidae species mitogenomes through comparative analysis. The complete mitogenome (16796 bp) of an endangered freshwater turtle, Nilssonia nigricans was sequenced and annotated. The mitogenome encoded for 37 genes and a major non-coding control region (CR). The mitogenome was A + T biased (62.16%) and included six overlapping and 19 intergenic spacer regions. The Relative synonymous codon usage (RSCU) value was consistent among all the Trionychidae species; with the exception of significant reduction of Serine (TCG) frequency in N. nigricans, N. formosa, and R. swinhoei. In N. nigricans, most of the transfer RNAs (tRNAs) were folded into classic clover-leaf secondary structures with Watson-Crick base pairing except for trnS1 (GCT). The comparative analysis revealed that most of the tRNAs were structurally different, except for trnE (TTC), trnQ (TTG), and trnM (CAT). The structural features of tRNAs resulted ≥ 10 mismatched or wobble base pairings in 12 tRNAs, which reflects the nucleotide composition in both H- and L-strands. The mitogenome of N. nigricans also revealed two unique tandem repeats (ATTAT)8, and (TATTA)20 in the CR. Further, the conserved motif 5'-GACATA-3' and stable stem-loop structure was detected in the CRs of all Trionychidae species, which play an significant role in regulating transcription and replication in the mitochondrial genome. Further, the comparative analysis of Ka/Ks indicated negative selection in most of the protein coding genes (PCGs). The constructed Maximum Likelihood (ML) phylogeny using all PCGs showed clustering of N. nigricans with N. formosa. The resulting phylogeny illustrated the similar topology as described previously and consistent with the taxonomic classification. However, more sampling from different taxonomic groups of Testudines and studies on their mitogenomics are desirable for better understanding of the phylogenetic and evolutionary relationships.
Collapse
Affiliation(s)
- Shantanu Kundu
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Vikas Kumar
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India.
| | - Kaomud Tyagi
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Rajasree Chakraborty
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Devkant Singha
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Iftikar Rahaman
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Avas Pakrashi
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| | - Kailash Chandra
- Centre for DNA Taxonomy, Molecular Systematics Division, Zoological Survey of India, M-Block, New Alipore, Kolkata, 700 053, West Bengal, India
| |
Collapse
|
34
|
Duan C, Huan Q, Chen X, Wu S, Carey LB, He X, Qian W. Reduced intrinsic DNA curvature leads to increased mutation rate. Genome Biol 2018; 19:132. [PMID: 30217230 PMCID: PMC6138893 DOI: 10.1186/s13059-018-1525-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 09/05/2018] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Mutation rates vary across the genome. Many trans factors that influence mutation rates have been identified, as have specific sequence motifs at the 1-7-bp scale, but cis elements remain poorly characterized. The lack of understanding regarding why different sequences have different mutation rates hampers our ability to identify positive selection in evolution and to identify driver mutations in tumorigenesis. RESULTS Here, we use a combination of synthetic genes and sequences of thousands of isolated yeast colonies to show that intrinsic DNA curvature is a major cis determinant of mutation rate. Mutation rate negatively correlates with DNA curvature within genes, and a 10% decrease in curvature results in a 70% increase in mutation rate. Consistently, both yeast and humans accumulate mutations in regions with small curvature. We further show that this effect is due to differences in the intrinsic mutation rate, likely due to differences in mutagen sensitivity and not due to differences in the local activity of DNA repair. CONCLUSIONS Our study establishes a framework for understanding the cis properties of DNA sequence in modulating the local mutation rate and identifies a novel causal source of non-uniform mutation rates across the genome.
Collapse
Affiliation(s)
- Chaorui Duan
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.,Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Qing Huan
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.,Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Xiaoshu Chen
- Human Genome Research Institute and Department of Medical Genetics, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510080, China
| | - Shaohuan Wu
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.,Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.,University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Lucas B Carey
- Department of Experimental and Health Sciences, Universitat Pompeu Fabra, 08003, Barcelona, Spain
| | - Xionglei He
- State Key Laboratory of Biocontrol, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China
| | - Wenfeng Qian
- State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. .,Key Laboratory of Genetic Network Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China. .,University of Chinese Academy of Sciences, Beijing, 100049, China.
| |
Collapse
|
35
|
VanderSluis B, Costanzo M, Billmann M, Ward HN, Myers CL, Andrews BJ, Boone C. Integrating genetic and protein-protein interaction networks maps a functional wiring diagram of a cell. Curr Opin Microbiol 2018; 45:170-179. [PMID: 30059827 DOI: 10.1016/j.mib.2018.06.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Revised: 06/26/2018] [Accepted: 06/27/2018] [Indexed: 01/01/2023]
Abstract
Systematic experimental approaches have led to construction of comprehensive genetic and protein-protein interaction networks for the budding yeast, Saccharomyces cerevisiae. Genetic interactions capture functional relationships between genes using phenotypic readouts, while protein-protein interactions identify physical connections between gene products. These complementary, and largely non-overlapping, networks provide a global view of the functional architecture of a cell, revealing general organizing principles, many of which appear to be evolutionarily conserved. Here, we focus on insights derived from the integration of large-scale genetic and protein-protein interaction networks, highlighting principles that apply to both unicellular and more complex systems, including human cells. Network integration reveals fundamental connections involving key functional modules of eukaryotic cells, defining a core network of cellular function, which could be elaborated to explore cell-type specificity in metazoans.
Collapse
Affiliation(s)
- Benjamin VanderSluis
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, 200 Union Street, Minneapolis, MN 55455, USA
| | - Michael Costanzo
- The Donnelly Centre, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada
| | - Maximilian Billmann
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, 200 Union Street, Minneapolis, MN 55455, USA
| | - Henry N Ward
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, 200 Union Street, Minneapolis, MN 55455, USA
| | - Chad L Myers
- Department of Computer Science and Engineering, University of Minnesota-Twin Cities, 200 Union Street, Minneapolis, MN 55455, USA.
| | - Brenda J Andrews
- The Donnelly Centre, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada.
| | - Charles Boone
- The Donnelly Centre, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, Ontario M5S 3E1, Canada; RIKEN Center for Sustainable Resource Science, Wako, Saitama, Japan
| |
Collapse
|
36
|
Subramanian S. Effect of genetic drift on determinants of protein evolution. Biol Lett 2018; 14:rsbl.2018.0167. [PMID: 30021860 DOI: 10.1098/rsbl.2018.0167] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2018] [Accepted: 06/20/2018] [Indexed: 11/12/2022] Open
Abstract
A number of previous studies reported that gene expression, tissue specificity, gene essentiality and the number of protein-protein interactions influence the rate of protein evolution. Here we investigated the influence of effective population size (Ne) on these determinants of protein evolution. For this purpose, we compared the ratio of non-synonymous-to-synonymous diversities (πN/πS) estimated for protein-coding genes of Mus musculus castaneus and Mus musculus musculus: populations with high and low Ne respectively. Our results revealed that the difference between πN/πS estimated for genes with high and low expression levels was significantly smaller for M. m. musculus compared to that observed for M. m. castaneus The difference between the πN/πS of broadly expressed and tissue specific genes was much higher for M. m. castaneus compared to that of M. m. musculus. Furthermore, the difference between the πN/πS computed for essential and non-essential genes was much smaller for M. m. musculus than M. m. castaneus A similar pattern was observed for genes involved in many protein-protein interactions versus those involved in one. These results suggest that the effects of the determinants on protein evolution were much reduced for the population with small Ne due to increased genetic drift.
Collapse
Affiliation(s)
- Sankar Subramanian
- GeneCology Research Centre, The University of the Sunshine Coast, 90 Sippy Downs Drive, Sippy Downs, Queensland 4556, Australia
| |
Collapse
|
37
|
Diepeveen ET, Gehrmann T, Pourquié V, Abeel T, Laan L. Patterns of Conservation and Diversification in the Fungal Polarization Network. Genome Biol Evol 2018; 10:1765-1782. [PMID: 29931311 PMCID: PMC6054225 DOI: 10.1093/gbe/evy121] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/18/2018] [Indexed: 12/12/2022] Open
Abstract
The combined actions of proteins in networks underlie all fundamental cellular functions. Deeper insights into the dynamics of network composition across species and their functional consequences are crucial to fully understand protein network evolution. Large-scale comparative studies with high phylogenetic resolution are now feasible through the recent rise in available genomic data sets of both model and nonmodel species. Here, we focus on the polarity network, which is universally essential for cell proliferation and studied in great detail in the model organism, Saccharomyces cerevisiae. We examine 42 proteins, directly related to cell polarization, across 298 fungal strains/species to determine the composition of the network and patterns of conservation and diversification. We observe strong protein conservation for a group of 23 core proteins: >95% of all examined strains/species possess at least 14 of these core proteins, albeit in varying compositions, and non of the individual core proteins is 100% conserved. We find high levels of variation in prevalence and sequence identity in the remaining 19 proteins, resulting in distinct lineage-specific compositions of the network in the majority of strains/species. We show that the observed diversification in network composition correlates with lineage, lifestyle, and genetic distance. Yeast, filamentous and basal unicellular fungi, form distinctive groups based on these analyses, with substantial differences to their polarization network. Our study shows that the fungal polarization network is highly dynamic, even between closely related species, and that functional conservation appears to be achieved by varying the specific components of the fungal polarization repertoire.
Collapse
Affiliation(s)
- Eveline T Diepeveen
- Department of Bionanoscience, Faculty of Applied Sciences, Kavli Institute of NanoScience, Delft University of Technology, The Netherlands
| | - Thies Gehrmann
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Intelligent Systems, Delft University of Technology, The Netherlands
- Department of Molecular Epidemiology, Leiden Computational Biology Center, Leiden University Medical Centre, The Netherlands
| | - Valérie Pourquié
- Department of Bionanoscience, Faculty of Applied Sciences, Kavli Institute of NanoScience, Delft University of Technology, The Netherlands
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Intelligent Systems, Delft University of Technology, The Netherlands
| | - Thomas Abeel
- Delft Bioinformatics Lab, Faculty of Electrical Engineering, Mathematics and Computer Science, Intelligent Systems, Delft University of Technology, The Netherlands
- Genome Sequencing and Analysis Program, Broad Institute of MIT and Harvard, 7 Cambridge Center, Cambridge, Massachusetts
| | - Liedewij Laan
- Department of Bionanoscience, Faculty of Applied Sciences, Kavli Institute of NanoScience, Delft University of Technology, The Netherlands
| |
Collapse
|
38
|
Galpert D, Fernández A, Herrera F, Antunes A, Molina-Ruiz R, Agüero-Chapin G. Surveying alignment-free features for Ortholog detection in related yeast proteomes by using supervised big data classifiers. BMC Bioinformatics 2018; 19:166. [PMID: 29724166 PMCID: PMC5934817 DOI: 10.1186/s12859-018-2148-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Accepted: 04/04/2018] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND The development of new ortholog detection algorithms and the improvement of existing ones are of major importance in functional genomics. We have previously introduced a successful supervised pairwise ortholog classification approach implemented in a big data platform that considered several pairwise protein features and the low ortholog pair ratios found between two annotated proteomes (Galpert, D et al., BioMed Research International, 2015). The supervised models were built and tested using a Saccharomycete yeast benchmark dataset proposed by Salichos and Rokas (2011). Despite several pairwise protein features being combined in a supervised big data approach; they all, to some extent were alignment-based features and the proposed algorithms were evaluated on a unique test set. Here, we aim to evaluate the impact of alignment-free features on the performance of supervised models implemented in the Spark big data platform for pairwise ortholog detection in several related yeast proteomes. RESULTS The Spark Random Forest and Decision Trees with oversampling and undersampling techniques, and built with only alignment-based similarity measures or combined with several alignment-free pairwise protein features showed the highest classification performance for ortholog detection in three yeast proteome pairs. Although such supervised approaches outperformed traditional methods, there were no significant differences between the exclusive use of alignment-based similarity measures and their combination with alignment-free features, even within the twilight zone of the studied proteomes. Just when alignment-based and alignment-free features were combined in Spark Decision Trees with imbalance management, a higher success rate (98.71%) within the twilight zone could be achieved for a yeast proteome pair that underwent a whole genome duplication. The feature selection study showed that alignment-based features were top-ranked for the best classifiers while the runners-up were alignment-free features related to amino acid composition. CONCLUSIONS The incorporation of alignment-free features in supervised big data models did not significantly improve ortholog detection in yeast proteomes regarding the classification qualities achieved with just alignment-based similarity measures. However, the similarity of their classification performance to that of traditional ortholog detection methods encourages the evaluation of other alignment-free protein pair descriptors in future research.
Collapse
Affiliation(s)
- Deborah Galpert
- Departamento de Ciencia de la Computación, Universidad Central ¨Marta Abreu¨ de Las Villas (UCLV), 54830, Santa Clara, Cuba
| | - Alberto Fernández
- Department of Computer Science and Artificial Intelligence, Research Center on Information and Communications Technology (CITIC-UGR), University of Granada, 18071, Granada, Spain
| | - Francisco Herrera
- Department of Computer Science and Artificial Intelligence, Research Center on Information and Communications Technology (CITIC-UGR), University of Granada, 18071, Granada, Spain
| | - Agostinho Antunes
- CIIMAR/CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos s/n 4450-208 Matosinhos, Porto, Portugal.,Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal
| | - Reinaldo Molina-Ruiz
- Centro de Bioactivos Químicos (CBQ), Universidad Central ¨Marta Abreu¨ de Las Villas (UCLV), 54830, Santa Clara, Cuba
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos s/n 4450-208 Matosinhos, Porto, Portugal. .,Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007, Porto, Portugal. .,Centro de Bioactivos Químicos (CBQ), Universidad Central ¨Marta Abreu¨ de Las Villas (UCLV), 54830, Santa Clara, Cuba.
| |
Collapse
|
39
|
Alvarez-Ponce D, Feyertag F, Chakraborty S. Position Matters: Network Centrality Considerably Impacts Rates of Protein Evolution in the Human Protein-Protein Interaction Network. Genome Biol Evol 2018; 9:1742-1756. [PMID: 28854629 PMCID: PMC5570066 DOI: 10.1093/gbe/evx117] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/01/2017] [Indexed: 02/06/2023] Open
Abstract
The proteins of any organism evolve at disparate rates. A long list of factors affecting rates of protein evolution have been identified. However, the relative importance of each factor in determining rates of protein evolution remains unresolved. The prevailing view is that evolutionary rates are dominantly determined by gene expression, and that other factors such as network centrality have only a marginal effect, if any. However, this view is largely based on analyses in yeasts, and accurately measuring the importance of the determinants of rates of protein evolution is complicated by the fact that the different factors are often correlated with each other, and by the relatively poor quality of available functional genomics data sets. Here, we use correlation, partial correlation and principal component regression analyses to measure the contributions of several factors to the variability of the rates of evolution of human proteins. For this purpose, we analyzed the entire human protein–protein interaction data set and the human signal transduction network—a network data set of exceptionally high quality, obtained by manual curation, which is expected to be virtually free from false positives. In contrast with the prevailing view, we observe that network centrality (measured as the number of physical and nonphysical interactions, betweenness, and closeness) has a considerable impact on rates of protein evolution. Surprisingly, the impact of centrality on rates of protein evolution seems to be comparable, or even superior according to some analyses, to that of gene expression. Our observations seem to be independent of potentially confounding factors and from the limitations (biases and errors) of interactomic data sets.
Collapse
|
40
|
Sigg MA, Menchen T, Lee C, Johnson J, Jungnickel MK, Choksi SP, Garcia G, Busengdal H, Dougherty GW, Pennekamp P, Werner C, Rentzsch F, Florman HM, Krogan N, Wallingford JB, Omran H, Reiter JF. Evolutionary Proteomics Uncovers Ancient Associations of Cilia with Signaling Pathways. Dev Cell 2018; 43:744-762.e11. [PMID: 29257953 DOI: 10.1016/j.devcel.2017.11.014] [Citation(s) in RCA: 64] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2016] [Revised: 09/18/2017] [Accepted: 11/17/2017] [Indexed: 12/19/2022]
Abstract
Cilia are organelles specialized for movement and signaling. To infer when during evolution signaling pathways became associated with cilia, we characterized the proteomes of cilia from sea urchins, sea anemones, and choanoflagellates. We identified 437 high-confidence ciliary candidate proteins conserved in mammals and discovered that Hedgehog and G-protein-coupled receptor pathways were linked to cilia before the origin of bilateria and transient receptor potential (TRP) channels before the origin of animals. We demonstrated that candidates not previously implicated in ciliary biology localized to cilia and further investigated ENKUR, a TRP channel-interacting protein identified in the cilia of all three organisms. ENKUR localizes to motile cilia and is required for patterning the left-right axis in vertebrates. Moreover, mutation of ENKUR causes situs inversus in humans. Thus, proteomic profiling of cilia from diverse eukaryotes defines a conserved ciliary proteome, reveals ancient connections to signaling, and uncovers a ciliary protein that underlies development and human disease.
Collapse
Affiliation(s)
- Monika Abedin Sigg
- Department of Biochemistry and Biophysics, Cardiovascular Research Institute, University of California, San Francisco, CA 94158, USA
| | - Tabea Menchen
- Department of General Pediatrics, University Children's Hospital Muenster, Muenster 48149, Germany
| | - Chanjae Lee
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology and Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX 78712, USA
| | - Jeffery Johnson
- Gladstone Institute of Cardiovascular Disease and Gladstone Institute of Virology and Immunology, San Francisco, CA 94158, USA
| | - Melissa K Jungnickel
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Semil P Choksi
- Department of Biochemistry and Biophysics, Cardiovascular Research Institute, University of California, San Francisco, CA 94158, USA
| | - Galo Garcia
- Department of Biochemistry and Biophysics, Cardiovascular Research Institute, University of California, San Francisco, CA 94158, USA
| | - Henriette Busengdal
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen 5008, Norway
| | - Gerard W Dougherty
- Department of General Pediatrics, University Children's Hospital Muenster, Muenster 48149, Germany
| | - Petra Pennekamp
- Department of General Pediatrics, University Children's Hospital Muenster, Muenster 48149, Germany
| | - Claudius Werner
- Department of General Pediatrics, University Children's Hospital Muenster, Muenster 48149, Germany
| | - Fabian Rentzsch
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen 5008, Norway
| | - Harvey M Florman
- Department of Cell and Developmental Biology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Nevan Krogan
- Gladstone Institute of Cardiovascular Disease and Gladstone Institute of Virology and Immunology, San Francisco, CA 94158, USA; Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA 94158, USA
| | - John B Wallingford
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology and Institute for Cellular and Molecular Biology, University of Texas at Austin, Austin, TX 78712, USA
| | - Heymut Omran
- Department of General Pediatrics, University Children's Hospital Muenster, Muenster 48149, Germany
| | - Jeremy F Reiter
- Department of Biochemistry and Biophysics, Cardiovascular Research Institute, University of California, San Francisco, CA 94158, USA.
| |
Collapse
|
41
|
Barroso GV, Puzovic N, Dutheil JY. The Evolution of Gene-Specific Transcriptional Noise Is Driven by Selection at the Pathway Level. Genetics 2018; 208:173-189. [PMID: 29097405 PMCID: PMC5753856 DOI: 10.1534/genetics.117.300467] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 10/13/2017] [Indexed: 11/18/2022] Open
Abstract
Biochemical reactions within individual cells result from the interactions of molecules, typically in small numbers. Consequently, the inherent stochasticity of binding and diffusion processes generates noise along the cascade that leads to the synthesis of a protein from its encoding gene. As a result, isogenic cell populations display phenotypic variability even in homogeneous environments. The extent and consequences of this stochastic gene expression have only recently been assessed on a genome-wide scale, owing, in particular, to the advent of single-cell transcriptomics. However, the evolutionary forces shaping this stochasticity have yet to be unraveled. Here, we take advantage of two recently published data sets for the single-cell transcriptome of the domestic mouse Mus musculus to characterize the effect of natural selection on gene-specific transcriptional stochasticity. We show that noise levels in the mRNA distributions (also known as transcriptional noise) significantly correlate with three-dimensional nuclear domain organization, evolutionary constraints on the encoded protein, and gene age. However, the position of the encoded protein in a biological pathway is the main factor that explains observed levels of transcriptional noise, in agreement with models of noise propagation within gene networks. Because transcriptional noise is under widespread selection, we argue that it constitutes an important component of the phenotype and that variance of expression is a potential target of adaptation. Stochastic gene expression should therefore be considered together with the mean expression level in functional and evolutionary studies of gene expression.
Collapse
Affiliation(s)
- Gustavo Valadares Barroso
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Natasa Puzovic
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Julien Y Dutheil
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
- Unité mixte de recherche 5554, Institut des Sciences de l'Évolution, Université de Montpellier, 34095, France
| |
Collapse
|
42
|
A High-Resolution Genome-Wide CRISPR/Cas9 Viability Screen Reveals Structural Features and Contextual Diversity of the Human Cell-Essential Proteome. Mol Cell Biol 2017; 38:MCB.00302-17. [PMID: 29038160 DOI: 10.1128/mcb.00302-17] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 09/11/2017] [Indexed: 11/20/2022] Open
Abstract
To interrogate genes essential for cell growth, proliferation and survival in human cells, we carried out a genome-wide clustered regularly interspaced short palindromic repeat (CRISPR)/Cas9 screen in a B-cell lymphoma line using a custom extended-knockout (EKO) library of 278,754 single-guide RNAs (sgRNAs) that targeted 19,084 RefSeq genes, 20,852 alternatively spliced exons, and 3,872 hypothetical genes. A new statistical analysis tool called robust analytics and normalization for knockout screens (RANKS) identified 2,280 essential genes, 234 of which were unique. Individual essential genes were validated experimentally and linked to ribosome biogenesis and stress responses. Essential genes exhibited a bimodal distribution across 10 different cell lines, consistent with a continuous variation in essentiality as a function of cell type. Genes essential in more lines had more severe fitness defects and encoded the evolutionarily conserved structural cores of protein complexes, whereas genes essential in fewer lines formed context-specific modules and encoded subunits at the periphery of essential complexes. The essentiality of individual protein residues across the proteome correlated with evolutionary conservation, structural burial, modular domains, and protein interaction interfaces. Many alternatively spliced exons in essential genes were dispensable and were enriched for disordered regions. Fitness defects were observed for 44 newly evolved hypothetical reading frames. These results illuminate the contextual nature and evolution of essential gene functions in human cells.
Collapse
|
43
|
Begum T, Ghosh TC, Basak S. Systematic Analyses and Prediction of Human Drug Side Effect Associated Proteins from the Perspective of Protein Evolution. Genome Biol Evol 2017; 9:337-350. [PMID: 28391292 PMCID: PMC5499873 DOI: 10.1093/gbe/evw301] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/16/2017] [Indexed: 12/20/2022] Open
Abstract
Identification of various factors involved in adverse drug reactions in target proteins to develop therapeutic drugs with minimal/no side effect is very important. In this context, we have performed a comparative evolutionary rate analyses between the genes exhibiting drug side-effect(s) (SET) and genes showing no side effect (NSET) with an aim to increase the prediction accuracy of SET/NSET proteins using evolutionary rate determinants. We found that SET proteins are more conserved than the NSET proteins. The rates of evolution between SET and NSET protein primarily depend upon their noncomplex (protein complex association number = 0) forming nature, phylogenetic age, multifunctionality, membrane localization, and transmembrane helix content irrespective of their essentiality, total druggability (total number of drugs/target), m-RNA expression level, and tissue expression breadth. We also introduced two novel terms—killer druggability (number of drugs with killing side effect(s)/target), essential druggability (number of drugs targeting essential proteins/target) to explain the evolutionary rate variation between SET and NSET proteins. Interestingly, we noticed that SET proteins are younger than NSET proteins and multifunctional younger SET proteins are candidates of acquiring killing side effects. We provide evidence that higher killer druggability, multifunctionality, and transmembrane helices support the conservation of SET proteins over NSET proteins in spite of their recent origin. By employing all these entities, our Support Vector Machine model predicts human SET/NSET proteins to a high degree of accuracy (∼86%).
Collapse
Affiliation(s)
- Tina Begum
- Bioinformatics Centre, Tripura University, Suryamaninagar, Tripura, India
| | | | - Surajit Basak
- Bioinformatics Centre, Tripura University, Suryamaninagar, Tripura, India.,Department of Molecular Biology & Bioinformatics, Tripura University, Suryamaninagar, Tripura, India
| |
Collapse
|
44
|
Abstract
Gene essentiality is a founding concept of genetics with important implications in both fundamental and applied research. Multiple screens have been performed over the years in bacteria, yeasts, animals and more recently in human cells to identify essential genes. A mounting body of evidence suggests that gene essentiality, rather than being a static and binary property, is both context dependent and evolvable in all kingdoms of life. This concept of a non-absolute nature of gene essentiality changes our fundamental understanding of essential biological processes and could directly affect future treatment strategies for cancer and infectious diseases.
Collapse
|
45
|
Wang W, Zhang X, Deng F, Yuan R, Shen F. Genome-wide characterization and expression analyses of superoxide dismutase (SOD) genes in Gossypium hirsutum. BMC Genomics 2017; 18:376. [PMID: 28499417 PMCID: PMC5429560 DOI: 10.1186/s12864-017-3768-5] [Citation(s) in RCA: 55] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2016] [Accepted: 05/07/2017] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Superoxide dismutases (SODs) are a key antioxidant enzyme family, which have been implicated in protecting plants against the toxic effects of reactive oxygen species. Despite current studies have shown that the gene family are involved in plant growth and developmental processes and biotic and abiotic stress responses, little is known about its functional role in upland cotton. RESULTS In the present study, we comprehensively analyzed the characteristics of the SOD gene family in upland cotton (Gossypium hirsutum). Based on their conserved motifs, 18 GhSOD genes were identified and phylogenetically classified into five subgroups which corroborated their classifications based on gene-structure patterns and subcellular localizations. The GhSOD sequences were distributed at different densities across 12 of the 26 chromosomes. The conserved domains, gene family evolution cis-acting elements of promoter regions and miRNA-mediated posttranscriptional regulation were predicted and analyzed. In addition, the expression pattern of 18 GhSOD genes were tested in different tissues/organs and developmental stages, and different abiotic stresses and abscisic acid, which indicated that the SOD gene family possessed temporal and spatial specificity expression specificity and may play important roles in reactive oxygen species scavenging caused by various stresses in upland cotton. CONCLUSIONS This study describes the first genome-wide analysis of the upland cotton SOD gene family, and the results will help establish a foundation for the further cloning and functional verification of the GhSOD gene family during stress responses, leading to crop improvement.
Collapse
Affiliation(s)
- Wei Wang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai’an, 271018 Shandong People’s Republic of China
| | - Xiaopei Zhang
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai’an, 271018 Shandong People’s Republic of China
| | - Fenni Deng
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai’an, 271018 Shandong People’s Republic of China
| | - Rui Yuan
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai’an, 271018 Shandong People’s Republic of China
| | - Fafu Shen
- State Key Laboratory of Crop Biology, College of Agronomy, Shandong Agricultural University, Tai’an, 271018 Shandong People’s Republic of China
| |
Collapse
|
46
|
Ascencio D, Ochoa S, Delaye L, DeLuna A. Increased rates of protein evolution and asymmetric deceleration after the whole-genome duplication in yeasts. BMC Evol Biol 2017; 17:40. [PMID: 28166720 PMCID: PMC5294719 DOI: 10.1186/s12862-017-0895-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Accepted: 01/26/2017] [Indexed: 11/25/2022] Open
Abstract
Background Whole-genome duplication (WGD) events have shaped the genomes of eukaryotic organisms. Relaxed selection after duplication along with inherent functional constraints are thought to determine the fate of the paralogs and, ultimately, the evolution of gene function. Here, we investigated the rate of protein evolution (as measured by dN/dS ratios) before and after the WGD in the hemiascomycete yeasts, and the way in which changes in such rates relate to molecular and biological function. Results For most groups of orthologous genes (81%) we observed a change in the rates of evolution after genome duplication. Genes with atypically-low dN/dS ratio before the WGD were prone to increase their rates of evolution after duplication. Importantly, the paralogs were often different in their rates of evolution after the WGD (50% cases), however, this was more consistent with an asymmetric deceleration in the protein-evolution rates, rather than an asymmetric increase of the initial rates. Functional-category analysis showed that regulatory proteins such as protein kinases and transcription factors were enriched in genes that increase their rates of evolution after the WGD. While changes in the rate of protein-sequence evolution were associated to protein abundance, content of disordered regions, and contribution to fitness, these features were an attribute of specific functional classes. Conclusions Our results indicate that strong purifying selection in ancestral pre-duplication sequences is a strong predictor of increased rates after the duplication in yeasts and that asymmetry in evolution rate is established during the deceleration phase. In addition, changes in the rates at which paralogous sequences evolve before and after WGD are different for specific protein functions; increased rates of protein evolution after duplication occur preferentially in specific protein functions. Electronic supplementary material The online version of this article (doi:10.1186/s12862-017-0895-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Diana Ascencio
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato, Mexico
| | - Soledad Ochoa
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato, Mexico
| | - Luis Delaye
- Departamento de Ingeniería Genética, Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato, Mexico
| | - Alexander DeLuna
- Unidad de Genómica Avanzada (Langebio), Centro de Investigación y de Estudios Avanzados del IPN, Irapuato, Guanajuato, Mexico.
| |
Collapse
|
47
|
Jacobs C, Lambourne L, Xia Y, Segrè D. Upon Accounting for the Impact of Isoenzyme Loss, Gene Deletion Costs Anticorrelate with Their Evolutionary Rates. PLoS One 2017; 12:e0170164. [PMID: 28107392 PMCID: PMC5249160 DOI: 10.1371/journal.pone.0170164] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2016] [Accepted: 12/30/2016] [Indexed: 12/19/2022] Open
Abstract
System-level metabolic network models enable the computation of growth and metabolic phenotypes from an organism's genome. In particular, flux balance approaches have been used to estimate the contribution of individual metabolic genes to organismal fitness, offering the opportunity to test whether such contributions carry information about the evolutionary pressure on the corresponding genes. Previous failure to identify the expected negative correlation between such computed gene-loss cost and sequence-derived evolutionary rates in Saccharomyces cerevisiae has been ascribed to a real biological gap between a gene's fitness contribution to an organism "here and now" and the same gene's historical importance as evidenced by its accumulated mutations over millions of years of evolution. Here we show that this negative correlation does exist, and can be exposed by revisiting a broadly employed assumption of flux balance models. In particular, we introduce a new metric that we call "function-loss cost", which estimates the cost of a gene loss event as the total potential functional impairment caused by that loss. This new metric displays significant negative correlation with evolutionary rate, across several thousand minimal environments. We demonstrate that the improvement gained using function-loss cost over gene-loss cost is explained by replacing the base assumption that isoenzymes provide unlimited capacity for backup with the assumption that isoenzymes are completely non-redundant. We further show that this change of the assumption regarding isoenzymes increases the recall of epistatic interactions predicted by the flux balance model at the cost of a reduction in the precision of the predictions. In addition to suggesting that the gene-to-reaction mapping in genome-scale flux balance models should be used with caution, our analysis provides new evidence that evolutionary gene importance captures much more than strict essentiality.
Collapse
Affiliation(s)
- Christopher Jacobs
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
| | - Luke Lambourne
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, Quebec, Canada
| | - Yu Xia
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, Quebec, Canada
| | - Daniel Segrè
- Bioinformatics Program, Boston University, Boston, Massachusetts, United States of America
- Department of Biology, Boston University, Boston, Massachusetts, United States of America
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts, United States of America
| |
Collapse
|
48
|
Mutation, Duplication, and More in the Evolution of Venomous Animals and Their Toxins. EVOLUTION OF VENOMOUS ANIMALS AND THEIR TOXINS 2017. [DOI: 10.1007/978-94-007-6458-3_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
49
|
Lateralization of gene expression in the honeybee brain during olfactory learning. Sci Rep 2016; 6:34727. [PMID: 27703214 PMCID: PMC5050455 DOI: 10.1038/srep34727] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Accepted: 09/20/2016] [Indexed: 12/17/2022] Open
Abstract
In the last decade, it has been demonstrated that brain functional asymmetry occurs not only in vertebrates but also in invertebrates. However, the mechanisms underlying functional asymmetry remain unclear. In the present study, we trained honeybees of the same parentage and age, on the proboscis extension reflex (PER) paradigm with only one antenna in use. The comparisons of gene expression between the left and right hemispheres were carried out using high throughput sequencing. Our research revealed that gene expression in the honeybee brain is also asymmetric, with more genes having higher expression in the right hemisphere than the left hemisphere. Our studies show that during olfactory learning, the left hemisphere is more responsible for long term memory and the right hemisphere is more responsible for the learning and short term memory.
Collapse
|
50
|
Alvarez-Ponce D, Sabater-Muñoz B, Toft C, Ruiz-González MX, Fares MA. Essentiality Is a Strong Determinant of Protein Rates of Evolution during Mutation Accumulation Experiments in Escherichia coli. Genome Biol Evol 2016; 8:2914-2927. [PMID: 27566759 PMCID: PMC5630975 DOI: 10.1093/gbe/evw205] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
The Neutral Theory of Molecular Evolution is considered the most powerful theory to understand the evolutionary behavior of proteins. One of the main predictions of this theory is that essential proteins should evolve slower than dispensable ones owing to increased selective constraints. Comparison of genomes of different species, however, has revealed only small differences between the rates of evolution of essential and nonessential proteins. In some analyses, these differences vanish once confounding factors are controlled for, whereas in other cases essentiality seems to have an independent, albeit small, effect. It has been argued that comparing relatively distant genomes may entail a number of limitations. For instance, many of the genes that are dispensable in controlled lab conditions may be essential in some of the conditions faced in nature. Moreover, essentiality can change during evolution, and rates of protein evolution are simultaneously shaped by a variety of factors, whose individual effects are difficult to isolate. Here, we conducted two parallel mutation accumulation experiments in Escherichia coli, during 5,500–5,750 generations, and compared the genomes at different points of the experiments. Our approach (a short-term experiment, under highly controlled conditions) enabled us to overcome many of the limitations of previous studies. We observed that essential proteins evolved substantially slower than nonessential ones during our experiments. Strikingly, rates of protein evolution were only moderately affected by expression level and protein length.
Collapse
Affiliation(s)
| | - Beatriz Sabater-Muñoz
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| | - Christina Toft
- Department of Genetics, University of Valencia, Valencia, Spain Departamento de Biotecnología, Instituto de Agroquímica y Tecnología de los Alimentos (CSIC), Valencia, Spain
| | - Mario X Ruiz-González
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Current Address: Secretaría de Educación Superior, Ciencia, Tecnología e Innovación, Proyecto Prometeo; Departamento de Ciencias Biológicas, Universidad Tócnica Particular de Loja, Loja, Ecuador
| | - Mario A Fares
- Instituto de Biología Molecular y Celular de Plantas (CSIC-UPV), Valencia, Spain Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|