1
|
Hao T, Zhang M, Song Z, Gou Y, Wang B, Sun J. Reconstruction of Eriocheir sinensis Protein-Protein Interaction Network Based on DGO-SVM Method. Curr Issues Mol Biol 2024; 46:7353-7372. [PMID: 39057077 PMCID: PMC11276262 DOI: 10.3390/cimb46070436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2024] [Revised: 06/25/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
Eriocheir sinensis is an economically important aquatic animal. Its regulatory mechanisms underlying many biological processes are still vague due to the lack of systematic analysis tools. The protein-protein interaction network (PIN) is an important tool for the systematic analysis of regulatory mechanisms. In this work, a novel machine learning method, DGO-SVM, was applied to predict the protein-protein interaction (PPI) in E. sinensis, and its PIN was reconstructed. With the domain, biological process, molecular functions and subcellular locations of proteins as the features, DGO-SVM showed excellent performance in Bombyx mori, humans and five aquatic crustaceans, with 92-96% accuracy. With DGO-SVM, the PIN of E. sinensis was reconstructed, containing 14,703 proteins and 7,243,597 interactions, in which 35,604 interactions were associated with 566 novel proteins mainly involved in the response to exogenous stimuli, cellular macromolecular metabolism and regulation. The DGO-SVM demonstrated that the biological process, molecular functions and subcellular locations of proteins are significant factors for the precise prediction of PPIs. We reconstructed the largest PIN for E. sinensis, which provides a systematic tool for the regulatory mechanism analysis. Furthermore, the novel-protein-related PPIs in the PIN may provide important clues for the mechanism analysis of the underlying specific physiological processes in E. sinensis.
Collapse
Affiliation(s)
| | | | | | | | - Bin Wang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (M.Z.); (Z.S.); (Y.G.)
| | - Jinsheng Sun
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (M.Z.); (Z.S.); (Y.G.)
| |
Collapse
|
2
|
Armingol E, Ghaddar A, Joshi CJ, Baghdassarian H, Shamie I, Chan J, Her HL, Berhanu S, Dar A, Rodriguez-Armstrong F, Yang O, O’Rourke EJ, Lewis NE. Inferring a spatial code of cell-cell interactions across a whole animal body. PLoS Comput Biol 2022; 18:e1010715. [PMID: 36395331 PMCID: PMC9714814 DOI: 10.1371/journal.pcbi.1010715] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 12/01/2022] [Accepted: 11/07/2022] [Indexed: 11/18/2022] Open
Abstract
Cell-cell interactions shape cellular function and ultimately organismal phenotype. Interacting cells can sense their mutual distance using combinations of ligand-receptor pairs, suggesting the existence of a spatial code, i.e., signals encoding spatial properties of cellular organization. However, this code driving and sustaining the spatial organization of cells remains to be elucidated. Here we present a computational framework to infer the spatial code underlying cell-cell interactions from the transcriptomes of the cell types across the whole body of a multicellular organism. As core of this framework, we introduce our tool cell2cell, which uses the coexpression of ligand-receptor pairs to compute the potential for intercellular interactions, and we test it across the Caenorhabditis elegans' body. Leveraging a 3D atlas of C. elegans' cells, we also implement a genetic algorithm to identify the ligand-receptor pairs most informative of the spatial organization of cells across the whole body. Validating the spatial code extracted with this strategy, the resulting intercellular distances are negatively correlated with the inferred cell-cell interactions. Furthermore, for selected cell-cell and ligand-receptor pairs, we experimentally confirm the communicatory behavior inferred with cell2cell and the genetic algorithm. Thus, our framework helps identify a code that predicts the spatial organization of cells across a whole-animal body.
Collapse
Affiliation(s)
- Erick Armingol
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, California, United States of America
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Abbas Ghaddar
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America
| | - Chintan J. Joshi
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Hratch Baghdassarian
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, California, United States of America
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Isaac Shamie
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, California, United States of America
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
| | - Jason Chan
- Poway High School, Poway, California, United States of America
| | - Hsuan-Lin Her
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, California, United States of America
| | - Samuel Berhanu
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America
| | - Anushka Dar
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America
| | | | - Olivia Yang
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America
| | - Eyleen J. O’Rourke
- Department of Biology, University of Virginia, Charlottesville, Virginia, United States of America
- Department of Cell Biology, School of Medicine of University of Virginia, Charlottesville, Virginia, United States of America
| | - Nathan E. Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, California, United States of America
- Department of Bioengineering, University of California, San Diego, La Jolla, California, United States of America
| |
Collapse
|
3
|
OUP accepted manuscript. Brief Funct Genomics 2022; 21:243-269. [DOI: 10.1093/bfgp/elac007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/17/2022] [Accepted: 03/18/2022] [Indexed: 11/14/2022] Open
|
4
|
Chang AYF, Liao BY. Reduced Translational Efficiency of Eukaryotic Genes after Duplication Events. Mol Biol Evol 2021; 37:1452-1461. [PMID: 31904835 DOI: 10.1093/molbev/msz309] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Control of gene expression has been found to be predominantly determined at the level of protein translation. However, to date, reduced expression from duplicated genes in eukaryotes for dosage maintenance has only been linked to transcriptional control involving epigenetic mechanisms. Here, we hypothesize that dosage maintenance following gene duplication also involves regulation at the protein level. To test this hypothesis, we compared transcriptome and proteome data of yeast models, Saccharomyces cerevisiae and Schizosaccharomyces pombe, and worm models, Caenorhabditis elegans and Caenorhabditis briggsae, to investigate lineage-specifically duplicated genes. Duplicated genes in both eukaryotic models exhibited a reduced protein-to-mRNA abundance ratio. Moreover, dosage sensitive genes, represented by genes encoding protein complex subunits, reduced their protein-to-mRNA abundance ratios more significantly than the other genes after duplication events. An analysis of ribosome profiling (Ribo-Seq) data further showed that reduced translational efficiency was more prominent for dosage sensitive genes than for the other genes. Meanwhile, no difference in protein degradation rate was associated with duplication events. Translationally repressed duplicated genes were also more likely to be inhibited at the level of transcription. Taken together, these results suggest that translation-mediated dosage control is partially contributed by natural selection and it enhances transcriptional control in maintaining gene dosage after gene duplication events during eukaryotic genome evolution.
Collapse
Affiliation(s)
- Andrew Ying-Fei Chang
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan, Republic of China
| | - Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan, Republic of China
| |
Collapse
|
5
|
Han Y, Cheng L, Sun W. Analysis of Protein-Protein Interaction Networks through Computational Approaches. Protein Pept Lett 2020; 27:265-278. [PMID: 31692419 DOI: 10.2174/0929866526666191105142034] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 05/08/2019] [Accepted: 09/26/2019] [Indexed: 01/02/2023]
Abstract
The interactions among proteins and genes are extremely important for cellular functions. Molecular interactions at protein or gene levels can be used to construct interaction networks in which the interacting species are categorized based on direct interactions or functional similarities. Compared with the limited experimental techniques, various computational tools make it possible to analyze, filter, and combine the interaction data to get comprehensive information about the biological pathways. By the efficient way of integrating experimental findings in discovering PPIs and computational techniques for prediction, the researchers have been able to gain many valuable data on PPIs, including some advanced databases. Moreover, many useful tools and visualization programs enable the researchers to establish, annotate, and analyze biological networks. We here review and list the computational methods, databases, and tools for protein-protein interaction prediction.
Collapse
Affiliation(s)
- Ying Han
- Cardiovascular Department, The Fourth Affiliated Hospital of Harbin Medical University, Harbin, China
| | - Liang Cheng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Weiju Sun
- Cardiovascular Department, The First Affiliated Hospital of Harbin Medical University, Harbin, China
| |
Collapse
|
6
|
Guan G, Fang M, Wong MK, Ho VWS, An X, Tang C, Huang X, Zhao Z. Multilevel regulation of muscle-specific transcription factor hlh-1 during Caenorhabditis elegans embryogenesis. Dev Genes Evol 2020; 230:265-278. [PMID: 32556563 PMCID: PMC7371654 DOI: 10.1007/s00427-020-00662-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Accepted: 05/31/2020] [Indexed: 11/29/2022]
Abstract
hlh-1 is a myogenic transcription factor required for body-wall muscle specification during embryogenesis in Caenorhabditis elegans. Despite its well-known role in muscle specification, comprehensive regulatory control upstream of hlh-1 remains poorly defined. Here, we first established a statistical reference for the spatiotemporal expression of hlh-1 at single-cell resolution up to the second last round of divisions for most of the cell lineages (from 4- to 350-cell stage) using 13 wild-type embryos. We next generated lineal expression of hlh-1 after RNA interference (RNAi) perturbation of 65 genes, which were selected based on their degree of conservation, mutant phenotypes, and known roles in development. We then compared the expression profiles between wild-type and RNAi embryos by clustering according to their lineal expression patterns using mean-shift and density-based clustering algorithms, which not only confirmed the roles of existing genes but also uncovered the potential functions of novel genes in muscle specification at multiple levels, including cellular, lineal, and embryonic levels. By combining the public data on protein-protein interactions, protein-DNA interactions, and genetic interactions with our RNAi data, we inferred regulatory pathways upstream of hlh-1 that function globally or locally. This work not only revealed diverse and multilevel regulatory mechanisms coordinating muscle differentiation during C. elegans embryogenesis but also laid a foundation for further characterizing the regulatory pathways controlling muscle specification at the cellular, lineal (local), or embryonic (global) level.
Collapse
Affiliation(s)
- Guoye Guan
- Center for Quantitative Biology, Peking University, Beijing, 100871, China
| | - Meichen Fang
- School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ming-Kin Wong
- Department of Biology, Hong Kong Baptist University, Hong Kong, 999077, China
| | - Vincy Wing Sze Ho
- Department of Biology, Hong Kong Baptist University, Hong Kong, 999077, China
| | - Xiaomeng An
- Department of Biology, Hong Kong Baptist University, Hong Kong, 999077, China
| | - Chao Tang
- Center for Quantitative Biology, Peking University, Beijing, 100871, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- School of Physics, Peking University, Beijing, 100871, China
| | - Xiaotai Huang
- School of Computer Science and Technology, Xidian University, Xi'an, 710126, Shaanxi, China.
| | - Zhongying Zhao
- Department of Biology, Hong Kong Baptist University, Hong Kong, 999077, China.
- State Key Laboratory of Environmental and Biological Analysis, Hong Kong Baptist University, Hong Kong, 999077, China.
| |
Collapse
|
7
|
Byshkin M, Stivala A, Mira A, Robins G, Lomi A. Fast Maximum Likelihood Estimation via Equilibrium Expectation for Large Network Data. Sci Rep 2018; 8:11509. [PMID: 30065311 PMCID: PMC6068132 DOI: 10.1038/s41598-018-29725-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 07/17/2018] [Indexed: 01/24/2023] Open
Abstract
A major line of contemporary research on complex networks is based on the development of statistical models that specify the local motifs associated with macro-structural properties observed in actual networks. This statistical approach becomes increasingly problematic as network size increases. In the context of current research on efficient estimation of models for large network data sets, we propose a fast algorithm for maximum likelihood estimation (MLE) that affords a significant increase in the size of networks amenable to direct empirical analysis. The algorithm we propose in this paper relies on properties of Markov chains at equilibrium, and for this reason it is called equilibrium expectation (EE). We demonstrate the performance of the EE algorithm in the context of exponential random graph models (ERGMs) a family of statistical models commonly used in empirical research based on network data observed at a single period in time. Thus far, the lack of efficient computational strategies has limited the empirical scope of ERGMs to relatively small networks with a few thousand nodes. The approach we propose allows a dramatic increase in the size of networks that may be analyzed using ERGMs. This is illustrated in an analysis of several biological networks and one social network with 104,103 nodes.
Collapse
Affiliation(s)
- Maksym Byshkin
- Institute of Computational Science, Università della Svizzera italiana, Lugano, 6900, Switzerland
| | - Alex Stivala
- Institute of Computational Science, Università della Svizzera italiana, Lugano, 6900, Switzerland
- Centre for Transformative Innovation, Swinburne University of Technology, Hawthorn Victoria, 3122, Australia
| | - Antonietta Mira
- Institute of Computational Science, Università della Svizzera italiana, Lugano, 6900, Switzerland
- Dipartimento di Scienza e Alta Tecnologia, Università dell'Insubria, Como, 22100, Italy
| | - Garry Robins
- Centre for Transformative Innovation, Swinburne University of Technology, Hawthorn Victoria, 3122, Australia
- School of Psychological Sciences, University of Melbourne, Parkville Victoria, 3010, Australia
| | - Alessandro Lomi
- Institute of Computational Science, Università della Svizzera italiana, Lugano, 6900, Switzerland.
- School of Psychological Sciences, University of Melbourne, Parkville Victoria, 3010, Australia.
| |
Collapse
|
8
|
Chang AYF, Liao BY. Recruitment of histone modifications to assist mRNA dosage maintenance after degeneration of cytosine DNA methylation during animal evolution. Genome Res 2017; 27:1513-1524. [PMID: 28720579 PMCID: PMC5580711 DOI: 10.1101/gr.221739.117] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Accepted: 07/05/2017] [Indexed: 12/24/2022]
Abstract
Following gene duplication, mRNA expression of the duplicated gene is reduced to maintain mRNA dosage. In mammals, this process is achieved with increased cytosine DNA methylation of the promoters of duplicated genes to suppress transcriptional initiation. However, not all animal species possess a full apparatus for cytosine DNA methylation. For such species, such as the roundworm (Caenorhabditis elegans, "worm" hereafter) or fruit fly (Drosophila melanogaster, "fly" hereafter), it is unclear how reduced expression of duplicated genes has been achieved evolutionarily. Here, we hypothesize that in the absence of a classical cytosine DNA methylation pathway, histone modifications play an increasing role in maintaining mRNA dosage following gene duplication. We initially verified that reduced gene expression of duplicated genes had occurred in the worm, fly, and mouse (Mus musculus). Next, several histone marks, with the capacity to control mRNA abundance in the models studied, were examined. In the worm and fly, but not in the mouse, multiple histone modifications were found to assist mRNA dosage maintenance following gene duplication events and the possible involvement of adenine DNA methylation in this process was excluded. Furthermore, the histone marks and acting regions that mediated the reduction in duplicated gene expression were found to be largely organism specific. Thus, it appears that many of the histone marks that maintain mRNA dosage were independently recruited during the evolution of worms and flies to compensate for the loss of cytosine DNA methylation machinery from their genomes.
Collapse
Affiliation(s)
- Andrew Ying-Fei Chang
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County 350, Taiwan, Republic of China
| | - Ben-Yang Liao
- Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Miaoli County 350, Taiwan, Republic of China
| |
Collapse
|
9
|
Chang JW, Zhou YQ, Ul Qamar MT, Chen LL, Ding YD. Prediction of Protein-Protein Interactions by Evidence Combining Methods. Int J Mol Sci 2016; 17:ijms17111946. [PMID: 27879651 PMCID: PMC5133940 DOI: 10.3390/ijms17111946] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Revised: 11/15/2016] [Accepted: 11/15/2016] [Indexed: 12/27/2022] Open
Abstract
Most cellular functions involve proteins' features based on their physical interactions with other partner proteins. Sketching a map of protein-protein interactions (PPIs) is therefore an important inception step towards understanding the basics of cell functions. Several experimental techniques operating in vivo or in vitro have made significant contributions to screening a large number of protein interaction partners, especially high-throughput experimental methods. However, computational approaches for PPI predication supported by rapid accumulation of data generated from experimental techniques, 3D structure definitions, and genome sequencing have boosted the map sketching of PPIs. In this review, we shed light on in silico PPI prediction methods that integrate evidence from multiple sources, including evolutionary relationship, function annotation, sequence/structure features, network topology and text mining. These methods are developed for integration of multi-dimensional evidence, for designing the strategies to predict novel interactions, and for making the results consistent with the increase of prediction coverage and accuracy.
Collapse
Affiliation(s)
- Ji-Wei Chang
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Yan-Qing Zhou
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Muhammad Tahir Ul Qamar
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Ling-Ling Chen
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| | - Yu-Duan Ding
- National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan 430070, China.
- College of Informatics, Huazhong Agricultural University, Wuhan 430070, China.
| |
Collapse
|