1
|
Khan MRUZ, Trivedi V. Molecular modelling, docking and network analysis of phytochemicals from Haritaki churna: role of protein cross-talks for their action. J Biomol Struct Dyn 2024; 42:4297-4312. [PMID: 37288779 DOI: 10.1080/07391102.2023.2220036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 05/26/2023] [Indexed: 06/09/2023]
Abstract
Phytochemicals are bioactive agents present in medicinal plants with therapeutic values. Phytochemicals isolated from plants target multiple cellular processes. In the current work, we have used fractionation techniques to identify 13 bioactive polyphenols in ayurvedic medicine Haritaki Churna. Employing the advanced spectroscopic and fractionation, structure of bioactive polyphenols was determined. Blasting the phytochemical structure allow us to identify a total of 469 protein targets from Drug bank and Binding DB. Phytochemicals with their protein targets from Drug bank was used to create a phytochemical-protein network comprising of 394 nodes and 1023 edges. It highlights the extensive cross-talk between protein target corresponding to different phytochemicals. Analysis of protein targets from Binding data bank gives a network comprised of 143 nodes and 275 edges. Taking the data together from Drug bank and binding data, seven most prominent drug targets (HSP90AA1, c-Src kinase, EGFR, Akt1, EGFR, AR, and ESR-α) were found to be target of the phytochemicals. Molecular modelling and docking experiment indicate that phytochemicals are fitting nicely into active site of the target proteins. The binding energy of the phytochemicals were better than the inhibitors of these protein targets. The strength and stability of the protein ligand complexes were further confirmed using molecular dynamic simulation studies. Further, the ADMET profiles of phytochemicals extracted from HCAE suggests that they can be potential drug targets. The phytochemical cross-talk was further proven by choosing c-Src as a model. HCAE down regulated c-Src and its downstream protein targets such as Akt1, cyclin D1 and vimentin. Hence, network analysis followed by molecular docking, molecular dynamics simulation and in-vitro studies clearly highlight the role of protein network and subsequent selection of drug candidate based on network pharmacology.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Md Rafi Uz Zama Khan
- Malaria Research Group, Department of Biosciences and Bioengineering, Indian Institute of Technology-Guwahati, Guwahati, Assam, India
| | - Vishal Trivedi
- Malaria Research Group, Department of Biosciences and Bioengineering, Indian Institute of Technology-Guwahati, Guwahati, Assam, India
| |
Collapse
|
2
|
Kasimanickam R, Kasimanickam V. MicroRNAs in the Pathogenesis of Preeclampsia-A Case-Control In Silico Analysis. Curr Issues Mol Biol 2024; 46:3438-3459. [PMID: 38666946 PMCID: PMC11048894 DOI: 10.3390/cimb46040216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/03/2024] [Accepted: 04/04/2024] [Indexed: 04/28/2024] Open
Abstract
Preeclampsia (PE) occurs in 5% to 7% of all pregnancies, and the PE that results from abnormal placentation acts as a primary cause of maternal and neonatal morbidity and mortality. The objective of this secondary analysis was to elucidate the pathogenesis of PE by probing protein-protein interactions from in silico analysis of transcriptomes between PE and normal placenta from Gene Expression Omnibus (GSE149812). The pathogenesis of PE is apparently determined by associations of miRNA molecules and their target genes and the degree of changes in their expressions with irregularities in the functions of hemostasis, vascular systems, and inflammatory processes at the fetal-maternal interface. These irregularities ultimately lead to impaired placental growth and hypoxic injuries, generally manifesting as placental insufficiency. These differentially expressed miRNAs or genes in placental tissue and/or in blood can serve as novel diagnostic and therapeutic biomarkers.
Collapse
Affiliation(s)
- Ramanathan Kasimanickam
- Department of Veterinary Clinical Sciences, College of Veterinary Medicine, Washington State University, Pullman, WA 99164, USA
| | - Vanmathy Kasimanickam
- Center for Reproductive Biology, College of Veterinary Medicine, Washington State University, Pullman, WA 99164, USA;
| |
Collapse
|
3
|
Ahn S, Datta S. Differential network connectivity analysis for microbiome data adjusted for clinical covariates using jackknife pseudo-values. BMC Bioinformatics 2024; 25:117. [PMID: 38500042 PMCID: PMC10946111 DOI: 10.1186/s12859-024-05689-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 02/02/2024] [Indexed: 03/20/2024] Open
Abstract
BACKGROUND A recent breakthrough in differential network (DN) analysis of microbiome data has been realized with the advent of next-generation sequencing technologies. The DN analysis disentangles the microbial co-abundance among taxa by comparing the network properties between two or more graphs under different biological conditions. However, the existing methods to the DN analysis for microbiome data do not adjust for other clinical differences between subjects. RESULTS We propose a Statistical Approach via Pseudo-value Information and Estimation for Differential Network Analysis (SOHPIE-DNA) that incorporates additional covariates such as continuous age and categorical BMI. SOHPIE-DNA is a regression technique adopting jackknife pseudo-values that can be implemented readily for the analysis. We demonstrate through simulations that SOHPIE-DNA consistently reaches higher recall and F1-score, while maintaining similar precision and accuracy to existing methods (NetCoMi and MDiNE). Lastly, we apply SOHPIE-DNA on two real datasets from the American Gut Project and the Diet Exchange Study to showcase the utility. The analysis of the Diet Exchange Study is to showcase that SOHPIE-DNA can also be used to incorporate the temporal change of connectivity of taxa with the inclusion of additional covariates. As a result, our method has found taxa that are related to the prevention of intestinal inflammation and severity of fatigue in advanced metastatic cancer patients. CONCLUSION SOHPIE-DNA is the first attempt of introducing the regression framework for the DN analysis in microbiome data. This enables the prediction of characteristics of a connectivity of a network with the presence of additional covariate information in the regression. The R package with a vignette of our methodology is available through the CRAN repository ( https://CRAN.R-project.org/package=SOHPIE ), named SOHPIE (pronounced as Sofie). The source code and user manual can be found at https://github.com/sjahnn/SOHPIE-DNA .
Collapse
Affiliation(s)
- Seungjun Ahn
- Department of Biostatistics, University of Florida, Gainesville, FL, USA
- Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Somnath Datta
- Department of Biostatistics, University of Florida, Gainesville, FL, USA.
| |
Collapse
|
4
|
Xu M, Abdullah NA, Md Sabri AQ. A method to improve the prediction performance of cancer-gene association by screening negative training samples through gene network data. Comput Biol Chem 2024; 108:107997. [PMID: 38154318 DOI: 10.1016/j.compbiolchem.2023.107997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2023] [Revised: 11/03/2023] [Accepted: 12/03/2023] [Indexed: 12/30/2023]
Abstract
This work focuses on data sampling in cancer-gene association prediction. Currently, researchers are using machine learning methods to predict genes that are more likely to produce cancer-causing mutations. To improve the performance of machine learning models, methods have been proposed, one of which is to improve the quality of the training data. Existing methods focus mainly on positive data, i.e. cancer driver genes, for screening selection. This paper proposes a low-cancer-related gene screening method based on gene network and graph theory algorithms to improve the negative samples selection. Genetic data with low cancer correlation is used as negative training samples. After experimental verification, using the negative samples screened by this method to train the cancer gene classification model can improve prediction performance. The biggest advantage of this method is that it can be easily combined with other methods that focus on enhancing the quality of positive training samples. It has been demonstrated that significant improvement is achieved by combining this method with three state-of-the-arts cancer gene prediction methods.
Collapse
Affiliation(s)
- Mingzhe Xu
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia; School of Energy and Intelligence Engineering, Henan University of Animal Husbandry and Economy, #6 North Longzihu Rd, Zhengzhou 450000, China.
| | - Nor Aniza Abdullah
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia.
| | - Aznul Qalid Md Sabri
- Faculty of Computer Science & Information Technology, Universiti Malaya, Kuala Lumpur, 50603 Malaysia.
| |
Collapse
|
5
|
Ramos-Medina MJ, Echeverría-Garcés G, Kyriakidis NC, León Cáceres Á, Ortiz-Prado E, Bautista J, Pérez-Meza ÁA, Abad-Sojos A, Nieto-Jaramillo K, Espinoza-Ferrao S, Ocaña-Paredes B, López-Cortés A. CardiOmics signatures reveal therapeutically actionable targets and drugs for cardiovascular diseases. Heliyon 2024; 10:e23682. [PMID: 38187312 PMCID: PMC10770621 DOI: 10.1016/j.heliyon.2023.e23682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Revised: 11/27/2023] [Accepted: 12/09/2023] [Indexed: 01/09/2024] Open
Abstract
Cardiovascular diseases are the leading cause of death worldwide, with heart failure being a complex condition that affects millions of individuals. Single-nucleus RNA sequencing has recently emerged as a powerful tool for unraveling the molecular mechanisms behind cardiovascular diseases. This cutting-edge technology enables the identification of molecular signatures, intracellular networks, and spatial relationships among cardiac cells, including cardiomyocytes, mast cells, lymphocytes, macrophages, lymphatic endothelial cells, endocardial cells, endothelial cells, epicardial cells, adipocytes, fibroblasts, neuronal cells, pericytes, and vascular smooth muscle cells. Despite these advancements, the discovery of essential therapeutic targets and drugs for precision cardiology remains a challenge. To bridge this gap, we conducted comprehensive in silico analyses of single-nucleus RNA sequencing data, functional enrichment, protein interactome network, and identification of the shortest pathways to physiological phenotypes. This integrated multi-omics analysis generated CardiOmics signatures, which allowed us to pinpoint three therapeutically actionable targets (ADRA1A1, PPARG, and ROCK2) and 15 effective drugs, including adrenergic receptor agonists, adrenergic receptor antagonists, norepinephrine precursors, PPAR receptor agonists, and Rho-associated kinase inhibitors, involved in late-stage cardiovascular disease clinical trials.
Collapse
Affiliation(s)
- María José Ramos-Medina
- German Cancer Research Center (DKFZ), Faculty of Biosciences, Heidelberg University, Heidelberg, Germany
| | - Gabriela Echeverría-Garcés
- Centro de Referencia Nacional de Genómica, Secuenciación y Bioinformática, Instituto Nacional de Investigación en Salud Pública “Leopoldo Izquieta Pérez”, Quito, Ecuador
- Latin American Network for the Implementation and Validation of Clinical Pharmacogenomics Guidelines (RELIVAF-CYTED), Santiago, Chile
| | - Nikolaos C. Kyriakidis
- Cancer Research Group (CRG), Faculty of Medicine, Universidad de Las Américas, Quito, Ecuador
| | - Ángela León Cáceres
- Heidelberg Institute of Global Health, Faculty of Medicine, University of Heidelberg, Heidelberg, Germany
- Instituto de Salud Pública, Facultad de Medicina, Pontificia Universidad Católica del Ecuador, Quito, Ecuador
| | - Esteban Ortiz-Prado
- One Health Research Group, Faculty of Medicine, Universidad de Las Américas, Quito, Ecuador
| | - Jhommara Bautista
- Cancer Research Group (CRG), Faculty of Medicine, Universidad de Las Américas, Quito, Ecuador
| | - Álvaro A. Pérez-Meza
- Escuela de Medicina, Colegio de Ciencias de La Salud COCSA, Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | | | - Karol Nieto-Jaramillo
- School of Biological Sciences and Engineering, Yachay Tech University, Urcuqui, Ecuador
| | | | - Belén Ocaña-Paredes
- Cancer Research Group (CRG), Faculty of Medicine, Universidad de Las Américas, Quito, Ecuador
| | - Andrés López-Cortés
- Cancer Research Group (CRG), Faculty of Medicine, Universidad de Las Américas, Quito, Ecuador
| |
Collapse
|
6
|
Giordano M, Falbo E, Maddalena L, Piccirillo M, Granata I. Untangling the Context-Specificity of Essential Genes by Means of Machine Learning: A Constructive Experience. Biomolecules 2023; 14:18. [PMID: 38254618 PMCID: PMC10813179 DOI: 10.3390/biom14010018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 11/29/2023] [Accepted: 12/20/2023] [Indexed: 01/24/2024] Open
Abstract
Gene essentiality is a genetic concept crucial for a comprehensive understanding of life and evolution. In the last decade, many essential genes (EGs) have been determined using different experimental and computational approaches, and this information has been used to reduce the genomes of model organisms. A growing amount of evidence highlights that essentiality is a property that depends on the context. Because of their importance in vital biological processes, recognising context-specific EGs (csEGs) could help for identifying new potential pharmacological targets and to improve precision therapeutics. Since most of the computational procedures proposed to identify and predict EGs neglect their context-specificity, we focused on this aspect, providing a theoretical and experimental overview of the literature, data and computational methods dedicated to recognising csEGs. To this end, we adapted existing computational methods to exploit a specific context (the kidney tissue) and experimented with four different prediction methods using the labels provided by four different identification approaches. The considerations derived from the analysis of the obtained results, confirmed and validated also by further experiments for a different tissue context, provide the reader with guidance on exploiting existing tools for achieving csEGs identification and prediction.
Collapse
Affiliation(s)
- Maurizio Giordano
- Institute for High-Performance Computing and Networking (ICAR), National Research Council (CNR), V. Pietro Castellino 111, 80131 Naples, Italy; (E.F.); (L.M.); (M.P.); (I.G.)
| | | | | | | | | |
Collapse
|
7
|
Gross J, Knipper M, Mazurek B. Candidate Key Proteins in Tinnitus: A Bioinformatic Study of Synaptic Transmission in Spiral Ganglion Neurons. Cell Mol Neurobiol 2023; 43:4189-4207. [PMID: 37736859 PMCID: PMC10661836 DOI: 10.1007/s10571-023-01405-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 08/24/2023] [Indexed: 09/23/2023]
Abstract
To study key proteins associated with changes in synaptic transmission in the spiral ganglion in tinnitus, we build three gene lists from the GeneCard database: 1. Perception of sound (PoS), 2. Acoustic stimulation (AcouStim), and 3. Tinnitus (Tin). Enrichment analysis by the DAVID database resulted in similar Gene Ontology (GO) terms for cellular components in all gene lists, reflecting synaptic structures known to be involved in auditory processing. The STRING protein-protein interaction (PPI) network and the Cytoscape data analyzer were used to identify the top two high-degree proteins (HDPs) and their high-score interaction proteins (HSIPs) identified by the combined score (CS) of the corresponding edges. The top two protein pairs (key proteins) for the PoS are BDNF-GDNF and OTOF-CACNA1D and for the AcouStim process BDNF-NTRK2 and TH-CALB1. The Tin process showed BDNF and NGF as HDPs, with high-score interactions with NTRK1 and NGFR at a comparable level. Compared to the PoS and AcouStim process, the number of HSIPs of key proteins (CS > 90. percentile) increases strongly in Tin. In the PoS and AcouStim networks, BDNF receptor signaling is the dominant pathway, and in the Tin network, the NGF-signaling pathway is of similar importance. Key proteins and their HSIPs are good indicators of biological processes and of signaling pathways characteristic for the normal hearing on the one hand and tinnitus on the other.
Collapse
Affiliation(s)
- Johann Gross
- Tinnitus Center, Charité-Universitätsmedizin Berlin, Berlin, Germany.
- Leibniz Society of Science Berlin, Berlin, Germany.
| | - Marlies Knipper
- Department of Otolaryngology, Head and Neck Surgery, Tübingen Hearing Research Center (THRC), Molecular Physiology of Hearing, University of Tübingen, Tübingen, Germany
- Leibniz Society of Science Berlin, Berlin, Germany
| | - Birgit Mazurek
- Tinnitus Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
8
|
Marino A, Sinaimeri B, Tronci E, Calamoneri T. STARGATE-X: a Python package for statistical analysis on the REACTOME network. J Integr Bioinform 2023; 20:jib-2022-0029. [PMID: 37732505 PMCID: PMC10757075 DOI: 10.1515/jib-2022-0029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Accepted: 01/24/2023] [Indexed: 09/22/2023] Open
Abstract
Many important aspects of biological knowledge at the molecular level can be represented by pathways. Through their analysis, we gain mechanistic insights and interpret lists of interesting genes from experiments (usually omics and functional genomic experiments). As a result, pathways play a central role in the development of bioinformatics methods and tools for computing predictions from known molecular-level mechanisms. Qualitative as well as quantitative knowledge about pathways can be effectively represented through biochemical networks linking the biochemical reactions and the compounds (e.g., proteins) occurring in the considered pathways. So, repositories providing biochemical networks for known pathways play a central role in bioinformatics and in systems biology. Here we focus on Reactome, a free, comprehensive, and widely used repository for biochemical networks and pathways. In this paper, we: (1) introduce a tool StARGate-X (STatistical Analysis of the Reactome multi-GrAph Through nEtworkX) to carry out an automated analysis of the connectivity properties of Reactome biochemical reaction network and of its biological hierarchy (i.e., cell compartments, namely, the closed parts within the cytosol, usually surrounded by a membrane); the code is freely available at https://github.com/marinoandrea/stargate-x; (2) show the effectiveness of our tool by providing an analysis of the Reactome network, in terms of centrality measures, with respect to in- and out-degree. As an example of usage of StARGate-X, we provide a detailed automated analysis of the Reactome network, in terms of centrality measures. We focus both on the subgraphs induced by single compartments and on the graph whose nodes are the strongly connected components. To the best of our knowledge, this is the first freely available tool that enables automatic analysis of the large biochemical network within Reactome through easy-to-use APIs (Application Programming Interfaces).
Collapse
Affiliation(s)
- Andrea Marino
- Computer Science Department, Sapienza University of Rome, Rome, Italy
| | | | - Enrico Tronci
- Computer Science Department, Sapienza University of Rome, Rome, Italy
| | | |
Collapse
|
9
|
Saberi F, Dehghan Z, Noori E, Zali H. Identification of Renal Transplantation Rejection Biomarkers in Blood Using the Systems Biology Approach. IRANIAN BIOMEDICAL JOURNAL 2023; 27:375-87. [PMID: 38224029 PMCID: PMC10826908 DOI: 10.52547/ibj.3871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 12/03/2022] [Accepted: 08/19/2023] [Indexed: 01/16/2024]
Abstract
Background Renal transplantation plays an essential role in the quality of life of patients with end-stage renal disease. At least 12% of the renal patients receiving transplantations show graft rejection. One of the methods used to diagnose renal transplantation rejection is renal allograft biopsy. This procedure is associated with some risks such as bleeding and arteriovenous fistula formation. In this study, we applied a bioinformatics approach to identify serum markers for graft rejection in patients receiving a renal transplantation. Methods Transcriptomic data were first retrieved from the blood of renal transplantation rejection patients using the GEO database. The data were then used to construct the protein-protein interaction and gene regulatory networks using Cytoscape software. Next, network analysis was performed to identify hub-bottlenecks, and key blood markers involved in renal graft rejection. Lastly, the gene ontology and functional pathways related to hub-bottlenecks were detected using PANTHER and DAVID servers. Results In PPIN and GRN, SYNCRIP, SQSTM1, GRAMD1A, FAM104A, ND2, TPGS2, ZNF652, RORA, and MALAT1 were the identified critical genes. In GRN, miR-155, miR17, miR146b, miR-200 family, and GATA2 were the factors that regulated critical genes. The MAPK, neurotrophin, and TNF signaling pathways, IL-17, and human cytomegalovirus infection, human papillomavirus infection, and shigellosis were identified as significant pathways involved in graft rejection. Concusion The above-mentioned genes can be used as diagnostic and therapeutic serum markers of transplantation rejection in renal patients. The newly predicted biomarkers and pathways require further studies.
Collapse
Affiliation(s)
- Fatemeh Saberi
- Student Research Committee, Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Cellular and Molecular Biology Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Zeinab Dehghan
- Department of Comparative Biomedical Sciences, School of Advanced Medical Sciences and Technologies, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Effat Noori
- Student Research Committee, Department of Medical Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Cellular and Molecular Biology Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hakimeh Zali
- Department of Tissue Engineering and Applied Cell Sciences, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
10
|
Hemedan AA, Schneider R, Ostaszewski M. Applications of Boolean modeling to study the dynamics of a complex disease and therapeutics responses. FRONTIERS IN BIOINFORMATICS 2023; 3:1189723. [PMID: 37325771 PMCID: PMC10267406 DOI: 10.3389/fbinf.2023.1189723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2023] [Accepted: 05/18/2023] [Indexed: 06/17/2023] Open
Abstract
Computational modeling has emerged as a critical tool in investigating the complex molecular processes involved in biological systems and diseases. In this study, we apply Boolean modeling to uncover the molecular mechanisms underlying Parkinson's disease (PD), one of the most prevalent neurodegenerative disorders. Our approach is based on the PD-map, a comprehensive molecular interaction diagram that captures the key mechanisms involved in the initiation and progression of PD. Using Boolean modeling, we aim to gain a deeper understanding of the disease dynamics, identify potential drug targets, and simulate the response to treatments. Our analysis demonstrates the effectiveness of this approach in uncovering the intricacies of PD. Our results confirm existing knowledge about the disease and provide valuable insights into the underlying mechanisms, ultimately suggesting potential targets for therapeutic intervention. Moreover, our approach allows us to parametrize the models based on omics data for further disease stratification. Our study highlights the value of computational modeling in advancing our understanding of complex biological systems and diseases, emphasizing the importance of continued research in this field. Furthermore, our findings have potential implications for the development of novel therapies for PD, which is a pressing public health concern. Overall, this study represents a significant step forward in the application of computational modeling to the investigation of neurodegenerative diseases, and underscores the power of interdisciplinary approaches in tackling challenging biomedical problems.
Collapse
|
11
|
Toffali L, D'Ulivo B, Giagulli C, Montresor A, Zenaro E, Delledonne M, Rossato M, Iadarola B, Sbarbati A, Bernardi P, Angelini G, Rossi B, Lopez N, Linke WA, Unger A, Di Silvestre D, Benazzi L, De Palma A, Motta S, Constantin G, Mauri P, Laudanna C. An isoform of the giant protein titin is a master regulator of human T lymphocyte trafficking. Cell Rep 2023; 42:112516. [PMID: 37204926 DOI: 10.1016/j.celrep.2023.112516] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/24/2023] [Accepted: 05/01/2023] [Indexed: 05/21/2023] Open
Abstract
Response to multiple microenvironmental cues and resilience to mechanical stress are essential features of trafficking leukocytes. Here, we describe unexpected role of titin (TTN), the largest protein encoded by the human genome, in the regulation of mechanisms of lymphocyte trafficking. Human T and B lymphocytes express five TTN isoforms, exhibiting cell-specific expression, distinct localization to plasma membrane microdomains, and different distribution to cytosolic versus nuclear compartments. In T lymphocytes, the LTTN1 isoform governs the morphogenesis of plasma membrane microvilli independently of ERM protein phosphorylation status, thus allowing selectin-mediated capturing and rolling adhesions. Likewise, LTTN1 controls chemokine-triggered integrin activation. Accordingly, LTTN1 mediates rho and rap small GTPases activation, but not actin polymerization. In contrast, chemotaxis is facilitated by LTTN1 degradation. Finally, LTTN1 controls resilience to passive cell deformation and ensures T lymphocyte survival in the blood stream. LTTN1 is, thus, a critical and versatile housekeeping regulator of T lymphocyte trafficking.
Collapse
Affiliation(s)
- Lara Toffali
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Beatrice D'Ulivo
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Cinzia Giagulli
- Department of Molecular and Translational Medicine, University of Brescia; 25123 Brescia, Lombardia, Italy
| | - Alessio Montresor
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy; The Center for Biomedical Computing (CBMC), University of Verona; 37134 Verona, Veneto, Italy
| | - Elena Zenaro
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Massimo Delledonne
- Department of Biotechnology, University of Verona; 37134 Verona, Veneto, Italy
| | - Marzia Rossato
- Department of Biotechnology, University of Verona; 37134 Verona, Veneto, Italy
| | - Barbara Iadarola
- Department of Biotechnology, University of Verona; 37134 Verona, Veneto, Italy
| | - Andrea Sbarbati
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona; 37134 Verona, Veneto, Italy
| | - Paolo Bernardi
- Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona; 37134 Verona, Veneto, Italy
| | - Gabriele Angelini
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Barbara Rossi
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Nicola Lopez
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy
| | - Wolfgang A Linke
- Institute of Physiology II, University of Muenster, and Heart Center, University Medicine; 37075 Göttingen, Germany
| | - Andreas Unger
- Institute of Physiology II, University of Muenster, and Heart Center, University Medicine; 37075 Göttingen, Germany
| | - Dario Di Silvestre
- Institute of Biomedical Technologies (ITB) CNR; 20090 Milan, Lombardia, Italy
| | - Louise Benazzi
- Institute of Biomedical Technologies (ITB) CNR; 20090 Milan, Lombardia, Italy
| | - Antonella De Palma
- Institute of Biomedical Technologies (ITB) CNR; 20090 Milan, Lombardia, Italy
| | - Sara Motta
- Institute of Biomedical Technologies (ITB) CNR; 20090 Milan, Lombardia, Italy
| | - Gabriela Constantin
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy; The Center for Biomedical Computing (CBMC), University of Verona; 37134 Verona, Veneto, Italy
| | - Pierluigi Mauri
- Institute of Biomedical Technologies (ITB) CNR; 20090 Milan, Lombardia, Italy
| | - Carlo Laudanna
- Department of Medicine, Division of General Pathology, Laboratory of Cell Trafficking and Signal Transduction, University of Verona; 37134 Verona, Veneto, Italy; The Center for Biomedical Computing (CBMC), University of Verona; 37134 Verona, Veneto, Italy.
| |
Collapse
|
12
|
Weiskittel TM, Cao A, Meng-Lin K, Lehmann Z, Feng B, Correia C, Zhang C, Wisniewski P, Zhu S, Yong Ung C, Li H. Network Biology-Inspired Machine Learning Features Predict Cancer Gene Targets and Reveal Target Coordinating Mechanisms. Pharmaceuticals (Basel) 2023; 16:752. [PMID: 37242535 PMCID: PMC10223789 DOI: 10.3390/ph16050752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 05/08/2023] [Accepted: 05/11/2023] [Indexed: 05/28/2023] Open
Abstract
Anticipating and understanding cancers' need for specific gene activities is key for novel therapeutic development. Here we utilized DepMap, a cancer gene dependency screen, to demonstrate that machine learning combined with network biology can produce robust algorithms that both predict what genes a cancer is dependent on and what network features coordinate such gene dependencies. Using network topology and biological annotations, we constructed four groups of novel engineered machine learning features that produced high accuracies when predicting binary gene dependencies. We found that in all examined cancer types, F1 scores were greater than 0.90, and model accuracy remained robust under multiple hyperparameter tests. We then deconstructed these models to identify tumor type-specific coordinators of gene dependency and identified that in certain cancers, such as thyroid and kidney, tumors' dependencies are highly predicted by gene connectivity. In contrast, other histologies relied on pathway-based features such as lung, where gene dependencies were highly predictive by associations with cell death pathway genes. In sum, we show that biologically informed network features can be a valuable and robust addition to predictive pharmacology models while simultaneously providing mechanistic insights.
Collapse
Affiliation(s)
- Taylor M. Weiskittel
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
- Mayo Clinic Alix School of Medicine, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Andrew Cao
- Department of Computer Science, Duke University, Durham, NC 27708, USA
| | - Kevin Meng-Lin
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| | - Zachary Lehmann
- Department of Chemistry, Biochemistry and Physics, South Dakota State University, Brookings, SD 57006, USA
| | - Benjamin Feng
- Department of Molecular Cell and Developmental Biology, University of California, Los Angeles, CA 90095, USA
| | - Cristina Correia
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| | - Cheng Zhang
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| | - Philip Wisniewski
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| | - Shizhen Zhu
- Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA
| | - Choong Yong Ung
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| | - Hu Li
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic College of Medicine and Science, Rochester, MN 55905, USA; (T.M.W.)
| |
Collapse
|
13
|
Dai S, Liu S, Zhou C, Yu F, Zhu G, Zhang W, Deng H, Burlingame A, Yu W, Wang T, Li N. Capturing the hierarchically assorted modules of protein-protein interactions in the organized nucleome. MOLECULAR PLANT 2023; 16:930-961. [PMID: 36960533 DOI: 10.1016/j.molp.2023.03.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Revised: 02/16/2023] [Accepted: 03/21/2023] [Indexed: 05/04/2023]
Abstract
Nuclear proteins are major constituents and key regulators of nucleome topological organization and manipulators of nuclear events. To decipher the global connectivity of nuclear proteins and the hierarchically organized modules of their interactions, we conducted two rounds of cross-linking mass spectrometry (XL-MS) analysis, one of which followed a quantitative double chemical cross-linking mass spectrometry (in vivoqXL-MS) workflow, and identified 24,140 unique crosslinks in total from the nuclei of soybean seedlings. This in vivo quantitative interactomics enabled the identification of 5340 crosslinks that can be converted into 1297 nuclear protein-protein interactions (PPIs), 1220 (94%) of which were non-confirmative (or novel) nuclear PPIs compared with those in repositories. There were 250 and 26 novel interactors of histones and the nucleolar box C/D small nucleolar ribonucleoprotein complex, respectively. Modulomic analysis of orthologous Arabidopsis PPIs produced 27 and 24 master nuclear PPI modules (NPIMs) that contain the condensate-forming protein(s) and the intrinsically disordered region-containing proteins, respectively. These NPIMs successfully captured previously reported nuclear protein complexes and nuclear bodies in the nucleus. Surprisingly, these NPIMs were hierarchically assorted into four higher-order communities in a nucleomic graph, including genome and nucleolus communities. This combinatorial pipeline of 4C quantitative interactomics and PPI network modularization revealed 17 ethylene-specific module variants that participate in a broad range of nuclear events. The pipeline was able to capture both nuclear protein complexes and nuclear bodies, construct the topological architectures of PPI modules and module variants in the nucleome, and probably map the protein compositions of biomolecular condensates.
Collapse
Affiliation(s)
- Shuaijian Dai
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Hong Kong, China
| | - Shichang Liu
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Chen Zhou
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Fengchao Yu
- Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Guang Zhu
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Wenhao Zhang
- Tsinghua-Peking Joint Centre for Life Sciences, Centre for Structural Biology, School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China
| | - Haiteng Deng
- Tsinghua-Peking Joint Centre for Life Sciences, Centre for Structural Biology, School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China
| | - Al Burlingame
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
| | - Weichuan Yu
- The HKUST Shenzhen-Hong Kong Collaborative Innovation Research Institute, Futian, Shenzhen, Guangdong 518057, China; Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
| | - Tingliang Wang
- Tsinghua-Peking Joint Centre for Life Sciences, Centre for Structural Biology, School of Life Sciences and School of Medicine, Tsinghua University, Beijing 100084, China.
| | - Ning Li
- Division of Life Science, The Hong Kong University of Science and Technology, Hong Kong SAR, China; Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Hong Kong, China; The HKUST Shenzhen-Hong Kong Collaborative Innovation Research Institute, Futian, Shenzhen, Guangdong 518057, China.
| |
Collapse
|
14
|
Das M, Selvakumar K, Alphonse PJA. Analyzing and Comparing Omicron Lineage Variants Protein–Protein Interaction Network Using Centrality Measure. SN COMPUTER SCIENCE 2023; 4:299. [PMID: 37016628 PMCID: PMC10062270 DOI: 10.1007/s42979-023-01685-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 01/10/2023] [Indexed: 03/31/2023]
Abstract
The Worldwide spread of the Omicron lineage variants has now been confirmed. It is crucial to understand the process of cellular life and to discover new drugs need to identify the important proteins in a protein interaction network (PPIN). PPINs are often represented by graphs in bioinformatics, which describe cell processes. There are some proteins that have significant influences on these tissues, and which play a crucial role in regulating them. The discovery of new drugs is aided by the study of significant proteins. These significant proteins can be found by reducing the graph and using graph analysis. Studies examining protein interactions in the Omicron lineage (B.1.1.529) and its variants (BA.5, BA.4, BA.3, BA.2, BA.1.1, BA.1) are not yet available. Studying Omicron has been intended to find a significant protein. 68 nodes represent 68 proteins and 52 edges represent the relationship among the protein in the network. A few centrality measures are computed namely page rank centrality (PRC), degree centrality (DC), closeness centrality (CC), and betweenness centrality (BC) together with node degree and Local clustering coefficient (LCC). We also discover 18 network clusters using Markov clustering. 8 significant proteins (candidate gene of Omicron lineage variants) were detected among the 68 proteins, including AHSG, KCNK1, KCNQ1, MAPT, NR1H4, PSMC2, PTPN11 and, UBE21 which scored the highest among the Omicron proteins. It is found that in the variant of Omicron protein-protein interaction networks, the MAPT protein's impact is the most significant.
Collapse
|
15
|
Sadeghi M, Karimi MR, Karimi AH, Ghorbanpour Farshbaf N, Barzegar A, Schmitz U. Network-Based and Machine-Learning Approaches Identify Diagnostic and Prognostic Models for EMT-Type Gastric Tumors. Genes (Basel) 2023; 14:genes14030750. [PMID: 36981021 PMCID: PMC10048224 DOI: 10.3390/genes14030750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 03/10/2023] [Accepted: 03/14/2023] [Indexed: 03/30/2023] Open
Abstract
The microsatellite stable/epithelial-mesenchymal transition (MSS/EMT) subtype of gastric cancer represents a highly aggressive class of tumors associated with low rates of survival and considerably high probabilities of recurrence. In the era of precision medicine, the accurate and prompt diagnosis of tumors of this subtype is of vital importance. In this study, we used Weighted Gene Co-expression Network Analysis (WGCNA) to identify a differentially expressed co-expression module of mRNAs in EMT-type gastric tumors. Using network analysis and linear discriminant analysis, we identified mRNA motifs and microRNA-based models with strong prognostic and diagnostic relevance: three models comprised of (i) the microRNAs miR-199a-5p and miR-141-3p, (ii) EVC/EVC2/GLI3, and (iii) PDE2A/GUCY1A1/GUCY1B1 gene expression profiles distinguish EMT-type tumors from other gastric tumors with high accuracy (Area Under the Receiver Operating Characteristic Curve (AUC) = 0.995, AUC = 0.9742, and AUC = 0.9717; respectively). Additionally, the DMD/ITGA1/CAV1 motif was identified as the top motif with consistent relevance to prognosis (hazard ratio > 3). Molecular functions of the members of the identified models highlight the central roles of MAPK, Hh, and cGMP/cAMP signaling in the pathology of the EMT subtype of gastric cancer and underscore their potential utility in precision therapeutic approaches.
Collapse
Affiliation(s)
- Mehdi Sadeghi
- Department of Cell & Molecular Biology, Semnan University, Semnan 3513119111, Iran
| | - Mohammad Reza Karimi
- Department of Cell & Molecular Biology, Semnan University, Semnan 3513119111, Iran
| | - Amir Hossein Karimi
- Department of Cell & Molecular Biology, Semnan University, Semnan 3513119111, Iran
| | | | - Abolfazl Barzegar
- Department of Biology, Faculty of Natural Science, University of Tabriz, Tabriz 5166616471, Iran
| | - Ulf Schmitz
- Department of Molecular & Cell Biology, James Cook University, Townsville, QLD 4811, Australia
- Centre for Tropical Bioinformatics and Molecular Biology, Australian Institute of Tropical Health and Medicine, James Cook University, Cairns, QLD 4878, Australia
| |
Collapse
|
16
|
Wang X, Wang H, Yin G, Zhang YD. Network-based drug repurposing for the treatment of COVID-19 patients in different clinical stages. Heliyon 2023; 9:e14059. [PMID: 36855680 PMCID: PMC9951095 DOI: 10.1016/j.heliyon.2023.e14059] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 02/13/2023] [Accepted: 02/20/2023] [Indexed: 03/02/2023] Open
Abstract
In the severe acute respiratory coronavirus disease 2019 (COVID-19) pandemic, there is an urgent need to develop effective treatments. Through a network-based drug repurposing approach, several effective drug candidates are identified for treating COVID-19 patients in different clinical stages. The proposed approach takes advantage of computational prediction methods by integrating publicly available clinical transcriptome and experimental data. We identify 51 drugs that regulate proteins interacted with SARS-CoV-2 protein through biological pathways against COVID-19, some of which have been experimented in clinical trials. Among the repurposed drug candidates, lovastatin leads to differential gene expression in clinical transcriptome for mild COVID-19 patients, and estradiol cypionate mainly regulates hormone-related biological functions to treat severe COVID-19 patients. Multi-target mechanisms of drug candidates are also explored. Erlotinib targets the viral protein interacted with cytokine and cytokine receptors to affect SARS-CoV-2 attachment and invasion. Lovastatin and testosterone block the angiotensin system to suppress the SARS-CoV-2 infection. In summary, our study has identified effective drug candidates against COVID-19 for patients in different clinical stages and provides comprehensive understanding of potential drug mechanisms.
Collapse
Affiliation(s)
- Xin Wang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China
| | - Han Wang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China
| | - Guosheng Yin
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China.,Department of Mathematics, Imperial College London, London, The United Kingdom
| | - Yan Dora Zhang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China.,Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
17
|
Fasano M, Alberio T. Neurodegenerative disorders: From clinicopathology convergence to systems biology divergence. HANDBOOK OF CLINICAL NEUROLOGY 2023; 192:73-86. [PMID: 36796949 DOI: 10.1016/b978-0-323-85538-9.00007-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
Neurodegenerative diseases are multifactorial. This means that several genetic, epigenetic, and environmental factors contribute to their emergence. Therefore, for the future management of these highly prevalent diseases, it is necessary to change perspective. If a holistic viewpoint is assumed, the phenotype (the clinicopathological convergence) emerges from the perturbation of a complex system of functional interactions among proteins (systems biology divergence). The systems biology top-down approach starts with the unbiased collection of sets of data generated through one or more -omics techniques and has the aim to identify the networks and the components that participate in the generation of a phenotype (disease), often without any available a priori knowledge. The principle behind the top-down method is that the molecular components that respond similarly to experimental perturbations are somehow functionally related. This allows the study of complex and relatively poorly characterized diseases without requiring extensive knowledge of the processes under investigation. In this chapter, the use of a global approach will be applied to the comprehension of neurodegeneration, with a particular focus on the two most prevalent ones, Alzheimer's and Parkinson's diseases. The final purpose is to distinguish disease subtypes (even with similar clinical manifestations) to launch a future of precision medicine for patients with these disorders.
Collapse
Affiliation(s)
- Mauro Fasano
- Department of Science and High Technology, University of Insubria, Busto Arsizio and Como, Italy; Center of Neuroscience, University of Insubria, Busto Arsizio and Como, Italy.
| | - Tiziana Alberio
- Department of Science and High Technology, University of Insubria, Busto Arsizio and Como, Italy; Center of Neuroscience, University of Insubria, Busto Arsizio and Como, Italy
| |
Collapse
|
18
|
Centrality measures in fuzzy social networks. INFORM SYST 2023. [DOI: 10.1016/j.is.2023.102179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
19
|
Identifying Tumor-Associated Genes from Bilayer Networks of DNA Methylation Sites and RNAs. LIFE (BASEL, SWITZERLAND) 2022; 13:life13010076. [PMID: 36676027 PMCID: PMC9861397 DOI: 10.3390/life13010076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/29/2022]
Abstract
Network theory has attracted much attention from the biological community because of its high efficacy in identifying tumor-associated genes. However, most researchers have focused on single networks of single omics, which have less predictive power. With the available multiomics data, multilayer networks can now be used in molecular research. In this study, we achieved this with the construction of a bilayer network of DNA methylation sites and RNAs. We applied the network model to five types of tumor data to identify key genes associated with tumors. Compared with the single network, the proposed bilayer network resulted in more tumor-associated DNA methylation sites and genes, which we verified with prognostic and KEGG enrichment analyses.
Collapse
|
20
|
Ayala-Ruano S, Marrero-Ponce Y, Aguilera-Mendoza L, Pérez N, Agüero-Chapin G, Antunes A, Aguilar AC. Network Science and Group Fusion Similarity-Based Searching to Explore the Chemical Space of Antiparasitic Peptides. ACS OMEGA 2022; 7:46012-46036. [PMID: 36570318 PMCID: PMC9773354 DOI: 10.1021/acsomega.2c03398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 11/21/2022] [Indexed: 05/13/2023]
Abstract
Antimicrobial peptides (AMPs) have appeared as promising compounds to treat a wide range of diseases. Their clinical potentialities reside in the wide range of mechanisms they can use for both killing microbes and modulating immune responses. However, the hugeness of the AMPs' chemical space (AMPCS), represented by more than 1065 unique sequences, has represented a big challenge for the discovery of new promising therapeutic peptides and for the identification of common structural motifs. Here, we introduce network science and a similarity searching approach to discover new promising AMPs, specifically antiparasitic peptides (APPs). We exploited the network-based representation of APPs' chemical space (APPCS) to retrieve valuable information by using three network types: chemical space (CSN), half-space proximal (HSPN), and metadata (METN). Some centrality measures were applied to identify in each network the most important and nonredundant peptides. Then, these central peptides were considered as queries (Qs) in group fusion similarity-based searches against a comprehensive collection of known AMPs, stored in the graph database StarPepDB, to propose new potential APPs. The performance of the resulting multiquery similarity-based search models (mQSSMs) was evaluated in five benchmarking data sets of APP/non-APPs. The predictions performed by the best mQSSM showed a strong-to-very-strong performance since their external Matthews correlation coefficient (MCC) values ranged from 0.834 to 0.965. Outstanding MCC values (>0.85) were attained by the mQSSM with 219 Qs from both networks CSN and HSPN with 0.5 as similarity threshold in external data sets. Then, the performance of our best mQSSM was compared with the APPs prediction servers AMPDiscover and AMPFun. The proposed model showed its relevance by outperforming state-of-the-art machine learning models to predict APPs. After applying the best mQSSM and additional filters on the non-APP space from StarPepDB, 95 AMPs were repurposed as potential APP hits. Due to the high sequence diversity of these peptides, different computational approaches were applied to identify relevant motifs for searching and designing new APPs. Lastly, we identified 11 promising APP lead candidates by using our best mQSSMs together with diversity-based network analyses, and 24 web servers for activity/toxicity and drug-like properties. These results support that network-based similarity searches can be an effective and reliable strategy to identify APPs. The proposed models and pipeline are freely available through the StarPep toolbox software at http://mobiosd-hub.com/starpep.
Collapse
Affiliation(s)
- Sebastián Ayala-Ruano
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Yovani Marrero-Ponce
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Computer-Aided
Molecular “Biosilico” Discovery and Bioinformatics Research
International Network (CAMD-BIR IN), Cumbayá, Quito 170901, Ecuador
- Universidad
San Francisco de Quito (USFQ), Instituto
de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
- or . Phone: +593-2-297-1700 (ext. 4021). http://www.uv.es/yoma/ or http://ymponce.googlepages.com/home
| | - Longendri Aguilera-Mendoza
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
| | - Noel Pérez
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Ana Cristina Aguilar
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
| |
Collapse
|
21
|
Macho Rendón J, Rebollido-Ríos R, Torrent Burgas M. HPIPred: Host-pathogen interactome prediction with phenotypic scoring. Comput Struct Biotechnol J 2022; 20:6534-6542. [PMID: 36514317 PMCID: PMC9718936 DOI: 10.1016/j.csbj.2022.11.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 11/22/2022] Open
Abstract
Protein-protein interactions (PPIs) are involved in most cellular processes. Unfortunately, current knowledge of host-pathogen interactomes is still very limited. Experimental methods used to detect PPIs have several limitations, including increasing complexity and economic cost in large-scale screenings. Hence, computational methods are commonly used to support experimental data, although they generally suffer from high false-positive rates. To address this issue, we have created HPIPred, a host-pathogen PPI prediction tool based on numerical encoding of physicochemical properties. Unlike other available methods, HPIPred integrates phenotypic data to prioritize biologically meaningful results. We used HPIPred to screen the entire Homo sapiens and Pseudomonas aeruginosa PAO1 proteomes to generate a host-pathogen interactome with 763 interactions displaying a highly connected network topology. Our predictive model can be used to prioritize protein-protein interactions as potential targets for antibacterial drug development. Available at: https://github.com/SysBioUAB/hpi_predictor.
Collapse
|
22
|
Di Rocco L, Ferraro Petrillo U, Rombo SE. DIAMIN: a software library for the distributed analysis of large-scale molecular interaction networks. BMC Bioinformatics 2022; 23:474. [DOI: 10.1186/s12859-022-05026-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 10/29/2022] [Indexed: 11/13/2022] Open
Abstract
Abstract
Background
Huge amounts of molecular interaction data are continuously produced and stored in public databases. Although many bioinformatics tools have been proposed in the literature for their analysis, based on their modeling through different types of biological networks, several problems still remain unsolved when the problem turns on a large scale.
Results
We propose , that is, a high-level software library to facilitate the development of applications for the efficient analysis of large-scale molecular interaction networks. relies on distributed computing, and it is implemented in Java upon the framework Apache Spark. It delivers a set of functionalities implementing different tasks on an abstract representation of very large graphs, providing a built-in support for methods and algorithms commonly used to analyze these networks. has been tested on data retrieved from two of the most used molecular interactions databases, resulting to be highly efficient and scalable. As shown by different provided examples, can be exploited by users without any distributed programming experience, in order to perform various types of data analysis, and to implement new algorithms based on its primitives.
Conclusions
The proposed has been proved to be successful in allowing users to solve specific biological problems that can be modeled relying on biological networks, by using its functionalities. The software is freely available and this will hopefully allow its rapid diffusion through the scientific community, to solve both specific data analysis and more complex tasks.
Collapse
|
23
|
Dimitrova A, Sferra G, Scippa GS, Trupiano D. Network-Based Analysis to Identify Hub Genes Involved in Spatial Root Response to Mechanical Constrains. Cells 2022; 11:cells11193121. [PMID: 36231084 PMCID: PMC9564363 DOI: 10.3390/cells11193121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 09/29/2022] [Accepted: 09/30/2022] [Indexed: 11/16/2022] Open
Abstract
Previous studies report that the asymmetric response, observed along the main poplar woody bent root axis, was strongly related to both the type of mechanical forces (compression or tension) and the intensity of force displacement. Despite a large number of targets that have been proposed to trigger this asymmetry, an understanding of the comprehensive and synergistic effect of the antistress spatially related pathways is still lacking. Recent progress in the bioinformatics area has the potential to fill these gaps through the use of in silico studies, able to investigate biological functions and pathway overlaps, and to identify promising targets in plant responses. Presently, for the first time, a comprehensive network-based analysis of proteomic signatures was used to identify functions and pivotal genes involved in the coordinated signalling pathways and molecular activities that asymmetrically modulate the response of different bent poplar root sectors and sides. To accomplish this aim, 66 candidate proteins, differentially represented across the poplar bent root sides and sectors, were grouped according to their abundance profile patterns and mapped, together with their first neighbours, on a high-confidence set of interactions from STRING to compose specific cluster-related subnetworks (I–VI). Successively, all subnetworks were explored by a functional gene set enrichment analysis to identify enriched gene ontology terms. Subnetworks were then analysed to identify the genes that are strongly interconnected with other genes (hub gene) and, thus, those that have a pivotal role in the bent root asymmetric response. The analysis revealed novel information regarding the response coordination, communication, and potential signalling pathways asymmetrically activated along the main root axis, delegated mainly to Ca2+ (for new lateral root formation) and ROS (for gravitropic response and lignin accumulation) signatures. Furthermore, some of the data indicate that the concave side of the bent sector, where the mechanical forces are most intense, communicates to the other (neighbour and distant) sectors, inducing spatially related strategies to ensure water uptake and accompanying cell modification. This information could be critical for understanding how plants maintain and improve their structural integrity—whenever and wherever it is necessary—in natural mechanical stress conditions.
Collapse
|
24
|
Yue R, Dutta A. Computational systems biology in disease modeling and control, review and perspectives. NPJ Syst Biol Appl 2022; 8:37. [PMID: 36192551 PMCID: PMC9528884 DOI: 10.1038/s41540-022-00247-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 09/05/2022] [Indexed: 02/02/2023] Open
Abstract
Omics-based approaches have become increasingly influential in identifying disease mechanisms and drug responses. Considering that diseases and drug responses are co-expressed and regulated in the relevant omics data interactions, the traditional way of grabbing omics data from single isolated layers cannot always obtain valuable inference. Also, drugs have adverse effects that may impair patients, and launching new medicines for diseases is costly. To resolve the above difficulties, systems biology is applied to predict potential molecular interactions by integrating omics data from genomic, proteomic, transcriptional, and metabolic layers. Combined with known drug reactions, the resulting models improve medicines' therapeutical performance by re-purposing the existing drugs and combining drug molecules without off-target effects. Based on the identified computational models, drug administration control laws are designed to balance toxicity and efficacy. This review introduces biomedical applications and analyses of interactions among gene, protein and drug molecules for modeling disease mechanisms and drug responses. The therapeutical performance can be improved by combining the predictive and computational models with drug administration designed by control laws. The challenges are also discussed for its clinical uses in this work.
Collapse
Affiliation(s)
- Rongting Yue
- Department of Electrical and Computer Engineering, University of Connecticut, 371 Fairfield Way, Storrs, CT, 06269, USA.
| | - Abhishek Dutta
- Department of Electrical and Computer Engineering, University of Connecticut, 371 Fairfield Way, Storrs, CT, 06269, USA
| |
Collapse
|
25
|
Lecca P, Ihekwaba-Ndibe AEC. Dynamic Modelling of DNA Repair Pathway at the Molecular Level: A New Perspective. Front Mol Biosci 2022; 9:878148. [PMID: 36177351 PMCID: PMC9513183 DOI: 10.3389/fmolb.2022.878148] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 06/22/2022] [Indexed: 11/30/2022] Open
Abstract
DNA is the genetic repository for all living organisms, and it is subject to constant changes caused by chemical and physical factors. Any change, if not repaired, erodes the genetic information and causes mutations and diseases. To ensure overall survival, robust DNA repair mechanisms and damage-bypass mechanisms have evolved to ensure that the DNA is constantly protected against potentially deleterious damage while maintaining its integrity. Not surprisingly, defects in DNA repair genes affect metabolic processes, and this can be seen in some types of cancer, where DNA repair pathways are disrupted and deregulated, resulting in genome instability. Mathematically modelling the complex network of genes and processes that make up the DNA repair network will not only provide insight into how cells recognise and react to mutations, but it may also reveal whether or not genes involved in the repair process can be controlled. Due to the complexity of this network and the need for a mathematical model and software platform to simulate different investigation scenarios, there must be an automatic way to convert this network into a mathematical model. In this paper, we present a topological analysis of one of the networks in DNA repair, specifically homologous recombination repair (HR). We propose a method for the automatic construction of a system of rate equations to describe network dynamics and present results of a numerical simulation of the model and model sensitivity analysis to the parameters. In the past, dynamic modelling and sensitivity analysis have been used to study the evolution of tumours in response to drugs in cancer medicine. However, automatic generation of a mathematical model and the study of its sensitivity to parameter have not been applied to research on the DNA repair network so far. Therefore, we present this application as an approach for medical research against cancer, since it could give insight into a possible approach with which central nodes of the networks and repair genes could be identified and controlled with the ultimate goal of aiding cancer therapy to fight the onset of cancer and its progression.
Collapse
Affiliation(s)
- Paola Lecca
- Faculty of Computer Science, Free University of Bozen-Bolzano, Bolzano, Italy
- *Correspondence: Paola Lecca, ; Adaoha E. C. Ihekwaba-Ndibe,
| | - Adaoha E. C. Ihekwaba-Ndibe
- Faculty of Health and Life Sciences, Coventry University, Coventry, United Kingdom
- *Correspondence: Paola Lecca, ; Adaoha E. C. Ihekwaba-Ndibe,
| |
Collapse
|
26
|
Hassan SH, Sferra G, Simiele M, Scippa GS, Morabito D, Trupiano D. Root and shoot biology of Arabidopsis halleri dissected by WGCNA: an insight into the organ pivotal pathways and genes of an hyperaccumulator. Funct Integr Genomics 2022; 22:1159-1172. [PMID: 36094581 DOI: 10.1007/s10142-022-00897-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 08/15/2022] [Accepted: 08/31/2022] [Indexed: 11/27/2022]
Abstract
Arabidopsis halleri is a hyperaccumulating pseudo-metallophyte and an emerging model to explore molecular basis of metal tolerance and hyperaccumulation. In this regard, understanding of interacting genes can be a crucial aspect as these interactions regulate several biological functions at molecular level in response to multiple signals. In this current study, we applied a weighted gene co-expression network analysis (WGCNA) on root and shoot RNA-seq data of A. halleri to predict the related scale-free organ specific co-expression networks, for the first time. A total of 19,653 genes of root and 18,081 genes of shoot were grouped into 14 modules and subjected to GO and KEGG enrichment analysis. "Photosynthesis" and "photosynthesis-antenna proteins" were identified as the most enriched and common pathway to both root and shoot. Whereas "glucosinolate biosynthesis," "autophagy," and "SNARE interactions in vesicular transport" were specific to root, and "circadian rhythm" was found to be enriched only in shoot. Later, hub and bottleneck genes were identified in each module by using cytoHubba plugin based on Cytoscape and scoring the relevance of each gene to the topology of the network. The modules with the most significant differential expression pattern across control and treatment (Cd-Zn treatment) were selected and their hub and bottleneck genes were screened to validate their possible involvement in heavy metal stress. Moreover, we combined the analysis of co-expression modules together with protein-protein interactions (PPIs), confirming some genes as potential candidates in plant heavy metal stress and as biomarkers. The results from this analysis shed the light on the pivotal functions to the hyperaccumulative trait of A. halleri, giving perspective to new paths for future research on this species.
Collapse
Affiliation(s)
- Sayyeda Hira Hassan
- Department of Biosciences and Territory, University of Molise, 86090, Pesche, Italy
| | - Gabriella Sferra
- Department of Biosciences and Territory, University of Molise, 86090, Pesche, Italy.
| | - Melissa Simiele
- Department of Biosciences and Territory, University of Molise, 86090, Pesche, Italy
| | | | - Domenico Morabito
- Laboratoire de Biologie des Ligneux et des Grandes Cultures (LBLGC-EA1207), Université d'Orléans, 45067, Orléans CEDEX 2, France
| | - Dalila Trupiano
- Department of Biosciences and Territory, University of Molise, 86090, Pesche, Italy
| |
Collapse
|
27
|
Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein–protein interaction network in a multi-omics integration context. Front Mol Biosci 2022; 9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open
Abstract
At the heart of the cellular machinery through the regulation of cellular functions, protein–protein interactions (PPIs) have a significant role. PPIs can be analyzed with network approaches. Construction of a PPI network requires prediction of the interactions. All PPIs form a network. Different biases such as lack of data, recurrence of information, and false interactions make the network unstable. Integrated strategies allow solving these different challenges. These approaches have shown encouraging results for the understanding of molecular mechanisms, drug action mechanisms, and identification of target genes. In order to give more importance to an interaction, it is evaluated by different confidence scores. These scores allow the filtration of the network and thus facilitate the representation of the network, essential steps to the identification and understanding of molecular mechanisms. In this review, we will discuss the main computational methods for predicting PPI, including ones confirming an interaction as well as the integration of PPIs into a network, and we will discuss visualization of these complex data.
Collapse
Affiliation(s)
- Vivian Robin
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Mickaël Leclercq
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Périn
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
- *Correspondence: Arnaud Droit,
| |
Collapse
|
28
|
Dinarvand M, Koch FC, Al Mouiee D, Vuong K, Vijayan A, Tanzim AF, Azad AKM, Penesyan A, Castaño-Rodríguez N, Vafaee F. dRNASb: a systems biology approach to decipher dynamics of host-pathogen interactions using temporal dual RNA-seq data. Microb Genom 2022; 8. [PMID: 36136078 DOI: 10.1099/mgen.0.000862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Infection triggers a dynamic cascade of reciprocal events between host and pathogen wherein the host activates complex mechanisms to recognise and kill pathogens while the pathogen often adjusts its virulence and fitness to avoid eradication by the host. The interaction between the pathogen and the host results in large-scale changes in gene expression in both organisms. Dual RNA-seq, the simultaneous detection of host and pathogen transcripts, has become a leading approach to unravelling complex molecular interactions between the host and the pathogen and is particularly informative for intracellular organisms. The amount of in vitro and in vivo dual RNA-seq data is rapidly growing, which demands computational pipelines to effectively analyse such data. In particular, holistic, systems-level, and temporal analyses of dual RNA-seq data are essential to enable further insights into the host-pathogen transcriptional dynamics and potential interactions. Here, we developed an integrative network-driven bioinformatics pipeline, dRNASb, a systems biology-based computational pipeline to analyse temporal transcriptional clusters, incorporate molecular interaction networks (e.g. protein-protein interactions), identify topologically and functionally key transcripts in host and pathogen, and associate host and pathogen temporal transcriptome to decipher potential between-species interactions. The pipeline is applicable to various dual RNA-seq data from different species and experimental conditions. As a case study, we applied dRNASb to analyse temporal dual RNA-seq data of Salmonella-infected human cells, which enabled us to uncover genes contributing to the infection process and their potential functions and to identify putative associations between host and pathogen genes during infection. Overall, dRNASb has the potential to identify key genes involved in bacterial growth or host defence mechanisms for future uses as therapeutic targets.
Collapse
Affiliation(s)
- Mojdeh Dinarvand
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Forrest C Koch
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Daniel Al Mouiee
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
- Ingham Institute for Applied Medical Research, Liverpool, NSW, Australia
- UNSW Data Science Hub, University of New South Wales, Sydney, NSW, Australia
| | - Kaylee Vuong
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Abhishek Vijayan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Afia Fariha Tanzim
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - A K M Azad
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Anahit Penesyan
- School of Natural Sciences, Faculty of Science and Engineering, Macquarie University, Sydney, NSW, Australia
| | - Natalia Castaño-Rodríguez
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Fatemeh Vafaee
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
- UNSW Data Science Hub, University of New South Wales, Sydney, NSW, Australia
| |
Collapse
|
29
|
Mansoor M, Nauman M, Rehman HU, Omar M. Gene Ontology Capsule GAN: an improved architecture for protein function prediction. PeerJ Comput Sci 2022; 8:e1014. [PMID: 36092003 PMCID: PMC9454774 DOI: 10.7717/peerj-cs.1014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Accepted: 05/31/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the core of all functions pertaining to living things. They consist of an extended amino acid chain folding into a three-dimensional shape that dictates their behavior. Currently, convolutional neural networks (CNNs) have been pivotal in predicting protein functions based on protein sequences. While it is a technology crucial to the niche, the computation cost and translational invariance associated with CNN make it impossible to detect spatial hierarchies between complex and simpler objects. Therefore, this research utilizes capsule networks to capture spatial information as opposed to CNNs. Since capsule networks focus on hierarchical links, they have a lot of potential for solving structural biology challenges. In comparison to the standard CNNs, our results exhibit an improvement in accuracy. Gene Ontology Capsule GAN (GOCAPGAN) achieved an F1 score of 82.6%, a precision score of 90.4% and recall score of 76.1%.
Collapse
|
30
|
Kim YJ, Kim K, Lee H, Jeon J, Lee J, Yoon J. The Protein-Protein Interaction Network of Hereditary Parkinsonism Genes Is a Hierarchical Scale-Free Network. Yonsei Med J 2022; 63:724-734. [PMID: 35914754 PMCID: PMC9344267 DOI: 10.3349/ymj.2022.63.8.724] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/19/2022] [Accepted: 05/02/2022] [Indexed: 11/27/2022] Open
Abstract
PURPOSE Hereditary parkinsonism genes consist of causative genes of familial Parkinson's disease (PD) with a locus symbol prefix (PARK genes) and hereditary atypical parkinsonian disorders that present atypical features and limited responsiveness to levodopa (non-PARK genes). Although studies have shown that hereditary parkinsonism genes are related to idiopathic PD at the phenotypic, gene expression, and genomic levels, no study has systematically investigated connectivity among the proteins encoded by these genes at the protein-protein interaction (PPI) level. MATERIALS AND METHODS Topological measurements and physical interaction enrichment were performed to assess PPI networks constructed using some or all the proteins encoded by hereditary parkinsonism genes (n=96), which were curated using the Online Mendelian Inheritance in Man database and literature. RESULTS Non-PARK and PARK genes were involved in common functional modules related to autophagy, mitochondrial or lysosomal organization, catecholamine metabolic process, chemical synapse transmission, response to oxidative stress, neuronal apoptosis, regulation of cellular protein catabolic process, and vesicle-mediated transport in synapse. The hereditary parkinsonism proteins formed a single large network comprising 51 nodes, 83 edges, and three PPI pairs. The probability of degree distribution followed a power-law scaling behavior, with a degree exponent of 1.24 and a correlation coefficient of 0.92. LRRK2 was identified as a hub gene with the highest degree of betweenness centrality; its physical interaction enrichment score was 1.28, which was highly significant. CONCLUSION Both PARK and non-PARK genes show high connectivity at the PPI and biological functional levels.
Collapse
Affiliation(s)
- Yun Joong Kim
- Department of Neurology, Yonsei University College of Medicine, Seoul, Korea
- Department of Neurology, Yongin Severance Hospital, Yonsei University Health System, Yongin, Korea.
| | - Kiyong Kim
- Department of Electronic Engineering, Kyonggi University, Suwon, Korea.
| | - Heonwoo Lee
- Department of Computer Engineering, Hallym University, Chuncheon, Korea
| | - Junbeom Jeon
- Department of Computer Engineering, Hallym University, Chuncheon, Korea
| | - Jinwoo Lee
- Department of Computer Engineering, Hallym University, Chuncheon, Korea
| | - Jeehee Yoon
- Department of Computer Engineering, Hallym University, Chuncheon, Korea
| |
Collapse
|
31
|
Selvan GT, Gollapalli P, Shetty P, Kumari NS. Exploring key molecular signatures of immune responses and pathways associated with tuberculosis in comorbid diabetes mellitus: a systems biology approach. BENI-SUEF UNIVERSITY JOURNAL OF BASIC AND APPLIED SCIENCES 2022. [DOI: 10.1186/s43088-022-00257-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Comorbid type 2 diabetes mellitus (T2DM) increases the risk for tuberculosis (TB) and its associated complications, although the pathological connections between T2DM and TB are unknown. The current research aims to identify shared molecular gene signatures and pathways that affirm the epidemiological association of T2DM and TB and afford clues on mechanistic basis of their association through integrative systems biology and bioinformatics approaches. Earlier research has found specific molecular markers linked to T2DM and TB, but, despite their importance, only offered a limited understanding of the genesis of this comorbidity. Our investigation used a network medicine method to find possible T2DM-TB molecular mediators.
Results
Functional annotation clustering, interaction networks, network cluster analysis, and network topology were part of our systematic investigation of T2DM-TB linked with 1603 differentially expressed genes (DEGs). The functional enrichment and gene interaction network analysis emphasized the importance of cytokine/chemokine signalling, T cell receptor signalling route, NF-kappa B signalling pathway and Jak-STAT signalling system. Furthermore, network analysis revealed significant DEGs such as ITGAM and STAT1, which may be necessary for T2DM-TB immune responses. Furthermore, these two genes are modulators in clusters C4 and C5, abundant in cytokine/chemokine signalling and Jak-STAT signalling pathways.
Conclusions
Our analyses highlight the role of ITGAM and STAT1 in T2DM-TB-associated pathways and advances our knowledge of the genetic processes driving this comorbidity.
Collapse
|
32
|
Panditrao G, Bhowmick R, Meena C, Sarkar RR. Emerging landscape of molecular interaction networks: Opportunities, challenges and prospects. J Biosci 2022. [PMID: 36210749 PMCID: PMC9018971 DOI: 10.1007/s12038-022-00253-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Network biology finds application in interpreting molecular interaction networks and providing insightful inferences using graph theoretical analysis of biological systems. The integration of computational bio-modelling approaches with different hybrid network-based techniques provides additional information about the behaviour of complex systems. With increasing advances in high-throughput technologies in biological research, attempts have been made to incorporate this information into network structures, which has led to a continuous update of network biology approaches over time. The newly minted centrality measures accommodate the details of omics data and regulatory network structure information. The unification of graph network properties with classical mathematical and computational modelling approaches and technologically advanced approaches like machine-learning- and artificial intelligence-based algorithms leverages the potential application of these techniques. These computational advances prove beneficial and serve various applications such as essential gene prediction, identification of drug–disease interaction and gene prioritization. Hence, in this review, we have provided a comprehensive overview of the emerging landscape of molecular interaction networks using graph theoretical approaches. With the aim to provide information on the wide range of applications of network biology approaches in understanding the interaction and regulation of genes, proteins, enzymes and metabolites at different molecular levels, we have reviewed the methods that utilize network topological properties, emerging hybrid network-based approaches and applications that integrate machine learning techniques to analyse molecular interaction networks. Further, we have discussed the applications of these approaches in biomedical research with a note on future prospects.
Collapse
Affiliation(s)
- Gauri Panditrao
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Pune, 411008 India
| | - Rupa Bhowmick
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Pune, 411008 India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002 India
| | - Chandrakala Meena
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Pune, 411008 India
| | - Ram Rup Sarkar
- Chemical Engineering and Process Development Division, CSIR-National Chemical Laboratory, Pune, 411008 India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, 201002 India
| |
Collapse
|
33
|
Khojasteh H, Khanteymoori A, Olyaee MH. Comparing protein-protein interaction networks of SARS-CoV-2 and (H1N1) influenza using topological features. Sci Rep 2022; 12:5867. [PMID: 35393450 PMCID: PMC8988119 DOI: 10.1038/s41598-022-08574-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 03/03/2022] [Indexed: 01/04/2023] Open
Abstract
SARS-CoV-2 pandemic first emerged in late 2019 in China. It has since infected more than 298 million individuals and caused over 5 million deaths globally. The identification of essential proteins in a protein–protein interaction network (PPIN) is not only crucial in understanding the process of cellular life but also useful in drug discovery. There are many centrality measures to detect influential nodes in complex networks. Since SARS-CoV-2 and (H1N1) influenza PPINs pose 553 common human proteins. Analyzing influential proteins and comparing these networks together can be an effective step in helping biologists for drug-target prediction. We used 21 centrality measures on SARS-CoV-2 and (H1N1) influenza PPINs to identify essential proteins. We applied principal component analysis and unsupervised machine learning methods to reveal the most informative measures. Appealingly, some measures had a high level of contribution in comparison to others in both PPINs, namely Decay, Residual closeness, Markov, Degree, closeness (Latora), Barycenter, Closeness (Freeman), and Lin centralities. We also investigated some graph theory-based properties like the power law, exponential distribution, and robustness. Both PPINs tended to properties of scale-free networks that expose their nature of heterogeneity. Dimensionality reduction and unsupervised learning methods were so effective to uncover appropriate centrality measures.
Collapse
Affiliation(s)
- Hakimeh Khojasteh
- Department of Computer Engineering, University of Zanjan, Zanjan, Iran
| | | | - Mohammad Hossein Olyaee
- Department of Computer Engineering, Engineering Faculty, University of Gonabad, Zanjan, Gonabad, Iran
| |
Collapse
|
34
|
Tang Y, Li X, Yuan Y, Zhang H, Zou Y, Xu Z, Xu Q, Song J, Deng C, Wang Q. Network pharmacology-based predictions of active components and pharmacological mechanisms of Artemisia annua L. for the treatment of the novel Corona virus disease 2019 (COVID-19). BMC Complement Med Ther 2022; 22:56. [PMID: 35241045 PMCID: PMC8893058 DOI: 10.1186/s12906-022-03523-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Accepted: 01/28/2022] [Indexed: 12/11/2022] Open
Abstract
Background Novel Corona Virus Disease 2019 (COVID-19) is closely associated with cytokines storms. The Chinese medicinal herb Artemisia annua L. (A. annua) has been traditionally used to control many inflammatory diseases, such as malaria and rheumatoid arthritis. We performed network analysis and employed molecular docking and network analysis to elucidate active components or targets and the underlying mechanisms of A. annua for the treatment of COVID-19. Methods Active components of A. annua were identified through the TCMSP database according to their oral bioavailability (OB) and drug-likeness (DL). Moreover, target genes associated with COVID-19 were mined from GeneCards, OMIM, and TTD. A compound-target (C-T) network was constructed to predict the relationship of active components with the targets. A Compound-disease-target (C-D-T) network has been built to reveal the direct therapeutic target for COVID-19. Molecular docking, molecular dynamics simulation studies (MD), and MM-GBSA binding free energy calculations were used to the closest molecules and targets between A. annua and COVID-19. Results In our network, GO, and KEGG analysis indicated that A. annua acted in response to COVID-19 by regulating inflammatory response, proliferation, differentiation, and apoptosis. The molecular docking results manifested excellent results to verify the binding capacity between the hub components and hub targets in COVID-19. MD and MM-GBSA data showed quercetin to be the more effective candidate against the virus by target MAPK1, and kaempferol to be the other more effective candidate against the virus by target TP53. We identified A. annua’s potentially active compounds and targets associated with them that act against COVID-19. Conclusions These findings suggest that A. annua may prevent and inhibit the inflammatory processes related to COVID-19.
Collapse
Affiliation(s)
- Yexiao Tang
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Xiaobo Li
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China.,Sci-tech Industrial Park, Guangzhou University of Chinese Medicine, Guangzhou, 510445, China
| | - Yueming Yuan
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China.,Sci-tech Industrial Park, Guangzhou University of Chinese Medicine, Guangzhou, 510445, China
| | - Hongying Zhang
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China.,Sci-tech Industrial Park, Guangzhou University of Chinese Medicine, Guangzhou, 510445, China
| | - Yuanyuan Zou
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Zhiyong Xu
- Sci-tech Industrial Park, Guangzhou University of Chinese Medicine, Guangzhou, 510445, China
| | - Qin Xu
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Jianping Song
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Changsheng Deng
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China
| | - Qi Wang
- Artemisinin Research Center, Guangzhou University of Chinese Medicine, Guangzhou, 510405, China. .,Guangzhou Chest Hospital, Guangzhou, 510095, China.
| |
Collapse
|
35
|
Identifying Potential New Gene Expression-Based Biomarkers in the Peripheral Blood Mononuclear Cells of Hepatitis B-Related Hepatocellular Carcinoma. Can J Gastroenterol Hepatol 2022; 2022:9541600. [PMID: 35265561 PMCID: PMC8901362 DOI: 10.1155/2022/9541600] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/13/2021] [Accepted: 01/22/2022] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVE The analysis of the gene expression of peripheral blood mononuclear cells (PBMCs) is important to clarify the pathogenesis of hepatocellular carcinoma (HCC) and the detection of suitable biomarkers. The purpose of this investigation was to use RNA-sequencing to screen the appropriate differentially expressed genes (DEGs) in the PBMCs for the HCC. METHODS The comprehensive transcriptome of extracted RNA of PBMC (n = 20) from patients with chronic hepatitis B (CHB), liver cirrhosis, and early stage of HCC (5 samples per group) was carried out using RNA-sequencing. All raw RNA-sequencing data analyses were performed using conventional RNA-sequencing analysis tools. Next, gene ontology (GO) analyses were carried out to elucidate the biological processes of DEGs. Finally, relative transcript abundance of selected DEGs was verified using qRT-PCR on additional validation groups. RESULTS Specifically, 13, 1262, and 1450 DEGs were identified for CHB, liver cirrhosis, and HCC, when compared with the healthy controls. GO enrichment analysis indicated that HCC is closely related to the immune response. Seven DEGs (TYMP, TYROBP, CD14, TGFBI, LILRA2, GNLY, and GZMB) were common to HCC, cirrhosis, and CHB when compared to healthy controls. The data revealed that the expressions of these 7 DEGs were consistent with those from the RNA-sequencing results. Also, the expressions of 7 representative genes that had higher sensitivity were obtained by receiver operating characteristic analysis, which indicated their important diagnostic accuracy for HBV-HCC. CONCLUSION This study provides us with new horizons into the biological process and potential prospective clinical diagnosis and prognosis of HCC in the near future.
Collapse
|
36
|
Freund AJ, Giabbanelli PJ. An Experimental Study on the Scalability of Recent Node Centrality Metrics in Sparse Complex Networks. Front Big Data 2022; 5:797584. [PMID: 35252851 PMCID: PMC8889076 DOI: 10.3389/fdata.2022.797584] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 01/21/2022] [Indexed: 11/20/2022] Open
Abstract
Node centrality measures are among the most commonly used analytical techniques for networks. They have long helped analysts to identify “important” nodes that hold power in a social context, where damages could have dire consequences for transportation applications, or who should be a focus for prevention in epidemiology. Given the ubiquity of network data, new measures have been proposed, occasionally motivated by emerging applications or by the ability to interpolate existing measures. Before analysts use these measures and interpret results, the fundamental question is: are these measures likely to complete within the time window allotted to the analysis? In this paper, we comprehensively examine how the time necessary to run 18 new measures (introduced from 2005 to 2020) scales as a function of the number of nodes in the network. Our focus is on giving analysts a simple and practical estimate for sparse networks. As the time consumption depends on the properties in the network, we nuance our analysis by considering whether the network is scale-free, small-world, or random. Our results identify that several metrics run in the order of O(nlogn) and could scale to large networks, whereas others can require O(n2) or O(n3) and may become prime targets in future works for approximation algorithms or distributed implementations.
Collapse
|
37
|
Alam A, Abubaker Bagabir H, Sultan A, Siddiqui MF, Imam N, Alkhanani MF, Alsulimani A, Haque S, Ishrat R. An Integrative Network Approach to Identify Common Genes for the Therapeutics in Tuberculosis and Its Overlapping Non-Communicable Diseases. Front Pharmacol 2022; 12:770762. [PMID: 35153741 PMCID: PMC8829040 DOI: 10.3389/fphar.2021.770762] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 12/27/2021] [Indexed: 12/15/2022] Open
Abstract
Tuberculosis (TB) is the leading cause of death from a single infectious agent. The estimated total global TB deaths in 2019 were 1.4 million. The decline in TB incidence rate is very slow, while the burden of noncommunicable diseases (NCDs) is exponentially increasing in low- and middle-income countries, where the prevention and treatment of TB disease remains a great burden, and there is enough empirical evidence (scientific evidence) to justify a greater research emphasis on the syndemic interaction between TB and NCDs. The current study was proposed to build a disease-gene network based on overlapping TB with NCDs (overlapping means genes involved in TB and other/s NCDs), such as Parkinson’s disease, cardiovascular disease, diabetes mellitus, rheumatoid arthritis, and lung cancer. We compared the TB-associated genes with genes of its overlapping NCDs to determine the gene-disease relationship. Next, we constructed the gene interaction network of disease-genes by integrating curated and experimentally validated interactions in humans and find the 13 highly clustered modules in the network, which contains a total of 86 hub genes that are commonly associated with TB and its overlapping NCDs, which are largely involved in the Inflammatory response, cellular response to cytokine stimulus, response to cytokine, cytokine-mediated signaling pathway, defense response, response to stress and immune system process. Moreover, the identified hub genes and their respective drugs were exploited to build a bipartite network that assists in deciphering the drug-target interaction, highlighting the influential roles of these drugs on apparently unrelated targets and pathways. Targeting these hub proteins by using drugs combination or drug repurposing approaches will improve the clinical conditions in comorbidity, enhance the potency of a few drugs, and give a synergistic effect with better outcomes. Thus, understanding the Mycobacterium tuberculosis (Mtb) infection and associated NCDs is a high priority to contain its short and long-term effects on human health. Our network-based analysis opens a new horizon for more personalized treatment, drug-repurposing opportunities, investigates new targets, multidrug treatment, and can uncover several side effects of unrelated drugs for TB and its overlapping NCDs.
Collapse
Affiliation(s)
- Aftab Alam
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Hala Abubaker Bagabir
- Department of Physiology, Faculty of Medicine, King Abdulaziz University, Rabigh, Saudi Arabia
| | - Armiya Sultan
- Department of Biosciences, Jamia Millia Islamia, New Delhi, India
| | | | - Nikhat Imam
- Department of Mathematics, Institute of Computer Science and Information Technology, Magadh University, Bodh Gaya, India
| | - Mustfa F Alkhanani
- Emergency Service Department, College of Applied Sciences, AlMaarefa University, Riyadh, Saudi Arabia
| | - Ahmad Alsulimani
- Medical Laboratory Technology Department, College of Applied Medical Sciences, Jazan University, Jazan, Saudi Arabia
| | - Shafiul Haque
- Research and Scientific Studies Unit, College of Nursing and Allied Health Sciences, Jazan University, Jazan, Saudi Arabia
| | - Romana Ishrat
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| |
Collapse
|
38
|
Zhuang Y, Xing F, Ghosh D, Banaei-Kashani F, Bowler RP, Kechris K. An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning. Front Genet 2022; 12:760299. [PMID: 35154240 PMCID: PMC8829118 DOI: 10.3389/fgene.2021.760299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 11/08/2021] [Indexed: 01/21/2023] Open
Abstract
Biological networks are often inferred through Gaussian graphical models (GGMs) using gene or protein expression data only. GGMs identify conditional dependence by estimating a precision matrix between genes or proteins. However, conventional GGM approaches often ignore prior knowledge about protein-protein interactions (PPI). Recently, several groups have extended GGM to weighted graphical Lasso (wGlasso) and network-based gene set analysis (Netgsa) and have demonstrated the advantages of incorporating PPI information. However, these methods are either computationally intractable for large-scale data, or disregard weights in the PPI networks. To address these shortcomings, we extended the Netgsa approach and developed an augmented high-dimensional graphical Lasso (AhGlasso) method to incorporate edge weights in known PPI with omics data for global network learning. This new method outperforms weighted graphical Lasso-based algorithms with respect to computational time in simulated large-scale data settings while achieving better or comparable prediction accuracy of node connections. The total runtime of AhGlasso is approximately five times faster than weighted Glasso methods when the graph size ranges from 1,000 to 3,000 with a fixed sample size (n = 300). The runtime difference between AhGlasso and weighted Glasso increases when the graph size increases. Using proteomic data from a study on chronic obstructive pulmonary disease, we demonstrate that AhGlasso improves protein network inference compared to the Netgsa approach by incorporating PPI information.
Collapse
Affiliation(s)
- Yonghua Zhuang
- Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States,*Correspondence: Yonghua Zhuang, ; Katerina Kechris,
| | - Fuyong Xing
- Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Debashis Ghosh
- Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States
| | - Farnoush Banaei-Kashani
- Department of Computer Science and Engineering, University of Colorado Denver, Denver, CO, United States
| | | | - Katerina Kechris
- Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States,*Correspondence: Yonghua Zhuang, ; Katerina Kechris,
| |
Collapse
|
39
|
Ahmed MM, Tazyeen S, Haque S, Alsulimani A, Ali R, Sajad M, Alam A, Ali S, Bagabir HA, Bagabir RA, Ishrat R. Network-Based Approach and IVI Methodologies, a Combined Data Investigation Identified Probable Key Genes in Cardiovascular Disease and Chronic Kidney Disease. Front Cardiovasc Med 2022; 8:755321. [PMID: 35071341 PMCID: PMC8767007 DOI: 10.3389/fcvm.2021.755321] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Accepted: 11/17/2021] [Indexed: 01/28/2023] Open
Abstract
In fact, the risk of dying from CVD is significant when compared to the risk of developing end-stage renal disease (ESRD). Moreover, patients with severe CKD are often excluded from randomized controlled trials, making evidence-based therapy of comorbidities like CVD complicated. Thus, the goal of this study was to use an integrated bioinformatics approach to not only uncover Differentially Expressed Genes (DEGs), their associated functions, and pathways but also give a glimpse of how these two conditions are related at the molecular level. We started with GEO2R/R program (version 3.6.3, 64 bit) to get DEGs by comparing gene expression microarray data from CVD and CKD. Thereafter, the online STRING version 11.1 program was used to look for any correlations between all these common and/or overlapping DEGs, and the results were visualized using Cytoscape (version 3.8.0). Further, we used MCODE, a cytoscape plugin, and identified a total of 15 modules/clusters of the primary network. Interestingly, 10 of these modules contained our genes of interest (key genes). Out of these 10 modules that consist of 19 key genes (11 downregulated and 8 up-regulated), Module 1 (RPL13, RPLP0, RPS24, and RPS2) and module 5 (MYC, COX7B, and SOCS3) had the highest number of these genes. Then we used ClueGO to add a layer of GO terms with pathways to get a functionally ordered network. Finally, to identify the most influential nodes, we employed a novel technique called Integrated Value of Influence (IVI) by combining the network's most critical topological attributes. This method suggests that the nodes with many connections (calculated by hubness score) and high spreading potential (the spreader nodes are intended to have the most impact on the information flow in the network) are the most influential or essential nodes in a network. Thus, based on IVI values, hubness score, and spreading score, top 20 nodes were extracted, in which RPS27A non-seed gene and RPS2, a seed gene, came out to be the important node in the network.
Collapse
Affiliation(s)
- Mohd Murshad Ahmed
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Safia Tazyeen
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Shafiul Haque
- Research and Scientific Unit, College of Nursing and Allied Health Science, Jazan University, Jazan, Saudi Arabia
| | - Ahmad Alsulimani
- Department of Medical Laboratory Technology, College of Applied Medical Sciences, Jazan University, Jazan, Saudi Arbia
| | - Rafat Ali
- Department of Bioscience, Jamia Millia Islamia, New Delhi, India
| | - Mohd Sajad
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Aftab Alam
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India
| | - Shahnawaz Ali
- Centre for Stem Cell & Regenerative Medicine, KING' College London, Guy's Hospital, London, United Kingdom
| | - Hala Abubaker Bagabir
- Department of Medical Physiology, Faculty of Medicine, King Abdulaziz University, Rabigh, Saudi Arabia
| | - Rania Abubaker Bagabir
- Department of Hematology and Immunology, College of Medicine, Umm-Al-Qura University, Mecca, Saudi Arabia
| | - Romana Ishrat
- Centre for Interdisciplinary Research in Basic Sciences, Jamia Millia Islamia, New Delhi, India,*Correspondence: Romana Ishrat ; orcid.org/0000-0001-9744-9047
| |
Collapse
|
40
|
Maschietto F, Gheeraert A, Piazzi A, Batista VS, Rivalta I. Distinct allosteric pathways in imidazole glycerol phosphate synthase from yeast and bacteria. Biophys J 2022; 121:119-130. [PMID: 34864045 PMCID: PMC8758406 DOI: 10.1016/j.bpj.2021.11.2888] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Revised: 09/17/2021] [Accepted: 11/29/2021] [Indexed: 01/07/2023] Open
Abstract
Understanding the relationship between protein structures and their function is still an open question that becomes very challenging when allostery plays an important functional role. Allosteric proteins, in fact, exploit different ranges of motions (from sidechain local fluctuations to long-range collective motions) to effectively couple distant binding sites, and of particular interest is whether allosteric proteins of the same families with similar functions and structures also necessarily share the same allosteric mechanisms. Here, we compared the early dynamics initiating the allosteric communication of a prototypical allosteric enzyme from two different organisms, i.e., the imidazole glycerol phosphate synthase (IGPS) enzymes from the thermophilic bacteria and the yeast, working at high and room temperatures, respectively. By combining molecular dynamics simulations and network models derived from graph theory, we found rather distinct early allosteric dynamics in the IGPS from the two organisms, involving significatively different allosteric pathways in terms of both local and collective motions. Given the successful prediction of key allosteric residues in the bacterial IGPS, whose mutation disrupts its allosteric communication, the outcome of this study paves the way for future experimental studies on the yeast IGPS that could foster therapeutic applications by exploiting the control of IGPS enzyme allostery.
Collapse
Affiliation(s)
| | - Aria Gheeraert
- Université de Lyon, CNRS, Institut de Chimie de Lyon, École Normale Supérieure de Lyon, Lyon Cedex 07, France
| | - Andrea Piazzi
- Dipartimento di Chimica Industriale “Toso Montanari”, Alma Mater Studiorum, Università di Bologna, Bologna, Italia
| | - Victor S. Batista
- Department of Chemistry, Yale University, New Haven, Connecticut,Corresponding author
| | - Ivan Rivalta
- Université de Lyon, CNRS, Institut de Chimie de Lyon, École Normale Supérieure de Lyon, Lyon Cedex 07, France,Dipartimento di Chimica Industriale “Toso Montanari”, Alma Mater Studiorum, Università di Bologna, Bologna, Italia,Corresponding author
| |
Collapse
|
41
|
Liu Y, Liang H, Zou Q, He Z. Significance-Based Essential Protein Discovery. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:633-642. [PMID: 32750873 DOI: 10.1109/tcbb.2020.3004364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The identification of essential proteins is an important problem in bioinformatics. During the past decades, many centrality measures and algorithms have been proposed to address this issue. However, existing methods still deserve the following drawbacks: (1) the lack of a context-free and readily interpretable quantification of their centrality values; (2) the difficulty of specifying a proper threshold for their centrality values; (3) the incapability of controlling the quality of reported essential proteins in a statistically sound manner. To overcome the limitations of existing solutions, we tackle the essential protein discovery problem from a significance testing perspective. More precisely, the essential protein discovery problem is formulated as a multiple hypothesis testing problem, where the null hypothesis is that each protein is not an essential protein. To quantify the statistical significance of each protein, we present a p-value calculation method in which both the degree and the local clustering coefficient are used as the test statistic and the Erdös-Rényi model is employed as the random graph model. After calculating the p-value for each protein, the false discovery rate is used as the error rate in the multiple testing correction procedure. Our significance-based essential protein discovery method is named as SigEP, which is tested on both simulated networks and real PPI networks. The experimental results show that our method is able to achieve better performance than those competing algorithms.
Collapse
|
42
|
Ramana CV, Das B. Profiling transcription factor sub-networks in type I interferon signaling and in response to SARS-CoV-2 infection. COMPUTATIONAL AND MATHEMATICAL BIOPHYSICS 2021. [DOI: 10.1515/cmb-2020-0128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
Type I interferons (IFN α/β) play a central role in innate immunity to respiratory viruses, including coronaviruses. In this study, transcription factor profiling in the transcriptome was used to gain novel insights into the role of inducible transcription factors in response to type I interferon signaling in immune cells and in lung epithelial cells after SARS-CoV-2 infection. Modeling the interferon-inducible transcription factor mRNA data in terms of distinct sub-networks based on biological functions such as antiviral response, immune modulation, and cell growth revealed enrichment of specific transcription factors in mouse and human immune cells. Interrogation of multiple microarray datasets revealed that SARS-CoV-2 induced high levels of IFN-beta and interferon-inducible transcription factor mRNA in human lung epithelial cells. Transcription factor mRNA of the three sub-networks were differentially regulated in human lung epithelial cell lines after SARS-CoV-2 infection and in COVID-19 patients. A subset of type I interferon-inducible transcription factors and inflammatory mediators were specifically enriched in the lungs and neutrophils of Covid-19 patients. The emerging complex picture of type I IFN transcriptional regulation consists of a rapid transcriptional switch mediated by the Jak-Stat cascade and a graded output of the inducible transcription factor activation that enables temporal regulation of gene expression.
Collapse
Affiliation(s)
- Chilakamarti V. Ramana
- Department of Medicine, Dartmouth-Hitchcock Medical Center, Lebanon , NH 03766, USA ; Department of Stem Cell and Infectious Diseases , KaviKrishna Laboratory , Guwahati Biotech Park, Indian Institute of Technology , Guwahati , India ; Thoreau Laboratory for Global Health , University of Massachusetts , Lowell, MA 01854, USA
| | - Bikul Das
- Department of Stem Cell and Infectious Diseases , KaviKrishna Laboratory, Guwahati Biotech Park, Indian Institute of Technology , Guwahati , India ; Thoreau Laboratory for Global Health , University of Massachusetts , Lowell, MA 01854, USA
| |
Collapse
|
43
|
Gupta R, Kumar P. CREB1 K292 and HINFP K330 as Putative Common Therapeutic Targets in Alzheimer's and Parkinson's Disease. ACS OMEGA 2021; 6:35780-35798. [PMID: 34984308 PMCID: PMC8717564 DOI: 10.1021/acsomega.1c05827] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Accepted: 12/07/2021] [Indexed: 05/16/2023]
Abstract
Integration of omics data and deciphering the mechanism of a biological regulatory network could be a promising approach to reveal the molecular mechanism involved in the progression of complex diseases, including Alzheimer's and Parkinson's. Despite having an overlapping mechanism in the etiology of Alzheimer's disease (AD) and Parkinson's disease (PD), the exact mechanism and signaling molecules behind them are still unknown. Further, the acetylation mechanism and histone deacetylase (HDAC) enzymes provide a positive direction toward studying the shared phenomenon between AD and PD pathogenesis. For instance, increased expression of HDACs causes a decrease in protein acetylation status, resulting in decreased cognitive and memory function. Herein, we employed an integrative approach to analyze the transcriptomics data that established a potential relationship between AD and PD. Data preprocessing and analysis of four publicly available microarray datasets revealed 10 HUB proteins, namely, CDC42, CD44, FGFR1, MYO5A, NUMA1, TUBB4B, ARHGEF9, USP5, INPP5D, and NUP93, that may be involved in the shared mechanism of AD and PD pathogenesis. Further, we identified the relationship between the HUB proteins and transcription factors that could be involved in the overlapping mechanism of AD and PD. CREB1 and HINFP were the crucial regulatory transcription factors that were involved in the AD and PD crosstalk. Further, lysine acetylation sites and HDAC enzyme prediction revealed the involvement of 15 and 27 potential lysine residues of CREB1 and HINFP, respectively. Our results highlighted the importance of HDAC1(K292) and HDAC6(K330) association with CREB1 and HINFP, respectively, in the AD and PD crosstalk. However, different datasets with a large number of samples and wet lab experimentation are required to validate and pinpoint the exact role of CREB1 and HINFP in the AD and PD crosstalk. It is also possible that the different datasets may or may not affect the results due to analysis parameters. In conclusion, our study potentially highlighted the crucial proteins, transcription factors, biological pathways, lysine residues, and HDAC enzymes shared between AD and PD at the molecular level. The findings can be used to study molecular studies to identify the possible relationship in the AD-PD crosstalk.
Collapse
Affiliation(s)
- Rohan Gupta
- Molecular Neuroscience and
Functional Genomics Laboratory, Department of Biotechnology, Delhi Technological University (Formerly DCE), Delhi 110042, India
| | - Pravir Kumar
- Molecular Neuroscience and
Functional Genomics Laboratory, Department of Biotechnology, Delhi Technological University (Formerly DCE), Delhi 110042, India
| |
Collapse
|
44
|
Anjos WF, Lanes GC, Azevedo VA, Santos AR. GENPPI: standalone software for creating protein interaction networks from genomes. BMC Bioinformatics 2021; 22:596. [PMID: 34915867 PMCID: PMC8680239 DOI: 10.1186/s12859-021-04501-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 11/30/2021] [Indexed: 11/30/2022] Open
Abstract
BackGround Bacterial genomes are being deposited into online databases at an increasing rate. Genome annotation represents one of the first efforts to understand organisms and their diseases. Some evolutionary relationships capable of being annotated only from genomes are conserved gene neighbourhoods (CNs), phylogenetic profiles (PPs), and gene fusions. At present, there is no standalone software that enables networks of interactions among proteins to be created using these three evolutionary characteristics with efficient and effective results. Results We developed GENPPI software for the ab initio prediction of interaction networks using predicted proteins from a genome. In our case study, we employed 50 genomes of the genus Corynebacterium. Based on the PP relationship, GENPPI differentiated genomes between the ovis and equi biovars of the species Corynebacterium pseudotuberculosis and created groups among the other species analysed. If we inspected only the CN relationship, we could not entirely separate biovars, only species. Our software GENPPI was determined to be efficient because, for example, it creates interaction networks from the central genomes of 50 species/lineages with an average size of 2200 genes in less than 40 min on a conventional computer. Moreover, the interaction networks that our software creates reflect correct evolutionary relationships between species, which we confirmed with average nucleotide identity analyses. Additionally, this software enables the user to define how he or she intends to explore the PP and CN characteristics through various parameters, enabling the creation of customized interaction networks. For instance, users can set parameters regarding the genus, metagenome, or pangenome. In addition to the parameterization of GENPPI, it is also the user’s choice regarding which set of genomes they are going to study. Conclusions GENPPI can help fill the gap concerning the considerable number of novel genomes assembled monthly and our ability to process interaction networks considering the noncore genes for all completed genome versions. With GENPPI, a user dictates how many and how evolutionarily correlated the genomes answer a scientific query.
Collapse
Affiliation(s)
- William F Anjos
- Department of Computer Science, Federal University of Uberlândia, Uberlândia, Brazil
| | - Gabriel C Lanes
- Biology Institute, Federal University of Uberlândia, Uberlândia, Brazil
| | - Vasco A Azevedo
- Department of Genetics, Federal University of Minas Gerais, Belo Horizonte, Brazil
| | - Anderson R Santos
- Department of Computer Science, Federal University of Uberlândia, Uberlândia, Brazil.
| |
Collapse
|
45
|
Liu Y, Chen W, He Z. Essential Protein Recognition via Community Significance. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2788-2794. [PMID: 34347602 DOI: 10.1109/tcbb.2021.3102018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Essential protein plays a vital role in understanding the cellular life. With the advance in high-throughput technologies, a number of protein-protein interaction (PPI) networks have been constructed such that essential proteins can be identified from a system biology perspective. Although a series of network-based essential protein discovery methods have been proposed, these existing methods still have some drawbacks. Recently, it has been shown that the significance-based method SigEP is promising on overcoming the defects that are inherent in currently available essential protein identification methods. However, the SigEP method is developed under the unrealistic Erdös-Rényi (E-R) model and its time complexity is very high. Hence, we propose a new significance-based essential protein recognition method named EPCS in which the essential protein discovery problem is formulated as a community significance testing problem. Experimental results on four PPI networks show that EPCS performs better than nine state-of-the-art essential protein identification methods and the only significance-based essential protein identification method SigEP.
Collapse
|
46
|
Naseri A, Sharghi M, Hasheminejad SMH. Enhancing gene regulatory networks inference through hub-based data integration. Comput Biol Chem 2021; 95:107589. [PMID: 34673384 DOI: 10.1016/j.compbiolchem.2021.107589] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Revised: 08/11/2021] [Accepted: 10/04/2021] [Indexed: 12/09/2022]
Abstract
One of the main research topics in computational biology is Gene Regulatory Network (GRN) reconstruction that refers to inferring the relationships between genes involved in regulating cell conditions in response to internal or external stimuli. To this end, most computational methods use only transcriptional gene expression data to reconstruct gene regulatory networks, but recent studies suggest that gene expression data must be integrated with other types of data to obtain more accurate models predicting real relationships between genes. In this study, a diffusion-based method is enhanced to integrate biological data of network types besides structural prior knowledge. The Random Walk with Restart algorithm (RWR) with an emphasis on hub nodes is executed separately on each network, and then jointly optimizes low-dimensional feature vectors for network nodes by diffusion component analysis. Next, these feature vectors are used to infer gene regulatory networks. Fourteen centrality measures are studied for the detection of hub nodes to be used in the RWR algorithm, and the best centrality measure having the greatest effect on the improvement of gene network inference is selected. A case study for the Saccharomyces cerevisiae and E. coli networks shows that using the proposed features in comparison with gene expression data alone results in 0.02-0.08 units improvement in Area Under Receiver Characteristic Operator (AUROC) criteria across different gene regulatory network inference methods. Furthermore, the proposed method was applied to the esophageal cancer data to infer its gene regulatory network. The proposed framework substantially improves accuracy and scalability of GRN inference. The fused features and the best centrality measure detected can be used to provide functional insights about genes or proteins in various biological applications. Moreover, it can be served as a general framework for network data and structural data integration and analysis problems in various scientific disciplines including biology.
Collapse
Affiliation(s)
- Atefeh Naseri
- Department of Computer Engineering, Alzahra University, Tehran, Iran.
| | - Mehran Sharghi
- Department of Computer Engineering, Alzahra University, Tehran, Iran.
| | | |
Collapse
|
47
|
Singh N, Bhatnagar S. Machine Learning for Prediction of Drug Targets in Microbe Associated Cardiovascular Diseases by Incorporating Host-pathogen Interaction Network Parameters. Mol Inform 2021; 41:e2100115. [PMID: 34676983 DOI: 10.1002/minf.202100115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 10/01/2021] [Indexed: 12/20/2022]
Abstract
Host-pathogen interactions play a crucial role in invasion, infection, and induction of immune response in humans. In this work, four machine learning algorithms, namely Logistic regression, K-nearest neighbor, Support Vector Machine, and Random Forest were implemented for the classification of drug targets. The algorithms were trained using 3400 hosts and 3800 pathogen drug and non-drug target proteins as learning instances. For each protein, 68 pathogen and 73 host features were computed that included sequence, structure, biological and host-pathogen network centrality characteristics. The Random Forest classifier model achieved the best accuracy after 10-fold cross-validation. 99 % accuracy was achieved with a ROC-AUC score of 0.99±0.01 for both pathogen and host training sets. The Eigenvector Centrality of host-pathogen interactions and host-host interactions was the top feature in performing classification of pathogen and host targets respectively. Other features important for classification were the presence of catalytic and binding sites, low instability/aliphatic index, and cellular location. The Random Forest classifier was then used for prediction of drug targets involved in Microbe Associated Cardiovascular Diseases. 331 host and 743 pathogen proteins were predicted as drug targets by the random forest model and can be validated experimentally for therapeutic intervention in Microbe Associated Cardiovascular Diseases.
Collapse
Affiliation(s)
- Nirupma Singh
- Department of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India
| | - Sonika Bhatnagar
- Department of Biotechnology, Netaji Subhas Institute of Technology, Dwarka, New Delhi, 110078, India.,Computational and Structural Biology Laboratory, Department of Biological Sciences and Engineering, Netaji Subhas University of Technology Dwarka, New Delhi, 110078, India
| |
Collapse
|
48
|
Berahmand K, Nasiri E, Pir Mohammadiani R, Li Y. Spectral clustering on protein-protein interaction networks via constructing affinity matrix using attributed graph embedding. Comput Biol Med 2021; 138:104933. [PMID: 34655897 DOI: 10.1016/j.compbiomed.2021.104933] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 09/20/2021] [Accepted: 10/07/2021] [Indexed: 02/02/2023]
Abstract
The identification of protein complexes in protein-protein interaction networks is the most fundamental and essential problem for revealing the underlying mechanism of biological processes. However, most existing protein complexes identification methods only consider a network's topology structures, and in doing so, these methods miss the advantage of using nodes' feature information. In protein-protein interaction, both topological structure and node features are essential ingredients for protein complexes. The spectral clustering method utilizes the eigenvalues of the affinity matrix of the data to map to a low-dimensional space. It has attracted much attention in recent years as one of the most efficient algorithms in the subcategory of dimensionality reduction. In this paper, a new version of spectral clustering, named text-associated DeepWalk-Spectral Clustering (TADW-SC), is proposed for attributed networks in which the identified protein complexes have structural cohesiveness and attribute homogeneity. Since the performance of spectral clustering heavily depends on the effectiveness of the affinity matrix, our proposed method will use the text-associated DeepWalk (TADW) to calculate the embedding vectors of proteins. In the following, the affinity matrix will be computed by utilizing the cosine similarity between the two low dimensional vectors, which will be considerable to improve the accuracy of the affinity matrix. Experimental results show that our method performs unexpectedly well in comparison to existing state-of-the-art methods in both real protein network datasets and synthetic networks.
Collapse
Affiliation(s)
- Kamal Berahmand
- School of Computer Sciences, Science and Engineering Faculty, Queensland University of Technology (QUT), Brisbane, Australia.
| | - Elahe Nasiri
- Department of Information Technology and Communications, Azarbaijan Shahid Madani University, Tabriz, Iran.
| | | | - Yuefeng Li
- School of Computer Sciences, Science and Engineering Faculty, Queensland University of Technology (QUT), Brisbane, Australia.
| |
Collapse
|
49
|
Li Y, Xia Y, Zhu H, Luu E, Huang G, Sun Y, Sun K, Markx S, Leong KW, Xu B, Fu BM. Investigation of Neurodevelopmental Deficits of 22 q11.2 Deletion Syndrome with a Patient-iPSC-Derived Blood-Brain Barrier Model. Cells 2021; 10:cells10102576. [PMID: 34685556 PMCID: PMC8534009 DOI: 10.3390/cells10102576] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 09/24/2021] [Accepted: 09/26/2021] [Indexed: 12/13/2022] Open
Abstract
The blood–brain barrier (BBB) is important in the normal functioning of the central nervous system. An altered BBB has been described in various neuropsychiatric disorders, including schizophrenia. However, the cellular and molecular mechanisms of such alterations remain unclear. Here, we investigate if BBB integrity is compromised in 22q11.2 deletion syndrome (also called DiGeorge syndrome), which is one of the validated genetic risk factors for schizophrenia. We utilized a set of human brain microvascular endothelial cells (HBMECs) derived from the induced pluripotent stem cell (iPSC) lines of patients with 22q11.2-deletion-syndrome-associated schizophrenia. We found that the solute permeability of the BBB formed from patient HBMECs increases by ~1.3–1.4-fold, while the trans-endothelial electrical resistance decreases to ~62% of the control values. Correspondingly, tight junction proteins and the endothelial glycocalyx that determine the integrity of the BBB are significantly disrupted. A transcriptome study also suggests that the transcriptional network related to the cell–cell junctions in the compromised BBB is substantially altered. An enrichment analysis further suggests that the genes within the altered gene expression network also contribute to neurodevelopmental disorders. Our findings suggest that neurovascular coupling can be targeted in developing novel therapeutical strategies for the treatment of 22q11.2 deletion syndrome.
Collapse
Affiliation(s)
- Yunfei Li
- Department of Biomedical Engineering, The City College of the City University of New York, New York, NY 10031, USA; (Y.L.); (Y.X.); (E.L.); (G.H.)
| | - Yifan Xia
- Department of Biomedical Engineering, The City College of the City University of New York, New York, NY 10031, USA; (Y.L.); (Y.X.); (E.L.); (G.H.)
| | - Huixiang Zhu
- Department of Psychiatry, Columbia University, New York, NY 10032, USA; (H.Z.); (Y.S.); (K.S.); (S.M.)
| | - Eric Luu
- Department of Biomedical Engineering, The City College of the City University of New York, New York, NY 10031, USA; (Y.L.); (Y.X.); (E.L.); (G.H.)
| | - Guangyao Huang
- Department of Biomedical Engineering, The City College of the City University of New York, New York, NY 10031, USA; (Y.L.); (Y.X.); (E.L.); (G.H.)
| | - Yan Sun
- Department of Psychiatry, Columbia University, New York, NY 10032, USA; (H.Z.); (Y.S.); (K.S.); (S.M.)
| | - Kevin Sun
- Department of Psychiatry, Columbia University, New York, NY 10032, USA; (H.Z.); (Y.S.); (K.S.); (S.M.)
| | - Sander Markx
- Department of Psychiatry, Columbia University, New York, NY 10032, USA; (H.Z.); (Y.S.); (K.S.); (S.M.)
| | - Kam W. Leong
- Department of Biomedical Engineering, Columbia University, New York, NY 10027, USA;
| | - Bin Xu
- Department of Psychiatry, Columbia University, New York, NY 10032, USA; (H.Z.); (Y.S.); (K.S.); (S.M.)
- Correspondence: (B.X.); (B.M.F.); Tel.: +1-212-650-7531 (B.M.F.)
| | - Bingmei M. Fu
- Department of Biomedical Engineering, The City College of the City University of New York, New York, NY 10031, USA; (Y.L.); (Y.X.); (E.L.); (G.H.)
- Correspondence: (B.X.); (B.M.F.); Tel.: +1-212-650-7531 (B.M.F.)
| |
Collapse
|
50
|
Liu G, Liu B, Li A, Wang X, Yu J, Zhou X. Identifying Protein Complexes With Clear Module Structure Using Pairwise Constraints in Protein Interaction Networks. Front Genet 2021; 12:664786. [PMID: 34512712 PMCID: PMC8430217 DOI: 10.3389/fgene.2021.664786] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2021] [Accepted: 06/23/2021] [Indexed: 02/02/2023] Open
Abstract
The protein-protein interaction (PPI) networks can be regarded as powerful platforms to elucidate the principle and mechanism of cellular organization. Uncovering protein complexes from PPI networks will lead to a better understanding of the science of biological function in cellular systems. In recent decades, numerous computational algorithms have been developed to identify protein complexes. However, the majority of them primarily concern the topological structure of PPI networks and lack of the consideration for the native organized structure among protein complexes. The PPI networks generated by high-throughput technology include a fraction of false protein interactions which make it difficult to identify protein complexes efficiently. To tackle these challenges, we propose a novel semi-supervised protein complex detection model based on non-negative matrix tri-factorization, which not only considers topological structure of a PPI network but also makes full use of available high quality known protein pairs with must-link constraints. We propose non-overlapping (NSSNMTF) and overlapping (OSSNMTF) protein complex detection algorithms to identify the significant protein complexes with clear module structures from PPI networks. In addition, the proposed two protein complex detection algorithms outperform a diverse range of state-of-the-art protein complex identification algorithms on both synthetic networks and human related PPI networks.
Collapse
Affiliation(s)
- Guangming Liu
- School of Computer Science & Engineering, Xi'an University of Technology, Xi'an, China
| | - Bo Liu
- Hebei Key Laboratory of Agricultural Big Data, College of Information Science and Technology, Hebei Agricultural University, Baoding, China
| | - Aimin Li
- School of Computer Science & Engineering, Xi'an University of Technology, Xi'an, China
| | - Xiaofan Wang
- School of Computer Science & Engineering, Xi'an University of Technology, Xi'an, China
| | - Jian Yu
- Beijing Key Lab of Traffic Data Analysis and Mining, School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
| | - Xuezhong Zhou
- Beijing Key Lab of Traffic Data Analysis and Mining, School of Computer and Information Technology, Beijing Jiaotong University, Beijing, China
| |
Collapse
|