1
|
Funari A, Fiscon G, Paci P. Network medicine and systems pharmacology approaches to predicting adverse drug effects. Br J Pharmacol 2024. [PMID: 39262113 DOI: 10.1111/bph.17330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Revised: 07/30/2024] [Accepted: 08/02/2024] [Indexed: 09/13/2024] Open
Abstract
Identifying and understanding the relationships between drug intake and adverse effects that can occur due to inadvertent molecular interactions between drugs and targets is a difficult task, especially considering the numerous variables that can influence the onset of such events. The ability to predict these side effects in advance would help physicians develop strategies to avoid or counteract them. In this article, we review the main computational methods for predicting side effects caused by drug molecules, highlighting their performance, limitations and application cases. Furthermore, we provide an overall view of resources, such as databases and tools, useful for building side effect prediction analyses.
Collapse
Affiliation(s)
- Alessio Funari
- Department of Translational and Precision Medicine, Sapienza University of Rome, Rome, Italy
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Giulia Fiscon
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
- Department of Computer, Control and Management Engineering "Antonio Ruberti", Sapienza University of Rome, Rome, Italy
| | - Paola Paci
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
- Department of Computer, Control and Management Engineering "Antonio Ruberti", Sapienza University of Rome, Rome, Italy
| |
Collapse
|
2
|
Yang W, Zou S, Gao H, Wang L, Ni W. A Novel Method for Targeted Identification of Essential Proteins by Integrating Chemical Reaction Optimization and Naive Bayes Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1274-1286. [PMID: 38536675 DOI: 10.1109/tcbb.2024.3382392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2024]
Abstract
Targeted identification of essential proteins is of great significance for species identification, drug manufacturing, and disease treatment. It is a challenge to analyze the binding mechanism between essential proteins and improve the identification speed while ensuring the accuracy of the identification. This paper proposes a novel method called EPCRO for identifying essential proteins, which incorporates the chemical reaction optimization (CRO) algorithm and the naive Bayes model to effectively detect essential proteins. In EPCRO, the naive Bayes model is employed to analyze the homogeneity between proteins. In order to improve the identification rate and speed of essential proteins, the protein homogeneity rate is integrated into the CRO algorithm to balance between local and global searches. EPCRO is experimentally compared with 17 existing methods (including, DC, SC, IC, EC, LAC, NC, PeC, WDC, EPD-RW, RWHN, TEGS, CFMM, BSPM, AFSO-EP, CVIM, RWEP, and EPPSO-DC) based on biological datasets. The results show that EPCRO is superior to the above methods in identification accuracy and speed.
Collapse
|
3
|
Datta C, Das P, Dutta S, Prasad T, Banerjee A, Gehlot S, Ghosal A, Dhabal S, Biswas P, De D, Chaudhuri S, Bhattacharjee A. AMPK activation reduces cancer cell aggressiveness via inhibition of monoamine oxidase A (MAO-A) expression/activity. Life Sci 2024; 352:122857. [PMID: 38914305 DOI: 10.1016/j.lfs.2024.122857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 06/14/2024] [Accepted: 06/16/2024] [Indexed: 06/26/2024]
Abstract
AIM AMPK can be considered as an important target molecule for cancer for its unique ability to directly recognize cellular energy status. The main aim of this study is to explore the role of different AMPK activators in managing cancer cell aggressiveness and to understand the mechanistic details behind the process. MAIN METHODS First, we explored the AMPK expression pattern and its significance in different subtypes of lung cancer by accessing the TCGA data sets for LUNG, LUAD and LUSC patients and then established the correlation between AMPK expression pattern and overall survival of lung cancer patients using Kaplan-Meire plot. We further carried out several cell-based assays by employing different wet lab techniques including RT-PCR, Western Blot, proliferation, migration and invasion assays to fulfil the aim of the study. KEY FINDINGS SIGNIFICANCE: This study identifies the importance of AMPK activators as a repurposing agent for combating lung and colon cancer cell aggressiveness. It also suggests SRT-1720 as a potent repurposing agent for cancer treatment especially in NSCLC patients where a point mutation is present in LKB1.
Collapse
Affiliation(s)
- Chandreyee Datta
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Payel Das
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Subhajit Dutta
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Tuhina Prasad
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Abhineet Banerjee
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Sameep Gehlot
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Arpa Ghosal
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Sukhamoy Dhabal
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Pritam Biswas
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Debojyoti De
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Surabhi Chaudhuri
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India
| | - Ashish Bhattacharjee
- Department of Biotechnology, National Institute of Technology, Durgapur, Mahatma Gandhi Avenue, 713209 Burdwan, West Bengal, India.
| |
Collapse
|
4
|
Pollet L, Xia Y. Structure-guided Evolutionary Analysis of Interactome Network Rewiring at Single Residue Resolution in Yeasts. J Mol Biol 2024; 436:168641. [PMID: 38844045 DOI: 10.1016/j.jmb.2024.168641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 04/30/2024] [Accepted: 06/01/2024] [Indexed: 06/16/2024]
Abstract
Protein-protein interactions (PPIs) are known to rewire extensively during evolution leading to lineage-specific and species-specific changes in molecular processes. However, the detailed molecular evolutionary mechanisms underlying interactome network rewiring are not well-understood. Here, we combine high-confidence PPI data, high-resolution three-dimensional structures of protein complexes, and homology-based structural annotation transfer to construct structurally-resolved interactome networks for the two yeasts S. cerevisiae and S. pombe. We then classify PPIs according to whether they are preserved or different between the two yeast species and compare site-specific evolutionary rates of interfacial versus non-interfacial residues for these different categories of PPIs. We find that residues in PPI interfaces evolve significantly more slowly than non-interfacial residues when using lineage-specific measures of evolutionary rate, but not when using non-lineage-specific measures. Furthermore, both lineage-specific and non-lineage-specific evolutionary rate measures can distinguish interfacial residues from non-interfacial residues for preserved PPIs between the two yeasts, but only the lineage-specific measure is appropriate for rewired PPIs. Finally, both lineage-specific and non-lineage-specific evolutionary rate measures are appropriate for elucidating structural determinants of protein evolution for residues outside of PPI interfaces. Overall, our results demonstrate that unlike tertiary structures of single proteins, PPIs and PPI interfaces can be highly volatile in their evolution, thus requiring the use of lineage-specific measures when studying their evolution. These results yield insight into the evolutionary design principles of PPIs and the mechanisms by which interactions are preserved or rewired between species, improving our understanding of the molecular evolution of PPIs and PPI interfaces at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
5
|
Pirayeshfard L, Luo S, Githaka JM, Saini A, Touret N, Goping IS, Julien O. Comparing the BAD Protein Interactomes in 2D and 3D Cell Culture Using Proximity Labeling. J Proteome Res 2024; 23:3433-3443. [PMID: 38959414 PMCID: PMC11302415 DOI: 10.1021/acs.jproteome.4c00111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 06/20/2024] [Accepted: 06/25/2024] [Indexed: 07/05/2024]
Abstract
Protein-protein interaction studies using proximity labeling techniques, such as biotin ligase-based BioID, have become integral in understanding cellular processes. Most studies utilize conventional 2D cell culture systems, potentially missing important differences in protein behavior found in 3D tissues. In this study, we investigated the protein-protein interactions of a protein, Bcl-2 Agonist of cell death (BAD), and compared conventional 2D culture conditions to a 3D system, wherein cells were embedded within a 3D extracellular matrix (ECM) mimic. Using BAD fused to the engineered biotin ligase miniTurbo (BirA*), we identified both overlapping and distinct BAD interactomes under 2D and 3D conditions. The known BAD binding proteins 14-3-3 isoforms and Bcl-XL interacted with BAD in both 2D and 3D. Of the 131 BAD-interactors identified, 56% were specific to 2D, 14% were specific to 3D, and 30% were common to both conditions. Interaction network analysis demonstrated differential associations between 2D and 3D interactomes, emphasizing the impact of the culture conditions on protein interactions. The 2D-3D overlap interactome encapsulated the apoptotic program, which is a well-known role of BAD. The 3D unique pathways were enriched in ECM signaling, suggestive of hitherto unknown functions for BAD. Thus, exploring protein-protein interactions in 3D provides novel clues into cell behavior. This exciting approach has the potential to bridge the knowledge gap between tractable 2D cell culture and organoid-like 3D systems.
Collapse
Affiliation(s)
- Leila Pirayeshfard
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
| | - Shu Luo
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
| | | | - Arashdeep Saini
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
| | - Nicolas Touret
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
| | - Ing Swie Goping
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
- Department
of Oncology, University of Alberta, Edmonton, AB T6G 2H7, Canada
| | - Olivier Julien
- Department
of Biochemistry, University of Alberta, Edmonton, AB T6G 2H7, Canada
| |
Collapse
|
6
|
Li J, Wang B, Ma X. Non-Coding RNAs Extended Omnigenic Module of Cancers. ENTROPY (BASEL, SWITZERLAND) 2024; 26:640. [PMID: 39202109 PMCID: PMC11353529 DOI: 10.3390/e26080640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Revised: 07/24/2024] [Accepted: 07/25/2024] [Indexed: 09/03/2024]
Abstract
The emergence of cancers involves numerous coding and non-coding genes. Understanding the contribution of non-coding RNAs (ncRNAs) to the cancer neighborhood is crucial for interpreting the interaction between molecular markers of cancer. However, there is a lack of systematic studies on the involvement of ncRNAs in the cancer neighborhood. In this paper, we construct an interaction network which encompasses multiple genes. We focus on the fundamental topological indicator, namely connectivity, and evaluate its performance when applied to cancer-affected genes using statistical indices. Our findings reveal that ncRNAs significantly enhance the connectivity of affected genes and mediate the inclusion of more genes in the cancer module. To further explore the role of ncRNAs in the network, we propose a connectivity-based method which leverages the bridging function of ncRNAs across cancer-affected genes and reveals the non-coding RNAs extended omnigenic module (NeOModule). Topologically, this module promotes the formation of cancer patterns involving ncRNAs. Biologically, it is enriched with cancer pathways and treatment targets, providing valuable insights into disease relationships.
Collapse
Affiliation(s)
| | - Bingbo Wang
- School of Computer Science and Technology, Xidian University, Xi’an 710119, China; (J.L.); (X.M.)
| | | |
Collapse
|
7
|
Su C, Hou Y, Xu J, Xu Z, Zhou M, Ke A, Li H, Xu J, Brendel M, Maasch JRMA, Bai Z, Zhang H, Zhu Y, Cincotta MC, Shi X, Henchcliffe C, Leverenz JB, Cummings J, Okun MS, Bian J, Cheng F, Wang F. Identification of Parkinson's disease PACE subtypes and repurposing treatments through integrative analyses of multimodal data. NPJ Digit Med 2024; 7:184. [PMID: 38982243 PMCID: PMC11233682 DOI: 10.1038/s41746-024-01175-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 06/21/2024] [Indexed: 07/11/2024] Open
Abstract
Parkinson's disease (PD) is a serious neurodegenerative disorder marked by significant clinical and progression heterogeneity. This study aimed at addressing heterogeneity of PD through integrative analysis of various data modalities. We analyzed clinical progression data (≥5 years) of individuals with de novo PD using machine learning and deep learning, to characterize individuals' phenotypic progression trajectories for PD subtyping. We discovered three pace subtypes of PD exhibiting distinct progression patterns: the Inching Pace subtype (PD-I) with mild baseline severity and mild progression speed; the Moderate Pace subtype (PD-M) with mild baseline severity but advancing at a moderate progression rate; and the Rapid Pace subtype (PD-R) with the most rapid symptom progression rate. We found cerebrospinal fluid P-tau/α-synuclein ratio and atrophy in certain brain regions as potential markers of these subtypes. Analyses of genetic and transcriptomic profiles with network-based approaches identified molecular modules associated with each subtype. For instance, the PD-R-specific module suggested STAT3, FYN, BECN1, APOA1, NEDD4, and GATA2 as potential driver genes of PD-R. It also suggested neuroinflammation, oxidative stress, metabolism, PI3K/AKT, and angiogenesis pathways as potential drivers for rapid PD progression (i.e., PD-R). Moreover, we identified repurposable drug candidates by targeting these subtype-specific molecular modules using network-based approach and cell line drug-gene signature data. We further estimated their treatment effects using two large-scale real-world patient databases; the real-world evidence we gained highlighted the potential of metformin in ameliorating PD progression. In conclusion, this work helps better understand clinical and pathophysiological complexity of PD progression and accelerate precision medicine.
Collapse
Grants
- R21 AG083003 NIA NIH HHS
- R01 AG082118 NIA NIH HHS
- R56 AG074001 NIA NIH HHS
- R01AG076448 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- RF1AG072449 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- MJFF-023081 Michael J. Fox Foundation for Parkinson's Research (Michael J. Fox Foundation)
- R01AG080991 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- P30 AG072959 NIA NIH HHS
- 3R01AG066707-01S1 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R21AG083003 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R01AG066707 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R35 AG071476 NIA NIH HHS
- RF1 AG082211 NIA NIH HHS
- R56AG074001 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R01AG082118 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R25 AG083721 NIA NIH HHS
- RF1AG082211 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- U01 NS093334 NINDS NIH HHS
- AG083721-01 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- RF1NS133812 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- P20GM109025 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- RF1 NS133812 NINDS NIH HHS
- R35AG71476 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- U01 AG073323 NIA NIH HHS
- R01 AG066707 NIA NIH HHS
- R01AG053798 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R01AG076234 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- R01 AG076448 NIA NIH HHS
- R01 AG080991 NIA NIH HHS
- R01 AG076234 NIA NIH HHS
- U01NS093334 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- P20 GM109025 NIGMS NIH HHS
- P30AG072959 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- RF1 AG072449 NIA NIH HHS
- R01 AG053798 NIA NIH HHS
- 3R01AG066707-02S1 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- U01AG073323 Foundation for the National Institutes of Health (Foundation for the National Institutes of Health, Inc.)
- ALZDISCOVERY-1051936 Alzheimer's Association
Collapse
Affiliation(s)
- Chang Su
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Yu Hou
- Department of Surgery, University of Minnesota, Minneapolis, MN, USA
| | - Jielin Xu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Zhenxing Xu
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Manqi Zhou
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Alison Ke
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
| | - Haoyang Li
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Jie Xu
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, USA
| | - Matthew Brendel
- Institute for Computational Biomedicine, Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Jacqueline R M A Maasch
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Department of Computer Science, Cornell Tech, Cornell University, New York, NY, USA
| | - Zilong Bai
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA
| | - Haotan Zhang
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, NY, USA
| | - Yingying Zhu
- Department of Computer Science, University of Texas at Arlington, Arlington, TX, USA
| | - Molly C Cincotta
- Lewis Katz School of Medicine, Temple University, Philadelphia, PA, USA
| | - Xinghua Shi
- Department of Computer and Information Sciences, Temple University, Philadelphia, PA, USA
| | - Claire Henchcliffe
- Department of Neurology, University of California Irvine, Irvine, CA, USA
| | - James B Leverenz
- Lou Ruvo Center for Brain Health, Neurological Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Jeffrey Cummings
- Chambers-Grundy Center for Transformative Neuroscience, Pam Quirk Brain Health and Biomarker Laboratory, Department of Brain Health, School of Integrated Health Sciences, University of Nevada Las Vegas, Las Vegas, NV, USA
| | - Michael S Okun
- Department of Neurology, Fixel Institute for Neurological Diseases, University of Florida, Gainesville, FL, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, University of Florida, Gainesville, FL, USA
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medicine, Cornell University, New York, NY, USA.
- Institute of Artificial Intelligence for Digital Health, Weill Cornell Medicine, Cornell University, New York, NY, USA.
| |
Collapse
|
8
|
Kellman BP, Mariethoz J, Zhang Y, Shaul S, Alteri M, Sandoval D, Jeffris M, Armingol E, Bao B, Lisacek F, Bojar D, Lewis NE. Decoding glycosylation potential from protein structure across human glycoproteins with a multi-view recurrent neural network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.15.594334. [PMID: 38798633 PMCID: PMC11118808 DOI: 10.1101/2024.05.15.594334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Glycosylation is described as a non-templated biosynthesis. Yet, the template-free premise is antithetical to the observation that different N-glycans are consistently placed at specific sites. It has been proposed that glycosite-proximal protein structures could constrain glycosylation and explain the observed microheterogeneity. Using site-specific glycosylation data, we trained a hybrid neural network to parse glycosites (recurrent neural network) and match them to feasible N-glycosylation events (graph neural network). From glycosite-flanking sequences, the algorithm predicts most human N-glycosylation events documented in the GlyConnect database and proposed structures corresponding to observed monosaccharide composition of the glycans at these sites. The algorithm also recapitulated glycosylation in Enhanced Aromatic Sequons, SARS-CoV-2 spike, and IgG3 variants, thus demonstrating the ability of the algorithm to predict both glycan structure and abundance. Thus, protein structure constrains glycosylation, and the neural network enables predictive in silico glycosylation of uncharacterized or novel protein sequences and genetic variants.
Collapse
Affiliation(s)
- Benjamin P. Kellman
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Augment Biologics, La Jolla, CA 92092
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| | - Julien Mariethoz
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
| | - Yujie Zhang
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Sigal Shaul
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Alteri
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Daniel Sandoval
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Jeffris
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Erick Armingol
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bokan Bao
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Frederique Lisacek
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
- Computer Science Department & Section of Biology, University of Geneva, route de Drize 7, CH-1227, Geneva, Switzerland
| | - Daniel Bojar
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg 41390, Sweden
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg 41390, Sweden
| | - Nathan E. Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| |
Collapse
|
9
|
Stochaj U. Yeast profilin mutants inhibit classical nuclear import and alter the balance between actin and tubulin levels. Biochem Cell Biol 2024; 102:206-212. [PMID: 38048555 DOI: 10.1139/bcb-2023-0223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/06/2023] Open
Abstract
Profilin is a small protein that controls actin polymerization in yeast and higher eukaryotes. In addition, profilin has emerged as a multifunctional protein that contributes to other processes in multicellular organisms. This study focuses on profilin (Pfy1) in the budding yeast Saccharomyces cerevisiae. The primary sequences of yeast Pfy1 and its metazoan orthologs diverge vastly. However, structural elements of profilin are conserved among different species. To date, the full spectrum of Pfy1 functions has yet to be defined. The current work explores the possible involvement of yeast profilin in nuclear protein import. To this end, a panel of well-characterized yeast profilin mutants was evaluated. The experiments demonstrate that yeast profilin (i) regulates nuclear protein import, (ii) determines the subcellular localization of essential nuclear transport factors, and (iii) controls the relative abundance of actin and tubulin. Together, these results define yeast profilin as a moonlighting protein that engages in multiple essential cellular activities.
Collapse
Affiliation(s)
- Ursula Stochaj
- Department of Physiology, McGill University, Montreal, QC H3G 1Y6, Canada
- Quantitative Life Sciences Program, McGill University, Montreal, QC H3G 1Y6, Canada
| |
Collapse
|
10
|
Kodiha M, Azad N, Chu S, Crampton N, Stochaj U. Oxidative stress and signaling through EGFR and PKA pathways converge on the nuclear transport factor RanBP1. Eur J Cell Biol 2024; 103:151376. [PMID: 38011756 DOI: 10.1016/j.ejcb.2023.151376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 11/01/2023] [Accepted: 11/17/2023] [Indexed: 11/29/2023] Open
Abstract
Nuclear protein trafficking requires the soluble transport factor RanBP1. The subcellular distribution of RanBP1 is dynamic, as the protein shuttles between the nucleus and cytoplasm. To date, the signaling pathways regulating RanBP1 subcellular localization are poorly understood. During interphase, RanBP1 resides mostly in the cytoplasm. We show here that oxidative stress concentrates RanBP1 in the nucleus, and our study defines the underlying mechanisms. Specifically, RanBP1's cysteine residues are not essential for its oxidant-induced relocation. Furthermore, our pharmacological approaches uncover that signaling mediated by epidermal growth factor receptor (EGFR) and protein kinase A (PKA) control RanBP1 localization during stress. In particular, pharmacological inhibitors of EGFR or PKA diminish the oxidant-dependent relocation of RanBP1. Mutant analysis identified serine 60 and tyrosine 103 as regulators of RanBP1 nuclear accumulation during oxidant exposure. Taken together, our results define RanBP1 as a target of oxidative stress and a downstream effector of EGFR and PKA signaling routes. This positions RanBP1 at the intersection of important cellular signaling circuits.
Collapse
Affiliation(s)
- Mohamed Kodiha
- Department of Physiology McGill University, Montreal H3G 1Y6, Canada
| | - Nabila Azad
- Department of Physiology McGill University, Montreal H3G 1Y6, Canada
| | - Siwei Chu
- Department of Physiology McGill University, Montreal H3G 1Y6, Canada
| | - Noah Crampton
- Department of Physiology McGill University, Montreal H3G 1Y6, Canada
| | - Ursula Stochaj
- Department of Physiology McGill University, Montreal H3G 1Y6, Canada.
| |
Collapse
|
11
|
Meyniel-Schicklin L, Amaudrut J, Mallinjoud P, Guillier F, Mangeot PE, Lines L, Aublin-Gex A, Scholtes C, Punginelli C, Joly S, Vasseur F, Manet E, Gruffat H, Henry T, Halitim F, Paparin JL, Machin P, Darteil R, Sampson D, Mikaelian I, Lane L, Navratil V, Golinelli-Cohen MP, Terzi F, André P, Lotteau V, Vonderscher J, Meldrum EC, de Chassey B. Viruses traverse the human proteome through peptide interfaces that can be biomimetically leveraged for drug discovery. Proc Natl Acad Sci U S A 2024; 121:e2308776121. [PMID: 38252831 PMCID: PMC10835127 DOI: 10.1073/pnas.2308776121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 12/06/2023] [Indexed: 01/24/2024] Open
Abstract
We present a drug design strategy based on structural knowledge of protein-protein interfaces selected through virus-host coevolution and translated into highly potential small molecules. This approach is grounded on Vinland, the most comprehensive atlas of virus-human protein-protein interactions with annotation of interacting domains. From this inspiration, we identified small viral protein domains responsible for interaction with human proteins. These peptides form a library of new chemical entities used to screen for replication modulators of several pathogens. As a proof of concept, a peptide from a KSHV protein, identified as an inhibitor of influenza virus replication, was translated into a small molecule series with low nanomolar antiviral activity. By targeting the NEET proteins, these molecules turn out to be of therapeutic interest in a nonalcoholic steatohepatitis mouse model with kidney lesions. This study provides a biomimetic framework to design original chemistries targeting cellular proteins, with indications going far beyond infectious diseases.
Collapse
Affiliation(s)
| | | | | | | | - Philippe E. Mangeot
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | | | - Anne Aublin-Gex
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | - Caroline Scholtes
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | - Claire Punginelli
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | | | - Florence Vasseur
- Université de Paris, INSERM U1151, CNRS UMR 8253, Institut Necker Enfants Malades, Département “Croissance et Signalisation”, Paris75015, France
| | - Evelyne Manet
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | - Henri Gruffat
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | - Thomas Henry
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | | | | | | | | | | | - Ivan Mikaelian
- Université de Lyon, Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, Centre Léon Bérard, Centre de recherche en cancérologie de Lyon, Lyon69373, France
| | - Lydie Lane
- Computer and Laboratory Investigation of Proteins of Human Origin Group, Swiss Institute of Bioinformatics, Lausanne1015, Switzerland
| | - Vincent Navratil
- Pôle Rhône-Alpes de bioinformatique, Rhône-Alpes Bioinformatics Center, Université Lyon 1, Villeurbanne69622, France
- European Virus Bio-informatiques Center, Jena07743, Germany
- Institut Français de Bioinformatique, IFB-core, UMS 3601, Évry91057, France
| | - Marie-Pierre Golinelli-Cohen
- Université Paris-Saclay, CNRS, Institut de Chimie des Substances Naturelles, Unité Propre de Recherche 2301, Gif-sur-Yvette91198, France
| | - Fabiola Terzi
- Université de Paris, INSERM U1151, CNRS UMR 8253, Institut Necker Enfants Malades, Département “Croissance et Signalisation”, Paris75015, France
| | - Patrice André
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | - Vincent Lotteau
- Centre International de Recherche en Infectiologie, University Lyon, Inserm, U1111, Université Claude Bernard Lyon 1, CNRS, UMR5308, Ecole Normale Supérieure de Lyon, Lyon69007, France
| | | | | | | |
Collapse
|
12
|
Zhao H, Liu G, Cao X. A seed expansion-based method to identify essential proteins by integrating protein-protein interaction sub-networks and multiple biological characteristics. BMC Bioinformatics 2023; 24:452. [PMID: 38036960 PMCID: PMC10688502 DOI: 10.1186/s12859-023-05583-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Accepted: 11/24/2023] [Indexed: 12/02/2023] Open
Abstract
BACKGROUND The identification of essential proteins is of great significance in biology and pathology. However, protein-protein interaction (PPI) data obtained through high-throughput technology include a high number of false positives. To overcome this limitation, numerous computational algorithms based on biological characteristics and topological features have been proposed to identify essential proteins. RESULTS In this paper, we propose a novel method named SESN for identifying essential proteins. It is a seed expansion method based on PPI sub-networks and multiple biological characteristics. Firstly, SESN utilizes gene expression data to construct PPI sub-networks. Secondly, seed expansion is performed simultaneously in each sub-network, and the expansion process is based on the topological features of predicted essential proteins. Thirdly, the error correction mechanism is based on multiple biological characteristics and the entire PPI network. Finally, SESN analyzes the impact of each biological characteristic, including protein complex, gene expression data, GO annotations, and subcellular localization, and adopts the biological data with the best experimental results. The output of SESN is a set of predicted essential proteins. CONCLUSIONS The analysis of each component of SESN indicates the effectiveness of all components. We conduct comparison experiments using three datasets from two species, and the experimental results demonstrate that SESN achieves superior performance compared to other methods.
Collapse
Affiliation(s)
- He Zhao
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| | - Guixia Liu
- College of Computer Science and Technology, Jilin University, Changchun, China.
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China.
| | - Xintian Cao
- College of Computer Science and Technology, Jilin University, Changchun, China
- Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun, China
| |
Collapse
|
13
|
Kim MJ, Kulkarni V, Goode MA, Sivesind TE. Exploring the interactions of antihistamine with retinoic acid receptor beta (RARB) by molecular dynamics simulations and genome-wide meta-analysis. J Mol Graph Model 2023; 124:108539. [PMID: 37331258 PMCID: PMC10529808 DOI: 10.1016/j.jmgm.2023.108539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 06/03/2023] [Accepted: 06/05/2023] [Indexed: 06/20/2023]
Abstract
Kaposi sarcoma (KS) is one of the most common AIDS-related malignant neoplasms, which can leave lesions on the skin among HIV patients. These lesions can be treated with 9-cis-retinoic acid (9-cis-RA), an endogenous ligand of retinoic acid receptors that has been FDA-approved for treatment of KS. However, topical application of 9-cis-RA can induce several unpleasant side effects, like headache, hyperlipidemia, and nausea. Hence, alternative therapeutics with less side effects are desirable. There are case reports associating over-the-counter antihistamine usage with regression of KS. Antihistamines competitively bind to H1 receptor and block the action of histamine, best known for being released in response to allergens. Furthermore, there are already dozens of antihistamines that are FDA-approved with less side effects than 9-cis-RA. This led our team to conduct a series of in-silico assays to determine whether antihistamines can activate retinoic acid receptors. First, we utilized high-throughput virtual screening and molecular dynamics simulations to model high-affinity interactions between antihistamines and retinoic acid receptor beta (RARβ). We then performed systems genetics analysis to identify a genetic association between H1 receptor itself and molecular pathways involved in KS. Together, these findings advocate for exploration of antihistamines against KS, starting with our two promising hit compounds, bepotastine and hydroxyzine, for experimental validation study in the future.
Collapse
Affiliation(s)
- Minjae J Kim
- University of Tennessee Health Sciences Center School of Medicine, Memphis, TN, USA.
| | | | - Micah A Goode
- University of Tennessee Health Sciences Center School of Medicine, Memphis, TN, USA.
| | - Torunn E Sivesind
- Department of Dermatology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
14
|
Chu S, Xie X, Payan C, Stochaj U. Valosin containing protein (VCP): initiator, modifier, and potential drug target for neurodegenerative diseases. Mol Neurodegener 2023; 18:52. [PMID: 37545006 PMCID: PMC10405438 DOI: 10.1186/s13024-023-00639-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 06/27/2023] [Indexed: 08/08/2023] Open
Abstract
The AAA+ ATPase valosin containing protein (VCP) is essential for cell and organ homeostasis, especially in cells of the nervous system. As part of a large network, VCP collaborates with many cofactors to ensure proteostasis under normal, stress, and disease conditions. A large number of mutations have revealed the importance of VCP for human health. In particular, VCP facilitates the dismantling of protein aggregates and the removal of dysfunctional organelles. These are critical events to prevent malfunction of the brain and other parts of the nervous system. In line with this idea, VCP mutants are linked to the onset and progression of neurodegeneration and other diseases. The intricate molecular mechanisms that connect VCP mutations to distinct brain pathologies continue to be uncovered. Emerging evidence supports the model that VCP controls cellular functions on multiple levels and in a cell type specific fashion. Accordingly, VCP mutants derail cellular homeostasis through several mechanisms that can instigate disease. Our review focuses on the association between VCP malfunction and neurodegeneration. We discuss the latest insights in the field, emphasize open questions, and speculate on the potential of VCP as a drug target for some of the most devastating forms of neurodegeneration.
Collapse
Affiliation(s)
- Siwei Chu
- Department of Physiology, McGill University, Montreal, HG3 1Y6, Canada
| | - Xinyi Xie
- Department of Physiology, McGill University, Montreal, HG3 1Y6, Canada
| | - Carla Payan
- Department of Physiology, McGill University, Montreal, HG3 1Y6, Canada
| | - Ursula Stochaj
- Department of Physiology, McGill University, Montreal, HG3 1Y6, Canada.
- Quantitative Life Sciences Program, McGill University, Montreal, Canada.
| |
Collapse
|
15
|
Kouchi Z, Kojima M. A Structural Network Analysis of Neuronal ArhGAP21/23 Interactors by Computational Modeling. ACS OMEGA 2023; 8:19249-19264. [PMID: 37305272 PMCID: PMC10249030 DOI: 10.1021/acsomega.2c08054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 05/05/2023] [Indexed: 06/13/2023]
Abstract
RhoGTPase-activating proteins (RhoGAPs) play multiple roles in neuronal development; however, details of their substrate recognition system remain elusive. ArhGAP21 and ArhGAP23 are RhoGAPs that contain N-terminal PDZ and pleckstrin homology domains. In the present study, the RhoGAP domain of these ArhGAPs was computationally modeled by template-based methods and the AlphaFold2 software program, and their intrinsic RhoGTPase recognition mechanism was analyzed from the domain structures using the protein docking programs HADDOCK and HDOCK. ArhGAP21 was predicted to preferentially catalyze Cdc42, RhoA, RhoB, RhoC, and RhoG and to downregulate RhoD and Tc10 activities. Regarding ArhGAP23, RhoA and Cdc42 were deduced to be its substrates, whereas RhoD downregulation was predicted to be less efficient. The PDZ domains of ArhGAP21/23 possess the FTLRXXXVY sequence, and similar globular folding consists of antiparalleled β-sheets and two α-helices that are conserved with PDZ domains of MAST-family proteins. A peptide docking analysis revealed the specific interaction of the ArhGAP23 PDZ domain with the PTEN C-terminus. The pleckstrin homology domain structure of ArhGAP23 was also predicted, and the functional selectivity for the interactors regulated by the folding and disordered domains in ArhGAP21 and ArhGAP23 was examined by an in silico analysis. An interaction analysis of these RhoGAPs revealed the existence of mammalian ArhGAP21/23-specific type I and type III Arf- and RhoGTPase-regulated signaling. Multiple recognition systems of RhoGTPase substrates and selective Arf-dependent localization of ArhGAP21/23 may form the basis of the functional core signaling necessary for synaptic homeostasis and axon/dendritic transport regulated by RhoGAP localization and activities.
Collapse
Affiliation(s)
- Zen Kouchi
- Department
of Genetics, Institute for Developmental
Research, Aichi Developmental Disability Center, 713-8 Kamiya-cho, Kasugai-city 480-0392 Aichi, Japan
| | - Masaki Kojima
- Laboratory
of Bioinformatics, School of Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji 192-0392, Japan
| |
Collapse
|
16
|
Huang T, Lin KH, Machado-Vieira R, Soares JC, Jiang X, Kim Y. Explainable drug side effect prediction via biologically informed graph neural network. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.05.26.23290615. [PMID: 37333107 PMCID: PMC10275013 DOI: 10.1101/2023.05.26.23290615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/20/2023]
Abstract
Early detection of potential side effects (SE) is a critical and challenging task for drug discovery and patient care. In-vitro or in-vivo approach to detect potential SEs is not scalable for many drug candidates during the preclinical stage. Recent advances in explainable machine learning may facilitate detecting potential SEs of new drugs before market release and elucidating the critical mechanism of biological actions. Here, we leverage multi-modal interactions among molecules to develop a biologically informed graph-based SE prediction model, called HHAN-DSI. HHAN-DSI predicted frequent and even uncommon SEs of the unseen drug with higher or comparable accuracy against benchmark methods. When applying HHAN-DSI to the central nervous system, the organs with the largest number of SEs, the model revealed diverse psychiatric medications' previously unknown but probable SEs, together with the potential mechanisms of actions through a network of genes, biological functions, drugs, and SEs.
Collapse
Affiliation(s)
- Tongtong Huang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Ko-Hong Lin
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Rodrigo Machado-Vieira
- Department of Psychiatry and Behavioral Sciences, McGovern Medical School, UTHealth, Houston, TX, United States
| | - Jair C Soares
- Department of Psychiatry and Behavioral Sciences, McGovern Medical School, UTHealth, Houston, TX, United States
| | - Xiaoqian Jiang
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| | - Yejin Kim
- School of Biomedical Informatics, UTHealth, Houston, TX, United States
| |
Collapse
|
17
|
Ye J, Li A, Zheng H, Yang B, Lu Y. Machine Learning Advances in Predicting Peptide/Protein-Protein Interactions Based on Sequence Information for Lead Peptides Discovery. Adv Biol (Weinh) 2023; 7:e2200232. [PMID: 36775876 DOI: 10.1002/adbi.202200232] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 12/30/2022] [Indexed: 02/14/2023]
Abstract
Peptides have shown increasing advantages and significant clinical value in drug discovery and development. With the development of high-throughput technologies and artificial intelligence (AI), machine learning (ML) methods for discovering new lead peptides have been expanded and incorporated into rational drug design. Predictions of peptide-protein interactions (PepPIs) and protein-protein interactions (PPIs) are both opportunities and challenges in computational biology, which will help to better understand the mechanisms of disease and provide the impetus for the discovery of lead peptides. This paper comprehensively reviews computational models for PepPI and PPI predictions. It begins with an introduction of various databases of peptide ligands and target proteins. Then it discusses data formats and feature representations for proteins and peptides. Furthermore, classical ML methods and emerging deep learning (DL) methods that can be used to train prediction models of PepPI and PPI are classified into four categories, and their advantages and disadvantages are analyzed. To assess the relative performance of different models, different validation protocols and evaluation indexes are discussed. The goal of this review is to help researchers quickly get started to develop computational frameworks using these integrated resources and eventually promote the discovery of lead peptides.
Collapse
Affiliation(s)
- Jiahao Ye
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - An Li
- Department of Critical Care Medicine, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, 200072, China
- Department of Biochemical Pharmacy, School of Pharmacy, Second Military Medical University, Shanghai, 200433, China
| | - Hao Zheng
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Banghua Yang
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Yiming Lu
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Department of Critical Care Medicine, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, 200072, China
- Department of Biochemical Pharmacy, School of Pharmacy, Second Military Medical University, Shanghai, 200433, China
| |
Collapse
|
18
|
Zuñiga-Hernandez J, Meneses C, Bastias M, Allende ML, Glavic A. Drosophila DAxud1 Has a Repressive Transcription Activity on Hsp70 and Other Heat Shock Genes. Int J Mol Sci 2023; 24:ijms24087485. [PMID: 37108646 PMCID: PMC10138878 DOI: 10.3390/ijms24087485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/11/2023] [Accepted: 04/12/2023] [Indexed: 04/29/2023] Open
Abstract
Drosophila melanogaster DAxud1 is a transcription factor that belongs to the Cysteine Serine Rich Nuclear Protein (CSRNP) family, conserved in metazoans, with a transcriptional transactivation activity. According to previous studies, this protein promotes apoptosis and Wnt signaling-mediated neural crest differentiation in vertebrates. However, no analysis has been conducted to determine what other genes it might control, especially in connection with cell survival and apoptosis. To partly answer this question, this work analyzes the role of Drosophila DAxud1 using Targeted-DamID-seq (TaDa-seq), which allows whole genome screening to determine in which regions it is most frequently found. This analysis confirmed the presence of DAxud1 in groups of pro-apoptotic and Wnt pathway genes, as previously described; furthermore, stress resistance genes that coding heat shock protein (HSP) family genes were found as hsp70, hsp67, and hsp26. The enrichment of DAxud1 also identified a DNA-binding motif (AYATACATAYATA) that is frequently found in the promoters of these genes. Surprisingly, the following analyses demonstrated that DAxud1 exerts a repressive role on these genes, which are necessary for cell survival. This is coupled with the pro-apoptotic and cell cycle arrest roles of DAxud1, in which repression of hsp70 complements the maintenance of tissue homeostasis through cell survival modulation.
Collapse
Affiliation(s)
- Jorge Zuñiga-Hernandez
- Millennium Institute Center for Genome Regulation (CGR), Department of Biology, Faculty of Sciences, University of Chile, Santiago 7800003, Chile
| | - Claudio Meneses
- Millennium Institute Center for Genome Regulation (CGR), Department of Biology, Faculty of Sciences, University of Chile, Santiago 7800003, Chile
- Millennium Nucleus Development of Super Adaptable Plants (MN-SAP), Santiago 8331150, Chile
- Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Santiago 8331150, Chile
| | - Macarena Bastias
- Centro de Biotecnología vegetal, Facultad de Ciencias de la Vida, Universidad Andrés Bello, Santiago 8370035, Chile
| | - Miguel L Allende
- Millennium Institute Center for Genome Regulation (CGR), Department of Biology, Faculty of Sciences, University of Chile, Santiago 7800003, Chile
| | - Alvaro Glavic
- Millennium Institute Center for Genome Regulation (CGR), Department of Biology, Faculty of Sciences, University of Chile, Santiago 7800003, Chile
| |
Collapse
|
19
|
Altuntas V. Diffusion Alignment Coefficient (DAC): A Novel Similarity Metric for Protein-Protein Interaction Network. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:894-903. [PMID: 35737632 DOI: 10.1109/tcbb.2022.3185406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Interaction networks can be used to predict the functions of unknown proteins using known interactions and proteins with known functions. Many graph theory or diffusion-based methods have been proposed, using the assumption that the topological properties of a protein in a network are related to its biological function. Here we seek to improve function prediction by finding more similar neighbors with a new diffusion-based alignment technique to overcome the topological information loss of the node. In this study, we introduce the Diffusion Alignment Coefficient (DAC) algorithm, which combines diffusion, longest common subsequence, and longest common substring techniques to measure the similarity of two nodes in protein interaction networks. As a proof of concept, our experiments, conducted on a real PPI networks S.cerevisiae and Homo Sapiens, demonstrated that our method obtained better results than competitors for MIPS and MSigDB Collections hallmark gene set functional categories. This is the first study to develop a measure of node function similarity using alignment to consider the positions of nodes in protein-protein interaction networks. According to the experimental results, the use of spatial information belonging to the nodes in the network has a positive effect on the detection of more functionally similar neighboring nodes.
Collapse
|
20
|
Identification of Cancer Driver Genes by Integrating Multiomics Data with Graph Neural Networks. Metabolites 2023; 13:metabo13030339. [PMID: 36984779 PMCID: PMC10052551 DOI: 10.3390/metabo13030339] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 02/20/2023] [Accepted: 02/22/2023] [Indexed: 03/02/2023] Open
Abstract
Cancer is a heterogeneous disease that is driven by the accumulation of both genetic and nongenetic alterations, so integrating multiomics data and extracting effective information from them is expected to be an effective way to predict cancer driver genes. In this paper, we first generate comprehensive instructive features for each gene from genomic, epigenomic, transcriptomic levels together with protein–protein interaction (PPI)-networks-derived attributes and then propose a novel semisupervised deep graph learning framework GGraphSAGE to predict cancer driver genes according to the impact of the alterations on a biological system. When applied to eight tumor types, experimental results suggest that GGraphSAGE outperforms several state-of-the-art computational methods for driver genes identification. Moreover, it broadens our current understanding of cancer driver genes from multiomics level and identifies driver genes specific to the tumor type rather than pan-cancer. We expect GGraphSAGE to open new avenues in precision medicine and even further predict drivers for other complex diseases.
Collapse
|
21
|
Das P, Mazumder DH. An extensive survey on the use of supervised machine learning techniques in the past two decades for prediction of drug side effects. Artif Intell Rev 2023; 56:1-28. [PMID: 36819660 PMCID: PMC9930028 DOI: 10.1007/s10462-023-10413-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/01/2023] [Indexed: 02/19/2023]
Abstract
Approved drugs for sale must be effective and safe, implying that the drug's advantages outweigh its known harmful side effects. Side effects (SE) of drugs are one of the common reasons for drug failure that may halt the whole drug discovery pipeline. The side effects might vary from minor concerns like a runny nose to potentially life-threatening issues like liver damage, heart attack, and death. Therefore, predicting the side effects of the drug is vital in drug development, discovery, and design. Supervised machine learning-based side effects prediction task has recently received much attention since it reduces time, chemical waste, design complexity, risk of failure, and cost. The advancement of supervised learning approaches for predicting side effects have emerged as essential computational tools. Supervised machine learning technique provides early information on drug side effects to develop an effective drug based on drug properties. Still, there are several challenges to predicting drug side effects. Thus, a near-exhaustive survey is carried out in this paper on the use of supervised machine learning approaches employed in drug side effects prediction tasks in the past two decades. In addition, this paper also summarized the drug descriptor required for the side effects prediction task, commonly utilized drug properties sources, computational models, and their performances. Finally, the research gap, open problems, and challenges for the further supervised learning-based side effects prediction task have been discussed.
Collapse
Affiliation(s)
- Pranab Das
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| | - Dilwar Hussain Mazumder
- Department of Computer Science and Engineering, National Institute of Technology Nagaland, Chumukedima, Dimapur, Nagaland 797103 India
| |
Collapse
|
22
|
Xu J, Xu J, Meng Y, Lu C, Cai L, Zeng X, Nussinov R, Cheng F. Graph embedding and Gaussian mixture variational autoencoder network for end-to-end analysis of single-cell RNA sequencing data. CELL REPORTS METHODS 2023; 3:100382. [PMID: 36814845 PMCID: PMC9939381 DOI: 10.1016/j.crmeth.2022.100382] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 10/31/2022] [Accepted: 12/08/2022] [Indexed: 05/25/2023]
Abstract
Single-cell RNA sequencing (scRNA-seq) is a revolutionary technology to determine the precise gene expression of individual cells and identify cell heterogeneity and subpopulations. However, technical limitations of scRNA-seq lead to heterogeneous and sparse data. Here, we present autoCell, a deep-learning approach for scRNA-seq dropout imputation and feature extraction. autoCell is a variational autoencoding network that combines graph embedding and a probabilistic depth Gaussian mixture model to infer the distribution of high-dimensional, sparse scRNA-seq data. We validate autoCell on simulated datasets and biologically relevant scRNA-seq. We show that interpolation of autoCell improves the performance of existing tools in identifying cell developmental trajectories of human preimplantation embryos. We identify disease-associated astrocytes (DAAs) and reconstruct DAA-specific molecular networks and ligand-receptor interactions involved in cell-cell communications using Alzheimer's disease as a prototypical example. autoCell provides a toolbox for end-to-end analysis of scRNA-seq data, including visualization, clustering, imputation, and disease-specific gene network identification.
Collapse
Affiliation(s)
- Junlin Xu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Jielin Xu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Yajie Meng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Changcheng Lu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Lijun Cai
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Xiangxiang Zeng
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan 410082, China
| | - Ruth Nussinov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, MD 21702, USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA
| |
Collapse
|
23
|
A binary interaction map between turnip mosaic virus and Arabidopsis thaliana proteomes. Commun Biol 2023; 6:28. [PMID: 36631662 PMCID: PMC9834402 DOI: 10.1038/s42003-023-04427-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 01/05/2023] [Indexed: 01/13/2023] Open
Abstract
Viruses are obligate intracellular parasites that have co-evolved with their hosts to establish an intricate network of protein-protein interactions. Here, we followed a high-throughput yeast two-hybrid screening to identify 378 novel protein-protein interactions between turnip mosaic virus (TuMV) and its natural host Arabidopsis thaliana. We identified the RNA-dependent RNA polymerase NIb as the viral protein with the largest number of contacts, including key salicylic acid-dependent transcription regulators. We verified a subset of 25 interactions in planta by bimolecular fluorescence complementation assays. We then constructed and analyzed a network comprising 399 TuMV-A. thaliana interactions together with intravirus and intrahost connections. In particular, we found that the host proteins targeted by TuMV are enriched in different aspects of plant responses to infections, are more connected and have an increased capacity to spread information throughout the cell proteome, display higher expression levels, and have been subject to stronger purifying selection than expected by chance. The proviral or antiviral role of ten host proteins was validated by characterizing the infection dynamics in the corresponding mutant plants, supporting a proviral role for the transcriptional regulator TGA1. Comparison with similar studies with animal viruses, highlights shared fundamental features in their mode of action.
Collapse
|
24
|
Zeng TS, Yang DS, Kelvin AA, Kelvin DJ. Host Transcriptome Analysis of Ferret Tissues Following Henipavirus Infection. Methods Mol Biol 2023; 2682:281-299. [PMID: 37610589 DOI: 10.1007/978-1-0716-3283-3_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Ferrets are commonly used as experimental models of infection for a variety of viruses due to their susceptibility to human respiratory viruses and the close resemblance of pathological outcomes found in human infections. Even though ferret-specific reagents are limited, the use of ferrets as a preclinical experimental model of infection has gained considerable interest since the publication of the ferret transcriptome and draft ferret genome. These advances have made it feasible to easily perform whole-genome gene expression analysis in the ferret infection model. Here, we describe methods for genome-wide gene expression analysis using RNA sequence (RNAseq) data obtained from the lung and brain tissues obtained from experimental infections of Hendra (HeV) and Nipah (NiV) viruses in ferrets. We provide detailed methods for RNAseq and representative data for host gene expression profiles of the lung tissues that show early activation of interferon pathways and later activation of inflammation-related pathways.
Collapse
Affiliation(s)
- Tian S Zeng
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - D S Yang
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - A A Kelvin
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - David J Kelvin
- Department of Microbiology and Immunology, Dalhousie University, Halifax, Nova Scotia, Canada.
| |
Collapse
|
25
|
Chu S, Moujaber O, Lemay S, Stochaj U. Multiple pathways promote microtubule stabilization in senescent intestinal epithelial cells. NPJ AGING 2022; 8:16. [PMID: 36526654 PMCID: PMC9758230 DOI: 10.1038/s41514-022-00097-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Accepted: 11/25/2022] [Indexed: 12/23/2022]
Abstract
Intestinal epithelial cells are critical for gastrointestinal homeostasis. However, their function declines during aging. The aging-related loss of organ performance is largely driven by the increase in senescent cells. To date, the hallmarks and molecular mechanisms related to cellular senescence are not fully understood. Microtubules control epithelial functions, and we identified microtubule stabilization as a phenotypic marker of senescent intestinal epithelial cells. The senescence inducer determined the pathway to microtubule stabilization. Specifically, enhanced microtubule stability was associated with α-tubulin hyperacetylation or increased abundance of the microtubule-binding protein tau. We show further that overexpression of MAPT, which encodes tau, augmented microtubule stability in intestinal epithelial cells. Notably, pharmacological microtubule stabilization was sufficient to induce cellular senescence. Taken together, this study provides new insights into the molecular mechanisms that control epithelial cell homeostasis. Our results support the concept that microtubule stability serves as a critical cue to trigger intestinal epithelial cell senescence.
Collapse
Affiliation(s)
- Siwei Chu
- grid.14709.3b0000 0004 1936 8649Department of Physiology, McGill University, Montreal, Quebec H3G 1Y6 Canada
| | - Ossama Moujaber
- grid.14709.3b0000 0004 1936 8649Department of Physiology, McGill University, Montreal, Quebec H3G 1Y6 Canada
| | - Serge Lemay
- grid.63984.300000 0000 9064 4811Department of Medicine, Division of Nephrology, McGill University Health Centre, Montreal, QC Canada
| | - Ursula Stochaj
- grid.14709.3b0000 0004 1936 8649Department of Physiology, McGill University, Montreal, Quebec H3G 1Y6 Canada
| |
Collapse
|
26
|
Robust and accurate prediction of self-interacting proteins from protein sequence information by exploiting weighted sparse representation based classifier. BMC Bioinformatics 2022; 23:518. [PMID: 36457083 PMCID: PMC9713954 DOI: 10.1186/s12859-022-04880-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 08/03/2022] [Indexed: 12/04/2022] Open
Abstract
BACKGROUND Self-interacting proteins (SIPs), two or more copies of the protein that can interact with each other expressed by one gene, play a central role in the regulation of most living cells and cellular functions. Although numerous SIPs data can be provided by using high-throughput experimental techniques, there are still several shortcomings such as in time-consuming, costly, inefficient, and inherently high in false-positive rates, for the experimental identification of SIPs even nowadays. Therefore, it is more and more significant how to develop efficient and accurate automatic approaches as a supplement of experimental methods for assisting and accelerating the study of predicting SIPs from protein sequence information. RESULTS In this paper, we present a novel framework, termed GLCM-WSRC (gray level co-occurrence matrix-weighted sparse representation based classification), for predicting SIPs automatically based on protein evolutionary information from protein primary sequences. More specifically, we firstly convert the protein sequence into Position Specific Scoring Matrix (PSSM) containing protein sequence evolutionary information, exploiting the Position Specific Iterated BLAST (PSI-BLAST) tool. Secondly, using an efficient feature extraction approach, i.e., GLCM, we extract abstract salient and invariant feature vectors from the PSSM, and then perform a pre-processing operation, the adaptive synthetic (ADASYN) technique, to balance the SIPs dataset to generate new feature vectors for classification. Finally, we employ an efficient and reliable WSRC model to identify SIPs according to the known information of self-interacting and non-interacting proteins. CONCLUSIONS Extensive experimental results show that the proposed approach exhibits high prediction performance with 98.10% accuracy on the yeast dataset, and 91.51% accuracy on the human dataset, which further reveals that the proposed model could be a useful tool for large-scale self-interacting protein prediction and other bioinformatics tasks detection in the future.
Collapse
|
27
|
Jiang Y, Shi C, Tian S, Zhi F, Shen X, Shang D, Tian J. Comprehensive molecular characterization of hypertension-related genes in cancer. CARDIO-ONCOLOGY 2022; 8:10. [PMID: 35513851 PMCID: PMC9069779 DOI: 10.1186/s40959-022-00136-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 04/29/2022] [Indexed: 11/14/2022]
Abstract
Background During cancer treatment, patients have a significantly higher risk of developing cardiovascular complications such as hypertension. In this study, we investigated the internal relationships between hypertension and different types of cancer. Methods First, we comprehensively characterized the involvement of 10 hypertension-related genes across 33 types of cancer. The somatic copy number alteration (CNA) and single nucleotide variant (SNV) of each gene were identified for each type of cancer. Then, the expression patterns of hypertension-related genes were analyzed across 14 types of cancer. The hypertension-related genes were aberrantly expressed in different types of cancer, and some were associated with the overall survival of patients or the cancer stage. Subsequently, the interactions between hypertension-related genes and clinically actionable genes (CAGs) were identified by analyzing the co-expressions and protein–protein interactions. Results We found that certain hypertension-related genes were correlated with CAGs. Next, the pathways associated with hypertension-related genes were identified. The positively correlated pathways included epithelial to mesenchymal transition, hormone androgen receptor, and receptor tyrosine kinase, and the negatively correlated pathways included apoptosis, cell cycle, and DNA damage response. Finally, the correlations between hypertension-related genes and drug sensitivity were evaluated for different drugs and different types of cancer. The hypertension-related genes were all positively or negatively correlated with the resistance of cancer to the majority of anti-cancer drugs. These results highlight the importance of hypertension-related genes in cancer. Conclusions This study provides an approach to characterize the relationship between hypertension-related genes and cancers in the post-genomic era. Supplementary Information The online version contains supplementary material available at 10.1186/s40959-022-00136-z.
Collapse
|
28
|
Xu J, Mao C, Hou Y, Luo Y, Binder JL, Zhou Y, Bekris LM, Shin J, Hu M, Wang F, Eng C, Oprea TI, Flanagan ME, Pieper AA, Cummings J, Leverenz JB, Cheng F. Interpretable deep learning translation of GWAS and multi-omics findings to identify pathobiology and drug repurposing in Alzheimer's disease. Cell Rep 2022; 41:111717. [PMID: 36450252 PMCID: PMC9837836 DOI: 10.1016/j.celrep.2022.111717] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 09/01/2022] [Accepted: 11/02/2022] [Indexed: 12/03/2022] Open
Abstract
Translating human genetic findings (genome-wide association studies [GWAS]) to pathobiology and therapeutic discovery remains a major challenge for Alzheimer's disease (AD). We present a network topology-based deep learning framework to identify disease-associated genes (NETTAG). We leverage non-coding GWAS loci effects on quantitative trait loci, enhancers and CpG islands, promoter regions, open chromatin, and promoter flanking regions under the protein-protein interactome. Via NETTAG, we identified 156 AD-risk genes enriched in druggable targets. Combining network-based prediction and retrospective case-control observations with 10 million individuals, we identified that usage of four drugs (ibuprofen, gemfibrozil, cholecalciferol, and ceftriaxone) is associated with reduced likelihood of AD incidence. Gemfibrozil (an approved lipid regulator) is significantly associated with 43% reduced risk of AD compared with simvastatin using an active-comparator design (95% confidence interval 0.51-0.63, p < 0.0001). In summary, NETTAG offers a deep learning methodology that utilizes GWAS and multi-genomic findings to identify pathobiology and drug repurposing in AD.
Collapse
Affiliation(s)
- Jielin Xu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Chengsheng Mao
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Yuan Hou
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Yuan Luo
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Jessica L Binder
- Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA
| | - Yadi Zhou
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Lynn M Bekris
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA
| | - Jiyoung Shin
- Division of Health and Biomedical Informatics, Department of Preventive Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Ming Hu
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44106, USA
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medical College, Cornell University, New York, NY 10065, USA
| | - Charis Eng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA
| | - Tudor I Oprea
- Department of Internal Medicine, University of New Mexico School of Medicine, Albuquerque, NM 87131, USA
| | - Margaret E Flanagan
- Department of Pathology and Mesulam Center for Cognitive Neurology and Alzheimer's Disease, Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Andrew A Pieper
- Harrington Discovery Institute, University Hospitals Cleveland Medical Center, Cleveland, OH 44106, USA; Department of Psychiatry, Case Western Reserve University, Cleveland, OH 44106, USA; Geriatric Psychiatry, GRECC, Louis Stokes Cleveland VA Medical Center, Cleveland, OH 44106, USA; Institute for Transformative Molecular Medicine, School of Medicine, Case Western Reserve University, Cleveland 44106, OH, USA; Department of Neuroscience, Case Western Reserve University, School of Medicine, Cleveland, OH 44106, USA
| | - Jeffrey Cummings
- Chambers-Grundy Center for Transformative Neuroscience, Department of Brain Health, School of Integrated Health Sciences, University of Nevada Las Vegas, Las Vegas, NV 89154, USA
| | - James B Leverenz
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA; Lou Ruvo Center for Brain Health, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA; Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH 44106, USA.
| |
Collapse
|
29
|
Abdullah-Zawawi MR, Govender N, Harun S, Muhammad NAN, Zainal Z, Mohamed-Hussein ZA. Multi-Omics Approaches and Resources for Systems-Level Gene Function Prediction in the Plant Kingdom. PLANTS (BASEL, SWITZERLAND) 2022; 11:2614. [PMID: 36235479 PMCID: PMC9573505 DOI: 10.3390/plants11192614] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 09/05/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
In higher plants, the complexity of a system and the components within and among species are rapidly dissected by omics technologies. Multi-omics datasets are integrated to infer and enable a comprehensive understanding of the life processes of organisms of interest. Further, growing open-source datasets coupled with the emergence of high-performance computing and development of computational tools for biological sciences have assisted in silico functional prediction of unknown genes, proteins and metabolites, otherwise known as uncharacterized. The systems biology approach includes data collection and filtration, system modelling, experimentation and the establishment of new hypotheses for experimental validation. Informatics technologies add meaningful sense to the output generated by complex bioinformatics algorithms, which are now freely available in a user-friendly graphical user interface. These resources accentuate gene function prediction at a relatively minimal cost and effort. Herein, we present a comprehensive view of relevant approaches available for system-level gene function prediction in the plant kingdom. Together, the most recent applications and sought-after principles for gene mining are discussed to benefit the plant research community. A realistic tabulation of plant genomic resources is included for a less laborious and accurate candidate gene discovery in basic plant research and improvement strategies.
Collapse
Affiliation(s)
- Muhammad-Redha Abdullah-Zawawi
- UKM Medical Molecular Biology Institute (UMBI), Universiti Kebangsaan Malaysia, Kuala Lumpur 56000, Malaysia
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nisha Govender
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Sarahani Harun
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zamri Zainal
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| | - Zeti-Azura Mohamed-Hussein
- Institute of System Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
- Faculty of Science and Technology, Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Malaysia
| |
Collapse
|
30
|
Li F, Sun Z, Liu JX, Shang J, Dai L, Liu X, Li Y. NESM: a network embedding method for tumor stratification by integrating multi-omics data. G3 GENES|GENOMES|GENETICS 2022; 12:6705238. [PMID: 36124952 PMCID: PMC9635646 DOI: 10.1093/g3journal/jkac243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 08/30/2022] [Indexed: 11/23/2022]
Abstract
Tumor stratification plays an important role in cancer diagnosis and individualized treatment. Recent developments in high-throughput sequencing technologies have produced huge amounts of multi-omics data, making it possible to stratify cancer types using multiple molecular datasets. We introduce a Network Embedding method for tumor Stratification by integrating Multi-omics data. Network Embedding method for tumor Stratification by integrating Multi-omics pregroup the samples, integrate the gene features and somatic mutation corresponding to cancer types within each group to construct patient features, and then integrate all groups to obtain comprehensive patient information. The gene features contain network topology information, because it is extracted by integrating deoxyribonucleic acid methylation, messenger ribonucleic acid expression data, and protein–protein interactions through network embedding method. On the one hand, a supervised learning method Light Gradient Boosting Machine is used to classify cancer types based on patient features. When compared with other 3 methods, Network Embedding method for tumor Stratification by integrating Multi-omics has the highest AUC in most cancer types. The average AUC for stratifying cancer types is 0.91, indicating that the patient features extracted by Network Embedding method for tumor Stratification by integrating Multi-omics are effective for tumor stratification. On the other hand, an unsupervised clustering algorithm Density-Based Spatial Clustering of Applications with Noise is utilized to divide single cancer subtypes. The vast majority of the subtypes identified by Network Embedding method for tumor Stratification by integrating Multi-omics are significantly associated with patient survival.
Collapse
Affiliation(s)
- Feng Li
- School of Computer Science, Qufu Normal University , Rizhao 276826, China
| | - Zhensheng Sun
- School of Computer Science, Qufu Normal University , Rizhao 276826, China
| | - Jin-Xing Liu
- School of Computer Science, Qufu Normal University , Rizhao 276826, China
| | - Junliang Shang
- School of Computer Science, Qufu Normal University , Rizhao 276826, China
| | - Lingyun Dai
- School of Computer Science, Qufu Normal University , Rizhao 276826, China
| | - Xikui Liu
- Department of Electrical Engineering and Information Technology, Shandong University of Science and Technology , Jinan, Shandong 250031, China
| | - Yan Li
- Department of Electrical Engineering and Information Technology, Shandong University of Science and Technology , Jinan, Shandong 250031, China
| |
Collapse
|
31
|
Yeh SJ, Chen BS. Systems Medicine Design based on Systems Biology Approaches and Deep Neural Network for Gastric Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3019-3031. [PMID: 34232888 DOI: 10.1109/tcbb.2021.3095369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Gastric cancer (GC) is the third leading cause of cancer death in the world. It is associated with the stimulation of microenvironment, aberrant epigenetic modification, and chronic inflammation. However, few researches discuss the GC molecular progression mechanisms from the perspective of the system level. In this study, we proposed a systems medicine design procedure to identify essential biomarkers and find corresponding drugs for GC. At first, we did big database mining to construct candidate protein-protein interaction network (PPIN) and candidate gene regulation network (GRN). Second, by leveraging the next-generation sequencing (NGS) data, we performed system modeling and applied system identification and model selection to obtain real genome-wide genetic and epigenetic networks (GWGENs). To make the real GWGENs easy to analyze, the principal network projection method was used to extract the core signaling pathways denoted by KEGG pathways. Subsequently, based on the identified biomarkers, we trained a deep neural network of drug-target interaction (DeepDTI) with supervised learning and filtered our candidate drugs considering drug regulation ability and drug sensitivity. With the proposed systematic strategy, we not only shed the light on the progression of GC but also suggested potential multiple-molecule drugs efficiently.
Collapse
|
32
|
Wei X, Yang J, Li S, Li B, Chen M, Lu Y, Wu X, Cheng Z, Zhang X, Chen Z, Wang C, Wang E, Zheng R, Xu X, Shang H. TAIGET: A small-molecule target identification and annotation web server. Front Pharmacol 2022; 13:898519. [PMID: 36105222 PMCID: PMC9465370 DOI: 10.3389/fphar.2022.898519] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 07/19/2022] [Indexed: 11/28/2022] Open
Abstract
Background: Accurate target identification of small molecules and downstream target annotation are important in pharmaceutical research and drug development. Methods: We present TAIGET, a friendly and easy to operate graphical web interface, which consists of a docking module based on AutoDock Vina and LeDock, a target screen module based on a Bayesian–Gaussian mixture model (BGMM), and a target annotation module derived from >14,000 cancer-related literature works. Results: TAIGET produces binding poses by selecting ≤5 proteins at a time from the UniProt ID-PDB network and submitting ≤3 ligands at a time with the SMILES format. Once the identification process of binding poses is complete, TAIGET then screens potential targets based on the BGMM. In addition, three medical experts and 10 medical students curated associations among drugs, genes, gene regulation, cancer outcome phenotype, 2,170 cancer cell types, and 73 cancer types from the PubMed literature, with the aim to construct a target annotation module. A target-related PPI network can be visualized by an interactive interface. Conclusion: This online tool significantly lowers the entry barrier of virtual identification of targets for users who are not experts in the technical aspects of virtual drug discovery. The web server is available free of charge at http://www.taiget.cn/.
Collapse
Affiliation(s)
- Xuxu Wei
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
- Key Laboratory of Chinese Internal Medicine of MOE, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Jiarui Yang
- School of Computer Science and Engineering, Central South University, Changsha, China
| | - Simin Li
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Boyuan Li
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Mengzhen Chen
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Yukang Lu
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Xiang Wu
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Zeyu Cheng
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
| | - Xiaoyu Zhang
- Key Laboratory of Chinese Internal Medicine of MOE, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Zhao Chen
- Key Laboratory of Chinese Internal Medicine of MOE, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Chunxia Wang
- Key Laboratory of Chinese Internal Medicine of MOE, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
| | - Edwin Wang
- Cumming School of Medicine, University of Calgary, Calgary, AB, Canada
| | - Ruiqing Zheng
- School of Computer Science and Engineering, Central South University, Changsha, China
- *Correspondence: Ruiqing Zheng, ; Xue Xu, ; Hongcai Shang,
| | - Xue Xu
- Key Laboratory of Occupational Hazard Identification and Control, Wuhan University of Science and Technology, Wuhan, China
- *Correspondence: Ruiqing Zheng, ; Xue Xu, ; Hongcai Shang,
| | - Hongcai Shang
- Key Laboratory of Chinese Internal Medicine of MOE, Dongzhimen Hospital, Beijing University of Chinese Medicine, Beijing, China
- *Correspondence: Ruiqing Zheng, ; Xue Xu, ; Hongcai Shang,
| |
Collapse
|
33
|
Gonçalves E, Poulos RC, Cai Z, Barthorpe S, Manda SS, Lucas N, Beck A, Bucio-Noble D, Dausmann M, Hall C, Hecker M, Koh J, Lightfoot H, Mahboob S, Mali I, Morris J, Richardson L, Seneviratne AJ, Shepherd R, Sykes E, Thomas F, Valentini S, Williams SG, Wu Y, Xavier D, MacKenzie KL, Hains PG, Tully B, Robinson PJ, Zhong Q, Garnett MJ, Reddel RR. Pan-cancer proteomic map of 949 human cell lines. Cancer Cell 2022; 40:835-849.e8. [PMID: 35839778 PMCID: PMC9387775 DOI: 10.1016/j.ccell.2022.06.010] [Citation(s) in RCA: 57] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 03/29/2022] [Accepted: 06/21/2022] [Indexed: 12/12/2022]
Abstract
The proteome provides unique insights into disease biology beyond the genome and transcriptome. A lack of large proteomic datasets has restricted the identification of new cancer biomarkers. Here, proteomes of 949 cancer cell lines across 28 tissue types are analyzed by mass spectrometry. Deploying a workflow to quantify 8,498 proteins, these data capture evidence of cell-type and post-transcriptional modifications. Integrating multi-omics, drug response, and CRISPR-Cas9 gene essentiality screens with a deep learning-based pipeline reveals thousands of protein biomarkers of cancer vulnerabilities that are not significant at the transcript level. The power of the proteome to predict drug response is very similar to that of the transcriptome. Further, random downsampling to only 1,500 proteins has limited impact on predictive power, consistent with protein networks being highly connected and co-regulated. This pan-cancer proteomic map (ProCan-DepMapSanger) is a comprehensive resource available at https://cellmodelpassports.sanger.ac.uk.
Collapse
Affiliation(s)
- Emanuel Gonçalves
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK; Instituto Superior Técnico (IST), Universidade de Lisboa, 1049-001 Lisboa, Portugal; INESC-ID, 1000-029 Lisboa, Portugal
| | - Rebecca C Poulos
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Zhaoxiang Cai
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Syd Barthorpe
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Srikanth S Manda
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Natasha Lucas
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Alexandra Beck
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Daniel Bucio-Noble
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Michael Dausmann
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Caitlin Hall
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Michael Hecker
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Jennifer Koh
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Howard Lightfoot
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Sadia Mahboob
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Iman Mali
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - James Morris
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Laura Richardson
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Akila J Seneviratne
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Rebecca Shepherd
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Erin Sykes
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Frances Thomas
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Sara Valentini
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK
| | - Steven G Williams
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Yangxiu Wu
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Dylan Xavier
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Karen L MacKenzie
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Peter G Hains
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Brett Tully
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia
| | - Phillip J Robinson
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia.
| | - Qing Zhong
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia.
| | - Mathew J Garnett
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK.
| | - Roger R Reddel
- ProCan®, Children's Medical Research Institute, Faculty of Medicine and Health, The University of Sydney, Westmead, NSW, Australia.
| |
Collapse
|
34
|
Shang L, Zhang Y, Liu Y, Jin C, Yuan Y, Tian C, Ni M, Bo X, Zhang L, Li D, He F, Wang J. A Yeast BiFC-seq Method for Genome-wide Interactome Mapping. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:795-807. [PMID: 34314873 PMCID: PMC9880813 DOI: 10.1016/j.gpb.2021.02.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Revised: 12/14/2020] [Accepted: 03/10/2021] [Indexed: 01/31/2023]
Abstract
Genome-wide physical protein-protein interaction (PPI) mapping remains a major challenge for current technologies. Here, we reported a high-efficiency BiFC-seq method, yeast-enhanced green fluorescent protein-based bimolecular fluorescence complementation (yEGFP-BiFC) coupled with next-generation DNA sequencing, for interactome mapping. We first applied yEGFP-BiFC method to systematically investigate an intraviral network of the Ebola virus. Two-thirds (9/14) of known interactions of EBOV were recaptured, and five novel interactions were discovered. Next, we used the BiFC-seq method to map the interactome of the tumor protein p53. We identified 97 interactors of p53, more than three-quarters of which were novel. Furthermore, in a more complex background, we screened potential interactors by pooling two BiFC libraries together and revealed a network of 229 interactions among 205 proteins. These results show that BiFC-seq is a highly sensitive, rapid, and economical method for genome-wide interactome mapping.
Collapse
Affiliation(s)
- Limin Shang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuehui Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yuchen Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Chaozhi Jin
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yanzhi Yuan
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Chunyan Tian
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Ming Ni
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Xiaochen Bo
- Department of Biotechnology, Beijing Institute of Radiation Medicine, Beijing 100850, China
| | - Li Zhang
- Department of Rehabilitation Medicine, Nan Lou; Key Laboratory of Wound Repair and Regeneration of PLA, College of Life Sciences, Chinese PLA General Hospital, Beijing 100853, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.
| | - Jian Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China; School of Basic Medical Sciences, Anhui Medical University, Hefei 230032, China.
| |
Collapse
|
35
|
Pollet L, Lambourne L, Xia Y. Structural Determinants of Yeast Protein-Protein Interaction Interface Evolution at the Residue Level. J Mol Biol 2022; 434:167750. [PMID: 35850298 DOI: 10.1016/j.jmb.2022.167750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2021] [Revised: 06/09/2022] [Accepted: 07/12/2022] [Indexed: 12/01/2022]
Abstract
Interfaces of contact between proteins play important roles in determining the proper structure and function of protein-protein interactions (PPIs). Therefore, to fully understand PPIs, we need to better understand the evolutionary design principles of PPI interfaces. Previous studies have uncovered that interfacial sites are more evolutionarily conserved than other surface protein sites. Yet, little is known about the nature and relative importance of evolutionary constraints in PPI interfaces. Here, we explore constraints imposed by the structure of the microenvironment surrounding interfacial residues on residue evolutionary rate using a large dataset of over 700 structural models of baker's yeast PPIs. We find that interfacial residues are, on average, systematically more conserved than all other residues with a similar degree of total burial as measured by relative solvent accessibility (RSA). Besides, we find that RSA of the residue when the PPI is formed is a better predictor of interfacial residue evolutionary rate than RSA in the monomer state. Furthermore, we investigate four structure-based measures of residue interfacial involvement, including change in RSA upon binding (ΔRSA), number of residue-residue contacts across the interface, and distance from the center or the periphery of the interface. Integrated modeling for evolutionary rate prediction in interfaces shows that ΔRSA plays a dominant role among the four measures of interfacial involvement, with minor, but independent contributions from other measures. These results yield insight into the evolutionary design of interfaces, improving our understanding of the role that structure plays in the molecular evolution of PPIs at the residue level.
Collapse
Affiliation(s)
- Léah Pollet
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada
| | - Luke Lambourne
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA; Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA, USA; Department of Cancer Biology, Dana-Farber Cancer Institute, Boston, MA, USA.
| | - Yu Xia
- Department of Bioengineering, Faculty of Engineering, McGill University, Montreal, QC, Canada.
| |
Collapse
|
36
|
Liu Q, Yao E, Liu C, Zhou X, Li Y, Xu M. M2GCN: multi-modal graph convolutional network for modeling polypharmacy side effects. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03839-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
|
37
|
Gupta C, Xu J, Jin T, Khullar S, Liu X, Alatkar S, Cheng F, Wang D. Single-cell network biology characterizes cell type gene regulation for drug repurposing and phenotype prediction in Alzheimer's disease. PLoS Comput Biol 2022; 18:e1010287. [PMID: 35849618 PMCID: PMC9333448 DOI: 10.1371/journal.pcbi.1010287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 07/28/2022] [Accepted: 06/07/2022] [Indexed: 12/03/2022] Open
Abstract
Dysregulation of gene expression in Alzheimer's disease (AD) remains elusive, especially at the cell type level. Gene regulatory network, a key molecular mechanism linking transcription factors (TFs) and regulatory elements to govern gene expression, can change across cell types in the human brain and thus serve as a model for studying gene dysregulation in AD. However, AD-induced regulatory changes across brain cell types remains uncharted. To address this, we integrated single-cell multi-omics datasets to predict the gene regulatory networks of four major cell types, excitatory and inhibitory neurons, microglia and oligodendrocytes, in control and AD brains. Importantly, we analyzed and compared the structural and topological features of networks across cell types and examined changes in AD. Our analysis shows that hub TFs are largely common across cell types and AD-related changes are relatively more prominent in some cell types (e.g., microglia). The regulatory logics of enriched network motifs (e.g., feed-forward loops) further uncover cell type-specific TF-TF cooperativities in gene regulation. The cell type networks are also highly modular and several network modules with cell-type-specific expression changes in AD pathology are enriched with AD-risk genes. The further disease-module-drug association analysis suggests cell-type candidate drugs and their potential target genes. Finally, our network-based machine learning analysis systematically prioritized cell type risk genes likely involved in AD. Our strategy is validated using an independent dataset which showed that top ranked genes can predict clinical phenotypes (e.g., cognitive impairment) of AD with reasonable accuracy. Overall, this single-cell network biology analysis provides a comprehensive map linking genes, regulatory networks, cell types and drug targets and reveals cell-type gene dysregulation in AD.
Collapse
Affiliation(s)
- Chirag Gupta
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Jielin Xu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
| | - Ting Jin
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Saniya Khullar
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Xiaoyu Liu
- Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Sayali Alatkar
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, Ohio, United States of America
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, Ohio, United States of America
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, Ohio, United States of America
| | - Daifeng Wang
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| |
Collapse
|
38
|
Wang Y, Wang LL, Wong L, Li Y, Wang L, You ZH. SIPGCN: A Novel Deep Learning Model for Predicting Self-Interacting Proteins from Sequence Information Using Graph Convolutional Networks. Biomedicines 2022; 10:biomedicines10071543. [PMID: 35884848 PMCID: PMC9313220 DOI: 10.3390/biomedicines10071543] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/24/2022] [Accepted: 06/24/2022] [Indexed: 11/16/2022] Open
Abstract
Protein is the basic organic substance that constitutes the cell and is the material condition for the life activity and the guarantee of the biological function activity. Elucidating the interactions and functions of proteins is a central task in exploring the mysteries of life. As an important protein interaction, self-interacting protein (SIP) has a critical role. The fast growth of high-throughput experimental techniques among biomolecules has led to a massive influx of available SIP data. How to conduct scientific research using the massive amount of SIP data has become a new challenge that is being faced in related research fields such as biology and medicine. In this work, we design an SIP prediction method SIPGCN using a deep learning graph convolutional network (GCN) based on protein sequences. First, protein sequences are characterized using a position-specific scoring matrix, which is able to describe the biological evolutionary message, then their hidden features are extracted by the deep learning method GCN, and, finally, the random forest is utilized to predict whether there are interrelationships between proteins. In the cross-validation experiment, SIPGCN achieved 93.65% accuracy and 99.64% specificity in the human data set. SIPGCN achieved 90.69% and 99.08% of these two indicators in the yeast data set, respectively. Compared with other feature models and previous methods, SIPGCN showed excellent results. These outcomes suggest that SIPGCN may be a suitable instrument for predicting SIP and may be a reliable candidate for future wet experiments.
Collapse
Affiliation(s)
- Ying Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China;
| | - Lin-Lin Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China;
- Correspondence: (L.-L.W.); (L.W.)
| | - Leon Wong
- Big Data and Intelligent Computing Research Center, Guangxi Academy of Sciences, Nanning 530007, China; (L.W.); (Z.-H.Y.)
| | - Yang Li
- School of Computer Science and Information Engineering, Hefei University of Technology, Hefei 230601, China;
| | - Lei Wang
- College of Information Science and Engineering, Zaozhuang University, Zaozhuang 277160, China;
- Big Data and Intelligent Computing Research Center, Guangxi Academy of Sciences, Nanning 530007, China; (L.W.); (Z.-H.Y.)
- Correspondence: (L.-L.W.); (L.W.)
| | - Zhu-Hong You
- Big Data and Intelligent Computing Research Center, Guangxi Academy of Sciences, Nanning 530007, China; (L.W.); (Z.-H.Y.)
- School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
| |
Collapse
|
39
|
Yeh SJ, Yeh TY, Chen BS. Systems Drug Discovery for Diffuse Large B Cell Lymphoma Based on Pathogenic Molecular Mechanism via Big Data Mining and Deep Learning Method. Int J Mol Sci 2022; 23:ijms23126732. [PMID: 35743172 PMCID: PMC9224183 DOI: 10.3390/ijms23126732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 06/10/2022] [Accepted: 06/15/2022] [Indexed: 02/01/2023] Open
Abstract
Diffuse large B cell lymphoma (DLBCL) is an aggressive heterogeneous disease. The most common subtypes of DLBCL include germinal center b-cell (GCB) type and activated b-cell (ABC) type. To learn more about the pathogenesis of two DLBCL subtypes (i.e., DLBCL ABC and DLBCL GCB), we firstly construct a candidate genome-wide genetic and epigenetic network (GWGEN) by big database mining. With the help of two DLBCL subtypes’ genome-wide microarray data, we identify their real GWGENs via system identification and model order selection approaches. Afterword, the core GWGENs of two DLBCL subtypes could be extracted from real GWGENs by principal network projection (PNP) method. By comparing core signaling pathways and investigating pathogenic mechanisms, we are able to identify pathogenic biomarkers as drug targets for DLBCL ABC and DLBCL GCD, respectively. Furthermore, we do drug discovery considering drug-target interaction ability, drug regulation ability, and drug toxicity. Among them, a deep neural network (DNN)-based drug-target interaction (DTI) model is trained in advance to predict potential drug candidates holding higher probability to interact with identified biomarkers. Consequently, two drug combinations are proposed to alleviate DLBCL ABC and DLBCL GCB, respectively.
Collapse
|
40
|
Repurposing Histaminergic Drugs in Multiple Sclerosis. Int J Mol Sci 2022; 23:ijms23116347. [PMID: 35683024 PMCID: PMC9181091 DOI: 10.3390/ijms23116347] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 05/31/2022] [Accepted: 06/03/2022] [Indexed: 11/16/2022] Open
Abstract
Multiple sclerosis is an autoimmune disease with a strong neuroinflammatory component that contributes to severe demyelination, neurodegeneration and lesions formation in white and grey matter of the spinal cord and brain. Increasing attention is being paid to the signaling of the biogenic amine histamine in the context of several pathological conditions. In multiple sclerosis, histamine regulates the differentiation of oligodendrocyte precursors, reduces demyelination, and improves the remyelination process. However, the concomitant activation of histamine H1–H4 receptors can sustain either damaging or favorable effects, depending on the specifically activated receptor subtype/s, the timing of receptor engagement, and the central versus peripheral target district. Conventional drug development has failed so far to identify curative drugs for multiple sclerosis, thus causing a severe delay in therapeutic options available to patients. In this perspective, drug repurposing offers an exciting and complementary alternative for rapidly approving some medicines already approved for other indications. In the present work, we have adopted a new network-medicine-based algorithm for drug repurposing called SAveRUNNER, for quantifying the interplay between multiple sclerosis-associated genes and drug targets in the human interactome. We have identified new histamine drug-disease associations and predicted off-label novel use of the histaminergic drugs amodiaquine, rupatadine, and diphenhydramine among others, for multiple sclerosis. Our work suggests that selected histamine-related molecules might get to the root causes of multiple sclerosis and emerge as new potential therapeutic strategies for the disease.
Collapse
|
41
|
Zhao Y, Yu Y, Wang H, Li Y, Deng Y, Jiang G, Luo Y. Machine Learning in Causal Inference: Application in Pharmacovigilance. Drug Saf 2022; 45:459-476. [PMID: 35579811 PMCID: PMC9114053 DOI: 10.1007/s40264-022-01155-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2022] [Indexed: 01/28/2023]
Abstract
Monitoring adverse drug events or pharmacovigilance has been promoted by the World Health Organization to assure the safety of medicines through a timely and reliable information exchange regarding drug safety issues. We aim to discuss the application of machine learning methods as well as causal inference paradigms in pharmacovigilance. We first reviewed data sources for pharmacovigilance. Then, we examined traditional causal inference paradigms, their applications in pharmacovigilance, and how machine learning methods and causal inference paradigms were integrated to enhance the performance of traditional causal inference paradigms. Finally, we summarized issues with currently mainstream correlation-based machine learning models and how the machine learning community has tried to address these issues by incorporating causal inference paradigms. Our literature search revealed that most existing data sources and tasks for pharmacovigilance were not designed for causal inference. Additionally, pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. We highlight several currently trending directions or gaps to integrate causal inference with machine learning in pharmacovigilance research. Finally, our literature search revealed that the adoption of causal paradigms can mitigate known issues with machine learning models. We foresee that the pharmacovigilance domain can benefit from the progress in the machine learning field.
Collapse
Affiliation(s)
- Yiqing Zhao
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yue Yu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Hanyin Wang
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yikuan Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yu Deng
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA.
| |
Collapse
|
42
|
Paci P, Fiscon G, Conte F, Wang RS, Handy DE, Farina L, Loscalzo J. Comprehensive network medicine-based drug repositioning via integration of therapeutic efficacy and side effects. NPJ Syst Biol Appl 2022; 8:12. [PMID: 35443763 PMCID: PMC9021283 DOI: 10.1038/s41540-022-00221-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 03/19/2022] [Indexed: 12/28/2022] Open
Abstract
Despite advances in modern medicine that led to improvements in cardiovascular outcomes, cardiovascular disease (CVD) remains the leading cause of mortality and morbidity globally. Thus, there is an urgent need for new approaches to improve CVD drug treatments. As the development time and cost of drug discovery to clinical application are excessive, alternate strategies for drug development are warranted. Among these are included computational approaches based on omics data for drug repositioning, which have attracted increasing attention. In this work, we developed an adjusted similarity measure implemented by the algorithm SAveRUNNER to reposition drugs for cardiovascular diseases while, at the same time, considering the side effects of drug candidates. We analyzed nine cardiovascular disorders and two side effects. We formulated both disease disorders and side effects as network modules in the human interactome, and considered those drug candidates that are proximal to disease modules but far from side-effects modules as ideal. Our method provides a list of drug candidates for cardiovascular diseases that are unlikely to produce common, adverse side-effects. This approach incorporating side effects is applicable to other diseases, as well.
Collapse
Affiliation(s)
- Paola Paci
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy. .,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy.
| | - Giulia Fiscon
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy.,Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Federica Conte
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Rui-Sheng Wang
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Diane E Handy
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| | - Lorenzo Farina
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Joseph Loscalzo
- Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, 02115, USA
| |
Collapse
|
43
|
Kouchi Z, Kojima M. Function of SYDE C2-RhoGAP family as signaling hubs for neuronal development deduced by computational analysis. Sci Rep 2022; 12:4325. [PMID: 35279680 PMCID: PMC8918327 DOI: 10.1038/s41598-022-08147-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Accepted: 03/02/2022] [Indexed: 11/21/2022] Open
Abstract
Recent investigations of neurological developmental disorders have revealed the Rho-family modulators such as Syde and its interactors as the candidate genes. Although the mammalian Syde proteins are reported to possess GTPase-accelerating activity for RhoA-family proteins, diverse species-specific substrate selectivities and binding partners have been described, presumably based on their evolutionary variance in the molecular organization. A comprehensive in silico analysis of Syde family proteins was performed to elucidate their molecular functions and neurodevelopmental networks. Predicted structural modeling of the RhoGAP domain may account for the molecular constraints to substrate specificity among Rho-family proteins. Deducing conserved binding motifs can extend the Syde interaction network and highlight diverse but Syde isoform-specific signaling pathways in neuronal homeostasis, differentiation, and synaptic plasticity from novel aspects of post-translational modification and proteolysis.
Collapse
|
44
|
Wang F, Ding Y, Lei X, Liao B, Wu FX. Identifying Gene Signatures for Cancer Drug Repositioning Based on Sample Clustering. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:953-965. [PMID: 32845842 DOI: 10.1109/tcbb.2020.3019781] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Drug repositioning is an important approach for drug discovery. Computational drug repositioning approaches typically use a gene signature to represent a particular disease and connect the gene signature with drug perturbation profiles. Although disease samples, especially from cancer, may be heterogeneous, most existing methods consider them as a homogeneous set to identify differentially expressed genes (DEGs)for further determining a gene signature. As a result, some genes that should be in a gene signature may be averaged off. In this study, we propose a new framework to identify gene signatures for cancer drug repositioning based on sample clustering (GS4CDRSC). GS4CDRSC first groups samples into several clusters based on their gene expression profiles. Second, an existing method is applied to the samples in each cluster for generating a list of DEGs. Then a weighting approach is used to identify an intergrated gene signature from all the lists of DEGs. The integrated gene signature is used to connect with drug perturbation profiles in the Connectivity Map (CMap)database to generate a list of drug candidates. GS4CDRSC has been tested with several cancer datasets and existing methods. The computational results show that GS4CDRSC outperforms those methods without the sample clustering and weighting approaches in terms of both number and rate of predicted known drugs for specific cancers.
Collapse
|
45
|
Yeh SJ, Chung YC, Chen BS. Investigating the Role of Obesity in Prostate Cancer and Identifying Biomarkers for Drug Discovery: Systems Biology and Deep Learning Approaches. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27030900. [PMID: 35164166 PMCID: PMC8840188 DOI: 10.3390/molecules27030900] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2021] [Revised: 01/22/2022] [Accepted: 01/26/2022] [Indexed: 12/21/2022]
Abstract
Prostate cancer (PCa) is the second most frequently diagnosed cancer for men and is viewed as the fifth leading cause of death worldwide. The body mass index (BMI) is taken as a vital criterion to elucidate the association between obesity and PCa. In this study, systematic methods are employed to investigate how obesity influences the noncutaneous malignancies of PCa. By comparing the core signaling pathways of lean and obese patients with PCa, we are able to investigate the relationships between obesity and pathogenic mechanisms and identify significant biomarkers as drug targets for drug discovery. Regarding drug design specifications, we take drug–target interaction, drug regulation ability, and drug toxicity into account. One deep neural network (DNN)-based drug–target interaction (DTI) model is trained in advance for predicting drug candidates based on the identified biomarkers. In terms of the application of the DNN-based DTI model and the consideration of drug design specifications, we suggest two potential multiple-molecule drugs to prevent PCa (covering lean and obese PCa) and obesity-specific PCa, respectively. The proposed multiple-molecule drugs (apigenin, digoxin, and orlistat) not only help to prevent PCa, suppressing malignant metastasis, but also result in lower production of fatty acids and cholesterol, especially for obesity-specific PCa.
Collapse
|
46
|
Chen S, Liu Y, Zhang Y, Wierbowski SD, Lipkin SM, Wei X, Yu H. A full-proteome, interaction-specific characterization of mutational hotspots across human cancers. Genome Res 2022; 32:135-149. [PMID: 34963661 PMCID: PMC8744679 DOI: 10.1101/gr.275437.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Accepted: 11/22/2021] [Indexed: 11/24/2022]
Abstract
Rapid accumulation of cancer genomic data has led to the identification of an increasing number of mutational hotspots with uncharacterized significance. Here we present a biologically informed computational framework that characterizes the functional relevance of all 1107 published mutational hotspots identified in approximately 25,000 tumor samples across 41 cancer types in the context of a human 3D interactome network, in which the interface of each interaction is mapped at residue resolution. Hotspots reside in network hub proteins and are enriched on protein interaction interfaces, suggesting that alteration of specific protein-protein interactions is critical for the oncogenicity of many hotspot mutations. Our framework enables, for the first time, systematic identification of specific protein interactions affected by hotspot mutations at the full proteome scale. Furthermore, by constructing a hotspot-affected network that connects all hotspot-affected interactions throughout the whole-human interactome, we uncover genome-wide relationships among hotspots and implicate novel cancer proteins that do not harbor hotspot mutations themselves. Moreover, applying our network-based framework to specific cancer types identifies clinically significant hotspots that can be used for prognosis and therapy targets. Overall, we show that our framework bridges the gap between the statistical significance of mutational hotspots and their biological and clinical significance in human cancers.
Collapse
Affiliation(s)
- Siwei Chen
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Yuan Liu
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Yingying Zhang
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York 14853, USA
| | - Shayne D Wierbowski
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| | - Steven M Lipkin
- Department of Medicine, Weill Cornell Medicine, New York, New York 10021, USA
| | - Xiaomu Wei
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Department of Medicine, Weill Cornell Medicine, New York, New York 10021, USA
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, New York 14853, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
47
|
Cheng Y, Hall TR, Xu X, Yung I, Souza D, Zheng J, Schiele F, Hoffmann M, Mbow ML, Garnett JP, Li J. Targeting uPA-uPAR interaction to improve intestinal epithelial barrier integrity in inflammatory bowel disease. EBioMedicine 2021; 75:103758. [PMID: 34933179 PMCID: PMC8688562 DOI: 10.1016/j.ebiom.2021.103758] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 11/23/2021] [Accepted: 12/01/2021] [Indexed: 12/26/2022] Open
Abstract
Background Loss of intestinal epithelial barrier integrity is a critical component of Inflammatory Bowel Disease (IBD) pathogenesis. Co-expression regulation of ligand-receptor pairs in IBD mucosa has not been systematically studied. Targeting ligand-receptor pairs which are induced in IBD mucosa and function in intestinal epithelial barrier integrity may provide novel therapeutics for IBD. Methods We performed transcriptomic meta-analysis on public IBD datasets combined with cell surface protein-protein-interaction (PPI) databases. We explored primary human/mouse intestinal organoids and Caco-2 cells for expression and function studies of uPA-uPAR (prime hits from the meta-analysis). Epithelial barrier integrity was measured by Trans-Epithelial Electrical Resistance (TEER), FITC-Dextran permeability and tight junction assessment. Genetic (CRISPR, siRNA and KO mice) and pharmacological (small molecules, neutralizing antibody and peptide inhibitors) approaches were applied. Mice deficient of uPAR were studied using the Dextran Sulfate Sodium (DSS)-induced colitis model. Findings The IBD ligand-receptor meta-analysis led to the discovery of a coordinated upregulation of uPA and uPAR in IBD mucosa. Both genes were significantly upregulated during epithelial barrier breakdown in primary intestinal organoids and decreased during barrier formation. Genetic inhibition of uPAR or uPA, or pharmacologically blocking uPA-uPAR interaction protects against cytokine-induced barrier breakdown. Deficiency of uPAR in epithelial cells leads to enhanced EGF/EGFR signalling, a known regulator of epithelial homeostasis and repair. Mice deficient of uPAR display improved intestinal barrier function in vitro and during DSS-induced colitis in vivo. Interpretation Our findings suggest that blocking uPA-uPAR interaction via pharmacological agents protects the epithelial barrier from inflammation-induced damage, indicating a potential therapeutic target for IBD. Funding The study was funded by Boehringer Ingelheim.
Collapse
Affiliation(s)
- Yang Cheng
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Tyler R Hall
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Xiao Xu
- Computational Biology Group, Discovery Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Ivy Yung
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Donald Souza
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Jie Zheng
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - Felix Schiele
- Biotherapeutics Discovery, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riß, Germany
| | - Matthias Hoffmann
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riß, Germany
| | - M Lamine Mbow
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA
| | - James P Garnett
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riß, Germany
| | - Jun Li
- Immunology and Respiratory Diseases Research, Boehringer Ingelheim Pharmaceuticals, Ridgefield, CT, USA.
| |
Collapse
|
48
|
Bodein A, Scott-Boyer MP, Perin O, Lê Cao KA, Droit A. Interpretation of network-based integration from multi-omics longitudinal data. Nucleic Acids Res 2021; 50:e27. [PMID: 34883510 PMCID: PMC8934642 DOI: 10.1093/nar/gkab1200] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/19/2021] [Accepted: 11/22/2021] [Indexed: 12/26/2022] Open
Abstract
Multi-omics integration is key to fully understand complex biological processes in an holistic manner. Furthermore, multi-omics combined with new longitudinal experimental design can unreveal dynamic relationships between omics layers and identify key players or interactions in system development or complex phenotypes. However, integration methods have to address various experimental designs and do not guarantee interpretable biological results. The new challenge of multi-omics integration is to solve interpretation and unlock the hidden knowledge within the multi-omics data. In this paper, we go beyond integration and propose a generic approach to face the interpretation problem. From multi-omics longitudinal data, this approach builds and explores hybrid multi-omics networks composed of both inferred and known relationships within and between omics layers. With smart node labelling and propagation analysis, this approach predicts regulation mechanisms and multi-omics functional modules. We applied the method on 3 case studies with various multi-omics designs and identified new multi-layer interactions involved in key biological functions that could not be revealed with single omics analysis. Moreover, we highlighted interplay in the kinetics that could help identify novel biological mechanisms. This method is available as an R package netOmics to readily suit any application.
Collapse
Affiliation(s)
- Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Perin
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Kim-Anh Lê Cao
- Melbourne Integrative Genomics, School of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| |
Collapse
|
49
|
Farooq QUA, Shaukat Z, Aiman S, Li CH. Protein-protein interactions: Methods, databases, and applications in virus-host study. World J Virol 2021; 10:288-300. [PMID: 34909403 PMCID: PMC8641042 DOI: 10.5501/wjv.v10.i6.288] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 04/19/2021] [Accepted: 07/30/2021] [Indexed: 02/06/2023] Open
Abstract
Almost all the cellular processes in a living system are controlled by proteins: They regulate gene expression, catalyze chemical reactions, transport small molecules across membranes, and transmit signal across membranes. Even, a viral infection is often initiated through virus-host protein interactions. Protein-protein interactions (PPIs) are the physical contacts between two or more proteins and they represent complex biological functions. Nowadays, PPIs have been used to construct PPI networks to study complex pathways for revealing the functions of unknown proteins. Scientists have used PPIs to find the molecular basis of certain diseases and also some potential drug targets. In this review, we will discuss how PPI networks are essential to understand the molecular basis of virus-host relationships and several databases which are dedicated to virus-host interaction studies. Here, we present a short but comprehensive review on PPIs, including the experimental and computational methods of finding PPIs, the databases dedicated to virus-host PPIs, and the associated various applications in protein interaction networks of some lethal viruses with their hosts.
Collapse
Affiliation(s)
- Qurat ul Ain Farooq
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| | - Zeeshan Shaukat
- Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
| | - Sara Aiman
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| | - Chun-Hua Li
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| |
Collapse
|
50
|
Venkatraman DL, Pulimamidi D, Shukla HG, Hegde SR. Tumor relevant protein functional interactions identified using bipartite graph analyses. Sci Rep 2021; 11:21530. [PMID: 34728699 PMCID: PMC8563864 DOI: 10.1038/s41598-021-00879-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 09/30/2021] [Indexed: 12/02/2022] Open
Abstract
An increased surge of -omics data for the diseases such as cancer allows for deriving insights into the affiliated protein interactions. We used bipartite network principles to build protein functional associations of the differentially regulated genes in 18 cancer types. This approach allowed us to combine expression data to functional associations in many cancers simultaneously. Further, graph centrality measures suggested the importance of upregulated genes such as BIRC5, UBE2C, BUB1B, KIF20A and PTH1R in cancer. Pathway analysis of the high centrality network nodes suggested the importance of the upregulation of cell cycle and replication associated proteins in cancer. Some of the downregulated high centrality proteins include actins, myosins and ATPase subunits. Among the transcription factors, mini-chromosome maintenance proteins (MCMs) and E2F family proteins appeared prominently in regulating many differentially regulated genes. The projected unipartite networks of the up and downregulated genes were comprised of 37,411 and 41,756 interactions, respectively. The conclusions obtained by collating these interactions revealed pan-cancer as well as subtype specific protein complexes and clusters. Therefore, we demonstrate that incorporating expression data from multiple cancers into bipartite graphs validates existing cancer associated mechanisms as well as directs to novel interactions and pathways.
Collapse
Affiliation(s)
| | - Deepshika Pulimamidi
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India
| | - Harsh G Shukla
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India
| | - Shubhada R Hegde
- Institute of Bioinformatics and Applied Biotechnology (IBAB), Bengaluru, 560 100, India.
| |
Collapse
|