1
|
Chen J, Zhang H, Qiu M, Hu J, Lin L, Mai L, Huang G, Chen X, Li X, Qin X, Zhao H. Honokiol in the treatment of triple-negative breast cancer: a network pharmacology approach and experimental validation. Biochem Biophys Res Commun 2025; 771:152008. [PMID: 40398092 DOI: 10.1016/j.bbrc.2025.152008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Revised: 05/12/2025] [Accepted: 05/12/2025] [Indexed: 05/23/2025]
Abstract
Triple-negative breast cancer (TNBC) is a rare and highly metastatic form of cancer. Honokiol (HNK), a biphenolic compound, has been utilized in TNBC treatment, though its specific targets remain unclear. This study aimed to elucidate the effects of HNK on TNBC by combining network pharmacology predictions and experimental validation to uncover its mechanisms. MDA-MB 231 and MDA-MB 468 cells were pre-treated with varying doses of HNK for 24 h. Cell viability, proliferation, and apoptosis were assessed using CCK8 and FACS assays, whereas a wound healing assay was used to evaluate cell migration. A tubule formation assay was used to assess blood vessel formation in HUVECs. Additionally, in vivo activity was confirmed using a zebrafish xenograft model. Network pharmacology and molecular docking predicted active ingredients, key targets, and potential mechanisms of HNK against TNBC. Results indicated that HNK induces apoptosis in MDA-MB 231 and MDA-MB 468 cells and inhibits their migration and proliferation. Furthermore, HNK suppressed blood vessel formation. Zebrafish xenograft experiments validated HNK's inhibitory effect on TNBC cells in vivo. Network pharmacology identified 36 potential HNK targets against TNBC, including HSP90AA1, AKT1, EGFR, ERBB2, HSP90AB1, PGR, MDM2, HDAC1, NR3C1, and MAPK14. Key signaling pathways such as PI3K-Akt, MAPK, Rap1, Ras, and FoxO were implicated in HNK's anti-TNBC mechanism. Molecular docking demonstrated spontaneous interactions between HNK and the targeted proteins. In conclusion, HNK may reduce angiogenesis by blocking the EGFR and HSP90AB1 pathways thereby decreasing proliferation and increasing apoptosis in TNBC cells.
Collapse
Affiliation(s)
- Jing Chen
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China; Medical Research Institute, Guangdong Provincial Key Laboratory of South China Structural Heart Disease, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Haipeng Zhang
- Department of Blood Transfusion, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Min Qiu
- Department of Cardiovascular Surgery, Guangdong Cardiovascular Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Jiemei Hu
- Department of Gynecology, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Lu Lin
- Department of Pharmacy, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Liping Mai
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Guiping Huang
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Xiuyun Chen
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Xiaohong Li
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China
| | - Xianyu Qin
- Department of Thoracic Surgery, Thoracic Cancer Center, The Sixth Affiliated Hospital, Sun Yat-sen University, Guangzhou, 510655, PR China.
| | - Haishan Zhao
- Medical Research Institute, Guangdong Provincial People's Hospital (Guangdong Academy of Medical Sciences), Southern Medical University, Guangzhou, 510080, PR China.
| |
Collapse
|
2
|
Müller-Dott S, Jaehnig EJ, Munchic KP, Jiang W, Yaron-Barir TM, Savage SR, Garrido-Rodriguez M, Johnson JL, Lussana A, Petsalaki E, Lei JT, Dugourd A, Krug K, Cantley LC, Mani DR, Zhang B, Saez-Rodriguez J. Comprehensive evaluation of phosphoproteomic-based kinase activity inference. Nat Commun 2025; 16:4771. [PMID: 40404650 PMCID: PMC12098709 DOI: 10.1038/s41467-025-59779-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 05/05/2025] [Indexed: 05/24/2025] Open
Abstract
Kinases regulate cellular processes and are essential for understanding cellular function and disease. To investigate the regulatory state of a kinase, numerous methods have been developed to infer kinase activities from phosphoproteomics data using kinase-substrate libraries. However, few phosphorylation sites can be attributed to an upstream kinase in these libraries, limiting the scope of kinase activity inference. Moreover, inferred activities vary across methods, necessitating evaluation for accurate interpretation. Here, we present benchmarKIN, an R package enabling comprehensive evaluation of kinase activity inference methods. Alongside classical perturbation experiments, benchmarKIN introduces a tumor-based benchmarking approach utilizing multi-omics data to identify highly active or inactive kinases. We used benchmarKIN to evaluate kinase-substrate libraries, inference algorithms and the potential of adding predicted kinase-substrate interactions to overcome the coverage limitations. Our evaluation shows most computational methods perform similarly, but the choice of library impacts the inferred activities with a combination of manually curated libraries demonstrating superior performance in recapitulating kinase activities. Additionally, in the tumor-based evaluation, adding predicted targets from NetworKIN further boosts the performance. We then demonstrate how kinase activity inference aids characterize kinase inhibitor responses in cell lines. Overall, benchmarKIN helps researchers to select reliable methods for identifying deregulated kinases.
Collapse
Affiliation(s)
- Sophia Müller-Dott
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Eric J Jaehnig
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | | | - Wen Jiang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Tomer M Yaron-Barir
- Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
- Englander Institute for Precision Medicine, Institute for Computational Biomedicine, Weill Cornell Medicine, New York, NY, USA
- Columbia University Vagelos College of Physicians and Surgeons, New York, NY, USA
| | - Sara R Savage
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Martin Garrido-Rodriguez
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Molecular Systems Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Jared L Johnson
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - Alessandro Lussana
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire, UK
| | - Evangelia Petsalaki
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire, UK
| | - Jonathan T Lei
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA
| | - Aurelien Dugourd
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Karsten Krug
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Lewis C Cantley
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - D R Mani
- The Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, and Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany.
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire, UK.
| |
Collapse
|
3
|
Ribeiro PH, Cutigi JF, Ramos RH, Ferreira CDOL, Evangelista AF, Simao ADS. Exploring the Influence of Gene Networks on Driver Gene Classification. J Comput Biol 2025. [PMID: 40356528 DOI: 10.1089/cmb.2025.0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2025] Open
Abstract
Cancer is a complex disease caused by mutations in the genome of cells. Genetic mutations can be divided into driver mutations, which are significant for the initiation and progression of cancer, and passenger mutations, which have a neutral effect. In recent years, computational methods have been developed to identify driver genes. Some of these methods use data from gene networks to classify the genes. However, the impact of different gene networks on the performance of these methods remains unexplored. This article aims to analyze the influence of genetic networks in driver gene classification. We analyzed driver gene classification methods that use gene networks as input data, using different cancer mutation datasets and distinct gene networks. Computational methods show significant variation in their results when different gene networks are employed. The results highlight the need to carefully interpret driver gene classification and emphasize the importance of using different gene networks. These findings underline the necessity of developing more robust computational approaches that account for network variability, ensuring greater reliability in driver gene identification and its applications in cancer research.
Collapse
|
4
|
Timofeeva AM, Aulova KS, Nevinsky GA. Modeling Alzheimer's Disease: A Review of Gene-Modified and Induced Animal Models, Complex Cell Culture Models, and Computational Modeling. Brain Sci 2025; 15:486. [PMID: 40426657 PMCID: PMC12109626 DOI: 10.3390/brainsci15050486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2025] [Revised: 04/30/2025] [Accepted: 05/03/2025] [Indexed: 05/29/2025] Open
Abstract
Alzheimer's disease, a complex neurodegenerative disease, is characterized by the pathological aggregation of insoluble amyloid β and hyperphosphorylated tau. Multiple models of this disease have been employed to investigate the etiology, pathogenesis, and multifactorial aspects of Alzheimer's disease and facilitate therapeutic development. Mammals, especially mice, are the most common models for studying the pathogenesis of this disease in vivo. To date, the scientific literature has documented more than 280 mouse models exhibiting diverse aspects of Alzheimer's disease pathogenesis. Other mammalian species, including rats, pigs, and primates, have also been utilized as models. Selected aspects of Alzheimer's disease have also been modeled in simpler model organisms, such as Drosophila melanogaster, Caenorhabditis elegans, and Danio rerio. It is possible to model Alzheimer's disease not only by creating genetically modified animal lines but also by inducing symptoms of this neurodegenerative disease. This review discusses the main methods of creating induced models, with a particular focus on modeling Alzheimer's disease on cell cultures. Induced pluripotent stem cell (iPSC) technology has facilitated novel investigations into the mechanistic underpinnings of diverse diseases, including Alzheimer's. Progress in culturing brain tissue allows for more personalized studies on how drugs affect the brain. Recent years have witnessed substantial advancements in intricate cellular system development, including spheroids, three-dimensional scaffolds, and microfluidic cultures. Microfluidic technologies have emerged as cutting-edge tools for studying intercellular interactions, the tissue microenvironment, and the role of the blood-brain barrier (BBB). Modern biology is experiencing a significant paradigm shift towards utilizing big data and omics technologies. Computational modeling represents a powerful methodology for researching a wide array of human diseases, including Alzheimer's. Bioinformatic methodologies facilitate the analysis of extensive datasets generated via high-throughput experimentation. It is imperative to underscore the significance of integrating diverse modeling techniques in elucidating pathogenic mechanisms in their entirety.
Collapse
Affiliation(s)
- Anna M. Timofeeva
- SB RAS Institute of Chemical Biology and Fundamental Medicine, Novosibirsk 630090, Russia
| | | | | |
Collapse
|
5
|
Ghasemi MR, Fateh ST, Ben-Mahmoud A, Gupta V, Stühn LG, Lesca G, Chatron N, Platzer K, Edery P, Sadeghi H, Isidor B, Cogné B, Schulz HL, Krauspe-Stübecke I, Periyasamy R, Nampoothiri S, Mirfakhraie R, Alijanpour S, Syrbe S, Pfeifer U, Spranger S, Grundmann-Hauser K, Haack TB, Papadopoulou MT, da Silva Gonçalves T, Panagiotakaki E, Arzimanoglou A, Tonekaboni SH, Rossi M, Korenke GC, Lacassie Y, Jang MH, Layman LC, Miryounesi M, Kim HG. Novel Digital Anomalies, Hippocampal Atrophy, and Mutations Expand the Genotypic and Phenotypic Spectra of CNKSR2 in the Houge Type of X-Linked Syndromic Intellectual Development Disorder (MRXSHG). Am J Med Genet A 2025; 197:e63963. [PMID: 39707601 DOI: 10.1002/ajmg.a.63963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2024] [Revised: 09/25/2024] [Accepted: 11/21/2024] [Indexed: 12/23/2024]
Abstract
The Houge type of X-linked syndromic intellectual developmental disorder (MRXSHG) encompasses a spectrum of neurodevelopmental disorders characterized by intellectual disability (ID), language/speech delay, attention issues, and epilepsy. These conditions arise from hemizygous or heterozygous deletions, along with point mutations, affecting CNKSR2, a gene located at Xp22.12. CNKSR2, also known as CNK2 or MAGUIN, functions as a synaptic scaffolding molecule within the neuronal postsynaptic density (PSD) of the central nervous system. It acts as a link connecting postsynaptic structural proteins, such as PSD95 and S-SCAM, by employing multiple functional domains crucial for synaptic signaling and protein-protein interactions. Predominantly expressed in dendrites, CNKSR2 is vital for dendritic spine morphogenesis in hippocampal neurons. Its loss-of-function variants result in reduced PSD size and impaired hippocampal development, affecting processes including neuronal proliferation, migration, and synaptogenesis. We present 15 patients including three from the MENA (Middle East and North Africa), a region with no documented mutations in CNKSR2. Each individual displays unique clinical presentations that encompass developmental delay, ID, language/speech delay, epilepsy, and autism. Genetic analyses revealed 14 distinct variants in CNKSR2, comprising five nonsense, three frameshift, two splice, and four missense variants, of which 13 are novel. The ACMG guidelines unanimously interpreted these 14 variants in 15 individuals as pathogenic, highlighting the detrimental impact of these CNKSR2 genetic alterations and confirming the molecular diagnosis of MRXSHG. Importantly, variants Ser767Phe and Ala827Pro may lead to proteasomal degradation or reduced PSD size, contributing to the neurodevelopmental phenotype. Furthermore, these two amino acids, along with another two affected by four missense variants, exhibit complete conservation in nine vertebrate species, illuminating their crucial role in the gene's functionality. Our study revealed unique new digital and brain phenotype, including pointed fingertips (fetal pads of fingertips), syndactyly, tapering fingers, and hippocampal atrophy. These novel clinical features in MRXSHG, combined with 13 novel variants, expand the phenotypic and genotypic spectra of MRXSHG associated with CNKSR2 mutations.
Collapse
Affiliation(s)
- Mohammad-Reza Ghasemi
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Center for Comprehensive Genetic Services, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Sahand Tehrani Fateh
- Center for Comprehensive Genetic Services, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Faculty of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Afif Ben-Mahmoud
- Neurological Disorders Research Center, Qatar Biomedical Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Vijay Gupta
- Neurological Disorders Research Center, Qatar Biomedical Research Institute, Hamad Bin Khalifa University, Doha, Qatar
| | - Lara G Stühn
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
| | - Gaetan Lesca
- Department of Medical Genetics, Member of the ERN EpiCARE, University Hospitals of Lyon (HCL), Lyon, France, Lyon, France
- University Claude Bernard Lyon 1, Lyon, France
| | - Nicolas Chatron
- Department of Medical Genetics, Member of the ERN EpiCARE, University Hospitals of Lyon (HCL), Lyon, France, Lyon, France
- University Claude Bernard Lyon 1, Lyon, France
| | - Konrad Platzer
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Patrick Edery
- Department of Medical Genetics, Member of the ERN EpiCARE, University Hospitals of Lyon (HCL), Lyon, France, Lyon, France
- GENDEV Team, INSERM U1028, CNRS UMR5292, Lyon Neuroscience Research Centre, Lyon, France
| | - Hossein Sadeghi
- Genomic Research Center, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Bertrand Isidor
- Service de Génétique Médicale, CHU Nantes, Nantes Cedex 1, France
- Université de Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Benjamin Cogné
- Service de Génétique Médicale, CHU Nantes, Nantes Cedex 1, France
- Université de Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | | | - Ilona Krauspe-Stübecke
- Bethlehem Health Center Department of Pediatrics and Adolescent Medicine 5, Stolberg, Germany
| | - Radhakrishnan Periyasamy
- Department of Medical Genetics, Kasturba Medical College, Manipal, Manipal Academy of Higher Education, Manipal, India
| | - Sheela Nampoothiri
- Department of Pediatric Genetics, Amrita Institute of Medical Sciences & Research Centre, Cochin, India
| | - Reza Mirfakhraie
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Sahar Alijanpour
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Steffen Syrbe
- Division for Pediatric Epileptology, Heidelberg University Hospital, Heidelberg, Germany
| | - Ulrich Pfeifer
- Division of Child Neurology and Metabolic Medicine, Center for Child and Adolescent Medicine, Heidelberg University Hospital, Heidelberg, Germany
| | | | - Kathrin Grundmann-Hauser
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
- Centre for Rare Diseases, University of Tuebingen, Tuebingen, Germany
| | - Tobias B Haack
- Institute of Medical Genetics and Applied Genomics, University of Tuebingen, Tuebingen, Germany
- Centre for Rare Diseases, University of Tuebingen, Tuebingen, Germany
| | - Maria T Papadopoulou
- Department of Paediatric Clinical Epileptology, Sleep Disorders and Functional Neurology, University Hospitals of Lyon (HCL), Member of the European Reference Network (ERN) EpiCARE, France
| | - Tayrine da Silva Gonçalves
- Department of Paediatric Clinical Epileptology, Sleep Disorders and Functional Neurology, University Hospitals of Lyon (HCL), Member of the European Reference Network (ERN) EpiCARE, France
| | - Eleni Panagiotakaki
- Department of Paediatric Clinical Epileptology, Sleep Disorders and Functional Neurology, University Hospitals of Lyon (HCL), Member of the European Reference Network (ERN) EpiCARE, France
| | - Alexis Arzimanoglou
- Department of Paediatric Clinical Epileptology, Sleep Disorders and Functional Neurology, University Hospitals of Lyon (HCL), Member of the European Reference Network (ERN) EpiCARE, France
- Sant Joan De Déu Children's Hospital, Member of the ERN EpiCARE, University of Barcelona, Institut de Recerca Sant Joan de Déu, Spain
| | - Seyed Hassan Tonekaboni
- Pediatric Neurology Excellence Center, Pediatric Neurology Department, Mofid Children Hospital, Faculty of Medicine, Shahid Beheshti University of Medical Sciences (SBMU), Tehran, Iran
| | - Massimiliano Rossi
- GENDEV Team, INSERM U1028, CNRS UMR5292, Lyon Neuroscience Research Centre, Lyon, France
- Department of Genetics, Lyon University Hospitals, Lyon, France
| | - G Christoph Korenke
- Department of Neuropediatrics, University Children's Hospital, Klinikum Oldenburg, Oldenburg, Germany
| | - Yves Lacassie
- Division of Genetics, Department of Pediatrics, Louisiana State University Health Science Center and Children's Hospital, New Orleans, Louisiana, USA
| | - Mi-Hyeon Jang
- Department of Neurosurgery, Robert Wood Johnson Medical School, Rutgers University, the State University of New Jersey, Piscataway, New Jersey, USA
| | - Lawrence C Layman
- Section of Reproductive Endocrinology, Infertility & Genetics, Department of Obstetrics & Gynecology, Augusta University, Augusta, Georgia, USA
- Department of Neuroscience and Regenerative Medicine, Augusta University, Augusta, Georgia, USA
| | - Mohammad Miryounesi
- Department of Medical Genetics, Faculty of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- Center for Comprehensive Genetic Services, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Hyung-Goo Kim
- Neurological Disorders Research Center, Qatar Biomedical Research Institute, Hamad Bin Khalifa University, Doha, Qatar
- Department of Neurosurgery, Robert Wood Johnson Medical School, Rutgers University, the State University of New Jersey, Piscataway, New Jersey, USA
| |
Collapse
|
6
|
Yu D, Yang X, Shang Y, Yuan S, Liu Y, Liu Y. Drug-target interaction prediction based on metapaths and simplified neighbor aggregation. Methods 2025; 240:154-164. [PMID: 40288620 DOI: 10.1016/j.ymeth.2025.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2025] [Revised: 04/02/2025] [Accepted: 04/20/2025] [Indexed: 04/29/2025] Open
Abstract
Drug-target interaction (DTI) prediction is critical in drug repositioning and discovery. In current metapath-based prediction methods, attention mechanisms are often used to differentiate the importance of various neighbors, enhancing the model's expressiveness. However, in biological networks with small-scale imbalanced data, attention mechanisms are prone to interference from noise and missing data, leading to instability in weight learning, reduced efficiency, and an increased risk of overfitting. To address these issues, we propose the use of average aggregation to mitigate noise, simplify model complexity, and improve stability. Specifically, we introduce a simplified mean aggregation method for DTI prediction. This approach uses average aggregation, effectively reducing noise interference, lowering model complexity, and preventing overfitting, making it especially suitable for current biological networks. Extensive testing on three heterogeneous biological datasets shows that SNADTI outperforms 12 leading methods across two evaluation metrics, significantly reducing training time and validating its effectiveness in DTI prediction. Complexity analysis reveals that our method offers a substantial computational speed advantage over other methods on the same dataset, highlighting its enhanced efficiency. Experimental results demonstrate that SNADTI excels in prediction accuracy, stability, and reproducibility, confirming its practicality and effectiveness in DTI prediction.
Collapse
Affiliation(s)
- Di Yu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Xinyu Yang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Yifan Shang
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China; Department of Biomedical Engineering, The Chinese University of Hong Kong, 999077, Hong Kong, China.
| | - Sisi Yuan
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, 28223, NC, USA
| | - Yuansheng Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| | - Yiping Liu
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, 410082, Hunan, China
| |
Collapse
|
7
|
Li Z, Huang J, Liu X, Xu P, Shen X, Pan C, Zhang W, Liu W, Han H. KRN-DTI: Towards accurate drug-target interaction prediction with Kolmogorov-Arnold and residual networks. Methods 2025; 240:137-144. [PMID: 40287076 DOI: 10.1016/j.ymeth.2025.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2025] [Revised: 04/03/2025] [Accepted: 04/15/2025] [Indexed: 04/29/2025] Open
Abstract
Predicting drug-target interactions (DTIs) accurately is essential in the field of drug discovery. Recently, artificial intelligence (AI) technologies, especially graph convolutional networks (GCNs), have been developed to tackle this challenge. However, as the number of GCN layers increases, models may lose critical information due to excessive smoothing. Moreover, these methods often lack interpretability and are dependent on specific datasets, which limits their generalizability. Consequently, this study introduces a novel method, KRN-DTI, which employs interpretable GCN technology to predict DTIs based on a drug-target heterogeneous network. The method uses GCN technology to identify potential DTIs by leveraging known interactions and dynamically adjusting the weights, thereby enhancing the model's interpretability. Additionally, residual connection technology is employed to integrate GNN outputs, mitigating the over-smoothing issue. Furthermore, the model's interpretability is enhanced by adaptively adjusting weights using Kolmogorov-Arnold Networks (KAN) and attention mechanisms. Experimental results show that KRN-DTI outperforms several advanced computational methods on the benchmark dataset. Case studies further highlight the effectiveness of KRN-DTI in predicting potential DTIs, showcasing its potential for real-world applications in drug discovery. Our code and data are publicly accessible at: https://github.com/lizhen5000/KRN-DTI.git.
Collapse
Affiliation(s)
- Zhen Li
- Institute of Computational Science and Technology, Guangzhou University, Guangzhou, Guangdong, 510006, China; School of Computer Science of Information Technology, Qiannan Normal University for Nationalities, Duyun, Guizhou, 558000, China
| | - Juyuan Huang
- Department of Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, 430071, China
| | - Xinxin Liu
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, Zhejiang 325035, China.
| | - Peng Xu
- Institute of Computational Science and Technology, Guangzhou University, Guangzhou, Guangdong, 510006, China.
| | - Xinwen Shen
- School of Data Science and Artificial Intelligence, Wenzhou University of Technology, Wenzhou, Zhejiang 325035, China
| | - Chu Pan
- College of Computer Science and Electronic Engineering, Hunan University, Changsha, Hunan, 410000, China
| | - Wei Zhang
- Department of Gynecology, Zhongnan Hospital of Wuhan University, Wuhan, Hubei, 430071, China
| | - Wenbin Liu
- Institute of Computational Science and Technology, Guangzhou University, Guangzhou, Guangdong, 510006, China.
| | - Henry Han
- School of engineering and computer science, Baylor University, Waco, TX, 76798, USA.
| |
Collapse
|
8
|
Wu G, Liang Y, Xi Q, Zuo Y. New Insights and Implications of Cell-Cell Interactions in Developmental Biology. Int J Mol Sci 2025; 26:3997. [PMID: 40362237 PMCID: PMC12072105 DOI: 10.3390/ijms26093997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2025] [Revised: 04/17/2025] [Accepted: 04/22/2025] [Indexed: 05/15/2025] Open
Abstract
The dynamic and meticulously regulated networks established the foundation for embryonic development, where the intercellular interactions and signal transduction assumed a pivotal role. In recent years, high-throughput technologies such as single-cell RNA sequencing (scRNA-seq) and spatial transcriptomics (ST) have advanced dramatically, empowering the systematic dissection of cell-to-cell regulatory networks. The emergence of comprehensive databases and analytical frameworks has further provided unprecedented insights into embryonic development and cell-cell interactions (CCIs). This paper reviewed the exponential increased CCIs works related to developmental biology from 2008 to 2023, comprehensively collected and categorized 93 analytical tools and 39 databases, and demonstrated its practical utility through illustrative case studies. In parallel, the article critically scrutinized the persistent challenges within this field, such as the intricacies of spatial localization and transmembrane state validation at single-cell resolution, and underscored the interpretative limitations inherent in current analytical frameworks. The development of CCIs' analysis tools with harmonizing multi-omics data and the construction of cross-species dynamically updated CCIs databases will be the main direction of future research. Future investigations into CCIs are poised to expeditiously drive the application and clinical translation within developmental biology, unlocking novel dimensions for exploration and progress.
Collapse
Affiliation(s)
| | | | | | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, Institutes of Biomedical Sciences, School of Life Sciences, Inner Mongolia University, Hohhot 010021, China; (G.W.); (Y.L.); (Q.X.)
| |
Collapse
|
9
|
Lu D, Zheng Y, Yi X, Hao J, Zeng X, Han L, Li Z, Jiao S, Jiang B, Ai J, Peng J. Identifying potential risk genes for clear cell renal cell carcinoma with deep reinforcement learning. Nat Commun 2025; 16:3591. [PMID: 40234405 PMCID: PMC12000451 DOI: 10.1038/s41467-025-58439-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 03/18/2025] [Indexed: 04/17/2025] Open
Abstract
Clear cell renal cell carcinoma (ccRCC) is the most prevalent type of renal cell carcinoma. However, our understanding of ccRCC risk genes remains limited. This gap in knowledge poses challenges to the effective diagnosis and treatment of ccRCC. To address this problem, we propose a deep reinforcement learning-based computational approach named RL-GenRisk to identify ccRCC risk genes. Distinct from traditional supervised models, RL-GenRisk frames the identification of ccRCC risk genes as a Markov Decision Process, combining the graph convolutional network and Deep Q-Network for risk gene identification. Moreover, a well-designed data-driven reward is proposed for mitigating the limitation of scant known risk genes. The evaluation demonstrates that RL-GenRisk outperforms existing methods in ccRCC risk gene identification. Additionally, RL-GenRisk identifies eight potential ccRCC risk genes. We successfully validated epidermal growth factor receptor (EGFR) and piccolo presynaptic cytomatrix protein (PCLO), corroborated through independent datasets and biological experimentation. This approach may also be used for other diseases in the future.
Collapse
Affiliation(s)
- Dazhi Lu
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Yan Zheng
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Xianyanling Yi
- Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu, China
| | - Jianye Hao
- College of Intelligence and Computing, Tianjin University, Tianjin, China.
| | - Xi Zeng
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Lu Han
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, Xi'an, China
| | - Zhigang Li
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Shaoqing Jiao
- School of Software, Northwestern Polytechnical University, Xi'an, China
| | - Bei Jiang
- Tianjin Second People's Hospital, Tianjin, China
| | - Jianzhong Ai
- Department of Urology, Institute of Urology, West China Hospital, Sichuan University, Chengdu, China.
| | - Jiajie Peng
- AI for Science Interdisciplinary Research Center, School of Computer Science, Northwestern Polytechnical University, Xi'an, China.
- Key Laboratory of Big Data Storage and Management, Northwestern Polytechnical University, Ministry of Industry and Information Technology, Xi'an, China.
| |
Collapse
|
10
|
Gopalakrishnan AP, Shivamurthy PB, Ahmed M, Ummar S, Ramesh P, Thomas SD, Mahin A, Nisar M, Soman S, Subbannayya Y, Raju R. Positional distribution and conservation of major phosphorylated sites in the human kinome. Front Mol Biosci 2025; 12:1557835. [PMID: 40270594 PMCID: PMC12015135 DOI: 10.3389/fmolb.2025.1557835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Accepted: 03/27/2025] [Indexed: 04/25/2025] Open
Abstract
The human protein kinome is a group of over 500 therapeutically relevant kinases. Exemplified by over 10,000 phosphorylated sites reported in global phosphoproteomes, kinases are also highly regulated by phosphorylation. Currently, 1008 phosphorylated sites in 273 kinases are associated with their regulation of activation/inhibition, and a few in 30 kinases are associated with altered activity. Phosphorylated sites in 196 kinases are related to other molecular functions such as localization and protein interactions. Over 8,000 phosphorylated sites, including all those in 517 kinases are unassigned to any functions. This imposes a significant bias and challenge for the effective analysis of global phosphoproteomics datasets. Hence, we derived a set of stably and frequently detected phosphorylated sites (representative phosphorylated sites) across diverse experimental conditions annotated in the PhosphoSitePlus database and presumed them to be relevant to the human kinase regulatory network. Analysis of these representative phosphorylated sites led to the classification of 449 kinases into four distinct categories (kinases with phosphorylated sites apportioned (PaKD) and enigmatic (PeKD), and those with predominantly within kinase domain (PiKD) and outside kinase domain (PoKD)). Knowledge-based functional analysis and sequence conservation across the family/subfamily identified phosphorylated sites unique to specific kinases that could contribute to their unique functions. This classification of representative kinase phosphorylated sites enhance our understanding of prioritized validation and provides a novel framework for targeted phosphorylated site enrichment approaches. Phosphorylated sites in kinases associated with dysregulation in diseases were frequently located outside the kinase domain, and suggesting their regulatory roles and opportunities for phosphorylated site-directed therapeutic approaches.
Collapse
Affiliation(s)
- Athira Perunelly Gopalakrishnan
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM), Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | | | - Mukhtar Ahmed
- Department of Zoology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Samseera Ummar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Poornima Ramesh
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Sonet Daniel Thomas
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM), Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Althaf Mahin
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM), Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Mahammad Nisar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Sowmya Soman
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Yashwanth Subbannayya
- School of Biosciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, United Kingdom
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM), Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| |
Collapse
|
11
|
Karunakaran KB, Jain S, Widera D, Cottrell GS. Spatial and functional profiles distinguish target sets of Parkinson's disease and antipsychotic drugs with different clinical effects. Transl Psychiatry 2025; 15:124. [PMID: 40185727 PMCID: PMC11971416 DOI: 10.1038/s41398-025-03351-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 03/07/2025] [Accepted: 03/26/2025] [Indexed: 04/07/2025] Open
Abstract
Several studies have examined the genetic factors shared between Parkinson's disease (PD) and schizophrenia (SZ), but the biological themes underlying their clinical relationships remain less explored. We employed systematic transcriptomic and network analyses to examine the genes targeted by two sets of antipsychotic drugs (APDs) - first-generation APDs inducing Parkinsonism and second-generation APDs typically effective against psychotic symptoms in PD - and two sets of PD drugs, one at risk of psychosis and the other with a lower risk of psychosis. Although global brain expression patterns did not effectively differentiate between the targets of the two sets of APDs, they did differentiate the targets of the two PD drug sets. However, both APD and PD target sets showed differences in mean expression levels in specific brain regions. Moreover, they showed significant enrichment for genes highly expressed in distinct adult and prenatal brain structures relative to the overall distribution of such genes among all brain-expressed genes. Specific neurotransmitter systems, either individually or in combinations, appeared to underlie the clinically informed drug categories, indicating their differential roles in inducing or not inducing PD and psychosis. Additionally, the target sets formed distinct network modules representing different biological mechanisms and exhibited differential proximity to putative PD and SZ risk genes in the human interactome. In summary, our study identified specific spatial and functional features that distinguish the target sets of PD and antipsychotic drugs with different clinical effects.
Collapse
Affiliation(s)
- Kalyani B Karunakaran
- School of Pharmacy, University of Reading, Reading, UK.
- Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India.
| | - Sanjeev Jain
- Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Darius Widera
- School of Pharmacy, University of Reading, Reading, UK
| | | |
Collapse
|
12
|
Nogueira-Rodríguez A, Glez-Peña D, Vieira CP, Vieira J, López-Fernández H. Towards a more accurate and reliable evaluation of machine learning protein-protein interaction prediction model performance in the presence of unavoidable dataset biases. J Integr Bioinform 2025:jib-2024-0054. [PMID: 40165676 DOI: 10.1515/jib-2024-0054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2024] [Accepted: 02/26/2025] [Indexed: 04/02/2025] Open
Abstract
The characterization of protein-protein interactions (PPIs) is fundamental to understand cellular functions. Although machine learning methods in this task have historically reported prediction accuracies up to 95 %, including those only using raw protein sequences, it has been highlighted that this could be overestimated due to the use of random splits and metrics that do not take into account potential biases in the datasets. Here, we propose a per-protein utility metric, pp_MCC, able to show a drop in the performance in both random and unseen-protein splits scenarios. We tested ML models based on sequence embeddings. The pp_MCC metric evidences a reduced performance even in a random split, reaching levels similar to those shown by the raw MCC metric computed over an unseen protein split, and drops even further when the pp_MCC is used in an unseen protein split scenario. Thus, the metric is able to give a more realistic performance estimation while allowing to use random splits, which could be interesting for more protein-centric studies. Given the low adjusted performance obtained, there seems to be room for improvement when using only primary sequence information, suggesting the need of inclusion of complementary protein data, accompanied with the use of the pp_MCC metric.
Collapse
Affiliation(s)
- Alba Nogueira-Rodríguez
- Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal
- SING Research Group, Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36213 Vigo, Spain
| | - Daniel Glez-Peña
- SING Research Group, Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36213 Vigo, Spain
| | - Cristina P Vieira
- Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal
- Instituto de Biologia Molecular e Celular (IBMC), Rua Alfredo Allen, 208, 4200-135 Porto, Portugal
| | - Jorge Vieira
- Instituto de Investigação e Inovação em Saúde (i3S), Universidade do Porto, Rua Alfredo Allen 208, 4200-135 Porto, Portugal
- Instituto de Biologia Molecular e Celular (IBMC), Rua Alfredo Allen, 208, 4200-135 Porto, Portugal
| | - Hugo López-Fernández
- SING Research Group, Galicia Sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, 36213 Vigo, Spain
| |
Collapse
|
13
|
Lubaba F, George M, Ahmed M, John L, Goplakrishnan AP, Shivamurthy PB, Varghese S, Pahal P, Nisar M, Ramesh P, Madar IH, Raju R. Theranostic Target NSUN2, a C(5)-Methyltransferase, Phospho-Regulatory Network Uncovered with Systematic Assembly of 805 Datasets. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2025; 29:164-177. [PMID: 40126188 DOI: 10.1089/omi.2025.0025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/25/2025]
Abstract
The RNA cytosine C(5)-methyltransferase NSUN2 is involved in RNA modification and regulates gene expression and genomic stability. Beyond multiple sequence/copy number variations, NSUN2 displays altered phosphoprotein expression in various cancers and developmental disorders, thereby making it a prime molecular target of relevance to both therapeutics and diagnostics, that is, theranostics. Despite its key role in kinase-regulated pathways and broader biological processes, the phospho-regulatory network of NSUN2 remains largely unexplored. We report here a systematic assembly of 805 phosphoproteomics datasets from the literature, among which 239 datasets showed differential regulation of NSUN2 phosphopeptides and 40 ensembled phosphosites in NSUN2. Significantly, the phosphorylation sites Ser456, Ser743, and Ser751 represented NSUN2 in ∼50% of these datasets. This is notable given that the functional roles of these phosphosites have been previously underappreciated and underrepresented in the scientific literature. Therefore, we implemented a codetection/coregulation approach based on the phosphosites in other proteins that are codifferentially regulated with phosphopeptides of NSUN2. This approach led to our identification of 55 interactors, 4 potential kinases, and 7 other methylases whose phosphopeptides were codifferentially regulated with NSUN2 phosphopeptides. To the best of our knowledge, this study provides the first phosphosite-centric regulatory network model of NSUN2 to employ theranostic strategies for targeting NSUN2 in cancers and other disorders.
Collapse
Affiliation(s)
- Fathimathul Lubaba
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Mejo George
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Mukhtar Ahmed
- Department of Zoology, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Levin John
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | | | | | - Susmi Varghese
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Priyanka Pahal
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Mahammad Nisar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Poornima Ramesh
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Inamul Hasan Madar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| |
Collapse
|
14
|
Mahin A, Gopalakrishnan AP, Ahmed M, Nisar M, John L, Shivamurthy PB, Ummar S, Varghese S, Modi PK, Pai VR, Prasad TSK, Raju R. Orchestrating Intracellular Calcium Signaling Cascades by Phosphosite-Centric Regulatory Network: A Comprehensive Analysis on Kinases CAMKK1 and CAMKK2. OMICS : A JOURNAL OF INTEGRATIVE BIOLOGY 2025; 29:139-153. [PMID: 40079160 DOI: 10.1089/omi.2024.0196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/14/2025]
Abstract
Intracellular calcium signaling is a cornerstone in cell biology and a key molecular target for human health and disease. Calcium/calmodulin dependent protein kinase kinases, CAMKK1 and CAMKK2 are serine/threonine kinases that contribute to the regulation of intracellular calcium signals in response to diverse stimuli. CAMKK1 generally has stable dynamics, whereas CAMKK2 dysregulation triggers oncogenicity and neurological disorders. To differentiate the phosphosignaling hierarchy associated with predominant phosphosites of CAMKK1 and CAMKK2, we assembled and analyzed the global cellular phosphoproteome datasets. We found that predominant phosphosites in CAMKK1 and CAMKK2 are located outside the kinase domain, and their phosphomotifs are highly homologous. Further, we employed a coregulation analysis approach to these predominant phosphosites, to infer the co-occurrence patterns of phosphorylations within CAMKKs and the coregulation patterns of other protein phosphosites with CAMKK sites. We report herein that independent phosphorylations at CAMKK2 S100 and S511 increase their enzymatic activity in the presence of calcium/calmodulin. In addition, the study unveils kinase-substrate associations such as RPS6KB1 as a novel high-confidence upstream kinase of both CAMKK1 S74 and CAMKK2 S100. Further, CAMKK2 was identified as a primary orchestrator in mediating intracellular calcium signaling cascades compared to CAMKK1 based on coregulation patterns of phosphosites from proteins involved in the calcium signaling pathway. These molecular details shed promising insights into the pathophysiology of several diseases such as cancers and psychiatric disorders associated with kinase activity dysregulations of CAMKK2 and further open the avenue for novel PTM-directed therapeutic strategies to regulate CAMKK2.
Collapse
Affiliation(s)
- Althaf Mahin
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM) [an ICMR-Collaborating Centre of Excellence (ICMR-CCoE 2024)], Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Athira Perunelly Gopalakrishnan
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM) [an ICMR-Collaborating Centre of Excellence (ICMR-CCoE 2024)], Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Mukhtar Ahmed
- Department of Zoology, College of Science, King Saud University, Riyadh, Kingdom of Saudi Arabia
| | - Mahammed Nisar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Levin John
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | | | - Samseera Ummar
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Susmi Varghese
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
| | - Prashant Kumar Modi
- Center for Systems Biology and Molecular Medicine (CSBMM) [an ICMR-Collaborating Centre of Excellence (ICMR-CCoE 2024)], Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Vinitha Ramanath Pai
- Department of Biochemistry, Yenepoya Medical College, Yenepoya (Deemed to be University), Mangaluru, India
| | - Thottethodi Subrahmanya Keshava Prasad
- Center for Systems Biology and Molecular Medicine (CSBMM) [an ICMR-Collaborating Centre of Excellence (ICMR-CCoE 2024)], Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science (CIODS), Yenepoya (Deemed to be University), Mangalore, India
- Center for Systems Biology and Molecular Medicine (CSBMM) [an ICMR-Collaborating Centre of Excellence (ICMR-CCoE 2024)], Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| |
Collapse
|
15
|
Dey L, Chakraborty S. Supervised learning approaches for predicting Ebola-Human Protein-Protein interactions. Gene 2025; 942:149228. [PMID: 39828063 DOI: 10.1016/j.gene.2025.149228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 12/04/2024] [Accepted: 01/07/2025] [Indexed: 01/22/2025]
Abstract
The goal of this research work is to predict protein-protein interactions (PPIs) between the Ebola virus and the host who is at risk of infection. Since there are very limited databases available on the Ebola virus; we have prepared a comprehensive database of all the PPIs between the Ebola virus and human proteins (EbolaInt). Our work focuses on the finding of some new protein-protein interactions between humans and the Ebola virus using some state- of-the-arts machine learning techniques. However, it is basically a two-class problem with a positive interacting dataset and a negative non-interacting dataset. These datasets contain various sequence-based human protein features such as structure of amino acid and conjoint triad and domain-related features. In this research, we have briefly discussed and used some well-known supervised learning approaches to predict PPIs between human proteins and Ebola virus proteins, including K-nearest neighbours (KNN), random forest (RF), support vector machine (SVM), and deep feed-forward multi-layer perceptron (DMLP) etc. We have validated our prediction results using gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. Our goal with this prediction is to compare all other models' accuracy, precision, recall, and f1-score for predicting these PPIs. In the result section, DMLP is giving the highest accuracy along with the prediction of 2655 potential human target proteins.
Collapse
Affiliation(s)
- Lopamudra Dey
- Department of Biomedical and Clinical Sciences, Linköping University, Sweden; Department of Computer Science & Engineering, Meghnad Saha Institute of Technology, Kolkata, India
| | - Sanjay Chakraborty
- Department of Computer and Information Science (IDA), REAL, AIICS, Linköping University, Sweden; Department of Computer Science & Engineering, Techno International New Town, Kolkata, India.
| |
Collapse
|
16
|
Göktepe YE. Protein-protein interaction prediction using enhanced features with spaced conjoint triad and amino acid pairwise distance. PeerJ Comput Sci 2025; 11:e2748. [PMID: 40134873 PMCID: PMC11935777 DOI: 10.7717/peerj-cs.2748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2024] [Accepted: 02/14/2025] [Indexed: 03/27/2025]
Abstract
Protein-protein interactions (PPIs) are pivotal in cellular processes, influencing a wide range of functions, from metabolism to immune responses. Despite the advancements in experimental techniques for PPI detection, their inherent limitations, such as high false-positive rates and significant resource demands, necessitate the development of computational approaches. This study presents a novel computational model named MFPIC (Multi-Feature Protein Interaction Classifier) for predicting PPIs, integrating enhanced sequence-based features, including a novel spaced conjoint triad (SCT) and amino acid pairwise distance (AAPD), with existing methods such as position-specific scoring matrices (PSSM) and AAindex-based features. The SCT captures complex sequence motifs by considering non-adjacent amino acid interactions, while AAPD provides critical spatial information about amino acid residues within protein sequences. The proposed model was evaluated across three benchmark datasets-Saccharomyces cerevisiae, Helicobacter pylori, and human proteins-demonstrating superior performance in comparison to state-of-the-art models. The results underscore the efficacy of integrating diverse and complementary features, achieving significant improvements in predictive accuracy, with the model achieving 95.90%, 99.33%, and 90.95% accuracy on the Saccharomyces cerevisiae, Helicobacter pylori, and human dataset, respectively. This approach not only enhances our understanding of PPI mechanisms but also offers valuable insights for the development of targeted therapeutic strategies.
Collapse
|
17
|
Thapa K, Kinali M, Pei S, Luna A, Babur Ö. Strategies to include prior knowledge in omics analysis with deep neural networks. PATTERNS (NEW YORK, N.Y.) 2025; 6:101203. [PMID: 40182174 PMCID: PMC11963003 DOI: 10.1016/j.patter.2025.101203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/05/2025]
Abstract
High-throughput molecular profiling technologies have revolutionized molecular biology research in the past decades. One important use of molecular data is to make predictions of phenotypes and other features of the organisms using machine learning algorithms. Deep learning models have become increasingly popular for this task due to their ability to learn complex non-linear patterns. Applying deep learning to molecular profiles, however, is challenging due to the very high dimensionality of the data and relatively small sample sizes, causing models to overfit. A solution is to incorporate biological prior knowledge to guide the learning algorithm for processing the functionally related input together. This helps regularize the models and improve their generalizability and interpretability. Here, we describe three major strategies proposed to use prior knowledge in deep learning models to make predictions based on molecular profiles. We review the related deep learning architectures, including the major ideas in relatively new graph neural networks.
Collapse
Affiliation(s)
- Kisan Thapa
- Computer Science Department, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston, MA 02125, USA
| | - Meric Kinali
- Computer Science Department, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston, MA 02125, USA
| | - Shichao Pei
- Computer Science Department, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston, MA 02125, USA
| | - Augustin Luna
- Developmental Therapeutics Branch, Center for Cancer Research, National Cancer Institute, NIH, 9000 Rockville Pike, Bathesda, MD 20892, USA
- Computational Biology Branch, National Library of Medicine, NIH, 9000 Rockville Pike, Bathesda, MD 20892, USA
| | - Özgün Babur
- Computer Science Department, University of Massachusetts Boston, 100 Morrissey Boulevard, Boston, MA 02125, USA
| |
Collapse
|
18
|
Hyeon DY, Nam D, Shin HJ, Jeong J, Jung E, Cho SY, Shin DH, Ku JL, Baek HJ, Yoo CW, Hong EK, Lim MC, Lee SJ, Bae YK, Kim JK, Bae J, Choi W, Kim SJ, Back S, Kang C, Madar IH, Kim H, Kim S, Kim DK, Kang J, Park GW, Park KS, Shin Y, Kim SS, Jung K, Hwang D, Lee SW, Kim JY. Proteogenomic characterization of molecular and cellular targets for treatment-resistant subtypes in locally advanced cervical cancers. Mol Cancer 2025; 24:77. [PMID: 40087745 PMCID: PMC11908047 DOI: 10.1186/s12943-025-02256-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Accepted: 02/01/2025] [Indexed: 03/17/2025] Open
Abstract
We report proteogenomic analysis of locally advanced cervical cancer (LACC). Exome-seq data revealed predominant alterations of keratinization-TP53 regulation and O-glycosylation-TP53 regulation axes in squamous and adeno-LACC, respectively, compared to in early-stage cervical cancer. Integrated clustering of mRNA, protein, and phosphorylation data identified six subtypes (Sub1-6) of LACC among which Sub3, 5, and 6 showed the treatment-resistant nature with poor local recurrence-free survival. Elevated immune and extracellular matrix (ECM) activation mediated by activated stroma (PDGFD and CXCL1high fibroblasts) characterized the immune-hot Sub3 enriched with MUC5AChigh epithelial cells (ECs). Increased epithelial-mesenchymal-transition (EMT) and ECM remodeling characterized the immune-cold squamous Sub5 enriched with PGK1 and CXCL10high ECs. We further demonstrated that CIC mutations could trigger EMT activation by upregulating ETV4, and the elevation of the immune checkpoint PVR and neutrophil-like myeloid-derived suppressive cells (FCN1 and FCGR3Bhigh macrophages) could cause suppression of T-cell activation in Sub5. Increased O-linked glycosylation of mucin characterized adeno-LACC Sub6 enriched with MUC5AChigh ECs. These results provide a battery of somatic mutations, cellular pathways, and cellular players that can be used to predict treatment-resistant LACC subtypes and can serve as potential therapeutic targets for these LACC subtypes.
Collapse
Affiliation(s)
- Do Young Hyeon
- School of Biological Sciences, Seoul National University, Seoul, 08826, Republic of Korea
| | - Dowoon Nam
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Hye-Jin Shin
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Juhee Jeong
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Eunsoo Jung
- School of Biological Sciences, Seoul National University, Seoul, 08826, Republic of Korea
| | - Soo Young Cho
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Dong Hoon Shin
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Ja-Lok Ku
- Korean Cell Line Bank, Laboratory of Cell Biology, Cancer Research Institute, Seoul National University College of Medicine, Seoul, Korea
| | - Hye Jung Baek
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Chong Woo Yoo
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Eun-Kyung Hong
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Myong Cheol Lim
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Sang-Jin Lee
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Young-Ki Bae
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Jong Kwang Kim
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Jingi Bae
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Wonyoung Choi
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea
| | - Su-Jin Kim
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Seunghoon Back
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Chaewon Kang
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Inamul Hasan Madar
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Hokeun Kim
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Suhwan Kim
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea
| | - Duk Ki Kim
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Jihyung Kang
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Geon Woo Park
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Ki Seok Park
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea
| | - Yourae Shin
- School of Biological Sciences, Seoul National University, Seoul, 08826, Republic of Korea
| | - Sang Soo Kim
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea.
| | - Keehoon Jung
- Department of Anatomy and Cell Biology and Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, 03080, Republic of Korea.
- Institute of Allergy and Clinical Immunology, Seoul National University Medical Research Center, Seoul, 03080, Republic of Korea.
| | - Daehee Hwang
- School of Biological Sciences, Seoul National University, Seoul, 08826, Republic of Korea.
| | - Sang-Won Lee
- Department of Chemistry and Center for Proteogenome Research, Korea University, Seoul, 02843, Republic of Korea.
| | - Joo-Young Kim
- Research Institute and Hospital, National Cancer Center, Goyang, 10408, Republic of Korea.
| |
Collapse
|
19
|
Parthaje S, Janardhanan M, Paul P, Karunakaran KB, Deb AP, Shankarappa B, Pal PK, Mahadevan A, Jain S, Viswanath B, Purushottam M. CAG Repeat Instability and Region-Specific Gene Expression Changes in the SCA12 Brain. CEREBELLUM (LONDON, ENGLAND) 2025; 24:60. [PMID: 40075006 DOI: 10.1007/s12311-025-01808-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 02/16/2025] [Indexed: 03/14/2025]
Abstract
Spinocerebellar ataxia type 12 (SCA12), an autosomal dominant cerebellar ataxia, caused by an expansion of (CAG)n in the 5' of the PPP2R2B gene on chr5q32, is common in India. The illness often manifests late in life, with diverse neurological and psychiatric symptoms, suggesting involvement of different brain regions. Prominent neuronal loss and atrophy of the cerebellum have been noted earlier. In Huntington's disease (HD), somatic instability associated with the size of the expanded CAG allele in HTT varies across regions of the brain, and influences the nature and severity of symptoms. We estimated CAG repeat size, methylation and gene expression in the PPP2R2B gene across regions in brain tissue from a person with SCA12. We also studied the regional expression of DNA repair pathway and cell cycle genes. Somatic mosaicism, manifested as CAG repeat instability, is detected across brain regions. The cerebellum showed the least somatic instability, and this was coupled with increased methylation, and lower expression, of the PPP2R2B gene. Interestingly, increased expression of DNA maintenance pathway related genes, which might partly explain the lowered DNA instability, was also observed. There was also decreased expression of cell cycle modulators, which could initiate apoptosis, and thus account for neuronal cell death seen in the brain sections. We suggest that drugs that improve DNA repeat stability, could thus be explored as a treatment option for SCA12.
Collapse
Affiliation(s)
- Shreevidya Parthaje
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Meghana Janardhanan
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
- Department of Medical Neuroscience, Dalhousie University, Halifax, Canada
- Institute of Psychiatric Phenomics and Genomics, University Hospital of Munich, Munich, Germany
| | - Pradip Paul
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Kalyani B Karunakaran
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Ashim Paul Deb
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Bhagyalakshmi Shankarappa
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Pramod Kumar Pal
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Anita Mahadevan
- Department of Neuropathology, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Sanjeev Jain
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India
| | - Biju Viswanath
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India.
| | - Meera Purushottam
- Molecular Genetics Laboratory, Department of Psychiatry, National Institute of Mental Health and Neurosciences (NIMHANS), Bangalore, India.
| |
Collapse
|
20
|
Cheng H, Liang Z, Wu Y, Hu J, Cao B, Liu Z, Liu B, Cheng H, Liu ZX. Inferring kinase-phosphosite regulation from phosphoproteome-enriched cancer multi-omics datasets. Brief Bioinform 2025; 26:bbaf143. [PMID: 40194556 PMCID: PMC11975364 DOI: 10.1093/bib/bbaf143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 02/03/2025] [Accepted: 03/14/2025] [Indexed: 04/09/2025] Open
Abstract
Phosphorylation in eukaryotic cells plays a key role in regulating cell signaling and disease progression. Despite the ability to detect thousands of phosphosites in a single experiment using high-throughput technologies, the kinases responsible for regulating these sites are largely unidentified. To solve this, we collected the quantitative data at the transcriptional, protein, and phosphorylation levels of 10 159 samples from 23 tumor datasets and 15 adjacent normal tissue datasets. Our analysis aimed to uncover the potential impact and linkage of kinase-phosphosite (KPS) pairs through experimental evidence in publications and prediction tools commonly used. We discovered that both experimentally validated and tool-predicted KPS pairs were enriched in groups where there is a significant correlation between kinase expression/phosphorylation level and the phosphorylation level of phosphosite. This suggested that a quantitative correlation could infer the KPS interconnections. Furthermore, the Spearman's correlation coefficient for these pairs were notably higher in tumor samples, indicating that these regulatory interactions are particularly pronounced in tumors. Consequently, building on the KPS correlations of different datasets as predictive features, we have developed an innovative approach that employed an oversampling method combined with and XGBoost algorithm (SMOTE-XGBoost) to predict potential kinase-specific phosphorylation sites in proteins. Moreover, the computed correlations and predictions of kinase-phosphosite interconnections were integrated into the eKPI database (https://ekpi.omicsbio.info/). In summary, our study could provide helpful information and facilitate further research on the regulatory relationship between kinases and phosphosites.
Collapse
Affiliation(s)
- Haoyang Cheng
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
- Department of Computer Science, The University of Hong Kong, Pokfulam Road, Hong Kong Special Administrative Region 999077, China
| | - Zhuoran Liang
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
| | - Yijin Wu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
| | - Jiamin Hu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
| | - Bijin Cao
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
- School of Life Sciences, Zhengzhou University, 100 Science Avenue, Zhengzhou 450001, China
| | - Zekun Liu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
| | - Bo Liu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
- School of Life Sciences, Zhengzhou University, 100 Science Avenue, Zhengzhou 450001, China
| | - Han Cheng
- School of Life Sciences, Zhengzhou University, 100 Science Avenue, Zhengzhou 450001, China
| | - Ze-Xian Liu
- State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, 651 Dongfeng East Road, Guangzhou 510060, China
| |
Collapse
|
21
|
Li J, Lu X, Jiang K, Tang D, Ning B, Sun F. TARSL: Triple-Attention Cross-Network Representation Learning to Predict Synthetic Lethality for Anti-Cancer Drug Discovery. IEEE J Biomed Health Inform 2025; 29:1680-1691. [PMID: 37603479 DOI: 10.1109/jbhi.2023.3306768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/23/2023]
Abstract
Cancer is a multifaceted disease that results from co-mutations of multi biological molecules. A promising strategy for cancer therapy involves in exploiting the phenomenon of Synthetic Lethality (SL) by targeting the SL partner of cancer gene. Since traditional methods for SL prediction suffer from high-cost, time-consuming and off-targets effects, computational approaches have been efficient complementary to these methods. Most of existing approaches treat SL associations as independent of other biological interaction networks, and fail to consider other information from various biological networks. Despite some approaches have integrated different networks to capture multi-modal features of genes for SL prediction, these methods implicitly assume that all sources and levels of information contribute equally to the SL associations. As such, a comprehensive and flexible framework for learning gene cross-network representations for SL prediction is still lacking. In this work, we present a novel Triple-Attention cross-network Representation learning for SL prediction (TARSL) by capturing molecular features from heterogeneous sources. We employ three-level attention modules to consider the different contribution of multi-level information. In particular, feature-level attention can capture the correlations between molecular feature and network link, node-level attention can differentiate the importance of various neighbors, and network-level attention can concentrate on important network and reduce the effects of irrelated networks. We perform comprehensive experiments on human SL datasets and these results have proven that our model is consistently superior to baseline methods and predicted SL associations could aid in designing anti-cancer drugs.
Collapse
|
22
|
Kiouri DP, Batsis GC, Mavromoustakos T, Giuliani A, Chasapis CT. Structure-Based Modeling of the Gut Bacteria-Host Interactome Through Statistical Analysis of Domain-Domain Associations Using Machine Learning. BIOTECH 2025; 14:13. [PMID: 40227324 PMCID: PMC11940256 DOI: 10.3390/biotech14010013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2025] [Revised: 02/16/2025] [Accepted: 02/21/2025] [Indexed: 04/15/2025] Open
Abstract
The gut microbiome, a complex ecosystem of microorganisms, plays a pivotal role in human health and disease. The gut microbiome's influence extends beyond the digestive system to various organs, and its imbalance is linked to a wide range of diseases, including cancer and neurodevelopmental, inflammatory, metabolic, cardiovascular, autoimmune, and psychiatric diseases. Despite its significance, the interactions between gut bacteria and human proteins remain understudied, with less than 20,000 experimentally validated protein interactions between the host and any bacteria species. This study addresses this knowledge gap by predicting a protein-protein interaction network between gut bacterial and human proteins. Using statistical associations between Pfam domains, a comprehensive dataset of over one million experimentally validated pan-bacterial-human protein interactions, as well as inter- and intra-species protein interactions from various organisms, were used for the development of a machine learning-based prediction method to uncover key regulatory molecules in this dynamic system. This study's findings contribute to the understanding of the intricate gut microbiome-host relationship and pave the way for future experimental validation and therapeutic strategies targeting the gut microbiome interplay.
Collapse
Affiliation(s)
- Despoina P. Kiouri
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
- Laboratory of Organic Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, 15772 Athens, Greece;
| | - Georgios C. Batsis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| | - Thomas Mavromoustakos
- Laboratory of Organic Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, 15772 Athens, Greece;
| | - Alessandro Giuliani
- Environment and Health Department, Istituto Superiore di Sanità, 00161 Rome, Italy;
| | - Christos T. Chasapis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| |
Collapse
|
23
|
Kiouri DP, Batsis GC, Chasapis CT. Structure-Based Deep Learning Framework for Modeling Human-Gut Bacterial Protein Interactions. Proteomes 2025; 13:10. [PMID: 39982320 PMCID: PMC11843979 DOI: 10.3390/proteomes13010010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2024] [Revised: 02/09/2025] [Accepted: 02/11/2025] [Indexed: 02/22/2025] Open
Abstract
Background: The interaction network between the human host proteins and the proteins of the gut bacteria is essential for the establishment of human health, and its dysregulation directly contributes to disease development. Despite its great importance, experimental data on protein-protein interactions (PPIs) between these species are sparse due to experimental limitations. Methods: This study presents a deep learning-based framework for predicting PPIs between human and gut bacterial proteins using structural data. The framework leverages graph-based protein representations and variational autoencoders (VAEs) to extract structural embeddings from protein graphs, which are then fused through a Bi-directional Cross-Attention module to predict interactions. The model addresses common challenges in PPI datasets, such as class imbalance, using focal loss to emphasize harder-to-classify samples. Results: The results demonstrated that this framework exhibits robust performance, with high precision and recall across validation and test datasets, underscoring its generalizability. By incorporating proteoforms in the analysis, the model accounts for the structural complexity within proteomes, making predictions biologically relevant. Conclusions: These findings offer a scalable tool for investigating the interactions between the host and the gut microbiota, potentially yielding new treatment targets and diagnostics for disorders linked to the microbiome.
Collapse
Affiliation(s)
- Despoina P. Kiouri
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
- Laboratory of Organic Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, 15772 Athens, Greece
| | - Georgios C. Batsis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| | - Christos T. Chasapis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| |
Collapse
|
24
|
Bachhar S, Kumar S, Dutta B, Das S. Emerging horizons of AI in pharmaceutical research. ADVANCES IN PHARMACOLOGY (SAN DIEGO, CALIF.) 2025; 103:325-348. [PMID: 40175048 DOI: 10.1016/bs.apha.2025.01.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2025]
Abstract
Artificial Intelligence (AI) has revolutionized drug discovery by enhancing data collection, integration, and predictive modeling across various critical stages. It aggregates vast biological and chemical data, including genomic information, protein structures, and chemical interactions with biological targets. Machine learning techniques and QSAR models are applied by AI to predict compound behaviors and predict potential drug candidates. Docking simulations predict drug-protein interactions, while virtual screening eliminates large chemical databases through efficient sifting. Similarly, AI supports de novo drug design by generating novel molecules, optimized against a particular biological target, using generative models such as generative adversarial network (GAN), always finding lead compounds with the most desirable pharmacological properties. AI used in clinical trials improves efficiency by pinpointing responsive patient cohorts leveraging genetic profiles and biomarkers and maintaining propriety such as dataset diversity and compliance with regulations. This chapter aimed to summarize and analyze the mechanism of AI to accelerate drug discovery by streamlining different processes that enable informed decisions and bring potential life-saving therapies to market faster, amounting to a breakthrough in pharmaceutical research and development.
Collapse
Affiliation(s)
- Sourav Bachhar
- Department of Electronics and Communication Engineering, Kalyani Government Engineering College, Nadia, West Bengal, India; The Institute of Science Culture and Social Studies, Belgharia, Kolkata, West Bengal, India
| | - Suryasarathi Kumar
- The Institute of Science Culture and Social Studies, Belgharia, Kolkata, West Bengal, India; School of Biological Sciences & Technology, Department of Applied Biology, Maulana Abul Kalam Azad University of Technology, Haringhata, West Bengal, India
| | - Basudeb Dutta
- Institute for Integrated Cell-Material Sciences, Kyoto University, Yoshida Ushinomiya-cho, Sakyo-ku, Kyoto, Japan; Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal, India; Department of Chemistry, School of Applied Sciences, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
| | - Somnath Das
- Department of Chemistry, School of Applied Sciences, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India.
| |
Collapse
|
25
|
He C, Zhao Z, Wang X, Zheng H, Duan L, Zuo J. Exploring drug-target interaction prediction on cold-start scenarios via meta-learning-based graph transformer. Methods 2025; 234:10-20. [PMID: 39550022 DOI: 10.1016/j.ymeth.2024.11.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 11/07/2024] [Accepted: 11/12/2024] [Indexed: 11/18/2024] Open
Abstract
Predicting drug-target interaction (DTI) is of great importance for drug discovery and development. With the rapid development of biological and chemical technologies, computational methods for DTI prediction are becoming a promising approach. However, there are few solutions to the cold-start problem in DTI prediction scenarios, as these methods rely on existing interaction information to support their modeling. Consequently, they are unable to effectively predict DTIs for new drugs or targets with limited interaction data in the existing work. To this end, we propose a graph transformer method based on meta-learning named MGDTI (short for Meta-learning-based Graph Transformer for Drug-Target Interaction prediction) to fill this gap. Technically, we employ drug-drug similarity and target-target similarity as additional information to mitigate the scarcity of interactions. Besides, we trained MGDTI via meta-learning to be adaptive to cold-start tasks. Moreover, we employed graph transformer to prevent over-smoothing by capturing long-range dependencies. Extensive results on the benchmark dataset demonstrate that MGDTI is effective on DTI prediction under cold-start scenarios.
Collapse
Affiliation(s)
- Chengxin He
- School of Computer Science, Sichuan University, Chengdu 610065, China; College of Biomedical Engineering, Sichuan University, Chengdu 610065, China
| | - Zhenjiang Zhao
- School of Computer Science, Sichuan University, Chengdu 610065, China
| | - Xinye Wang
- School of Computer Science, Sichuan University, Chengdu 610065, China
| | - Huiru Zheng
- School of Computing, Ulster University, Belfast BT15 1ED, Northern Ireland, UK
| | - Lei Duan
- School of Computer Science, Sichuan University, Chengdu 610065, China
| | - Jie Zuo
- School of Computer Science, Sichuan University, Chengdu 610065, China.
| |
Collapse
|
26
|
Sun L, Yin Z, Lu L. ISLRWR: A network diffusion algorithm for drug-target interactions prediction. PLoS One 2025; 20:e0302281. [PMID: 39883675 PMCID: PMC11781719 DOI: 10.1371/journal.pone.0302281] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 04/01/2024] [Indexed: 02/01/2025] Open
Abstract
Machine learning techniques and computer-aided methods are now widely used in the pre-discovery tasks of drug discovery, effectively improving the efficiency of drug development and reducing the workload and cost. In this study, we used multi-source heterogeneous network information to build a network model, learn the network topology through multiple network diffusion algorithms, and obtain compressed low-dimensional feature vectors for predicting drug-target interactions (DTIs). We applied the metropolis-hasting random walk (MHRW) algorithm to improve the performance of the random walk with restart (RWR) algorithm, forming the basis by which the self-loop probability of the current node is removed. Additionally, the propagation efficiency of the MHRW was improved using the improved metropolis-hasting random walk (IMRWR) algorithm, facilitating network deep sampling. Finally, we proposed a correction of the transfer probability of the entire network after increasing the self-loop rate of isolated nodes to form the ISLRWR algorithm. Notably, the ISLRWR algorithm improved the area under the receiver operating characteristic curve (AUROC) by 7.53 and 5.72%, and the area under the precision-recall curve (AUPRC) by 5.95 and 4.19% compared to the RWR and MHRW algorithms, respectively, in predicting DTIs performance. Moreover, after excluding the interference of homologous proteins (popular drugs or targets may lead to inflated prediction results), the ISLRWR algorithm still showed a significant performance improvement.
Collapse
Affiliation(s)
- Lu Sun
- School of Mathematics, Physics and Statistics, Institute for Frontier Medical Technology, Center of Intelligent Computing and Applied Statistics, Shanghai University of Engineering Science, Shanghai, China
| | - Zhixiang Yin
- School of Mathematics, Physics and Statistics, Institute for Frontier Medical Technology, Center of Intelligent Computing and Applied Statistics, Shanghai University of Engineering Science, Shanghai, China
| | - Lin Lu
- Shanghai Xinhao Information Technology Co., Ltd., Shanghai, China
| |
Collapse
|
27
|
Zhang B, Quan L, Zhang Z, Cao L, Chen Q, Peng L, Wang J, Jiang Y, Nie L, Li G, Wu T, Lyu Q. MVCL-DTI: Predicting Drug-Target Interactions Using a Multiview Contrastive Learning Model on a Heterogeneous Graph. J Chem Inf Model 2025; 65:1009-1026. [PMID: 39812134 DOI: 10.1021/acs.jcim.4c02073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2025]
Abstract
Accurate prediction of drug-target interactions (DTIs) is pivotal for accelerating the processes of drug discovery and drug repurposing. MVCL-DTI, a novel model leveraging heterogeneous graphs for predicting DTIs, tackles the challenge of synthesizing information from varied biological subnetworks. It integrates neighbor view, meta-path view, and diffusion view to capture semantic features and employs an attention-based contrastive learning approach, along with a multiview attention-weighted fusion module, to effectively integrate and adaptively weight the information from the different views. Tested under various conditions on benchmark data sets, including varying positive-to-negative sample ratios, conducting hard negative sampling experiments, and masking known DTIs with different ratios, as well as redundant DTIs with various similarity metrics, MVCL-DTI exhibits strong robust generalization. The model is then employed to predict novel DTIs, with a particular focus on COVID-19-related drugs, highlighting its practical applicability. Ultimately, through features visualization and computational properties analysis, we've pinpointed critical elements, including Gene Ontology and substituent nodes, along with a proper initialization strategy, underscoring their vital role in DTI prediction tasks.
Collapse
Affiliation(s)
- Bei Zhang
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
- China Mobile (Suzhou) Software Technology Company Limited, Suzhou 215163, China
| | - Lijun Quan
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
- Collaborative Innovation Center of Novel Software Technology and Industrialization, Jiangsu 210000, China
| | - Zhijun Zhang
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Lexin Cao
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Qiufeng Chen
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Liangchen Peng
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Junkai Wang
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Yelu Jiang
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Liangpeng Nie
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Geng Li
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
| | - Tingfang Wu
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
- Collaborative Innovation Center of Novel Software Technology and Industrialization, Jiangsu 210000, China
| | - Qiang Lyu
- School of Computer Science and Technology, Soochow University, Jiangsu 215006, China
- Collaborative Innovation Center of Novel Software Technology and Industrialization, Jiangsu 210000, China
| |
Collapse
|
28
|
Kiouri DP, Batsis GC, Chasapis CT. Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning. Biomolecules 2025; 15:141. [PMID: 39858535 PMCID: PMC11763140 DOI: 10.3390/biom15010141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2024] [Revised: 01/11/2025] [Accepted: 01/14/2025] [Indexed: 01/27/2025] Open
Abstract
Protein-Protein Interaction (PPI) prediction plays a pivotal role in understanding cellular processes and uncovering molecular mechanisms underlying health and disease. Structure-based PPI prediction has emerged as a robust alternative to sequence-based methods, offering greater biological accuracy by integrating three-dimensional spatial and biochemical features. This work summarizes the recent advances in computational approaches leveraging protein structure information for PPI prediction, focusing on machine learning (ML) and deep learning (DL) techniques. These methods not only improve predictive accuracy but also provide insights into functional sites, such as binding and catalytic residues. However, challenges such as limited high-resolution structural data and the need for effective negative sampling persist. Through the integration of experimental and computational tools, structure-based prediction paves the way for comprehensive proteomic network analysis, holding promise for advancements in drug discovery, biomarker identification, and personalized medicine. Future directions include enhancing scalability and dataset reliability to expand these approaches across diverse proteomes.
Collapse
Affiliation(s)
- Despoina P. Kiouri
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
- Laboratory of Organic Chemistry, Department of Chemistry, National and Kapodistrian University of Athens, 15772 Athens, Greece
| | - Georgios C. Batsis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| | - Christos T. Chasapis
- Institute of Chemical Biology, National Hellenic Research Foundation, 11635 Athens, Greece; (D.P.K.); (G.C.B.)
| |
Collapse
|
29
|
Yang G, Liu Y, Wen S, Chen W, Zhu X, Wang Y. DTI-MHAPR: optimized drug-target interaction prediction via PCA-enhanced features and heterogeneous graph attention networks. BMC Bioinformatics 2025; 26:11. [PMID: 39800678 PMCID: PMC11726937 DOI: 10.1186/s12859-024-06021-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Accepted: 12/20/2024] [Indexed: 01/16/2025] Open
Abstract
Drug-target interactions (DTIs) are pivotal in drug discovery and development, and their accurate identification can significantly expedite the process. Numerous DTI prediction methods have emerged, yet many fail to fully harness the feature information of drugs and targets or address the issue of feature redundancy. We aim to refine DTI prediction accuracy by eliminating redundant features and capitalizing on the node topological structure to enhance feature extraction. To achieve this, we introduce a PCA-augmented multi-layer heterogeneous graph-based network that concentrates on key features throughout the encoding-decoding phase. Our approach initiates with the construction of a heterogeneous graph from various similarity metrics, which is then encoded via a graph neural network. We concatenate and integrate the resultant representation vectors to merge multi-level information. Subsequently, principal component analysis is applied to distill the most informative features, with the random forest algorithm employed for the final decoding of the integrated data. Our method outperforms six baseline models in terms of accuracy, as demonstrated by extensive experimentation. Comprehensive ablation studies, visualization of results, and in-depth case analyses further validate our framework's efficacy and interpretability, providing a novel tool for drug discovery that integrates multimodal features.
Collapse
Affiliation(s)
- Guang Yang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China
| | - Yinbo Liu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China
| | - Sijian Wen
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China
| | - Wenxi Chen
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China
| | - Xiaolei Zhu
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China
| | - Yongmei Wang
- School of Information and Artificial Intelligence, Anhui Agricultural University, Changjiang West Road, Hefei, 230036, Anhui, China.
| |
Collapse
|
30
|
Rawal O, Turhan B, Peradejordi IF, Chandrasekar S, Kalayci S, Gnjatic S, Johnson J, Bouhaddou M, Gümüş ZH. PhosNetVis: A web-based tool for fast kinase-substrate enrichment analysis and interactive 2D/3D network visualizations of phosphoproteomics data. PATTERNS (NEW YORK, N.Y.) 2025; 6:101148. [PMID: 39896259 PMCID: PMC11783894 DOI: 10.1016/j.patter.2024.101148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 11/12/2024] [Accepted: 12/11/2024] [Indexed: 02/04/2025]
Abstract
Protein phosphorylation involves the reversible modification of a protein (substrate) residue by another protein (kinase). Liquid chromatography-mass spectrometry studies are rapidly generating massive protein phosphorylation datasets across multiple conditions. Researchers then must infer kinases responsible for changes in phosphosites of each substrate. However, tools that infer kinase-substrate interactions (KSIs) are not optimized to interactively explore the resulting large and complex networks, significant phosphosites, and states. There is thus an unmet need for a tool that facilitates user-friendly analysis, interactive exploration, visualization, and communication of phosphoproteomics datasets. We present PhosNetVis, a web-based tool for researchers of all computational skill levels to easily infer, generate, and interactively explore KSI networks in 2D or 3D by streamlining phosphoproteomics data analysis steps within a single tool. PhostNetVis lowers barriers for researchers by rapidly generating high-quality visualizations to gain biological insights from their phosphoproteomics datasets. It is available at https://gumuslab.github.io/PhosNetVis/.
Collapse
Affiliation(s)
- Osho Rawal
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Berk Turhan
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Faculty of Engineering and Natural Sciences, Sabanci University, 34956 Istanbul, Türkiye
| | - Irene Font Peradejordi
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Cornell Tech, Cornell University, New York, NY 10044, USA
| | - Shreya Chandrasekar
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Cornell Tech, Cornell University, New York, NY 10044, USA
| | - Selim Kalayci
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sacha Gnjatic
- Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Immunology and Immunotherapy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jeffrey Johnson
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Mehdi Bouhaddou
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Institute for Quantitative and Computational Biosciences, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Zeynep H. Gümüş
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| |
Collapse
|
31
|
Xu Y, Zhang Y, Song K, Liu J, Zhao R, Zhang X, Pei L, Li M, Chen Z, Zhang C, Wang P, Li F. ScDrugAct: a comprehensive database to dissect tumor microenvironment cell heterogeneity contributing to drug action and resistance across human cancers. Nucleic Acids Res 2025; 53:D1536-D1546. [PMID: 39526387 PMCID: PMC11701732 DOI: 10.1093/nar/gkae994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 09/27/2024] [Accepted: 10/15/2024] [Indexed: 11/16/2024] Open
Abstract
The transcriptional heterogeneity of tumor microenvironment (TME) cells is a crucial factor driving the diversity of cellular response to drug treatment and resistance. Therefore, characterizing the cells associated with drug treatment and resistance will help us understand therapeutic mechanisms, discover new therapeutic targets and facilitate precision medicine. Here, we describe a database, scDrugAct (http://bio-bigdata.hrbmu.edu.cn/scDrugAct/), which aims to establish connections among drugs, genes and cells and dissect the impact of TME cellular heterogeneity on drug action and resistance at single-cell resolution. ScDrugAct is curated with drug-cell connections between 3838 223 cells across 34 cancer types and 13 857 drugs and identifies 17 274 drug perturbation/resistance-related genes and 276 559 associations between >10 000 drugs and 53 cell types. ScDrugAct also provides multiple flexible tools to retrieve and analyze connections among drugs, genes and cells; the distribution and developmental trajectories of drug-associated cells within the TME; functional features affecting the heterogeneity of cellular responses to drug perturbation and drug resistance; the cell-specific drug-related gene network; and drug-drug similarities. ScDrugAct serves as an important resource for investigating the impact of the cellular heterogeneity of the TME on drug therapies and can help researchers understand the mechanisms of action and resistance of drugs, as well as discover therapeutic targets.
Collapse
Affiliation(s)
- Yanjun Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Yifang Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Kaiyue Song
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Jiaqi Liu
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Rui Zhao
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Xiaomeng Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Liying Pei
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Mengyue Li
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Zhe Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Chunlong Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Peng Wang
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| | - Feng Li
- College of Bioinformatics Science and Technology, Harbin Medical University, 157 Baojian Road, Harbin 150081, China
| |
Collapse
|
32
|
Zou H, Li S, Guo J, Wen L, Lv C, Leng F, Chen Z, Zeng M, Xu J, Li Y, Li X. Pan-cancer analysis reveals age-associated genetic alterations in protein domains. Am J Hum Genet 2025; 112:44-58. [PMID: 39708814 PMCID: PMC11739924 DOI: 10.1016/j.ajhg.2024.11.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 11/26/2024] [Accepted: 11/26/2024] [Indexed: 12/23/2024] Open
Abstract
Cancer incidence and mortality differ among individuals of different ages, but the functional consequences of genetic alterations remain largely unknown. We systematically characterized genetic alterations within protein domains stratified by affected individual's age and showed that the mutational effects on domains varied with age. We further identified potential age-associated driver genes with hotspots across 33 cancers. The candidate drivers involved numerous cancer-related genes that participate in various oncogenic pathways and play central roles in human protein-protein interaction (PPI) networks. We found widespread age biases in protein domains and identified the associations between hotspots and age. Age-stratified PPI networks perturbed by hotspots were constructed to illustrate the function of mutations enriched in domains. We found that hotspots in young adults were associated with premature senescence. In summary, we provided a catalog of age-associated hotspots and their perturbed networks, which may facilitate precision diagnostics and treatments for cancer.
Collapse
Affiliation(s)
- Haozhe Zou
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Si Li
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin 150081, China
| | - Jiyu Guo
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin 150081, China
| | - Luan Wen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Chongwen Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Feng Leng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Zefeng Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Mengqian Zeng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Juan Xu
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China
| | - Yongsheng Li
- School of Interdisciplinary Medicine and Engineering, Harbin Medical University, Harbin 150081, China.
| | - Xia Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, Heilongjiang 150081, China.
| |
Collapse
|
33
|
Xie Y, Wang X, Wang P, Bi X. A pseudo-label supervised graph fusion attention network for drug–target interaction prediction. EXPERT SYSTEMS WITH APPLICATIONS 2025; 259:125264. [DOI: 10.1016/j.eswa.2024.125264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
34
|
Wright SN, Colton S, Schaffer LV, Pillich RT, Churas C, Pratt D, Ideker T. State of the interactomes: an evaluation of molecular networks for generating biological insights. Mol Syst Biol 2025; 21:1-29. [PMID: 39653848 PMCID: PMC11697402 DOI: 10.1038/s44320-024-00077-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2024] [Revised: 11/07/2024] [Accepted: 11/11/2024] [Indexed: 12/18/2024] Open
Abstract
Advancements in genomic and proteomic technologies have powered the creation of large gene and protein networks ("interactomes") for understanding biological systems. However, the proliferation of interactomes complicates the selection of networks for specific applications. Here, we present a comprehensive evaluation of 45 current human interactomes, encompassing protein-protein interactions as well as gene regulatory, signaling, colocalization, and genetic interaction networks. Our analysis shows that large composite networks such as HumanNet, STRING, and FunCoup are most effective for identifying disease genes, while smaller networks such as DIP, Reactome, and SIGNOR demonstrate stronger performance in interaction prediction. Our study provides a benchmark for interactomes across diverse biological applications and clarifies factors that influence network performance. Furthermore, our evaluation pipeline paves the way for continued assessment of emerging and updated interaction networks in the future.
Collapse
Affiliation(s)
- Sarah N Wright
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Scott Colton
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Leah V Schaffer
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Rudolf T Pillich
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Christopher Churas
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Dexter Pratt
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA
| | - Trey Ideker
- Department of Medicine, University of California San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
35
|
Huang P, Gao W, Fu C, Wang M, Li Y, Chu B, He A, Li Y, Deng X, Zhang Y, Kong Q, Yuan J, Wang H, Shi Y, Gao D, Qin R, Hunter T, Tian R. Clinical functional proteomics of intercellular signalling in pancreatic cancer. Nature 2025; 637:726-735. [PMID: 39537929 DOI: 10.1038/s41586-024-08225-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 10/15/2024] [Indexed: 11/16/2024]
Abstract
Pancreatic ductal adenocarcinoma (PDAC) has an atypical, highly stromal tumour microenvironment (TME) that profoundly contributes to its poor prognosis1. Here, to better understand the intercellular signalling between cancer and stromal cells directly in PDAC tumours, we developed a multidimensional proteomic strategy called TMEPro. We applied TMEPro to profile the glycosylated secreted and plasma membrane proteome of 100 human pancreatic tissue samples to a great depth, define cell type origins and identify potential paracrine cross-talk, especially that mediated through tyrosine phosphorylation. Temporal dynamics during pancreatic tumour progression were investigated in a genetically engineered PDAC mouse model. Functionally, we revealed reciprocal signalling between stromal cells and cancer cells mediated by the stromal PDGFR-PTPN11-FOS signalling axis. Furthermore, we examined the generic shedding mechanism of plasma membrane proteins in PDAC tumours and revealed that matrix-metalloprotease-mediated shedding of the AXL receptor tyrosine kinase ectodomain provides an additional dimension of intercellular signalling regulation in the PDAC TME. Importantly, the level of shed AXL has a potential correlation with lymph node metastasis, and inhibition of AXL shedding and its kinase activity showed a substantial synergistic effect in inhibiting cancer cell growth. In summary, we provide TMEPro, a generically applicable clinical functional proteomic strategy, and a comprehensive resource for better understanding the PDAC TME and facilitating the discovery of new diagnostic and therapeutic targets.
Collapse
Affiliation(s)
- Peiwu Huang
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Weina Gao
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Changying Fu
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Min Wang
- Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yunguang Li
- Key Laboratory of Multi-Cell Systems, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Bizhu Chu
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - An He
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Yuan Li
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Xiaomei Deng
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Yehan Zhang
- Key Laboratory of Multi-Cell Systems, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China
| | - Qian Kong
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China
| | - Jingxiong Yuan
- Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Hebin Wang
- Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
| | - Yu Shi
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, USA.
- Bristol Myers Squibb, San Diego, CA, USA.
| | - Dong Gao
- Key Laboratory of Multi-Cell Systems, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, China.
- Institute of Cancer Research, Shenzhen Bay Laboratory, Shenzhen, China.
| | - Renyi Qin
- Department of Biliary-Pancreatic Surgery, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.
| | - Tony Hunter
- Molecular and Cell Biology Laboratory, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Ruijun Tian
- State Key Laboratory of Medical Proteomics and Shenzhen Key Laboratory of Functional Proteomics, Department of Chemistry and Research Center for Chemical Biology and Omics Analysis, School of Science and Guangming Advanced Research Institute, Southern University of Science and Technology, Shenzhen, China.
| |
Collapse
|
36
|
Rao R, Gulfishan M, Kim MS, Kashyap MK. Deciphering Cancer Complexity: Integrative Proteogenomics and Proteomics Approaches for Biomarker Discovery. Methods Mol Biol 2025; 2859:211-237. [PMID: 39436604 DOI: 10.1007/978-1-0716-4152-1_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2024]
Abstract
Proteomics has revolutionized the field of cancer biology because the use of a large number of in vivo (SILAC), in vitro (iTRAQ, ICAT, TMT, stable-isotope Dimethyl, and 18O) labeling techniques or label-free methods (spectral counting or peak intensities) coupled with mass spectrometry enables us to profile and identify dysregulated proteins in diseases such as cancer. These proteome and genome studies have led to many challenges, such as the lack of consistency or correlation between copy numbers, RNA, and protein-level data. This review covers solely mass spectrometry-based approaches used for cancer biomarker discovery. It also touches on the emerging role of oncoproteogenomics or proteogenomics in cancer biomarker discovery and how this new area is attracting the integration of genomics and proteomics areas to address some of the important questions to help impinge on the biology and pathophysiology of different malignancies to make these mass spectrometry-based studies more realistic and relevant to clinical settings.
Collapse
Affiliation(s)
- Rashmi Rao
- School of Life and Allied Health Sciences, Glocal University, Saharanpur, UP, India
| | - Mohd Gulfishan
- School of Life and Allied Health Sciences, Glocal University, Saharanpur, UP, India
| | - Min-Sik Kim
- Department of New Biology, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu-42988, Republic of Korea
| | - Manoj Kumar Kashyap
- Amity Stem Cell Institute (ASCI), Amity Medical School (AMS), Amity University Haryana, Panchgaon (Manesar), Gurugram, Haryana, India.
| |
Collapse
|
37
|
Rawal O, Turhan B, Peradejordi IF, Chandrasekar S, Kalayci S, Gnjatic S, Johnson J, Bouhaddou M, Gümüş ZH. PhosNetVis: A web-based tool for fast kinase-substrate enrichment analysis and interactive 2D/3D network visualizations of phosphoproteomics data. ARXIV 2024:arXiv:2402.05016v4. [PMID: 39010877 PMCID: PMC11247916] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
Protein phosphorylation involves the reversible modification of a protein (substrate) residue by another protein (kinase). Liquid chromatography-mass spectrometry studies are rapidly generating massive protein phosphorylation datasets across multiple conditions. Researchers then must infer kinases responsible for changes in phosphosites of each substrate. However, tools that infer kinase-substrate interactions (KSIs) are not optimized to interactively explore the resulting large and complex networks, significant phosphosites, and states. There is thus an unmet need for a tool that facilitates user-friendly analysis, interactive exploration, visualization, and communication of phosphoproteomics datasets. We present PhosNetVis, a web-based tool for researchers of all computational skill levels to easily infer, generate and interactively explore KSI networks in 2D or 3D by streamlining phosphoproteomics data analysis steps within a single tool. PhostNetVis lowers barriers for researchers in rapidly generating high-quality visualizations to gain biological insights from their phosphoproteomics datasets. It is available at: https://gumuslab.github.io/PhosNetVis/.
Collapse
Affiliation(s)
- Osho Rawal
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- These authors contributed equally
| | - Berk Turhan
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Türkiye
- These authors contributed equally
| | - Irene Font Peradejordi
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Cornell Tech, Cornell University, New York, NY 10044, USA
| | - Shreya Chandrasekar
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Cornell Tech, Cornell University, New York, NY 10044, USA
| | - Selim Kalayci
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sacha Gnjatic
- Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Immunology and Immunotherapy, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jeffrey Johnson
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Mehdi Bouhaddou
- Department of Microbiology, Immunology, and Molecular Genetics, University of California, Los Angeles; Los Angeles, CA 90095, USA
- Institute for Quantitative and Computational Biosciences, University of California, Los Angeles; Los Angeles, CA 90095, USA
| | - Zeynep H. Gümüş
- Department of Genetics and Genomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Marc and Jennifer Lipschultz Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Lead contact
| |
Collapse
|
38
|
Yin H, Duo H, Li S, Qin D, Xie L, Xiao Y, Sun J, Tao J, Zhang X, Li Y, Zou Y, Yang Q, Yang X, Hao Y, Li B. Unlocking biological insights from differentially expressed genes: Concepts, methods, and future perspectives. J Adv Res 2024:S2090-1232(24)00560-5. [PMID: 39647635 DOI: 10.1016/j.jare.2024.12.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2024] [Revised: 10/12/2024] [Accepted: 12/03/2024] [Indexed: 12/10/2024] Open
Abstract
BACKGROUND Identifying differentially expressed genes (DEGs) is a core task of transcriptome analysis, as DEGs can reveal the molecular mechanisms underlying biological processes. However, interpreting the biological significance of large DEG lists is challenging. Currently, gene ontology, pathway enrichment and protein-protein interaction analysis are common strategies employed by biologists. Additionally, emerging analytical strategies/approaches (such as network module analysis, knowledge graph, drug repurposing, cell marker discovery, trajectory analysis, and cell communication analysis) have been proposed. Despite these advances, comprehensive guidelines for systematically and thoroughly mining the biological information within DEGs remain lacking. AIM OF REVIEW This review aims to provide an overview of essential concepts and methodologies for the biological interpretation of DEGs, enhancing the contextual understanding. It also addresses the current limitations and future perspectives of these approaches, highlighting their broad applications in deciphering the molecular mechanism of complex diseases and phenotypes. To assist users in extracting insights from extensive datasets, especially various DEG lists, we developed DEGMiner (https://www.ciblab.net/DEGMiner/), which integrates over 300 easily accessible databases and tools. KEY SCIENTIFIC CONCEPTS OF REVIEW This review offers strong support and guidance for exploring DEGs, and also will accelerate the discovery of hidden biological insights within genomes.
Collapse
Affiliation(s)
- Huachun Yin
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China; Department of Neurosurgery, Xinqiao Hospital, The Army Medical University, Chongqing 400037, PR China; Department of Neurobiology, Chongqing Key Laboratory of Neurobiology, The Army Medical University, Chongqing 400038, PR China
| | - Hongrui Duo
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Song Li
- Department of Neurosurgery, Xinqiao Hospital, The Army Medical University, Chongqing 400037, PR China
| | - Dan Qin
- Department of Biology, College of Science, Northeastern University, Boston, MA 02115, USA
| | - Lingling Xie
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Yingxue Xiao
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Jing Sun
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Jingxin Tao
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Xiaoxi Zhang
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Yinghong Li
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, PR China
| | - Yue Zou
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Qingxia Yang
- Zhejiang Provincial Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, PR China
| | - Xian Yang
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China
| | - Youjin Hao
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China.
| | - Bo Li
- College of Life Sciences, Chongqing Normal University, Chongqing 401331, PR China.
| |
Collapse
|
39
|
Sluzala ZB, Shan Y, Elghazi L, Cárdenas EL, Hamati A, Garner AL, Fort PE. Novel mTORC2/HSPB4 Interaction: Role and Regulation of HSPB4 T148 Phosphorylation. Cells 2024; 13:2000. [PMID: 39682748 PMCID: PMC11640050 DOI: 10.3390/cells13232000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 11/23/2024] [Accepted: 11/27/2024] [Indexed: 12/18/2024] Open
Abstract
HSPB4 and HSPB5 (α-crystallins) have shown increasing promise as neuroprotective agents, demonstrating several anti-apoptotic and protective roles in disorders such as multiple sclerosis and diabetic retinopathy. HSPs are highly regulated by post-translational modification, including deamidation, glycosylation, and phosphorylation. Among them, T148 phosphorylation has been shown to regulate the structural and functional characteristics of HSPB4 and underlie, in part, its neuroprotective capacity. We recently demonstrated that this phosphorylation is reduced in retinal tissues from patients with diabetic retinopathy, raising the question of its regulation during diseases. The kinase(s) responsible for regulating this phosphorylation, however, have yet to be identified. To this end, we employed a multi-tier strategy utilizing in vitro kinome profiling, bioinformatics, and chemoproteomics to predict and discover the kinases capable of phosphorylating T148. Several kinases were identified as being capable of specifically phosphorylating T148 in vitro, and further analysis highlighted mTORC2 as a particularly strong candidate. Altogether, our data demonstrate that the HSPB4-mTORC2 interaction is multi-faceted. Our data support the role of mTORC2 as a specific kinase phosphorylating HSPB4 at T148, but also provide evidence that the HSPB4 chaperone function further strengthens the interaction. This study addresses a critical gap in our understanding of the regulatory underpinnings of T148 phosphorylation-mediated neuroprotection.
Collapse
Affiliation(s)
- Zachary B. Sluzala
- Department of Ophthalmology & Visual Sciences, The University of Michigan, Ann Arbor, MI 48109, USA; (Z.B.S.); (Y.S.); (L.E.); (A.H.)
| | - Yang Shan
- Department of Ophthalmology & Visual Sciences, The University of Michigan, Ann Arbor, MI 48109, USA; (Z.B.S.); (Y.S.); (L.E.); (A.H.)
| | - Lynda Elghazi
- Department of Ophthalmology & Visual Sciences, The University of Michigan, Ann Arbor, MI 48109, USA; (Z.B.S.); (Y.S.); (L.E.); (A.H.)
| | - Emilio L. Cárdenas
- Interdepartmental Program in Medicinal Chemistry, The University of Michigan, Ann Arbor, MI 48109, USA; (E.L.C.); (A.L.G.)
| | - Angelina Hamati
- Department of Ophthalmology & Visual Sciences, The University of Michigan, Ann Arbor, MI 48109, USA; (Z.B.S.); (Y.S.); (L.E.); (A.H.)
| | - Amanda L. Garner
- Interdepartmental Program in Medicinal Chemistry, The University of Michigan, Ann Arbor, MI 48109, USA; (E.L.C.); (A.L.G.)
| | - Patrice E. Fort
- Department of Ophthalmology & Visual Sciences, The University of Michigan, Ann Arbor, MI 48109, USA; (Z.B.S.); (Y.S.); (L.E.); (A.H.)
- Department of Molecular & Integrative Physiology, The University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
40
|
Nayar G, Altman RB. Heterogeneous network approaches to protein pathway prediction. Comput Struct Biotechnol J 2024; 23:2727-2739. [PMID: 39035835 PMCID: PMC11260399 DOI: 10.1016/j.csbj.2024.06.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Revised: 06/17/2024] [Accepted: 06/18/2024] [Indexed: 07/23/2024] Open
Abstract
Understanding protein-protein interactions (PPIs) and the pathways they comprise is essential for comprehending cellular functions and their links to specific phenotypes. Despite the prevalence of molecular data generated by high-throughput sequencing technologies, a significant gap remains in translating this data into functional information regarding the series of interactions that underlie phenotypic differences. In this review, we present an in-depth analysis of heterogeneous network methodologies for modeling protein pathways, highlighting the critical role of integrating multifaceted biological data. It outlines the process of constructing these networks, from data representation to machine learning-driven predictions and evaluations. The work underscores the potential of heterogeneous networks in capturing the complexity of proteomic interactions, thereby offering enhanced accuracy in pathway prediction. This approach not only deepens our understanding of cellular processes but also opens up new possibilities in disease treatment and drug discovery by leveraging the predictive power of comprehensive proteomic data analysis.
Collapse
Affiliation(s)
- Gowri Nayar
- Department of Biomedical Data Science, Stanford University, United States
| | - Russ B. Altman
- Department of Biomedical Data Science, Stanford University, United States
- Department of Genetics, Stanford University, United States
- Department of Medicine, Stanford University, United States
- Department of Bioengineering, Stanford University, United States
| |
Collapse
|
41
|
Cheng X, Meng X, Chen R, Song Z, Li S, Wei S, Lv H, Zhang S, Tang H, Jiang Y, Zhang R. The molecular subtypes of autoimmune diseases. Comput Struct Biotechnol J 2024; 23:1348-1363. [PMID: 38596313 PMCID: PMC11001648 DOI: 10.1016/j.csbj.2024.03.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 03/27/2024] [Accepted: 03/27/2024] [Indexed: 04/11/2024] Open
Abstract
Autoimmune diseases (ADs) are characterized by their complexity and a wide range of clinical differences. Despite patients presenting with similar symptoms and disease patterns, their reactions to treatments may vary. The current approach of personalized medicine, which relies on molecular data, is seen as an effective method to address the variability in these diseases. This review examined the pathologic classification of ADs, such as multiple sclerosis and lupus nephritis, over time. Acknowledging the limitations inherent in pathologic classification, the focus shifted to molecular classification to achieve a deeper insight into disease heterogeneity. The study outlined the established methods and findings from the molecular classification of ADs, categorizing systemic lupus erythematosus (SLE) into four subtypes, inflammatory bowel disease (IBD) into two, rheumatoid arthritis (RA) into three, and multiple sclerosis (MS) into a single subtype. It was observed that the high inflammation subtype of IBD, the RA inflammation subtype, and the MS "inflammation & EGF" subtype share similarities. These subtypes all display a consistent pattern of inflammation that is primarily driven by the activation of the JAK-STAT pathway, with the effective drugs being those that target this signaling pathway. Additionally, by identifying markers that are uniquely associated with the various subtypes within the same disease, the study was able to describe the differences between subtypes in detail. The findings are expected to contribute to the development of personalized treatment plans for patients and establish a strong basis for tailored approaches to treating autoimmune diseases.
Collapse
Affiliation(s)
| | | | | | - Zerun Song
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shuai Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Siyu Wei
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hongchao Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shuhao Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hao Tang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Ruijie Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| |
Collapse
|
42
|
Zheng G, Wu D, Wei X, Xu D, Mao T, Yan D, Han W, Shang X, Chen Z, Qiu J, Tang K, Cao Z, Qiu T. PbsNRs: predict the potential binders and scaffolds for nuclear receptors. Brief Bioinform 2024; 26:bbae710. [PMID: 39798999 PMCID: PMC11724720 DOI: 10.1093/bib/bbae710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 12/09/2024] [Accepted: 12/26/2024] [Indexed: 01/15/2025] Open
Abstract
Nuclear receptors (NRs) are a class of essential proteins that regulate the expression of specific genes and are associated with multiple diseases. In silico methods for prescreening potential NR binders with predictive binding ability are highly desired for NR-related drug development but are rarely reported. Here, we present the PbsNRs (Predicting binders and scaffolds for Nuclear Receptors), a user-friendly web server designed to predict the potential NR binders and scaffolds through proteochemometric modeling. The utility of PbsNRs was systemically evaluated using both chemical compounds and natural products. Results indicated that PbsNRs achieved a good prediction performance for chemical compounds on internal (ROC-AUC = 0.906, where ROC is Receiver-Operating Characteristic curve and AUC is the Area Under the Curve) and external (ROC-AUC = 0.783) datasets, outperforming both compound-ligand interaction tools and NR-specific predictors. PbsNRs also successfully identified bioactive chemical scaffolds for NRs by screening massive natural products. Moreover, the predicted bioactive and inactive natural products for NR2B1 were experimentally validated using biosensors. PbsNRs not only aids in screening potential therapeutic NR binders but also helps discover the essential molecular scaffold and guide the drug discovery for multiple NR-related diseases. The PbsNRs web server is available at http://pbsnrs.badd-cao.net.
Collapse
Affiliation(s)
- Genhui Zheng
- Institute of Clinical Science, Zhongshan Hospital, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Intelligent Medicine Institute, School of Life Sciences, Fudan University, No. 180 Fenglin Road, Shanghai 200032, China
- Oden Institute for Computational Engineering and Sciences (ICES), University of Texas at Austin, No. 201 East 24th Street, Austin 78712, TX, United States
| | - Dingfeng Wu
- National Center, Children’s Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, No. 3333 Binsheng Road, Hangzhou 310052, China
| | - Xiuxia Wei
- School of Health Science and Engineering, University of Shanghai for Science and Technology, No. 516 Jungong Road, Yangpu District, Shanghai 200093, China
| | - Dongpo Xu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, No. 516 Jungong Road, Yangpu District, Shanghai 200093, China
| | - Tiantian Mao
- School of Life Sciences and Technology, Tongji University, No. 1239 Siping Road, Shanghai 200092, China
| | - Deyu Yan
- School of Life Sciences and Technology, Tongji University, No. 1239 Siping Road, Shanghai 200092, China
| | - Wenhao Han
- School of Life Sciences and Technology, Tongji University, No. 1239 Siping Road, Shanghai 200092, China
| | - Xiaoxiao Shang
- Institute of Clinical Science, Zhongshan Hospital, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Intelligent Medicine Institute, School of Life Sciences, Fudan University, No. 180 Fenglin Road, Shanghai 200032, China
- Department of Mathematics and Statistics, McGill University, 805 Sherbrooke Street West, Montreal H3A 0B9, Quebec, Canada
| | - Zikun Chen
- School of Life Sciences and Technology, Tongji University, No. 1239 Siping Road, Shanghai 200092, China
| | - Jingxuan Qiu
- School of Health Science and Engineering, University of Shanghai for Science and Technology, No. 516 Jungong Road, Yangpu District, Shanghai 200093, China
| | - Kailin Tang
- School of Life Sciences and Technology, Tongji University, No. 1239 Siping Road, Shanghai 200092, China
| | - Zhiwei Cao
- Institute of Clinical Science, Zhongshan Hospital, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Intelligent Medicine Institute, School of Life Sciences, Fudan University, No. 180 Fenglin Road, Shanghai 200032, China
| | - Tianyi Qiu
- Institute of Clinical Science, Zhongshan Hospital, Shanghai Medical College, Shanghai Institute of Infectious Disease and Biosecurity, Intelligent Medicine Institute, School of Life Sciences, Fudan University, No. 180 Fenglin Road, Shanghai 200032, China
| |
Collapse
|
43
|
Wiśniewski J, Więcek K, Ali H, Pyrc K, Kula-Păcurar A, Wagner M, Chen HC. Distinguishable topology of the task-evoked functional genome networks in HIV-1 reservoirs. iScience 2024; 27:111222. [PMID: 39559761 PMCID: PMC11570469 DOI: 10.1016/j.isci.2024.111222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 10/07/2024] [Accepted: 10/18/2024] [Indexed: 11/20/2024] Open
Abstract
HIV-1 reservoirs display a heterogeneous nature, lodging both intact and defective proviruses. To deepen our understanding of such heterogeneous HIV-1 reservoirs and their functional implications, we integrated basic concepts of graph theory to characterize the composition of HIV-1 reservoirs. Our analysis revealed noticeable topological properties in networks, featuring immunologic signatures enriched by genes harboring intact and defective proviruses, when comparing antiretroviral therapy (ART)-treated HIV-1-infected individuals and elite controllers. The key variable, the rich factor, played a pivotal role in classifying distinct topological properties in networks. The host gene expression strengthened the accuracy of classification between elite controllers and ART-treated patients. Markov chain modeling for the simulation of different graph networks demonstrated the presence of an intrinsic barrier between elite controllers and non-elite controllers. Overall, our work provides a prime example of leveraging genomic approaches alongside mathematical tools to unravel the complexities of HIV-1 reservoirs.
Collapse
Affiliation(s)
- Janusz Wiśniewski
- Quantitative Virology Research Group, Population Diagnostics Center, Łukasiewicz Research Network – PORT Polish Center for Technology Development, Stabłowicka 147, 54-066 Wrocław, Poland
| | - Kamil Więcek
- Quantitative Virology Research Group, Population Diagnostics Center, Łukasiewicz Research Network – PORT Polish Center for Technology Development, Stabłowicka 147, 54-066 Wrocław, Poland
| | - Haider Ali
- Molecular Virology Group, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7A str, 30-387 Kraków, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Łojasiewicza 11, 30-348 Kraków, Poland
| | - Krzysztof Pyrc
- Virogenetics Laboratory of Virology, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7A str, 30-387 Kraków, Poland
| | - Anna Kula-Păcurar
- Molecular Virology Group, Małopolska Centre of Biotechnology, Jagiellonian University, Gronostajowa 7A str, 30-387 Kraków, Poland
| | - Marek Wagner
- Innate Immunity Research Group, Life Sciences and Biotechnology Center, Łukasiewicz Research Network – PORT Polish Center for Technology Development, Stabłowicka 147, 54-066 Wrocław, Poland
| | - Heng-Chang Chen
- Quantitative Virology Research Group, Population Diagnostics Center, Łukasiewicz Research Network – PORT Polish Center for Technology Development, Stabłowicka 147, 54-066 Wrocław, Poland
| |
Collapse
|
44
|
Lin X, Chang X, Zhang Y, Gao Z, Chi X. Automatic construction of Petri net models for computational simulations of molecular interaction network. NPJ Syst Biol Appl 2024; 10:131. [PMID: 39521772 PMCID: PMC11550427 DOI: 10.1038/s41540-024-00464-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2024] [Accepted: 10/30/2024] [Indexed: 11/16/2024] Open
Abstract
Petri nets are commonly applied in modeling biological systems. However, construction of a Petri net model for complex biological systems is often time consuming, and requires expertise in the research area, limiting their application. To address this challenge, we developed GINtoSPN, an R package that automates the conversion of multi-omics molecular interaction network extracted from the Global Integrative Network (GIN) into Petri nets in GraphML format. These GraphML files can be directly used for Signaling Petri Net (SPN) simulation. To demonstrate the utility of this tool, we built a Petri net model for neurofibromatosis type I. Simulation of NF1 gene knockout, compared to normal skin fibroblast cells, revealed persistent accumulation of Ras-GTPs as expected. Additionally, we identified several other genes substantially affected by the loss of NF1's function, exhibiting individual-specific variability. These results highlight the effectiveness of GINtoSPN in streamlining the modeling and simulation of complex biological systems.
Collapse
Affiliation(s)
- Xuefei Lin
- Department of Dermatology and Venereal Disease, Xuan Wu Hospital, Beijing, China
| | - Xiao Chang
- Department of Dermatology and Venereal Disease, Xuan Wu Hospital, Beijing, China
| | - Yizheng Zhang
- China National Center for Bioinformation, Beijing, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhanyu Gao
- China National Center for Bioinformation, Beijing, China
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
- HKU Li Ka Shing Faculty of Medicine, Hong Kong, China
| | - Xu Chi
- China National Center for Bioinformation, Beijing, China.
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
45
|
E U, T M, A V G, D P. A comprehensive survey of drug-target interaction analysis in allopathy and siddha medicine. Artif Intell Med 2024; 157:102986. [PMID: 39326289 DOI: 10.1016/j.artmed.2024.102986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 08/13/2024] [Accepted: 09/18/2024] [Indexed: 09/28/2024]
Abstract
Effective drug delivery is the cornerstone of modern healthcare, ensuring therapeutic compounds reach their intended targets efficiently. This paper explores the potential of personalized and holistic healthcare, driven by the synergy between traditional and allopathic medicine systems, with a specific focus on the vast reservoir of medicinal compounds found in plants rooted in the historical legacy of traditional medicine. Motivated by the desire to unlock the therapeutic potential of medicinal plants and bridge the gap between traditional and allopathic medicine, this survey delves into in-silico computational approaches for studying Drug-Target Interactions (DTI) within the contexts of allopathy and siddha medicine. The contributions of this survey are multifaceted: it offers a comprehensive overview of in-silico methods for DTI analysis in both systems, identifies common challenges in DTI studies, provides insights into future directions to advance DTI analysis, and includes a comparative analysis of DTI in allopathy and siddha medicine. The findings of this survey highlight the pivotal role of in-silico computational approaches in advancing drug research and development in both allopathy and siddha medicine, emphasizing the importance of integrating these methods to drive the future of personalized healthcare.
Collapse
Affiliation(s)
- Uma E
- Department of Information Science and Technology, College of Engineering Guindy, Chennai, India.
| | - Mala T
- Department of Information Science and Technology, College of Engineering Guindy, Chennai, India
| | - Geetha A V
- Department of Information Science and Technology, College of Engineering Guindy, Chennai, India
| | - Priyanka D
- Department of Information Science and Technology, College of Engineering Guindy, Chennai, India
| |
Collapse
|
46
|
Liu B, Tsoumakas G. Integrating Similarities via Local Interaction Consistency and Optimizing Area Under the Curve Measures via Matrix Factorization for Drug-Target Interaction Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2212-2225. [PMID: 39226198 DOI: 10.1109/tcbb.2024.3453499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
In drug discovery, identifying drug-target interactions (DTIs) via experimental approaches is a tedious and expensive procedure. Computational methods efficiently predict DTIs and recommend a small part of potential interacting pairs for further experimental confirmation, accelerating the drug discovery process. Although fusing heterogeneous drug and target similarities can improve the prediction ability, the existing similarity combination methods ignore the interaction consistency for neighbour entities. Furthermore, area under the precision-recall curve (AUPR) and area under the receiver operating characteristic curve (AUC) are two widely used evaluation metrics in DTI prediction. However, the two metrics are seldom considered as losses within existing DTI prediction methods. We propose a local interaction consistency (LIC) aware similarity integration method to fuse vital information from diverse views for DTI prediction models. Furthermore, we propose two matrix factorization (MF) methods that optimize AUPR and AUC using convex surrogate losses respectively, and then develop an ensemble MF approach that takes advantage of the two area under the curve metrics by combining the two single metric based MF models. Experimental results under different prediction settings show that the proposed methods outperform various competitors in terms of the metric(s) they optimize and are reliable in discovering potential new DTIs.
Collapse
|
47
|
Zhang M, Hong Y, Shen L, Xu S, Xu Y, Zhang X, Liu J, Liu X. A heterogeneous graph neural network with automatic discovery of effective metapaths for drug–target interaction prediction. FUTURE GENERATION COMPUTER SYSTEMS 2024; 160:283-294. [DOI: 10.1016/j.future.2024.05.054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
48
|
Xu B, Chen J, Wang Y, Fu Q, Lu Y. Reinforced Metapath Optimization in Heterogeneous Information Networks for Drug-Target Interaction Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:2315-2329. [PMID: 39316496 DOI: 10.1109/tcbb.2024.3467135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2024]
Abstract
Graph neural networks offer an effective avenue for predicting drug-target interactions. In this domain, researchers have found that constructing heterogeneous information networks based on metapaths using diverse biological datasets enhances prediction performance. However, the performance of such methods is closely tied to the selection of metapaths and the compatibility between metapath subgraphs and graph neural networks. Most existing approaches still rely on fixed strategies for selecting metapaths and often fail to fully exploit node information along the metapaths, limiting the improvement in model performance. This paper introduces a novel method for predicting drug-target interactions by optimizing metapaths in heterogeneous information networks. On one hand, the method formulates the metapath optimization problem as a Markov decision process, using the enhancement of downstream network performance as a reward signal. Through iterative training of a reinforcement learning agent, a high-quality set of metapaths is learned. On the other hand, to fully leverage node information along the metapaths, the paper constructs subgraphs based on nodes along the metapaths. Different depths of subgraphs are processed using different graph convolutional neural network. The proposed method is validated using standard heterogeneous biological benchmark datasets. Experimental results on standard datasets show significant advantages over traditional methods.
Collapse
|
49
|
Ayalvari S, Kaedi M, Sehhati M. A modified multiple-criteria decision-making approach based on a protein-protein interaction network to diagnose latent tuberculosis. BMC Med Inform Decis Mak 2024; 24:319. [PMID: 39478591 PMCID: PMC11523813 DOI: 10.1186/s12911-024-02668-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 09/05/2024] [Indexed: 11/02/2024] Open
Abstract
BACKGROUND DNA microarrays provide informative data for transcriptional profiling and identifying gene expression signatures to help prevent progression of latent tuberculosis infection (LTBI) to active disease. However, constructing a prognostic model for distinguishing LTBI from active tuberculosis (ATB) is very challenging due to the noisy nature of data and lack of a generally stable analysis approach. METHODS In the present study, we proposed an accurate predictive model with the help of data fusion at the decision level. In this regard, results of filter feature selection and wrapper feature selection techniques were combined with multiple-criteria decision-making (MCDM) methods to select 10 genes from six microarray datasets that can be the most discriminative genes for diagnosing tuberculosis cases. As the main contribution of this study, the final ranking function was constructed by combining protein-protein interaction (PPI) network with an MCDM method (called Decision-making Trial and Evaluation Laboratory or DEMATEL) to improve the feature ranking approach. RESULTS By applying data fusion at the decision level on the 10 introduced genes in terms of fusion of classifiers of random forests (RF) and k-nearest neighbors (KNN) regarding Yager's theory, the proposed algorithm reached a sensitivity of 0.97, specificity of 0.90, and accuracy of 0.95. Finally, with the help of cumulative clustering, the genes involved in the diagnosis of latent and activated tuberculosis have been introduced. CONCLUSIONS The combination of MCDM methods and PPI networks can significantly improve the diagnosis different states of tuberculosis. CLINICAL TRIAL NUMBER Not applicable.
Collapse
Affiliation(s)
- Somayeh Ayalvari
- Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran
| | - Marjan Kaedi
- Faculty of Computer Engineering, University of Isfahan, Isfahan, Iran.
| | - Mohammadreza Sehhati
- Department of Biomedical Engineering, School of Advanced Medical Technology, Isfahan University of Medical Sciences, Isfahan, Iran
| |
Collapse
|
50
|
Xiong D, Qiu Y, Zhao J, Zhou Y, Lee D, Gupta S, Torres M, Lu W, Liang S, Kang JJ, Eng C, Loscalzo J, Cheng F, Yu H. A structurally informed human protein-protein interactome reveals proteome-wide perturbations caused by disease mutations. Nat Biotechnol 2024:10.1038/s41587-024-02428-4. [PMID: 39448882 DOI: 10.1038/s41587-024-02428-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 09/11/2024] [Indexed: 10/26/2024]
Abstract
To assist the translation of genetic findings to disease pathobiology and therapeutics discovery, we present an ensemble deep learning framework, termed PIONEER (Protein-protein InteractiOn iNtErfacE pRediction), that predicts protein-binding partner-specific interfaces for all known protein interactions in humans and seven other common model organisms to generate comprehensive structurally informed protein interactomes. We demonstrate that PIONEER outperforms existing state-of-the-art methods and experimentally validate its predictions. We show that disease-associated mutations are enriched in PIONEER-predicted protein-protein interfaces and explore their impact on disease prognosis and drug responses. We identify 586 significant protein-protein interactions (PPIs) enriched with PIONEER-predicted interface somatic mutations (termed oncoPPIs) from analysis of approximately 11,000 whole exomes across 33 cancer types and show significant associations of oncoPPIs with patient survival and drug responses. PIONEER, implemented as both a web server platform and a software package, identifies functional consequences of disease-associated alleles and offers a deep learning tool for precision medicine at multiscale interactome network levels.
Collapse
Grants
- R01GM124559 U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
- R01GM125639 U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
- R01GM130885 U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
- RM1GM139738 U.S. Department of Health & Human Services | NIH | National Institute of General Medical Sciences (NIGMS)
- R01DK115398 U.S. Department of Health & Human Services | NIH | National Institute of Diabetes and Digestive and Kidney Diseases (National Institute of Diabetes & Digestive & Kidney Diseases)
- U01HG007691 U.S. Department of Health & Human Services | NIH | National Human Genome Research Institute (NHGRI)
- R01HL155107 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01HL155096 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- R01HL166137 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- U54HL119145 U.S. Department of Health & Human Services | NIH | National Heart, Lung, and Blood Institute (NHLBI)
- AHA957729 American Heart Association (American Heart Association, Inc.)
- 24MERIT1185447 American Heart Association (American Heart Association, Inc.)
- R01AG084250 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- R56AG074001 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- U01AG073323 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- R01AG066707 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- R01AG076448 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- R01AG082118 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- RF1AG082211 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- R21AG083003 U.S. Department of Health & Human Services | NIH | National Institute on Aging (U.S. National Institute on Aging)
- RF1NS133812 U.S. Department of Health & Human Services | NIH | National Institute of Neurological Disorders and Stroke (NINDS)
Collapse
Affiliation(s)
- Dapeng Xiong
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY, USA
| | - Yunguang Qiu
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Junfei Zhao
- Department of Systems Biology, Herbert Irving Comprehensive Center, Columbia University, New York, NY, USA
| | - Yadi Zhou
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Dongjin Lee
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
| | - Shobhita Gupta
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY, USA
- Biophysics Program, Cornell University, Ithaca, NY, USA
| | - Mateo Torres
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY, USA
| | - Weiqiang Lu
- Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, China
| | - Siqi Liang
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
| | - Jin Joo Kang
- Department of Computational Biology, Cornell University, Ithaca, NY, USA
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA
- Center for Innovative Proteomics, Cornell University, Ithaca, NY, USA
| | - Charis Eng
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Joseph Loscalzo
- Channing Division of Network Medicine, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Feixiong Cheng
- Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA.
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA.
- Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA.
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA.
| | - Haiyuan Yu
- Department of Computational Biology, Cornell University, Ithaca, NY, USA.
- Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, NY, USA.
- Center for Innovative Proteomics, Cornell University, Ithaca, NY, USA.
| |
Collapse
|