1
|
Anti-HIV Potential of Beesioside I Derivatives as Maturation Inhibitors: Synthesis, 3D-QSAR, Molecular Docking and Molecular Dynamics Simulations. Int J Mol Sci 2023; 24:ijms24021430. [PMID: 36674943 PMCID: PMC9867151 DOI: 10.3390/ijms24021430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 01/04/2023] [Accepted: 01/09/2023] [Indexed: 01/13/2023] Open
Abstract
HIV-1 maturation is the final step in the retroviral lifecycle that is regulated by the proteolytic cleavage of the Gag precursor protein. As a first-in-class HIV-1 maturation inhibitor (MI), bevirimat blocks virion maturation by disrupting capsid-spacer peptide 1 (CA-SP1) cleavage, which acts as the target of MIs. Previous alterations of beesioside I (1) produced (20S,24S)-15ꞵ,16ꞵ-diacetoxy-18,24; 20,24-diepoxy-9,19-cyclolanostane-3ꞵ,25-diol 3-O-3′,3′-dimethylsuccinate (3, DSC), showing similar anti-HIV potency compared to bevirimat. To ascertain the binding modes of this derivative, further modification of compound 1 was conducted. Three-dimensional quantitative structure−activity relationship (3D-QSAR) analysis combined with docking simulations and molecular dynamics (MD) were conducted. Five new derivatives were synthesized, among which compound 3b showed significant activity against HIV-1NL4-3 with an EC50 value of 0.28 µM. The developed 3D-QSAR model resulted in great predictive ability with training set (r2 = 0.99, q2 = 0.55). Molecular docking studies were complementary to the 3D-QSAR analysis, showing that DSC was differently bound to CA-SP1 with higher affinity than that of bevirimat. MD studies revealed that the complex of the ligand and the protein was stable, with root mean square deviation (RMSD) values <2.5 Å. The above results provided valuable insights into the potential of DSC as a prototype to develop new antiviral agents.
Collapse
|
2
|
Tsuchiya Y, Yamamori Y, Tomii K. Protein-protein interaction prediction methods: from docking-based to AI-based approaches. Biophys Rev 2022; 14:1341-1348. [PMID: 36570321 PMCID: PMC9759050 DOI: 10.1007/s12551-022-01032-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 11/30/2022] [Indexed: 12/23/2022] Open
Abstract
Protein-protein interactions (PPIs), such as protein-protein inhibitor, antibody-antigen complex, and supercomplexes play diverse and important roles in cells. Recent advances in structural analysis methods, including cryo-EM, for the determination of protein complex structures are remarkable. Nevertheless, much room remains for improvement and utilization of computational methods to predict PPIs because of the large number and great diversity of unresolved complex structures. This review introduces a wide array of computational methods, including our own, for estimating PPIs including antibody-antigen interactions, offering both historical and forward-looking perspectives.
Collapse
Affiliation(s)
- Yuko Tsuchiya
- grid.208504.b0000 0001 2230 7538Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-Ku, Tokyo, 135-0064 Japan
| | - Yu Yamamori
- grid.208504.b0000 0001 2230 7538Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-Ku, Tokyo, 135-0064 Japan
| | - Kentaro Tomii
- grid.208504.b0000 0001 2230 7538Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-Ku, Tokyo, 135-0064 Japan
| |
Collapse
|
3
|
Li R, Lee JY, Yang JM, Akutsu T. Densest subgraph-based methods for protein-protein interaction hot spot prediction. BMC Bioinformatics 2022; 23:451. [PMID: 36316653 PMCID: PMC9623998 DOI: 10.1186/s12859-022-04996-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 10/19/2022] [Indexed: 11/29/2022] Open
Abstract
Background Hot spots play an important role in protein binding analysis. The residue interaction network is a key point in hot spot prediction, and several graph theory-based methods have been proposed to detect hot spots. Although the existing methods can yield some interesting residues by network analysis, low recall has limited their abilities in finding more potential hot spots. Result In this study, we develop three graph theory-based methods to predict hot spots from only a single residue interaction network. We detect the important residues by finding subgraphs with high densities, i.e., high average degrees. Generally, a high degree implies a high binding possibility between protein chains, and thus a subgraph with high density usually relates to binding sites that have a high rate of hot spots. By evaluating the results on 67 complexes from the SKEMPI database, our methods clearly outperform existing graph theory-based methods on recall and F-score. In particular, our main method, Min-SDS, has an average recall of over 0.665 and an f2-score of over 0.364, while the recall and f2-score of the existing methods are less than 0.400 and 0.224, respectively. Conclusion The Min-SDS method performs best among all tested methods on the hot spot prediction problem, and all three of our methods provide useful approaches for analyzing bionetworks. In addition, the densest subgraph-based methods predict hot spots with only one residue interaction network, which is constructed from spatial atomic coordinate data to mitigate the shortage of data from wet-lab experiments.
Collapse
Affiliation(s)
- Ruiming Li
- grid.258799.80000 0004 0372 2033Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011 Kyoto Japan
| | - Jung-Yu Lee
- grid.260539.b0000 0001 2059 7017Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300 Taiwan
| | - Jinn-Moon Yang
- grid.260539.b0000 0001 2059 7017Institute of Bioinformatics and Systems Biology, National Yang Ming Chiao Tung University, Hsinchu, 300 Taiwan ,grid.260539.b0000 0001 2059 7017Department of Biological Science and Technology, National Yang Ming Chiao Tung University, Hsinchu, 300 Taiwan
| | - Tatsuya Akutsu
- grid.258799.80000 0004 0372 2033Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, 611-0011 Kyoto Japan
| |
Collapse
|
4
|
Kumari K, Sharma PK, Aggarwal Y, Singh RP. Secretome analysis of an environmental isolate Enterobacter sp. S-33 identifies proteins related to pathogenicity. Arch Microbiol 2022; 204:662. [PMID: 36198868 DOI: 10.1007/s00203-022-03277-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/16/2022] [Accepted: 09/28/2022] [Indexed: 11/29/2022]
Abstract
Enterobacter species are responsible for causing infections of the lower respiratory tract, urinary tract, meninges, etc. Proteins secreted by these species may act as determinants of host-pathogen interaction and play a role in virulence. Among the secreted proteins, the Type VI secretion system (T6SS) acts as a molecular nanomachine to deliver many effector proteins directly into prey cells in a contact-dependent manner. The secreted proteins may provide an idea for the interaction of bacteria to their environment and an understanding of the role of these proteins for their role in bacterial physiology and behaviour. Therefore, aim of this study was to characterize the secreted proteins in the culture supernatant by a T6SS bacterium Enterobacter sp. S-33 using nano-LC-MS/MS tool. Using a combined mass spectrometry and bioinformatics approach, we identified a total of 736 proteins in the secretome. Bioinformatics analysis predicting subcellular localization identified 110 of the secreted proteins possessed signal sequences. By gene ontology analysis, more than 80 proteins of the secretome were classified into biological or molecular functions. More than 20 percent of secretome proteins were virulence proteins including T6SS proteins, proteins involved in adherence and fimbriae formation, molecular chaperones, outer membrane proteins, serine proteases, antimicrobial, biofilm, exotoxins, etc. In summary, the results of the present study of the S-33 secretome provide a basis for understanding the possible pathogenic mechanisms and future investigation by detailed experimental approach will provide a confirmation of secreted virulence proteins in the exact role of virulence using the in vivo model.
Collapse
Affiliation(s)
- Kiran Kumari
- Department of Bioengineering and Biotechnology, BIT Mesra, Ranchi, Jharkhand, 835215, India
| | - Parva Kumar Sharma
- Department of Plant Sciences and Landscape Architecture, University of Maryland, College Park, MD-20742, USA
| | - Yogender Aggarwal
- Department of Bioengineering and Biotechnology, BIT Mesra, Ranchi, Jharkhand, 835215, India
| | - Rajnish Prakash Singh
- Department of Bioengineering and Biotechnology, BIT Mesra, Ranchi, Jharkhand, 835215, India.
| |
Collapse
|
5
|
Ghadie MA, Xia Y. Are transient protein-protein interactions more dispensable? PLoS Comput Biol 2022; 18:e1010013. [PMID: 35404956 PMCID: PMC9000134 DOI: 10.1371/journal.pcbi.1010013] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2021] [Accepted: 03/11/2022] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions (PPIs) are key drivers of cell function and evolution. While it is widely assumed that most permanent PPIs are important for cellular function, it remains unclear whether transient PPIs are equally important. Here, we estimate and compare dispensable content among transient PPIs and permanent PPIs in human. Starting with a human reference interactome mapped by experiments, we construct a human structural interactome by building three-dimensional structural models for PPIs, and then distinguish transient PPIs from permanent PPIs using several structural and biophysical properties. We map common mutations from healthy individuals and disease-causing mutations onto the structural interactome, and perform structure-based calculations of the probabilities for common mutations (assumed to be neutral) and disease mutations (assumed to be mildly deleterious) to disrupt transient PPIs and permanent PPIs. Using Bayes' theorem we estimate that a similarly small fraction (<~20%) of both transient and permanent PPIs are completely dispensable, i.e., effectively neutral upon disruption. Hence, transient and permanent interactions are subject to similarly strong selective constraints in the human interactome.
Collapse
Affiliation(s)
| | - Yu Xia
- Department of Bioengineering, McGill University, Montreal, Canada
| |
Collapse
|
6
|
The properties of human disease mutations at protein interfaces. PLoS Comput Biol 2022; 18:e1009858. [PMID: 35120134 PMCID: PMC8849535 DOI: 10.1371/journal.pcbi.1009858] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 02/16/2022] [Accepted: 01/24/2022] [Indexed: 12/27/2022] Open
Abstract
The assembly of proteins into complexes and their interactions with other biomolecules are often vital for their biological function. While it is known that mutations at protein interfaces have a high potential to be damaging and cause human genetic disease, there has been relatively little consideration for how this varies between different types of interfaces. Here we investigate the properties of human pathogenic and putatively benign missense variants at homomeric (isologous and heterologous), heteromeric, DNA, RNA and other ligand interfaces, and at different regions in proteins with respect to those interfaces. We find that different types of interfaces vary greatly in their propensity to be associated with pathogenic mutations, with homomeric heterologous and DNA interfaces being particularly enriched in disease. We also find that residues that do not directly participate in an interface, but are close in three-dimensional space, show a significant disease enrichment. Finally, we observe that mutations at different types of interfaces tend to have distinct property changes when undergoing amino acid substitutions associated with disease, and that this is linked to substantial variability in their identification by computational variant effect predictors. Nearly all proteins interact with other molecules as part of their biological function. For example, proteins can interact with other copies of the same type of protein, with different proteins, with DNA, or with small ligand molecules. Many mutations at protein interfaces, the regions of proteins that interact with other molecules, are known to cause human genetic disease. In this study, we first investigate how different types of protein interfaces have different tendencies to be associated with disease. We also show that the closer a mutation is to an interface, the more likely it is to cause disease. Finally, we study how mutations at different types of interfaces tend to be associated with different changes in amino acid properties, which appears to influence our ability to computationally predict the effects of mutations. Ultimately, we hope that consideration of protein interface properties will eventually improve our ability to identify new disease-causing mutations.
Collapse
|
7
|
Ovek D, Abali Z, Zeylan ME, Keskin O, Gursoy A, Tuncbag N. Artificial intelligence based methods for hot spot prediction. Curr Opin Struct Biol 2021; 72:209-218. [PMID: 34954608 DOI: 10.1016/j.sbi.2021.11.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/29/2022]
Abstract
Proteins interact through their interfaces to fulfill essential functions in the cell. They bind to their partners in a highly specific manner and form complexes that have a profound effect on understanding the biological pathways they are involved in. Any abnormal interactions may cause diseases. Therefore, the identification of small molecules which modulate protein interactions through their interfaces has high therapeutic potential. However, discovering such molecules is challenging. Most protein-protein binding affinity is attributed to a small set of amino acids found in protein interfaces known as hot spots. Recent studies demonstrate that drug-like small molecules specifically may bind to hot spots. Therefore, hot spot prediction is crucial. As experimental data accumulates, artificial intelligence begins to be used for computational hot spot prediction. First, we review machine learning and deep learning for computational hot spot prediction and then explain the significance of hot spots toward drug design.
Collapse
Affiliation(s)
- Damla Ovek
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Zeynep Abali
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | | | - Ozlem Keskin
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Attila Gursoy
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Nurcan Tuncbag
- College of Engineering, Koc University, 34450 Istanbul, Turkey; School of Medicine, Koc University, 34450 Istanbul, Turkey.
| |
Collapse
|
8
|
Wilson CJ, Chang M, Karttunen M, Choy WY. KEAP1 Cancer Mutants: A Large-Scale Molecular Dynamics Study of Protein Stability. Int J Mol Sci 2021; 22:5408. [PMID: 34065616 PMCID: PMC8161161 DOI: 10.3390/ijms22105408] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Revised: 05/11/2021] [Accepted: 05/13/2021] [Indexed: 12/30/2022] Open
Abstract
We have performed 280 μs of unbiased molecular dynamics (MD) simulations to investigate the effects of 12 different cancer mutations on Kelch-like ECH-associated protein 1 (KEAP1) (G333C, G350S, G364C, G379D, R413L, R415G, A427V, G430C, R470C, R470H, R470S and G476R), one of the frequently mutated proteins in lung cancer. The aim was to provide structural insight into the effects of these mutants, including a new class of ANCHOR (additionally NRF2-complexed hypomorph) mutant variants. Our work provides additional insight into the structural dynamics of mutants that could not be analyzed experimentally, painting a more complete picture of their mutagenic effects. Notably, blade-wise analysis of the Kelch domain points to stability as a possible target of cancer in KEAP1. Interestingly, structural analysis of the R470C ANCHOR mutant, the most prevalent missense mutation in KEAP1, revealed no significant change in structural stability or NRF2 binding site dynamics, possibly indicating an covalent modification as this mutant's mode of action.
Collapse
Affiliation(s)
- Carter J. Wilson
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Megan Chang
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| | - Mikko Karttunen
- Department of Applied Mathematics, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
- Department of Chemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 3K7, Canada
- Centre for Advanced Materials and Biomaterials Research, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5B7, Canada
| | - Wing-Yiu Choy
- Department of Biochemistry, The University of Western Ontario, 1151 Richmond Street, London, ON N6A 5C1, Canada; (C.J.W.); (M.C.)
| |
Collapse
|
9
|
Ponzoni L, Peñaherrera DA, Oltvai ZN, Bahar I. Rhapsody: predicting the pathogenicity of human missense variants. Bioinformatics 2020; 36:3084-3092. [PMID: 32101277 PMCID: PMC7214033 DOI: 10.1093/bioinformatics/btaa127] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 12/27/2019] [Accepted: 02/21/2020] [Indexed: 12/22/2022] Open
Abstract
MOTIVATION The biological effects of human missense variants have been studied experimentally for decades but predicting their effects in clinical molecular diagnostics remains challenging. Available computational tools are usually based on the analysis of sequence conservation and structural properties of the mutant protein. We recently introduced a new machine learning method that demonstrated for the first time the significance of protein dynamics in determining the pathogenicity of missense variants. RESULTS Here, we present a new interface (Rhapsody) that enables fully automated assessment of pathogenicity, incorporating both sequence coevolution data and structure- and dynamics-based features. Benchmarked against a dataset of about 20 000 annotated variants, the methodology is shown to outperform well-established and/or advanced prediction tools. We illustrate the utility of Rhapsody by in silico saturation mutagenesis studies of human H-Ras, phosphatase and tensin homolog and thiopurine S-methyltransferase. AVAILABILITY AND IMPLEMENTATION The new tool is available both as an online webserver at http://rhapsody.csb.pitt.edu and as an open-source Python package (GitHub repository: https://github.com/prody/rhapsody; PyPI package installation: pip install prody-rhapsody). Links to additional resources, tutorials and package documentation are provided in the 'Python package' section of the website. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Luca Ponzoni
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Daniel A Peñaherrera
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Zoltán N Oltvai
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA.,Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15261, USA.,Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN 55455, USA
| | - Ivet Bahar
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
10
|
Zhang N, Lu H, Chen Y, Zhu Z, Yang Q, Wang S, Li M. PremPRI: Predicting the Effects of Missense Mutations on Protein-RNA Interactions. Int J Mol Sci 2020; 21:ijms21155560. [PMID: 32756481 PMCID: PMC7432928 DOI: 10.3390/ijms21155560] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 07/28/2020] [Accepted: 07/30/2020] [Indexed: 12/23/2022] Open
Abstract
Protein–RNA interactions are crucial for many cellular processes, such as protein synthesis and regulation of gene expression. Missense mutations that alter protein–RNA interaction may contribute to the pathogenesis of many diseases. Here, we introduce a new computational method PremPRI, which predicts the effects of single mutations occurring in RNA binding proteins on the protein–RNA interactions by calculating the binding affinity changes quantitatively. The multiple linear regression scoring function of PremPRI is composed of three sequence- and eight structure-based features, and is parameterized on 248 mutations from 50 protein–RNA complexes. Our model shows a good agreement between calculated and experimental values of binding affinity changes with a Pearson correlation coefficient of 0.72 and the corresponding root-mean-square error of 0.76 kcal·mol−1, outperforming three other available methods. PremPRI can be used for finding functionally important variants, understanding the molecular mechanisms, and designing new protein–RNA interaction inhibitors.
Collapse
|
11
|
Insights into changes in binding affinity caused by disease mutations in protein-protein complexes. Comput Biol Med 2020; 123:103829. [DOI: 10.1016/j.compbiomed.2020.103829] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 05/20/2020] [Accepted: 05/20/2020] [Indexed: 01/11/2023]
|
12
|
Zhang N, Chen Y, Lu H, Zhao F, Alvarez RV, Goncearenco A, Panchenko AR, Li M. MutaBind2: Predicting the Impacts of Single and Multiple Mutations on Protein-Protein Interactions. iScience 2020; 23:100939. [PMID: 32169820 PMCID: PMC7068639 DOI: 10.1016/j.isci.2020.100939] [Citation(s) in RCA: 80] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 11/21/2019] [Accepted: 02/20/2020] [Indexed: 01/17/2023] Open
Abstract
Missense mutations may affect proteostasis by destabilizing or over-stabilizing protein complexes and changing the pathway flux. Predicting the effects of stabilizing mutations on protein-protein interactions is notoriously difficult because existing experimental sets are skewed toward mutations reducing protein-protein binding affinity and many computational methods fail to correctly evaluate their effects. To address this issue, we developed a method MutaBind2, which estimates the impacts of single as well as multiple mutations on protein-protein interactions. MutaBind2 employs only seven features, and the most important of them describe interactions of proteins with the solvent, evolutionary conservation of the site, and thermodynamic stability of the complex and each monomer. This approach shows a distinct improvement especially in evaluating the effects of mutations increasing binding affinity. MutaBind2 can be used for finding disease driver mutations, designing stable protein complexes, and discovering new protein-protein interaction inhibitors.
Collapse
Affiliation(s)
- Ning Zhang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Yuting Chen
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Haoyu Lu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Feiyang Zhao
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Roberto Vera Alvarez
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Alexander Goncearenco
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Anna R Panchenko
- National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA.
| | - Minghui Li
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China.
| |
Collapse
|
13
|
Dincer C, Kaya T, Keskin O, Gursoy A, Tuncbag N. 3D spatial organization and network-guided comparison of mutation profiles in Glioblastoma reveals similarities across patients. PLoS Comput Biol 2019; 15:e1006789. [PMID: 31527881 PMCID: PMC6782092 DOI: 10.1371/journal.pcbi.1006789] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 10/08/2019] [Accepted: 07/31/2019] [Indexed: 02/06/2023] Open
Abstract
Glioblastoma multiforme (GBM) is the most aggressive type of brain tumor. Molecular heterogeneity is a hallmark of GBM tumors that is a barrier in developing treatment strategies. In this study, we used the nonsynonymous mutations of GBM tumors deposited in The Cancer Genome Atlas (TCGA) and applied a systems level approach based on biophysical characteristics of mutations and their organization in patient-specific subnetworks to reduce inter-patient heterogeneity and to gain potential clinically relevant insights. Approximately 10% of the mutations are located in "patches" which are defined as the set of residues spatially in close proximity that are mutated across multiple patients. Grouping mutations as 3D patches reduces the heterogeneity across patients. There are multiple patches that are relatively small in oncogenes, whereas there are a small number of very large patches in tumor suppressors. Additionally, different patches in the same protein are often located at different domains that can mediate different functions. We stratified the patients into five groups based on their potentially affected pathways that are revealed from the patient-specific subnetworks. These subnetworks were constructed by integrating mutation profiles of the patients with the interactome data. Network-guided clustering showed significant association between the groups and patient survival (P-value = 0.0408). Also, each group carries a set of signature 3D mutation patches that affect predominant pathways. We integrated drug sensitivity data of GBM cell lines with the mutation patches and the patient groups to analyze the possible therapeutic outcome of these patches. We found that Pazopanib might be effective in Group 3 by targeting CSF1R. Additionally, inhibiting ATM that is a mediator of PTEN phosphorylation may be ineffective in Group 2. We believe that from mutations to networks and eventually to clinical and therapeutic data, this study provides a novel perspective in the network-guided precision medicine.
Collapse
Affiliation(s)
- Cansu Dincer
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
| | - Tugba Kaya
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, Koc University, Istanbul, Turkey
- Research Center for Translational Medicine (KUTTAM), Koc University, Istanbul, Turkey
| | - Attila Gursoy
- Research Center for Translational Medicine (KUTTAM), Koc University, Istanbul, Turkey
- Department of Computer Engineering, Koc University, Istanbul, Turkey
| | - Nurcan Tuncbag
- Department of Health Informatics, Graduate School of Informatics, METU, Ankara, Turkey
- Cancer Systems Biology Laboratory (CanSyL-METU), Ankara, Turkey
- * E-mail:
| |
Collapse
|
14
|
Deng L, Sui Y, Zhang J. XGBPRH: Prediction of Binding Hot Spots at Protein⁻RNA Interfaces Utilizing Extreme Gradient Boosting. Genes (Basel) 2019; 10:genes10030242. [PMID: 30901953 PMCID: PMC6471955 DOI: 10.3390/genes10030242] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 03/14/2019] [Accepted: 03/15/2019] [Indexed: 01/24/2023] Open
Abstract
Hot spot residues at protein⁻RNA complexes are vitally important for investigating the underlying molecular recognition mechanism. Accurately identifying protein⁻RNA binding hot spots is critical for drug designing and protein engineering. Although some progress has been made by utilizing various available features and a series of machine learning approaches, these methods are still in the infant stage. In this paper, we present a new computational method named XGBPRH, which is based on an eXtreme Gradient Boosting (XGBoost) algorithm and can effectively predict hot spot residues in protein⁻RNA interfaces utilizing an optimal set of properties. Firstly, we download 47 protein⁻RNA complexes and calculate a total of 156 sequence, structure, exposure, and network features. Next, we adopt a two-step feature selection algorithm to extract a combination of 6 optimal features from the combination of these 156 features. Compared with the state-of-the-art approaches, XGBPRH achieves better performances with an area under the ROC curve (AUC) score of 0.817 and an F1-score of 0.802 on the independent test set. Meanwhile, we also apply XGBPRH to two case studies. The results demonstrate that the method can effectively identify novel energy hotspots.
Collapse
Affiliation(s)
- Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha 410075, China.
| | - Yuanchao Sui
- School of Computer Science and Engineering, Central South University, Changsha 410075, China.
| | - Jingpu Zhang
- School of Computer and Data Science, Henan University of Urban Construction, Pingdingshan 467000, China.
| |
Collapse
|