1
|
Zheng F, Jiang X, Wen Y, Yang Y, Li M. Systematic investigation of machine learning on limited data: A study on predicting protein-protein binding strength. Comput Struct Biotechnol J 2024; 23:460-472. [PMID: 38235359 PMCID: PMC10792694 DOI: 10.1016/j.csbj.2023.12.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 12/14/2023] [Accepted: 12/16/2023] [Indexed: 01/19/2024] Open
Abstract
The application of machine learning techniques in biological research, especially when dealing with limited data availability, poses significant challenges. In this study, we leveraged advancements in method development for predicting protein-protein binding strength to conduct a systematic investigation into the application of machine learning on limited data. The binding strength, quantitatively measured as binding affinity, is vital for understanding the processes of recognition, association, and dysfunction that occur within protein complexes. By incorporating transfer learning, integrating domain knowledge, and employing both deep learning and traditional machine learning algorithms, we mitigated the impact of data limitations and made significant advancements in predicting protein-protein binding affinity. In particular, we developed over 20 models, ultimately selecting three representative best-performing ones that belong to distinct categories. The first model is structure-based, consisting of a random forest regression and thirteen handcrafted features. The second model is sequence-based, employing an architecture that combines transferred embedding features with a multilayer perceptron. Finally, we created an ensemble model by averaging the predictions of the two aforementioned models. The comparison with other predictors on three independent datasets confirms the significant improvements achieved by our models in predicting protein-protein binding affinity. The programs for running these three models are available at https://github.com/minghuilab/BindPPI.
Collapse
Affiliation(s)
- Feifan Zheng
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Xin Jiang
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Yuhao Wen
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Yan Yang
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| | - Minghui Li
- MOE Key Laboratory of Geriatric Diseases and Immunology, School of Biology and Basic Medical Sciences, Suzhou Medical College of Soochow University, Suzhou, Jiangsu Province 215123, China
| |
Collapse
|
2
|
Nebli S, Rebai A, Ayadi I. Screening clusters of charged residues in plants' mitochondrial proteins and biological significance. Mitochondrion 2024; 78:101938. [PMID: 39013535 DOI: 10.1016/j.mito.2024.101938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Revised: 07/09/2024] [Accepted: 07/12/2024] [Indexed: 07/18/2024]
Abstract
Protein function is dependent on charge interactions and charge biased regions, which are involved in a wide range of cellular and biochemical processes. We report the development of a new algorithm implemented in Python and its use to identify charge clusters CC (NegativeCC: NCC, PositiveCC: PCC and MixedCC: MCC) and compare their presence in mitochondrial proteins of plant groups. To characterize the resulting CC, statistical, structural and functional analyses were conducted. The screening of 105 399 protein sequences showed that 2.6 %, 0.48 % and 0.03 % of the proteins contain NCC, PCC and MCC, respectively. Mitochondrial proteins encoded by the nuclear genome of green algae have the biggest proportion of both PCC (1.6 %) and MCC (0.4 %) and mitochondrial proteins coded by the nuclear genome of other plants group have the highest portion of NCC (7.5 %). The mapping of the identified CC showed that they are mainly located in the terminal regions of the protein. Annotation showed that proteins with CC are classified as binding proteins, are included in the transmembrane transport processes, and are mainly located in the membrane. The CC scanning revealed the presence of 2373 and 784 sites and 192 and 149 motif profiles within NCC and PCC, respectively. The investigation of CC within pentatricopeptide repeat-containing proteins revealed that they are involved in correct and specific RNA editing. CC were proven to play a key role in providing insightful structural and functional information of complex protein assemblies which could be useful in biotechnological applications.
Collapse
Affiliation(s)
- Syrine Nebli
- Laboratory of Molecular and Cellular Screening Processes, Centre of Biotechnology of Sfax, University of Sfax, P. O. Box 1177, 3018 Sfax, Tunisia.
| | - Ahmed Rebai
- Laboratory of Molecular and Cellular Screening Processes, Centre of Biotechnology of Sfax, University of Sfax, P. O. Box 1177, 3018 Sfax, Tunisia.
| | - Imen Ayadi
- Laboratory of Molecular and Cellular Screening Processes, Centre of Biotechnology of Sfax, University of Sfax, P. O. Box 1177, 3018 Sfax, Tunisia.
| |
Collapse
|
3
|
Sun X, Wu Z, Su J, Li C. A deep attention model for wide-genome protein-peptide binding affinity prediction at a sequence level. Int J Biol Macromol 2024; 276:133811. [PMID: 38996881 DOI: 10.1016/j.ijbiomac.2024.133811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2024] [Revised: 07/09/2024] [Accepted: 07/09/2024] [Indexed: 07/14/2024]
Abstract
Peptides are pivotal in numerous biological activities by engaging in up to 40 % of protein-protein interactions in many cellular processes. Due to their exceptional specificity and effectiveness, peptides have emerged as promising candidates for drug design. However, accurately predicting protein-peptide binding affinity remains a challenging. Aiming at the problem, we develop a prediction model PepPAP based on convolutional neural network and multi-head attention, which relies solely on sequence features. These features include physicochemical properties, intrinsic disorder, sequence encoding, and especially interface propensity which is extracted from 16,689 non-redundant protein-peptide complexes. Notably, the adopted regression stratification cross-validation scheme proposed in our previous work is beneficial to improve the prediction for the cases with extreme binding affinity values. On three benchmark test datasets: T100, a series of peptides targeting to PDZ domain and CXCR4, PepPAP shows excellent performance, outperforming the existing methods and demonstrating its good generalization ability. Furthermore, PepPAP has good results in binary interaction prediction, and the analysis of the feature space distribution visualization highlights PepPAP's effectiveness. To the best of our knowledge, PepPAP is the first sequence-based deep attention model for wide-genome protein-peptide binding affinity prediction, and holds the potential to offer valuable insights for the peptide-based drug design.
Collapse
Affiliation(s)
- Xiaohan Sun
- College of Chemistry and Life Science, Beijing University of Technology, Beijing 100124, China
| | - Zhixiang Wu
- College of Chemistry and Life Science, Beijing University of Technology, Beijing 100124, China
| | - Jingjie Su
- College of Chemistry and Life Science, Beijing University of Technology, Beijing 100124, China
| | - Chunhua Li
- College of Chemistry and Life Science, Beijing University of Technology, Beijing 100124, China.
| |
Collapse
|
4
|
Chen H, Revennaugh B, Fu H, Ivanov AA. AVERON notebook to discover actionable cancer vulnerabilities enabled by neomorph protein-protein interactions. iScience 2024; 27:110035. [PMID: 38883827 PMCID: PMC11179073 DOI: 10.1016/j.isci.2024.110035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 04/30/2024] [Accepted: 05/16/2024] [Indexed: 06/18/2024] Open
Abstract
Genomic alterations, such as missense mutations, often lead to the activation of oncogenic pathways and cell transformation by rewiring protein-protein interaction (PPI) networks. Understanding how mutant-directed neomorph PPIs (neoPPIs) drive cancer is vital to developing new personalized clinical strategies. However, the experimental interrogation of neoPPI functions in patients with cancer is highly challenging. To address this challenge, we developed a computational platform, termed AVERON for discovering actionable vulnerabilities enabled by rewired oncogenic networks. AVERON enables rapid systematic profiling of the clinical significance of neomorph PPIs across different cancer types, informing molecular mechanisms of neoPPI-driven tumorigenesis, and revealing therapeutically actionable neoPPI-regulated genes. We demonstrated the application of the AVERON platform by evaluating the biological functions and clinical significance of 130 neomorph interactions, experimentally determined for oncogenic BRAFV600E. The AVERON application to broad sets of mutant-directed PPIs may inform new testable biological models and clinical strategies in cancer.
Collapse
Affiliation(s)
- Hongyue Chen
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA, USA
| | - Brian Revennaugh
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA, USA
| | - Haian Fu
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA, USA
- Emory Chemical Biology Discovery Center, Emory University School of Medicine, Emory University, Atlanta, GA, USA
- Winship Cancer Institute, Emory University, Atlanta, GA, USA
- Department of Hematology, Medical Oncology Emory University, Atlanta, GA, USA
| | - Andrey A Ivanov
- Department of Pharmacology and Chemical Biology, Emory University School of Medicine, Emory University, Atlanta, GA, USA
- Emory Chemical Biology Discovery Center, Emory University School of Medicine, Emory University, Atlanta, GA, USA
- Winship Cancer Institute, Emory University, Atlanta, GA, USA
| |
Collapse
|
5
|
Velez-Arce A, Huang K, Li MM, Lin X, Gao W, Fu T, Kellis M, Pentelute BL, Zitnik M. TDC-2: Multimodal Foundation for Therapeutic Science. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.12.598655. [PMID: 38948789 PMCID: PMC11212894 DOI: 10.1101/2024.06.12.598655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Therapeutics Data Commons (tdcommons.ai) is an open science initiative with unified datasets, AI models, and benchmarks to support research across therapeutic modalities and drug discovery and development stages. The Commons 2.0 (TDC-2) is a comprehensive overhaul of Therapeutic Data Commons to catalyze research in multimodal models for drug discovery by unifying single-cell biology of diseases, biochemistry of molecules, and effects of drugs through multimodal datasets, AI-powered API endpoints, new multimodal tasks and model frameworks, and comprehensive benchmarks. TDC-2 introduces over 1,000 multimodal datasets spanning approximately 85 million cells, pre-calculated embeddings from 5 state-of-the-art single-cell models, and a biomedical knowledge graph. TDC-2 drastically expands the coverage of ML tasks across therapeutic pipelines and 10+ new modalities, spanning but not limited to single-cell gene expression data, clinical trial data, peptide sequence data, peptidomimetics protein-peptide interaction data regarding newly discovered ligands derived from AS-MS spectroscopy, novel 3D structural data for proteins, and cell-type-specific protein-protein interaction networks at single-cell resolution. TDC-2 introduces multimodal data access under an API-first design using the model-view-controller paradigm. TDC-2 introduces 7 novel ML tasks with fine-grained biological contexts: contextualized drug-target identification, single-cell chemical/genetic perturbation response prediction, protein-peptide binding affinity prediction task, and clinical trial outcome prediction task, which introduce antigen-processing-pathway-specific, cell-type-specific, peptide-specific, and patient-specific biological contexts. TDC-2 also releases benchmarks evaluating 15+ state-of-the-art models across 5+ new learning tasks evaluating models on diverse biological contexts and sampling approaches. Among these, TDC-2 provides the first benchmark for context-specific learning. TDC-2, to our knowledge, is also the first to introduce a protein-peptide binding interaction benchmark.
Collapse
|
6
|
Biswas G, Mukherjee D, Basu S. Combining Complementarity and Binding Energetics in the Assessment of Protein Interactions: EnCPdock-A Practical Manual. J Comput Biol 2024. [PMID: 38885081 DOI: 10.1089/cmb.2024.0554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024] Open
Abstract
The combined effect of shape and electrostatic complementarities (Sc, EC) at the interface of the interacting protein partners (PPI) serves as the physical basis for such associations and is a strong determinant of their binding energetics. EnCPdock (https://www.scinetmol.in/EnCPdock/) presents a comprehensive web platform for the direct conjoint comparative analyses of complementarity and binding energetics in PPIs. It elegantly interlinks the dual nature of local (Sc) and nonlocal complementarity (EC) in PPIs using the complementarity plot. It further derives an AI-based ΔGbinding with a prediction accuracy comparable to the state of the art. This book chapter presents a practical manual to conceptualize and implement EnCPdock with its various features and functionalities, collectively having the potential to serve as a valuable protein engineering tool in the design of novel protein interfaces.
Collapse
Affiliation(s)
- Gargi Biswas
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | | | - Sankar Basu
- Department of Microbiology, Asutosh College, University of Calcutta, Kolkata, India
| |
Collapse
|
7
|
Liu JX, Zhang X, Huang YQ, Hao GF, Yang GF. Multi-level bioinformatics resources support drug target discovery of protein-protein interactions. Drug Discov Today 2024; 29:103979. [PMID: 38608830 DOI: 10.1016/j.drudis.2024.103979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/14/2024] [Accepted: 04/05/2024] [Indexed: 04/14/2024]
Abstract
Drug discovery often begins with a new target. Protein-protein interactions (PPIs) are crucial to multitudinous cellular processes and offer a promising avenue for drug-target discovery. PPIs are characterized by multi-level complexity: at the protein level, interaction networks can be used to identify potential targets, whereas at the residue level, the details of the interactions of individual PPIs can be used to examine a target's druggability. Much great progress has been made in target discovery through multi-level PPI-related computational approaches, but these resources have not been fully discussed. Here, we systematically survey bioinformatics tools for identifying and assessing potential drug targets, examining their characteristics, limitations and applications. This work will aid the integration of the broader protein-to-network context with the analysis of detailed binding mechanisms to support the discovery of drug targets.
Collapse
Affiliation(s)
- Jia-Xin Liu
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China
| | - Xiao Zhang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Yuan-Qin Huang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Ge-Fei Hao
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China; State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China.
| | - Guang-Fu Yang
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China.
| |
Collapse
|
8
|
Grassmann G, Miotto M, Desantis F, Di Rienzo L, Tartaglia GG, Pastore A, Ruocco G, Monti M, Milanetti E. Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments. Chem Rev 2024; 124:3932-3977. [PMID: 38535831 PMCID: PMC11009965 DOI: 10.1021/acs.chemrev.3c00550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 02/20/2024] [Accepted: 02/21/2024] [Indexed: 04/11/2024]
Abstract
Investigating protein-protein interactions is crucial for understanding cellular biological processes because proteins often function within molecular complexes rather than in isolation. While experimental and computational methods have provided valuable insights into these interactions, they often overlook a critical factor: the crowded cellular environment. This environment significantly impacts protein behavior, including structural stability, diffusion, and ultimately the nature of binding. In this review, we discuss theoretical and computational approaches that allow the modeling of biological systems to guide and complement experiments and can thus significantly advance the investigation, and possibly the predictions, of protein-protein interactions in the crowded environment of cell cytoplasm. We explore topics such as statistical mechanics for lattice simulations, hydrodynamic interactions, diffusion processes in high-viscosity environments, and several methods based on molecular dynamics simulations. By synergistically leveraging methods from biophysics and computational biology, we review the state of the art of computational methods to study the impact of molecular crowding on protein-protein interactions and discuss its potential revolutionizing effects on the characterization of the human interactome.
Collapse
Affiliation(s)
- Greta Grassmann
- Department
of Biochemical Sciences “Alessandro Rossi Fanelli”, Sapienza University of Rome, Rome 00185, Italy
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Mattia Miotto
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Fausta Desantis
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- The
Open University Affiliated Research Centre at Istituto Italiano di
Tecnologia, Genoa 16163, Italy
| | - Lorenzo Di Rienzo
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
| | - Gian Gaetano Tartaglia
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Genoa 16163, Italy
- Center
for Human Technologies, Genoa 16152, Italy
| | - Annalisa Pastore
- Experiment
Division, European Synchrotron Radiation
Facility, Grenoble 38043, France
| | - Giancarlo Ruocco
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Physics, Sapienza University, Rome 00185, Italy
| | - Michele Monti
- RNA
System Biology Lab, Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia, Genoa 16163, Italy
| | - Edoardo Milanetti
- Center
for Life Nano & Neuro Science, Istituto
Italiano di Tecnologia, Rome 00161, Italy
- Department
of Physics, Sapienza University, Rome 00185, Italy
| |
Collapse
|
9
|
Ridha F, Gromiha MM. MPA-Pred: A machine learning approach for predicting the binding affinity of membrane protein-protein complexes. Proteins 2024; 92:499-508. [PMID: 37949651 DOI: 10.1002/prot.26633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 10/05/2023] [Accepted: 10/25/2023] [Indexed: 11/12/2023]
Abstract
Membrane protein-protein interactions are essential for several functions including cell signaling, ion transport, and enzymatic activity. These interactions are mainly dictated by their binding affinities. Although several methods are available for predicting the binding affinity of protein-protein complexes, there exists no specific method for membrane protein-protein complexes. In this work, we collected the experimental binding affinity data for a set of 114 membrane protein-protein complexes and derived several structure and sequence-based features. Our analysis on the relationship between binding affinity and the features revealed that the important factors mainly depend on the type of membrane protein and the functional class of the protein. Specifically, aromatic and charged residues at the interface, and aromatic-aromatic and electrostatic interactions are found to be important to understand the binding affinity. Further, we developed a method, MPA-Pred, for predicting the binding affinity of membrane protein-protein complexes using a machine learning approach. It showed an average correlation and mean absolute error of 0.83 and 0.91 kcal/mol, respectively, using the jack-knife test on a set of 114 complexes. We have also developed a web server and it is available at https://web.iitm.ac.in/bioinfo2/MPA-Pred/. This method can be used for predicting the affinity of membrane protein-protein complexes at a large scale and aid to improve drug design strategies.
Collapse
Affiliation(s)
- Fathima Ridha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai, India
- Department of Computer Science, National University of Singapore, Singapore, Singapore
| |
Collapse
|
10
|
Day M, Tetik B, Parlak M, Almeida-Hernández Y, Räschle M, Kaschani F, Siegert H, Marko A, Sanchez-Garcia E, Kaiser M, Barker IA, Pearl LH, Oliver AW, Boos D. TopBP1 utilises a bipartite GINS binding mode to support genome replication. Nat Commun 2024; 15:1797. [PMID: 38413589 PMCID: PMC10899662 DOI: 10.1038/s41467-024-45946-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 02/07/2024] [Indexed: 02/29/2024] Open
Abstract
Activation of the replicative Mcm2-7 helicase by loading GINS and Cdc45 is crucial for replication origin firing, and as such for faithful genetic inheritance. Our biochemical and structural studies demonstrate that the helicase activator GINS interacts with TopBP1 through two separate binding surfaces, the first involving a stretch of highly conserved amino acids in the TopBP1-GINI region, the second a surface on TopBP1-BRCT4. The two surfaces bind to opposite ends of the A domain of the GINS subunit Psf1. Mutation analysis reveals that either surface is individually able to support TopBP1-GINS interaction, albeit with reduced affinity. Consistently, either surface is sufficient for replication origin firing in Xenopus egg extracts and becomes essential in the absence of the other. The TopBP1-GINS interaction appears sterically incompatible with simultaneous binding of DNA polymerase epsilon (Polε) to GINS when bound to Mcm2-7-Cdc45, although TopBP1-BRCT4 and the Polε subunit PolE2 show only partial competitivity in binding to Psf1. Our TopBP1-GINS model improves the understanding of the recently characterised metazoan pre-loading complex. It further predicts the coordination of three molecular origin firing processes, DNA polymerase epsilon arrival, TopBP1 ejection and GINS integration into Mcm2-7-Cdc45.
Collapse
Affiliation(s)
- Matthew Day
- School of Biological and Behavioural Sciences, Blizard Institute, Queen Mary University of London, London, E1 2AT, UK.
- Cancer Research UK DNA Repair Enzymes Group, Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9RQ, UK.
| | - Bilal Tetik
- Molecular Genetics II, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Milena Parlak
- Molecular Genetics II, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Yasser Almeida-Hernández
- Computational Bioengineering, Fakultät Bio- und Chemieingenieurwesen, Technical University Dortmund, Emil-Figge Str. 66, 44227, Dortmund, Germany
- Computational Biochemistry, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Markus Räschle
- Molecular Genetics, Technical University Kaiserslautern, Paul-Ehrlich Straße 24, 67663, Kaiserslautern, Germany
| | - Farnusch Kaschani
- Analytics Core Facility Essen, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
- Chemical Biology, Center of Medical Biotechnology, University Duisburg-Essen, Fakultät Biologie, Essen, Germany
| | - Heike Siegert
- Molecular Genetics II, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Anika Marko
- Molecular Genetics II, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Elsa Sanchez-Garcia
- Computational Bioengineering, Fakultät Bio- und Chemieingenieurwesen, Technical University Dortmund, Emil-Figge Str. 66, 44227, Dortmund, Germany
- Computational Biochemistry, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
| | - Markus Kaiser
- Analytics Core Facility Essen, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany
- Chemical Biology, Center of Medical Biotechnology, University Duisburg-Essen, Fakultät Biologie, Essen, Germany
| | - Isabel A Barker
- Cancer Research UK DNA Repair Enzymes Group, Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9RQ, UK
| | - Laurence H Pearl
- Cancer Research UK DNA Repair Enzymes Group, Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9RQ, UK.
- Division of Structural Biology, Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Road, London, SW1E 6BT, UK.
| | - Antony W Oliver
- Cancer Research UK DNA Repair Enzymes Group, Genome Damage and Stability Centre, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9RQ, UK.
| | - Dominik Boos
- Molecular Genetics II, Center of Medical Biotechnology, University of Duisburg-Essen, Universitätsstraße 2-5, 45141, Essen, Germany.
| |
Collapse
|
11
|
Pan X, Li Y, Huang P, Staecker H, He M. Extracellular vesicles for developing targeted hearing loss therapy. J Control Release 2024; 366:460-478. [PMID: 38182057 DOI: 10.1016/j.jconrel.2023.12.050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 12/19/2023] [Accepted: 12/28/2023] [Indexed: 01/07/2024]
Abstract
Substantial efforts have been made for local administration of small molecules or biologics in treating hearing loss diseases caused by either trauma, genetic mutations, or drug ototoxicity. Recently, extracellular vesicles (EVs) naturally secreted from cells have drawn increasing attention on attenuating hearing impairment from both preclinical studies and clinical studies. Highly emerging field utilizing diverse bioengineering technologies for developing EVs as the bioderived therapeutic materials, along with artificial intelligence (AI)-based targeting toolkits, shed the light on the unique properties of EVs specific to inner ear delivery. This review will illuminate such exciting research field from fundamentals of hearing protective functions of EVs to biotechnology advancement and potential clinical translation of functionalized EVs. Specifically, the advancements in assessing targeting ligands using AI algorithms are systematically discussed. The overall translational potential of EVs is reviewed in the context of auditory sensing system for developing next generation gene therapy.
Collapse
Affiliation(s)
- Xiaoshu Pan
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, Florida 32610, United States
| | - Yanjun Li
- Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida, Gainesville, Florida 32610, United States
| | - Peixin Huang
- Department of Otolaryngology, Head and Neck Surgery, University of Kansas School of Medicine, Kansas City, Kansas 66160, United States
| | - Hinrich Staecker
- Department of Otolaryngology, Head and Neck Surgery, University of Kansas School of Medicine, Kansas City, Kansas 66160, United States.
| | - Mei He
- Department of Pharmaceutics, College of Pharmacy, University of Florida, Gainesville, Florida 32610, United States.
| |
Collapse
|
12
|
Aarthy M, Pandiyan GN, Paramasivan R, Kumar A, Gupta B. Identification and prioritisation of potential vaccine candidates using subtractive proteomics and designing of a multi-epitope vaccine against Wuchereria bancrofti. Sci Rep 2024; 14:1970. [PMID: 38263422 PMCID: PMC10806236 DOI: 10.1038/s41598-024-52457-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Accepted: 01/18/2024] [Indexed: 01/25/2024] Open
Abstract
This study employed subtractive proteomics and immunoinformatics to analyze the Wuchereria bancrofti proteome and identify potential therapeutic targets, with a focus on designing a vaccine against the parasite species. A comprehensive bioinformatics analysis of the parasite's proteome identified 51 probable therapeutic targets, among which "Kunitz/bovine pancreatic trypsin inhibitor domain-containing protein" was identified as the most promising vaccine candidate. The candidate protein was used to design a multi-epitope vaccine, incorporating B-cell and T-cell epitopes identified through various tools. The vaccine construct underwent extensive analysis of its antigenic, physical, and chemical features, including the determination of secondary and tertiary structures. Docking and molecular dynamics simulations were performed with HLA alleles, Toll-like receptor 4 (TLR4), and TLR3 to assess its potential to elicit the human immune response. Immune simulation analysis confirmed the predicted vaccine's strong binding affinity with immunoglobulins, indicating its potential efficacy in generating an immune response. However, experimental validation and testing of this multi-epitope vaccine construct would be needed to assess its potential against W. bancrofti and even for a broader range of lymphatic filarial infections given the similarities between W. bancrofti and Brugia.
Collapse
Affiliation(s)
- Murali Aarthy
- ICMR-Vector Control Research Centre (VCRC), Field Station, Madurai, Tamil Nadu, 625002, India
| | - G Navaneetha Pandiyan
- ICMR-Vector Control Research Centre (VCRC), Field Station, Madurai, Tamil Nadu, 625002, India
| | - R Paramasivan
- ICMR-Vector Control Research Centre (VCRC), Field Station, Madurai, Tamil Nadu, 625002, India
| | - Ashwani Kumar
- ICMR-Vector Control Research Centre (VCRC), Puducherry, India
- Saveetha Institute of Medical and Technical Sciences (SIMATS), Saveetha University, Tandhalam, Chennai, Tamil Nadu, 602105, India
| | - Bhavna Gupta
- ICMR-Vector Control Research Centre (VCRC), Field Station, Madurai, Tamil Nadu, 625002, India.
| |
Collapse
|
13
|
Yi C, Taylor ML, Ziebarth J, Wang Y. Predictive Models and Impact of Interfacial Contacts and Amino Acids on Protein-Protein Binding Affinity. ACS OMEGA 2024; 9:3454-3468. [PMID: 38284090 PMCID: PMC10809705 DOI: 10.1021/acsomega.3c06996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 12/11/2023] [Accepted: 12/14/2023] [Indexed: 01/30/2024]
Abstract
Protein-protein interactions (PPIs) play a central role in nearly all cellular processes. The strength of the binding in a PPI is characterized by the binding affinity (BA) and is a key factor in controlling protein-protein complex formation and defining the structure-function relationship. Despite advancements in understanding protein-protein binding, much remains unknown about the interfacial region and its association with BA. New models are needed to predict BA with improved accuracy for therapeutic design. Here, we use machine learning approaches to examine how well different types of interfacial contacts can be used to predict experimentally determined BA and to reveal the impact of the specific amino acids at the binding interface on BA. We create a series of multivariate linear regression models incorporating different contact features at both residue and atomic levels and examine how different methods of identifying and characterizing these properties impact the performance of these models. Particularly, we introduce a new and simple approach to predict BA based on the quantities of specific amino acids at the protein-protein interface. We found that the numbers of specific amino acids at the protein-protein interface were correlated with BA. We show that the interfacial numbers of amino acids can be used to produce models with consistently good performance across different data sets, indicating the importance of the identities of interfacial amino acids in underlying BA. When trained on a diverse set of complexes from two benchmark data sets, the best performing BA model was generated with an explicit linear equation involving six amino acids. Tyrosine, in particular, was identified as the key amino acid in controlling BA, as it had the strongest correlation with BA and was consistently identified as the most important amino acid in feature importance studies. Glycine and serine were identified as the next two most important amino acids in predicting BA. The results from this study further our understanding of PPIs and can be used to make improved predictions of BA, giving them implications for drug design and screening in the pharmaceutical industry.
Collapse
Affiliation(s)
- Carey
Huang Yi
- Department of Chemistry, The University of Memphis, Memphis, Tennessee 38152, United States
| | - Mitchell Lee Taylor
- Department of Chemistry, The University of Memphis, Memphis, Tennessee 38152, United States
| | - Jesse Ziebarth
- Department of Chemistry, The University of Memphis, Memphis, Tennessee 38152, United States
| | - Yongmei Wang
- Department of Chemistry, The University of Memphis, Memphis, Tennessee 38152, United States
| |
Collapse
|
14
|
de Raffele D, Ilie IM. Unlocking novel therapies: cyclic peptide design for amyloidogenic targets through synergies of experiments, simulations, and machine learning. Chem Commun (Camb) 2024; 60:632-645. [PMID: 38131333 DOI: 10.1039/d3cc04630c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Existing therapies for neurodegenerative diseases like Parkinson's and Alzheimer's address only their symptoms and do not prevent disease onset. Common therapeutic agents, such as small molecules and antibodies struggle with insufficient selectivity, stability and bioavailability, leading to poor performance in clinical trials. Peptide-based therapeutics are emerging as promising candidates, with successful applications for cardiovascular diseases and cancers due to their high bioavailability, good efficacy and specificity. In particular, cyclic peptides have a long in vivo stability, while maintaining a robust antibody-like binding affinity. However, the de novo design of cyclic peptides is challenging due to the lack of long-lived druggable pockets of the target polypeptide, absence of exhaustive conformational distributions of the target and/or the binder, unknown binding site, methodological limitations, associated constraints (failed trials, time, money) and the vast combinatorial sequence space. Hence, efficient alignment and cooperation between disciplines, and synergies between experiments and simulations complemented by popular techniques like machine-learning can significantly speed up the therapeutic cyclic-peptide development for neurodegenerative diseases. We review the latest advancements in cyclic peptide design against amyloidogenic targets from a computational perspective in light of recent advancements and potential of machine learning to optimize the design process. We discuss the difficulties encountered when designing novel peptide-based inhibitors and we propose new strategies incorporating experiments, simulations and machine learning to design cyclic peptides to inhibit the toxic propagation of amyloidogenic polypeptides. Importantly, these strategies extend beyond the mere design of cyclic peptides and serve as template for the de novo generation of (bio)materials with programmable properties.
Collapse
Affiliation(s)
- Daria de Raffele
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| | - Ioana M Ilie
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| |
Collapse
|
15
|
Michalik I, Kuder KJ. Machine Learning Methods in Protein-Protein Docking. Methods Mol Biol 2024; 2780:107-126. [PMID: 38987466 DOI: 10.1007/978-1-0716-3985-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
An exponential increase in the number of publications that address artificial intelligence (AI) usage in life sciences has been noticed in recent years, while new modeling techniques are constantly being reported. The potential of these methods is vast-from understanding fundamental cellular processes to discovering new drugs and breakthrough therapies. Computational studies of protein-protein interactions, crucial for understanding the operation of biological systems, are no exception in this field. However, despite the rapid development of technology and the progress in developing new approaches, many aspects remain challenging to solve, such as predicting conformational changes in proteins, or more "trivial" issues as high-quality data in huge quantities.Therefore, this chapter focuses on a short introduction to various AI approaches to study protein-protein interactions, followed by a description of the most up-to-date algorithms and programs used for this purpose. Yet, given the considerable pace of development in this hot area of computational science, at the time you read this chapter, the development of the algorithms described, or the emergence of new (and better) ones should come as no surprise.
Collapse
Affiliation(s)
- Ilona Michalik
- Department of Technology and Biotechnology of Drugs, Faculty of Pharmacy, Jagiellonian University Medical College, Kraków, Poland
| | - Kamil J Kuder
- Department of Technology and Biotechnology of Drugs, Faculty of Pharmacy, Jagiellonian University Medical College, Kraków, Poland.
| |
Collapse
|
16
|
Jarończyk M. Software for Predicting Binding Free Energy of Protein-Protein Complexes and Their Mutants. Methods Mol Biol 2024; 2780:139-147. [PMID: 38987468 DOI: 10.1007/978-1-0716-3985-6_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]
Abstract
Protein-protein binding affinity prediction is important for understanding complex biochemical pathways and to uncover protein interaction networks. Quantitative estimation of the binding affinity changes caused by mutations can provide critical information for protein function annotation and genetic disease diagnoses. The binding free energies of protein-protein complexes can be predicted using several computational tools. This chapter is a summary of software developed for the prediction of binding free energies for protein-protein complexes and their mutants.
Collapse
|
17
|
Shityakov S, Kravtsov V, Skorb EV, Nosonovsky M. Ergodicity Breaking and Self-Destruction of Cancer Cells by Induced Genome Chaos. ENTROPY (BASEL, SWITZERLAND) 2023; 26:37. [PMID: 38248163 PMCID: PMC10814486 DOI: 10.3390/e26010037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 12/25/2023] [Accepted: 12/28/2023] [Indexed: 01/23/2024]
Abstract
During the progression of some cancer cells, the degree of genome instability may increase, leading to genome chaos in populations of malignant cells. While normally chaos is associated with ergodicity, i.e., the state when the time averages of relevant parameters are equal to their phase space averages, the situation with cancer propagation is more complex. Chromothripsis, a catastrophic massive genomic rearrangement, is observed in many types of cancer, leading to increased mutation rates. We present an entropic model of genome chaos and ergodicity and experimental evidence that increasing the degree of chaos beyond the non-ergodic threshold may lead to the self-destruction of some tumor cells. We study time and population averages of chromothripsis frequency in cloned rhabdomyosarcomas from rat stem cells. Clones with frequency above 10% result in cell apoptosis, possibly due to mutations in the BCL2 gene. Potentially, this can be used for suppressing cancer cells by shifting them into a non-ergodic proliferation regime.
Collapse
Affiliation(s)
- Sergey Shityakov
- Infochemistry Scientific Center (ISC), ITMO University, 9 Lomonosova St., 191002 St. Petersburg, Russia;
| | - Viacheslav Kravtsov
- Infochemistry Scientific Center (ISC), ITMO University, 9 Lomonosova St., 191002 St. Petersburg, Russia;
| | - Ekaterina V. Skorb
- Infochemistry Scientific Center (ISC), ITMO University, 9 Lomonosova St., 191002 St. Petersburg, Russia;
| | - Michael Nosonovsky
- Infochemistry Scientific Center (ISC), ITMO University, 9 Lomonosova St., 191002 St. Petersburg, Russia;
- College of Engineering and Applied Science, University of Wisconsin-Milwaukee, Milwaukee, WI 53211, USA
| |
Collapse
|
18
|
Nikam R, Yugandhar K, Gromiha MM. Deep learning-based method for predicting and classifying the binding affinity of protein-protein complexes. BIOCHIMICA ET BIOPHYSICA ACTA. PROTEINS AND PROTEOMICS 2023; 1871:140948. [PMID: 37567456 DOI: 10.1016/j.bbapap.2023.140948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 08/05/2023] [Accepted: 08/08/2023] [Indexed: 08/13/2023]
Abstract
Protein-protein interactions (PPIs) play a critical role in various biological processes. Accurately estimating the binding affinity of PPIs is essential for understanding the underlying molecular recognition mechanisms. In this study, we employed a deep learning approach to predict the binding affinity (ΔG) of protein-protein complexes. To this end, we compiled a dataset of 903 protein-protein complexes, each with its corresponding experimental binding affinity, which belong to six functional classes. We extracted 8 to 20 non-redundant features from the sequence information as well as the predicted three-dimensional structures using feature selection methods for each protein functional class. Our method showed an overall mean absolute error of 1.05 kcal/mol and a correlation of 0.79 between experimental and predicted ΔG values. Additionally, we evaluated our model for discriminating high and low affinity protein-protein complexes and it achieved an accuracy of 87% with an F1 score of 0.86 using 10-fold cross-validation on the selected features. Our approach presents an efficient tool for studying PPIs and provides crucial insights into the underlying mechanisms of the molecular recognition process. The web server can be freely accessed at https://web.iitm.ac.in/bioinfo2/DeepPPAPred/index.html.
Collapse
Affiliation(s)
- Rahul Nikam
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India
| | - Kumar Yugandhar
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; Department of Computational Biology, Cornell University, New York, USA
| | - M Michael Gromiha
- Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, Indian Institute of Technology Madras, Chennai 600036, Tamil Nadu, India; Department of Computer Science, Tokyo Institute of Technology, Yokohama, Japan; Department of Computer Science, National University of Singapore, Singapore.
| |
Collapse
|
19
|
Pacini MF, Perdomini A, Bulfoni Balbi C, Dinatale B, Herrera FE, Perez AR, Marcipar I. The high identity of the Trypanosoma cruzi Group-I of trans-sialidases points them as promising vaccine immunogens. Proteins 2023; 91:1444-1460. [PMID: 37323089 DOI: 10.1002/prot.26537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 05/24/2023] [Accepted: 05/30/2023] [Indexed: 06/17/2023]
Abstract
Trans-sialidase (TS) superfamily of proteins comprises eight subgroups, being the proteins of Group-I (TS-GI) promising immunogens in vaccine approaches against Trypanosoma cruzi. Strikingly, TS-GI antigenic variability among parasite lineages and their influence on vaccine development has not been previously analyzed. Here, a search in GenBank detects 49 TS-GI indexed sequences, whereas the main infecting human different parasite discrete typing units (DTU) are represented. In silico comparison among these sequences indicate that they share an identity above 92%. Moreover, the antigenic regions (T-cell and B-cell epitopes) are conserved in most sequences or present amino acid substitutions that scarcely may alter the antigenicity. Additionally, since the generic term TS is usually used to refer to different immunogens of this broad family, a further in silico analysis of the TS-GI-derived fragments tested in preclinical vaccines was done to determine the coverage and identity among them, showing that overall amino acid identity of vaccine immunogens is high, but the segment coverage varies widely. Accordingly, strong H-2K, H-2I, and B-cell epitopes are dissimilarly represented among vaccine TS-derived fragments depending on the extension of the TG-GI sequence used. Moreover, bioinformatic analysis detected a set of 150 T-cell strong epitopes among the DTU-indexed sequences that strongly bind human HLA-I supertypes. In all currently reported experimental vaccines based on TS-GI fragments, mapping these 150 epitopes showed that they are moderately represented. However, despite vaccine epitopes do not present all the substitutions observed in the DTUs, these regions of the proteins are equally recognized by the same HLAs. Interestingly, the predictions regarding global and South American population coverage estimated in these 150 epitopes are similar to the estimations in experimental vaccines when the complete sequence of TS-GI is used as an antigen. In silico prediction also shows that a number of these MHC-I restricted T-cell strong epitopes could be also cross-recognized by HLA-I supertypes and H-2Kb or H-2Kd backgrounds, indicating that these mice may be used to improve and facilitate the development of new TS-based vaccines and suggesting an immunogenic and protective potential in humans. Further molecular docking analyses were performed to strengthen these results. Taken together, different strategies that would cover more or eventually fully of these T-cell and also B-cell epitopes to reach a high level of coverage are considered.
Collapse
Affiliation(s)
- Maria Florencia Pacini
- Laboratorio de Estudios en Enfermedad de Chagas, Instituto de Inmunología Clínica y Experimental de Rosario (IDICER-CONICET), Rosario, Argentina
| | - Adrián Perdomini
- Laboratorio de Tecnología Inmunológica, Facultad de Bioquímica y Ciencias Biológicas, Universidad Nacional del Litoral, Santa Fe, Argentina
| | - Camila Bulfoni Balbi
- Laboratorio de Estudios en Enfermedad de Chagas, Instituto de Inmunología Clínica y Experimental de Rosario (IDICER-CONICET), Rosario, Argentina
| | - Brenda Dinatale
- Laboratorio de Estudios en Enfermedad de Chagas, Instituto de Inmunología Clínica y Experimental de Rosario (IDICER-CONICET), Rosario, Argentina
| | - Fernando E Herrera
- Área de Modelado Molecular, Departamento de Física, Facultad de Bioquímica y Ciencias, Universidad Nacional del Litoral, (CONICET), Santa Fe, Argentina
| | - Ana Rosa Perez
- Laboratorio de Estudios en Enfermedad de Chagas, Instituto de Inmunología Clínica y Experimental de Rosario (IDICER-CONICET), Rosario, Argentina
- Centro de Investigación y Producción de Reactivos Biológicos (CIPReB), Facultad de Ciencias Médicas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Iván Marcipar
- Laboratorio de Tecnología Inmunológica, Facultad de Bioquímica y Ciencias Biológicas, Universidad Nacional del Litoral, Santa Fe, Argentina
| |
Collapse
|
20
|
Chen Z, Wang X, Chen X, Huang J, Wang C, Wang J, Wang Z. Accelerating therapeutic protein design with computational approaches toward the clinical stage. Comput Struct Biotechnol J 2023; 21:2909-2926. [PMID: 38213894 PMCID: PMC10781723 DOI: 10.1016/j.csbj.2023.04.027] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 04/11/2023] [Accepted: 04/27/2023] [Indexed: 01/13/2024] Open
Abstract
Therapeutic protein, represented by antibodies, is of increasing interest in human medicine. However, clinical translation of therapeutic protein is still largely hindered by different aspects of developability, including affinity and selectivity, stability and aggregation prevention, solubility and viscosity reduction, and deimmunization. Conventional optimization of the developability with widely used methods, like display technologies and library screening approaches, is a time and cost-intensive endeavor, and the efficiency in finding suitable solutions is still not enough to meet clinical needs. In recent years, the accelerated advancement of computational methodologies has ushered in a transformative era in the field of therapeutic protein design. Owing to their remarkable capabilities in feature extraction and modeling, the integration of cutting-edge computational strategies with conventional techniques presents a promising avenue to accelerate the progression of therapeutic protein design and optimization toward clinical implementation. Here, we compared the differences between therapeutic protein and small molecules in developability and provided an overview of the computational approaches applicable to the design or optimization of therapeutic protein in several developability issues.
Collapse
Affiliation(s)
- Zhidong Chen
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xinpei Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Xu Chen
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Juyang Huang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Chenglin Wang
- Shenzhen Qiyu Biotechnology Co., Ltd, Shenzhen 518107, China
| | - Junqing Wang
- School of Pharmaceutical Sciences, Shenzhen Campus of Sun Yat-sen University, Shenzhen 518107, China
| | - Zhe Wang
- Department of Pathology, The Eighth Affiliated Hospital, Sun Yat-sen University, Shenzhen 518033, China
| |
Collapse
|
21
|
Guo Z, Yamaguchi R. Machine learning methods for protein-protein binding affinity prediction in protein design. FRONTIERS IN BIOINFORMATICS 2022; 2:1065703. [PMID: 36591334 PMCID: PMC9800603 DOI: 10.3389/fbinf.2022.1065703] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 12/01/2022] [Indexed: 12/23/2022] Open
Abstract
Protein-protein interactions govern a wide range of biological activity. A proper estimation of the protein-protein binding affinity is vital to design proteins with high specificity and binding affinity toward a target protein, which has a variety of applications including antibody design in immunotherapy, enzyme engineering for reaction optimization, and construction of biosensors. However, experimental and theoretical modelling methods are time-consuming, hinder the exploration of the entire protein space, and deter the identification of optimal proteins that meet the requirements of practical applications. In recent years, the rapid development in machine learning methods for protein-protein binding affinity prediction has revealed the potential of a paradigm shift in protein design. Here, we review the prediction methods and associated datasets and discuss the requirements and construction methods of binding affinity prediction models for protein design.
Collapse
Affiliation(s)
- Zhongliang Guo
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan
| | - Rui Yamaguchi
- Division of Cancer Systems Biology, Aichi Cancer Center Research Institute, Nagoya, Aichi, Japan,Division of Cancer Informatics, Nagoya University Graduate School of Medicine, Nagoya, Aichi, Japan,*Correspondence: Rui Yamaguchi,
| |
Collapse
|
22
|
Akagawa M, Shirai T, Sada M, Nagasawa N, Kondo M, Takeda M, Nagasawa K, Kimura R, Okayama K, Hayashi Y, Sugai T, Tsugawa T, Ishii H, Kawashima H, Katayama K, Ryo A, Kimura H. Detailed Molecular Interactions between Respiratory Syncytial Virus Fusion Protein and the TLR4/MD-2 Complex In Silico. Viruses 2022; 14:v14112382. [PMID: 36366480 PMCID: PMC9694959 DOI: 10.3390/v14112382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 10/19/2022] [Accepted: 10/27/2022] [Indexed: 01/31/2023] Open
Abstract
Molecular interactions between respiratory syncytial virus (RSV) fusion protein (F protein) and the cellular receptor Toll-like receptor 4 (TLR4) and myeloid differentiation factor-2 (MD-2) protein complex are unknown. Thus, to reveal the detailed molecular interactions between them, in silico analyses were performed using various bioinformatics techniques. The present simulation data showed that the neutralizing antibody (NT-Ab) binding sites in both prefusion and postfusion proteins at sites II and IV were involved in the interactions between them and the TLR4 molecule. Moreover, the binding affinity between postfusion proteins and the TLR4/MD-2 complex was higher than that between prefusion proteins and the TLR4/MD-2 complex. This increased binding affinity due to conformational changes in the F protein may be able to form syncytium in RSV-infected cells. These results may contribute to better understand the infectivity and pathogenicity (syncytium formation) of RSV.
Collapse
Affiliation(s)
- Mao Akagawa
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Tatsuya Shirai
- Advanced Medical Science Research Center, Gunma Paz University Research Institute, Shibukawa-shi 377-0008, Japan
| | - Mitsuru Sada
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Norika Nagasawa
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Mayumi Kondo
- Department of Clinical Engineering, Faculty of Medical Technology, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Makoto Takeda
- Department of Virology III, National Institute of Infectious Diseases, Musashimurayama-shi, Tokyo 208-0011, Japan
| | - Koo Nagasawa
- Department of Pediatrics, Graduate School of Medical Science, Chiba University, Chiba-shi 260-8670, Japan
| | - Ryusuke Kimura
- Advanced Medical Science Research Center, Gunma Paz University Research Institute, Shibukawa-shi 377-0008, Japan
- Department of Bacteriology, Graduate School of Medicine, Gunma University, Maebashi-shi 371-8514, Japan
| | - Kaori Okayama
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Yuriko Hayashi
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
| | - Toshiyuki Sugai
- Department of Nursing Science, Graduate School of Health Science, Hiroshima University, Hiroshima-shi 734-8551, Japan
| | - Takeshi Tsugawa
- Department of Pediatrics, School of Medicine, Sapporo Medical University, Sapporo-shi 060-8543, Japan
| | - Haruyuki Ishii
- Department of Respiratory Medicine, School of Medicine, Kyorin University, Mitaka-shi, Tokyo 181-8611, Japan
| | - Hisashi Kawashima
- Department of Pediatrics and Adolescent Medicine, Tokyo Medical University, Shinjuku-ku, Tokyo 160-0023, Japan
| | - Kazuhiko Katayama
- Laboratory of Viral Infection Control, Graduate School of Infection Control Sciences, Ōmura Satoshi Memorial Institute, Kitasato University, Minato-ku, Tokyo 108-8641, Japan
| | - Akihide Ryo
- Department of Microbiology, School of Medicine, Yokohama City University, Yokohama-shi 236-0004, Japan
| | - Hirokazu Kimura
- Department of Health Science, Graduate School of Health Sciences, Gunma Paz University, Takasaki-shi 370-0006, Japan
- Advanced Medical Science Research Center, Gunma Paz University Research Institute, Shibukawa-shi 377-0008, Japan
- Correspondence: ; Tel.: +81-27-365-3366; Fax: +81-42-247-8077
| |
Collapse
|
23
|
Chang L, Mondal A, Perez A. Towards rational computational peptide design. FRONTIERS IN BIOINFORMATICS 2022; 2:1046493. [PMID: 36338806 PMCID: PMC9634169 DOI: 10.3389/fbinf.2022.1046493] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 10/11/2022] [Indexed: 11/16/2022] Open
Abstract
Peptides are prevalent in biology, mediating as many as 40% of protein-protein interactions, and involved in other cellular functions such as transport and signaling. Their ability to bind with high specificity make them promising therapeutical agents with intermediate properties between small molecules and large biologics. Beyond their biological role, peptides can be programmed to self-assembly, and they are already being used for functions as diverse as oligonuclotide delivery, tissue regeneration or as drugs. However, the transient nature of their interactions has limited the number of structures and knowledge of binding affinities available-and their flexible nature has limited the success of computational pipelines that predict the structures and affinities of these molecules. Fortunately, recent advances in experimental and computational pipelines are creating new opportunities for this field. We are starting to see promising predictions of complex structures, thermodynamic and kinetic properties. We believe in the following years this will lead to robust rational peptide design pipelines with success similar to those applied for small molecule drug discovery.
Collapse
Affiliation(s)
- Liwei Chang
- Department of Chemistry, University of Florida, Gainesville, FL, United States,Quantum Theory Project, University of Florida, Gainesville, FL, United States
| | - Arup Mondal
- Department of Chemistry, University of Florida, Gainesville, FL, United States,Quantum Theory Project, University of Florida, Gainesville, FL, United States
| | - Alberto Perez
- Department of Chemistry, University of Florida, Gainesville, FL, United States,Quantum Theory Project, University of Florida, Gainesville, FL, United States,*Correspondence: Alberto Perez,
| |
Collapse
|