1
|
Lyu Y, He R, Hu J, Wang C, Gong X. Prediction of the tetramer protein complex interaction based on CNN and SVM. Front Genet 2023; 14:1076904. [PMID: 36777731 PMCID: PMC9909274 DOI: 10.3389/fgene.2023.1076904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 01/16/2023] [Indexed: 01/27/2023] Open
Abstract
Protein-protein interactions play an important role in life activities. The study of protein-protein interactions helps to better understand the mechanism of protein complex interaction, which is crucial for drug design, protein function annotation and three-dimensional structure prediction of protein complexes. In this paper, we study the tetramer protein complex interaction. The research has two parts: The first part is to predict the interaction between chains of the tetramer protein complex. In this part, we proposed a feature map to represent a sample generated by two chains of the tetramer protein complex, and constructed a Convolutional Neural Network (CNN) model to predict the interaction between chains of the tetramer protein complex. The AUC value of testing set is 0.6263, which indicates that our model can be used to predict the interaction between chains of the tetramer protein complex. The second part is to predict the tetramer protein complex interface residue pairs. In this part, we proposed a Support Vector Machine (SVM) ensemble method based on under-sampling and ensemble method to predict the tetramer protein complex interface residue pairs. In the top 10 predictions, when at least one protein-protein interaction interface is correctly predicted, the accuracy of our method is 82.14%. The result shows that our method is effective for the prediction of the tetramer protein complex interface residue pairs.
Collapse
Affiliation(s)
- Yanfen Lyu
- Department of Mathematics and PhysicsScience and Engineering, Hebei University of Engineering, Handan, China
| | - Ruonan He
- School of Information, Renmin University of China, Beijing, China
| | - Jingjing Hu
- Department of Mathematics and PhysicsScience and Engineering, Hebei University of Engineering, Handan, China
| | - Chunxia Wang
- School of Landscape and Ecological Engineering, Hebei University of Engineering, Handan, China,*Correspondence: Chunxia Wang, ; Xinqi Gong,
| | - Xinqi Gong
- Mathematical Intelligence Application Lab, Institute for Mathematical Sciences, School of Math, Renmin University of China, Beijing, China,Beijing Academy of Artificial Intelligence, Beijing, China,*Correspondence: Chunxia Wang, ; Xinqi Gong,
| |
Collapse
|
2
|
Protein-protein interaction and non-interaction predictions using gene sequence natural vector. Commun Biol 2022; 5:652. [PMID: 35780196 PMCID: PMC9250521 DOI: 10.1038/s42003-022-03617-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 06/21/2022] [Indexed: 12/02/2022] Open
Abstract
Predicting protein–protein interaction and non-interaction are two important different aspects of multi-body structure predictions, which provide vital information about protein function. Some computational methods have recently been developed to complement experimental methods, but still cannot effectively detect real non-interacting protein pairs. We proposed a gene sequence-based method, named NVDT (Natural Vector combine with Dinucleotide and Triplet nucleotide), for the prediction of interaction and non-interaction. For protein–protein non-interactions (PPNIs), the proposed method obtained accuracies of 86.23% for Homo sapiens and 85.34% for Mus musculus, and it performed well on three types of non-interaction networks. For protein-protein interactions (PPIs), we obtained accuracies of 99.20, 94.94, 98.56, 95.41, and 94.83% for Saccharomyces cerevisiae, Drosophila melanogaster, Helicobacter pylori, Homo sapiens, and Mus musculus, respectively. Furthermore, NVDT outperformed established sequence-based methods and demonstrated high prediction results for cross-species interactions. NVDT is expected to be an effective approach for predicting PPIs and PPNIs. Protein-protein non-interactions and interactions are distinguished and predicted by gene sequence using single nucleotide and contiguous nucleotides combined with machine learning models.
Collapse
|
3
|
Zhou X, Song H, Li J. Residue-Frustration-Based Prediction of Protein-Protein Interactions Using Machine Learning. J Phys Chem B 2022; 126:1719-1727. [PMID: 35170967 DOI: 10.1021/acs.jpcb.1c10525] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The study of protein-protein interactions (PPIs) is important in understanding the function of proteins. However, it is still a challenge to investigate the transient protein-protein interaction by experiments. Hence, the computational prediction for protein-protein interactions draws growing attention. Statistics-based features have been widely used in the studies of protein structure prediction and protein folding. Due to the scarcity of experimental data of PPI, it is difficult to construct a conventional statistical feature for PPI prediction, and the application of statistics-based features is very limited in this field. In this paper, we explored the application of frustration, a statistical potential, in PPI prediction. By comparing the energetic contribution of the extra stabilization energy from a given residue pair in the native protein with the statistics of the energies, we obtained the residue pair's frustration index. By calculating the number of residue pairs with a high frustration index, the highly frustrated density, a residue-frustration-based feature, was then obtained to describe the tendency of residues to be involved in PPI. Highly frustrated density, as well as structure-based features, were then used to describe protein residues and combined with the long short-term memory (LSTM) neural network to predict PPI residue pairs. Our model correctly predicted 75% dimers when only the top 2‰ residue pairs were selected in each dimer. Our model, which considers the statistics-based features, is significantly different from the models based on the chemical features of residues. We found that frustration can effectively describe the tendency of residue to be involved in PPI. Frustration-based features can replace chemical features to combine with machine learning and realize the better performance of PPI prediction. It reveals the great potential of statistical potential such as frustration in PPI prediction.
Collapse
Affiliation(s)
- Xiaozhou Zhou
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Hangzhou 310027, Zhejiang, China
| | - Haoyu Song
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Hangzhou 310027, Zhejiang, China
| | - Jingyuan Li
- Zhejiang Province Key Laboratory of Quantum Technology and Device, Institute of Quantitative Biology, Department of Physics, Zhejiang University, Hangzhou 310027, Zhejiang, China
| |
Collapse
|
4
|
Hong Z, Liu J, Chen Y. An interpretable machine learning method for homo-trimeric protein interface residue-residue interaction prediction. Biophys Chem 2021; 278:106666. [PMID: 34418678 DOI: 10.1016/j.bpc.2021.106666] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2021] [Revised: 08/09/2021] [Accepted: 08/09/2021] [Indexed: 12/29/2022]
Abstract
Protein-protein interaction plays an important role in life activities. A more fine-grained analysis, such as residues and atoms level, will better benefit us to understand the mechanism for inter-protein interaction and drug design. The development of efficient computational methods to reduce trials and errors, as well as assisting experimental researchers to determine the complex structure are some of the ongoing studies in the field. The research of trimer protein interface, especially homotrimer, has been rarely studied. In this paper, we proposed an interpretable machine learning method for homo-trimeric protein interface residue pairs prediction. The structure, sequence, and physicochemical information are intergraded as feature input fed to model for training. Graph model is utilized to present spatial information for intra-protein. Matrix factorization captures the different features' interactions. Kernel function is designed to auto-acquire the adjacent information of our target residue pairs. The accuracy rate achieves 54.5% in an independent test set. Sequence and structure alignment exhibit the ability of model self-study. Our model indicates the biological significance between sequence and structure, and could be auxiliary for reducing trials and errors in the fields of protein complex determination and protein-protein docking, etc. SIGNIFICANCE: Protein complex structures are significant for understanding protein function and promising functional protein design. With data increasing, some computational tools have been developed for protein complex residue contact prediction, which is one of the most significant steps for complex structure prediction. But for homo-trimeric protein, the sequence-based deep learning predictors are infeasible for homologous sequences, and the algorithm black box prevents us from understanding of each step operation. In this way, we propose an interpreting machine learning method for homo-trimeric protein interface residue-residue interaction prediction, and the predictor shows a good performance. Our work provides a computational auxiliary way for determining the homo-trimeric proteins interface residue pairs which will be further verified by wet experiments, and and gives a hand for the downstream works, such as protein-protein docking, protein complex structure prediction and drug design.
Collapse
Affiliation(s)
- Zhonghua Hong
- Jiaxing Hospital of Traditional Chinese Medicine, Jiaxing University, Jiaxing 314001, PR China.
| | - Jiale Liu
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, PR China
| | - Yinggao Chen
- Shantou Central Hospital, Shantou 515041, PR China.
| |
Collapse
|
5
|
Gheyouche E, Bagueneau M, Loirand G, Offmann B, Téletchéa S. Structural Design and Analysis of the RHOA-ARHGEF1 Binding Mode: Challenges and Applications for Protein-Protein Interface Prediction. Front Mol Biosci 2021; 8:643728. [PMID: 34109211 PMCID: PMC8181724 DOI: 10.3389/fmolb.2021.643728] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 04/13/2021] [Indexed: 01/02/2023] Open
Abstract
The interaction between two proteins may involve local movements, such as small side-chains re-positioning or more global allosteric movements, such as domain rearrangement. We studied how one can build a precise and detailed protein-protein interface using existing protein-protein docking methods, and how it can be possible to enhance the initial structures using molecular dynamics simulations and data-driven human inspection. We present how this strategy was applied to the modeling of RHOA-ARHGEF1 interaction using similar complexes of RHOA bound to other members of the Rho guanine nucleotide exchange factor family for comparative assessment. In parallel, a more crude approach based on structural superimposition and molecular replacement was also assessed. Both models were then successfully refined using molecular dynamics simulations leading to protein structures where the major data from scientific literature could be recovered. We expect that the detailed strategy used in this work will prove useful for other protein-protein interface design. The RHOA-ARHGEF1 interface modeled here will be extremely useful for the design of inhibitors targeting this protein-protein interaction (PPI).
Collapse
Affiliation(s)
| | | | - Gervaise Loirand
- Université de Nantes, CHU Nantes, CNRS, Inserm, L'institut Du Thorax, Nantes, France
| | | | | |
Collapse
|
6
|
A Two-Layer SVM Ensemble-Classifier to Predict Interface Residue Pairs of Protein Trimers. MOLECULES (BASEL, SWITZERLAND) 2020; 25:molecules25194353. [PMID: 32977371 PMCID: PMC7582526 DOI: 10.3390/molecules25194353] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 09/16/2020] [Accepted: 09/18/2020] [Indexed: 11/29/2022]
Abstract
Study of interface residue pairs is important for understanding the interactions between monomers inside a trimer protein–protein complex. We developed a two-layer support vector machine (SVM) ensemble-classifier that considers physicochemical and geometric properties of amino acids and the influence of surrounding amino acids. Different descriptors and different combinations may give different prediction results. We propose feature combination engineering based on correlation coefficients and F-values. The accuracy of our method is 65.38% in independent test set, indicating biological significance. Our predictions are consistent with the experimental results. It shows the effectiveness and reliability of our method to predict interface residue pairs of protein trimers.
Collapse
|
7
|
Bardsley EN, Paterson DJ. Neurocardiac regulation: from cardiac mechanisms to novel therapeutic approaches. J Physiol 2020; 598:2957-2976. [PMID: 30307615 PMCID: PMC7496613 DOI: 10.1113/jp276962] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Accepted: 10/02/2018] [Indexed: 12/15/2022] Open
Abstract
Cardiac sympathetic overactivity is a well-established contributor to the progression of neurogenic hypertension and heart failure, yet the underlying pathophysiology remains unclear. Recent studies have highlighted the importance of acutely regulated cyclic nucleotides and their effectors in the control of intracellular calcium and exocytosis. Emerging evidence now suggests that a significant component of sympathetic overactivity and enhanced transmission may arise from impaired cyclic nucleotide signalling, resulting from compromised phosphodiesterase activity, as well as alterations in receptor-coupled G-protein activation. In this review, we address some of the key cellular and molecular pathways that contribute to sympathetic overactivity in hypertension and discuss their potential for therapeutic targeting.
Collapse
Affiliation(s)
- E. N. Bardsley
- Wellcome Trust OXION Initiative in Ion Channels and DiseaseOxfordUK
- Burdon Sanderson Cardiac Science Centre, Department of PhysiologyAnatomy and Genetics, University of OxfordOxfordOX1 3PTUK
| | - D. J. Paterson
- Wellcome Trust OXION Initiative in Ion Channels and DiseaseOxfordUK
- Burdon Sanderson Cardiac Science Centre, Department of PhysiologyAnatomy and Genetics, University of OxfordOxfordOX1 3PTUK
| |
Collapse
|
8
|
Lyu Y, Huang H, Gong X. A Novel Index of Contact Frequency from Noise Protein-Protein Interaction Data Help for Accurate Interface Residue Pair Prediction. Interdiscip Sci 2020; 12:204-216. [PMID: 32185690 DOI: 10.1007/s12539-020-00364-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 01/23/2020] [Accepted: 02/24/2020] [Indexed: 11/24/2022]
Abstract
Protein-protein interactions are important for most biological processes and have been studied for decades. However, the detailed formation mechanism of protein-protein interaction interface is still ambiguous, which makes it difficult to accurately predict the protein-protein interaction interface residue pairs. Here, we extract the interface residue-residue contacts from the decoys in the ZDOCK protein-protein complex decoy set with RMSD mostly larger than 3 Å. To accurately compute the interface residue-residue contacts, we define a new constant called interface residue pairs frequency, which counts the atom contact numbers between two interface residues. We normalize interface residue pairs frequency to pick out the top residue-residue pairs from all the possible pairs preferential to be on correct protein-protein interaction interface. When tested on 37 protein dimers from the decoy set where most decoys are incorrect, our method successfully predicts 30 protein dimers with a success rate of up to 81.1%. Higher accuracy than some other state-of-the-art methods confirmed the performance of our method.
Collapse
Affiliation(s)
- Yanfen Lyu
- Mathematical Intelligence Application Lab, Institute for Mathematical Sciences, School of Math, Renmin University of China, Beijing, 100872, China
| | - He Huang
- Mathematical Intelligence Application Lab, Institute for Mathematical Sciences, School of Math, Renmin University of China, Beijing, 100872, China
| | - Xinqi Gong
- Mathematical Intelligence Application Lab, Institute for Mathematical Sciences, School of Math, Renmin University of China, Beijing, 100872, China.
| |
Collapse
|
9
|
Liu J, Gong X. Attention mechanism enhanced LSTM with residual architecture and its application for protein-protein interaction residue pairs prediction. BMC Bioinformatics 2019; 20:609. [PMID: 31775612 PMCID: PMC6882172 DOI: 10.1186/s12859-019-3199-1] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2018] [Accepted: 11/06/2019] [Indexed: 11/25/2022] Open
Abstract
Background Recurrent neural network(RNN) is a good way to process sequential data, but the capability of RNN to compute long sequence data is inefficient. As a variant of RNN, long short term memory(LSTM) solved the problem in some extent. Here we improved LSTM for big data application in protein-protein interaction interface residue pairs prediction based on the following two reasons. On the one hand, there are some deficiencies in LSTM, such as shallow layers, gradient explosion or vanishing, etc. With a dramatic data increasing, the imbalance between algorithm innovation and big data processing has been more serious and urgent. On the other hand, protein-protein interaction interface residue pairs prediction is an important problem in biology, but the low prediction accuracy compels us to propose new computational methods. Results In order to surmount aforementioned problems of LSTM, we adopt the residual architecture and add attention mechanism to LSTM. In detail, we redefine the block, and add a connection from front to back in every two layers and attention mechanism to strengthen the capability of mining information. Then we use it to predict protein-protein interaction interface residue pairs, and acquire a quite good accuracy over 72%. What’s more, we compare our method with random experiments, PPiPP, standard LSTM, and some other machine learning methods. Our method shows better performance than the methods mentioned above. Conclusion We present an attention mechanism enhanced LSTM with residual architecture, and make deeper network without gradient vanishing or explosion to a certain extent. Then we apply it to a significant problem– protein-protein interaction interface residue pairs prediction and obtain a better accuracy than other methods. Our method provides a new approach for protein-protein interaction computation, which will be helpful for related biomedical researches.
Collapse
Affiliation(s)
- Jiale Liu
- Mathematics Intelligence Application Lab, Institute for Mathematical Sciences, Renmin University of China, No. 59 Zhongguancun Street,Haidian District, Beijing, China
| | - Xinqi Gong
- Mathematics Intelligence Application Lab, Institute for Mathematical Sciences, Renmin University of China, No. 59 Zhongguancun Street,Haidian District, Beijing, China. .,Center for Mathematical Sciences and Applications,Harvard University, Boston, MA02138, USA.
| |
Collapse
|
10
|
Molecular Dynamics Simulation Reveals Exposed Residues in the Ligand-Binding Domain of the Low-Density Lipoprotein Receptor that Interacts with Vesicular Stomatitis Virus-G Envelope. Viruses 2019; 11:v11111063. [PMID: 31731579 PMCID: PMC6893590 DOI: 10.3390/v11111063] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 10/26/2019] [Accepted: 10/28/2019] [Indexed: 01/20/2023] Open
Abstract
Familial hypercholesterolemia (FH) is an autosomal dominant disease most often caused by mutations in the low-density lipoprotein receptor (LDLR) gene, which consists of 18 exons spanning 45 kb and codes for a precursor protein of 860 amino acids. Mutations in the LDLR gene lead to a reduced hepatic clearance of LDL as well as a high risk of coronary artery disease (CAD) and sudden cardiac death (SCD). Recently, LDLR transgenes have generated interest as potential therapeutic agents. However, LDLR packaging using a lentiviral vector (LVV) system pseudotyped with a vesicular stomatitis virus (VSV)-G envelope is not efficient. In this study, we modified the LVV system to improve transduction efficiency and investigated the LDLR regions responsible for transduction inhibition. Transduction efficiency of 293T cells with a 5′-LDLReGFP-3′ fusion construct was only 1.55% compared to 42.32% for the eGFP construct. Moreover, co-expression of LDLR affected eGFP packaging. To determine the specific region of the LDLR protein responsible for packaging inhibition, we designed constructs with mutations or sequential deletions at the 3′ and 5′ ends of LDLR cDNA. All constructs except one without the ligand-binding domain (LBD) (pWoLBD–eGFP) resulted in low transduction efficiency, despite successful packaging of viral RNA in the VSV envelope, as confirmed through RT-PCR. When we evaluated a direct interaction between LDLR and the VSV envelope glycoprotein using MD simulation and protein–protein interactions, we uncovered Val119, Thr120, Thr67, and Thr118 as exposed residues in the LDLR receptor that interact with the VSV protein. Together, our results suggest that the LBD of LDLR interacts with the VSV-G protein during viral packaging, which significantly reduces transduction efficiency.
Collapse
|
11
|
Nilofer C, Sukhwal A, Mohanapriya A, Sakharkar MK, Kangueane P. Small protein-protein interfaces rich in electrostatic are often linked to regulatory function. J Biomol Struct Dyn 2019; 38:3260-3279. [PMID: 31495333 DOI: 10.1080/07391102.2019.1657040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Protein-protein interaction (PPI) is critical for several biological functions in living cells through the formation of an interface. Therefore, it is of interest to characterize protein-protein interfaces using an updated non-redundant structural dataset of 2557 homo (identical subunits) and 393 hetero (different subunits) dimer protein complexes determined by X-ray crystallography. We analyzed the interfaces using van der Waals (vdW), hydrogen bonding and electrostatic energies. Results show that on average homo and hetero interfaces are similar. Hence, we further grouped the 2950 interfaces based on percentage vdW to total energies into dominant (≥60%) and sub-dominant (<60%) vdW interfaces. Majority (92%) of interfaces have dominant vdW energy with large interface size (146 ± 87 (homo) and 137 ± 76 (hetero) residues) and interface area (1622 ± 1135 Å2 (homo) and 1579 ± 1060 Å2 (hetero)). However, a proportion (8%) of interfaces have sub-dominant vdW energy with small interface size (85 ± 46 (homo) and 88 ± 36 (hetero) residues) and interface area (823 ± 538 Å2 (homo) and 881 ± 377 Å2 (hetero)). It is found that large interfaces have two-fold more interface area and interface size than small interfaces with increasing hydrogen bonding energy to interface size. However, small interfaces have three-fold more electrostatics energy than large interfaces with increasing electrostatics to interface size. Thus, 8% of complexes having small interfaces with limited interface area and sub-dominant vdW energy are rich in electrostatics. It is interesting to observe that complexes having small interfaces are often associated with regulatory function. Hence, the observed structural features with known molecular function provide insights for the better understanding of PPI.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Christina Nilofer
- Biomedical Informatics (P) Ltd., Pondicherry, India.,School of Biosciences & Technology, VIT University, Vellore, Tamil Nadu, India
| | - Anshul Sukhwal
- National Centre for Biological Sciences (NCBS), Bangalore, India
| | | | | | | |
Collapse
|
12
|
Cicaloni V, Trezza A, Pettini F, Spiga O. Applications of in Silico Methods for Design and Development of Drugs Targeting Protein-Protein Interactions. Curr Top Med Chem 2019; 19:534-554. [PMID: 30836920 DOI: 10.2174/1568026619666190304153901] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 01/02/2019] [Accepted: 01/25/2019] [Indexed: 11/22/2022]
Abstract
BACKGROUND Identification of Protein-Protein Interactions (PPIs) is a major challenge in modern molecular biology and biochemistry research, due to the unquestionable role of proteins in cells, biological process and pathological states. Over the past decade, the PPIs have evolved from being considered a highly challenging field of research to being investigated and examined as targets for pharmacological intervention. OBJECTIVE Comprehension of protein interactions is crucial to known how proteins come together to build signalling pathways, to carry out their functions, or to cause diseases, when deregulated. Multiplicity and great amount of PPIs structures offer a huge number of new and potential targets for the treatment of different diseases. METHODS Computational techniques are becoming predominant in PPIs studies for their effectiveness, flexibility, accuracy and cost. As a matter of fact, there are effective in silico approaches which are able to identify PPIs and PPI site. Such methods for computational target prediction have been developed through molecular descriptors and data-mining procedures. RESULTS In this review, we present different types of interactions between protein-protein and the application of in silico methods for design and development of drugs targeting PPIs. We described computational approaches for the identification of possible targets on protein surface and to detect of stimulator/ inhibitor molecules. CONCLUSION A deeper study of the most recent bioinformatics methodologies for PPIs studies is vital for a better understanding of protein complexes and for discover new potential PPI modulators in therapeutic intervention.
Collapse
Affiliation(s)
- Vittoria Cicaloni
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy.,Toscana Life Sciences Foundation, via Fiorentina 1, 53100 Siena, Italy
| | - Alfonso Trezza
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| | - Francesco Pettini
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| | - Ottavia Spiga
- Department of Biotechnology, Chemistry and Pharmacy (Dept. of Excellence 2018-2022), University of Siena, via A. Moro 2, 53100 Siena, Italy
| |
Collapse
|
13
|
The Symmetric Difference Distance: A New Way to Evaluate the Evolution of Interfaces along Molecular Dynamics Trajectories; Application to Influenza Hemagglutinin. Symmetry (Basel) 2019. [DOI: 10.3390/sym11050662] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
We propose a new and easy approach to evaluate structural dissimilarities between frames issued from molecular dynamics, and we test this methodology on human hemagglutinin. This protein is responsible for the entry of the influenza virus into the host cell by endocytosis, and this virus causes seasonal epidemics of infectious disease, which can be estimated to result in hundreds of thousands of deaths each year around the world. We computed the three interfaces between the three protomers of the hemagglutinin H1 homotrimer (PDB code: 1RU7) for each of its conformations generated from molecular dynamics simulation. For each conformation, we considered the set of residues involved in the union of these three interfaces. The dissimilarity between each pair of conformations was measured with our new methodology, the symmetric difference distance between the associated set of residues. The main advantages of the full procedure are: (i) it is parameter free; (ii) no spatial alignment is needed and (iii) it is simple enough so that it can be implemented by a beginner in programming. It is shown to be a relevant tool to follow the evolution of the conformation along the molecular dynamics trajectories.
Collapse
|
14
|
Wong AKC, Sze-To HY, Johanning GL. Pattern to Knowledge: Deep Knowledge-Directed Machine Learning for Residue-Residue Interaction Prediction. Sci Rep 2018; 8:14841. [PMID: 30287904 PMCID: PMC6172270 DOI: 10.1038/s41598-018-32834-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Accepted: 09/17/2018] [Indexed: 11/21/2022] Open
Abstract
Residue-residue close contact (R2R-C) data procured from three-dimensional protein-protein interaction (PPI) experiments is currently used for predicting residue-residue interaction (R2R-I) in PPI. However, due to complex physiochemical environments, R2R-I incidences, facilitated by multiple factors, are usually entangled in the source environment and masked in the acquired data. Here we present a novel method, P2K (Pattern to Knowledge), to disentangle R2R-I patterns and render much succinct discriminative information expressed in different specific R2R-I statistical/functional spaces. Since such knowledge is not visible in the data acquired, we refer to it as deep knowledge. Leveraging the deep knowledge discovered to construct machine learning models for sequence-based R2R-I prediction, without trial-and-error combination of the features over external knowledge of sequences, our R2R-I predictor was validated for its effectiveness under stringent leave-one-complex-out-alone cross-validation in a benchmark dataset, and was surprisingly demonstrated to perform better than an existing sequence-based R2R-I predictor by 28% (p: 1.9E-08). P2K is accessible via our web server on https://p2k.uwaterloo.ca .
Collapse
Affiliation(s)
- Andrew K C Wong
- Department of Systems Design Engineering, University of Waterloo, 200 University Avenue West, Waterloo, N2L 3G1, Ontario, Canada.
| | - Ho Yin Sze-To
- Department of Systems Design Engineering, University of Waterloo, 200 University Avenue West, Waterloo, N2L 3G1, Ontario, Canada
| | - Gary L Johanning
- Biosciences Division, SRI International, 333 Ravenswood Ave, Menlo Park, CA, USA
| |
Collapse
|