1
|
Osetrina DA, Kusova AM, Bikmullin AG, Klochkova EA, Yulmetov AR, Semenova EA, Mukhametzyanov TA, Usachev KS, Klochkov VV, Blokhin DS. Extent of N-Terminus Folding of Semenogelin 1 Cleavage Product Determines Tendency to Amyloid Formation. Int J Mol Sci 2023; 24:ijms24108949. [PMID: 37240295 DOI: 10.3390/ijms24108949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/15/2023] [Accepted: 05/16/2023] [Indexed: 05/28/2023] Open
Abstract
It is known that four peptide fragments of predominant protein in human semen Semenogelin 1 (SEM1) (SEM1(86-107), SEM1(68-107), SEM1(49-107) and SEM1(45-107)) are involved in fertilization and amyloid formation processes. In this work, the structure and dynamic behavior of SEM1(45-107) and SEM1(49-107) peptides and their N-domains were described. According to ThT fluorescence spectroscopy data, it was shown that the amyloid formation of SEM1(45-107) starts immediately after purification, which is not observed for SEM1(49-107). Seeing that the peptide amino acid sequence of SEM1(45-107) differs from SEM1(49-107) only by the presence of four additional amino acid residues in the N domain, these domains of both peptides were obtained via solid-phase synthesis and the difference in their dynamics and structure was investigated. SEM1(45-67) and SEM1(49-67) showed no principal difference in dynamic behavior in water solution. Furthermore, we obtained mostly disordered structures of SEM1(45-67) and SEM1(49-67). However, SEM1(45-67) contains a helix (E58-K60) and helix-like (S49-Q51) fragments. These helical fragments may rearrange into β-strands during amyloid formation process. Thus, the difference in full-length peptides' (SEM1(45-107) and SEM1(49-107)) amyloid-forming behavior may be explained by the presence of a structured helix at the SEM1(45-107) N-terminus, which contributes to an increased rate of amyloid formation.
Collapse
Affiliation(s)
- Daria A Osetrina
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| | - Aleksandra M Kusova
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
- Kazan Institute of Biochemistry and Biophysics, FRC Kazan Scientific Center, Russian Academy of Sciences, Kazan 420111, Russia
| | - Aydar G Bikmullin
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
- Laboratory of Structural Biology, Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan 420021, Russia
| | - Evelina A Klochkova
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
- Laboratory of Structural Biology, Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan 420021, Russia
| | - Aydar R Yulmetov
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| | - Evgenia A Semenova
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| | - Timur A Mukhametzyanov
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| | - Konstantin S Usachev
- Laboratory of Structural Biology, Institute of Fundamental Medicine and Biology, Kazan Federal University, Kazan 420021, Russia
- Laboratory for Structural Analysis of Biomacromolecules, Federal Research Center "Kazan Scientific Center of Russian Academy of Sciences", Kazan 420111, Russia
| | - Vladimir V Klochkov
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| | - Dmitriy S Blokhin
- NMR Laboratory, Medical Physics Department, Institute of Physics, Kazan Federal University, Kremlevskaya Str., 18, Kazan 420008, Russia
| |
Collapse
|
2
|
Muniyappan S, Rayan AXA, Varrieth GT. DTiGNN: Learning drug-target embedding from a heterogeneous biological network based on a two-level attention-based graph neural network. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2023; 20:9530-9571. [PMID: 37161255 DOI: 10.3934/mbe.2023419] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
MOTIVATION In vitro experiment-based drug-target interaction (DTI) exploration demands more human, financial and data resources. In silico approaches have been recommended for predicting DTIs to reduce time and cost. During the drug development process, one can analyze the therapeutic effect of the drug for a particular disease by identifying how the drug binds to the target for treating that disease. Hence, DTI plays a major role in drug discovery. Many computational methods have been developed for DTI prediction. However, the existing methods have limitations in terms of capturing the interactions via multiple semantics between drug and target nodes in a heterogeneous biological network (HBN). METHODS In this paper, we propose a DTiGNN framework for identifying unknown drug-target pairs. The DTiGNN first calculates the similarity between the drug and target from multiple perspectives. Then, the features of drugs and targets from each perspective are learned separately by using a novel method termed an information entropy-based random walk. Next, all of the learned features from different perspectives are integrated into a single drug and target similarity network by using a multi-view convolutional neural network. Using the integrated similarity networks, drug interactions, drug-disease associations, protein interactions and protein-disease association, the HBN is constructed. Next, a novel embedding algorithm called a meta-graph guided graph neural network is used to learn the embedding of drugs and targets. Then, a convolutional neural network is employed to infer new DTIs after balancing the sample using oversampling techniques. RESULTS The DTiGNN is applied to various datasets, and the result shows better performance in terms of the area under receiver operating characteristic curve (AUC) and area under precision-recall curve (AUPR), with scores of 0.98 and 0.99, respectively. There are 23,739 newly predicted DTI pairs in total.
Collapse
Affiliation(s)
- Saranya Muniyappan
- Computer Science and Engineering, CEG Campus, Anna University, Tamil Nadu, India
| | | | | |
Collapse
|
3
|
Das B, Kutsal M, Das R. A geometric deep learning model for display and prediction of potential drug-virus interactions against SARS-CoV-2. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS : AN INTERNATIONAL JOURNAL SPONSORED BY THE CHEMOMETRICS SOCIETY 2022; 229:104640. [PMID: 36042844 PMCID: PMC9400382 DOI: 10.1016/j.chemolab.2022.104640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2022] [Revised: 08/17/2022] [Accepted: 08/19/2022] [Indexed: 05/04/2023]
Abstract
Although the coronavirus epidemic spread rapidly with the Omicron variant, it lost its lethality rate with the effect of vaccine and immunity. The hospitalization and intense demand decreased. However, there is no definite information about when this disease will end or how dangerous the different variants could be. In addition, it is not possible to end the risk of variants that will continue to circulate among animals in nature. After this stage, drug-virus interactions should be examined in order to be able to prepare against possible new types of viruses and variants and to rapidly-produce drugs or vaccines against possible viruses. Despite experimental methods that are expensive, laborious, and time-consuming, geometric deep learning(GDL) is an alternative method that can be used to make this process faster and cheaper. In this study, we propose a new model based on geometric deep learning for the prediction of drug-virus interaction against COVID-19. First, we use the antiviral drug data in the SMILES molecular structure representation to generate too many features and better describe the structure of chemical species. Then the data is converted into a molecular representation and then into a graphical structure that the GDL model can understand. The node feature vectors are transferred to a different space with the Message Passing Neural Network (MPNN) for the training process to take place. We develop a geometric neural network architecture where the graph embedding values are passed through the fully connected layer and the prediction is actualized. The results indicate that the proposed method outperforms existing methods with 97% accuracy in predicting drug-virus interactions.
Collapse
Affiliation(s)
- Bihter Das
- Department of Software Engineering, Technology Faculty, Firat University, 23119, Elazig, Turkey
| | - Mucahit Kutsal
- Department of Software Engineering, Technology Faculty, Firat University, 23119, Elazig, Turkey
| | - Resul Das
- Department of Software Engineering, Technology Faculty, Firat University, 23119, Elazig, Turkey
| |
Collapse
|
4
|
Saha S, Chatterjee P, Halder AK, Nasipuri M, Basu S, Plewczynski D. ML-DTD: Machine Learning-Based Drug Target Discovery for the Potential Treatment of COVID-19. Vaccines (Basel) 2022; 10:1643. [PMID: 36298508 PMCID: PMC9607653 DOI: 10.3390/vaccines10101643] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/11/2022] [Accepted: 09/14/2022] [Indexed: 11/05/2022] Open
Abstract
Recent research has highlighted that a large section of druggable protein targets in the Human interactome remains unexplored for various diseases. It might lead to the drug repurposing study and help in the in-silico prediction of new drug-human protein target interactions. The same applies to the current pandemic of COVID-19 disease in global health issues. It is highly desirable to identify potential human drug targets for COVID-19 using a machine learning approach since it saves time and labor compared to traditional experimental methods. Structure-based drug discovery where druggability is determined by molecular docking is only appropriate for the protein whose three-dimensional structures are available. With machine learning algorithms, differentiating relevant features for predicting targets and non-targets can be used for the proteins whose 3-D structures are unavailable. In this research, a Machine Learning-based Drug Target Discovery (ML-DTD) approach is proposed where a machine learning model is initially built up and tested on the curated dataset consisting of COVID-19 human drug targets and non-targets formed by using the Therapeutic Target Database (TTD) and human interactome using several classifiers like XGBBoost Classifier, AdaBoost Classifier, Logistic Regression, Support Vector Classification, Decision Tree Classifier, Random Forest Classifier, Naive Bayes Classifier, and K-Nearest Neighbour Classifier (KNN). In this method, protein features include Gene Set Enrichment Analysis (GSEA) ranking, properties derived from the protein sequence, and encoded protein network centrality-based measures. Among all these, XGBBoost, KNN, and Random Forest models are satisfactory and consistent. This model is further used to predict novel COVID-19 human drug targets, which are further validated by target pathway analysis, the emergence of allied repurposed drugs, and their subsequent docking study.
Collapse
Affiliation(s)
- Sovan Saha
- Department of Computer Science & Engineering, Institute of Engineering & Management, Salt Lake Electronics Complex, Kolkata 700091, India
| | - Piyali Chatterjee
- Department of Computer Science & Engineering, Netaji Subhash Engineering College, Techno City, Panchpota, Garia, Kolkata 700152, India
| | - Anup Kumar Halder
- Faculty of Mathematics and Information Sciences, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Banacha 2c Street, 02-097 Warsaw, Poland
| | - Mita Nasipuri
- Department of Computer Science & Engineering, Jadavpur University, 188, Raja S.C. Mallick Road, Kolkata 700032, India
| | - Subhadip Basu
- Department of Computer Science & Engineering, Jadavpur University, 188, Raja S.C. Mallick Road, Kolkata 700032, India
| | - Dariusz Plewczynski
- Faculty of Mathematics and Information Sciences, Warsaw University of Technology, Koszykowa 75, 00-662 Warsaw, Poland
- Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Banacha 2c Street, 02-097 Warsaw, Poland
| |
Collapse
|
5
|
Xuan P, Zhang X, Zhang Y, Hu K, Nakaguchi T, Zhang T. multi-type neighbors enhanced global topology and pairwise attribute learning for drug-protein interaction prediction. Brief Bioinform 2022; 23:6581435. [PMID: 35514190 DOI: 10.1093/bib/bbac120] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 03/07/2022] [Accepted: 03/15/2022] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Accurate identification of proteins interacted with drugs helps reduce the time and cost of drug development. Most of previous methods focused on integrating multisource data about drugs and proteins for predicting drug-target interactions (DTIs). There are both similarity connection and interaction connection between two drugs, and these connections reflect their relationships from different perspectives. Similarly, two proteins have various connections from multiple perspectives. However, most of previous methods failed to deeply integrate these connections. In addition, multiple drug-protein heterogeneous networks can be constructed based on multiple kinds of connections. The diverse topological structures of these networks are still not exploited completely. RESULTS We propose a novel model to extract and integrate multi-type neighbor topology information, diverse similarities and interactions related to drugs and proteins. Firstly, multiple drug-protein heterogeneous networks are constructed according to multiple kinds of connections among drugs and those among proteins. The multi-type neighbor node sequences of a drug node (or a protein node) are formed by random walks on each network and they reflect the hidden neighbor topological structure of the node. Secondly, a module based on graph neural network (GNN) is proposed to learn the multi-type neighbor topologies of each node. We propose attention mechanisms at neighbor node level and at neighbor type level to learn more informative neighbor nodes and neighbor types. A network-level attention is also designed to enhance the context dependency among multiple neighbor topologies of a pair of drug and protein nodes. Finally, the attribute embedding of the drug-protein pair is formulated by a proposed embedding strategy, and the embedding covers the similarities and interactions about the pair. A module based on three-dimensional convolutional neural networks (CNN) is constructed to deeply integrate pairwise attributes. Extensive experiments have been performed and the results indicate GCDTI outperforms several state-of-the-art prediction methods. The recall rate estimation over the top-ranked candidates and case studies on 5 drugs further demonstrate GCDTI's ability in discovering potential drug-protein interactions.
Collapse
Affiliation(s)
- Ping Xuan
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.,School of Computer Science, Shaanxi Normal University, Xi'an 710062, China
| | - Xiaowen Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Yu Zhang
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Kaimiao Hu
- School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China
| | - Toshiya Nakaguchi
- Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan
| | - Tiangang Zhang
- School of Mathematical Science, Heilongjiang University, Harbin 150080, China
| |
Collapse
|
6
|
Deep Link-Prediction Based on the Local Structure of Bipartite Networks. ENTROPY 2022; 24:e24050610. [PMID: 35626496 PMCID: PMC9140406 DOI: 10.3390/e24050610] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/21/2022] [Accepted: 04/26/2022] [Indexed: 01/04/2023]
Abstract
Link prediction based on bipartite networks can not only mine hidden relationships between different types of nodes, but also reveal the inherent law of network evolution. Existing bipartite network link prediction is mainly based on the global structure that cannot analyze the role of the local structure in link prediction. To tackle this problem, this paper proposes a deep link-prediction (DLP) method by leveraging the local structure of bipartite networks. The method first extracts the local structure between target nodes and observes structural information between nodes from a local perspective. Then, representation learning of the local structure is performed on the basis of the graph neural network to extract latent features between target nodes. Lastly, a deep-link prediction model is trained on the basis of latent features between target nodes to achieve link prediction. Experimental results on five datasets showed that DLP achieved significant improvement over existing state-of-the-art link prediction methods. In addition, this paper analyzes the relationship between local structure and link prediction, confirming the effectiveness of a local structure in link prediction.
Collapse
|
7
|
El-Behery H, Attia AF, El-Feshawy N, Torkey H. Efficient machine learning model for predicting drug-target interactions with case study for Covid-19. Comput Biol Chem 2021; 93:107536. [PMID: 34271420 PMCID: PMC8256690 DOI: 10.1016/j.compbiolchem.2021.107536] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 06/23/2021] [Accepted: 06/24/2021] [Indexed: 11/30/2022]
Abstract
BACKGROUND Discover possible Drug Target Interactions (DTIs) is a decisive step in the detection of the effects of drugs as well as drug repositioning. There is a strong incentive to develop effective computational methods that can effectively predict potential DTIs, as traditional DTI laboratory experiments are expensive, time-consuming, and labor-intensive. Some technologies have been developed for this purpose, however large numbers of interactions have not yet been detected, the accuracy of their prediction still low, and protein sequences and structured data are rarely used together in the prediction process. METHODS This paper presents DTIs prediction model that takes advantage of the special capacity of the structured form of proteins and drugs. Our model obtains features from protein amino-acid sequences using physical and chemical properties, and from drugs smiles (Simplified Molecular Input Line Entry System) strings using encoding techniques. Comparing the proposed model with different existing methods under K-fold cross validation, empirical results show that our model based on ensemble learning algorithms for DTI prediction provide more accurate results from both structures and features data. RESULTS The proposed model is applied on two datasets:Benchmark (feature only) datasets and DrugBank (Structure data) datasets. Experimental results obtained by Light-Boost and ExtraTree using structures and feature data results in 98 % accuracy and 0.97 f-score comparing to 94 % and 0.92 achieved by the existing methods. Moreover, our model can successfully predict more yet undiscovered interactions, and hence can be used as a practical tool to drug repositioning. A case study of applying our prediction model on the proteins that are known to be affected by Corona viruses in order to predict the possible interactions among these proteins and existing drugs is performed. Also, our model is applied on Covid-19 related drugs announced on DrugBank. The results show that some drugs like DB00691 and DB05203 are predicted with 100 % accuracy to interact with ACE2 protein. This protein is a self-membrane protein that enables Covid-19 infection. Hence, our model can be used as an effective tool in drug reposition to predict possible drug treatments for Covid-19.
Collapse
Affiliation(s)
- Heba El-Behery
- Department of Computer Science and Engineering, Faculty of Engineering, Kafrelsheikh University, Kafr_El_Sheikh, Egypt.
| | - Abdel-Fattah Attia
- Department of Computer Science and Engineering, Faculty of Engineering, Kafrelsheikh University, Kafr_El_Sheikh, Egypt.
| | - Nawal El-Feshawy
- Computer Science & Engineering Department, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
| | - Hanaa Torkey
- Computer Science & Engineering Department, Faculty of Electronic Engineering, Menoufia University, Menouf, Egypt.
| |
Collapse
|