1
|
Vijayakumar S, Kumar LL, Borkotoky S, Murali A. The Application of MD Simulation to Lead Identification, Vaccine Design, and Structural Studies in Combat against Leishmaniasis - A Review. Mini Rev Med Chem 2024; 24:1089-1111. [PMID: 37680156 DOI: 10.2174/1389557523666230901105231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 06/07/2023] [Accepted: 07/14/2023] [Indexed: 09/09/2023]
Abstract
Drug discovery, vaccine design, and protein interaction studies are rapidly moving toward the routine use of molecular dynamics simulations (MDS) and related methods. As a result of MDS, it is possible to gain insights into the dynamics and function of identified drug targets, antibody-antigen interactions, potential vaccine candidates, intrinsically disordered proteins, and essential proteins. The MDS appears to be used in all possible ways in combating diseases such as cancer, however, it has not been well documented as to how effectively it is applied to infectious diseases such as Leishmaniasis. As a result, this review aims to survey the application of MDS in combating leishmaniasis. We have systematically collected articles that illustrate the implementation of MDS in drug discovery, vaccine development, and structural studies related to Leishmaniasis. Of all the articles reviewed, we identified that only a limited number of studies focused on the development of vaccines against Leishmaniasis through MDS. Also, the PCA and FEL studies were not carried out in most of the studies. These two were globally accepted utilities to understand the conformational changes and hence it is recommended that this analysis should be taken up in similar approaches in the future.
Collapse
Affiliation(s)
| | | | - Subhomoi Borkotoky
- Department of Biotechnology, Invertis University, Bareilly, Uttar Pradesh, India
| | - Ayaluru Murali
- Department of Bioinformatics, Pondicherry University, Puducherry, India
| |
Collapse
|
2
|
MOZART, a QSAR Multi-Target Web-Based Tool to Predict Multiple Drug-Enzyme Interactions. Molecules 2023; 28:molecules28031182. [PMID: 36770857 PMCID: PMC9921108 DOI: 10.3390/molecules28031182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 01/09/2023] [Accepted: 01/16/2023] [Indexed: 01/27/2023] Open
Abstract
Developing models able to predict interactions between drugs and enzymes is a primary goal in computational biology since these models may be used for predicting both new active drugs and the interactions between known drugs on untested targets. With the compilation of a large dataset of drug-enzyme pairs (62,524), we recognized a unique opportunity to attempt to build a novel multi-target machine learning (MTML) quantitative structure-activity relationship (QSAR) model for probing interactions among different drugs and enzyme targets. To this end, this paper presents an MTML-QSAR model based on using the features of topological drugs together with the artificial neural network (ANN) multi-layer perceptron (MLP). Validation of the final best model found was carried out by internal cross-validation statistics and other relevant diagnostic statistical parameters. The overall accuracy of the derived model was found to be higher than 96%. Finally, to maximize the diffusion of this model, a public and accessible tool has been developed to allow users to perform their own predictions. The developed web-based tool is public accessible and can be downloaded as free open-source software.
Collapse
|
3
|
Akbar S, Hayat M, Iqbal M, Jan MA. iACP-GAEnsC: Evolutionary genetic algorithm based ensemble classification of anticancer peptides by utilizing hybrid feature space. Artif Intell Med 2017; 79:62-70. [PMID: 28655440 DOI: 10.1016/j.artmed.2017.06.008] [Citation(s) in RCA: 96] [Impact Index Per Article: 13.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Revised: 06/12/2017] [Accepted: 06/16/2017] [Indexed: 01/10/2023]
Abstract
Cancer is a fatal disease, responsible for one-quarter of all deaths in developed countries. Traditional anticancer therapies such as, chemotherapy and radiation, are highly expensive, susceptible to errors and ineffective techniques. These conventional techniques induce severe side-effects on human cells. Due to perilous impact of cancer, the development of an accurate and highly efficient intelligent computational model is desirable for identification of anticancer peptides. In this paper, evolutionary intelligent genetic algorithm-based ensemble model, 'iACP-GAEnsC', is proposed for the identification of anticancer peptides. In this model, the protein sequences are formulated, using three different discrete feature representation methods, i.e., amphiphilic Pseudo amino acid composition, g-Gap dipeptide composition, and Reduce amino acid alphabet composition. The performance of the extracted feature spaces are investigated separately and then merged to exhibit the significance of hybridization. In addition, the predicted results of individual classifiers are combined together, using optimized genetic algorithm and simple majority technique in order to enhance the true classification rate. It is observed that genetic algorithm-based ensemble classification outperforms than individual classifiers as well as simple majority voting base ensemble. The performance of genetic algorithm-based ensemble classification is highly reported on hybrid feature space, with an accuracy of 96.45%. In comparison to the existing techniques, 'iACP-GAEnsC' model has achieved remarkable improvement in terms of various performance metrics. Based on the simulation results, it is observed that 'iACP-GAEnsC' model might be a leading tool in the field of drug design and proteomics for researchers.
Collapse
Affiliation(s)
- Shahid Akbar
- Department of Computer Science, Abdul Wali Khan University Mardan, KP 23200, Pakistan.
| | - Maqsood Hayat
- Department of Computer Science, Abdul Wali Khan University Mardan, KP 23200, Pakistan.
| | - Muhammad Iqbal
- Department of Computer Science, Abdul Wali Khan University Mardan, KP 23200, Pakistan.
| | - Mian Ahmad Jan
- Department of Computer Science, Abdul Wali Khan University Mardan, KP 23200, Pakistan.
| |
Collapse
|
4
|
Bielińska-Wąż D, Wąż P. Spectral-dynamic representation of DNA sequences. J Biomed Inform 2017; 72:1-7. [PMID: 28587890 DOI: 10.1016/j.jbi.2017.06.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Revised: 05/03/2017] [Accepted: 06/01/2017] [Indexed: 11/25/2022]
Abstract
A graphical representation of DNA sequences in which the distribution of a particular base B=A,C,G,T is represented by a set of discrete lines has been formulated. The methodology of this approach has been borrowed from two areas of physics: spectroscopy and dynamics. Consequently, the set of discrete lines is referred to as the B-spectrum. Next, the B-spectrum is transformed to a rigid body composed of material points. In this way a dynamic representation of the DNA sequence has been obtained. The centers of mass of these rigid bodies, divided by their moments of inertia, have been taken as the descriptors of the spectra and, thus, of the DNA sequences. The performance of this method on a standard set of data commonly applied by authors introducing new approaches to bioinformatics (the first exons of β-globin genes of different species) proved to be very good.
Collapse
Affiliation(s)
- Dorota Bielińska-Wąż
- Department of Radiological Informatics and Statistics, Medical University of Gdańsk, Tuwima 15, 80-210 Gdańsk, Poland.
| | - Piotr Wąż
- Department of Nuclear Medicine, Medical University of Gdańsk, Tuwima 15, 80-210 Gdańsk, Poland.
| |
Collapse
|
5
|
20D-dynamic representation of protein sequences. Genomics 2016; 107:16-23. [DOI: 10.1016/j.ygeno.2015.12.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 12/10/2015] [Accepted: 12/14/2015] [Indexed: 11/23/2022]
|
6
|
Yao Y, Yan S, Han J, Dai Q, He PA. A novel descriptor of protein sequences and its application. J Theor Biol 2014; 347:109-17. [PMID: 24412564 DOI: 10.1016/j.jtbi.2014.01.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2013] [Revised: 11/10/2013] [Accepted: 01/01/2014] [Indexed: 02/05/2023]
Abstract
In this paper, a dynamic 3-D graphical representation of protein sequences is introduced based on three physical-chemical properties of amino acids. The coordinates of the graph have direct biological significance, which could reflect the innate structure of the proteins. The information of principal moments of inertia and range of axis coordinate are extracted as a novel mixed descriptor and proposed for the comparison of protein primary sequences. Meanwhile, the Euclidean distance of the normalized descriptor vectors which avoid the influence of the difference in length of protein sequences under consideration is employed as a quantitative measurement of the similarity of proteins. Finally, we take the nine ND5 (NADH dehydrogenase subunit 5) proteins for example and illustrate the effectiveness of our approach.
Collapse
Affiliation(s)
- Yuhua Yao
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China.
| | - Shoujiang Yan
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Jianning Han
- College of Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Qi Dai
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Ping-an He
- College of Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| |
Collapse
|
7
|
Ziarek JJ, Liu Y, Smith E, Zhang G, Peterson FC, Chen J, Yu Y, Chen Y, Volkman BF, Li R. Fragment-based optimization of small molecule CXCL12 inhibitors for antagonizing the CXCL12/CXCR4 interaction. Curr Top Med Chem 2013; 12:2727-40. [PMID: 23368099 DOI: 10.2174/1568026611212240003] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2012] [Revised: 10/08/2012] [Accepted: 11/03/2012] [Indexed: 12/21/2022]
Abstract
The chemokine CXCL12 and its G protein-coupled receptor (GPCR) CXCR4 are high-priority clinical targets because of their involvement in metastatic cancers (also implicated in autoimmune disease and cardiovascular disease). Because chemokines interact with two distinct sites to bind and activate their receptors, both the GPCRs and chemokines are potential targets for small molecule inhibition. A number of chemokines have been validated as targets for drug development, but virtually all drug discovery efforts focus on the GPCRs. However, all CXCR4 receptor antagonists with the exception of MSX-122 have failed in clinical trials due to unmanageable toxicities, emphasizing the need for alternative strategies to interfere with CXCL12/CXCR4-guided metastatic homing. Although targeting the relatively featureless surface of CXCL12 was presumed to be challenging, focusing efforts at the sulfotyrosine (sY) binding pockets proved successful for procuring initial hits. Using a hybrid structure-based in silico/NMR screening strategy, we recently identified a ligand that occludes the receptor recognition site. From this initial hit, we designed a small fragment library containing only nine tetrazole derivatives using a fragment-based and bioisostere approach to target the sY binding sites of CXCL12. Compound binding modes and affinities were studied by 2D NMR spectroscopy, X-ray crystallography, molecular docking and cell-based functional assays. Our results demonstrate that the sY binding sites are conducive to the development of high affinity inhibitors with better ligand efficiency (LE) than typical protein-protein interaction inhibitors (LE ≤ 0.24). Our novel tetrazole-based fragment 18 was identified to bind the sY21 site with a K(d) of 24 μM (LE = 0.30). Optimization of 18 yielded compound 25 which specifically inhibits CXCL12-induced migration with an improvement in potency over the initial hit 9. The fragment from this library that exhibited the highest affinity and ligand efficiency (11: K(d) = 13 μM, LE = 0.33) may serve as a starting point for development of inhibitors targeting the sY12 site.
Collapse
Affiliation(s)
- Joshua J Ziarek
- Department of Biochemistry, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Enhanced QSAR model performance by integrating structural and gene expression information. Molecules 2013; 18:10789-801. [PMID: 24008242 PMCID: PMC6270197 DOI: 10.3390/molecules180910789] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Revised: 07/20/2013] [Accepted: 07/26/2013] [Indexed: 11/29/2022] Open
Abstract
Despite decades of intensive research and a number of demonstrable successes, quantitative structure-activity relationship (QSAR) models still fail to yield predictions with reasonable accuracy in some circumstances, especially when the QSAR paradox occurs. In this study, to avoid the QSAR paradox, we proposed a novel integrated approach to improve the model performance through using both structural and biological information from compounds. As a proof-of-concept, the integrated models were built on a toxicological dataset to predict non-genotoxic carcinogenicity of compounds, using not only the conventional molecular descriptors but also expression profiles of significant genes selected from microarray data. For test set data, our results demonstrated that the prediction accuracy of QSAR model was dramatically increased from 0.57 to 0.67 with incorporation of expression data of just one selected signature gene. Our successful integration of biological information into classic QSAR model provided a new insight and methodology for building predictive models especially when QSAR paradox occurred.
Collapse
|
9
|
Chen W, Feng P, Lin H. Prediction of ketoacyl synthase family using reduced amino acid alphabets. J Ind Microbiol Biotechnol 2011; 39:579-84. [PMID: 22042516 DOI: 10.1007/s10295-011-1047-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Accepted: 10/04/2011] [Indexed: 11/28/2022]
Abstract
Ketoacyl synthases are enzymes involved in fatty acid synthesis and can be classified into five families based on primary sequence similarity. Different families have different catalytic mechanisms. Developing cost-effective computational models to identify the family of ketoacyl synthases will be helpful for enzyme engineering and in knowing individual enzymes' catalytic mechanisms. In this work, a support vector machine-based method was developed to predict ketoacyl synthase family using the n-peptide composition of reduced amino acid alphabets. In jackknife cross-validation, the model based on the 2-peptide composition of a reduced amino acid alphabet of size 13 yielded the best overall accuracy of 96.44% with average accuracy of 93.36%, which is superior to other state-of-the-art methods. This result suggests that the information provided by n-peptide compositions of reduced amino acid alphabets provides efficient means for enzyme family classification and that the proposed model can be efficiently used for ketoacyl synthase family annotation.
Collapse
Affiliation(s)
- Wei Chen
- Department of Physics, College of Sciences, Center for Genomics and Computational Biology, Hebei United University, Tangshan 063000, China.
| | | | | |
Collapse
|
10
|
On the information expressed in enzyme structure: more lessons from ribonuclease A. Mol Divers 2011; 15:769-79. [PMID: 21347658 DOI: 10.1007/s11030-011-9307-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2010] [Accepted: 02/05/2011] [Indexed: 01/17/2023]
Abstract
Brownian computations were directed at Ribonuclease A (RNase A) and variants in folded states so as to quantify information of the statistical type at the atom/covalent bond level. This advanced the research reported in this journal last year on the information properties of enzyme primary structure. Brownian computation data are illustrated for a sixteen-member library. The results identify signature traits that distinguish the folded wild type (WT) molecule from variants. The distinctions are explainable in terms of correlated information and dispersion energy. The Brownian tools used for this study can be directed at other protein families (e.g., kinases, isomerases, etc.) in rapid screening, QSAR, and design applications.
Collapse
|
11
|
Wang JF, Chou KC. Insights from studying the mutation-induced allostery in the M2 proton channel by molecular dynamics. Protein Eng Des Sel 2010; 23:663-6. [PMID: 20571121 DOI: 10.1093/protein/gzq040] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
As an essential component of the viral envelope, M2 proton channel plays a central role in the virus replications and has been a key target for drug design against the influenza A viruses. The adamantadine-based drugs, such as amantadine and rimantadine, were developed for blocking the channel so as to suppress the replication of viruses. However, patients, especially those infected by the H1N1 influenza A viruses, are increasingly suffering from the drug-resistance problem. According to the findings revealed recently by the high-resolution NMR studies, the drug-resistance problem is due to the structural allostery caused by some mutations, such as L26F, V27A and S31N, in the four-helix bundle of the channel. In this study, we are to address this problem from a dynamic point of view by conducting molecular dynamics (MD) simulations on both the open and the closed states of the wild-type (WT) and S31N mutant M2 channels in the presence of rimantadine. It was observed from the MD simulated structures that the mutant channel could still keep open even if binding with rimantadine, but the WT channel could not. This was because the mutation would destabilize the helix bundle and trigger it from a compact packing state to a loose one. It is anticipated that the findings may provide useful insights for in-depth understanding the action mechanism of the M2 channel and developing more-effective drugs against influenza A viruses.
Collapse
Affiliation(s)
- Jing-Fang Wang
- Shanghai Center for Systems Biomedicine, Shanghai Jiaotong University, 800 Dongchuan, Shanghai 200240, China.
| | | |
Collapse
|
12
|
On the information expressed in enzyme primary structure: lessons from Ribonuclease A. Mol Divers 2009; 14:673-86. [PMID: 19921453 DOI: 10.1007/s11030-009-9211-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 10/24/2009] [Indexed: 10/20/2022]
Abstract
The information expressed in an enzyme's primary structure is investigated. Brownian computations are directed at Ribonuclease A (RNase A) so as to quantify the information at the atom/covalent bond level. The information content and distribution are crucial because the primary structure underpins the molecule's chemical functions. Brownian computation data are illustrated for the native protein, mutants, and sequence isomers. The results identify signature features of the active protein on new information grounds. The same tools offer rapid screening of proteins and polypeptides whereby several examples are illustrated.
Collapse
|