1
|
Genome mutations of the Turkish strain Leishmania infantum_TR01. INFECTION GENETICS AND EVOLUTION 2021; 92:104907. [PMID: 33971306 DOI: 10.1016/j.meegid.2021.104907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 04/28/2021] [Accepted: 05/07/2021] [Indexed: 11/22/2022]
Abstract
PURPOSE Today, almost 15 years have passed since the whole genome sequence (WGS) of a Leishmania parasite was completed for the first time. However, information on the genetics of these parasites remains to be elucidated. Genome-based studies may contribute to control strategies for leishmaniasis. The increase in genetically based studies, particularly whole-genome sequencing studies on Leishmania, will contribute to the control of leishmaniasis, which is a common and neglected disease worldwide. Our previous study obtained the Leishmania infantum_TR01 (Lin_TR01) genome sequence (Guldemir et al., 2020). In the present study, we focused on the mutations detected in the genome and aimed to investigate the effects of these mutations. METHODS In our previous study, the whole-genome sequence of the L. infantum_TR01 strain was obtained (Guldemir et al., 2020). In the present study, 3153 polymorphisms were detected in bioinformatics analysis performed on the Geneious 11.0.5. (www.geneious.com) platform. Herein, the L. infantum JPCM5 strain was used as the reference genome for genome mapping. Polymorphic regions were determined using the Find Variations/SNPs program on the Geneious platform. We further analyzed these polymorphisms detected in the previous study. Additionally, a literature review was performed by searching the PubMed database for proteins with polymorphisms. RESULTS In our previous study (Guldemir et al., 2020), the genomic DNA sequence was submitted to the NCBI GenBank (www.ncbi.nlm.nih.gov) database and registered under the name Leishmania infantum_TR01 (Lin_TR01) and project accession number PRJNA437593. As a result of the annotation of the genome, 3153 polymorphisms were identified. In this study, 166 protein-coding polymorphisms were found among the 3153 polymorphisms, affecting 63 different proteins. Fourteen of them were studied, and the remaining 49 proteins were not studied. The 14 proteins examined in terms of the mutations detected in this study were related to virulence (n = 5), vaccine candidates (n = 2), diagnosis/typing (n = 4), drug resistance (n = 2), drug targets (n = 3) and vital function (n = 1). CONCLUSION As mentioned previously, the acquisition of the Lin_TR01 genome was described in our previous study (Guldemir et al., 2020). The present meta-analytical study is the first comparison report of whole-genome sequence-based polymorphisms between the Turkish strain Leishmania infantum_TR01 and reference Leishmania infantum JPCM5 strain and evaluated polymorphisms and proteins. In this study, we focused on the mutations detected in the genome, and the effects of these mutations were investigated and evaluated together with the current literature. In our previous study, a high-quality WGS of Leishmania infantum was successfully obtained for the first time in Turkey (1). In this study, the comparison of both genomes will contribute to providing the scientific community with a solid infrastructure for postgenomic investigations of the parasite.
Collapse
|
2
|
Wu J, Mai G, Deng B, Younseo J, Du D, Chen F, Ma Q. Quantitative Structure-activity Relationship of Acetylcholinesterase Inhibitors based on mRMR Combined with Support Vector Regression. LETT ORG CHEM 2019. [DOI: 10.2174/1570178615666181008125341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
In this work, support vector regression (SVR), an effective machine learning method, proposed by Vapnik was applied to establish QSAR model for a series of AchEI. Fourteen descriptors were selected for constructing the SVR mode by using mRMR-Forward feature selection method. The parameters (ε, C) were adjusted by leave-one-out cross validation (LOOCV) method which was used to judge the predictive power of different models. After optimization, one optimal SVR-QSAR model was attained, and the mean relative errors (MRE) of LOOCV by using SVR is 1.72%. As a result, LogP negatively affected the activity, Refractivity and Water Accessible Surface Area positively affected the activity.
Collapse
Affiliation(s)
- Jiaxiang Wu
- Shanghai Key Laboratory of Bio-Crops, College of Life Science, Shanghai University, Shanghai, China
| | - Guozhao Mai
- Department of Rehabilitation Medicine, The People's Hospital of Heshan, Guangdong, China
| | - Bowen Deng
- Shanghai Key Laboratory of Bio-Crops, College of Life Science, Shanghai University, Shanghai, China
| | - Jeong Younseo
- Center for Bioinformatics and Computational Biology, Pai Chai University, Daejeon, South Korea
| | - Dongsu Du
- Shanghai Key Laboratory of Bio-Crops, College of Life Science, Shanghai University, Shanghai, China
| | - Fuxue Chen
- Shanghai Key Laboratory of Bio-Crops, College of Life Science, Shanghai University, Shanghai, China
| | - Qiaorong Ma
- Department of Clinical Laboratory, Minzu Hospital of Guangxi Zhuang Autonomous Region, Affiliated Minzu Hospital of Guangxi Medical University, Nanning, Guangxi, China
| |
Collapse
|
3
|
Wang MY, Liang JW, Olounfeh KM, Sun Q, Zhao N, Meng FH. A Comprehensive In Silico Method to Study the QSTR of the Aconitine Alkaloids for Designing Novel Drugs. Molecules 2018; 23:E2385. [PMID: 30231506 PMCID: PMC6225272 DOI: 10.3390/molecules23092385] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2018] [Revised: 09/11/2018] [Accepted: 09/12/2018] [Indexed: 12/22/2022] Open
Abstract
A combined in silico method was developed to predict potential protein targets that are involved in cardiotoxicity induced by aconitine alkaloids and to study the quantitative structure⁻toxicity relationship (QSTR) of these compounds. For the prediction research, a Protein-Protein Interaction (PPI) network was built from the extraction of useful information about protein interactions connected with aconitine cardiotoxicity, based on nearly a decade of literature and the STRING database. The software Cytoscape and the PharmMapper server were utilized to screen for essential proteins in the constructed network. The Calcium-Calmodulin-Dependent Protein Kinase II alpha (CAMK2A) and gamma (CAMK2G) were identified as potential targets. To obtain a deeper insight on the relationship between the toxicity and the structure of aconitine alkaloids, the present study utilized QSAR models built in Sybyl software that possess internal robustness and external high predictions. The molecular dynamics simulation carried out here have demonstrated that aconitine alkaloids possess binding stability for the receptor CAMK2G. In conclusion, this comprehensive method will serve as a tool for following a structural modification of the aconitine alkaloids and lead to a better insight into the cardiotoxicity induced by the compounds that have similar structures to its derivatives.
Collapse
Affiliation(s)
- Ming-Yang Wang
- School of Pharmacy, China Medical University, Shenyang 110122, Liaoning, China.
| | - Jing-Wei Liang
- School of Pharmacy, China Medical University, Shenyang 110122, Liaoning, China.
| | | | - Qi Sun
- School of Pharmacy, China Medical University, Shenyang 110122, Liaoning, China.
| | - Nan Zhao
- School of Pharmacy, China Medical University, Shenyang 110122, Liaoning, China.
| | - Fan-Hao Meng
- School of Pharmacy, China Medical University, Shenyang 110122, Liaoning, China.
| |
Collapse
|
4
|
Prediction of the aquatic toxicity of aromatic compounds to tetrahymena pyriformis through support vector regression. Oncotarget 2018; 8:49359-49369. [PMID: 28467816 PMCID: PMC5564774 DOI: 10.18632/oncotarget.17210] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 03/30/2017] [Indexed: 01/24/2023] Open
Abstract
Toxicity evaluation is an extremely important process during drug development. It is usually initiated by experiments on animals, which is time-consuming and costly. To speed up such a process, a quantitative structure-activity relationship (QSAR) study was performed to develop a computational model for correlating the structures of 581 aromatic compounds with their aquatic toxicity to tetrahymena pyriformis. A set of 68 molecular descriptors derived solely from the structures of the aromatic compounds were calculated based on Gaussian 03, HyperChem 7.5, and TSAR V3.3. A comprehensive feature selection method, minimum Redundancy Maximum Relevance (mRMR)-genetic algorithm (GA)-support vector regression (SVR) method, was applied to select the best descriptor subset in QSAR analysis. The SVR method was employed to model the toxicity potency from a training set of 500 compounds. Five-fold cross-validation method was used to optimize the parameters of SVR model. The new SVR model was tested on an independent dataset of 81 compounds. Both high internal consistent and external predictive rates were obtained, indicating the SVR model is very promising to become an effective tool for fast detecting the toxicity.
Collapse
|
5
|
Patil RB, Barbosa EG, Sangshetti JN, Sawant SD, Zambre VP. 3D-QSAR with R: A new 3D-QSAR methodology applied to a set of DGAT1 inhibitors [corrected]. Comput Biol Chem 2018; 74:123-131. [PMID: 29602042 DOI: 10.1016/j.compbiolchem.2018.02.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2018] [Revised: 02/23/2018] [Accepted: 02/25/2018] [Indexed: 12/21/2022]
Abstract
The rapid advances in computational methods for the drug design have resulted in the accurate predictions of biological activities of ligands with or without the availability of enzyme structures. 3D-QSAR is one of the computational methods used for such purpose. Currently, freely available 3D-QSAR methods suffer the limitations like complex methodologies, difficulty in the analysis of results, applying the statistical methods and validations of models built. Present work describes simple and novel 3D-QSAR methodology, which uses bash scripts LQTA_R_LJ, LQTA_R_QQ and LQTA_R_HB using freely available R statistical program. These scripts then generate Leenard-Jones, Coulomb and Hydrogen bond descriptors. These descriptors provide the steric 3D property, electrostatic property and hydrogen bond formation capacity respectively. These scripts have been tested for the set of DGAT1 inhibitors and results showed that the 3D-QSAR models built have better predictive abilities in terms of R2 0.735, Q2loo 0.635 and R2ext 0.715. The 3D-QSAR model suggested that the substitutions of the alkyl group at the oxadiazolyl ring at the 6th position of the pyrrolo-pyridazine ring is undesirable, on the contrary, substituted phenyl ring at 7th position is responsible for the improved DGAT1 inhibitory activity. The analysis also suggested that 6th position could be substituted with the oxadiazolyl ring or analogous heterocyclic rings, where the 3rd position of such heterocyclic rings substituted with rigid hydrophobic substitute can improve DGAT1 activity.
Collapse
Affiliation(s)
- Rajesh B Patil
- Department of Pharmaceutical Chemistry, Sinhgad Technical Education Society's, Smt. Kashibai Navale College of Pharmacy, Pune-Saswad Road, Kondhwa (Bk.), Pune, 411048, Maharashtra, India.
| | - Euzebio G Barbosa
- Chemistry Institute, University of Campinas (UNICAMP), POB 6154, Campinas, SP, 13083-970, Brazil
| | - Jaiprakash N Sangshetti
- Department of Pharmaceutical Chemistry, Y. B. Chavan College of Pharmacy, Dr. Rafiq Zakaria Campus, Aurangabad, 431001, Maharashtra, India
| | - Sanjay D Sawant
- Department of Pharmaceutical Chemistry, Sinhgad Technical Education Society's, Smt. Kashibai Navale College of Pharmacy, Pune-Saswad Road, Kondhwa (Bk.), Pune, 411048, Maharashtra, India
| | - Vishal P Zambre
- Department of Pharmaceutical Chemistry, Sinhgad Technical Education Society's, Smt. Kashibai Navale College of Pharmacy, Pune-Saswad Road, Kondhwa (Bk.), Pune, 411048, Maharashtra, India
| |
Collapse
|
6
|
Jalilvand A, Akbari B, Zare Mirakabad F. S-FLN: A sequence-based hierarchical approach for functional linkage network construction. J Theor Biol 2018; 437:149-162. [PMID: 29080781 DOI: 10.1016/j.jtbi.2017.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2016] [Revised: 07/27/2017] [Accepted: 10/18/2017] [Indexed: 11/24/2022]
Abstract
The functional linkage network (FLN) construction is a primary and important step in drug discovery and disease gene prioritization methods. In order to construct FLN, several methods have been introduced based on integration of various biological data. Although, there are impressive ideas behind these methods, they suffer from low quality of the biological data. In this paper, a hierarchical sequence-based approach is proposed to construct FLN. The proposed approach, denoted as S-FLN (Sequence-based Functional Linkage Network), uses the sequence of proteins as the primary data in three main steps. Firstly, the physicochemical properties of amino-acids are employed to describe the functionality of proteins. As the sequence of proteins is a more comprehensive and accurate primary data, more reliable relations are achieved. Secondly, seven different descriptor methods are used to extract feature vectors from the proteins sequences. Advantage of different descriptor methods lead to obtain diverse ensemble learners in the next step. Finally, a two-layer ensemble learning structure is proposed to calculated the score of protein pairs. The proposed approach has been evaluated using two biological datasets, S.Cerevisiae and H.Pylori, and resulted in 93.9% and 91.15% precision rates, respectively. The results of various experiments indicate the efficiency and validity of the proposed approach.
Collapse
Affiliation(s)
- A Jalilvand
- Department of Electronic and computer engineering,Tarbiat Modares University, Tehran, Iran
| | - B Akbari
- Department of Electronic and computer engineering,Tarbiat Modares University, Tehran, Iran.
| | - F Zare Mirakabad
- Department of Mathematics and Computer Science, Amirkabir University of Technology, Tehran, Iran
| |
Collapse
|
7
|
Bielińska-Wąż D, Wąż P. Spectral-dynamic representation of DNA sequences. J Biomed Inform 2017; 72:1-7. [PMID: 28587890 DOI: 10.1016/j.jbi.2017.06.001] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Revised: 05/03/2017] [Accepted: 06/01/2017] [Indexed: 11/25/2022]
Abstract
A graphical representation of DNA sequences in which the distribution of a particular base B=A,C,G,T is represented by a set of discrete lines has been formulated. The methodology of this approach has been borrowed from two areas of physics: spectroscopy and dynamics. Consequently, the set of discrete lines is referred to as the B-spectrum. Next, the B-spectrum is transformed to a rigid body composed of material points. In this way a dynamic representation of the DNA sequence has been obtained. The centers of mass of these rigid bodies, divided by their moments of inertia, have been taken as the descriptors of the spectra and, thus, of the DNA sequences. The performance of this method on a standard set of data commonly applied by authors introducing new approaches to bioinformatics (the first exons of β-globin genes of different species) proved to be very good.
Collapse
Affiliation(s)
- Dorota Bielińska-Wąż
- Department of Radiological Informatics and Statistics, Medical University of Gdańsk, Tuwima 15, 80-210 Gdańsk, Poland.
| | - Piotr Wąż
- Department of Nuclear Medicine, Medical University of Gdańsk, Tuwima 15, 80-210 Gdańsk, Poland.
| |
Collapse
|
8
|
Thillainayagam M, Anbarasu A, Ramaiah S. Comparative molecular field analysis and molecular docking studies on novel aryl chalcone derivatives against an important drug target cysteine protease in Plasmodium falciparum. J Theor Biol 2016; 403:110-128. [DOI: 10.1016/j.jtbi.2016.05.019] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Revised: 05/03/2016] [Accepted: 05/10/2016] [Indexed: 01/08/2023]
|
9
|
20D-dynamic representation of protein sequences. Genomics 2016; 107:16-23. [DOI: 10.1016/j.ygeno.2015.12.003] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Revised: 12/10/2015] [Accepted: 12/14/2015] [Indexed: 11/23/2022]
|
10
|
Studies of binding interactions between Dufulin and southern rice black-streaked dwarf virus P9-1. Bioorg Med Chem 2015; 23:3629-37. [DOI: 10.1016/j.bmc.2015.04.008] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2015] [Revised: 04/03/2015] [Accepted: 04/04/2015] [Indexed: 01/08/2023]
|
11
|
Fatemi MH, Heidari A, Gharaghani S. QSAR prediction of HIV-1 protease inhibitory activities using docking derived molecular descriptors. J Theor Biol 2015; 369:13-22. [PMID: 25600056 DOI: 10.1016/j.jtbi.2015.01.008] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Revised: 01/10/2015] [Accepted: 01/12/2015] [Indexed: 01/30/2023]
Abstract
In this study, application of a new hybrid docking-quantitative structure activity relationship (QSAR) methodology to model and predict the HIV-1 protease inhibitory activities of a series of newly synthesized chemicals is reported. This hybrid docking-QSAR approach can provide valuable information about the most important chemical and structural features of the ligands that affect their inhibitory activities. Docking studies were used to find the actual conformations of chemicals in active site of HIV-1 protease. Then the molecular descriptors were calculated from these conformations. Multiple linear regression (MLR) and least square support vector machine (LS-SVM) were used as QSAR models, respectively. The obtained results reveal that statistical parameters of the LS-SVM model are better than the MLR model, which indicate that there are some non-linear relations between selected molecular descriptors and anti-HIV activities of interested chemicals. The correlation coefficient (R), root mean square error (RMSE) and average absolute error (AAE) for LS-SVM are: R=0.988, RMSE=0.207 and AAE=0.145 for the training set, and R=0.965, RMSE=0.403 and AAE=0.338 for the test set. Leave one out cross validation test was used for assessment of the predictive power and validity of models which led to cross-validation correlation coefficient QUOTE of 0.864 and 0.850 and standardized predicted relative error sum of squares (SPRESS) of 0.553 and 0.581 for LS-SVM and MLR models, respectively.
Collapse
Affiliation(s)
- Mohammad H Fatemi
- Chemometrics Laboratory, Faculty of Chemistry, University of Mazandaran, Babolsar 47416-95447, Iran.
| | - Afsane Heidari
- Chemometrics Laboratory, Faculty of Chemistry, University of Mazandaran, Babolsar 47416-95447, Iran
| | - Sajjad Gharaghani
- Department of Bioinformatics, Laboratory of Chemoinformatics, Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| |
Collapse
|
12
|
Gómez de León CT, Díaz Martín RD, Mendoza Hernández G, González Pozos S, Ambrosio JR, Mondragón Flores R. Proteomic characterization of the subpellicular cytoskeleton of Toxoplasma gondii tachyzoites. J Proteomics 2014; 111:86-99. [DOI: 10.1016/j.jprot.2014.03.008] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2013] [Revised: 02/27/2014] [Accepted: 03/07/2014] [Indexed: 01/09/2023]
|
13
|
Lin TH, Tsai TL. Constructing a linear QSAR for some metabolizable drugs by human or pig flavin-containing monooxygenases using some molecular features selected by a genetic algorithm trained SVM. J Theor Biol 2014; 356:85-97. [DOI: 10.1016/j.jtbi.2014.04.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2013] [Revised: 04/01/2014] [Accepted: 04/16/2014] [Indexed: 10/25/2022]
|
14
|
Yao Y, Yan S, Han J, Dai Q, He PA. A novel descriptor of protein sequences and its application. J Theor Biol 2014; 347:109-17. [PMID: 24412564 DOI: 10.1016/j.jtbi.2014.01.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2013] [Revised: 11/10/2013] [Accepted: 01/01/2014] [Indexed: 02/05/2023]
Abstract
In this paper, a dynamic 3-D graphical representation of protein sequences is introduced based on three physical-chemical properties of amino acids. The coordinates of the graph have direct biological significance, which could reflect the innate structure of the proteins. The information of principal moments of inertia and range of axis coordinate are extracted as a novel mixed descriptor and proposed for the comparison of protein primary sequences. Meanwhile, the Euclidean distance of the normalized descriptor vectors which avoid the influence of the difference in length of protein sequences under consideration is employed as a quantitative measurement of the similarity of proteins. Finally, we take the nine ND5 (NADH dehydrogenase subunit 5) proteins for example and illustrate the effectiveness of our approach.
Collapse
Affiliation(s)
- Yuhua Yao
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China.
| | - Shoujiang Yan
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Jianning Han
- College of Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Qi Dai
- College of Life Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| | - Ping-an He
- College of Sciences, Zhejiang Sci-Tech University, Hangzhou 310018, People's Republic of China
| |
Collapse
|
15
|
Diniz MC, Pacheco ACL, Farias KM, de Oliveira DM. The eukaryotic flagellum makes the day: novel and unforeseen roles uncovered after post-genomics and proteomics data. Curr Protein Pept Sci 2013; 13:524-46. [PMID: 22708495 PMCID: PMC3499766 DOI: 10.2174/138920312803582951] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Revised: 05/22/2012] [Accepted: 05/23/2012] [Indexed: 12/21/2022]
Abstract
This review will summarize and discuss the current biological understanding of the motile eukaryotic flagellum,
as posed out by recent advances enabled by post-genomics and proteomics approaches. The organelle, which is crucial
for motility, survival, differentiation, reproduction, division and feeding, among other activities, of many eukaryotes,
is a great example of a natural nanomachine assembled mostly by proteins (around 350-650 of them) that have been conserved
throughout eukaryotic evolution. Flagellar proteins are discussed in terms of their arrangement on to the axoneme,
the canonical “9+2” microtubule pattern, and also motor and sensorial elements that have been detected by recent proteomic
analyses in organisms such as Chlamydomonas reinhardtii, sea urchin, and trypanosomatids. Such findings can be
remarkably matched up to important discoveries in vertebrate and mammalian types as diverse as sperm cells, ciliated
kidney epithelia, respiratory and oviductal cilia, and neuro-epithelia, among others. Here we will focus on some exciting
work regarding eukaryotic flagellar proteins, particularly using the flagellar proteome of C. reinhardtii as a reference map
for exploring motility in function, dysfunction and pathogenic flagellates. The reference map for the eukaryotic flagellar
proteome consists of 652 proteins that include known structural and intraflagellar transport (IFT) proteins, less well-characterized
signal transduction proteins and flagellar associated proteins (FAPs), besides almost two hundred unannotated
conserved proteins, which lately have been the subject of intense investigation and of our present examination.
Collapse
Affiliation(s)
- Michely C Diniz
- Programa de Pós-Graduação em Biotecnologia-RENORBIO-Rede Nordeste de Biotecnologia, Universidade Estadual do Ceará-UECE, Av. Paranjana, 1700, Campus do Itaperi, Fortaleza, CE 60740-000 Brasil
| | | | | | | |
Collapse
|
16
|
Ziarek JJ, Liu Y, Smith E, Zhang G, Peterson FC, Chen J, Yu Y, Chen Y, Volkman BF, Li R. Fragment-based optimization of small molecule CXCL12 inhibitors for antagonizing the CXCL12/CXCR4 interaction. Curr Top Med Chem 2013; 12:2727-40. [PMID: 23368099 DOI: 10.2174/1568026611212240003] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2012] [Revised: 10/08/2012] [Accepted: 11/03/2012] [Indexed: 12/21/2022]
Abstract
The chemokine CXCL12 and its G protein-coupled receptor (GPCR) CXCR4 are high-priority clinical targets because of their involvement in metastatic cancers (also implicated in autoimmune disease and cardiovascular disease). Because chemokines interact with two distinct sites to bind and activate their receptors, both the GPCRs and chemokines are potential targets for small molecule inhibition. A number of chemokines have been validated as targets for drug development, but virtually all drug discovery efforts focus on the GPCRs. However, all CXCR4 receptor antagonists with the exception of MSX-122 have failed in clinical trials due to unmanageable toxicities, emphasizing the need for alternative strategies to interfere with CXCL12/CXCR4-guided metastatic homing. Although targeting the relatively featureless surface of CXCL12 was presumed to be challenging, focusing efforts at the sulfotyrosine (sY) binding pockets proved successful for procuring initial hits. Using a hybrid structure-based in silico/NMR screening strategy, we recently identified a ligand that occludes the receptor recognition site. From this initial hit, we designed a small fragment library containing only nine tetrazole derivatives using a fragment-based and bioisostere approach to target the sY binding sites of CXCL12. Compound binding modes and affinities were studied by 2D NMR spectroscopy, X-ray crystallography, molecular docking and cell-based functional assays. Our results demonstrate that the sY binding sites are conducive to the development of high affinity inhibitors with better ligand efficiency (LE) than typical protein-protein interaction inhibitors (LE ≤ 0.24). Our novel tetrazole-based fragment 18 was identified to bind the sY21 site with a K(d) of 24 μM (LE = 0.30). Optimization of 18 yielded compound 25 which specifically inhibits CXCL12-induced migration with an improvement in potency over the initial hit 9. The fragment from this library that exhibited the highest affinity and ligand efficiency (11: K(d) = 13 μM, LE = 0.33) may serve as a starting point for development of inhibitors targeting the sY12 site.
Collapse
Affiliation(s)
- Joshua J Ziarek
- Department of Biochemistry, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Jin YY, Ma Y, Gao QX, Wang RL, Wang SQ, Xu WR. Design of specific inhibitors of the protein tyrosine phosphatase SHP-2 by virtual screening and core hopping method. MOLECULAR SIMULATION 2013. [DOI: 10.1080/08927022.2013.824573] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
18
|
Liu L, Ma Y, Wang RL, Xu WR, Wang SQ, Chou KC. Find novel dual-agonist drugs for treating type 2 diabetes by means of cheminformatics. Drug Des Devel Ther 2013; 7:279-88. [PMID: 23630413 PMCID: PMC3623550 DOI: 10.2147/dddt.s42113] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
The high prevalence of type 2 diabetes mellitus in the world as well as the increasing reports about the adverse side effects of the existing diabetes treatment drugs have made developing new and effective drugs against the disease a very high priority. In this study, we report ten novel compounds found by targeting peroxisome proliferator-activated receptors (PPARs) using virtual screening and core hopping approaches. PPARs have drawn increasing attention for developing novel drugs to treat diabetes due to their unique functions in regulating glucose, lipid, and cholesterol metabolism. The reported compounds are featured with dual functions, and hence belong to the category of dual agonists. Compared with the single PPAR agonists, the dual PPAR agonists, formed by combining the lipid benefit of PPARα agonists (such as fibrates) and the glycemic advantages of the PPARγ agonists (such as thiazolidinediones), are much more powerful in treating diabetes because they can enhance metabolic effects while minimizing the side effects. This was observed in the studies on molecular dynamics simulations, as well as on absorption, distribution, metabolism, and excretion, that these novel dual agonists not only possessed the same function as ragaglitazar (an investigational drug developed by Novo Nordisk for treating type 2 diabetes) did in activating PPARα and PPARγ, but they also had more favorable conformation for binding to the two receptors. Moreover, the residues involved in forming the binding pockets of PPARα and PPARγ among the top ten compounds are explicitly presented, and this will be very useful for the in-depth conduction of mutagenesis experiments. It is anticipated that the ten compounds may become potential drug candidates, or at the very least, the findings reported here may stimulate new strategies or provide useful insights for designing new and more powerful dual-agonist drugs for treating type 2 diabetes.
Collapse
Affiliation(s)
- Lei Liu
- PET/CT Center, General Hospital of Tianjin Medical University, Tianjin, People’s Republic of China
| | - Ying Ma
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, People’s Republic of China
| | - Run-Ling Wang
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, People’s Republic of China
| | - Wei-Ren Xu
- Tianjin Institute of Pharmaceutical Research (TIPR), Tianjin, People’s Republic of China
| | - Shu-Qing Wang
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, People’s Republic of China
- Gordon Life Science Institute, Belmont, MA, USA
| | - Kuo-Chen Chou
- Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia
- Gordon Life Science Institute, Belmont, MA, USA
| |
Collapse
|
19
|
Cao DS, Xu QS, Hu QN, Liang YZ. ChemoPy: freely available python package for computational biology and chemoinformatics. ACTA ACUST UNITED AC 2013; 29:1092-4. [PMID: 23493324 DOI: 10.1093/bioinformatics/btt105] [Citation(s) in RCA: 123] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
MOTIVATION Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural and physicochemical features. It computes 16 drug feature groups composed of 19 descriptors that include 1135 descriptor values. In addition, it provides seven types of molecular fingerprint systems for drug molecules, including topological fingerprints, electro-topological state (E-state) fingerprints, MACCS keys, FP4 keys, atom pairs fingerprints, topological torsion fingerprints and Morgan/circular fingerprints. By applying a semi-empirical quantum chemistry program MOPAC, ChemoPy can also compute a large number of 3D molecular descriptors conveniently. AVAILABILITY The python package, ChemoPy, is freely available via http://code.google.com/p/pychem/downloads/list, and it runs on Linux and MS-Windows. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- Research Center of Modernization of Traditional Chinese Medicines, Central South University, Changsha, P. R. China
| | | | | | | |
Collapse
|
20
|
Characterization of structure–antioxidant activity relationship of peptides in free radical systems using QSAR models: Key sequence positions and their amino acid properties. J Theor Biol 2013; 318:29-43. [DOI: 10.1016/j.jtbi.2012.10.029] [Citation(s) in RCA: 130] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2012] [Revised: 10/21/2012] [Accepted: 10/22/2012] [Indexed: 11/22/2022]
|
21
|
Use of fast conformational sampling to improve the characterization of VEGF A–peptide interactions. J Theor Biol 2013; 317:293-300. [DOI: 10.1016/j.jtbi.2012.10.021] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2012] [Revised: 10/12/2012] [Accepted: 10/15/2012] [Indexed: 01/25/2023]
|
22
|
Ghosh A, Chattopadhyay S, Chawla-Sarkar M, Nandy P, Nandy A. In silico study of rotavirus VP7 surface accessible conserved regions for antiviral drug/vaccine design. PLoS One 2012; 7:e40749. [PMID: 22844409 PMCID: PMC3406019 DOI: 10.1371/journal.pone.0040749] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2012] [Accepted: 06/12/2012] [Indexed: 11/23/2022] Open
Abstract
Background Rotaviral diarrhoea kills about half a million children annually in developing countries and accounts for one third of diarrhea related hospitalizations. Drugs and vaccines against the rotavirus are handicapped, as in all viral diseases, by the rapid mutational changes that take place in the DNA and protein sequences rendering most of these ineffective. As of now only two vaccines are licensed and approved by the WHO (World Health Organization), but display reduced efficiencies in the underdeveloped countries where the disease is more prevalent. We approached this issue by trying to identify regions of surface exposed conserved segments on the surface glycoproteins of the virion, which may then be targeted by specific peptide vaccines. We had developed a bioinformatics protocol for these kinds of problems with reference to the influenza neuraminidase protein, which we have refined and expanded to analyze the rotavirus issue. Results Our analysis of 433 VP7 (Viral Protein 7 from rotavirus) surface protein sequences across 17 subtypes encompassing mammalian hosts using a 20D Graphical Representation and Numerical Characterization method, identified four possible highly conserved peptide segments. Solvent accessibility prediction servers were used to identify that these are predominantly surface situated. These regions analyzed through selected epitope prediction servers for their epitopic properties towards possible T-cell and B-cell activation showed good results as epitopic candidates (only dry lab confirmation). Conclusions The main reasons for the development of alternative vaccine strategies for the rotavirus are the failure of current vaccines and high production costs that inhibit their application in developing countries. We expect that it would be possible to use the protein surface exposed regions identified in our study as targets for peptide vaccines and drug designs for stable immunity against divergent strains of the rotavirus. Though this study is fully dependent on computational prediction algorithms, it provides a platform for wet lab experiments.
Collapse
Affiliation(s)
- Ambarnil Ghosh
- Physics Department, Jadavpur University, Kolkata, West Bengal, India
| | - Shiladitya Chattopadhyay
- Division of Virology, National Institute of Cholera and Enteric Diseases, Kolkata, West Bengal, India
| | - Mamta Chawla-Sarkar
- Division of Virology, National Institute of Cholera and Enteric Diseases, Kolkata, West Bengal, India
| | - Papiya Nandy
- Physics Department, Jadavpur University, Kolkata, West Bengal, India
| | - Ashesh Nandy
- Centre for Interdisciplinary Research and Education, Kolkata, West Bengal, India
- * E-mail:
| |
Collapse
|
23
|
A segmented principal component analysis—regression approach to QSAR study of peptides. J Theor Biol 2012; 305:37-44. [DOI: 10.1016/j.jtbi.2012.03.028] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2011] [Revised: 03/08/2012] [Accepted: 03/26/2012] [Indexed: 12/22/2022]
|
24
|
Ma Y, Wang SQ, Xu WR, Wang RL, Chou KC. Design novel dual agonists for treating type-2 diabetes by targeting peroxisome proliferator-activated receptors with core hopping approach. PLoS One 2012; 7:e38546. [PMID: 22685582 PMCID: PMC3369836 DOI: 10.1371/journal.pone.0038546] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2012] [Accepted: 05/07/2012] [Indexed: 12/02/2022] Open
Abstract
Owing to their unique functions in regulating glucose, lipid and cholesterol metabolism, PPARs (peroxisome proliferator-activated receptors) have drawn special attention for developing drugs to treat type-2 diabetes. By combining the lipid benefit of PPAR-alpha agonists (such as fibrates) with the glycemic advantages of the PPAR-gamma agonists (such as thiazolidinediones), the dual PPAR agonists approach can both improve the metabolic effects and minimize the side effects caused by either agent alone, and hence has become a promising strategy for designing effective drugs against type-2 diabetes. In this study, by means of the powerful “core hopping” and “glide docking” techniques, a novel class of PPAR dual agonists was discovered based on the compound GW409544, a well-known dual agonist for both PPAR-alpha and PPAR-gamma modified from the farglitazar structure. It was observed by molecular dynamics simulations that these novel agonists not only possessed the same function as GW409544 did in activating PPAR-alpha and PPAR-gamma, but also had more favorable conformation for binding to the two receptors. It was further validated by the outcomes of their ADME (absorption, distribution, metabolism, and excretion) predictions that the new agonists hold high potential to become drug candidates. Or at the very least, the findings reported here may stimulate new strategy or provide useful insights for discovering more effective dual agonists for treating type-2 diabetes. Since the “core hopping” technique allows for rapidly screening novel cores to help overcome unwanted properties by generating new lead compounds with improved core properties, it has not escaped our notice that the current strategy along with the corresponding computational procedures can also be utilized to find novel and more effective drugs for treating other illnesses.
Collapse
Affiliation(s)
- Ying Ma
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, China
| | - Shu-Qing Wang
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, China
- Gordon Life Science Institute, San Diego, California, United States of America
- * E-mail: (SQW); (RLW)
| | - Wei-Ren Xu
- Tianjin Institute of Pharmaceutical Research (TIPR), Tianjin, China
| | - Run-Ling Wang
- Tianjin Key Laboratory on Technologies Enabling Development of Clinical Therapeutics and Diagnostics (Theranostics), School of Pharmacy, Tianjin Medical University, Tianjin, China
- * E-mail: (SQW); (RLW)
| | - Kuo-Chen Chou
- Gordon Life Science Institute, San Diego, California, United States of America
| |
Collapse
|
25
|
Hu LL, Chen C, Huang T, Cai YD, Chou KC. Predicting biological functions of compounds based on chemical-chemical interactions. PLoS One 2011; 6:e29491. [PMID: 22220213 PMCID: PMC3248422 DOI: 10.1371/journal.pone.0029491] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2011] [Accepted: 11/29/2011] [Indexed: 11/18/2022] Open
Abstract
Given a compound, how can we effectively predict its biological function? It is a fundamentally important problem because the information thus obtained may benefit the understanding of many basic biological processes and provide useful clues for drug design. In this study, based on the information of chemical-chemical interactions, a novel method was developed that can be used to identify which of the following eleven metabolic pathway classes a query compound may be involved with: (1) Carbohydrate Metabolism, (2) Energy Metabolism, (3) Lipid Metabolism, (4) Nucleotide Metabolism, (5) Amino Acid Metabolism, (6) Metabolism of Other Amino Acids, (7) Glycan Biosynthesis and Metabolism, (8) Metabolism of Cofactors and Vitamins, (9) Metabolism of Terpenoids and Polyketides, (10) Biosynthesis of Other Secondary Metabolites, (11) Xenobiotics Biodegradation and Metabolism. It was observed that the overall success rate obtained by the method via the 5-fold cross-validation test on a benchmark dataset consisting of 3,137 compounds was 77.97%, which is much higher than 10.45%, the corresponding success rate obtained by the random guesses. Besides, to deal with the situation that some compounds may be involved with more than one metabolic pathway class, the method presented here is featured by the capacity able to provide a series of potential metabolic pathway classes ranked according to the descending order of their likelihood for each of the query compounds concerned. Furthermore, our method was also applied to predict 5,549 compounds whose metabolic pathway classes are unknown. Interestingly, the results thus obtained are quite consistent with the deductions from the reports by other investigators. It is anticipated that, with the continuous increase of the chemical-chemical interaction data, the current method will be further enhanced in its power and accuracy, so as to become a useful complementary vehicle in annotating uncharacterized compounds for their biological functions.
Collapse
Affiliation(s)
- Le-Le Hu
- Institute of Systems Biology, Shanghai University, Shanghai, China
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai, China
| | - Chen Chen
- Department of Chemistry, College of Sciences, Shanghai University, Shanghai, China
| | - Tao Huang
- Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, China
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Yu-Dong Cai
- Institute of Systems Biology, Shanghai University, Shanghai, China
- Gordon Life Science Institute, San Diego, California, United States of America
- * E-mail:
| | - Kuo-Chen Chou
- Gordon Life Science Institute, San Diego, California, United States of America
| |
Collapse
|
26
|
Chen W, Feng P, Lin H. Prediction of ketoacyl synthase family using reduced amino acid alphabets. J Ind Microbiol Biotechnol 2011; 39:579-84. [PMID: 22042516 DOI: 10.1007/s10295-011-1047-z] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2011] [Accepted: 10/04/2011] [Indexed: 11/28/2022]
Abstract
Ketoacyl synthases are enzymes involved in fatty acid synthesis and can be classified into five families based on primary sequence similarity. Different families have different catalytic mechanisms. Developing cost-effective computational models to identify the family of ketoacyl synthases will be helpful for enzyme engineering and in knowing individual enzymes' catalytic mechanisms. In this work, a support vector machine-based method was developed to predict ketoacyl synthase family using the n-peptide composition of reduced amino acid alphabets. In jackknife cross-validation, the model based on the 2-peptide composition of a reduced amino acid alphabet of size 13 yielded the best overall accuracy of 96.44% with average accuracy of 93.36%, which is superior to other state-of-the-art methods. This result suggests that the information provided by n-peptide compositions of reduced amino acid alphabets provides efficient means for enzyme family classification and that the proposed model can be efficiently used for ketoacyl synthase family annotation.
Collapse
Affiliation(s)
- Wei Chen
- Department of Physics, College of Sciences, Center for Genomics and Computational Biology, Hebei United University, Tangshan 063000, China.
| | | | | |
Collapse
|
27
|
González-Díaz H, Prado-Prado F, Sobarzo-Sánchez E, Haddad M, Maurel Chevalley S, Valentin A, Quetin-Leclercq J, Dea-Ayuela MA, Teresa Gomez-Muños M, Munteanu CR, José Torres-Labandeira J, García-Mera X, Tapia RA, Ubeira FM. NL MIND-BEST: A web server for ligands and proteins discovery—Theoretic-experimental study of proteins of Giardia lamblia and new compounds active against Plasmodium falciparum. J Theor Biol 2011; 276:229-49. [DOI: 10.1016/j.jtbi.2011.01.010] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2010] [Revised: 12/02/2010] [Accepted: 01/10/2011] [Indexed: 10/18/2022]
|
28
|
González-Díaz H, Muíño L, Anadón AM, Romaris F, Prado-Prado FJ, Munteanu CR, Dorado J, Sierra AP, Mezo M, González-Warleta M, Gárate T, Ubeira FM. MISS-Prot: web server for self/non-self discrimination of protein residue networks in parasites; theory and experiments in Fasciola peptides and Anisakis allergens. MOLECULAR BIOSYSTEMS 2011; 7:1938-55. [PMID: 21468430 DOI: 10.1039/c1mb05069a] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Infections caused by human parasites (HPs) affect the poorest 500 million people worldwide but chemotherapy has become expensive, toxic, and/or less effective due to drug resistance. On the other hand, many 3D structures in Protein Data Bank (PDB) remain without function annotation. We need theoretical models to quickly predict biologically relevant Parasite Self Proteins (PSP), which are expressed differentially in a given parasite and are dissimilar to proteins expressed in other parasites and have a high probability to become new vaccines (unique sequence) or drug targets (unique 3D structure). We present herein a model for PSPs in eight different HPs (Ascaris, Entamoeba, Fasciola, Giardia, Leishmania, Plasmodium, Trypanosoma, and Toxoplasma) with 90% accuracy for 15 341 training and validation cases. The model combines protein residue networks, Markov Chain Models (MCM) and Artificial Neural Networks (ANN). The input parameters are the spectral moments of the Markov transition matrix for electrostatic interactions associated with the protein residue complex network calculated with the MARCH-INSIDE software. We implemented this model in a new web-server called MISS-Prot (MARCH-INSIDE Scores for Self-Proteins). MISS-Prot was programmed using PHP/HTML/Python and MARCH-INSIDE routines and is freely available at: . This server is easy to use by non-experts in Bioinformatics who can carry out automatic online upload and prediction with 3D structures deposited at PDB (mode 1). We can also study outcomes of Peptide Mass Fingerprinting (PMFs) and MS/MS for query proteins with unknown 3D structures (mode 2). We illustrated the use of MISS-Prot in experimental and/or theoretical studies of peptides from Fasciola hepatica cathepsin proteases or present on 10 Anisakis simplex allergens (Ani s 1 to Ani s 10). In doing so, we combined electrophoresis (1DE), MALDI-TOF Mass Spectroscopy, and MASCOT to seek sequences, Molecular Mechanics + Molecular Dynamics (MM/MD) to generate 3D structures and MISS-Prot to predict PSP scores. MISS-Prot also allows the prediction of PSP proteins in 16 additional species including parasite hosts, fungi pathogens, disease transmission vectors, and biotechnologically relevant organisms.
Collapse
Affiliation(s)
- Humberto González-Díaz
- Department of Microbiology & Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782, Santiago de Compostela, Spain.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Prado-Prado F, García-Mera X, Abeijón P, Alonso N, Caamaño O, Yáñez M, Gárate T, Mezo M, González-Warleta M, Muiño L, Ubeira FM, González-Díaz H. Using entropy of drug and protein graphs to predict FDA drug-target network: theoretic-experimental study of MAO inhibitors and hemoglobin peptides from Fasciola hepatica. Eur J Med Chem 2011; 46:1074-94. [PMID: 21315497 DOI: 10.1016/j.ejmech.2011.01.023] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2010] [Revised: 01/10/2011] [Accepted: 01/13/2011] [Indexed: 12/11/2022]
Abstract
There are many drugs described with very different affinity to a large number of receptors. In this work, we selected Drug-Target pairs (DTPs/nDTPs) of drugs with high affinity/non-affinity for different targets like proteins. Quantitative Structure-Activity Relationships (QSAR) models become a very useful tool in this context to substantially reduce time and resources consuming experiments. Unfortunately, most QSAR models predict activity against only one protein. To solve this problem, we developed here a multi-target QSAR (mt-QSAR) classifier using the MARCH-INSIDE technique to calculate structural parameters of drug and target plus one Artificial Neuronal Network (ANN) to seek the model. The best ANN model found is a Multi-Layer Perceptron (MLP) with profile MLP 32:32-15-1:1. This MLP classifies correctly 623 out of 678 DTPs (Sensitivity = 91.89%) and 2995 out of 3234 nDTPs (Specificity = 92.61%), corresponding to training Accuracy = 92.48%. The validation of the model was carried out by means of external predicting series. The model classifies correctly 313 out of 338 DTPs (Sensitivity = 92.60%) and 1411 out of 1534 nDTP (Specificity = 91.98%) in validation series, corresponding to total Accuracy = 92.09% for validation series (Predictability). This model favorably compares with other LDA and ANN models developed in this work and Machine Learning classifiers published before to address the same problem in different aspects. These mt-QSARs offer also a good opportunity to construct drug-protein Complex Networks (CNs) that can be used to explore large and complex drug-protein receptors databases. Finally, we illustrated two practical uses of this model with two different experiments. In experiment 1, we report prediction, synthesis, characterization, and MAO-A and MAO-B pharmacological assay of 10 rasagiline derivatives promising for anti-Parkinson drug design. In experiment 2, we report sampling, parasite culture, SEC and 1DE sample preparation, MALDI-TOF MS and MS/MS analysis, MASCOT search, MM/MD 3D structure modeling, and QSAR prediction for different peptides of hemoglobin found in the proteome of the human parasite Fasciola hepatica; which is promising for anti-parasite drug targets discovery.
Collapse
|
30
|
Ghosh A, Nandy A. Graphical representation and mathematical characterization of protein sequences and applications to viral proteins. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2011; 83:1-42. [PMID: 21570664 PMCID: PMC7150266 DOI: 10.1016/b978-0-12-381262-9.00001-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Graphical representation and numerical characterization (GRANCH) of nucleotide and protein sequences is a new field that is showing a lot of promise in analysis of such sequences. While formulation and applications of GRANCH techniques for DNA/RNA sequences started just over a decade ago, analyses of protein sequences by these techniques are of more recent origin. The emphasis is still on developing the underlying technique, but significant results have been achieved in using these methods for protein phylogeny, mass spectral data of proteins and protein serum profiles in parasites, toxicoproteomics, determination of different indices for use in QSAR studies, among others. We briefly mention these in this chapter, with some details on protein phylogeny and viral diseases. In particular, we cover a systematic method developed in GRANCH to determine conserved surface exposed peptide segments in selected viral proteins that can be used for drug and vaccine targeting. The new GRANCH techniques and applications for DNAs and proteins are covered briefly to provide an overview to this nascent field.
Collapse
Affiliation(s)
- Ambarnil Ghosh
- Physics Department, Jadavpur University, Jadavpur, Kolkata, India
| | | |
Collapse
|
31
|
Wang Y, Yu L, Zhang L, Qu H, Cheng Y. A Novel Methodology for Multicomponent Drug Design and Its Application in Optimizing the Combination of Active Components from Chinese Medicinal FormulaShenmai. Chem Biol Drug Des 2010; 75:318-24. [DOI: 10.1111/j.1747-0285.2009.00934.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
32
|
Xi L, Du J, Li S, Li J, Liu H, Yao X. A combined molecular modeling study on gelatinases and their potent inhibitors. J Comput Chem 2010; 31:24-42. [DOI: 10.1002/jcc.21279] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
33
|
Concu R, Dea-Ayuela MA, Perez-Montoto LG, Bolas-Fernández F, Prado-Prado FJ, Podda G, Uriarte E, Ubeira FM, González-Díaz H. Prediction of enzyme classes from 3D structure: a general model and examples of experimental-theoretic scoring of peptide mass fingerprints of Leishmania proteins. J Proteome Res 2009; 8:4372-82. [PMID: 19603824 DOI: 10.1021/pr9003163] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The number of protein and peptide structures included in Protein Data Bank (PDB) and Gen Bank without functional annotation has increased. Consequently, there is a high demand for theoretical models to predict these functions. Here, we trained and validated, with an external set, a Markov Chain Model (MCM) that classifies proteins by their possible mechanism of action according to Enzyme Classification (EC) number. The methodology proposed is essentially new, and enables prediction of all EC classes with a single equation without the need for an equation for each class or nonlinear models with multiple outputs. In addition, the model may be used to predict whether one peptide presents a positive or negative contribution of the activity of the same EC class. The model predicts the first EC number for 106 out of 151 (70.2%) oxidoreductases, 178/178 (100%) transferases, 223/223 (100%) hydrolases, 64/85 (75.3%) lyases, 74/74 (100%) isomerases, and 100/100 (100%) ligases, as well as 745/811 (91.9%) nonenzymes. It is important to underline that this method may help us predict new enzyme proteins or select peptide candidates that improve enzyme activity, which may be of interest for the prediction of new drugs or drug targets. To illustrate the model's application, we report the 2D-Electrophoresis (2DE) isolation from Leishmania infantum as well as MADLI TOF Mass Spectra characterization and theoretical study of the Peptide Mass Fingerprints (PMFs) of a new protein sequence. The theoretical study focused on MASCOT, BLAST alignment, and alignment-free QSAR prediction of the contribution of 29 peptides found in the PMF of the new protein to specific enzyme action. This combined strategy may be used to identify and predict peptides of prokaryote and eukaryote parasites and their hosts as well as other superior organisms, which may be of interest in drug development or target identification.
Collapse
Affiliation(s)
- Riccardo Concu
- Department of Microbiology & Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela, Spain
| | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Yan SM, Wu G. Trends in global warming and evolution of matrix protein 2 family from influenza A virus. Interdiscip Sci 2009; 1:272-9. [PMID: 20640805 PMCID: PMC7091293 DOI: 10.1007/s12539-009-0053-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2009] [Revised: 05/22/2009] [Accepted: 05/25/2009] [Indexed: 05/29/2023]
Abstract
The global warming is an important factor affecting the biological evolution, and the influenza is an important disease that threatens humans with possible epidemics or pandemics. In this study, we attempted to analyze the trends in global warming and evolution of matrix protein 2 family from influenza A virus, because this protein is a target of anti-flu drug, and its mutation would have significant effect on the resistance to anti-flu drugs. The evolution of matrix protein 2 of influenza A virus from 1959 to 2008 was defined using the unpredictable portion of amino-acid pair predictability. Then the trend in this evolution was compared with the trend in the global temperature, the temperature in north and south hemispheres, and the temperature in influenza A virus sampling site, and species carrying influenza A virus. The results showed the similar trends in global warming and in evolution of M2 proteins although we could not correlate them at this stage of study. The study suggested the potential impact of global warming on the evolution of proteins from influenza A virus.
Collapse
Affiliation(s)
- Shao-Min Yan
- National Engineering Research Center for Non-food Biorefinery, Guangxi Academy of Sciences, Nanning, Guangxi 530007 P.R. China
| | - Guang Wu
- Computational Mutation Project, DreamSciTech Consulting, Shenzhen, Guangdong, 518054 P.R. China
| |
Collapse
|
35
|
3D entropy and moments prediction of enzyme classes and experimental-theoretic study of peptide fingerprints in Leishmania parasites. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2009; 1794:1784-94. [DOI: 10.1016/j.bbapap.2009.08.020] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2009] [Revised: 08/07/2009] [Accepted: 08/17/2009] [Indexed: 11/21/2022]
|
36
|
Shine Y, Kikuchi T. Estimation of relative binding free energy based on a free energy variational principle for quantitative structure activity relationship analyses. Chem Phys 2009. [DOI: 10.1016/j.chemphys.2009.09.023] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
37
|
Pérez-Montoto LG, Santana L, González-Díaz H. Scoring function for DNA-drug docking of anticancer and antiparasitic compounds based on spectral moments of 2D lattice graphs for molecular dynamics trajectories. Eur J Med Chem 2009; 44:4461-9. [PMID: 19604606 PMCID: PMC7127518 DOI: 10.1016/j.ejmech.2009.06.011] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2009] [Revised: 06/04/2009] [Accepted: 06/05/2009] [Indexed: 02/02/2023]
Abstract
We introduce here a new class of invariants for MD trajectories based on the spectral moments pi(k)(L) of the Markov matrix associated to lattice network-like (LN) graph representations of Molecular Dynamics (MD) trajectories. The procedure embeds the MD energy profiles on a 2D Cartesian coordinates system using simple heuristic rules. At the same time, we associate the LN with a Markov matrix that describes the probabilities of passing from one state to other in the new 2D space. We construct this type of LNs for 422 MD trajectories obtained in DNA-drug docking experiments of 57 furocoumarins. The combined use of psoralens+ultraviolet light (UVA) radiation is known as PUVA therapy. PUVA is effective in the treatment of skin diseases such as psoriasis and mycosis fungoides. PUVA is also useful to treat human platelet (PTL) concentrates in order to eliminate Leishmania spp. and Trypanosoma cruzi. Both are parasites that cause Leishmaniosis (a dangerous skin and visceral disease) and Chagas disease, respectively; and may circulate in blood products collected from infected donors. We included in this study both lineal (psoralens) and angular (angelicins) furocoumarins. In the study, we grouped the LNs on two sets; set1: DNA-drug complex MD trajectories for active compounds and set2: MD trajectories of non-active compounds or no-optimal MD trajectories of active compounds. We calculated the respective pi(k)(L) values for all these LNs and used them as inputs to train a new classifier that discriminate set1 from set2 cases. In training series the model correctly classifies 79 out of 80 (specificity=98.75%) set1 and 226 out of 238 (Sensitivity=94.96%) set2 trajectories. In independent validation series the model correctly classifies 26 out of 26 (specificity=100%) set1 and 75 out of 78 (sensitivity=96.15%) set2 trajectories. We propose this new model as a scoring function to guide DNA-docking studies in the drug design of new coumarins for anticancer or antiparasitic PUVA therapy.
Collapse
Affiliation(s)
- Lázaro G. Pérez-Montoto
- Department of Microbiology & Parasitology, and Department of Organic Chemistry
- Faculty of Pharmacy, University of Santiago de Compostela, 15782, Spain
| | - Lourdes Santana
- Faculty of Pharmacy, University of Santiago de Compostela, 15782, Spain
| | - Humberto González-Díaz
- Department of Microbiology & Parasitology, and Department of Organic Chemistry
- Faculty of Pharmacy, University of Santiago de Compostela, 15782, Spain
| |
Collapse
|
38
|
Hemmateenejad B, Mehdipour AR, Miri R, Shamsipur M. Application of MOLMAP approach for QSAR modeling of various biological activities using substituent electronic descriptors. J Comput Chem 2009; 30:2001-9. [DOI: 10.1002/jcc.21198] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
39
|
Viña D, Uriarte E, Orallo F, González-Díaz H. Alignment-free prediction of a drug-target complex network based on parameters of drug connectivity and protein sequence of receptors. Mol Pharm 2009; 6:825-35. [PMID: 19281186 DOI: 10.1021/mp800102c] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
There are many drugs described with very different affinity to a large number of receptors. In this work, we selected drug-receptor pairs (DRPs) of affinity/nonaffinity drugs to similar/dissimilar receptors and we represented them as a large network, which may be used to identify drugs that can act on a receptor. Computational chemistry prediction of the biological activity based on quantitative structure-activity relationships (QSAR) substantially increases the potentialities of this kind of networks avoiding time- and resource-consuming experiments. Unfortunately, most QSAR models are unspecific or predict activity against only one receptor. To solve this problem, we developed here a multitarget QSAR (mt-QSAR) classification model. Overall model classification accuracy was 72.25% (1390/1924 compounds) in training, 72.28% (459/635) in cross-validation. Outputs of this mt-QSAR model were used as inputs to construct a network. The observed network has 1735 nodes (DRPs), 1754 edges or pairs of DRPs with similar drug-target affinity (sPDRPs), and low coverage density d = 0.12%. The predicted network has 1735 DRPs, 1857 sPDRPs, and also low coverage density d = 0.12%. After an edge-to-edge comparison (chi-square = 9420.3; p < 0.005), we have demonstrated that the predicted network is significantly similar to the one observed and both have a distribution closer to exponential than to normal.
Collapse
Affiliation(s)
- Dolores Viña
- Department of Organic Chemistry, University of Santiago de Compostela, 15782, Spain
| | | | | | | |
Collapse
|
40
|
Generalized lattice graphs for 2D-visualization of biological information. J Theor Biol 2009; 261:136-47. [PMID: 19646452 PMCID: PMC7094121 DOI: 10.1016/j.jtbi.2009.07.029] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2009] [Revised: 07/18/2009] [Accepted: 07/20/2009] [Indexed: 01/09/2023]
Abstract
Several graph representations have been introduced for different data in theoretical biology. For instance, complex networks based on Graph theory are used to represent the structure and/or dynamics of different large biological systems such as protein–protein interaction networks. In addition, Randic, Liao, Nandy, Basak, and many others developed some special types of graph-based representations. This special type of graph includes geometrical constrains to node positioning in space and adopts final geometrical shapes that resemble lattice-like patterns. Lattice networks have been used to visually depict DNA and protein sequences but they are very flexible. However, despite the proved efficacy of new lattice-like graph/networks to represent diverse systems, most works focus on only one specific type of biological data. This work proposes a generalized type of lattice and illustrates how to use it in order to represent and compare biological data from different sources. We exemplify the following cases: protein sequence; mass spectra (MS) of protein peptide mass fingerprints (PMF); molecular dynamic trajectory (MDTs) from structural studies; mRNA microarray data; single nucleotide polymorphisms (SNPs); 1D or 2D-Electrophoresis study of protein polymorphisms and protein-research patent and/or copyright information. We used data available from public sources for some examples but for other, we used experimental results reported herein for the first time. This work may break new ground for the application of Graph theory in theoretical biology and other areas of biomedical sciences.
Collapse
|
41
|
Pérez-Montoto LG, Dea-Ayuela MA, Prado-Prado FJ, Bolas-Fernández F, Ubeira FM, González-Díaz H. Study of peptide fingerprints of parasite proteins and drug-DNA interactions with Markov-Mean-Energy invariants of biopolymer molecular-dynamic lattice networks. POLYMER 2009; 50:3857-3870. [PMID: 32287404 PMCID: PMC7111648 DOI: 10.1016/j.polymer.2009.05.055] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2009] [Revised: 05/06/2009] [Accepted: 05/14/2009] [Indexed: 11/26/2022]
Abstract
Since the advent of Molecular Dynamics (MD) in biopolymers science with the study by Karplus et al. on protein dynamics, MD has become the by foremost well established, computational technique to investigate structure and function of biomolecules and their respective complexes and interactions. The analysis of the MD trajectories (MDTs) remains, however, the greatest challenge and requires a great deal of insight, experience, and effort. Here, we introduce a new class of invariants for MDTs based on the spatial distribution of Mean-Energy values ξk (L) on a 2D Euclidean space representation of the MDTs. The procedure forces one MD trajectory to fold into a 2D Cartesian coordinates system using a step-by-step procedure driven by simple rules. The ξk (L) values are invariants of a Markov matrix (1 Π), which describes the probabilities of transition between two states in the new 2D space; which is associated to a graph representation of MDTs similar to the lattice networks (LNs) of DNA and protein sequences. We also introduce a new algorithm to perform phylogenetic analysis of peptides based on MDTs instead of the sequence of the polypeptide. In a first experiment, we illustrate this algorithm for 35 peptides present on the Peptide Mass Fingerprint (PMF) of a new protein of Leishmania infantum studied in this work. We report, by the first time, 2D Electrophoresis isolation, MALDI TOF Mass Spectroscopy characterization, and MASCOT search results for this PMF. In a second experiment, we construct the LNs for 422 MDTs obtained in DNA-Drug Docking simulations of the interaction of 57 anticancer furocoumarins with a DNA oligonucleotide. We calculated the respective ξk (L) values for all these LNs and used them as inputs to train a new classifier with Accuracy = 85.44% and 84.91% in training and validation respectively. The new model can be used as scoring function to guide DNA-Drug Docking studies in drug design of new coumarins for PUVA therapy. The new phylogenetics analysis algorithms encode information different from sequence similarity and may be used to analyze MDTs obtained in Docking or modeling experiments for any classes of biopolymers. The work opens new perspective on the analysis and applications of MD in polymer sciences.
Collapse
Affiliation(s)
- Lázaro Guillermo Pérez-Montoto
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain,Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - María Auxiliadora Dea-Ayuela
- Departamento de Atención Sanitaria, Salud Pública y Sanidad Animal, Facultad CC Experimentales y de La Salud, Universidad CEU Cardenal Herrera, 46113 Moncada (Valencia), Spain
| | - Francisco J. Prado-Prado
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain,Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | | | - Florencio M. Ubeira
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - Humberto González-Díaz
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain,Corresponding author. Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| |
Collapse
|
42
|
González-Díaz H, Dea-Ayuela MA, Pérez-Montoto LG, Prado-Prado FJ, Agüero-Chapín G, Bolas-Fernández F, Vazquez-Padrón RI, Ubeira FM. QSAR for RNases and theoretic-experimental study of molecular diversity on peptide mass fingerprints of a new Leishmania infantum protein. Mol Divers 2009; 14:349-69. [PMID: 19578942 PMCID: PMC7088557 DOI: 10.1007/s11030-009-9178-0] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2009] [Accepted: 06/13/2009] [Indexed: 11/29/2022]
Abstract
The toxicity and low success of current treatments for Leishmaniosis determines the search of new peptide drugs and/or molecular targets in Leishmania pathogen species (L. infantum and L. major). For example, Ribonucleases (RNases) are enzymes relevant to several biologic processes; then, theoretical and experimental study of the molecular diversity of Peptide Mass Fingerprints (PMFs) of RNases is useful for drug design. This study introduces a methodology that combines QSAR models, 2D-Electrophoresis (2D-E), MALDI-TOF Mass Spectroscopy (MS), BLAST alignment, and Molecular Dynamics (MD) to explore PMFs of RNases. We illustrate this approach by investigating for the first time the PMFs of a new protein of L. infantum. Here we report and compare new versus old predictive models for RNases based on Topological Indices (TIs) of Markov Pseudo-Folding Lattices. These group of indices called Pseudo-folding Lattice 2D-TIs include: Spectral moments pi ( k )(x,y), Mean Electrostatic potentials xi ( k )(x,y), and Entropy measures theta ( k )(x,y). The accuracy of the models (training/cross-validation) was as follows: xi ( k )(x,y)-model (96.0%/91.7%)>pi ( k )(x,y)-model (84.7/83.3) > theta ( k )(x,y)-model (66.0/66.7). We also carried out a 2D-E analysis of biological samples of L. infantum promastigotes focusing on a 2D-E gel spot of one unknown protein with M<20, 100 and pI <7. MASCOT search identified 20 proteins with Mowse score >30, but not one >52 (threshold value), the higher value of 42 was for a probable DNA-directed RNA polymerase. However, we determined experimentally the sequence of more than 140 peptides. We used QSAR models to predict RNase scores for these peptides and BLAST alignment to confirm some results. We also calculated 3D-folding TIs based on MD experiments and compared 2D versus 3D-TIs on molecular phylogenetic analysis of the molecular diversity of these peptides. This combined strategy may be of interest in drug development or target identification.
Collapse
Affiliation(s)
- Humberto González-Díaz
- Department of Microbiology and Parasitology, and Department of Organic Chemistry, Faculty of Pharmacy, USC, 15782, Santiago de Compostela, Spain.
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Wang JF, Gong K, Wei DQ, Li YX, Chou KC. Molecular dynamics studies on the interactions of PTP1B with inhibitors: from the first phosphate-binding site to the second one. Protein Eng Des Sel 2009; 22:349-55. [PMID: 19380334 DOI: 10.1093/protein/gzp012] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Protein tyrosine phosphatases 1B (PTP1B) is a major negative regulator of both insulin and leptin signaling pathways. In view of this, it becomes an important target for drug development against cancers, diabetes and obesity. The aim of the current study is to use the long time-scale molecular dynamics (MD) simulations to investigate the structural and dynamic factors that cause its inhibition by INTA and INTB, the two most potent and highly selective PTP1B inhibitors known so far. In order to investigate the mode of collective motions that is vitally important to the biological function, the covariance matrix of C(alpha) atoms was introduced for performing the dynamic analysis of the inhibition systems. It has been observed that the conformational and dynamic features of WPD-Loop, R-Loop and S-Loop play a key role in providing a smooth entrance for the inhibitors moving into the binding pocket as well as a favorable microenvironment to stabilize them. Furthermore, the hydrogen bonding networks formed around the active site with INTA and INTB may be the main reason of why the inhibition of PTP1B by the two ligands is so potent and selective. All these findings might provide useful insights for developing novel and effective drugs to treat cancer, diabetes and obesity.
Collapse
Affiliation(s)
- Jing-Fang Wang
- Bioinformatics Center, Key Laboratory of Systems Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, Peoples Republic of China
| | | | | | | | | |
Collapse
|
44
|
Li G, Haney KM, Kellogg GE, Zhang Y. Comparative docking study of anibamine as the first natural product CCR5 antagonist in CCR5 homology models. J Chem Inf Model 2009; 49:120-32. [PMID: 19166361 DOI: 10.1021/ci800356a] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Anibamine, a novel pyridine quaternary alkaloid recently isolated from Aniba sp., has been found to effectively bind to the chemokine receptor CCR5 with an IC(50) at 1 microM in competition with (125)I-gp120, an HIV viral envelope protein binding to CCR5 with high affinity. Since CCR5, a G-protein-coupled receptor, is an essential coreceptor for the human immunodeficiency virus type I (HIV-1) entry to host cells, a CCR5 antagonist that inhibits the cellular entry of HIV-1 provides a new therapy choice for the treatment of HIV. Anibamine provides a novel structural skeleton that is remarkably different from all lead compounds previously identified as CCR5 antagonists. Here, we report comparative docking studies of anibamine with several other known CCR5 antagonists in two CCR5 homology models built based on the crystal structures of bovine rhodopsin and human beta(2)-adrenergic receptor. The binding pocket of anibamine has some common features shared with other high affinity CCR5 antagonists, suggesting that they may bind in similar binding sites and/or modes. At the same time, several unique binding features of anibamine were identified, and it will likely prove beneficial in future molecular design of novel CCR5 antagonists based on the anibamine scaffold.
Collapse
Affiliation(s)
- Guo Li
- Department of Medicinal Chemistry, School of Pharmacy, Virginia Commonwealth University, Richmond, Virginia 23298-0540, USA
| | | | | | | |
Collapse
|
45
|
Agüero-Chapin G, Varona-Santos J, de la Riva GA, Antunes A, González-Villa T, Uriarte E, González-Díaz H. Alignment-Free Prediction of Polygalacturonases with Pseudofolding Topological Indices: Experimental Isolation from Coffea arabica and Prediction of a New Sequence. J Proteome Res 2009; 8:2122-8. [DOI: 10.1021/pr800867y] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Guillermín Agüero-Chapin
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Javier Varona-Santos
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Gustavo A. de la Riva
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Agostinho Antunes
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Tomás González-Villa
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Eugenio Uriarte
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| | - Humberto González-Díaz
- Unit of Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy, Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela (USC), 15782, Spain, CBQ, Central University of Las Villas, 54830, Santa Clara, Cuba, CIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123 Porto, Portugal, Sylvester Comprehensive Cancer Center, University of Miami, Miami, Florida 33136
| |
Collapse
|
46
|
Gaussian process: an alternative approach for QSAM modeling of peptides. Amino Acids 2009; 38:199-212. [DOI: 10.1007/s00726-008-0228-1] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2008] [Accepted: 12/18/2008] [Indexed: 10/21/2022]
|
47
|
Sagar S, Kaur M, Dawe A, Seshadri SV, Christoffels A, Schaefer U, Radovanovic A, Bajic VB. DDESC: Dragon database for exploration of sodium channels in human. BMC Genomics 2008; 9:622. [PMID: 19099596 PMCID: PMC2631582 DOI: 10.1186/1471-2164-9-622] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2008] [Accepted: 12/20/2008] [Indexed: 12/03/2022] Open
Abstract
Background Sodium channels are heteromultimeric, integral membrane proteins that belong to a superfamily of ion channels. The mutations in genes encoding for sodium channel proteins have been linked with several inherited genetic disorders such as febrile epilepsy, Brugada syndrome, ventricular fibrillation, long QT syndrome, or channelopathy associated insensitivity to pain. In spite of these significant effects that sodium channel proteins/genes could have on human health, there is no publicly available resource focused on sodium channels that would support exploration of the sodium channel related information. Results We report here Dragon Database for Exploration of Sodium Channels in Human (DDESC), which provides comprehensive information related to sodium channels regarding different entities, such as "genes and proteins", "metabolites and enzymes", "toxins", "chemicals with pharmacological effects", "disease concepts", "human anatomy", "pathways and pathway reactions" and their potential links. DDESC is compiled based on text- and data-mining. It allows users to explore potential associations between different entities related to sodium channels in human, as well as to automatically generate novel hypotheses. Conclusion DDESC is first publicly available resource where the information related to sodium channels in human can be explored at different levels. This database is freely accessible for academic and non-profit users via the worldwide web .
Collapse
Affiliation(s)
- Sunil Sagar
- South African National Bioinformatics Institute, University of the Western Cape, Private Bag- X17, Modderdam Road, Bellville, Cape Town 7535, South Africa.
| | | | | | | | | | | | | | | |
Collapse
|
48
|
Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection. Amino Acids 2008; 38:975-83. [PMID: 19048186 DOI: 10.1007/s00726-008-0209-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2008] [Accepted: 11/03/2008] [Indexed: 10/21/2022]
Abstract
Apoptosis proteins have a central role in the development and the homeostasis of an organism. These proteins are very important for understanding the mechanism of programmed cell death. The function of an apoptosis protein is closely related to its subcellular location. It is crucial to develop powerful tools to predict apoptosis protein locations for rapidly increasing gap between the number of known structural proteins and the number of known sequences in protein databank. In this study, amino acids pair compositions with different spaces are used to construct feature sets for representing sample of protein feature selection approach based on binary particle swarm optimization, which is applied to extract effective feature. Ensemble classifier is used as prediction engine, of which the basic classifier is the fuzzy K-nearest neighbor. Each basic classifier is trained with different feature sets. Two datasets often used in prior works are selected to validate the performance of proposed approach. The results obtained by jackknife test are quite encouraging, indicating that the proposed method might become a potentially useful tool for subcellular location of apoptosis protein, or at least can play a complimentary role to the existing methods in the relevant areas. The supplement information and software written in Matlab are available by contacting the corresponding author.
Collapse
|
49
|
Xiao X, Lin WZ. Application of protein grey incidence degree measure to predict protein quaternary structural types. Amino Acids 2008; 37:741-9. [DOI: 10.1007/s00726-008-0212-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2008] [Accepted: 11/10/2008] [Indexed: 10/21/2022]
|