1
|
Kleandrova VV, Cordeiro MNDS, Speck-Planche A. Optimizing drug discovery using multitasking models for quantitative structure-biological effect relationships: an update of the literature. Expert Opin Drug Discov 2023; 18:1231-1243. [PMID: 37639708 DOI: 10.1080/17460441.2023.2251385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 08/21/2023] [Indexed: 08/31/2023]
Abstract
INTRODUCTION Drug discovery has provided modern societies with the means to fight against many diseases. In this sense, computational methods have been at the forefront, playing an important role in rationalizing the search for novel drugs. Yet, tackling phenomena such as the multi-genic nature of diseases and drug resistance are limitations of the current computational methods. Multi-tasking models for quantitative structure-biological effect relationships (mtk-QSBER) have emerged to overcome such limitations. AREAS COVERED The present review describes an update on the fundamentals and applications of the mtk-QSBER models as tools to accelerate multiple stages/substages of the drug discovery process. EXPERT OPINION Computational approaches are extremely important for the rationalization of the search for novel and efficacious therapeutic agents. However, they need to focus more on the multi-target drug discovery paradigm. In this sense, mtk-QSBER models are particularly suited for multi-target drug discovery, offering encouraging opportunities across multiple therapeutic areas and scientific disciplines associated with drug discovery.
Collapse
Affiliation(s)
- Valeria V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Russian Biotechnological University, Moscow, Russian Federation
| | - M Natália D S Cordeiro
- LAQV@REQUIMTE/Department of Chemistry and Biochemistry, Faculty of Sciences, University of Porto, Porto, Portugal
| | - Alejandro Speck-Planche
- LAQV@REQUIMTE/Department of Chemistry and Biochemistry, Faculty of Sciences, University of Porto, Porto, Portugal
| |
Collapse
|
2
|
PTML Modeling for Pancreatic Cancer Research: In Silico Design of Simultaneous Multi-Protein and Multi-Cell Inhibitors. Biomedicines 2022; 10:biomedicines10020491. [PMID: 35203699 PMCID: PMC8962338 DOI: 10.3390/biomedicines10020491] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 02/10/2022] [Accepted: 02/15/2022] [Indexed: 02/07/2023] Open
Abstract
Pancreatic cancer (PANC) is a dangerous type of cancer that is a major cause of mortality worldwide and exhibits a remarkably poor prognosis. To date, discovering anti-PANC agents remains a very complex and expensive process. Computational approaches can accelerate the search for anti-PANC agents. We report for the first time two models that combined perturbation theory with machine learning via a multilayer perceptron network (PTML-MLP) to perform the virtual design and prediction of molecules that can simultaneously inhibit multiple PANC cell lines and PANC-related proteins, such as caspase-1, tumor necrosis factor-alpha (TNF-alpha), and the insulin-like growth factor 1 receptor (IGF1R). Both PTML-MLP models exhibited accuracies higher than 78%. Using the interpretation from one of the PTML-MLP models as a guideline, we extracted different molecular fragments desirable for the inhibition of the PANC cell lines and the aforementioned PANC-related proteins and then assembled some of those fragments to form three new molecules. The two PTML-MLP models predicted the designed molecules as potentially versatile anti-PANC agents through inhibition of the three PANC-related proteins and multiple PANC cell lines. Conclusions: This work opens new horizons for the application of the PTML modeling methodology to anticancer research.
Collapse
|
3
|
Speck-Planche A, Kleandrova VV, Scotti MT. In Silico Drug Repurposing for Anti-Inflammatory Therapy: Virtual Search for Dual Inhibitors of Caspase-1 and TNF-Alpha. Biomolecules 2021; 11:biom11121832. [PMID: 34944476 PMCID: PMC8699067 DOI: 10.3390/biom11121832] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 11/15/2021] [Accepted: 12/02/2021] [Indexed: 12/27/2022] Open
Abstract
Inflammation involves a complex biological response of the body tissues to damaging stimuli. When dysregulated, inflammation led by biomolecular mediators such as caspase-1 and tumor necrosis factor-alpha (TNF-alpha) can play a detrimental role in the progression of different medical conditions such as cancer, neurological disorders, autoimmune diseases, and cytokine storms caused by viral infections such as COVID-19. Computational approaches can accelerate the search for dual-target drugs able to simultaneously inhibit the aforementioned proteins, enabling the discovery of wide-spectrum anti-inflammatory agents. This work reports the first multicondition model based on quantitative structure–activity relationships and a multilayer perceptron neural network (mtc-QSAR-MLP) for the virtual screening of agency-regulated chemicals as versatile anti-inflammatory therapeutics. The mtc-QSAR-MLP model displayed accuracy higher than 88%, and was interpreted from a physicochemical and structural point of view. When using the mtc-QSAR-MLP model as a virtual screening tool, we could identify several agency-regulated chemicals as dual inhibitors of caspase-1 and TNF-alpha, and the experimental information later retrieved from the scientific literature converged with our computational results. This study supports the capabilities of our mtc-QSAR-MLP model in anti-inflammatory therapy with direct applications to current health issues such as the COVID-19 pandemic.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil;
- Correspondence:
| | - Valeria V. Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe shosse 11, 125080 Moscow, Russia;
| | - Marcus T. Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil;
| |
Collapse
|
4
|
Kleandrova VV, Speck-Planche A. The QSAR Paradigm in Fragment-Based Drug Discovery: From the Virtual Generation of Target Inhibitors to Multi-Scale Modeling. Mini Rev Med Chem 2021; 20:1357-1374. [PMID: 32013845 DOI: 10.2174/1389557520666200204123156] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 10/21/2019] [Accepted: 10/28/2019] [Indexed: 12/24/2022]
Abstract
Fragment-Based Drug Design (FBDD) has established itself as a promising approach in modern drug discovery, accelerating and improving lead optimization, while playing a crucial role in diminishing the high attrition rates at all stages in the drug development process. On the other hand, FBDD has benefited from the application of computational methodologies, where the models derived from the Quantitative Structure-Activity Relationships (QSAR) have become consolidated tools. This mini-review focuses on the evolution and main applications of the QSAR paradigm in the context of FBDD in the last five years. This report places particular emphasis on the QSAR models derived from fragment-based topological approaches to extract physicochemical and/or structural information, allowing to design potentially novel mono- or multi-target inhibitors from relatively large and heterogeneous databases. Here, we also discuss the need to apply multi-scale modeling, to exemplify how different datasets based on target inhibition can be simultaneously integrated and predicted together with other relevant endpoints such as the biological activity against non-biomolecular targets, as well as in vitro and in vivo toxicity and pharmacokinetic properties. In this context, seminal papers are briefly analyzed. As huge amounts of data continue to accumulate in the domains of the chemical, biological and biomedical sciences, it has become clear that drug discovery must be viewed as a multi-scale optimization process. An ideal multi-scale approach should integrate diverse chemical and biological data and also serve as a knowledge generator, enabling the design of potentially optimal chemicals that may become therapeutic agents.
Collapse
Affiliation(s)
- Valeria V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Department of Chemistry, Institute of Pharmacy, I.M. Sechenov First Moscow State Medical University, Trubetskaya Str., 8, b. 2, 119992, Moscow, Russian Federation
| |
Collapse
|
5
|
Kino S, Hsu YT, Shiba K, Chien YS, Mita C, Kawachi I, Daoud A. A scoping review on the use of machine learning in research on social determinants of health: Trends and research prospects. SSM Popul Health 2021; 15:100836. [PMID: 34169138 PMCID: PMC8207228 DOI: 10.1016/j.ssmph.2021.100836] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/15/2021] [Accepted: 06/01/2021] [Indexed: 02/08/2023] Open
Abstract
Background Machine learning (ML) has spread rapidly from computer science to several disciplines. Given the predictive capacity of ML, it offers new opportunities for health, behavioral, and social scientists. However, it remains unclear how and to what extent ML is being used in studies of social determinants of health (SDH). Methods Using four search engines, we conducted a scoping review of studies that used ML to study SDH (published before May 1, 2020). Two independent reviewers analyzed the relevant studies. For each study, we identified the research questions, Results, data, and algorithms. We synthesized our findings in a narrative report. Results Of the initial 8097 hits, we identified 82 relevant studies. The number of publications has risen during the past decade. More than half of the studies (n = 46) used US data. About 80% (n = 66) utilized surveys, and 70% (n = 57) employed ML for common prediction tasks. Although the number of studies in ML and SDH is growing rapidly, only a few studies used ML to improve causal inference, curate data, or identify social bias in predictions (i.e., algorithmic fairness). Conclusions While ML equips researchers with new ways to measure health outcomes and their determinants from non-conventional sources such as text, audio, and image data, most studies still rely on traditional surveys. Although there are no guarantees that ML will lead to better social epidemiological research, the potential for innovation in SDH research is evident as a result of harnessing the predictive power of ML for causality, data curation, or algorithmic fairness.
Collapse
Affiliation(s)
- Shiho Kino
- Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA, USA.,Department of Social Epidemiology, Kyoto University, Kyoto, Japan
| | - Yu-Tien Hsu
- Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Koichiro Shiba
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Yung-Shin Chien
- Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Carol Mita
- Countway Library of Medicine, Harvard University, Boston, MA, USA
| | - Ichiro Kawachi
- Department of Social and Behavioral Sciences, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Adel Daoud
- Center for Population and Development Studies, Harvard T.H. Chan School of Public Health, Harvard University, Boston, MA, USA.,Department of Sociology and Work Science, University of Gothenburg, Sweden.,The Division of Data Science and Artificial Intelligence of the Department of Computer Science and Engineering, Chalmers University of Technology, Sweden.,Institute for Analytical Sociology, Linköping University, Sweden
| |
Collapse
|
6
|
Ortega-Tenezaca B, González-Díaz H. IFPTML mapping of nanoparticle antibacterial activity vs. pathogen metabolic networks. NANOSCALE 2021; 13:1318-1330. [PMID: 33410431 DOI: 10.1039/d0nr07588d] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Nanoparticles are useful antimicrobial drug-release systems, but some nanoparticles also exhibit antibacterial activity. However, investigation of their antibacterial activity is a difficult and slow process due to the numerous combinations of nanoparticle size, shape, and composition vs. biological tests, assay organisms, and multiple activity parameters to be measured. Additionally, the overuse of antibiotics has led to the emergence of resistant bacterial strains with different metabolic networks. Computational models may speed up this process, but the models reported to date do not to consider all the previous factors, and the data sources are dispersed and not curated. Thus, herein, we used an information fusion, perturbation-theory machine learning (IFPTML) approach, which is introduced by us for the first time, to fit a model for the discovery of antibacterial nanoparticles. The dataset studied had 15 classes of nanoparticles (1-100 nm) with most cases in the range of 1-50 nm vs. >20 pathogenic bacteria species with different metabolic networks. The nanoparticles studied included metal nanoparticles of Au, Ag, and Cu; oxide nanoparticles of Zn, Cu, La, Al, Fe, Sn, Ti, Cd, and Si; and metal salt nanoparticles of CuI and CdS. We used the SOFT.PTML software (our own application) with a user-friendly interface for the IFPTML calculations and a control statistics package. Using SOFT.PTML, we found a linear logistic regression equation that could model 4 biological activity parameters using only 8 variables with χ2 = 2265.75, p-level <0.05, sensitivity, Sn = 79.4, and specificity, Sp = 99.3, for 3213 cases (nanoparticle-bacteria pairs) in the training series. The model had Sn = 80.8 and Sp = 99.3 for 2114 cases in the external validation series. We also developed a random forest non-linear model with higher values of Sn and Sp = 98-99% in the training/validation series, although it was more complicated to use. SOFT.PTML has been demonstrated to be a useful tool for the analysis of complex data in nanotechnology. We also introduced a new anabolism-catabolism unbalance index of metabolic networks to reveal the biological connotation of the IFPTML predictions for antibacterial nanoparticles. These new models open a new door for the discovery of NPs vs. new bacterial species and strains with different topological structures of their metabolic networks.
Collapse
Affiliation(s)
- Bernabé Ortega-Tenezaca
- RNASA-IMEDIR, Computer Science Faculty, University of A Coruna, 15071 A Coruña, Spain and Amazon State University UEA, Puyo, Pastaza, Ecuador and Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain. and Biomedical Research Institute of A Coruña (INIBIC), University Hospital Complex of A Coruña (CHUAC), 15006 A Coruña, Spain and Center for Investigation on Technologies of Information and Communication (CITIC), University of Coruña (UDC), Campus de Elviña s/n, 15071 A Coruña, Spain
| | - Humberto González-Díaz
- Department of Organic and Inorganic Chemistry, University of Basque Country UPV/EHU, 48940 Leioa, Spain. and Basque Center for Biophysics CSIC-UPVEH, University of Basque Country UPV/EHU, 48940 Leioa, Spain and IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
7
|
Kleandrova VV, Speck-Planche A. PTML Modeling for Alzheimer’s Disease: Design and Prediction of Virtual Multi-Target Inhibitors of GSK3B, HDAC1, and HDAC6. Curr Top Med Chem 2020; 20:1661-1676. [DOI: 10.2174/1568026620666200607190951] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/12/2019] [Accepted: 01/05/2020] [Indexed: 01/23/2023]
Abstract
Background:
Alzheimer’s disease is characterized by a progressive pattern of cognitive and
functional impairment, which ultimately leads to death. Computational approaches have played an important
role in the context of drug discovery for anti-Alzheimer's therapies. However, most of the computational
models reported to date have been focused on only one protein associated with Alzheimer's,
while relying on small datasets of structurally related molecules.
Objective:
We introduce the first model combining perturbation theory and machine learning based on
artificial neural networks (PTML-ANN) for simultaneous prediction and design of inhibitors of three
Alzheimer’s disease-related proteins, namely glycogen synthase kinase 3 beta (GSK3B), histone deacetylase
1 (HDAC1), and histone deacetylase 6 (HDAC6).
Methods:
The PTML-ANN model was obtained from a dataset retrieved from ChEMBL, and it relied on
a classification approach to predict chemicals as active or inactive.
Results:
The PTML-ANN model displayed sensitivity and specificity higher than 85% in both training
and test sets. The physicochemical and structural interpretation of the molecular descriptors in the model
permitted the direct extraction of fragments suggested to favorably contribute to enhancing the multitarget
inhibitory activity. Based on this information, we assembled ten molecules from several fragments
with positive contributions. Seven of these molecules were predicted as triple target inhibitors while the
remaining three were predicted as dual-target inhibitors. The estimated physicochemical properties of
the designed molecules complied with Lipinski’s rule of five and its variants.
Conclusion:
This work opens new horizons toward the design of multi-target inhibitors for anti- Alzheimer's
therapies.
Collapse
Affiliation(s)
- Valeria V. Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Programa Institucional de Fomento a la Investigacion, Desarrollo e Innovacion, Universidad Tecnologica Metropolitana, Ignacio Valdivieso 2409, P.O. Box 8940577, San Joaquin, Santiago, Chile
| |
Collapse
|
8
|
Speck-Planche A. Multi-Scale Modeling in Drug Discovery Against Infectious Diseases. Mini Rev Med Chem 2019; 19:1560-1563. [PMID: 31833463 DOI: 10.2174/138955751919191024110000] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 01/13/2019] [Accepted: 01/29/2019] [Indexed: 02/02/2023]
Abstract
This work discusses the idea that drug discovery, instead of being performed through a series of filtering-based stages, should be viewed as a multi-scale optimization problem. Here, the most promising multi-scale models are analyzed in terms of their applications, advantages, and limitations in the search for more potent and safer chemicals against infectious diseases. Multi-scale de novo drug design is highlighted as an emerging paradigm, able to accelerate the discovery of more effective antimicrobial agents.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- Department of Chemistry, Institute of Pharmacy, I.M. Sechenov First Moscow State Medical University, Trubetskaya str., 8, b. 2, 119992, Moscow, Russian Federation
| |
Collapse
|
9
|
Quevedo-Tumailli VF, Ortega-Tenezaca B, González-Díaz H. Chromosome Gene Orientation Inversion Networks (GOINs) of Plasmodium Proteome. J Proteome Res 2018; 17:1258-1268. [DOI: 10.1021/acs.jproteome.7b00861] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Viviana F. Quevedo-Tumailli
- RNASA-IMEDIR,
Computer Science Faculty, University of A Coruña, 15071 A Coruña, Spain
- Universidad Estatal Amazónica UEA, Puyo, Pastaza, Ecuador
| | - Bernabé Ortega-Tenezaca
- RNASA-IMEDIR,
Computer Science Faculty, University of A Coruña, 15071 A Coruña, Spain
- Universidad Estatal Amazónica UEA, Puyo, Pastaza, Ecuador
- Universidad Regional Autónoma de los Andes UNIANDES-Puyo, Puyo, Pastaza,Ecuador
| | - Humbert González-Díaz
- Dept.
of Organic Chemistry II, University of the Basque Country UPV/EHU, 48940 Leioa, Biscay, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Biscay, Spain
| |
Collapse
|
10
|
Chun S, Muthu M, Gopal J, Paul D, Kim DH, Gansukh E, Anthonydhason V. The unequivocal preponderance of biocomputation in clinical virology. RSC Adv 2018; 8:17334-17345. [PMID: 35539262 PMCID: PMC9080393 DOI: 10.1039/c8ra00888d] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 03/14/2018] [Indexed: 11/22/2022] Open
Abstract
Bioinformatics and computer based data simulation and modeling are captivating biological research, delivering great results already and promising to deliver more. As biological research is a complex, intricate, diverse field, any available support is gladly taken. With recent outbreaks and epidemics, pathogens are a constant threat to the global economy and security. Virus related plagues are somehow the most difficult to handle. Biocomputation has provided appreciable help in resolving clinical virology related issues. This review, for the first time, surveys the current status of the role of computation in virus related research. Advances made in the fields of clinical virology, antiviral drug design, viral immunology and viral oncology, through input from biocomputation, have been discussed. The amount of progress made and the software platforms available are consolidated in this review. The limitations of computation based methods are presented. Finally, the challenges facing the future of biocomputation in clinical virology are speculated upon. Biocomputation in clinical virology.![]()
Collapse
Affiliation(s)
- Sechul Chun
- Department of Environmental Health Science
- Konkuk University
- Seoul 143-701
- Korea
| | - Manikandan Muthu
- Department of Environmental Health Science
- Konkuk University
- Seoul 143-701
- Korea
| | - Judy Gopal
- Department of Environmental Health Science
- Konkuk University
- Seoul 143-701
- Korea
| | - Diby Paul
- Environmental Microbiology
- Department of Environmental Engineering
- Konkuk University
- Seoul 143-701
- Korea
| | - Doo Hwan Kim
- Department of Environmental Health Science
- Konkuk University
- Seoul 143-701
- Korea
| | - Enkhtaivan Gansukh
- Department of Environmental Health Science
- Konkuk University
- Seoul 143-701
- Korea
| | - Vimala Anthonydhason
- Department of Biotechnology
- Indian Institute of Technology-Madras
- Chennai 600036
- India
| |
Collapse
|
11
|
Speck-Planche A, Dias Soeiro Cordeiro MN. Speeding up Early Drug Discovery in Antiviral Research: A Fragment-Based in Silico Approach for the Design of Virtual Anti-Hepatitis C Leads. ACS COMBINATORIAL SCIENCE 2017; 19:501-512. [PMID: 28437091 DOI: 10.1021/acscombsci.7b00039] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Hepatitis C constitutes an unresolved global health problem. This infectious disease is caused by the hepatotropic hepatitis C virus (HCV), and it can lead to the occurrence of life-threatening medical conditions, such as cirrhosis and liver cancer. Nowadays, major clinical concerns have arisen because of the appearance of multidrug resistance (MDR) and the side effects especially associated with long-term treatments. In this work, we report the first multitasking model for quantitative structure-biological effect relationships (mtk-QSBER), focused on the simultaneous exploration of anti-HCV activity and in vitro safety profiles related to the absorption, distribution, metabolism, elimination, and toxicity (ADMET). The mtk-QSBER model was created from a data set formed by 40 158 cases, displaying accuracy higher than 95% in both training and prediction (test) sets. Several molecular fragments were selected, and their quantitative contributions to anti-HCV activity and ADMET profiles were calculated. By combining the analysis of the fragments with positive contributions and the physicochemical meanings of the different molecular descriptors in the mtk-QSBER, six new molecules were designed. These new molecules were predicted to exhibit potent anti-HCV activity and desirable in vitro ADMET properties. In addition, the designed molecules have good druglikeness according to the Lipinski's rule of five and its variants.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- LAQV@REQUIMTE/Department
of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| | | |
Collapse
|
12
|
Identification of DEP domain-containing proteins by a machine learning method and experimental analysis of their expression in human HCC tissues. Sci Rep 2016; 6:39655. [PMID: 28000796 PMCID: PMC5175133 DOI: 10.1038/srep39655] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 11/24/2016] [Indexed: 12/23/2022] Open
Abstract
The Dishevelled/EGL-10/Pleckstrin (DEP) domain-containing (DEPDC) proteins have seven members. However, whether this superfamily can be distinguished from other proteins based only on the amino acid sequences, remains unknown. Here, we describe a computational method to segregate DEPDCs and non-DEPDCs. First, we examined the Pfam numbers of the known DEPDCs and used the longest sequences for each Pfam to construct a phylogenetic tree. Subsequently, we extracted 188-dimensional (188D) and 20D features of DEPDCs and non-DEPDCs and classified them with random forest classifier. We also mined the motifs of human DEPDCs to find the related domains. Finally, we designed experimental verification methods of human DEPDC expression at the mRNA level in hepatocellular carcinoma (HCC) and adjacent normal tissues. The phylogenetic analysis showed that the DEPDCs superfamily can be divided into three clusters. Moreover, the 188D and 20D features can both be used to effectively distinguish the two protein types. Motif analysis revealed that the DEP and RhoGAP domain was common in human DEPDCs, human HCC and the adjacent tissues that widely expressed DEPDCs. However, their regulation was not identical. In conclusion, we successfully constructed a binary classifier for DEPDCs and experimentally verified their expression in human HCC tissues.
Collapse
|
13
|
Drug-symptom networking: Linking drug-likeness screening to drug discovery. Pharmacol Res 2015; 103:105-13. [PMID: 26615785 DOI: 10.1016/j.phrs.2015.11.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Revised: 10/26/2015] [Accepted: 11/11/2015] [Indexed: 01/19/2023]
Abstract
Understanding the relationships between drugs and symptoms has broad medical consequences, yet a comprehensive description of the drug-symptom associations is currently lacking. Here, 1441 FDA-approved drugs were collected, and PCA was used to extract 122 descriptors which explained 91% of the variance. Then, a k-means++ method was employed to partition the drug dataset into 3 clusters, and 3 corresponding SVDD models (drug-likeness screening models) were constructed with an overall accuracy of up to 95.6%. Furthermore, 6878 herbal molecules from the TcmSP™ database were screened by the above 3 SVDD model to obtain 5309 candidate drug molecules with highly accept classification of 77.19%. To assess the accuracy of the SVDD models, 8559 herbal molecule-symptom co-occurrences were mined from Pubmed abstracts, involving 697 herbal molecules and 314 symptoms. Most of the 697 herbal molecules could be found in the accepted SVDD data (5309 molecules), showing the potential of the SVDD for the screening of drug candidates. Moreover, a herbal molecule-herbal molecule network and a herbal molecule-symptom were constructed. Overall, the results provided a new drug-likeness screening approach independent to abnormal training data, and the comprehensive collection of herbal molecule-symptom associations formed a new data resource for systematic characterization of the symptom-oriented medicines.
Collapse
|