1
|
Insight into potent TLR2 inhibitors for the treatment of disease caused by Mycoplasma pneumoniae based on machine learning approaches. Mol Divers 2023; 27:371-387. [PMID: 35488091 DOI: 10.1007/s11030-022-10433-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 04/01/2022] [Indexed: 02/08/2023]
Abstract
Mycoplasma pneumoniae (MP) is one of the most common pathogens that causes acute respiratory tract infections. Children experiencing MP infection often suffer severe complications, lung injury, and even death. Previous studies have demonstrated that Toll-like receptor 2 (TLR2) is a potential therapeutic target for treating the MP-induced inflammatory response. However, the screening of natural compounds has received more attention for the treatment of bacterial infections to reduce the likelihood of bacterial resistance. Herein, we screened compounds by combining molecular docking and machine learning approaches to find potential lead compounds for treating MP infection. First, all compounds were docked with the TLR2 receptor protein to screen for potential candidates. To predict drug bioactivity, a machine learning model (random forest) was trained for TLR2 inhibitors to obtain the predictive model. The model achieved significant squared correlation coefficient (R2) values for the training set (0.85) and validation set (0.84) of compounds. The developed machine learning model was then used to predict the pIC50 values of the top 50 candidates from the Traditional Chinese compounds and Discovery Diversity sets of compounds. As a result, these compounds are capable of inhibiting the inflammatory response induced by MP. However, prior to bringing these compounds to market, it is necessary to verify these results with additional biological testing, including preclinical and clinical studies. Moreover, the present study provides a theoretical basis for the use of natural compounds as potential candidates to treat pneumonia caused by MP.
Collapse
|
2
|
Zhu Z, Rahman Z, Aamir M, Shah SZA, Hamid S, Bilawal A, Li S, Ishfaq M. Insight into TLR4 receptor inhibitory activity via QSAR for the treatment of Mycoplasma pneumonia disease. RSC Adv 2023; 13:2057-2069. [PMID: 36712602 PMCID: PMC9833105 DOI: 10.1039/d2ra06178c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 12/14/2022] [Indexed: 01/13/2023] Open
Abstract
Mycoplasma pneumoniae (MP) is one of the most common pathogenic organisms causing upper and lower respiratory tract infections, lung injury, and even death in young children. Toll-like receptors (TLRs) play an important role in innate immunity by allowing the host to recognize pathogens invading the body. Previous studies demonstrated that TLR4 is a potential therapeutic target for the treatment of MP pneumonia. Therefore, the present study aimed to screen biologically active ingredients that target the TLR4 receptor pathway. We first used molecular docking to screen out the active compounds inhibiting the TLR4 pathway, and then used regression and classification machine learning algorithms to establish a quantitative structure-activity relationship (QSAR) model to predict the biological activity of the screened compounds. A total of 78 molecules were used in QSAR modelling, which were retrieved from the ChEMBL database. The QSAR models had acceptable correlation coefficients of R 2 on the training and testing dataset in the range of 0.96 to 0.91 and 0.93 to 0.76, respectively. The multiclass classification models showed accuracy on training and testing data within ranges of 1.0 to 0.70, 0.96 to 0.63, and log loss ranges from 0.27 to 8.63, respectively. In addition, molecular descriptors and fingerprints have been studied as structural elements involved in increased and decreased inhibitory activities. These results provide a quantitative analysis of QSAR and classification models applicable for high-throughput screening, as well as insights into the mechanisms of inhibition of TLR4 antagonists.
Collapse
Affiliation(s)
- Zemin Zhu
- College of Computer Science, Huanggang Normal UniversityHuanggang 438000China+86 15972855212
| | - Ziaur Rahman
- College of Computer Science, Huanggang Normal UniversityHuanggang 438000China+86 15972855212
| | - Muhammad Aamir
- College of Computer Science, Huanggang Normal UniversityHuanggang 438000China+86 15972855212
| | - Syed Zahid Ali Shah
- Department of Pathology, Faculty of Veterinary and Animal Sciences, The Islamia University of Bahawalpur-PakistanPakistan
| | - Sattar Hamid
- The University of Agriculture PeshawarKhyber Pakhtunkhwa25130Pakistan
| | - Akhunzada Bilawal
- College of Food Science, Northeast Agricultural UniversityHarbinChina
| | - Sihong Li
- Key Laboratory of Applied Technology on Green-Eco-Healthy Animal Husbandry of Zhejiang Province, College of Animal Science and Technology, College of Veterinary Medicine, Zhejiang A&F UniversityHangzhou 311300China
| | - Muhammad Ishfaq
- College of Computer Science, Huanggang Normal UniversityHuanggang 438000China+86 15972855212
| |
Collapse
|
3
|
Khatun MS, Shoombuatong W, Hasan MM, Kurata H. Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction. Curr Genomics 2020; 21:454-463. [PMID: 33093807 PMCID: PMC7536797 DOI: 10.2174/1389202921999200625103936] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Revised: 03/19/2020] [Accepted: 05/27/2020] [Indexed: 12/22/2022] Open
Abstract
Protein-protein interactions (PPIs) are the physical connections between two or more proteins via electrostatic forces or hydrophobic effects. Identification of the PPIs is pivotal, which contributes to many biological processes including protein function, disease incidence, and therapy design. The experimental identification of PPIs via high-throughput technology is time-consuming and expensive. Bioinformatics approaches are expected to solve such restrictions. In this review, our main goal is to provide an inclusive view of the existing sequence-based computational prediction of PPIs. Initially, we briefly introduce the currently available PPI databases and then review the state-of-the-art bioinformatics approaches, working principles, and their performances. Finally, we discuss the caveats and future perspective of the next generation algorithms for the prediction of PPIs.
Collapse
Affiliation(s)
| | | | - Md. Mehedi Hasan
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Japan Society for the Promotion of Science, 5-3-1 Kojimachi, Chiyoda-ku, Tokyo 102-0083, Japan; Tel: +81-948-297-828; E-mail: and Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| | - Hiroyuki Kurata
- Address correspondence to these authors at the Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Japan Society for the Promotion of Science, 5-3-1 Kojimachi, Chiyoda-ku, Tokyo 102-0083, Japan; Tel: +81-948-297-828; E-mail: and Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Biomedical Informatics R&D Center, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka 820-8502, Japan; Tel: +81-948-297-828; E-mail:
| |
Collapse
|
4
|
Charoenkwan P, Kanthawong S, Schaduangrat N, Yana J, Shoombuatong W. PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method. Cells 2020; 9:E353. [PMID: 32028709 PMCID: PMC7072630 DOI: 10.3390/cells9020353] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 12/16/2022] Open
Abstract
Although, existing methods have been successful in predicting phage (or bacteriophage) virion proteins (PVPs) using various types of protein features and complex classifiers, such as support vector machine and naïve Bayes, these two methods do not allow interpretability. However, the characterization and analysis of PVPs might be of great significance to understanding the molecular mechanisms of bacteriophage genetics and the development of antibacterial drugs. Hence, we herein proposed a novel method (PVPred-SCM) based on the scoring card method (SCM) in conjunction with dipeptide composition to identify and characterize PVPs. In PVPred-SCM, the propensity scores of 400 dipeptides were calculated using the statistical discrimination approach. Rigorous independent validation test showed that PVPred-SCM utilizing only dipeptide composition yielded an accuracy of 77.56%, indicating that PVPred-SCM performed well relative to the state-of-the-art method utilizing a number of protein features. Furthermore, the propensity scores of dipeptides were used to provide insights into the biochemical and biophysical properties of PVPs. Upon comparison, it was found that PVPred-SCM was superior to the existing methods considering its simplicity, interpretability, and implementation. Finally, in an effort to facilitate high-throughput prediction of PVPs, we provided a user-friendly web-server for identifying the likelihood of whether or not these sequences are PVPs. It is anticipated that PVPred-SCM will become a useful tool or at least a complementary existing method for predicting and analyzing PVPs.
Collapse
Affiliation(s)
- Phasit Charoenkwan
- Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai 50200, Thailand;
| | - Sakawrat Kanthawong
- Department of Microbiology, Faculty of Medicine, Khon Kaen University, Khon Kaen 40002, Thailand;
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand;
| | - Janchai Yana
- Department of Chemistry, Faculty of Science and Technology, Chiang Mai Rajabhat University, Chiang Mai 50300, Thailand;
| | - Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand;
| |
Collapse
|
5
|
iQSP: A Sequence-Based Tool for the Prediction and Analysis of Quorum Sensing Peptides via Chou's 5-Steps Rule and Informative Physicochemical Properties. Int J Mol Sci 2019; 21:ijms21010075. [PMID: 31861928 PMCID: PMC6981611 DOI: 10.3390/ijms21010075] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 12/13/2019] [Accepted: 12/18/2019] [Indexed: 01/18/2023] Open
Abstract
Understanding of quorum-sensing peptides (QSPs) in their functional mechanism plays an essential role in finding new opportunities to combat bacterial infections by designing drugs. With the avalanche of the newly available peptide sequences in the post-genomic age, it is highly desirable to develop a computational model for efficient, rapid and high-throughput QSP identification purely based on the peptide sequence information alone. Although, few methods have been developed for predicting QSPs, their prediction accuracy and interpretability still requires further improvements. Thus, in this work, we proposed an accurate sequence-based predictor (called iQSP) and a set of interpretable rules (called IR-QSP) for predicting and analyzing QSPs. In iQSP, we utilized a powerful support vector machine (SVM) cooperating with 18 informative features from physicochemical properties (PCPs). Rigorous independent validation test showed that iQSP achieved maximum accuracy and MCC of 93.00% and 0.86, respectively. Furthermore, a set of interpretable rules IR-QSP was extracted by using random forest model and the 18 informative PCPs. Finally, for the convenience of experimental scientists, the iQSP web server was established and made freely available online. It is anticipated that iQSP will become a useful tool or at least as a complementary existing method for predicting and analyzing QSPs.
Collapse
|
6
|
Schaduangrat N, Nantasenamat C, Prachayasittikul V, Shoombuatong W. Meta-iAVP: A Sequence-Based Meta-Predictor for Improving the Prediction of Antiviral Peptides Using Effective Feature Representation. Int J Mol Sci 2019; 20:ijms20225743. [PMID: 31731751 PMCID: PMC6888698 DOI: 10.3390/ijms20225743] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 11/07/2019] [Accepted: 11/13/2019] [Indexed: 12/31/2022] Open
Abstract
In spite of the large-scale production and widespread distribution of vaccines and antiviral drugs, viruses remain a prominent human disease. Recently, the discovery of antiviral peptides (AVPs) has become an influential antiviral agent due to their extraordinary advantages. With the avalanche of newly-found peptide sequences in the post-genomic era, there is a great demand to develop a sequence-based predictor for timely identifying AVPs as this information is very useful for both basic research and drug development. In this study, we propose a novel sequence-based meta-predictor with an effective feature representation, called Meta-iAVP, for the accurate prediction of AVPs from given peptide sequences. Herein, the effective feature representation was extracted from a set of prediction scores derived from various machine learning algorithms and types of features. To the best of our knowledge, the model proposed herein represents the first meta-based approach for the prediction of AVPs. An overall accuracy and Matthews correlation coefficient of 95.20% and 0.90, respectively, was achieved from the independent test set on an objective benchmark dataset. Comparative analysis suggested that Meta-iAVP was superior to that of existing methods and therefore represents a useful tool for AVP prediction. Finally, in an effort to facilitate high-throughput prediction of AVPs, the model was deployed as the Meta-iAVP web server and is made freely available online at http://codes.bio/meta-iavp/ where users can submit query peptide sequences for determining the likelihood of whether or not these peptides are AVPs.
Collapse
Affiliation(s)
- Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand; (N.S.); (C.N.)
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand; (N.S.); (C.N.)
| | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand;
| | - Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand; (N.S.); (C.N.)
- Correspondence: ; Tel.: +66-2441-4371 (ext. 2715)
| |
Collapse
|
7
|
Laengsri V, Shoombuatong W, Adirojananon W, Nantasenamat C, Prachayasittikul V, Nuchnoi P. ThalPred: a web-based prediction tool for discriminating thalassemia trait and iron deficiency anemia. BMC Med Inform Decis Mak 2019; 19:212. [PMID: 31699079 PMCID: PMC6836478 DOI: 10.1186/s12911-019-0929-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Accepted: 10/14/2019] [Indexed: 01/07/2023] Open
Abstract
Background The hypochromic microcytic anemia (HMA) commonly found in Thailand are iron deficiency anemia (IDA) and thalassemia trait (TT). Accurate discrimination between IDA and TT is an important issue and better methods are urgently needed. Although considerable RBC formulas and indices with various optimal cut-off values have been developed, distinguishing between IDA and TT is still a challenging problem due to the diversity of various anemic populations. To address this problem, it is desirable to develop an improved and automated prediction model for discriminating IDA from TT. Methods We retrospectively collected laboratory data of HMA found in Thai adults. Five machine learnings, including k-nearest neighbor (k-NN), decision tree, random forest (RF), artificial neural network (ANN) and support vector machine (SVM), were applied to construct a discriminant model. Performance was assessed and compared with thirteen existing discriminant formulas and indices. Results The data of 186 patients (146 patients with TT and 40 with IDA) were enrolled. The interpretable rules derived from the RF model were proposed to demonstrate the combination of RBC indices for discriminating IDA from TT. A web-based tool ‘ThalPred’ was implemented using an SVM model based on seven RBC parameters. ThalPred achieved prediction results with an external accuracy, MCC and AUC of 95.59, 0.87 and 0.98, respectively. Conclusion ThalPred and an interpretable rule were provided for distinguishing IDA from TT. For the convenience of health care team experimental scientists, a web-based tool has been established at http://codes.bio/thalpred/ by which users can easily get their desired screening test result without the need to go through the underlying mathematical and computational details.
Collapse
Affiliation(s)
- V Laengsri
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.,Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - W Shoombuatong
- Center of Data Mining and Medical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - W Adirojananon
- Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - C Nantasenamat
- Center of Data Mining and Medical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - V Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - P Nuchnoi
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand. .,Department of Clinical Microscopy, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand.
| |
Collapse
|
8
|
Pratiwi R, Prachayasittikul V, Prachayasittikul S, Nantasenamat C. Rational design of novel sirtuin 1 activators via structure-activity insights from application of QSAR modeling. EXCLI JOURNAL 2019; 18:207-222. [PMID: 31217784 PMCID: PMC6558509 DOI: 10.17179/excli2019-1274] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 03/20/2019] [Indexed: 12/13/2022]
Abstract
Sirtuin 1 (SIRT1) enzyme regulates major cell activities, and its activation offers lucrative therapeutic potentials for aging diseases including Alzheimer's disease (AD). Regarding the global aging society, continual attention has been given to various chemical scaffolds as a source for the discovery of novel SIRT1 activators since the discovery of the pioneer activator, resveratrol. Understanding structure-activity relationship (SAR) is essential for screening, designing as well as improving the properties of drugs. In this study, an in silico approach based on quantitative structure-activity relationship (QSAR) modeling, was employed for understanding the SAR of currently available SIRT1 fused-aromatic activators (i.e., imidazothiazole, oxazolopyridine, and azabenzimidazole analogs). Three QSAR models constructed using multiple linear regression (MLR) provided good predictive performance (R 2 LOOCV = 0.729 - 0.863 and RMSE LOOCV = 0.165 - 0.325). An additional novel set of 181 structurally modified compounds were rationally designed according to key descriptors deduced from the QSAR findings and their SIRT1 activities were predicted using the constructed models. In overview, the study provides insightful SAR findings of currently available SIRT1 activators that would be useful for guiding the rational design, screening, and development of further potent SIRT1 activators for managing age-related clinical conditions. A series of promising compounds as well as important scaffolds and molecular properties for potent SIRT1 activator were highlighted. This study demonstrated the efficacious role of QSAR-driven structural modification for the rational design of novel leads.
Collapse
Affiliation(s)
- Reny Pratiwi
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.,Department of Medical Laboratory Technology, Faculty of Health Sciences, Setia Budi University, Surakarta 57127, Indonesia
| | - Veda Prachayasittikul
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Supaluk Prachayasittikul
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
9
|
Schaduangrat N, Nantasenamat C, Prachayasittikul V, Shoombuatong W. ACPred: A Computational Tool for the Prediction and Analysis of Anticancer Peptides. Molecules 2019; 24:E1973. [PMID: 31121946 PMCID: PMC6571645 DOI: 10.3390/molecules24101973] [Citation(s) in RCA: 120] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 05/07/2019] [Accepted: 05/17/2019] [Indexed: 01/01/2023] Open
Abstract
Anticancer peptides (ACPs) have emerged as a new class of therapeutic agent for cancer treatment due to their lower toxicity as well as greater efficacy, selectivity and specificity when compared to conventional small molecule drugs. However, the experimental identification of ACPs still remains a time-consuming and expensive endeavor. Therefore, it is desirable to develop and improve upon existing computational models for predicting and characterizing ACPs. In this study, we present a bioinformatics tool called the ACPred, which is an interpretable tool for the prediction and characterization of the anticancer activities of peptides. ACPred was developed by utilizing powerful machine learning models (support vector machine and random forest) and various classes of peptide features. It was observed by a jackknife cross-validation test that ACPred can achieve an overall accuracy of 95.61% in identifying ACPs. In addition, analysis revealed the following distinguishing characteristics that ACPs possess: (i) hydrophobic residue enhances the cationic properties of α-helical ACPs resulting in better cell penetration; (ii) the amphipathic nature of the α-helical structure plays a crucial role in its mechanism of cytotoxicity; and (iii) the formation of disulfide bridges on β-sheets is vital for structural maintenance which correlates with its ability to kill cancer cells. Finally, for the convenience of experimental scientists, the ACPred web server was established and made freely available online.
Collapse
Affiliation(s)
- Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
| | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
| | - Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand.
| |
Collapse
|
10
|
Shoombuatong W, Schaduangrat N, Nantasenamat C. Unraveling the bioactivity of anticancer peptides as deduced from machine learning. EXCLI JOURNAL 2018; 17:734-752. [PMID: 30190664 PMCID: PMC6123611 DOI: 10.17179/excli2018-1447] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 07/10/2018] [Indexed: 12/13/2022]
Abstract
Cancer imposes a global health burden as it represents one of the leading causes of morbidity and mortality while also giving rise to significant economic burden owing to the associated expenditures for its monitoring and treatment. In spite of advancements in cancer therapy, the low success rate and recurrence of tumor has necessitated the ongoing search for new therapeutic agents. Aside from drugs based on small molecules and protein-based biopharmaceuticals, there has been an intense effort geared towards the development of peptide-based therapeutics owing to its favorable and intrinsic properties of being relatively small, highly selective, potent, safe and low in production costs. In spite of these advantages, there are several inherent weaknesses that are in need of attention in the design and development of therapeutic peptides. An abundance of data on bioactive and therapeutic peptides have been accumulated over the years and the burgeoning area of artificial intelligence has set the stage for the lucrative utilization of machine learning to make sense of these large and high-dimensional data. This review summarizes the current state-of-the-art on the application of machine learning for studying the bioactivity of anticancer peptides along with future outlook of the field. Data and R codes used in the analysis herein are available on GitHub at https://github.com/Shoombuatong2527/anticancer-peptides-review.
Collapse
Affiliation(s)
- Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
11
|
Thippakorn C, Schaduangrat N, Nantasenamat C. Proteomic and bioinformatic discovery of biomarkers for diabetic nephropathy. EXCLI JOURNAL 2018; 17:312-330. [PMID: 29805343 PMCID: PMC5962897 DOI: 10.17179/excli2018-1150] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 03/12/2018] [Indexed: 11/13/2022]
Abstract
Diabetes is associated with numerous metabolic and vascular risk factors that contribute to a high rate of micro-vascular and macro-vascular disorders leading to mortality and morbidity from diabetic complications. In this case, the major cause of death in overall diabetic patients results from diabetic nephropathy (DN) or renal failure. The risk factors and mechanisms that correspond to the development of DN are not fully understood and so far, no specific and sufficient diagnostic biomarkers are currently available other than micro- or macroalbuminuria. Therefore, this review describes current and novel protein biomarkers in the context of DN as well as probable proteins biomarkers associated with pathological processes for the early stage of DN via proteomics data together with bioinformatics. In addition, the mechanisms involved in early development of diabetic vascular disorders and complications resulting from glucose induced oxidative stress will also be explored.
Collapse
Affiliation(s)
- Chadinee Thippakorn
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Nalini Schaduangrat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok 10700, Thailand
| |
Collapse
|
12
|
Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine Learning and Data Mining Methods in Diabetes Research. Comput Struct Biotechnol J 2017; 15:104-116. [PMID: 28138367 PMCID: PMC5257026 DOI: 10.1016/j.csbj.2016.12.005] [Citation(s) in RCA: 332] [Impact Index Per Article: 47.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Revised: 12/20/2016] [Accepted: 12/27/2016] [Indexed: 12/14/2022] Open
Abstract
The remarkable advances in biotechnology and health sciences have led to a significant production of data, such as high throughput genetic data and clinical information, generated from large Electronic Health Records (EHRs). To this end, application of machine learning and data mining methods in biosciences is presently, more than ever before, vital and indispensable in efforts to transform intelligently all available information into valuable knowledge. Diabetes mellitus (DM) is defined as a group of metabolic disorders exerting significant pressure on human health worldwide. Extensive research in all aspects of diabetes (diagnosis, etiopathophysiology, therapy, etc.) has led to the generation of huge amounts of data. The aim of the present study is to conduct a systematic review of the applications of machine learning, data mining techniques and tools in the field of diabetes research with respect to a) Prediction and Diagnosis, b) Diabetic Complications, c) Genetic Background and Environment, and e) Health Care and Management with the first category appearing to be the most popular. A wide range of machine learning algorithms were employed. In general, 85% of those used were characterized by supervised learning approaches and 15% by unsupervised ones, and more specifically, association rules. Support vector machines (SVM) arise as the most successful and widely used algorithm. Concerning the type of data, clinical datasets were mainly used. The title applications in the selected articles project the usefulness of extracting valuable knowledge leading to new hypotheses targeting deeper understanding and further investigation in DM.
Collapse
Affiliation(s)
- Ioannis Kavakiotis
- Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
| | - Olga Tsave
- Laboratory of Inorganic Chemistry, Department of Chemical Engineering, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Athanasios Salifoglou
- Laboratory of Inorganic Chemistry, Department of Chemical Engineering, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Nicos Maglaveras
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
- Lab of Computing and Medical Informatics, Medical School, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Ioannis Vlahavas
- Department of Informatics, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| | - Ioanna Chouvarda
- Institute of Applied Biosciences, CERTH, Thessaloniki, Greece
- Lab of Computing and Medical Informatics, Medical School, Aristotle University of Thessaloniki, Thessaloniki 54124, Greece
| |
Collapse
|
13
|
CryoProtect: A Web Server for Classifying Antifreeze Proteins from Nonantifreeze Proteins. J CHEM-NY 2017. [DOI: 10.1155/2017/9861752] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Antifreeze protein (AFP) is an ice-binding protein that protects organisms from freezing in extremely cold environments. AFPs are found across a diverse range of species and, therefore, significantly differ in their structures. As there are no consensus sequences available for determining the ice-binding domain of AFPs, thus the prediction and characterization of AFPs from their sequence is a challenging task. This study addresses this issue by predicting AFPs directly from sequence on a large set of 478 AFPs and 9,139 non-AFPs using machine learning (e.g., random forest) as a function of interpretable features (e.g., amino acid composition, dipeptide composition, and physicochemical properties). Furthermore, AFPs were characterized using propensity scores and important physicochemical properties via statistical and principal component analysis. The predictive model afforded high performance with an accuracy of 88.28% and results revealed that AFPs are likely to be composed of hydrophobic amino acids as well as amino acids with hydroxyl and sulfhydryl side chains. The predictive model is provided as a free publicly available web server called CryoProtect for classifying query protein sequence as being either AFP or non-AFP. The data set and source code are for reproducing the results which are provided on GitHub.
Collapse
|
14
|
Arulmozhiraja S, Matsuo N, Ishitsubo E, Okazaki S, Shimano H, Tokiwa H. Comparative Binding Analysis of Dipeptidyl Peptidase IV (DPP-4) with Antidiabetic Drugs - An Ab Initio Fragment Molecular Orbital Study. PLoS One 2016; 11:e0166275. [PMID: 27832184 PMCID: PMC5104442 DOI: 10.1371/journal.pone.0166275] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Accepted: 10/25/2016] [Indexed: 02/07/2023] Open
Abstract
Dipeptidyl peptidase IV (DPP-4) enzyme is responsible for the degradation of incretins that stimulates insulin secretion and hence inhibition of DPP-4 becomes an established approach for the treatment of type 2 diabetics. We studied the interaction between DPP-4 and its inhibitor drugs (sitagliptin 1, linagliptin 2, alogliptin 3, and teneligliptin 4) quantitatively by using fragment molecular orbital calculations at the RI-MP2/cc-pVDZ level to analyze the inhibitory activities of the drugs. Apart from having common interactions with key residues, inhibitors encompassing the DPP-4 active site extensively interact widely with the hydrophobic pocket by their hydrophobic inhibitor moieties. The cumulative hydrophobic interaction becomes stronger for these inhibitors and hence linagliptin and teneligliptin have larger interaction energies, and consequently higher inhibitory activities, than their alogliptin and sitagliptin counterparts. Though effective interaction for both 2 and 3 is at [Formula: see text] subsite, 2 has a stronger binding to this subsite interacting with Trp629 and Tyr547 than 3 does. The presence of triazolopiperazine and piperazine moiety in 1 and 4, respectively, provides the interaction to the S2 extensive subsite; however, the latter's superior inhibitory activity is not only due to a relatively tighter binding to the S2 extensive subsite, but also due to the interactions to the S1 subsite. The calculated hydrophobic interfragment interaction energies correlate well with the experimental binding affinities (KD) and inhibitory activities (IC50) of the DPP-4 inhibitors.
Collapse
Affiliation(s)
- Sundaram Arulmozhiraja
- Department of Chemistry, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
- Research Center for Smart molecules, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
| | - Naoya Matsuo
- Department of Chemistry, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
| | - Erika Ishitsubo
- Department of Chemistry, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
| | - Seiji Okazaki
- Department of Chemistry, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
- Research Center for Smart molecules, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
| | - Hitoshi Shimano
- Department of Internal Medicine (Endocrinology and Metabolism), University of Tsukuba, Tsukuba 305–8575, Japan
- AMED-CREST, Tokyo, Japan
| | - Hiroaki Tokiwa
- Department of Chemistry, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
- Research Center for Smart molecules, Rikkyo University, 3-34-1 Nishi-Ikebukuro, Toshima-ku, Tokyo 171–8501, Japan
- AMED-CREST, Tokyo, Japan
| |
Collapse
|
15
|
Anuwongcharoen N, Shoombuatong W, Tantimongcolwat T, Prachayasittikul V, Nantasenamat C. Exploring the chemical space of influenza neuraminidase inhibitors. PeerJ 2016; 4:e1958. [PMID: 27114890 PMCID: PMC4841240 DOI: 10.7717/peerj.1958] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2015] [Accepted: 03/31/2016] [Indexed: 12/01/2022] Open
Abstract
The fight against the emergence of mutant influenza strains has led to the screening of an increasing number of compounds for inhibitory activity against influenza neuraminidase. This study explores the chemical space of neuraminidase inhibitors (NAIs), which provides an opportunity to obtain further molecular insights regarding the underlying basis of their bioactivity. In particular, a large set of 347 and 175 NAIs against influenza A and B, respectively, was compiled from the literature. Molecular and quantum chemical descriptors were obtained from low-energy conformational structures geometrically optimized at the PM6 level. The bioactivities of NAIs were classified as active or inactive according to their half maximum inhibitory concentration (IC50) value in which IC50 < 1µM and ≥ 10µM were defined as active and inactive compounds, respectively. Interpretable decision rules were derived from a quantitative structure–activity relationship (QSAR) model established using a set of substructure descriptors via decision tree analysis. Univariate analysis, feature importance analysis from decision tree modeling and molecular scaffold analysis were performed on both data sets for discriminating important structural features amongst active and inactive NAIs. Good predictive performance was achieved as deduced from accuracy and Matthews correlation coefficient values in excess of 81% and 0.58, respectively, for both influenza A and B NAIs. Furthermore, molecular docking was employed to investigate the binding modes and their moiety preferences of active NAIs against both influenza A and B neuraminidases. Moreover, novel NAIs with robust binding fitness towards influenza A and B neuraminidase were generated via combinatorial library enumeration and their binding fitness was on par or better than FDA-approved drugs. The results from this study are anticipated to be beneficial for guiding the rational drug design of novel NAIs for treating influenza infections.
Collapse
Affiliation(s)
- Nuttapat Anuwongcharoen
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand; Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University , Bangkok , Thailand
| | - Tanawut Tantimongcolwat
- Center for Research and Innovation, Faculty of Medical Technology, Mahidol University , Bangkok , Thailand
| | - Virapong Prachayasittikul
- Department of Clinical Microbiology and Applied Technology, Faculty of Medical Technology, Mahidol University , Bangkok , Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University , Bangkok , Thailand
| |
Collapse
|