1
|
Dou B, Zhu Z, Merkurjev E, Ke L, Chen L, Jiang J, Zhu Y, Liu J, Zhang B, Wei GW. Machine Learning Methods for Small Data Challenges in Molecular Science. Chem Rev 2023; 123:8736-8780. [PMID: 37384816 PMCID: PMC10999174 DOI: 10.1021/acs.chemrev.3c00189] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
Small data are often used in scientific and engineering research due to the presence of various constraints, such as time, cost, ethics, privacy, security, and technical limitations in data acquisition. However, big data have been the focus for the past decade, small data and their challenges have received little attention, even though they are technically more severe in machine learning (ML) and deep learning (DL) studies. Overall, the small data challenge is often compounded by issues, such as data diversity, imputation, noise, imbalance, and high-dimensionality. Fortunately, the current big data era is characterized by technological breakthroughs in ML, DL, and artificial intelligence (AI), which enable data-driven scientific discovery, and many advanced ML and DL technologies developed for big data have inadvertently provided solutions for small data problems. As a result, significant progress has been made in ML and DL for small data challenges in the past decade. In this review, we summarize and analyze several emerging potential solutions to small data challenges in molecular science, including chemical and biological sciences. We review both basic machine learning algorithms, such as linear regression, logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), kernel learning (KL), random forest (RF), and gradient boosting trees (GBT), and more advanced techniques, including artificial neural network (ANN), convolutional neural network (CNN), U-Net, graph neural network (GNN), Generative Adversarial Network (GAN), long short-term memory (LSTM), autoencoder, transformer, transfer learning, active learning, graph-based semi-supervised learning, combining deep learning with traditional machine learning, and physical model-based data augmentation. We also briefly discuss the latest advances in these methods. Finally, we conclude the survey with a discussion of promising trends in small data challenges in molecular science.
Collapse
Affiliation(s)
- Bozheng Dou
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Zailiang Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Ekaterina Merkurjev
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Lu Ke
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Long Chen
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jian Jiang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Yueying Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Bengong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
2
|
Perez JJ. Exploiting Knowledge on Structure-Activity Relationships for Designing Peptidomimetics of Endogenous Peptides. Biomedicines 2021; 9:651. [PMID: 34200402 PMCID: PMC8229937 DOI: 10.3390/biomedicines9060651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 06/03/2021] [Accepted: 06/04/2021] [Indexed: 12/01/2022] Open
Abstract
Endogenous peptides are important mediators in cell communication, being consequently involved in many physiological processes. Their use as therapeutic agents is limited due to their poor pharmacokinetic profile. To circumvent this drawback, alternative diverse molecules based on the stereochemical features that confer their activity can be synthesized, using them as guidance; from peptide surrogates provided with a better pharmacokinetic profile, to small molecule peptidomimetics, through cyclic peptides. The design process requires a competent use of the structure-activity results available on individual peptides. Specifically, it requires synthesis and analysis of the activity of diverse analogs, biophysical information and computational work. In the present work, we show a general framework of the process and show its application to two specific examples: the design of selective AT1 antagonists of angiotensin and the design of selective B2 antagonists of bradykinin.
Collapse
Affiliation(s)
- Juan J Perez
- Department of Chemical Engineering, Universitat Politecnica de Catalunya, 08028 Barcelona, Spain
| |
Collapse
|
3
|
Lee YS, Remesic M, Ramos-Colon C, Wu Z, LaVigne J, Molnar G, Tymecka D, Misicka A, Streicher JM, Hruby VJ, Porreca F. Multifunctional Enkephalin Analogs with a New Biological Profile: MOR/DOR Agonism and KOR Antagonism. Biomedicines 2021; 9:biomedicines9060625. [PMID: 34072734 PMCID: PMC8229567 DOI: 10.3390/biomedicines9060625] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/27/2021] [Accepted: 05/28/2021] [Indexed: 11/25/2022] Open
Abstract
In our previous studies, we developed a series of mixed MOR/DOR agonists that are enkephalin-like tetrapeptide analogs with an N-phenyl-N-piperidin-4-ylpropionamide (Ppp) moiety at the C-terminus. Further SAR study on the analogs, initiated by the findings from off-target screening, resulted in the discovery of LYS744 (6, Dmt-DNle-Gly-Phe(p-Cl)-Ppp), a multifunctional ligand with MOR/DOR agonist and KOR antagonist activity (GTPγS assay: IC50 = 52 nM, Imax = 122% cf. IC50 = 59 nM, Imax = 100% for naloxone) with nanomolar range of binding affinity (Ki = 1.3 nM cf. Ki = 2.4 nM for salvinorin A). Based on its unique biological profile, 6 is considered to possess high therapeutic potential for the treatment of chronic pain by modulating pathological KOR activation while retaining analgesic efficacy attributed to its MOR/DOR agonist activity.
Collapse
Affiliation(s)
- Yeon Sun Lee
- Department of Pharmacology, University of Arizona, Tucson, AZ 85724, USA; (J.L.); (G.M.); (J.M.S.); (F.P.)
- Correspondence: ; Tel.: +1-520-626-2820
| | - Michael Remesic
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA; (M.R.); (C.R.-C.); (V.J.H.)
| | - Cyf Ramos-Colon
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA; (M.R.); (C.R.-C.); (V.J.H.)
| | - Zhijun Wu
- ABC Resource, Plainsboro, NJ 08536, USA;
| | - Justin LaVigne
- Department of Pharmacology, University of Arizona, Tucson, AZ 85724, USA; (J.L.); (G.M.); (J.M.S.); (F.P.)
| | - Gabriella Molnar
- Department of Pharmacology, University of Arizona, Tucson, AZ 85724, USA; (J.L.); (G.M.); (J.M.S.); (F.P.)
| | - Dagmara Tymecka
- Faculty of Chemistry, University of Warsaw, Pasteura, PL-02-093 Warsaw, Poland; (D.T.); (A.M.)
| | - Aleksandra Misicka
- Faculty of Chemistry, University of Warsaw, Pasteura, PL-02-093 Warsaw, Poland; (D.T.); (A.M.)
| | - John M. Streicher
- Department of Pharmacology, University of Arizona, Tucson, AZ 85724, USA; (J.L.); (G.M.); (J.M.S.); (F.P.)
| | - Victor J. Hruby
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721, USA; (M.R.); (C.R.-C.); (V.J.H.)
| | - Frank Porreca
- Department of Pharmacology, University of Arizona, Tucson, AZ 85724, USA; (J.L.); (G.M.); (J.M.S.); (F.P.)
| |
Collapse
|
5
|
Wu Z, Hruby VJ. Toward a Universal μ-Agonist Template for Template-Based Alignment Modeling of Opioid Ligands. ACS OMEGA 2019; 4:17457-17476. [PMID: 31656918 PMCID: PMC6812133 DOI: 10.1021/acsomega.9b02244] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 09/25/2019] [Indexed: 05/04/2023]
Abstract
Opioid ligands are a large group of G-protein-coupled receptor ligands possessing high structural diversity, along with complicated structure-activity relationships (SARs). To better understand their structural correlations as well as the related SARs, we developed the innovative template-based alignment modeling in our recent studies on a variety of opioid ligands. As previously reported, this approach showed promise but also with limitations, which was mainly attributed to the small size of morphine as a template. With this study, we set out to construct an artificial μ-agonist template to overcome this limitation. The newly constructed template contained a largely extended scaffold, along with a few special μ-features relevant to the μ-selectivity of opioid ligands. As demonstrated in this paper, the new template showed significantly improved efficacy in facilitating the alignment modeling of a wide variety of opioid ligands. This report comprises of two main parts. Part 1 discusses the general construction process and the structural features as well as a few typical examples of the template applications and Part 2 focuses on the template refinement and validation.
Collapse
Affiliation(s)
- Zhijun Wu
- ABC Resource, Plainsboro, New Jersey 08536, United States
- E-mail:
| | - Victor J. Hruby
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona 85716, United States
| |
Collapse
|
6
|
Deekonda S, Rankin D, Davis P, Lai J, Vanderah TW, Porecca F, Hruby VJ. Design synthesis and structure-activity relationship of 5-substituted (tetrahydronaphthalen-2yl)methyl with N-phenyl-N-(piperidin-2-yl)propionamide derivatives as opioid ligands. Bioorg Med Chem 2015; 24:85-91. [PMID: 26712115 DOI: 10.1016/j.bmc.2015.11.030] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Revised: 11/11/2015] [Accepted: 11/21/2015] [Indexed: 11/19/2022]
Abstract
Here, we report the design, synthesis and structure activity relationship of novel small molecule opioid ligands based on 5-amino substituted (tetrahydronaphthalen-2-yl)methyl moiety with N-phenyl-N-(piperidin-2-yl)propionamide derivatives. We synthesized various molecules including amino, amide and hydroxy substitution on the 5th position of the (tetrahydronaphthalen-2-yl)methyl moiety. In our further designs we replaced the (tetrahydronaphthalen-2-yl)methyl moiety with benzyl and phenethyl moiety. These N-phenyl-N-(piperidin-2-yl)propionamide analogues showed moderate to good binding affinities (850-4 nM) and were selective towards the μ opioid receptor over the δ opioid receptors. From the structure activity relationship studies, we found that a hydroxyl substitution at the 5th position of (tetrahydronapthalen-2yl)methyl group, ligands 19 and 20, showed excellent binding affinities 4 and 5 nM, respectively, and 1000 fold selectivity towards the μ opioid relative to the delta opioid receptor. The ligand 19 showed potent agonist activities 75±21 nM, and 190±42 nM in the GPI and MVD assays. Surprisingly the fluoro analogue 20 showed good agonist activities in MVD assays 170±42 nM, in contrast to its binding affinity results.
Collapse
Affiliation(s)
- Srinivas Deekonda
- Department of Chemistry and Biochemistry, University of Arizona, 1306 E. University Boulevard, Tucson, AZ 85721, USA
| | - David Rankin
- Department of Pharmacology, University of Arizona, Tucson, AZ 85721, USA
| | - Peg Davis
- Department of Pharmacology, University of Arizona, Tucson, AZ 85721, USA
| | - Josephine Lai
- Department of Pharmacology, University of Arizona, Tucson, AZ 85721, USA
| | - Todd W Vanderah
- Department of Pharmacology, University of Arizona, Tucson, AZ 85721, USA
| | - Frank Porecca
- Department of Pharmacology, University of Arizona, Tucson, AZ 85721, USA
| | - Victor J Hruby
- Department of Chemistry and Biochemistry, University of Arizona, 1306 E. University Boulevard, Tucson, AZ 85721, USA.
| |
Collapse
|
7
|
Baldo BA, Pham NH. Histamine-releasing and allergenic properties of opioid analgesic drugs: resolving the two. Anaesth Intensive Care 2012; 40:216-35. [PMID: 22417016 DOI: 10.1177/0310057x1204000204] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Opioid analgesics are amongst the most commonly administered drugs in hospitals. Whether natural or synthetic, they show some common structural features, morphine-like pharmacological action and binding specificity for complementary opioid receptors. Tramadol differs from the other opioid analgesics in possessing monoaminergic activity in addition to its affinity for the µ opioid receptor. Many opioids are potent histamine releasers producing a variety of haemodynamic changes and anaphylactoid reactions, but the relationship of the appearance of these effects to the histamine plasma concentration is complex and there is no direct and invariable relationship between the two. Studies of the histamine-releasing effects, chiefly centred on morphine, reveal variable findings and conclusions often due to a range of factors including differences in technical measurements, dose, mode of administration, site of injection, the anatomical distribution of histamine receptors and heterogeneity of patient responses. Morphine itself has multiple direct effects on the vasculature and other haemodynamically-active mediators released along with histamine contribute to the variable responses to opioid drug administration. Despite their heavy use and occasional apparent anaphylactic-like side-effects, immunoglobulin E antibody-mediated immediate hypersensitivity reactions to the drugs are not often encountered. Uncertainties associated with skin testing with these known histamine-releasers, and the general unavailability of opioid drug-specific immunoglobulin E antibody tests contribute to the frequent failure to adequately investigate and establish underlying mechanisms of reactions by distinguishing anaphylactoid from true anaphylactic reactions. Clinical implications for diagnosis of reactions and some speculations on the rarity of true Type 1 allergies to these drugs are presented.
Collapse
|