1
|
Ethnicity is not biology: retinal pigment score to evaluate biological variability from ophthalmic imaging using machine learning. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.06.28.23291873. [PMID: 37461664 PMCID: PMC10350142 DOI: 10.1101/2023.06.28.23291873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/24/2023]
Abstract
Background Few metrics exist to describe phenotypic diversity within ophthalmic imaging datasets, with researchers often using ethnicity as an inappropriate marker for biological variability. Methods We derived a continuous, measured metric, the retinal pigment score (RPS), that quantifies the degree of pigmentation from a colour fundus photograph of the eye. RPS was validated using two large epidemiological studies with demographic and genetic data (UK Biobank and EPIC-Norfolk Study). Findings A genome-wide association study (GWAS) of RPS from UK Biobank identified 20 loci with known associations with skin, iris and hair pigmentation, of which 8 were replicated in the EPIC-Norfolk cohort. There was a strong association between RPS and ethnicity, however, there was substantial overlap between each ethnicity and the respective distributions of RPS scores. Interpretation RPS serves to decouple traditional demographic variables, such as ethnicity, from clinical imaging characteristics. RPS may serve as a useful metric to quantify the diversity of the training, validation, and testing datasets used in the development of AI algorithms to ensure adequate inclusion and explainability of the model performance, critical in evaluating all currently deployed AI models. The code to derive RPS is publicly available at: https://github.com/uw-biomedical-ml/retinal-pigmentation-score. Funding The authors did not receive support from any organisation for the submitted work.
Collapse
|
2
|
Metabolic perturbations in fibrosis disease. Int J Biochem Cell Biol 2021; 139:106073. [PMID: 34461262 DOI: 10.1016/j.biocel.2021.106073] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 08/09/2021] [Accepted: 08/25/2021] [Indexed: 12/19/2022]
Abstract
Metabolic changes occur in all forms of disease but their impact on fibrosis is a relatively recent area of interest. This review provides an overview of the major metabolic pathways, glycolysis, amino acid metabolism and lipid metabolism, and highlights how they influence fibrosis at a cellular and tissue level, drawing on key discoveries in dermal, renal, pulmonary and hepatic fibrosis. The emerging influence of adipose tissue-derived cytokines is discussed and brings a link between fibrosis and systemic metabolism. To close, the concept of targeting metabolism for fibrotic therapy is reviewed, drawing on lessons from the more established field of cancer metabolism, with an emphasis on important considerations for clinical translation.
Collapse
|
3
|
The effect of water hardness on atopic eczema, skin barrier function: A systematic review, meta-analysis. Clin Exp Allergy 2020; 51:430-451. [PMID: 33259122 DOI: 10.1111/cea.13797] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 11/03/2020] [Accepted: 11/11/2020] [Indexed: 12/29/2022]
Abstract
BACKGROUND Hard domestic water has been reported to worsen atopic eczema (AE) and may contribute to its development in early life. OBJECTIVE To review the literature on the relationship between the effect of water hardness (high calcium carbonate; CaCO3 ) on (a) the risk of developing AE, (b) the treatment of existing AE and (c) skin barrier function in human and animal studies. DESIGN , DATA SOURCES AND ELIGIBILITY CRITERIA We systematically searched databases (MEDLINE, Embase, Cochrane CENTRAL, GREAT and Web of Science) from inception until 30/6/2020. Human and animal observational and experimental studies were included. The primary outcomes were risk of AE and skin barrier function. Studies were meta-analysed using a random effects model. Evidence certainty was evaluated using the Grading of Recommendations, Assessment, Development and Evaluations (GRADE) approach. RESULTS Sixteen studies were included. Pooled observational data from seven studies on 385,901 participants identified increased odds of AE in children exposed to harder versus softer water (odds ratio 1.28, 95% CI 1.09, 1.50; GRADE certainty: very low). Two mechanistic studies in humans reported higher deposition of the detergent sodium lauryl sulphate in those exposed to harder versus softer water. Two randomized controlled trials comparing water softeners with standard care did not show a significant difference in objective AE severity with softened water (standardized mean difference 0.06 standard deviations higher, 95% CI 0.16 lower to 0.27 higher; GRADE certainty: moderate). CONCLUSIONS & CLINICAL RELEVANCE There was a positive association between living in a hard water (range: 76 to > 350 mg/L CaCO3 ) area and AE in children. There is no evidence that domestic water softeners improve objective disease severity in established AE. There may be a role of water hardness in the initiation of skin inflammation in early life, but there is a need for further longitudinal and interventional studies.
Collapse
|
4
|
Time Series Integrative Analysis of RNA Sequencing and MicroRNA Expression Data Reveals Key Biologic Wound Healing Pathways in Keloid-Prone Individuals. J Invest Dermatol 2018; 138:2690-2693. [PMID: 29870686 DOI: 10.1016/j.jid.2018.05.017] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 05/22/2018] [Accepted: 05/24/2018] [Indexed: 12/11/2022]
|
5
|
Petri Net computational modelling of Langerhans cell Interferon Regulatory Factor Network predicts their role in T cell activation. Sci Rep 2017; 7:668. [PMID: 28386100 PMCID: PMC5428800 DOI: 10.1038/s41598-017-00651-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2016] [Accepted: 03/08/2017] [Indexed: 01/29/2023] Open
Abstract
Langerhans cells (LCs) are able to orchestrate adaptive immune responses in the skin by interpreting the microenvironmental context in which they encounter foreign substances, but the regulatory basis for this has not been established. Utilising systems immunology approaches combining in silico modelling of a reconstructed gene regulatory network (GRN) with in vitro validation of the predictions, we sought to determine the mechanisms of regulation of immune responses in human primary LCs. The key role of Interferon regulatory factors (IRFs) as controllers of the human Langerhans cell response to epidermal cytokines was revealed by whole transcriptome analysis. Applying Boolean logic we assembled a Petri net-based model of the IRF-GRN which provides molecular pathway predictions for the induction of different transcriptional programmes in LCs. In silico simulations performed after model parameterisation with transcription factor expression values predicted that human LC activation of antigen-specific CD8 T cells would be differentially regulated by epidermal cytokine induction of specific IRF-controlled pathways. This was confirmed by in vitro measurement of IFN-γ production by activated T cells. As a proof of concept, this approach shows that stochastic modelling of a specific immune networks renders transcriptome data valuable for the prediction of functional outcomes of immune responses.
Collapse
|
6
|
Primary cutaneous nodular amyloidosis associated with psoriasis. Clin Exp Dermatol 2014; 39:608-11. [PMID: 24888341 DOI: 10.1111/ced.12347] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/11/2014] [Indexed: 11/30/2022]
Abstract
Primary cutaneous nodular amyloidosis (PCNA) presents as solitary or multiple firm, waxy nodules with a predilection for acral areas. Histologically, PCNA can be identical to myeloma-associated systemic amyloidosis with monoclonal immunoglobulin light chain deposits. We describe a patient in whom PCNA developed in a scar in an area affected by chronic plaque psoriasis. PCNA has previously been associated with other autoimmune diseases, but to our knowledge, this is the first association with psoriasis. Interestingly, T helper (Th)17 cells, which are crucial in psoriasis pathogenesis, have recently been implicated in promotion of myeloma and plasma cell dyscrasias. The association of psoriasis and plasma-cell light chain production in the skin, as in this case, suggests a possible role for Th17 cells in PCNA formation. The dermatopathological literature of this rare but important disease is discussed.
Collapse
|
7
|
Machine learning approaches for predicting compounds that interact with therapeutic and ADMET related proteins. J Pharm Sci 2007; 96:2838-60. [PMID: 17786989 DOI: 10.1002/jps.20985] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Computational methods for predicting compounds of specific pharmacodynamic and ADMET (absorption, distribution, metabolism, excretion and toxicity) property are useful for facilitating drug discovery and evaluation. Recently, machine learning methods such as neural networks and support vector machines have been explored for predicting inhibitors, antagonists, blockers, agonists, activators and substrates of proteins related to specific therapeutic and ADMET property. These methods are particularly useful for compounds of diverse structures to complement QSAR methods, and for cases of unavailable receptor 3D structure to complement structure-based methods. A number of studies have demonstrated the potential of these methods for predicting such compounds as substrates of P-glycoprotein and cytochrome P450 CYP isoenzymes, inhibitors of protein kinases and CYP isoenzymes, and agonists of serotonin receptor and estrogen receptor. This article is intended to review the strategies, current progresses and underlying difficulties in using machine learning methods for predicting these protein binders and as potential virtual screening tools. Algorithms for proper representation of the structural and physicochemical properties of compounds are also evaluated.
Collapse
|
8
|
Usefulness of traditionally defined herbal properties for distinguishing prescriptions of traditional Chinese medicine from non-prescription recipes. JOURNAL OF ETHNOPHARMACOLOGY 2007; 109:21-8. [PMID: 16884871 DOI: 10.1016/j.jep.2006.06.007] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2006] [Revised: 05/31/2006] [Accepted: 06/14/2006] [Indexed: 05/11/2023]
Abstract
Traditional Chinese medicine (TCM) has been widely practiced and is considered as an attractive to conventional medicine. Multi-herb recipes have been routinely used in TCM. These have been formulated by using TCM-defined herbal properties (TCM-HPs), the scientific basis of which is unclear. The usefulness of TCM-HPs was evaluated by analyzing the distribution pattern of TCM-HPs of the constituent herbs in 1161 classical TCM prescriptions, which shows patterns of multi-herb correlation. Two artificial intelligence (AI) methods were used to examine whether TCM-HPs are capable of distinguishing TCM prescriptions from non-TCM recipes. Two AI systems were trained and tested by using 1161 TCM prescriptions, 11,202 non-TCM recipes, and two separate evaluation methods. These systems correctly classified 83.1-97.3% of the TCM prescriptions, 90.8-92.3% of the non-TCM recipes. These results suggest that TCM-HPs are capable of separating TCM prescriptions from non-TCM recipes, which are useful for formulating TCM prescriptions and consistent with the expected correlation between TCM-HPs and the physicochemical properties of herbal ingredients responsible for producing the collective pharmacological and other effects of specific TCM prescriptions.
Collapse
|
9
|
Computer prediction of cardiovascular and hematological agents by statistical learning methods. Cardiovasc Hematol Agents Med Chem 2007; 5:11-9. [PMID: 17266544 DOI: 10.2174/187152507779315787] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Computational methods have been explored for predicting agents that produce therapeutic or adverse effects in cardiovascular and hematological systems. The quantitative structure-activity relationship (QSAR) method is the first statistical learning methods successfully used for predicting various classes of cardiovascular and hematological agents. In recent years, more sophisticated statistical learning methods have been explored for predicting cardiovascular and hematological agents particularly those of diverse structures that might not be straightforwardly modelled by single QSAR models. These methods include partial least squares, multiple linear regressions, linear discriminant analysis, k-nearest neighbour, artificial neural networks and support vector machines. Their application potential has been exhibited in the prediction of various classes of cardiovascular and hematological agents including 1, 4-dihydropyridine calcium channel antagonists, angiotensin converting enzyme inhibitors, thrombin inhibitors, AchE inhibitors, HERG potassium channel inhibitors and blockers, potassium channel openers, platelet aggregation inhibitors, protein kinase inhibitors, dopamine antagonists and torsade de pointes causing agents. This article reviews the strategies, current progresses and problems in using statistical learning methods for predicting cardiovascular and hematological agents. It also evaluates algorithms for properly representing and extracting the structural and physicochemical properties of compounds relevant to the prediction of cardiovascular and hematological agents.
Collapse
|
10
|
Database of traditional Chinese medicine and its application to studies of mechanism and to prescription validation. Br J Pharmacol 2006; 149:1092-103. [PMID: 17088869 PMCID: PMC2014641 DOI: 10.1038/sj.bjp.0706945] [Citation(s) in RCA: 129] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
BACKGROUND AND PURPOSE Traditional Chinese Medicine (TCM) is widely practised and is viewed as an attractive alternative to conventional medicine. Quantitative information about TCM prescriptions, constituent herbs and herbal ingredients is necessary for studying and exploring TCM. EXPERIMENTAL APPROACH We manually collected information on TCM in books and other printed sources in Medline. The Traditional Chinese Medicine Information Database TCM-ID, at http://tcm.cz3.nus.edu.sg/group/tcm-id/tcmid.asp, was introduced for providing comprehensive information about all aspects of TCM including prescriptions, constituent herbs, herbal ingredients, molecular structure and functional properties of active ingredients, therapeutic and side effects, clinical indication and application and related matters. RESULTS TCM-ID currently contains information for 1,588 prescriptions, 1,313 herbs, 5,669 herbal ingredients, and the 3D structure of 3,725 herbal ingredients. The value of the data in TCM-ID was illustrated by using some of the data for an in-silico study of molecular mechanism of the therapeutic effects of herbal ingredients and for developing a computer program to validate TCM multi-herb preparations. CONCLUSIONS AND IMPLICATIONS The development of systems biology has led to a new design principle for therapeutic intervention strategy, the concept of 'magic shrapnel' (rather than the 'magic bullet'), involving many drugs against multiple targets, administered in a single treatment. TCM offers an extensive source of examples of this concept in which several active ingredients in one prescription are aimed at numerous targets and work together to provide therapeutic benefit. The database and its mining applications described here represent early efforts toward exploring TCM for new theories in drug discovery.
Collapse
|
11
|
Classification of a diverse set of Tetrahymena pyriformis toxicity chemical compounds from molecular descriptors by statistical learning methods. Chem Res Toxicol 2006; 19:1030-9. [PMID: 16918241 DOI: 10.1021/tx0600550] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
Toxicity of various compounds has been measured in many studies by their toxic effects against Tetrahymena pyriformis. Efforts have also been made to use computational quantitative structure-activity relationship (QSAR) and statistical learning methods (SLMs) for predicting Tetrahymena pyriformis toxicity (TPT) at impressive accuracies. Because of the diversity of compounds and toxicity mechanisms, it is desirable to explore additional methods and to examine if these methods are applicable to more diverse sets of compounds. We tested several SLMs (logistic regression, C4.5 decision tree, k-nearest neighbor, probabilistic neural network, support vector machines) for their capability in predicting TPT by using 1129 compounds (841 TPT and 288 non-TPT agents) which are more diverse than those in other studies. A feature selection method was used for improving prediction performance and selecting molecular descriptors responsible for distinguishing TPT and non-TPT agents. The prediction accuracies are 86.9% approximately 94.2% for TPT and 71.2% approximately 87.5% for non-TPT agents based on 5-fold cross-validation studies, which are comparable to some of earlier studies despite the use of more diverse sets of compounds. The selected molecular descriptors are consistent with those used in other studies and experimental findings. These suggest that SLMs are useful for predicting TPT potential of diverse sets of compounds and for characterizing the molecular descriptors associated with TPT.
Collapse
|
12
|
Prediction of estrogen receptor agonists and characterization of associated molecular descriptors by statistical learning methods. J Mol Graph Model 2006; 25:313-23. [PMID: 16497524 DOI: 10.1016/j.jmgm.2006.01.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2005] [Revised: 12/21/2005] [Accepted: 01/19/2006] [Indexed: 01/04/2023]
Abstract
Specific estrogen receptor (ER) agonists have been used for hormone replacement therapy, contraception, osteoporosis prevention, and prostate cancer treatment. Some ER agonists and partial-agonists induce cancer and endocrine function disruption. Methods for predicting ER agonists are useful for facilitating drug discovery and chemical safety evaluation. Structure-activity relationships and rule-based decision forest models have been derived for predicting ER binders at impressive accuracies of 87.1-97.6% for ER binders and 80.2-96.0% for ER non-binders. However, these are not designed for identifying ER agonists and they were developed from a subset of known ER binders. This work explored several statistical learning methods (support vector machines, k-nearest neighbor, probabilistic neural network and C4.5 decision tree) for predicting ER agonists from comprehensive set of known ER agonists and other compounds. The corresponding prediction systems were developed and tested by using 243 ER agonists and 463 ER non-agonists, respectively, which are significantly larger in number and structural diversity than those in previous studies. A feature selection method was used for selecting molecular descriptors responsible for distinguishing ER agonists from non-agonists, some of which are consistent with those used in other studies and the findings from X-ray crystallography data. The prediction accuracies of these methods are comparable to those of earlier studies despite the use of significantly more diverse range of compounds. SVM gives the best accuracy of 88.9% for ER agonists and 98.1% for non-agonists. Our study suggests that statistical learning methods such as SVM are potentially useful for facilitating the prediction of ER agonists and for characterizing the molecular descriptors associated with ER agonists.
Collapse
|
13
|
Abstract
Pregnane X receptor (PXR) regulates drug metabolism and is involved in drug-drug interactions. Prediction of PXR activators is important for evaluating drug metabolism and toxicity. Computational pharmacophore and quantitative structure-activity relationship models have been developed for predicting PXR activators. Because of the structural diversity of PXR activators, more efforts are needed for exploring methods applicable to a broader spectrum of compounds. We explored three machine learning methods (MLMs) for predicting PXR activators, which were trained and tested by using significantly higher number of compounds, 128 PXR activators (98 human) and 77 PXR non-activators, than those of previous studies. The recursive feature-selection method was used to select molecular descriptors relevant to PXR activator prediction, which are consistent with conclusions from other computational and structural studies. In a 10-fold cross-validation test, our MLM systems correctly predicted 81.2 to 84.0% of PXR activators, 80.8 to 85.0% of hPXR activators, 61.2 to 70.3% of PXR nonactivators, and 67.7 to 73.6% of hPXR nonactivators. Our systems also correctly predicted 73.3 to 86.7% of 15 newly published hPXR activators. MLMs seem to be useful for predicting PXR activators and for providing clues to physicochemical features of PXR activation.
Collapse
|
14
|
Prediction of compounds with specific pharmacodynamic, pharmacokinetic or toxicological property by statistical learning methods. Mini Rev Med Chem 2006; 6:449-59. [PMID: 16613581 DOI: 10.2174/138955706776361501] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Computational methods for predicting compounds of specific pharmacodynamic, pharmacokinetic, or toxicological property are useful for facilitating drug discovery and drug safety evaluation. The quantitative structure-activity relationship (QSAR) and quantitative structure-property relationship (QSPR) methods are the most successfully used statistical learning methods for predicting compounds of specific property. More recently, other statistical learning methods such as neural networks and support vector machines have been explored for predicting compounds of higher structural diversity than those covered by QSAR and QSPR. These methods have shown promising potential in a number of studies. This article is intended to review the strategies, current progresses and underlying difficulties in using statistical learning methods for predicting compounds of specific property. It also evaluates algorithms commonly used for representing structural and physicochemical properties of compounds.
Collapse
|
15
|
Abstract
Various toxicological profiles, such as genotoxic potential, need to be studied in drug discovery processes and submitted to the drug regulatory authorities for drug safety evaluation. As part of the effort for developing low cost and efficient adverse drug reaction testing tools, several statistical learning methods have been used for developing genotoxicity prediction systems with an accuracy of up to 73.8% for genotoxic (GT+) and 92.8% for nongenotoxic (GT-) agents. These systems have been developed and tested by using less than 400 known GT+ and GT- agents, which is significantly less in number and diversity than the 860 GT+ and GT- agents known at present. There is a need to examine if a similar level of accuracy can be achieved for the more diverse set of molecules and to evaluate other statistical learning methods not yet applied to genotoxicity prediction. This work is intended for testing several statistical learning methods by using 860 GT+ and GT- agents, which include support vector machines (SVM), probabilistic neural network (PNN), k-nearest neighbor (k-NN), and C4.5 decision tree (DT). A feature selection method, recursive feature elimination, is used for selecting molecular descriptors relevant to genotoxicity study. The overall accuracies of SVM, k-NN, and PNN are comparable to and those of DT lower than the results from earlier studies, with SVM giving the highest accuracies of 77.8% for GT+ and 92.7% for GT- agents. Our study suggests that statistical learning methods, particularly SVM, k-NN, and PNN, are useful for facilitating the prediction of genotoxic potential of a diverse set of molecules.
Collapse
|
16
|
Computer automated prediction of potential therapeutic and toxicity protein targets of bioactive compounds from Chinese medicinal plants. THE AMERICAN JOURNAL OF CHINESE MEDICINE 2002; 30:139-54. [PMID: 12067089 DOI: 10.1142/s0192415x02000156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Understanding the molecular mechanism and pharmacology of bioactive compounds from Chinese medicinal plants (CMP) is important in facilitating scientific evaluation of novel therapeutic approaches in traditional Chinese medicine. It is also of significance in new drug development based on the mechanism of Chinese medicine. A key step towards this task is the determination of the therapeutic and toxicity protein targets of CMP compounds. In this work, newly developed computer software INVDOCK is used for automated identification of potential therapeutic and toxicity targets of several bioactive compounds isolated from Chinese medicinal plants. This software searches a protein database to find proteins to which a CMP compound can bind or weakly bind. INVDOCK results on three CMP compounds (allicin, catechin and camptotecin) show that 60% of computer-identified potential therapeutic protein targets and 27% of computer-identified potential toxicity targets have been implicated or confirmed by experiments. This software may potentially be used as a relatively fast-speed and low-cost tool for facilitating the study of molecular mechanism and pharmacology of bioactive compounds from Chinese medicinal plants and natural products from other sources.
Collapse
|
17
|
Prediction of potential toxicity and side effect protein targets of a small molecule by a ligand-protein inverse docking approach. J Mol Graph Model 2002; 20:199-218. [PMID: 11766046 DOI: 10.1016/s1093-3263(01)00109-7] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Determination of potential drug toxicity and side effect in early stages of drug development is important in reducing the cost and time of drug discovery. In this work, we explore a computer method for predicting potential toxicity and side effect protein targets of a small molecule. A ligand-protein inverse docking approach is used for computer-automated search of a protein cavity database to identify protein targets. This database is developed from protein 3D structures in the protein data bank (PDB). Docking is conducted by a procedure involving multiple conformer shape-matching alignment of a molecule to a cavity followed by molecular-mechanics torsion optimization and energy minimization on both the molecule and the protein residues at the binding region. Potential protein targets are selected by evaluation of molecular mechanics energy and, while applicable, further analysis of its binding competitiveness against other ligands that bind to the same receptor site in at least one PDB entry. Our results on several drugs show that 83% of the experimentally known toxicity and side effect targets for these drugs are predicted. The computer search successfully predicted 38 and missed five experimentally confirmed or implicated protein targets with available structure and in which binding involves no covalent bond. There are additional 30 predicted targets yet to be validated experimentally. Application of this computer approach can potentially facilitate the prediction of toxicity and side effect of a drug or drug lead.
Collapse
|