Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tetko IV, Sushko Y, Novotarskyi S, Patiny L, Kondratov I, Petrenko AE, Charochkina L, Asiri AM. How accurately can we predict the melting points of drug-like compounds? J Chem Inf Model 2014;54:3320-9. [PMID: 25489863 PMCID: PMC4702524 DOI: 10.1021/ci5005288] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

For:	Tetko IV, Sushko Y, Novotarskyi S, Patiny L, Kondratov I, Petrenko AE, Charochkina L, Asiri AM. How accurately can we predict the melting points of drug-like compounds? J Chem Inf Model 2014;54:3320-9. [PMID: 25489863 PMCID: PMC4702524 DOI: 10.1021/ci5005288] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]

Number

Cited by Other Article(s)

Zhu X, Polyakov VR, Bajjuri K, Hu H, Maderna A, Tovee CA, Ward SC. Building Machine Learning Small Molecule Melting Points and Solubility Models Using CCDC Melting Points Dataset. J Chem Inf Model 2023;63:2948-2959. [PMID: 37125691 DOI: 10.1021/acs.jcim.3c00308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]

Syed TA, Ansari KB, Banerjee A, Wood DA, Khan MS, Al Mesfer MK. Machine‐learning predictions of caffeine co‐crystal formation accompanying experimental and molecular validations. J FOOD PROCESS ENG 2022. [DOI: 10.1111/jfpe.14230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Li Y, Aslam A, Saeed S, Zhang G, Kanwal S. Targeting highly resisted anticancer drugs through topological descriptors using VIKOR multi-criteria decision analysis. EUROPEAN PHYSICAL JOURNAL PLUS 2022;137:1245. [PMID: 36405039 PMCID: PMC9667010 DOI: 10.1140/epjp/s13360-022-03469-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Accepted: 11/06/2022] [Indexed: 06/16/2023]

Machine learning models for phase transition and decomposition temperature of ionic liquids. J Mol Liq 2022. [DOI: 10.1016/j.molliq.2022.120247] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Carrera GVSM. The Melting Point Profile of Organic Molecules: A Chemoinformatic Approach. ADVANCED THEORY AND SIMULATIONS 2022. [DOI: 10.1002/adts.202200503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Bujak M, Podsiadło M, Katrusiak A. Response to comment on Properties and interactions - melting point of tribromobenzene isomers. ACTA CRYSTALLOGRAPHICA SECTION B, STRUCTURAL SCIENCE, CRYSTAL ENGINEERING AND MATERIALS 2022;78:276-278. [PMID: 35411867 DOI: 10.1107/s2052520622003067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Makarov D, Fadeeva Y, Shmukler L, Tetko I. Beware of proper validation of models for ionic Liquids! J Mol Liq 2021. [DOI: 10.1016/j.molliq.2021.117722] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

On prediction of melting points without computer simulation: a focus on energetic molecular crystals. FIREPHYSCHEM 2021. [DOI: 10.1016/j.fpc.2021.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Xiang Y, Tang YH, Liu H, Lin G, Sun H. Predicting Single-Substance Phase Diagrams: A Kernel Approach on Graph Representations of Molecules. J Phys Chem A 2021;125:4488-4497. [PMID: 33999627 DOI: 10.1021/acs.jpca.1c02391] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Toropova AP, Toropov AA, Benfenati E. The self-organizing vector of atom-pairs proportions: use to develop models for melting points. Struct Chem 2021. [DOI: 10.1007/s11224-021-01778-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Sifain AE, Rice BM, Yalkowsky SH, Barnes BC. Machine learning transition temperatures from 2D structure. J Mol Graph Model 2021;105:107848. [PMID: 33667863 DOI: 10.1016/j.jmgm.2021.107848] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/11/2021] [Accepted: 01/19/2021] [Indexed: 10/22/2022]

Fu L, Yang ZY, Yang ZJ, Yin MZ, Lu AP, Chen X, Liu S, Hou TJ, Cao DS. QSAR-assisted-MMPA to expand chemical transformation space for lead optimization. Brief Bioinform 2021;22:6071857. [PMID: 33418563 DOI: 10.1093/bib/bbaa374] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 10/25/2020] [Accepted: 11/25/2020] [Indexed: 11/13/2022] Open

Abstract

Matched molecular pairs analysis (MMPA) has become a powerful tool for automatically and systematically identifying medicinal chemistry transformations from compound/property datasets. However, accurate determination of matched molecular pair (MMP) transformations largely depend on the size and quality of existing experimental data. Lack of high-quality experimental data heavily hampers the extraction of more effective medicinal chemistry knowledge. Here, we developed a new strategy called quantitative structure-activity relationship (QSAR)-assisted-MMPA to expand the number of chemical transformations and took the logD7.4 property endpoint as an example to demonstrate the reliability of the new method. A reliable logD7.4 consensus prediction model was firstly established, and its applicability domain was strictly assessed. By applying the reliable logD7.4 prediction model to screen two chemical databases, we obtained more high-quality logD7.4 data by defining a strict applicability domain threshold. Then, MMPA was performed on the predicted data and experimental data to derive more chemical rules. To validate the reliability of the chemical rules, we compared the magnitude and directionality of the property changes of the predicted rules with those of the measured rules. Then, we compared the novel chemical rules generated by our proposed approach with the published chemical rules, and found that the magnitude and directionality of the property changes were consistent, indicating that the proposed QSAR-assisted-MMPA approach has the potential to enrich the collection of rule types or even identify completely novel rules. Finally, we found that the number of the MMP rules derived from the experimental data could be amplified by the predicted data, which is helpful for us to analyze the medicinal chemical rules in local chemical environment. In summary, the proposed QSAR-assisted-MMPA approach could be regarded as a very promising strategy to expand the chemical transformation space for lead optimization, especially when no enough experimental data can support MMPA.

Collapse

Korshunova M, Ginsburg B, Tropsha A, Isayev O. OpenChem: A Deep Learning Toolkit for Computational Chemistry and Drug Design. J Chem Inf Model 2021;61:7-13. [PMID: 33393291 DOI: 10.1021/acs.jcim.0c00971] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Crystalline tetrazepam as a case study on the volume change on melting of molecular organic compounds. Int J Pharm 2021;593:120124. [DOI: 10.1016/j.ijpharm.2020.120124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Revised: 11/23/2020] [Accepted: 11/23/2020] [Indexed: 11/22/2022]

Yuan J, Liu X, Wang S, Chang C, Zeng Q, Song Z, Jin Y, Zeng Q, Sun G, Ruan S, Greenwell C, Abramov YA. Virtual coformer screening by a combined machine learning and physics-based approach. CrystEngComm 2021. [DOI: 10.1039/d1ce00587a] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Affiliation(s)

Jiuchuang Yuan XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Xuetao Liu XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogeomics, Peking University Shenzhen Graduate School, Shenzhen, 518055 China
Simin Wang XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Chao Chang XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Qiao Zeng XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Zhengtian Song XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Yingdi Jin XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Qun Zeng XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Guangxu Sun XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Shigang Ruan XtalPi Inc., Shenzhen Jingtai Technology Co., Ltd., Floor 3, Sf Industrial Plant, No. 2 hongliu Road, Fubao Community, Fubao Street, Futian District, Shenzhen, 518100 China
Chandler Greenwell XtalPi Inc, Cambridge, Massachusetts 02142, USA
Yuriy A. Abramov XtalPi Inc, Cambridge, Massachusetts 02142, USA Eshelman School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599, USA

Collapse

Boobier S, Hose DRJ, Blacker AJ, Nguyen BN. Machine learning with physicochemical relationships: solubility prediction in organic solvents and water. Nat Commun 2020;11:5753. [PMID: 33188226 PMCID: PMC7666209 DOI: 10.1038/s41467-020-19594-z] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Accepted: 10/12/2020] [Indexed: 11/09/2022] Open

Sivaraman G, Jackson NE, Sanchez-Lengeling B, Vázquez-Mayagoitia Á, Aspuru-Guzik A, Vishwanath V, de Pablo JJ. A machine learning workflow for molecular analysis: application to melting points. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/ab8aa3] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open

Tinworth CP, Young RJ. Facts, Patterns, and Principles in Drug Discovery: Appraising the Rule of 5 with Measured Physicochemical Data. J Med Chem 2020;63:10091-10108. [PMID: 32324397 DOI: 10.1021/acs.jmedchem.9b01596] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Mathieu D. QSPR versus fragment-based methods to predict octanol-air partition coefficients: Revisiting a recent comparison of both approaches. CHEMOSPHERE 2020;245:125584. [PMID: 31864054 DOI: 10.1016/j.chemosphere.2019.125584] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 12/04/2019] [Accepted: 12/07/2019] [Indexed: 06/10/2023]

Karpov P, Godin G, Tetko IV. Transformer-CNN: Swiss knife for QSAR modeling and interpretation. J Cheminform 2020;12:17. [PMID: 33431004 PMCID: PMC7079452 DOI: 10.1186/s13321-020-00423-w] [Citation(s) in RCA: 107] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2019] [Accepted: 03/09/2020] [Indexed: 01/03/2023] Open

Chen G, Shen Z, Iyer A, Ghumman UF, Tang S, Bi J, Chen W, Li Y. Machine-Learning-Assisted De Novo Design of Organic Molecules and Polymers: Opportunities and Challenges. Polymers (Basel) 2020;12:E163. [PMID: 31936321 PMCID: PMC7023065 DOI: 10.3390/polym12010163] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 12/27/2019] [Accepted: 01/02/2020] [Indexed: 12/18/2022] Open

Abstract

Organic molecules and polymers have a broad range of applications in biomedical, chemical, and materials science fields. Traditional design approaches for organic molecules and polymers are mainly experimentally-driven, guided by experience, intuition, and conceptual insights. Though they have been successfully applied to discover many important materials, these methods are facing significant challenges due to the tremendous demand of new materials and vast design space of organic molecules and polymers. Accelerated and inverse materials design is an ideal solution to these challenges. With advancements in high-throughput computation, artificial intelligence (especially machining learning, ML), and the growth of materials databases, ML-assisted materials design is emerging as a promising tool to flourish breakthroughs in many areas of materials science and engineering. To date, using ML-assisted approaches, the quantitative structure property/activity relation for material property prediction can be established more accurately and efficiently. In addition, materials design can be revolutionized and accelerated much faster than ever, through ML-enabled molecular generation and inverse molecular design. In this perspective, we review the recent progresses in ML-guided design of organic molecules and polymers, highlight several successful examples, and examine future opportunities in biomedical, chemical, and materials science fields. We further discuss the relevant challenges to solve in order to fully realize the potential of ML-assisted materials design for organic molecules and polymers. In particular, this study summarizes publicly available materials databases, feature representations for organic molecules, open-source tools for feature generation, methods for molecular generation, and ML models for prediction of material properties, which serve as a tutorial for researchers who have little experience with ML before and want to apply ML for various applications. Last but not least, it draws insights into the current limitations of ML-guided design of organic molecules and polymers. We anticipate that ML-assisted materials design for organic molecules and polymers will be the driving force in the near future, to meet the tremendous demand of new materials with tailored properties in different fields.

Collapse

Bunally SB, Luscombe CN, Young RJ. Using Physicochemical Measurements to Influence Better Compound Design. SLAS DISCOVERY 2019;24:791-801. [PMID: 31429385 DOI: 10.1177/2472555219859845] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Dalavitsou A, Vasiliadis A, Mordos MD, Kouskoura MG, Markopoulou CK. Analytes’ Structure and Signal Response in Evaporating Light Scattering Detector (ELSD). CURR ANAL CHEM 2019. [DOI: 10.2174/1573411014666180330161557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Deng T, Jia GZ. Prediction of aqueous solubility of compounds based on neural network. Mol Phys 2019. [DOI: 10.1080/00268976.2019.1600754] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Palmblad M. Visual and Semantic Enrichment of Analytical Chemistry Literature Searches by Combining Text Mining and Computational Chemistry. Anal Chem 2019;91:4312-4316. [PMID: 30835438 PMCID: PMC6448173 DOI: 10.1021/acs.analchem.8b05818] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Deep learning for molecular generation. Future Med Chem 2019;11:567-597. [DOI: 10.4155/fmc-2018-0358] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Brown TN, Armitage JM, Arnot JA. Application of an Iterative Fragment Selection (IFS) Method to Estimate Entropies of Fusion and Melting Points of Organic Chemicals. Mol Inform 2019;38:e1800160. [PMID: 30816634 DOI: 10.1002/minf.201800160] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2018] [Accepted: 02/10/2019] [Indexed: 11/09/2022]

Sakkiah S, Guo W, Pan B, Kusko R, Tong W, Hong H. Computational prediction models for assessing endocrine disrupting potential of chemicals. JOURNAL OF ENVIRONMENTAL SCIENCE AND HEALTH. PART C, ENVIRONMENTAL CARCINOGENESIS & ECOTOXICOLOGY REVIEWS 2019;36:192-218. [PMID: 30633647 DOI: 10.1080/10590501.2018.1537132] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Liu R, Wallqvist A. Molecular Similarity-Based Domain Applicability Metric Efficiently Identifies Out-of-Domain Compounds. J Chem Inf Model 2018;59:181-189. [DOI: 10.1021/acs.jcim.8b00597] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Popova M, Isayev O, Tropsha A. Deep reinforcement learning for de novo drug design. SCIENCE ADVANCES 2018;4:eaap7885. [PMID: 30050984 PMCID: PMC6059760 DOI: 10.1126/sciadv.aap7885] [Citation(s) in RCA: 499] [Impact Index Per Article: 83.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2017] [Accepted: 06/13/2018] [Indexed: 05/20/2023]

Withnall M, Chen H, Tetko IV. Matched Molecular Pair Analysis on Large Melting Point Datasets: A Big Data Perspective. ChemMedChem 2018;13:599-606. [PMID: 28650584 PMCID: PMC5900986 DOI: 10.1002/cmdc.201700303] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Revised: 06/26/2017] [Indexed: 11/11/2022]

Tebes-Stevens C, Patel JM, Koopmans M, Olmstead J, Hilal SH, Pope N, Weber EJ, Wolfe K. Demonstration of a consensus approach for the calculation of physicochemical properties required for environmental fate assessments. CHEMOSPHERE 2018;194:94-106. [PMID: 29197820 PMCID: PMC6146973 DOI: 10.1016/j.chemosphere.2017.11.137] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Revised: 11/21/2017] [Accepted: 11/22/2017] [Indexed: 05/21/2023]

Mathieu D. Atom Pair Contribution Method: Fast and General Procedure To Predict Molecular Formation Enthalpies. J Chem Inf Model 2018;58:12-26. [DOI: 10.1021/acs.jcim.7b00613] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Mathieu D. Solubility of organic compounds in octanol: Improved predictions based on the geometrical fragment approach. CHEMOSPHERE 2017;182:399-405. [PMID: 28511135 DOI: 10.1016/j.chemosphere.2017.05.045] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2017] [Revised: 05/05/2017] [Accepted: 05/08/2017] [Indexed: 06/07/2023]

Coley CW, Barzilay R, Green WH, Jaakkola TS, Jensen KF. Convolutional Embedding of Attributed Molecular Graphs for Physical Property Prediction. J Chem Inf Model 2017;57:1757-1772. [PMID: 28696688 DOI: 10.1021/acs.jcim.6b00601] [Citation(s) in RCA: 220] [Impact Index Per Article: 31.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Mathieu D, Bouteloup R. Reliable and Versatile Model for the Density of Liquids Based on Additive Volume Increments. Ind Eng Chem Res 2016. [DOI: 10.1021/acs.iecr.6b03809] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Tetko IV, Maran U, Tropsha A. Public (Q)SAR Services, Integrated Modeling Environments, and Model Repositories on the Web: State of the Art and Perspectives for Future Development. Mol Inform 2016;36. [PMID: 27778468 DOI: 10.1002/minf.201600082] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 10/03/2016] [Indexed: 01/08/2023]

Does ‘Big Data’ exist in medicinal chemistry, and if so, how can it be harnessed? Future Med Chem 2016;8:1801-1806. [DOI: 10.4155/fmc-2016-0163] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open

Tetko IV, Engkvist O, Koch U, Reymond JL, Chen H. BIGCHEM: Challenges and Opportunities for Big Data Analysis in Chemistry. Mol Inform 2016;35:615-621. [PMID: 27464907 PMCID: PMC5129546 DOI: 10.1002/minf.201600073] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 07/06/2016] [Indexed: 01/19/2023]

Baskin II, Winkler D, Tetko IV. A renaissance of neural networks in drug discovery. Expert Opin Drug Discov 2016;11:785-95. [PMID: 27295548 DOI: 10.1080/17460441.2016.1201262] [Citation(s) in RCA: 123] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. CERAPP: Collaborative Estrogen Receptor Activity Prediction Project. ENVIRONMENTAL HEALTH PERSPECTIVES 2016;124:1023-33. [PMID: 26908244 PMCID: PMC4937869 DOI: 10.1289/ehp.1510267] [Citation(s) in RCA: 222] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Revised: 10/05/2015] [Accepted: 02/08/2016] [Indexed: 05/18/2023]

Abstract

BACKGROUND

Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program.

OBJECTIVES

We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing.

METHODS

CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies.

RESULTS

Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing.

CONCLUSION

This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points.

CITATION

Mansouri K, Abdelaziz A, Rybacka A, Roncaglioni A, Tropsha A, Varnek A, Zakharov A, Worth A, Richard AM, Grulke CM, Trisciuzzi D, Fourches D, Horvath D, Benfenati E, Muratov E, Wedebye EB, Grisoni F, Mangiatordi GF, Incisivo GM, Hong H, Ng HW, Tetko IV, Balabin I, Kancherla J, Shen J, Burton J, Nicklaus M, Cassotti M, Nikolov NG, Nicolotti O, Andersson PL, Zang Q, Politi R, Beger RD, Todeschini R, Huang R, Farag S, Rosenberg SA, Slavov S, Hu X, Judson RS. 2016.

CERAPP

Collaborative Estrogen Receptor Activity Prediction Project. Environ Health Perspect 124:1023-1033; http://dx.doi.org/10.1289/ehp.1510267.

Collapse

Affiliation(s)

Kamel Mansouri National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Oak Ridge Institute for Science and Education, Oak Ridge, Tennessee, USA
Ahmed Abdelaziz Institute of Structural Biology, Helmholtz Zentrum Muenchen-German Research Center for Environmental Health (GmbH), Neuherberg, Germany
Aleksandra Rybacka Chemistry Department, Umeå University, Umeå, Sweden
Alessandra Roncaglioni Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Alexander Tropsha Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Alexandre Varnek Laboratoire de Chemoinformatique, University of Strasbourg, Strasbourg, France
Alexey Zakharov National Cancer Institute, National Institutes of Health (NIH), Department of Health and Human Services (DHHS), Bethesda, Maryland, USA
Andrew Worth Institute for Health and Consumer Protection (IHCP), Joint Research Centre of the European Commission in Ispra, Ispra, Italy
Ann M. Richard National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Christopher M. Grulke National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Daniela Trisciuzzi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Denis Fourches Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Dragos Horvath Laboratoire de Chemoinformatique, University of Strasbourg, Strasbourg, France
Emilio Benfenati Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Eugene Muratov Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Eva Bay Wedebye Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Francesca Grisoni Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Giuseppe F. Mangiatordi Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Giuseppina M. Incisivo Environmental Chemistry and Toxicology Laboratory, IRCCS (Istituto di Ricovero e Cura a Carattere Scientifico)-Istituto di Ricerche Farmacologiche Mario Negri, Milan, Italy
Huixiao Hong Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration (USDA), Jefferson, Arizona, USA
Hui W. Ng Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration (USDA), Jefferson, Arizona, USA
Igor V. Tetko Institute of Structural Biology, Helmholtz Zentrum Muenchen-German Research Center for Environmental Health (GmbH), Neuherberg, Germany BigChem GmbH, Neuherberg, Germany
Ilya Balabin High Performance Computing, Lockheed Martin, Research Triangle Park, North Carolina, USA
Jayaram Kancherla National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA
Jie Shen Research Institute for Fragrance Materials, Inc., Woodcliff Lake, New Jersey, USA
Julien Burton Institute for Health and Consumer Protection (IHCP), Joint Research Centre of the European Commission in Ispra, Ispra, Italy
Marc Nicklaus National Cancer Institute, National Institutes of Health (NIH), Department of Health and Human Services (DHHS), Bethesda, Maryland, USA
Matteo Cassotti Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Nikolai G. Nikolov Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Orazio Nicolotti Department of Pharmacy-Drug Sciences, University of Bari, Bari, Italy
Patrik L. Andersson Chemistry Department, Umeå University, Umeå, Sweden
Qingda Zang Integrated Laboratory Systems, Inc., Research Triangle Park, North Carolina, USA
Regina Politi Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Richard D. Beger Division of Systems Biology, National Center for Toxicological Research, USDA, Jefferson, Arizona, USA
Roberto Todeschini Milano Chemometrics and QSAR Research Group, University of Milano-Bicocca, Milan, Italy
Ruili Huang National Center for Advancing Translational Sciences, NIH, DHHS, Bethesda, Maryland, USA
Sherif Farag Laboratory for Molecular Modeling, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
Sine A. Rosenberg Division of Toxicology and Risk Assessment, National Food Institute, Technical University of Denmark, Copenhagen, Denmark
Svetoslav Slavov Integrated Laboratory Systems, Inc., Research Triangle Park, North Carolina, USA
Xin Hu National Center for Advancing Translational Sciences, NIH, DHHS, Bethesda, Maryland, USA
Richard S. Judson National Center for Computational Toxicology, U.S. Environmental Protection Agency, Research Triangle Park, North Carolina, USA Address correspondence to R.S. Judson, U.S. EPA, National Center for Computational Toxicology, 109 T.W. Alexander Dr., Research Triangle Park, NC 27711 USA. Telephone: (919) 541-3085. E-mail:

Collapse

Mathieu D. Physics-Based Modeling of Chemical Hazards in a Regulatory Framework: Comparison with Quantitative Structure–Property Relationship (QSPR) Methods for Impact Sensitivities. Ind Eng Chem Res 2016. [DOI: 10.1021/acs.iecr.6b01536] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Novotarskyi S, Abdelaziz A, Sushko Y, Körner R, Vogt J, Tetko IV. ToxCast EPA in Vitro to in Vivo Challenge: Insight into the Rank-I Model. Chem Res Toxicol 2016;29:768-75. [PMID: 27120770 PMCID: PMC5413193 DOI: 10.1021/acs.chemrestox.5b00481] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Buchholz H, Emel'yanenko VN, Lorenz H, Verevkin SP. An Examination of the Phase Transition Thermodynamics of (S)- and (RS)-Naproxen as a Basis for the Design of Enantioselective Crystallization Processes. J Pharm Sci 2016;105:1676-1683. [PMID: 27056629 DOI: 10.1016/j.xphs.2016.02.032] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 02/03/2016] [Accepted: 02/16/2016] [Indexed: 10/22/2022]

Tetko IV, Varbanov HP, Galanski MS, Talmaciu M, Platts JA, Ravera M, Gabano E. Prediction of logP for Pt(II) and Pt(IV) complexes: Comparison of statistical and quantum-chemistry based approaches. J Inorg Biochem 2016;156:1-13. [PMID: 26717258 DOI: 10.1016/j.jinorgbio.2015.12.006] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2015] [Revised: 11/19/2015] [Accepted: 12/09/2015] [Indexed: 01/31/2023]

Tetko IV, M. Lowe D, Williams AJ. The development of models to predict melting and pyrolysis point data associated with several hundred thousand compounds mined from PATENTS. J Cheminform 2016;8:2. [PMID: 26807157 PMCID: PMC4724158 DOI: 10.1186/s13321-016-0113-y] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Accepted: 01/08/2016] [Indexed: 11/18/2022] Open

Abstract

BACKGROUND

Melting point (MP) is an important property in regards to the solubility of chemical compounds. Its prediction from chemical structure remains a highly challenging task for quantitative structure-activity relationship studies. Success in this area of research critically depends on the availability of high quality MP data as well as accurate chemical structure representations in order to develop models. Currently, available datasets for MP predictions have been limited to around 50k molecules while lots more data are routinely generated following the synthesis of novel materials. Significant amounts of MP data are freely available within the patent literature and, if it were available in the appropriate form, could potentially be used to develop predictive models.

RESULTS

We have developed a pipeline for the automated extraction and annotation of chemical data from published PATENTS. Almost 300,000 data points have been collected and used to develop models to predict melting and pyrolysis (decomposition) points using tools available on the OCHEM modeling platform (http://ochem.eu). A number of technical challenges were simultaneously solved to develop models based on these data. These included the handing of sparse data matrices with >200,000,000,000 entries and parallel calculations using 32 × 6 cores per task using 13 descriptor sets totaling more than 700,000 descriptors. We showed that models developed using data collected from PATENTS had similar or better prediction accuracy compared to the highly curated data used in previous publications. The separation of data for chemicals that decomposed rather than melting, from compounds that did undergo a normal melting transition, was performed and models for both pyrolysis and MPs were developed. The accuracy of the consensus MP models for molecules from the drug-like region of chemical space was similar to their estimated experimental accuracy, 32 °C. Last but not least, important structural features related to the pyrolysis of chemicals were identified, and a model to predict whether a compound will decompose instead of melting was developed.

CONCLUSIONS

We have shown that automated tools for the analysis of chemical information have reached a mature stage allowing for the extraction and collection of high quality data to enable the development of structure-activity relationship models. The developed models and data are publicly available at http://ochem.eu/article/99826.

Collapse

Salmina ES, Haider N, Tetko IV. Extended Functional Groups (EFG): An Efficient Set for Chemical Characterization and Structure-Activity Relationship Studies of Chemical Compounds. Molecules 2015;21:E1. [PMID: 26703557 PMCID: PMC6273096 DOI: 10.3390/molecules21010001] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Revised: 12/09/2015] [Accepted: 12/15/2015] [Indexed: 11/16/2022] Open

Tales from the war on error: the art and science of curating QSAR data. J Comput Aided Mol Des 2015;29:897-910. [PMID: 26290258 DOI: 10.1007/s10822-015-9865-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2015] [Accepted: 08/07/2015] [Indexed: 10/23/2022]