Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Number

Cited by Other Article(s)

Lourenço MP, Hostaš J, Bellinger C, Tchagang A, Salahub DR. Reinforcement learning for in silico determination of adsorbate-substrate structures. J Comput Chem 2024;45:1289-1302. [PMID: 38357973 DOI: 10.1002/jcc.27322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 01/18/2024] [Accepted: 01/22/2024] [Indexed: 02/16/2024]

Das M, Ghosh A, Sunoj RB. Advances in machine learning with chemical language models in molecular property and reaction outcome predictions. J Comput Chem 2024;45:1160-1176. [PMID: 38299229 DOI: 10.1002/jcc.27315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 01/06/2024] [Accepted: 01/09/2024] [Indexed: 02/02/2024]

Abstract

Molecular properties and reactions form the foundation of chemical space. Over the years, innumerable molecules have been synthesized, a smaller fraction of them found immediate applications, while a larger proportion served as a testimony to creative and empirical nature of the domain of chemical science. With increasing emphasis on sustainable practices, it is desirable that a target set of molecules are synthesized preferably through a fewer empirical attempts instead of a larger library, to realize an active candidate. In this front, predictive endeavors using machine learning (ML) models built on available data acquire high timely significance. Prediction of molecular property and reaction outcome remain one of the burgeoning applications of ML in chemical science. Among several methods of encoding molecular samples for ML models, the ones that employ language like representations are gaining steady popularity. Such representations would additionally help adopt well-developed natural language processing (NLP) models for chemical applications. Given this advantageous background, herein we describe several successful chemical applications of NLP focusing on molecular property and reaction outcome predictions. From relatively simpler recurrent neural networks (RNNs) to complex models like transformers, different network architecture have been leveraged for tasks such as de novo drug design, catalyst generation, forward and retro-synthesis predictions. The chemical language model (CLM) provides promising avenues toward a broad range of applications in a time and cost-effective manner. While we showcase an optimistic outlook of CLMs, attention is also placed on the persisting challenges in reaction domain, which would optimistically be addressed by advanced algorithms tailored to chemical language and with increased availability of high-quality datasets.

Collapse

Wang G, Wang C, Zhang X, Li Z, Zhou J, Sun Z. Machine learning interatomic potential: Bridge the gap between small-scale models and realistic device-scale simulations. iScience 2024;27:109673. [PMID: 38646181 PMCID: PMC11033164 DOI: 10.1016/j.isci.2024.109673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024] Open

Ge M, Pan Y, Liu X, Zhao Z, Su D. Automatic center identification of electron diffraction with multi-scale transformer networks. Ultramicroscopy 2024;259:113926. [PMID: 38310650 DOI: 10.1016/j.ultramic.2024.113926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 12/08/2023] [Accepted: 01/21/2024] [Indexed: 02/06/2024]

Huang Z, Wang Y, Li C, He H. Growing Like a Tree: Finding Trunks From Graph Skeleton Trees. IEEE Trans Pattern Anal Mach Intell 2024;46:2838-2851. [PMID: 38015698 DOI: 10.1109/tpami.2023.3336315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2023]

Wang HE, Triebkorn P, Breyton M, Dollomaja B, Lemarechal JD, Petkoski S, Sorrentino P, Depannemaecker D, Hashemi M, Jirsa VK. Virtual brain twins: from basic neuroscience to clinical use. Natl Sci Rev 2024;11:nwae079. [PMID: 38698901 PMCID: PMC11065363 DOI: 10.1093/nsr/nwae079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Revised: 02/05/2024] [Accepted: 02/20/2024] [Indexed: 05/05/2024] Open

Affiliation(s)

Huifang E Wang Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Paul Triebkorn Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Martin Breyton Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France Service de Pharmacologie Clinique et Pharmacosurveillance, AP–HM, Marseille, 13005, France
Borana Dollomaja Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Jean-Didier Lemarechal Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Spase Petkoski Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Pierpaolo Sorrentino Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Damien Depannemaecker Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Meysam Hashemi Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France
Viktor K Jirsa Aix Marseille Université, Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106; Marseille 13005, France

Collapse

Ahmadi M, Alizadeh B, Ayyoubzadeh SM, Abiyarghamsari M. Predicting Pharmacokinetics of Drugs Using Artificial Intelligence Tools: A Systematic Review. Eur J Drug Metab Pharmacokinet 2024;49:249-262. [PMID: 38457092 DOI: 10.1007/s13318-024-00883-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/29/2024] [Indexed: 03/09/2024]

Abstract

BACKGROUND AND OBJECTIVE

Pharmacokinetic studies encompass the examination of the absorption, distribution, metabolism, and excretion of bioactive compounds. The pharmacokinetics of drugs exert a substantial influence on their efficacy and safety. Consequently, the investigation of pharmacokinetics holds great importance. However, laboratory-based assessment necessitates the use of numerous animals, various materials, and significant time. To mitigate these challenges, alternative methods such as artificial intelligence have emerged as a promising approach. This systematic review aims to review existing studies, focusing on the application of artificial intelligence tools in predicting the pharmacokinetics of drugs.

METHODS

A pre-prepared search strategy based on related keywords was used to search different databases (PubMed, Scopus, Web of Science). The process involved combining articles, eliminating duplicates, and screening articles based on their titles, abstracts, and full text. Articles were selected based on inclusion and exclusion criteria. Then, the quality of the included articles was assessed using an appraisal tool.

RESULTS

Ultimately, 23 relevant articles were included in this study. The clearance parameter received the highest level of investigation, followed by the area under the concentration-time curve (AUC) parameter, in pharmacokinetic studies. Among the various models employed in the articles, Random Forest and eXtreme Gradient Boosting (XGBoost) emerged as the most commonly utilized ones. Generalized Linear Models and Elastic Nets (GLMnet) and Random Forest models showed the most performance in predicting clearance.

CONCLUSION

Overall, artificial intelligence tools offer a robust, rapid, and precise means of predicting various pharmacokinetic parameters based on a dataset containing information of patients or drugs.

Collapse

Smart SE, Welakuh DM, Narang P. Many-Body Excited States with a Contracted Quantum Eigensolver. J Chem Theory Comput 2024. [PMID: 38693607 DOI: 10.1021/acs.jctc.4c00030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2024]

Ni HC, Yuan R, Zhang J, Zuo JM. Framework of compressive sensing and data compression for 4D-STEM. Ultramicroscopy 2024;259:113938. [PMID: 38359632 DOI: 10.1016/j.ultramic.2024.113938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Revised: 01/28/2024] [Accepted: 02/08/2024] [Indexed: 02/17/2024]

Quetin S, Bahoric B, Maleki F, Enger SA. Deep learning for high-resolution dose prediction in high dose rate brachytherapy for breast cancer treatment. Phys Med Biol 2024;69:105011. [PMID: 38604185 DOI: 10.1088/1361-6560/ad3dbd] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 04/11/2024] [Indexed: 04/13/2024]

Abstract

Objective.Monte Carlo (MC) simulations are the benchmark for accurate radiotherapy dose calculations, notably in patient-specific high dose rate brachytherapy (HDR BT), in cases where considering tissue heterogeneities is critical. However, the lengthy computational time limits the practical application of MC simulations. Prior research used deep learning (DL) for dose prediction as an alternative to MC simulations. While accurate dose predictions akin to MC were attained, graphics processing unit limitations constrained these predictions to large voxels of 3 mm × 3 mm × 3 mm. This study aimed to enable dose predictions as accurate as MC simulations in 1 mm × 1 mm × 1 mm voxels within a clinically acceptable timeframe.Approach.Computed tomography scans of 98 breast cancer patients treated with Iridium-192-based HDR BT were used: 70 for training, 14 for validation, and 14 for testing. A new cropping strategy based on the distance to the seed was devised to reduce the volume size, enabling efficient training of 3D DL models using 1 mm × 1 mm × 1 mm dose grids. Additionally, novel DL architecture with layer-level fusion were proposed to predict MC simulated dose to medium-in-medium (Dm,m). These architectures fuse information from TG-43 dose to water-in-water (Dw,w) with patient tissue composition at the layer-level. Different inputs describing patient body composition were investigated.Main results.The proposed approach demonstrated state-of-the-art performance, on par with the MCDm,mmaps, but 300 times faster. The mean absolute percent error for dosimetric indices between the MC and DL-predicted complete treatment plans was 0.17% ± 0.15% for the planning target volumeV100, 0.30% ± 0.32% for the skinD2cc, 0.82% ± 0.79% for the lungD2cc, 0.34% ± 0.29% for the chest wallD2ccand 1.08% ± 0.98% for the heartD2cc.Significance.Unlike the time-consuming MC simulations, the proposed novel strategy efficiently converts TG-43Dw,wmaps into preciseDm,mmaps at high resolution, enabling clinical integration.

Collapse

Harris SB, Biswas A, Yun SJ, Roccapriore KM, Rouleau CM, Puretzky AA, Vasudevan RK, Geohegan DB, Xiao K. Autonomous Synthesis of Thin Film Materials with Pulsed Laser Deposition Enabled by In Situ Spectroscopy and Automation. Small Methods 2024:e2301763. [PMID: 38678523 DOI: 10.1002/smtd.202301763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 04/10/2024] [Indexed: 05/01/2024]

Schwerdtfeger P, Wales DJ. 100 Years of the Lennard-Jones Potential. J Chem Theory Comput 2024. [PMID: 38669689 DOI: 10.1021/acs.jctc.4c00135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]

Roche ST, Bayer Q, Carlson BT, Ouligian WC, Serhiayenka P, Stelzer J, Hong TM. Nanosecond anomaly detection with decision trees and real-time application to exotic Higgs decays. Nat Commun 2024;15:3527. [PMID: 38664390 PMCID: PMC11045859 DOI: 10.1038/s41467-024-47704-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open

van Tilborg D, Brinkmann H, Criscuolo E, Rossen L, Özçelik R, Grisoni F. Deep learning for low-data drug discovery: Hurdles and opportunities. Curr Opin Struct Biol 2024;86:102818. [PMID: 38669740 DOI: 10.1016/j.sbi.2024.102818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 03/27/2024] [Accepted: 03/29/2024] [Indexed: 04/28/2024]

Doucet M, Candeago R, Wang H, Browning JF, Su X. Studying Transient Phenomena in Thin Films with Reinforcement Learning. J Phys Chem Lett 2024;15:4444-4450. [PMID: 38626466 DOI: 10.1021/acs.jpclett.4c00467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2024]

Ge F, Wang R, Qu C, Zheng P, Nandi A, Conte R, Houston PL, Bowman JM, Dral PO. Tell Machine Learning Potentials What They Are Needed For: Simulation-Oriented Training Exemplified for Glycine. J Phys Chem Lett 2024;15:4451-4460. [PMID: 38626460 DOI: 10.1021/acs.jpclett.4c00746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/18/2024]

Affiliation(s)

Fuchun Ge State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, and Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen University, Xiamen, Fujian 361005, China
Ran Wang State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, and Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen University, Xiamen, Fujian 361005, China
Chen Qu Independent Researcher, Toronto, Ontario M9B0E3, Canada
Peikun Zheng State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, and Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen University, Xiamen, Fujian 361005, China
Apurba Nandi Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States Department of Physics and Materials Science, University of Luxembourg, Luxembourg City L-1511, Luxembourg
Riccardo Conte Dipartimento di Chimica, Università degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
Paul L Houston Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, United States
Joel M Bowman Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, United States
Pavlo O Dral State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, and Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen University, Xiamen, Fujian 361005, China

Collapse

Boldini D, Friedrich L, Kuhn D, Sieber SA. Machine Learning Assisted Hit Prioritization for High Throughput Screening in Drug Discovery. ACS Cent Sci 2024;10:823-832. [PMID: 38680560 PMCID: PMC11046457 DOI: 10.1021/acscentsci.3c01517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 03/01/2024] [Accepted: 03/01/2024] [Indexed: 05/01/2024]

Truex N, Mohapatra S, Melo M, Rodriguez J, Li N, Abraham W, Sementa D, Touti F, Keskin DB, Wu CJ, Irvine DJ, Gómez-Bombarelli R, Pentelute BL. Design of Cytotoxic T Cell Epitopes by Machine Learning of Human Degrons. ACS Cent Sci 2024;10:793-802. [PMID: 38680558 PMCID: PMC11046456 DOI: 10.1021/acscentsci.3c01544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 02/13/2024] [Accepted: 02/16/2024] [Indexed: 05/01/2024]

Affiliation(s)

Nicholas L. Truex Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States Department of Chemistry and Biochemistry, University of South Carolina, Columbia, South Carolina 29208, United States
Somesh Mohapatra Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States Machine Intelligence and Manufacturing Operations Group, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
Mariane Melo The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, United States Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard University, Cambridge, Massachusetts 02139, United States
Jacob Rodriguez Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
Na Li The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, United States
Wuhbet Abraham The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, United States
Deborah Sementa Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
Faycal Touti Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
Derin B. Keskin Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, United States Harvard Medical School, Boston, Massachusetts 02115, United States Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States Translational Immunogenomics Laboratory (TIGL), Dana-Farber Cancer Institute, Boston, Massachusetts 02215, United States Department of Computer Science, Metropolitan College, Boston University, Boston, Massachusetts 02215, United States Section for Bioinformatics, Department of Health Technology, Technical University of Denmark, Lyngby DK-2800, Denmark
Catherine J. Wu Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts 02215, United States Harvard Medical School, Boston, Massachusetts 02115, United States Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States Department of Medicine, Brigham and Women’s Hospital, Boston, Massachusetts 02115, United States
Darrell J. Irvine Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, United States Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard University, Cambridge, Massachusetts 02139, United States Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States Howard Hughes Medical Institute, Chevy Chase, Maryland 20815, United States
Rafael Gómez-Bombarelli Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
Bradley L. Pentelute Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142, United States Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, United States Center for Environmental Health Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States

Collapse

Margraf JT. Neural graph distance embedding for molecular geometry generation. J Comput Chem 2024. [PMID: 38655845 DOI: 10.1002/jcc.27349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 03/05/2024] [Accepted: 03/08/2024] [Indexed: 04/26/2024]

Gangwal A, Lavecchia A. Unlocking the potential of generative AI in drug discovery. Drug Discov Today 2024:103992. [PMID: 38663579 DOI: 10.1016/j.drudis.2024.103992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 03/22/2024] [Accepted: 04/18/2024] [Indexed: 05/04/2024]

Shakiba M, Akimov AV. Machine-Learned Kohn-Sham Hamiltonian Mapping for Nonadiabatic Molecular Dynamics. J Chem Theory Comput 2024;20:2992-3007. [PMID: 38581699 DOI: 10.1021/acs.jctc.4c00008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2024]

France-Lanord A, Vroylandt H, Salanne M, Rotenberg B, Saitta AM, Pietrucci F. Data-Driven Path Collective Variables. J Chem Theory Comput 2024;20:3069-3084. [PMID: 38619076 DOI: 10.1021/acs.jctc.4c00123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]

Wang Y, Chen H, Xie L, Liu J, Zhang L, Yu J. Swarm Autonomy: From Agent Functionalization to Machine Intelligence. Adv Mater 2024:e2312956. [PMID: 38653192 DOI: 10.1002/adma.202312956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 04/17/2024] [Indexed: 04/25/2024]

Schmidt B, Hildebrandt A. From GPUs to AI and quantum: three waves of acceleration in bioinformatics. Drug Discov Today 2024;29:103990. [PMID: 38663581 DOI: 10.1016/j.drudis.2024.103990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/05/2024] [Accepted: 04/17/2024] [Indexed: 05/01/2024]

Meewan I, Panmanee J, Petchyam N, Lertvilai P. HBCVTr: an end-to-end transformer with a deep neural network hybrid model for anti-HBV and HCV activity predictor from SMILES. Sci Rep 2024;14:9262. [PMID: 38649402 PMCID: PMC11035669 DOI: 10.1038/s41598-024-59933-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 04/16/2024] [Indexed: 04/25/2024] Open

Westerlund AM, Manohar Koki S, Kancharla S, Tibo A, Saigiridharan L, Kabeshov M, Mercado R, Genheden S. Do Chemformers Dream of Organic Matter? Evaluating a Transformer Model for Multistep Retrosynthesis. J Chem Inf Model 2024;64:3021-3033. [PMID: 38602390 DOI: 10.1021/acs.jcim.3c01685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]

Gallegos M, Isamura BK, Popelier PLA, Martín Pendás Á. An Unsupervised Machine Learning Approach for the Automatic Construction of Local Chemical Descriptors. J Chem Inf Model 2024;64:3059-3079. [PMID: 38498942 DOI: 10.1021/acs.jcim.3c01906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]

Abstract

Condensing the many physical variables defining a chemical system into a fixed-size array poses a significant challenge in the development of chemical Machine Learning (ML). Atom Centered Symmetry Functions (ACSFs) offer an intuitive featurization approach by means of a tedious and labor-intensive selection of tunable parameters. In this work, we implement an unsupervised ML strategy relying on a Gaussian Mixture Model (GMM) to automatically optimize the ACSF parameters. GMMs effortlessly decompose the vastness of the chemical and conformational spaces into well-defined radial and angular clusters, which are then used to build tailor-made ACSFs. The unsupervised exploration of the space has demonstrated general applicability across a diverse range of systems, spanning from various unimolecular landscapes to heterogeneous databases. The impact of the sampling technique and temperature on space exploration is also addressed, highlighting the particularly advantageous role of high-temperature Molecular Dynamics (MD) simulations. The reliability of the resulting features is assessed through the estimation of the atomic charges of a prototypical capped amino acid and a heterogeneous collection of CHON molecules. The automatically constructed ACSFs serve as high-quality descriptors, consistently yielding typical prediction errors below 0.010 electrons bound for the reported atomic charges. Altering the spatial distribution of the functions with respect to the cluster highlights the critical role of symmetry rupture in achieving significantly improved features. More specifically, using two separate functions to describe the lower and upper tails of the cluster results in the best performing models with errors as low as 0.006 electrons. Finally, the effectiveness of finely tuned features was checked across different architectures, unveiling the superior performance of Gaussian Process (GP) models over Feed Forward Neural Networks (FFNNs), particularly in low-data regimes, with nearly a 2-fold increase in prediction quality. Altogether, this approach paves the way toward an easier construction of local chemical descriptors, while providing valuable insights into how radial and angular spaces should be mapped. Finally, this work opens the possibility of encoding many-body information beyond angular terms into upcoming ML features.

Collapse

Ding Y, Qiang B, Chen Q, Liu Y, Zhang L, Liu Z. Exploring Chemical Reaction Space with Machine Learning Models: Representation and Feature Perspective. J Chem Inf Model 2024;64:2955-2970. [PMID: 38489239 DOI: 10.1021/acs.jcim.4c00004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/17/2024]

Gou Q, Liu J, Su H, Guo Y, Chen J, Zhao X, Pu X. Exploring an accurate machine learning model to quickly estimate stability of diverse energetic materials. iScience 2024;27:109452. [PMID: 38523799 PMCID: PMC10960145 DOI: 10.1016/j.isci.2024.109452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/27/2024] [Accepted: 03/06/2024] [Indexed: 03/26/2024] Open

Zills F, Schäfer MR, Segreto N, Kästner J, Holm C, Tovey S. Collaboration on Machine-Learned Potentials with IPSuite: A Modular Framework for Learning-on-the-Fly. J Phys Chem B 2024;128:3662-3676. [PMID: 38568231 DOI: 10.1021/acs.jpcb.3c07187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/19/2024]

Qin W, Wang H, Zhang F, Ma W, Wang J, Huang T. Nonconvex Robust High-Order Tensor Completion Using Randomized Low-Rank Approximation. IEEE Trans Image Process 2024;33:2835-2850. [PMID: 38598373 DOI: 10.1109/tip.2024.3385284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/12/2024]

Joshi KP, Adhikari G, Bhattarai D, Adhikari A, Lamichanne S. Forest fire vulnerability in Nepal's chure region: Investigating the influencing factors using generalized linear model. Heliyon 2024;10:e28525. [PMID: 38596031 PMCID: PMC11002069 DOI: 10.1016/j.heliyon.2024.e28525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/20/2024] [Accepted: 03/20/2024] [Indexed: 04/11/2024] Open

Abstract

The Chure region, among the world's youngest mountains, stands out as highly susceptible to natural calamities, particularly forest fires. The region has consistently experienced forest fire incidents, resulting in the degradation of valuable natural and anthropogenic resources. Despite its vulnerability, there have been limited studies to understand the relationship of various causative factors for the recurring fire problem. Hence, to comprehend the influencing factors for the recurring forest fire problem and its extent, we utilized generalized linear modeling under binary logistic regression to combine the dependent variable of satellite detected fire points and various independent variables. We conducted a variance inflation factor (VIF) test and correlation matrix to identify the 14 suitable variables for the study. The analysis revealed that forest fires occurred mostly during the three pre-monsoon periods and had a significant positive relation with the area under forest, rangeland, bare-grounds, and Normalized Difference Vegetation Index (NDVI) (P < 0.05). Consequently, our model showed that the probability of fire incidents decreases with elevation, precipitation, and population density (P < 0.05). Among the significant variables, the forest areas emerges as the most influencing factor, followed by precipitation, elevation, area of rangeland, population density, NDVI, and the area of bare ground. The validation of the model was done through the area under the curve (AUC = 0.92) and accuracy (ACC = 0.89) assessments, which showed the model performed excellently in terms of predictive capabilities. The modeling result and the forest fire susceptible map provide valuable insights into the forest fire vulnerability in the region, offering baseline information about forest fires that will be helpful for line agencies to prepare management strategies to further prevent the deterioration of the region.

Collapse

Zhang HK, Liu S, Zhang SX. Absence of Barren Plateaus in Finite Local-Depth Circuits with Long-Range Entanglement. Phys Rev Lett 2024;132:150603. [PMID: 38682974 DOI: 10.1103/physrevlett.132.150603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/24/2024] [Accepted: 03/13/2024] [Indexed: 05/01/2024]

Drmota P, Nadlinger DP, Main D, Nichol BC, Ainley EM, Leichtle D, Mantri A, Kashefi E, Srinivas R, Araneda G, Ballance CJ, Lucas DM. Verifiable Blind Quantum Computing with Trapped Ions and Single Photons. Phys Rev Lett 2024;132:150604. [PMID: 38682960 DOI: 10.1103/physrevlett.132.150604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Accepted: 01/16/2024] [Indexed: 05/01/2024]

Choi S, Lee J, Seo J, Han SW, Lee SH, Seo JH, Seok J. Automated BigSMILES conversion workflow and dataset for homopolymeric macromolecules. Sci Data 2024;11:371. [PMID: 38605036 PMCID: PMC11009387 DOI: 10.1038/s41597-024-03212-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 04/02/2024] [Indexed: 04/13/2024] Open

Panwar P, Yang Q, Martini A. Temperature-Dependent Density and Viscosity Prediction for Hydrocarbons: Machine Learning and Molecular Dynamics Simulations. J Chem Inf Model 2024;64:2760-2774. [PMID: 37582234 DOI: 10.1021/acs.jcim.3c00231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]

Abstract

Machine learning-based predictive models allow rapid and reliable prediction of material properties and facilitate innovative materials design. Base oils used in the formulation of lubricant products are complex hydrocarbons of varying sizes and structure. This study developed Gaussian process regression-based models to accurately predict the temperature-dependent density and dynamic viscosity of 305 complex hydrocarbons. In our approach, strongly correlated/collinear predictors were trimmed, important predictors were selected by least absolute shrinkage and selection operator (LASSO) regularization and prior domain knowledge, hyperparameters were systematically optimized by Bayesian optimization, and the models were interpreted. The approach provided versatile and quantitative structure-property relationship (QSPR) models with relatively simple predictors for determining the dynamic viscosity and density of complex hydrocarbons at any temperature. In addition, we developed molecular dynamics simulation-based descriptors and evaluated the feasibility and versatility of dynamic descriptors from simulations for predicting the material properties. It was found that the models developed using a comparably smaller pool of dynamic descriptors performed similarly in predicting density and viscosity to models based on many more static descriptors. The best models were shown to predict density and dynamic viscosity with coefficient of determination (R2) values of 99.6% and 97.7%, respectively, for all data sets, including a test data set of 45 molecules. Finally, partial dependency plots (PDPs), individual conditional expectation (ICE) plots, local interpretable model-agnostic explanation (LIME) values, and trimmed model R2 values were used to identify the most important static and dynamic predictors of the density and viscosity.

Collapse

Gao C, Bao W, Wang S, Zheng J, Wang L, Ren Y, Jiao L, Wang J, Wang X. DockingGA: enhancing targeted molecule generation using transformer neural network and genetic algorithm with docking simulation. Brief Funct Genomics 2024:elae011. [PMID: 38582610 DOI: 10.1093/bfgp/elae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 02/25/2024] [Accepted: 03/13/2024] [Indexed: 04/08/2024] Open

Unke OT, Stöhr M, Ganscha S, Unterthiner T, Maennel H, Kashubin S, Ahlin D, Gastegger M, Medrano Sandonas L, Berryman JT, Tkatchenko A, Müller KR. Biomolecular dynamics with machine-learned quantum-mechanical force fields trained on diverse chemical fragments. Sci Adv 2024;10:eadn4397. [PMID: 38579003 DOI: 10.1126/sciadv.adn4397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2023] [Accepted: 02/29/2024] [Indexed: 04/07/2024]

Affiliation(s)

Oliver T Unke Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany DFG Cluster of Excellence "Unifying Systems in Catalysis" (UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany
Martin Stöhr Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
Stefan Ganscha Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland
Thomas Unterthiner Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland
Hartmut Maennel Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland
Sergii Kashubin Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland
Daniel Ahlin Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland
Michael Gastegger Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany DFG Cluster of Excellence "Unifying Systems in Catalysis" (UniSysCat), Technische Universität Berlin, 10623 Berlin, Germany BASLEARN - TU Berlin/BASF Joint Lab for Machine Learning, Technische Universität Berlin, 10587 Berlin, Germany
Leonardo Medrano Sandonas Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
Joshua T Berryman Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
Alexandre Tkatchenko Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
Klaus-Robert Müller Google DeepMind, Tucholskystraße 2, 10117 Berlin, Germany and Brandschenkestrasse 110, 8002 Zürich, Switzerland Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul 02841, Korea Max Planck Institute for Informatics, Stuhlsatzenhausweg, 66123 Saarbrücken, Germany BIFOLD - Berlin Institute for the Foundations of Learning and Data, Berlin, Germany

Collapse

Lu B, Xia Y, Ren Y, Xie M, Zhou L, Vinai G, Morton SA, Wee ATS, van der Wiel WG, Zhang W, Wong PKJ. When Machine Learning Meets 2D Materials: A Review. Adv Sci (Weinh) 2024;11:e2305277. [PMID: 38279508 PMCID: PMC10987159 DOI: 10.1002/advs.202305277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/21/2023] [Indexed: 01/28/2024]

Affiliation(s)

Bin Lu ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China
Yuze Xia ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China
Yuqian Ren ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China
Miaomiao Xie ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China
Liguo Zhou ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China
Giovanni Vinai Instituto Officina dei Materiali (IOM)‐CNRLaboratorio TASCTriesteI‐34149Italy
Simon A. Morton Advanced Light Source (ALS)Lawrence Berkeley National LaboratoryBerkeleyCA94720USA
Andrew T. S. Wee Department of Physics and Centre for Advanced 2D Materials (CA2DM) and Graphene Research Centre (GRC)National University of SingaporeSingapore117542Singapore
Wilfred G. van der Wiel NanoElectronics Group, MESA+ Institute for Nanotechnology and BRAINS Center for Brain‐Inspired Nano SystemsUniversity of TwenteEnschede7500AEThe Netherlands Institute of PhysicsUniversity of Münster48149MünsterGermany
Wen Zhang ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China NanoElectronics Group, MESA+ Institute for Nanotechnology and BRAINS Center for Brain‐Inspired Nano SystemsUniversity of TwenteEnschede7500AEThe Netherlands
Ping Kwan Johnny Wong ARTIST Lab for Artificial Electronic Materials and Technologies, School of MicroelectronicsNorthwestern Polytechnical UniversityXi'an710072P. R. China Yangtze River Delta Research Institute of Northwestern Polytechnical UniversityTaicang215400P. R. China NPU Chongqing Technology Innovation CenterChongqing400000P. R. China

Collapse

Guo Y, Zhang H, Yuan L, Chen W, Zhao H, Yu QQ, Shi W. Machine learning and new insights for breast cancer diagnosis. J Int Med Res 2024;52:3000605241237867. [PMID: 38663911 PMCID: PMC11047257 DOI: 10.1177/03000605241237867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 02/21/2024] [Indexed: 04/28/2024] Open

Munteanu V, Starostin V, Greco A, Pithan L, Gerlach A, Hinderhofer A, Kowarik S, Schreiber F. Neural network analysis of neutron and X-ray reflectivity data incorporating prior knowledge. J Appl Crystallogr 2024;57:456-469. [PMID: 38596736 PMCID: PMC11001411 DOI: 10.1107/s1600576724002115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 03/03/2024] [Indexed: 04/11/2024] Open

Yang Z, Zhang L, Liu T, Wang H, Tang Z, Zhao H, Yuan L, Zhang Z, Liu X. Alternating projection combined with fast gradient projection (FGP-AP) method for intensity-only measurement optical diffraction tomography in LED array microscopy. Biomed Opt Express 2024;15:2524-2542. [PMID: 38633101 PMCID: PMC11019679 DOI: 10.1364/boe.518955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 03/06/2024] [Accepted: 03/11/2024] [Indexed: 04/19/2024]

Martínez‐Mauricio KL, García‐Jacas CR, Cordoves‐Delgado G. Examining evolutionary scale modeling-derived different-dimensional embeddings in the antimicrobial peptide classification through a KNIME workflow. Protein Sci 2024;33:e4928. [PMID: 38501511 PMCID: PMC10949403 DOI: 10.1002/pro.4928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Revised: 01/28/2024] [Accepted: 01/30/2024] [Indexed: 03/20/2024]

Abstract

Molecular features play an important role in different bio-chem-informatics tasks, such as the Quantitative Structure-Activity Relationships (QSAR) modeling. Several pre-trained models have been recently created to be used in downstream tasks, either by fine-tuning a specific model or by extracting features to feed traditional classifiers. In this regard, a new family of Evolutionary Scale Modeling models (termed as ESM-2 models) was recently introduced, demonstrating outstanding results in protein structure prediction benchmarks. Herein, we studied the usefulness of the different-dimensional embeddings derived from the ESM-2 models to classify antimicrobial peptides (AMPs). To this end, we built a KNIME workflow to use the same modeling methodology across experiments in order to guarantee fair analyses. As a result, the 640- and 1280-dimensional embeddings derived from the 30- and 33-layer ESM-2 models, respectively, are the most valuable since statistically better performances were achieved by the QSAR models built from them. We also fused features of the different ESM-2 models, and it was concluded that the fusion contributes to getting better QSAR models than using features of a single ESM-2 model. Frequency studies revealed that only a portion of the ESM-2 embeddings is valuable for modeling tasks since between 43% and 66% of the features were never used. Comparisons regarding state-of-the-art deep learning (DL) models confirm that when performing methodologically principled studies in the prediction of AMPs, non-DL based QSAR models yield comparable-to-superior performances to DL-based QSAR models. The developed KNIME workflow is available-freely at https://github.com/cicese-biocom/classification-QSAR-bioKom. This workflow can be valuable to avoid unfair comparisons regarding new computational methods, as well as to propose new non-DL based QSAR models.

Collapse

Ghiandoni GM, Evertsson E, Riley DJ, Tyrchan C, Rathi PC. Augmenting DMTA using predictive AI modelling at AstraZeneca. Drug Discov Today 2024;29:103945. [PMID: 38460568 DOI: 10.1016/j.drudis.2024.103945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 02/27/2024] [Accepted: 03/05/2024] [Indexed: 03/11/2024]

Daniel DT, Mitra S, Eichel RA, Diddens D, Granwehr J. Machine Learning Isotropic g Values of Radical Polymers. J Chem Theory Comput 2024;20:2592-2604. [PMID: 38456629 DOI: 10.1021/acs.jctc.3c01252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Elgendy R, Younes A, Abu-Donia HM, Farouk RM. Efficient quantum algorithms for set operations. Sci Rep 2024;14:7015. [PMID: 38527996 DOI: 10.1038/s41598-024-56860-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 03/12/2024] [Indexed: 03/27/2024] Open

Luo M, Lee SS. Tandem neural network-assisted inverse design of highly efficient diffractive slanted waveguide grating. Opt Express 2024;32:12587-12600. [PMID: 38571077 DOI: 10.1364/oe.514502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 03/12/2024] [Indexed: 04/05/2024]

Korolev V, Mitrofanov A. Coarse-Grained Crystal Graph Neural Networks for Reticular Materials Design. J Chem Inf Model 2024;64:1919-1931. [PMID: 38456446 DOI: 10.1021/acs.jcim.3c02083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Lalith N, Singh AR, Gauthier JA. The Importance of Reaction Energy in Predicting Chemical Reaction Barriers with Machine Learning Models. Chemphyschem 2024:e202300933. [PMID: 38517585 DOI: 10.1002/cphc.202300933] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 03/21/2024] [Accepted: 03/22/2024] [Indexed: 03/24/2024]

Abstract

Improving our fundamental understanding of complex heterocatalytic processes increasingly relies on electronic structure simulations and microkinetic models based on calculated energy differences. In particular, calculation of activation barriers, usually achieved through compute-intensive saddle point search routines, remains a serious bottleneck in understanding trends in catalytic activity for highly branched reaction networks. Although the well-known Brønsted-Evans-Polyani (BEP) scaling - a one-feature linear regression model - has been widely applied in such microkinetic models, they still rely on calculated reaction energies and may not generalize beyond a single facet on a single class of materials, e. g., a terrace sites on transition metals. For highly branched and energetically shallow reaction networks, such as electrochemical CO2 reduction or wastewater remediation, calculating even reaction energies on many surfaces can become computationally intractable due to the combinatorial explosion of states that must be considered. Here, we investigate the feasibility of activation barrier prediction without knowledge of the reaction energy using linear and nonlinear machine learning (ML) models trained on a new database of over 500 dehydrogenation activation barriers. We also find that inclusion of the reaction energy significantly improves both classes of ML models, but complex nonlinear models can achieve performance similar to the simplest BEP scaling when predicting activation barriers on new systems. Additionally, inclusion of the reaction energy significantly improves generalizability to new systems beyond the training set. Our results suggest that the reaction energy is a critical feature to consider when building models to predict activation barriers, indicating that efforts to reliably predict reaction energies through, e. g., the Open Catalyst Project and others, will be an important route to effective model development for more complex systems.

Collapse

Sammüller F, Hermann S, Schmidt M. Why neural functionals suit statistical mechanics. J Phys Condens Matter 2024;36:243002. [PMID: 38467072 DOI: 10.1088/1361-648x/ad326f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 03/11/2024] [Indexed: 03/13/2024]