1
|
Kadan A, Ryczko K, Wildman A, Wang R, Roitberg A, Yamazaki T. Accelerated Organic Crystal Structure Prediction with Genetic Algorithms and Machine Learning. J Chem Theory Comput 2023; 19:9388-9402. [PMID: 38059458 DOI: 10.1021/acs.jctc.3c00853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
We present a high-throughput, end-to-end pipeline for organic crystal structure prediction (CSP)─the problem of identifying the stable crystal structures that will form from a given molecule based only on its molecular composition. Our tool uses neural network potentials to allow for efficient screening and structural relaxation of generated crystal candidates. Our pipeline consists of two distinct stages: random search, whereby crystal candidates are randomly generated and screened, and optimization, where a genetic algorithm (GA) optimizes this screened population. We assess the performance of each stage of our pipeline on 21 molecules taken from the Cambridge Crystallographic Data Centre's CSP blind tests. We show that random search alone yields matches for ≈50% of targets. We then validate the potential of our full pipeline, making use of the GA to optimize the root-mean-square deviation between crystal candidates and the experimentally derived structure. With this approach, we are able to find matches for ≈80% of candidates with 10-100 times smaller initial population sizes than when using random search. Lastly, we run our full pipeline with an ANI model that is trained on a small data set of molecules extracted from crystal structures in the Cambridge Structural Database, generating ≈60% of targets. By leveraging machine learning models trained to predict energies at the density functional theory level, our pipeline has the potential to approach the accuracy of ab initio methods and the efficiency of empirical force fields.
Collapse
Affiliation(s)
- Amit Kadan
- Good Chemistry Company, 1285 W Pender Street, Vancouver, British Columbia V6E 4B1, Canada
| | - Kevin Ryczko
- Good Chemistry Company, 1285 W Pender Street, Vancouver, British Columbia V6E 4B1, Canada
| | - Andrew Wildman
- Good Chemistry Company, 1285 W Pender Street, Vancouver, British Columbia V6E 4B1, Canada
| | - Rodrigo Wang
- Good Chemistry Company, 1285 W Pender Street, Vancouver, British Columbia V6E 4B1, Canada
| | - Adrian Roitberg
- Department of Chemistry, University of Florida, P.O. Box 117200, Gainesville, Florida 32611-7200, United States
| | - Takeshi Yamazaki
- Good Chemistry Company, 1285 W Pender Street, Vancouver, British Columbia V6E 4B1, Canada
| |
Collapse
|
2
|
O’Connor D, Bier I, Tom R, Hiszpanski AM, Steele BA, Marom N. Ab Initio Crystal Structure Prediction of the Energetic Materials LLM-105, RDX, and HMX. CRYSTAL GROWTH & DESIGN 2023; 23:6275-6289. [PMID: 38173900 PMCID: PMC10763925 DOI: 10.1021/acs.cgd.3c00027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2023] [Revised: 08/02/2023] [Indexed: 01/05/2024]
Abstract
Crystal structure prediction (CSP) is performed for the energetic materials (EMs) LLM-105 and α-RDX, as well as the α and β conformational polymorphs of 1,3,5,7-tetranitro-1,3,5,7-tetraazacyclooctane (HMX), using the genetic algorithm (GA) code, GAtor, and its associated random structure generator, Genarris. Genarris and GAtor successfully generate the experimental structures of all targets. GAtor's symmetric crossover scheme, where the space group symmetries of parent structures are treated as genes inherited by offspring, is found to be particularly effective. However, conducting several GA runs with different settings is still important for achieving diverse samplings of the potential energy surface. For LLM-105 and α-RDX, the experimental structure is ranked as the most stable, with all of the dispersion-inclusive density functional theory (DFT) methods used here. For HMX, the α form was persistently ranked as more stable than the β form, in contrast to experimental observations, even when correcting for vibrational contributions and thermal expansion. This may be attributed to insufficient accuracy of dispersion-inclusive DFT methods or to kinetic effects not considered here. In general, the ranking of some putative structures is found to be sensitive to the choice of the DFT functional and the dispersion method. For LLM-105, GAtor generates a putative structure with a layered packing motif, which is desirable thanks to its correlation with low sensitivity. Our results demonstrate that CSP is a useful tool for studying the ubiquitous polymorphism of EMs and shows promise of becoming an integral part of the EM development pipeline.
Collapse
Affiliation(s)
- Dana O’Connor
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Imanuel Bier
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Rithwik Tom
- Department
of Physics, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Anna M. Hiszpanski
- Materials
Science Division, Lawrence Livermore National
Laboratory, Livermore, California 94550, United States
| | - Brad A. Steele
- Materials
Science Division, Lawrence Livermore National
Laboratory, Livermore, California 94550, United States
| | - Noa Marom
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania 15213, United States
- Department
of Physics, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
- Department
of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
3
|
Wang J, Gao H, Han Y, Ding C, Pan S, Wang Y, Jia Q, Wang HT, Xing D, Sun J. MAGUS: machine learning and graph theory assisted universal structure searcher. Natl Sci Rev 2023; 10:nwad128. [PMID: 37332628 PMCID: PMC10275355 DOI: 10.1093/nsr/nwad128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2023] [Revised: 03/30/2023] [Accepted: 04/28/2023] [Indexed: 06/20/2023] Open
Abstract
Crystal structure predictions based on first-principles calculations have gained great success in materials science and solid state physics. However, the remaining challenges still limit their applications in systems with a large number of atoms, especially the complexity of conformational space and the cost of local optimizations for big systems. Here, we introduce a crystal structure prediction method, MAGUS, based on the evolutionary algorithm, which addresses the above challenges with machine learning and graph theory. Techniques used in the program are summarized in detail and benchmark tests are provided. With intensive tests, we demonstrate that on-the-fly machine-learning potentials can be used to significantly reduce the number of expensive first-principles calculations, and the crystal decomposition based on graph theory can efficiently decrease the required configurations in order to find the target structures. We also summarized the representative applications of this method on several research topics, including unexpected compounds in the interior of planets and their exotic states at high pressure and high temperature (superionic, plastic, partially diffusive state, etc.); new functional materials (superhard, high-energy-density, superconducting, photoelectric materials), etc. These successful applications demonstrated that MAGUS code can help to accelerate the discovery of interesting materials and phenomena, as well as the significant value of crystal structure predictions in general.
Collapse
Affiliation(s)
| | | | | | - Chi Ding
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Shuning Pan
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Yong Wang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Qiuhan Jia
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Hui-Tian Wang
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | - Dingyu Xing
- National Laboratory of Solid State Microstructures, School of Physics and Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing 210093, China
| | | |
Collapse
|
4
|
Kilgour M, Rogal J, Tuckerman M. Geometric Deep Learning for Molecular Crystal Structure Prediction. J Chem Theory Comput 2023. [PMID: 37053511 PMCID: PMC10373482 DOI: 10.1021/acs.jctc.3c00031] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/15/2023]
Abstract
We develop and test new machine learning strategies for accelerating molecular crystal structure ranking and crystal property prediction using tools from geometric deep learning on molecular graphs. Leveraging developments in graph-based learning and the availability of large molecular crystal data sets, we train models for density prediction and stability ranking which are accurate, fast to evaluate, and applicable to molecules of widely varying size and composition. Our density prediction model, MolXtalNet-D, achieves state-of-the-art performance, with lower than 2% mean absolute error on a large and diverse test data set. Our crystal ranking tool, MolXtalNet-S, correctly discriminates experimental samples from synthetically generated fakes and is further validated through analysis of the submissions to the Cambridge Structural Database Blind Tests 5 and 6. Our new tools are computationally cheap and flexible enough to be deployed within an existing crystal structure prediction pipeline both to reduce the search space and score/filter crystal structure candidates.
Collapse
Affiliation(s)
- Michael Kilgour
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Jutta Rogal
- Department of Chemistry, New York University, New York, New York 10003, United States
- Fachbereich Physik, Freie Universität Berlin, 14195 Berlin, Germany
| | - Mark Tuckerman
- Department of Chemistry, New York University, New York, New York 10003, United States
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Rd. North, Shanghai 200062, China
- Simons Center for Computational Physical Chemistry at New York University, New York, New York 10003, United States
| |
Collapse
|
5
|
Tom R, Gao S, Yang Y, Zhao K, Bier I, Buchanan EA, Zaykov A, Havlas Z, Michl J, Marom N. Inverse Design of Tetracene Polymorphs with Enhanced Singlet Fission Performance by Property-Based Genetic Algorithm Optimization. CHEMISTRY OF MATERIALS : A PUBLICATION OF THE AMERICAN CHEMICAL SOCIETY 2023; 35:1373-1386. [PMID: 36999121 PMCID: PMC10042130 DOI: 10.1021/acs.chemmater.2c03444] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/06/2023] [Indexed: 06/19/2023]
Abstract
The efficiency of solar cells may be improved by using singlet fission (SF), in which one singlet exciton splits into two triplet excitons. SF occurs in molecular crystals. A molecule may crystallize in more than one form, a phenomenon known as polymorphism. Crystal structure may affect SF performance. In the common form of tetracene, SF is experimentally known to be slightly endoergic. A second, metastable polymorph of tetracene has been found to exhibit better SF performance. Here, we conduct inverse design of the crystal packing of tetracene using a genetic algorithm (GA) with a fitness function tailored to simultaneously optimize the SF rate and the lattice energy. The property-based GA successfully generates more structures predicted to have higher SF rates and provides insight into packing motifs associated with improved SF performance. We find a putative polymorph predicted to have superior SF performance to the two forms of tetracene, whose structures have been determined experimentally. The putative structure has a lattice energy within 1.5 kJ/mol of the most stable common form of tetracene.
Collapse
Affiliation(s)
- Rithwik Tom
- Department
of Physics, Carnegie Mellon University, Pittsburgh, Pennsylvania15213, United States
| | - Siyu Gao
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania15213, United States
| | - Yi Yang
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania15213, United States
| | - Kaiji Zhao
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania15213, United States
| | - Imanuel Bier
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania15213, United States
| | - Eric A. Buchanan
- Department
of Chemistry, University of Colorado, Boulder, Colorado80309, United States
| | - Alexandr Zaykov
- Institute
of Organic Chemistry and Biochemistry, Czech
Academy of Sciences, 16610Prague 6, Czech
Republic
- Department
of Physical Chemistry, University of Chemistry
and Technology, 166 28Prague 6, Czech Republic
| | - Zdeněk Havlas
- Institute
of Organic Chemistry and Biochemistry, Czech
Academy of Sciences, 16610Prague 6, Czech
Republic
| | - Josef Michl
- Department
of Chemistry, University of Colorado, Boulder, Colorado80309, United States
- Institute
of Organic Chemistry and Biochemistry, Czech
Academy of Sciences, 16610Prague 6, Czech
Republic
| | - Noa Marom
- Department
of Physics, Carnegie Mellon University, Pittsburgh, Pennsylvania15213, United States
- Department
of Materials Science and Engineering, Carnegie
Mellon University, Pittsburgh, Pennsylvania15213, United States
- Department
of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania15213, United States
| |
Collapse
|
6
|
Price AJA, Otero-de-la-Roza A, Johnson ER. XDM-corrected hybrid DFT with numerical atomic orbitals predicts molecular crystal lattice energies with unprecedented accuracy. Chem Sci 2023; 14:1252-1262. [PMID: 36756332 PMCID: PMC9891363 DOI: 10.1039/d2sc05997e] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Accepted: 12/13/2022] [Indexed: 12/23/2022] Open
Abstract
Molecular crystals are important for many applications, including energetic materials, organic semiconductors, and the development and commercialization of pharmaceuticals. The exchange-hole dipole moment (XDM) dispersion model has shown good performance in the calculation of relative and absolute lattice energies of molecular crystals, although it has traditionally been applied in combination with plane-wave/pseudopotential approaches. This has limited XDM to use with semilocal functional approximations, which suffer from delocalization error and poor quality conformational energies, and to systems with a few hundreds of atoms at most due to unfavorable scaling. In this work, we combine XDM with numerical atomic orbitals, which enable the efficient use of XDM-corrected hybrid functionals for molecular crystals. We test the new XDM-corrected functionals for their ability to predict the lattice energies of molecular crystals for the X23 set and 13 ice phases, the latter being a particularly stringent test. A composite approach using a XDM-corrected, 25% hybrid functional based on B86bPBE achieves a mean absolute error of 0.48 kcal mol-1 per molecule for the X23 set and 0.19 kcal mol-1 for the total lattice energies of the ice phases, compared to recent diffusion Monte-Carlo data. These results make the new XDM-corrected hybrids not only far more computationally efficient than previous XDM implementations, but also the most accurate density-functional methods for molecular crystal lattice energies to date.
Collapse
Affiliation(s)
- Alastair J. A. Price
- Department of Chemistry, Dalhousie University6274 Coburg RdHalifaxB3H 4R2Nova ScotiaCanada
| | - Alberto Otero-de-la-Roza
- Departamento de Química Física y Analítica and MALTA-Consolider Team, Facultad de Química, Universidad de Oviedo Oviedo 33006 Spain
| | - Erin R. Johnson
- Department of Chemistry, Dalhousie University6274 Coburg RdHalifaxB3H 4R2Nova ScotiaCanada
| |
Collapse
|
7
|
Rana B, Beran GJO, Herbert JM. Correcting π-delocalisation errors in conformational energies using density-corrected DFT, with application to crystal polymorphs. Mol Phys 2022. [DOI: 10.1080/00268976.2022.2138789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022]
Affiliation(s)
- Bhaskar Rana
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| | | | - John M. Herbert
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
8
|
Petsev ND, Nikoubashman A, Latinwo F, Stillinger FH, Debenedetti PG. Crystal Prediction via Genetic Algorithms in a Model Chiral System. J Phys Chem B 2022; 126:7771-7780. [PMID: 36162405 DOI: 10.1021/acs.jpcb.2c04501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Chiral crystals and their constituent molecules play a prominent role in theories about the origin of biological homochirality and in drug discovery, design, and stability. Although the prediction and identification of stable chiral crystal structures is crucial for numerous technologies, including separation processes and polymorph selection and control, predictive ability is often complicated by a combination of many-body interactions and molecular complexity and handedness. In this work, we address these challenges by applying genetic algorithms to predict the ground-state crystal lattices formed by a chiral tetramer molecular model, which we have previously shown to exhibit complex fluid-phase behavior. Using this approach, we explore the relative stability and structures of the model's conglomerate and racemic crystals, and present a structural phase diagram for the stable Bravais crystal types in the zero-temperature limit.
Collapse
Affiliation(s)
- Nikolai D Petsev
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| | - Arash Nikoubashman
- Institute of Physics, Johannes Gutenberg University Mainz, Staudingerweg 7, 55128 Mainz, Germany
| | - Folarin Latinwo
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States.,Synopsys Inc., Austin, Texas 78746, United States
| | - Frank H Stillinger
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States
| | - Pablo G Debenedetti
- Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States
| |
Collapse
|
9
|
Global analysis of the energy landscapes of molecular crystal structures by applying the threshold algorithm. Commun Chem 2022; 5:86. [PMID: 36697680 PMCID: PMC9814927 DOI: 10.1038/s42004-022-00705-4] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Accepted: 07/15/2022] [Indexed: 01/28/2023] Open
Abstract
Polymorphism in molecular crystals has important consequences for the control of materials properties and our understanding of crystallization. Computational methods, including crystal structure prediction, have provided important insight into polymorphism, but have usually been limited to assessing the relative energies of structures. We describe the implementation of the Monte Carlo threshold algorithm as a method to provide an estimate of the energy barriers separating crystal structures. By sampling the local energy minima accessible from multiple starting structures, the simulations yield a global picture of the crystal energy landscapes and provide valuable information on the depth of the energy minima associated with crystal structures. We present results from applying the threshold algorithm to four polymorphic organic molecular crystals, examine the influence of applying space group symmetry constraints during the simulations, and discuss the relationship between the structure of the energy landscape and the intermolecular interactions present in the crystals.
Collapse
|
10
|
Momenzadeh Abardeh Z, Salimi A, Oganov AR. Crystal structure prediction of N-halide phthalimide compounds: Halogen bond synthon as a touchstone. CrystEngComm 2022. [DOI: 10.1039/d2ce00476c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
We address crystal structure prediction problem by combining evolutionary algorithm USPEX (used to predict sets of low-energy crystal structures) and synthon approach (extracting preferable supramolecular synthons from Cambridge Structural Database,...
Collapse
|