1
|
Manchev YT, Burn MJ, Popelier PLA. Ichor: A Python library for computational chemistry data management and machine learning force field development. J Comput Chem 2024; 45:2912-2928. [PMID: 39215569 DOI: 10.1002/jcc.27477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2024] [Revised: 07/09/2024] [Accepted: 07/18/2024] [Indexed: 09/04/2024]
Abstract
We present ichor, an open-source Python library that simplifies data management in computational chemistry and streamlines machine learning force field development. Ichor implements many easily extensible file management tools, in addition to a lazy file reading system, allowing efficient management of hundreds of thousands of computational chemistry files. Data from calculations can be readily stored into databases for easy sharing and post-processing. Raw data can be directly processed by ichor to create machine learning-ready datasets. In addition to powerful data-related capabilities, ichor provides interfaces to popular workload management software employed by High Performance Computing clusters, making for effortless submission of thousands of separate calculations with only a single line of Python code. Furthermore, a simple-to-use command line interface has been implemented through a series of menu systems to further increase accessibility and efficiency of common important ichor tasks. Finally, ichor implements general tools for visualization and analysis of datasets and tools for measuring machine-learning model quality both on test set data and in simulations. With the current functionalities, ichor can serve as an end-to-end data procurement, data management, and analysis solution for machine-learning force-field development.
Collapse
Affiliation(s)
- Yulian T Manchev
- Department of Chemistry, The University of Manchester, Manchester, UK
| | - Matthew J Burn
- Department of Chemistry, The University of Manchester, Manchester, UK
| | - Paul L A Popelier
- Department of Chemistry, The University of Manchester, Manchester, UK
| |
Collapse
|
2
|
Neese F. A perspective on the future of quantum chemical software: the example of the ORCA program package. Faraday Discuss 2024; 254:295-314. [PMID: 39051881 DOI: 10.1039/d4fd00056k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2024]
Abstract
The field of computational chemistry has made an impressive impact on contemporary chemical research. In order to carry out computational studies on actual systems, sophisticated software is required in form of large-scale quantum chemical program packages. Given the enormous diversity and complexity of the methods that need to be implementation in such packages, it is evident that these software pieces are very large (millions of code lines) and extremely complex. Most of the packages in widespread use by the computational chemistry community have had a development history of decades. Given the rapid progress in the hardware and a lack of resources (time, workforce, money), it is not possible to keep redesigning these program packages from scratch in order to keep up with the ever more quickly shifting hardware landscape. In this perspective, some aspects of the multitude of challenges that the developer community faces are discussed. While the task at hand - to ensure that quantum chemical program packages can keep evolving and make best use of the available hardware - is daunting, there are also new evolving opportunities. The problems and potential cures are discussed with the example of the ORCA package that has been developed in our research group.
Collapse
Affiliation(s)
- Frank Neese
- Department of Molecular Theory and Spectroscopy, Kaiser-Wilhelm-Platz 1, D-45470 Mülheim an der Ruhr, Germany.
| |
Collapse
|
3
|
Kim I, Jeong D, Weisburn LP, Alexiu A, Van Voorhis T, Rhee YM, Son WJ, Kim HJ, Yim J, Kim S, Cho Y, Jang I, Lee S, Kim DS. Very-Large-Scale GPU-Accelerated Nuclear Gradient of Time-Dependent Density Functional Theory with Tamm-Dancoff Approximation and Range-Separated Hybrid Functionals. J Chem Theory Comput 2024; 20:9018-9031. [PMID: 39373529 DOI: 10.1021/acs.jctc.4c01003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/08/2024]
Abstract
Modern graphics processing units (GPUs) provide an unprecedented level of computing power. In this study, we present a high-performance, multi-GPU implementation of the analytical nuclear gradient for Kohn-Sham time-dependent density functional theory (TDDFT), employing the Tamm-Dancoff approximation (TDA) and Gaussian-type atomic orbitals as basis functions. We discuss GPU-efficient algorithms for the derivatives of electron repulsion integrals and exchange-correlation functionals within the range-separated scheme. As an illustrative example, we calculate the TDA-TDDFT gradient of the S1 state of a full-scale green fluorescent protein with explicit water solvent molecules, totaling 4353 atoms, at the ωB97X/def2-SVP level of theory. Our algorithm demonstrates favorable parallel efficiencies on a high-speed distributed system equipped with 256 Nvidia A100 GPUs, achieving >70% with up to 64 GPUs and 31% with 256 GPUs, effectively leveraging the capabilities of modern high-performance computing systems.
Collapse
Affiliation(s)
- Inkoo Kim
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
- Department of Chemistry, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, United States
| | - Daun Jeong
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Leah P Weisburn
- Department of Chemistry, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, United States
| | - Alexandra Alexiu
- Department of Chemistry, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, United States
| | - Troy Van Voorhis
- Department of Chemistry, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts 02139, United States
| | - Young Min Rhee
- Department of Chemistry, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
| | - Won-Joon Son
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Hyung-Jin Kim
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Jinkyu Yim
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Sungmin Kim
- Samsung Advanced Institute of Technology, Samsung Electronics, Suwon 16678, Republic of Korea
| | - Yeonchoo Cho
- Samsung Advanced Institute of Technology, Samsung Electronics, Suwon 16678, Republic of Korea
| | - Inkook Jang
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Seungmin Lee
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| | - Dae Sin Kim
- Innovation Center, Samsung Electronics, Hwaseong 18448, Republic of Korea
| |
Collapse
|
4
|
Bylaska EJ, Panyala A, Bauman NP, Peng B, Pathak H, Mejia-Rodriguez D, Govind N, Williams-Young DB, Aprà E, Bagusetty A, Mutlu E, Jackson KA, Baruah T, Yamamoto Y, Pederson MR, Withanage KPK, Pedroza-Montero JN, Bilbrey JA, Choudhury S, Firoz J, Herman KM, Xantheas SS, Rigor P, Vila FD, Rehr JJ, Fung M, Grofe A, Johnston C, Baker N, Kaneko K, Liu H, Kowalski K. Electronic structure simulations in the cloud computing environment. J Chem Phys 2024; 161:150902. [PMID: 39431777 DOI: 10.1063/5.0226437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Accepted: 09/15/2024] [Indexed: 10/22/2024] Open
Abstract
The transformative impact of modern computational paradigms and technologies, such as high-performance computing (HPC), quantum computing, and cloud computing, has opened up profound new opportunities for scientific simulations. Scalable computational chemistry is one beneficiary of this technological progress. The main focus of this paper is on the performance of various quantum chemical formulations, ranging from low-order methods to high-accuracy approaches, implemented in different computational chemistry packages and libraries, such as NWChem, NWChemEx, Scalable Predictive Methods for Excitations and Correlated Phenomena, ExaChem, and Fermi-Löwdin orbital self-interaction correction on Azure Quantum Elements, Microsoft's cloud services platform for scientific discovery. We pay particular attention to the intricate workflows for performing complex chemistry simulations, associated data curation, and mechanisms for accuracy assessment, which is demonstrated with the Arrows automated workflow for high throughput simulations. Finally, we provide a perspective on the role of cloud computing in supporting the mission of leadership computational facilities.
Collapse
Affiliation(s)
- Eric J Bylaska
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Ajay Panyala
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Nicholas P Bauman
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Bo Peng
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Himadri Pathak
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Daniel Mejia-Rodriguez
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Niranjan Govind
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - David B Williams-Young
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
| | - Edoardo Aprà
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Abhishek Bagusetty
- Argonne Leadership Computing Facility, Argonne National Laboratory, 9700 South Cass Avenue, Building 240, Argonne, Illinois 60439, USA
| | - Erdal Mutlu
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Koblar A Jackson
- Physics Department, Central Michigan University, Mt. Pleasant, Michigan 48859, USA
| | - Tunna Baruah
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Yoh Yamamoto
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | - Mark R Pederson
- Department of Physics, University of Texas at El Paso, El Paso, Texas 79968, USA
| | | | | | - Jenna A Bilbrey
- Artificial Intelligence and Data Analytics Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
| | - Sutanay Choudhury
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Jesun Firoz
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Kristina M Herman
- Department of Chemistry, University of Washington, Seattle, Washington 98195, USA
| | - Sotiris S Xantheas
- Advanced Computing, Mathematics, and Data Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
- Department of Chemistry, University of Washington, Seattle, Washington 98195, USA
| | - Paul Rigor
- Center for Cloud Computing, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| | - Fernando D Vila
- Department of Physics, University of Washington, Seattle, Washington 98195, USA
| | - John J Rehr
- Department of Physics, University of Washington, Seattle, Washington 98195, USA
| | - Mimi Fung
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Adam Grofe
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | | | - Nathan Baker
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Ken Kaneko
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Hongbin Liu
- Microsoft Azure Quantum, Redmond, Washington 98052, USA
| | - Karol Kowalski
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, USA
| |
Collapse
|
5
|
Alkan M, Pham BQ, Del Angel Cruz D, Hammond JR, Barnes TA, Gordon MS. LibERI-A portable and performant multi-GPU accelerated library for electron repulsion integrals via OpenMP offloading and standard language parallelism. J Chem Phys 2024; 161:082501. [PMID: 39171700 DOI: 10.1063/5.0215352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Accepted: 07/16/2024] [Indexed: 08/23/2024] Open
Abstract
A portable and performant graphics processing unit (GPU)-accelerated library for electron repulsion integral (ERI) evaluation, named LibERI, has been developed and implemented via directive-based (e.g., OpenMP and OpenACC) and standard language parallelism (e.g., Fortran DO CONCURRENT). Offloaded ERIs consist of integrals over low and high contraction s, p, and d functions using the rotated-axis and Rys quadrature methods. GPU codes are factorized based on previous developments [Pham et al., J. Chem. Theory Comput. 19(8), 2213-2221 (2023)] with two layers of integral screening and quartet presorting. In this work, the density screening is moved to the GPU to enhance the computational efficacy for large molecular systems. The L-shells in the Pople basis set are also separated into pure S and P shells to increase the ERI homogeneity and reduce atomic operations and the memory footprint. LibERI is compatible with any quantum chemistry drivers supporting the MolSSI Driver Interface. Benchmark calculations of LibERI interfaced with the GAMESS software package were carried out on various GPU architectures and molecular systems. The results show that the LibERI performance is comparable to other state-of-the-art GPU-accelerated codes (e.g., TeraChem and GMSHPC) and, in some cases, outperforms conventionally developed ERI CUDA kernels (e.g., QUICK) while fully maintaining portability.
Collapse
Affiliation(s)
- Melisa Alkan
- Department of Chemistry, Iowa State University and Ames National Laboratory, Ames, Iowa 50011, USA
- Department of Chemistry, Stanford University, Palo Alto, California 94305, USA
| | - Buu Q Pham
- Department of Chemistry, Iowa State University and Ames National Laboratory, Ames, Iowa 50011, USA
| | - Daniel Del Angel Cruz
- Department of Chemistry, Iowa State University and Ames National Laboratory, Ames, Iowa 50011, USA
| | | | - Taylor A Barnes
- Molecular Sciences Software Institute, Blacksburg, Virginia 24060, USA
| | - Mark S Gordon
- Department of Chemistry, Iowa State University and Ames National Laboratory, Ames, Iowa 50011, USA
| |
Collapse
|
6
|
Pederson JP, McDaniel JG. PyDFT-QMMM: A modular, extensible software framework for DFT-based QM/MM molecular dynamics. J Chem Phys 2024; 161:034103. [PMID: 39007371 DOI: 10.1063/5.0219851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 06/24/2024] [Indexed: 07/16/2024] Open
Abstract
PyDFT-QMMM is a Python-based package for performing hybrid quantum mechanics/molecular mechanics (QM/MM) simulations at the density functional level of theory. The program is designed to treat short-range and long-range interactions through user-specified combinations of electrostatic and mechanical embedding procedures within periodic simulation domains, providing necessary interfaces to external quantum chemistry and molecular dynamics software. To enable direct embedding of long-range electrostatics in periodic systems, we have derived and implemented force terms for our previously described QM/MM/PME approach [Pederson and McDaniel, J. Chem. Phys. 156, 174105 (2022)]. Communication with external software packages Psi4 and OpenMM is facilitated through Python application programming interfaces (APIs). The core library contains basic utilities for running QM/MM molecular dynamics simulations, and plug-in entry-points are provided for users to implement custom energy/force calculation and integration routines, within an extensible architecture. The user interacts with PyDFT-QMMM primarily through its Python API, allowing for complex workflow development with Python scripting, for example, interfacing with PLUMED for free energy simulations. We provide benchmarks of forces and energy conservation for the QM/MM/PME and alternative QM/MM electrostatic embedding approaches. We further demonstrate a simple example use case for water solute in a water solvent system, for which radial distribution functions are computed from 100 ps QM/MM simulations; in this example, we highlight how the solvation structure is sensitive to different basis-set choices due to under- or over-polarization of the QM water molecule's electron density.
Collapse
Affiliation(s)
- John P Pederson
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | - Jesse G McDaniel
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| |
Collapse
|
7
|
Carrasco-Busturia D, Ippoliti E, Meloni S, Rothlisberger U, Olsen JMH. Multiscale biomolecular simulations in the exascale era. Curr Opin Struct Biol 2024; 86:102821. [PMID: 38688076 DOI: 10.1016/j.sbi.2024.102821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/03/2024] [Accepted: 04/04/2024] [Indexed: 05/02/2024]
Abstract
The complexity of biological systems and processes, spanning molecular to macroscopic scales, necessitates the use of multiscale simulations to get a comprehensive understanding. Quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) simulations are crucial for capturing processes beyond the reach of classical MD simulations. The advent of exascale computing offers unprecedented opportunities for scientific exploration, not least within life sciences, where simulations are essential to unravel intricate molecular mechanisms underlying biological processes. However, leveraging the immense computational power of exascale computing requires innovative algorithms and software designs. In this context, we discuss the current status and future prospects of multiscale biomolecular simulations on exascale supercomputers with a focus on QM/MM MD. We highlight our own efforts in developing a versatile and high-performance multiscale simulation framework with the aim of efficient utilization of state-of-the-art supercomputers. We showcase its application in uncovering complex biological mechanisms and its potential for leveraging exascale computing.
Collapse
Affiliation(s)
- David Carrasco-Busturia
- DTU Chemistry, Technical University of Denmark (DTU), Kongens Lyngby, DK-2800, Denmark. https://twitter.com/@DavidCdeB
| | - Emiliano Ippoliti
- Computational Biomedicine, Institute of Advanced Simulations IAS-5/Institute for Neuroscience and Medicine INM-9, Forschungszentrum Jülich GmbH, Jülich, DE-52428, Germany
| | - Simone Meloni
- Dipartimento di Scienze Chimiche, Farmaceutiche ed Agrarie (DOCPAS), Università degli Studi di Ferrara (Unife), Ferrara, I-44121, Italy. https://twitter.com/@smeloni99
| | - Ursula Rothlisberger
- Laboratory of Computational Chemistry and Biochemistry, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, CH-1015, Switzerland. https://twitter.com/@lcbc_epfl
| | | |
Collapse
|
8
|
Pracht P, Grimme S, Bannwarth C, Bohle F, Ehlert S, Feldmann G, Gorges J, Müller M, Neudecker T, Plett C, Spicher S, Steinbach P, Wesołowski PA, Zeller F. CREST-A program for the exploration of low-energy molecular chemical space. J Chem Phys 2024; 160:114110. [PMID: 38511658 DOI: 10.1063/5.0197592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 02/29/2024] [Indexed: 03/22/2024] Open
Abstract
Conformer-rotamer sampling tool (CREST) is an open-source program for the efficient and automated exploration of molecular chemical space. Originally developed in Pracht et al. [Phys. Chem. Chem. Phys. 22, 7169 (2020)] as an automated driver for calculations at the extended tight-binding level (xTB), it offers a variety of molecular- and metadynamics simulations, geometry optimization, and molecular structure analysis capabilities. Implemented algorithms include automated procedures for conformational sampling, explicit solvation studies, the calculation of absolute molecular entropy, and the identification of molecular protonation and deprotonation sites. Calculations are set up to run concurrently, providing efficient single-node parallelization. CREST is designed to require minimal user input and comes with an implementation of the GFNn-xTB Hamiltonians and the GFN-FF force-field. Furthermore, interfaces to any quantum chemistry and force-field software can easily be created. In this article, we present recent developments in the CREST code and show a selection of applications for the most important features of the program. An important novelty is the refactored calculation backend, which provides significant speed-up for sampling of small or medium-sized drug molecules and allows for more sophisticated setups, for example, quantum mechanics/molecular mechanics and minimum energy crossing point calculations.
Collapse
Affiliation(s)
- Philipp Pracht
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Stefan Grimme
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Christoph Bannwarth
- Institute for Physical Chemistry, RWTH Aachen University, Melatener Str. 20, 52056 Aachen, Germany
| | - Fabian Bohle
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Sebastian Ehlert
- AI4Science, Microsoft Research, Evert van de Beekstraat 354, 1118 CZ Schiphol, The Netherlands
| | - Gereon Feldmann
- Institute for Physical Chemistry, RWTH Aachen University, Melatener Str. 20, 52056 Aachen, Germany
| | - Johannes Gorges
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Marcel Müller
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | - Tim Neudecker
- Institute for Physical and Theoretical Chemistry, University of Bremen, 28359 Bremen, Germany
| | - Christoph Plett
- Mulliken Center for Theoretical Chemistry, Institute for Physical and Theoretical Chemistry, University of Bonn, Beringstr. 4, 53115 Bonn, Germany
| | | | - Pit Steinbach
- Institute for Physical Chemistry, RWTH Aachen University, Melatener Str. 20, 52056 Aachen, Germany
| | - Patryk A Wesołowski
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Felix Zeller
- Institute for Physical and Theoretical Chemistry, University of Bremen, 28359 Bremen, Germany
| |
Collapse
|
9
|
Dral PO. AI in computational chemistry through the lens of a decade-long journey. Chem Commun (Camb) 2024; 60:3240-3258. [PMID: 38444290 DOI: 10.1039/d4cc00010b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
This article gives a perspective on the progress of AI tools in computational chemistry through the lens of the author's decade-long contributions put in the wider context of the trends in this rapidly expanding field. This progress over the last decade is tremendous: while a decade ago we had a glimpse of what was to come through many proof-of-concept studies, now we witness the emergence of many AI-based computational chemistry tools that are mature enough to make faster and more accurate simulations increasingly routine. Such simulations in turn allow us to validate and even revise experimental results, deepen our understanding of the physicochemical processes in nature, and design better materials, devices, and drugs. The rapid introduction of powerful AI tools gives rise to unique challenges and opportunities that are discussed in this article too.
Collapse
Affiliation(s)
- Pavlo O Dral
- State Key Laboratory of Physical Chemistry of Solid Surfaces, College of Chemistry and Chemical Engineering, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, and Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), Xiamen University, Xiamen, Fujian 361005, China.
| |
Collapse
|
10
|
Crandall Z, Windus TL, Richard RM. CMaize: Simplifying inter-package modularity from the build up. J Chem Phys 2024; 160:092502. [PMID: 38445730 DOI: 10.1063/5.0196384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2024] [Accepted: 02/18/2024] [Indexed: 03/07/2024] Open
Abstract
There is a growing desire for inter-package modularity within the chemistry software community to reuse encapsulated code units across a variety of software packages. Most comprehensive efforts at achieving inter-package modularity will quickly run afoul of a very practical problem, being able to cohesively build the modules. Writing and maintaining build systems has long been an issue for many scientific software packages that rely on compiled languages such as C/C++. The push for inter-package modularity compounds this issue by additionally requiring binary artifacts from disparate developers to interoperate at a binary level. Thankfully, the de facto build tool for C/C++, CMake, is more than capable of supporting the myriad of edge cases that complicate writing robust build systems. Unfortunately, writing and maintaining a robust CMake build system can be a laborious endeavor because CMake provides few abstractions to aid the developer. The need to significantly simplify the process of writing robust CMake-based build systems, especially in inter-package builds, motivated us to write CMaize. In addition to describing the architecture and design of CMaize, the article also demonstrates how CMaize is used in production-level software.
Collapse
Affiliation(s)
- Zachery Crandall
- Chemical and Biological Sciences, Ames National Laboratory, Ames, Iowa 50011, USA
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
| | - Theresa L Windus
- Chemical and Biological Sciences, Ames National Laboratory, Ames, Iowa 50011, USA
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
| | - Ryan M Richard
- Chemical and Biological Sciences, Ames National Laboratory, Ames, Iowa 50011, USA
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
| |
Collapse
|
11
|
Marx V. Collaborate, disseminate, accelerate. Nat Methods 2024; 21:372-376. [PMID: 38413712 DOI: 10.1038/s41592-024-02202-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
|
12
|
Lehtola S. A call to arms: Making the case for more reusable libraries. J Chem Phys 2023; 159:180901. [PMID: 37947507 DOI: 10.1063/5.0175165] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 10/23/2023] [Indexed: 11/12/2023] Open
Abstract
The traditional foundation of science lies on the cornerstones of theory and experiment. Theory is used to explain experiment, which in turn guides the development of theory. Since the advent of computers and the development of computational algorithms, computation has risen as the third cornerstone of science, joining theory and experiment on an equal footing. Computation has become an essential part of modern science, amending experiment by enabling accurate comparison of complicated theories to sophisticated experiments, as well as guiding by triage both the design and targets of experiments and the development of novel theories and computational methods. Like experiment, computation relies on continued investment in infrastructure: it requires both hardware (the physical computer on which the calculation is run) as well as software (the source code of the programs that performs the wanted simulations). In this Perspective, I discuss present-day challenges on the software side in computational chemistry, which arise from the fast-paced development of algorithms, programming models, as well as hardware. I argue that many of these challenges could be solved with reusable open source libraries, which are a public good, enhance the reproducibility of science, and accelerate the development and availability of state-of-the-art methods and improved software.
Collapse
Affiliation(s)
- Susi Lehtola
- Department of Chemistry, University of Helsinki, P.O. Box 55, FI-00014 Helsinki, Finland
| |
Collapse
|