1
|
Sturm PO, Silva SJ. A Nudge to the Truth: Atom Conservation as a Hard Constraint in Models of Atmospheric Composition Using a Species-Weighted Correction. ACS ES&T AIR 2025; 2:99-108. [PMID: 39817258 PMCID: PMC11730974 DOI: 10.1021/acsestair.4c00220] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2024] [Revised: 11/14/2024] [Accepted: 11/15/2024] [Indexed: 01/18/2025]
Abstract
Computational models of atmospheric composition are not always physically consistent. For example, not all models respect fundamental conservation laws such as conservation of atoms in an interconnected chemical system. In well performing models, these unphysical deviations are often ignored because they are frequently minor, and thus only need a small nudge to perfectly conserve mass. Here we introduce a method that anchors a prediction from any numerical model to physically consistent hard constraints, nudging concentrations to the nearest solution that respects the conservation laws. This closed-form model-agnostic correction uses a single matrix operation to minimally perturb the predicted concentrations to ensure that atoms are conserved to machine precision. To demonstrate this approach, we train a gradient boosting decision tree ensemble to emulate a small reference model of ozone photochemistry and test the effect of the correction on accurate but nonconservative predictions. The nudging approach minimally perturbs the already well-predicted results for most species, but decreases the accuracy of important oxidants, including radicals. We develop a weighted extension of this nudging approach that considers the uncertainty and magnitude of each species in the correction. This species-level weighting approach is essential to accurately predict important low concentration species such as radicals. We find that applying the species-weighted correction slightly improves overall accuracy by nudging unphysical predictions to a more likely mass-conserving solution.
Collapse
Affiliation(s)
- Patrick Obin Sturm
- Department
of Earth Sciences, University of Southern
California, Los Angeles, California 90089, United States
| | - Sam J. Silva
- Department
of Earth Sciences, University of Southern
California, Los Angeles, California 90089, United States
- Department
of Environmental Engineering, University
of Southern California, Los Angeles, California 90089, United States
- Department
of Population and Public Health Sciences, University of Southern California, Los Angeles, California 90032, United States
| |
Collapse
|
2
|
Raguram ER, Dahl JC, Jensen KF, Buchwald SL. Kinetic Modeling Enables Understanding of Off-Cycle Processes in Pd-Catalyzed Amination of Five-Membered Heteroaryl Halides. J Am Chem Soc 2024. [PMID: 39566015 DOI: 10.1021/jacs.4c10488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2024]
Abstract
The mechanism of Pd-catalyzed amination of five-membered heteroaryl halides was investigated by integrating experimental kinetic analysis with kinetic modeling through predictive testing and likelihood ratio analysis, revealing an atypical productive coupling pathway and multiple off-cycle events. The GPhos-supported Pd catalyst, along with the moderate-strength base NaOTMS, was previously found to promote efficient coupling between five-membered heteroaryl halides and secondary amines. However, slight deviations from the optimal concentration, temperature, and/or solvent resulted in significantly lower yields, contrary to typical reaction optimization trends. We found that the coupling of 4-bromothiazole with piperidine proceeds through an uncommon mechanism in which the NaOTMS base, rather than the amine, binds first to the oxidative addition complex; the resulting OTMS-bound Pd species is the resting state. Formation of the Pd-amido complex via base/amine exchange was identified as the turnover-limiting step, unlike other reported catalyst systems for which reductive elimination is turnover-limiting. We determined that the amine-bound Pd complex, usually an on-cycle intermediate, is instead a reversibly generated off-cycle species, and that base-mediated decomposition of 4-bromothiazole is the primary irreversible catalyst deactivation pathway. Predictive testing and kinetic modeling were key to the identification of these off-cycle processes, providing insight into minor mechanistic pathways that are difficult to observe experimentally. Collectively, this report reveals the unique enabling features of the Pd-GPhos/NaOTMS system, implementing mechanistic insights to improve the yields of particularly challenging coupling reactions. Moreover, these findings highlight the utility of applying predictive tests to kinetic models for the rapid evaluation of mechanistic possibilities in small-molecule catalytic systems.
Collapse
Affiliation(s)
- Elaine Reichert Raguram
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| | - Jakob C Dahl
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| | - Klavs F Jensen
- Department of Chemical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| | - Stephen L Buchwald
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
3
|
Avram F, Adenane R, Neagu M. Advancing Mathematical Epidemiology and Chemical Reaction Network Theory via Synergies Between Them. ENTROPY (BASEL, SWITZERLAND) 2024; 26:936. [PMID: 39593882 PMCID: PMC11592501 DOI: 10.3390/e26110936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2024] [Revised: 10/25/2024] [Accepted: 10/28/2024] [Indexed: 11/28/2024]
Abstract
Our paper reviews some key concepts in chemical reaction network theory and mathematical epidemiology, and examines their intersection, with three goals. The first is to make the case that mathematical epidemiology (ME), and also related sciences like population dynamics, virology, ecology, etc., could benefit by adopting the universal language of essentially non-negative kinetic systems as developed by chemical reaction network (CRN) researchers. In this direction, our investigation of the relations between CRN and ME lead us to propose for the first time a definition of ME models, stated in Open Problem 1. Our second goal is to inform researchers outside ME of the convenient next generation matrix (NGM) approach for studying the stability of boundary points, which do not seem sufficiently well known. Last but not least, we want to help students and researchers who know nothing about either ME or CRN to learn them quickly, by offering them a Mathematica package "bootcamp", including illustrating notebooks (and certain sections below will contain associated suggested notebooks; however, readers with experience may safely skip the bootcamp). We hope that the files indicated in the titles of various sections will be helpful, though of course improvement is always possible, and we ask the help of the readers for that.
Collapse
Affiliation(s)
- Florin Avram
- Laboratoire de Mathématiques Appliquées, Université de Pau, 64000 Pau, France
| | - Rim Adenane
- Département des Mathématiques, Faculté des Sciences, Université Ibn-Tofail, 14000 Kenitra, Morocco;
| | - Mircea Neagu
- Department of Mathematics and Computer Science, Transilvania University of Braşov, 500091 Braşov, Romania;
| |
Collapse
|
4
|
Bourdais T, Batlle P, Yang X, Baptista R, Rouquette N, Owhadi H. Codiscovering graphical structure and functional relationships within data: A Gaussian Process framework for connecting the dots. Proc Natl Acad Sci U S A 2024; 121:e2403449121. [PMID: 39088394 PMCID: PMC11317615 DOI: 10.1073/pnas.2403449121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 06/26/2024] [Indexed: 08/03/2024] Open
Abstract
Most problems within and beyond the scientific domain can be framed into one of the following three levels of complexity of function approximation. Type 1: Approximate an unknown function given input/output data. Type 2: Consider a collection of variables and functions, some of which are unknown, indexed by the nodes and hyperedges of a hypergraph (a generalized graph where edges can connect more than two vertices). Given partial observations of the variables of the hypergraph (satisfying the functional dependencies imposed by its structure), approximate all the unobserved variables and unknown functions. Type 3: Expanding on Type 2, if the hypergraph structure itself is unknown, use partial observations of the variables of the hypergraph to discover its structure and approximate its unknown functions. These hypergraphs offer a natural platform for organizing, communicating, and processing computational knowledge. While most scientific problems can be framed as the data-driven discovery of unknown functions in a computational hypergraph whose structure is known (Type 2), many require the data-driven discovery of the structure (connectivity) of the hypergraph itself (Type 3). We introduce an interpretable Gaussian Process (GP) framework for such (Type 3) problems that does not require randomization of the data, access to or control over its sampling, or sparsity of the unknown functions in a known or learned basis. Its polynomial complexity, which contrasts sharply with the super-exponential complexity of causal inference methods, is enabled by the nonlinear ANOVA capabilities of GPs used as a sensing mechanism.
Collapse
Affiliation(s)
- Théo Bourdais
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA91125
| | - Pau Batlle
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA91125
| | - Xianjin Yang
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA91125
| | - Ricardo Baptista
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA91125
| | - Nicolas Rouquette
- Jet Propulsion Laboratory, California Institute of Technology, Pasadena, CA91109
| | - Houman Owhadi
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA91125
| |
Collapse
|
5
|
Gilkes J, Storr MT, Maurer RJ, Habershon S. Predicting Long-Time-Scale Kinetics under Variable Experimental Conditions with Kinetica.jl. J Chem Theory Comput 2024; 20:5196-5214. [PMID: 38829777 PMCID: PMC11209948 DOI: 10.1021/acs.jctc.4c00333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/13/2024] [Accepted: 05/13/2024] [Indexed: 06/05/2024]
Abstract
Predicting the degradation processes of molecules over long time scales is a key aspect of industrial materials design. However, it is made computationally challenging by the need to construct large networks of chemical reactions that are relevant to the experimental conditions that kinetic models must mirror, with every reaction requiring accurate kinetic data. Here, we showcase Kinetica.jl, a new software package for constructing large-scale chemical reaction networks in a fully automated fashion by exploring chemical reaction space with a kinetics-driven algorithm; coupled to efficient machine-learning models of activation energies for sampled elementary reactions, we show how this approach readily enables generation and kinetic characterization of networks containing ∼103 chemical species and ≃104-105 reactions. Symbolic-numeric modeling of the generated reaction networks is used to allow for flexible, efficient computation of kinetic profiles under experimentally realizable conditions such as continuously variable temperature regimes, enabling direct connection between bottom-up reaction networks and experimental observations. Highly efficient propagation of long-time-scale kinetic profiles is required for automated reaction network refinement and is enabled here by a new discrete kinetic approximation. The resulting Kinetica.jl simulation package therefore enables automated generation, characterization, and long-time-scale modeling of complex chemical reaction systems. We demonstrate this for hydrocarbon pyrolysis simulated over time scales of seconds, using transient temperature profiles representing those of tubular flow reactor experiments.
Collapse
Affiliation(s)
- Joe Gilkes
- Department
of Chemistry, University of Warwick, Gibbet Hill Road, CV4 7AL Coventry, U.K.
- EPSRC
HetSys Centre for Doctoral Training, University
of Warwick, Gibbet Hill
Rd, CV4 7AL Coventry, U.K.
| | | | - Reinhard J. Maurer
- Department
of Chemistry, University of Warwick, Gibbet Hill Road, CV4 7AL Coventry, U.K.
- Department
of Physics, University of Warwick, Gibbet Hill Road, CV4 7AL Coventry, U.K.
| | - Scott Habershon
- Department
of Chemistry, University of Warwick, Gibbet Hill Road, CV4 7AL Coventry, U.K.
| |
Collapse
|
6
|
Piho P, Thomas P. Feedback between stochastic gene networks and population dynamics enables cellular decision-making. SCIENCE ADVANCES 2024; 10:eadl4895. [PMID: 38787956 PMCID: PMC11122677 DOI: 10.1126/sciadv.adl4895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/24/2024] [Indexed: 05/26/2024]
Abstract
Phenotypic selection occurs when genetically identical cells are subject to different reproductive abilities due to cellular noise. Such noise arises from fluctuations in reactions synthesizing proteins and plays a crucial role in how cells make decisions and respond to stress or drugs. We propose a general stochastic agent-based model for growing populations capturing the feedback between gene expression and cell division dynamics. We devise a finite state projection approach to analyze gene expression and division distributions and infer selection from single-cell data in mother machines and lineage trees. We use the theory to quantify selection in multi-stable gene expression networks and elucidate that the trade-off between phenotypic switching and selection enables robust decision-making essential for synthetic circuits and developmental lineage decisions. Using live-cell data, we demonstrate that combining theory and inference provides quantitative insights into bet-hedging-like response to DNA damage and adaptation during antibiotic exposure in Escherichia coli.
Collapse
Affiliation(s)
- Paul Piho
- Department of Mathematics, Imperial College London, London, UK
| | | |
Collapse
|
7
|
de Wit XM, Fruchart M, Khain T, Toschi F, Vitelli V. Pattern formation by turbulent cascades. Nature 2024; 627:515-521. [PMID: 38509279 PMCID: PMC10954557 DOI: 10.1038/s41586-024-07074-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Accepted: 01/15/2024] [Indexed: 03/22/2024]
Abstract
Fully developed turbulence is a universal and scale-invariant chaotic state characterized by an energy cascade from large to small scales at which the cascade is eventually arrested by dissipation1-6. Here we show how to harness these seemingly structureless turbulent cascades to generate patterns. Pattern formation entails a process of wavelength selection, which can usually be traced to the linear instability of a homogeneous state7. By contrast, the mechanism we propose here is fully nonlinear. It is triggered by the non-dissipative arrest of turbulent cascades: energy piles up at an intermediate scale, which is neither the system size nor the smallest scales at which energy is usually dissipated. Using a combination of theory and large-scale simulations, we show that the tunable wavelength of these cascade-induced patterns can be set by a non-dissipative transport coefficient called odd viscosity, ubiquitous in chiral fluids ranging from bioactive to quantum systems8-12. Odd viscosity, which acts as a scale-dependent Coriolis-like force, leads to a two-dimensionalization of the flow at small scales, in contrast with rotating fluids in which a two-dimensionalization occurs at large scales4. Apart from odd viscosity fluids, we discuss how cascade-induced patterns can arise in natural systems, including atmospheric flows13-19, stellar plasma such as the solar wind20-22, or the pulverization and coagulation of objects or droplets in which mass rather than energy cascades23-25.
Collapse
Affiliation(s)
- Xander M de Wit
- Department of Applied Physics and Science Education, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Michel Fruchart
- Gulliver, ESPCI Paris, Université PSL, CNRS, Paris, France
- James Franck Institute, The University of Chicago, Chicago, IL, USA
| | - Tali Khain
- James Franck Institute, The University of Chicago, Chicago, IL, USA
| | - Federico Toschi
- Department of Applied Physics and Science Education, Eindhoven University of Technology, Eindhoven, The Netherlands.
- CNR-IAC, Rome, Italy.
| | - Vincenzo Vitelli
- James Franck Institute, The University of Chicago, Chicago, IL, USA.
- Kadanoff Center for Theoretical Physics, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
8
|
Lang PF, Jain A, Rackauckas C. SBMLToolkit.jl: a Julia package for importing SBML into the SciML ecosystem. J Integr Bioinform 2024; 21:jib-2024-0003. [PMID: 38801698 PMCID: PMC11294517 DOI: 10.1515/jib-2024-0003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/21/2024] [Indexed: 05/29/2024] Open
Abstract
Julia is a general purpose programming language that was designed for simplifying and accelerating numerical analysis and computational science. In particular the Scientific Machine Learning (SciML) ecosystem of Julia packages includes frameworks for high-performance symbolic-numeric computations. It allows users to automatically enhance high-level descriptions of their models with symbolic preprocessing and automatic sparsification and parallelization of computations. This enables performant solution of differential equations, efficient parameter estimation and methodologies for automated model discovery with neural differential equations and sparse identification of nonlinear dynamics. To give the systems biology community easy access to SciML, we developed SBMLToolkit.jl. SBMLToolkit.jl imports dynamic SBML models into the SciML ecosystem to accelerate model simulation and fitting of kinetic parameters. By providing computational systems biologists with easy access to the open-source Julia ecosystevnm, we hope to catalyze the development of further Julia tools in this domain and the growth of the Julia bioscience community. SBMLToolkit.jl is freely available under the MIT license. The source code is available at https://github.com/SciML/SBMLToolkit.jl.
Collapse
Affiliation(s)
| | | | - Christopher Rackauckas
- JuliaHub, Boston, USA
- Computer Science and Artificial Intelligence Laboratory (CSAIL), Massachusetts Institute of Technology, Boston, USA
| |
Collapse
|