1
|
Ahmad F, Muhmood T. Clinical translation of nanomedicine with integrated digital medicine and machine learning interventions. Colloids Surf B Biointerfaces 2024; 241:114041. [PMID: 38897022 DOI: 10.1016/j.colsurfb.2024.114041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 06/11/2024] [Accepted: 06/13/2024] [Indexed: 06/21/2024]
Abstract
Nanomaterials based therapeutics transform the ways of disease prevention, diagnosis and treatment with increasing sophistications in nanotechnology at a breakneck pace, but very few could reach to the clinic due to inconsistencies in preclinical studies followed by regulatory hinderances. To tackle this, integrating the nanomedicine discovery with digital medicine provide technologies as tools of specific biological activity measurement. Hence, overcome the redundancies in nanomedicine discovery by the on-site data acquisition and analytics through integrating intelligent sensors and artificial intelligence (AI) or machine learning (ML). Integrated AI/ML wearable sensors directly gather clinically relevant biochemical information from the subject's body and process data for physicians to make right clinical decision(s) in a time and cost-effective way. This review summarizes insights and recommend the infusion of actionable big data computation enabled sensors in burgeoning field of nanomedicine at academia, research institutes, and pharmaceutical industries, with a potential of clinical translation. Furthermore, many blind spots are present in modern clinically relevant computation, one of which could prevent ML-guided low-cost new nanomedicine development from being successfully translated into the clinic was also discussed.
Collapse
Affiliation(s)
- Farooq Ahmad
- State Key Laboratory of Chemistry and Utilization of Carbon Based Energy Resources, College of Chemistry, Xinjiang University, Urumqi 830017, China.
| | - Tahir Muhmood
- International Iberian Nanotechnology Laboratory (INL), Avenida Mestre José Veiga, Braga 4715-330, Portugal.
| |
Collapse
|
2
|
Christiansen R, Pfister N, Jakobsen ME, Gnecco N, Peters J. A Causal Framework for Distribution Generalization. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022; 44:6614-6630. [PMID: 34232865 DOI: 10.1109/tpami.2021.3094760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
We consider the problem of predicting a response Y from a set of covariates X when test- and training distributions differ. Since such differences may have causal explanations, we consider test distributions that emerge from interventions in a structural causal model, and focus on minimizing the worst-case risk. Causal regression models, which regress the response on its direct causes, remain unchanged under arbitrary interventions on the covariates, but they are not always optimal in the above sense. For example, for linear models and bounded interventions, alternative solutions have been shown to be minimax prediction optimal. We introduce the formal framework of distribution generalization that allows us to analyze the above problem in partially observed nonlinear models for both direct interventions on X and interventions that occur indirectly via exogenous variables A. It takes into account that, in practice, minimax solutions need to be identified from data. Our framework allows us to characterize under which class of interventions the causal function is minimax optimal. We prove sufficient conditions for distribution generalization and present corresponding impossibility results. We propose a practical method, NILE, that achieves distribution generalization in a nonlinear IV setting with linear extrapolation. We prove consistency and present empirical results.
Collapse
|
3
|
Song C, Simmons BI, Fortin MJ, Gonzalez A. Generalism drives abundance: A computational causal discovery approach. PLoS Comput Biol 2022; 18:e1010302. [PMID: 36173959 PMCID: PMC9521805 DOI: 10.1371/journal.pcbi.1010302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Accepted: 06/14/2022] [Indexed: 11/29/2022] Open
Abstract
A ubiquitous pattern in ecological systems is that more abundant species tend to be more generalist; that is, they interact with more species or can occur in wider range of habitats. However, there is no consensus on whether generalism drives abundance (a selection process) or abundance drives generalism (a drift process). As it is difficult to conduct direct experiments to solve this chicken-and-egg dilemma, previous studies have used a causal discovery method based on formal logic and have found that abundance drives generalism. Here, we refine this method by correcting its bias regarding skewed distributions, and employ two other independent causal discovery methods based on nonparametric regression and on information theory, respectively. Contrary to previous work, all three independent methods strongly indicate that generalism drives abundance when applied to datasets on plant-hummingbird communities and reef fishes. Furthermore, we find that selection processes are more important than drift processes in structuring multispecies systems when the environment is variable. Our results showcase the power of the computational causal discovery approach to aid ecological research. Ever since Aristotle, the chicken-or-egg causality dilemma has baffled researchers. Such causality dilemmas are abundant in ecological research, where causal directions are often assumed but not tested. An archetypal example is whether being a generalist causes a species to be more abundant, or whether being more abundant causes a species to be generalists. Without doubt, the gold standard to establish causal directions is controlled experiments. However, controlled experiments that can disentangle the direction of causality in this case are challenging because it involves controlling biotic or abiotic niche breadth. These challenges create an opportunity for computational tools to detect the most likely causal direction. Here, by adapting a set of recently developed computational methods, we provide strong evidence that generalism drives abundance, overturning the previously established direction. We hope our work raises awareness of the potential for computational discovery methods to address long-standing questions in ecology, especially increasingly large datasets become available.
Collapse
Affiliation(s)
- Chuliang Song
- Department of Biology, Quebec Centre for Biodiversity Science, McGill University, Montreal, Canada
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
- * E-mail:
| | - Benno I. Simmons
- Department of Ecology and Conservation, University of Exeter, Cornwall Campus, Penryn, United Kingdom
| | - Marie-Josée Fortin
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Andrew Gonzalez
- Department of Biology, Quebec Centre for Biodiversity Science, McGill University, Montreal, Canada
| |
Collapse
|
4
|
Wu Y, Judge MT, Edison AS, Arnold J. Uncovering in vivo biochemical patterns from time-series metabolic dynamics. PLoS One 2022; 17:e0268394. [PMID: 35550643 PMCID: PMC9098013 DOI: 10.1371/journal.pone.0268394] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 04/28/2022] [Indexed: 11/19/2022] Open
Abstract
System biology relies on holistic biomolecule measurements, and untangling biochemical networks requires time-series metabolomics profiling. With current metabolomic approaches, time-series measurements can be taken for hundreds of metabolic features, which decode underlying metabolic regulation. Such a metabolomic dataset is untargeted with most features unannotated and inaccessible to statistical analysis and computational modeling. The high dimensionality of the metabolic space also causes mechanistic modeling to be rather cumbersome computationally. We implemented a faster exploratory workflow to visualize and extract chemical and biochemical dependencies. Time-series metabolic features (about 300 for each dataset) were extracted by Ridge Tracking-based Extract (RTExtract) on measurements from continuous in vivo monitoring of metabolism by NMR (CIVM-NMR) in Neurospora crassa under different conditions. The metabolic profiles were then smoothed and projected into lower dimensions, enabling a comparison of metabolic trends in the cultures. Next, we expanded incomplete metabolite annotation using a correlation network. Lastly, we uncovered meaningful metabolic clusters by estimating dependencies between smoothed metabolic profiles. We thus sidestepped the processes of time-consuming mechanistic modeling, difficult global optimization, and labor-intensive annotation. Multiple clusters guided insights into central energy metabolism and membrane synthesis. Dense connections with glucose 1-phosphate indicated its central position in metabolism in N. crassa. Our approach was benchmarked on simulated random network dynamics and provides a novel exploratory approach to analyzing high-dimensional metabolic dynamics.
Collapse
Affiliation(s)
- Yue Wu
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
| | - Michael T. Judge
- Department of Genetics, University of Georgia, Athens, GA, United States of America
| | - Arthur S. Edison
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
- Department of Genetics, University of Georgia, Athens, GA, United States of America
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States of America
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, United States of America
- * E-mail: (ASE); (JA)
| | - Jonathan Arnold
- Institute of Bioinformatics, University of Georgia, Athens, GA, United States of America
- Department of Genetics, University of Georgia, Athens, GA, United States of America
- Department of Statistics, University of Georgia, Athens, GA, United States of America
- Department of Physics and Astronomy, University of Georgia, Athens, GA, United States of America
- * E-mail: (ASE); (JA)
| |
Collapse
|
5
|
Serov N, Vinogradov V. Artificial intelligence to bring nanomedicine to life. Adv Drug Deliv Rev 2022; 184:114194. [PMID: 35283223 DOI: 10.1016/j.addr.2022.114194] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2021] [Revised: 03/04/2022] [Accepted: 03/07/2022] [Indexed: 12/13/2022]
Abstract
The technology of drug delivery systems (DDSs) has demonstrated an outstanding performance and effectiveness in production of pharmaceuticals, as it is proved by many FDA-approved nanomedicines that have an enhanced selectivity, manageable drug release kinetics and synergistic therapeutic actions. Nonetheless, to date, the rational design and high-throughput development of nanomaterial-based DDSs for specific purposes is far from a routine practice and is still in its infancy, mainly due to the limitations in scientists' capabilities to effectively acquire, analyze, manage, and comprehend complex and ever-growing sets of experimental data, which is vital to develop DDSs with a set of desired functionalities. At the same time, this task is feasible for the data-driven approaches, high throughput experimentation techniques, process automatization, artificial intelligence (AI) technology, and machine learning (ML) approaches, which is referred to as The Fourth Paradigm of scientific research. Therefore, an integration of these approaches with nanomedicine and nanotechnology can potentially accelerate the rational design and high-throughput development of highly efficient nanoformulated drugs and smart materials with pre-defined functionalities. In this Review, we survey the important results and milestones achieved to date in the application of data science, high throughput, as well as automatization approaches, combined with AI and ML to design and optimize DDSs and related nanomaterials. This manuscript mission is not only to reflect the state-of-art in data-driven nanomedicine, but also show how recent findings in the related fields can transform the nanomedicine's image. We discuss how all these results can be used to boost nanomedicine translation to the clinic, as well as highlight the future directions for the development, data-driven, high throughput experimentation-, and AI-assisted design, as well as the production of nanoformulated drugs and smart materials with pre-defined properties and behavior. This Review will be of high interest to the chemists involved in materials science, nanotechnology, and DDSs development for biomedical applications, although the general nature of the presented approaches enables knowledge translation to many other fields of science.
Collapse
Affiliation(s)
- Nikita Serov
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, Saint-Petersburg 191002, Russian Federation
| | - Vladimir Vinogradov
- International Institute "Solution Chemistry of Advanced Materials and Technologies", ITMO University, Saint-Petersburg 191002, Russian Federation.
| |
Collapse
|
6
|
Abstract
Ordinary differential equation (ODE) is widely used in modeling biological and physical processes in science. In this article, we propose a new reproducing kernel-based approach for estimation and inference of ODE given noisy observations. We do not assume the functional forms in ODE to be known, or restrict them to be linear or additive, and we allow pairwise interactions. We perform sparse estimation to select individual functionals, and construct confidence intervals for the estimated signal trajectories. We establish the estimation optimality and selection consistency of kernel ODE under both the low-dimensional and high-dimensional settings, where the number of unknown functionals can be smaller or larger than the sample size. Our proposal builds upon the smoothing spline analysis of variance (SS-ANOVA) framework, but tackles several important problems that are not yet fully addressed, and thus extends the scope of existing SS-ANOVA as well. We demonstrate the efficacy of our method through numerous ODE examples.
Collapse
Affiliation(s)
- Xiaowu Dai
- Department of Economics and Simons Institute for the Theory of Computing, the University of California, Berkeley, Berkeley, CA
| | - Lexin Li
- Department of Economics and Simons Institute for the Theory of Computing, the University of California, Berkeley, Berkeley, CA
- Department of Biostatistics and Epidemiology, the University of California, Berkeley, Berkeley, CA
| |
Collapse
|
7
|
Chen Z, Liu Y, Sun H. Physics-informed learning of governing equations from scarce data. Nat Commun 2021; 12:6136. [PMID: 34675223 PMCID: PMC8531004 DOI: 10.1038/s41467-021-26434-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/04/2021] [Indexed: 11/23/2022] Open
Abstract
Harnessing data to discover the underlying governing laws or equations that describe the behavior of complex physical systems can significantly advance our modeling, simulation and understanding of such systems in various science and engineering disciplines. This work introduces a novel approach called physics-informed neural network with sparse regression to discover governing partial differential equations from scarce and noisy data for nonlinear spatiotemporal systems. In particular, this discovery approach seamlessly integrates the strengths of deep neural networks for rich representation learning, physics embedding, automatic differentiation and sparse regression to approximate the solution of system variables, compute essential derivatives, as well as identify the key derivative terms and parameters that form the structure and explicit expression of the equations. The efficacy and robustness of this method are demonstrated, both numerically and experimentally, on discovering a variety of partial differential equation systems with different levels of data scarcity and noise accounting for different initial/boundary conditions. The resulting computational framework shows the potential for closed-form model discovery in practical applications where large and accurate datasets are intractable to capture.
Collapse
Affiliation(s)
- Zhao Chen
- grid.261112.70000 0001 2173 3359Department of Civil and Environmental Engineering, Northeastern University, Boston, MA 02115 USA
| | - Yang Liu
- grid.261112.70000 0001 2173 3359Department of Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115 USA
| | - Hao Sun
- grid.24539.390000 0004 0368 8103Gaoling School of Artificial Intelligence, Renmin University of China, 100872 Beijing, China ,grid.24539.390000 0004 0368 8103Beijing Key Laboratory of Big Data Management and Analysis Methods, 100872 Beijing, China ,grid.116068.80000 0001 2341 2786Department of Civil and Environmental Engineering, MIT, Cambridge, MA 02139 USA
| |
Collapse
|
8
|
Bongers S, Forré P, Peters J, Mooij JM. Foundations of structural causal models with cycles and latent variables. Ann Stat 2021. [DOI: 10.1214/21-aos2064] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
| | | | - Jonas Peters
- Department of Mathematical Sciences, University of Copenhagen
| | | |
Collapse
|
9
|
Pfister N, Williams EG, Peters J, Aebersold R, Bühlmann P. Stabilizing variable selection and regression. Ann Appl Stat 2021. [DOI: 10.1214/21-aoas1487] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Niklas Pfister
- Department of Mathematical Sciences, University of Copenhagen
| | - Evan G. Williams
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
| | - Jonas Peters
- Department of Mathematical Sciences, University of Copenhagen
| | | | | |
Collapse
|
10
|
Rothenhäusler D, Meinshausen N, Bühlmann P, Peters J. Anchor regression: Heterogeneous data meet causality. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12398] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | | | | | - Jonas Peters
- Department of Mathematical Sciences University of Copenhagen Copenhagen Denmark
| |
Collapse
|
11
|
Bühlmann P, Ćevid D. Deconfounding and Causal Regularisation for Stability and External Validity. Int Stat Rev 2020. [DOI: 10.1111/insr.12426] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | - Domagoj Ćevid
- Seminar for Statistics ETH Zürich Zürich Switzerland
| |
Collapse
|
12
|
Weichwald S, Peters J. Causality in Cognitive Neuroscience: Concepts, Challenges, and Distributional Robustness. J Cogn Neurosci 2020; 33:226-247. [PMID: 32812827 DOI: 10.1162/jocn_a_01623] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Whereas probabilistic models describe the dependence structure between observed variables, causal models go one step further: They predict, for example, how cognitive functions are affected by external interventions that perturb neuronal activity. In this review and perspective article, we introduce the concept of causality in the context of cognitive neuroscience and review existing methods for inferring causal relationships from data. Causal inference is an ambitious task that is particularly challenging in cognitive neuroscience. We discuss two difficulties in more detail: the scarcity of interventional data and the challenge of finding the right variables. We argue for distributional robustness as a guiding principle to tackle these problems. Robustness (or invariance) is a fundamental principle underlying causal methodology. A (correctly specified) causal model of a target variable generalizes across environments or subjects as long as these environments leave the causal mechanisms of the target intact. Consequently, if a candidate model does not generalize, then either it does not consist of the target variable's causes or the underlying variables do not represent the correct granularity of the problem. In this sense, assessing generalizability may be useful when defining relevant variables and can be used to partially compensate for the lack of interventional data.
Collapse
|