1
|
Han BA, Varshney KR, LaDeau S, Subramaniam A, Weathers KC, Zwart J. A synergistic future for AI and ecology. Proc Natl Acad Sci U S A 2023; 120:e2220283120. [PMID: 37695904 PMCID: PMC10515155 DOI: 10.1073/pnas.2220283120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/13/2023] Open
Abstract
Research in both ecology and AI strives for predictive understanding of complex systems, where nonlinearities arise from multidimensional interactions and feedbacks across multiple scales. After a century of independent, asynchronous advances in computational and ecological research, we foresee a critical need for intentional synergy to meet current societal challenges against the backdrop of global change. These challenges include understanding the unpredictability of systems-level phenomena and resilience dynamics on a rapidly changing planet. Here, we spotlight both the promise and the urgency of a convergence research paradigm between ecology and AI. Ecological systems are a challenge to fully and holistically model, even using the most prominent AI technique today: deep neural networks. Moreover, ecological systems have emergent and resilient behaviors that may inspire new, robust AI architectures and methodologies. We share examples of how challenges in ecological systems modeling would benefit from advances in AI techniques that are themselves inspired by the systems they seek to model. Both fields have inspired each other, albeit indirectly, in an evolution toward this convergence. We emphasize the need for more purposeful synergy to accelerate the understanding of ecological resilience whilst building the resilience currently lacking in modern AI systems, which have been shown to fail at times because of poor generalization in different contexts. Persistent epistemic barriers would benefit from attention in both disciplines. The implications of a successful convergence go beyond advancing ecological disciplines or achieving an artificial general intelligence-they are critical for both persisting and thriving in an uncertain future.
Collapse
Affiliation(s)
| | - Kush R. Varshney
- IBM Research - Thomas J. Watson Research Center, Yorktown Heights, NY10598
| | | | - Ajit Subramaniam
- Lamont-Doherty Earth Observatory, Columbia University, Palisades, NY10964
| | | | - Jacob Zwart
- U.S. Geological Survey, Water Resources Mission Area, Integrated Information Dissemination Division, San Francisco, CA94116
| |
Collapse
|
2
|
Dautle M, Zhang S, Chen Y. scTIGER: A Deep-Learning Method for Inferring Gene Regulatory Networks from Case versus Control scRNA-seq Datasets. Int J Mol Sci 2023; 24:13339. [PMID: 37686146 PMCID: PMC10488287 DOI: 10.3390/ijms241713339] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2023] [Revised: 08/06/2023] [Accepted: 08/23/2023] [Indexed: 09/10/2023] Open
Abstract
Inferring gene regulatory networks (GRNs) from single-cell RNA-seq (scRNA-seq) data is an important computational question to find regulatory mechanisms involved in fundamental cellular processes. Although many computational methods have been designed to predict GRNs from scRNA-seq data, they usually have high false positive rates and none infer GRNs by directly using the paired datasets of case-versus-control experiments. Here we present a novel deep-learning-based method, named scTIGER, for GRN detection by using the co-differential relationships of gene expression profiles in paired scRNA-seq datasets. scTIGER employs cell-type-based pseudotiming, an attention-based convolutional neural network method and permutation-based significance testing for inferring GRNs among gene modules. As state-of-the-art applications, we first applied scTIGER to scRNA-seq datasets of prostate cancer cells, and successfully identified the dynamic regulatory networks of AR, ERG, PTEN and ATF3 for same-cell type between prostatic cancerous and normal conditions, and two-cell types within the prostatic cancerous environment. We then applied scTIGER to scRNA-seq data from neurons with and without fear memory and detected specific regulatory networks for BDNF, CREB1 and MAPK4. Additionally, scTIGER demonstrates robustness against high levels of dropout noise in scRNA-seq data.
Collapse
Affiliation(s)
- Madison Dautle
- Department of Biological and Biomedical Sciences, Rowan University, Glassboro, NJ 08028, USA;
| | - Shaoqiang Zhang
- College of Computer and Information Engineering, Tianjin Normal University, Tianjin 300387, China
| | - Yong Chen
- Department of Biological and Biomedical Sciences, Rowan University, Glassboro, NJ 08028, USA;
| |
Collapse
|
3
|
Domova V, Vrotsou K. A Model for Types and Levels of Automation in Visual Analytics: A Survey, a Taxonomy, and Examples. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2023; 29:3550-3568. [PMID: 35358047 DOI: 10.1109/tvcg.2022.3163765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The continuous growth in availability and access to data presents a major challenge to the human analyst. As the manual analysis of large and complex datasets is nowadays practically impossible, the need for assisting tools that can automate the analysis process while keeping the human analyst in the loop is imperative. A large and growing body of literature recognizes the crucial role of automation in Visual Analytics and suggests that automation is among the most important constituents for effective Visual Analytics systems. Today, however, there is no appropriate taxonomy nor terminology for assessing the extent of automation in a Visual Analytics system. In this article, we aim to address this gap by introducing a model of levels of automation tailored for the Visual Analytics domain. The consistent terminology of the proposed taxonomy could provide a ground for users/readers/reviewers to describe and compare automation in Visual Analytics systems. Our taxonomy is grounded on a combination of several existing and well-established taxonomies of levels of automation in the human-machine interaction domain and relevant models within the visual analytics field. To exemplify the proposed taxonomy, we selected a set of existing systems from the event-sequence analytics domain and mapped the automation of their visual analytics process stages against the automation levels in our taxonomy.
Collapse
|
4
|
Castro M, Mendes Júnior PR, Soriano-Vargas A, de Oliveira Werneck R, Moreira Gonçalves M, Lusquino Filho L, Moura R, Zampieri M, Linares O, Ferreira V, Ferreira A, Davólio A, Schiozer D, Rocha A. Time series causal relationships discovery through feature importance and ensemble models. Sci Rep 2023; 13:11402. [PMID: 37452079 PMCID: PMC10349147 DOI: 10.1038/s41598-023-37929-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023] Open
Abstract
Inferring causal relationships from observational data is a key challenge in understanding the interpretability of Machine Learning models. Given the ever-increasing amount of observational data available in many areas, Machine Learning algorithms used for forecasting have become more complex, leading to a less understandable path of how a decision is made by the model. To address this issue, we propose leveraging ensemble models, e.g., Random Forest, to assess which input features the trained model prioritizes when making a forecast and, in this way, establish causal relationships between the variables. The advantage of these algorithms lies in their ability to provide feature importance, which allows us to build the causal network. We present our methodology to estimate causality in time series from oil field production. As it is difficult to extract causal relations from a real field, we also included a synthetic oil production dataset and a weather dataset, which is also synthetic, to provide the ground truth. We aim to perform causal discovery, i.e., establish the existing connections between the variables in each dataset. Through an iterative process of improving the forecasting of a target's value, we evaluate whether the forecasting improves by adding information from a new potential driver; if so, we state that the driver causally affects the target. On the oil field-related datasets, our causal analysis results agree with the interwell connections already confirmed by tracer information; whenever the tracer data are available, we used it as our ground truth. This consistency between both estimated and confirmed connections provides us the confidence about the effectiveness of our proposed methodology. To our knowledge, this is the first time causal analysis using solely production data is employed to discover interwell connections in an oil field dataset.
Collapse
Affiliation(s)
- Manuel Castro
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil.
| | - Pedro Ribeiro Mendes Júnior
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Aurea Soriano-Vargas
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Rafael de Oliveira Werneck
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Maiara Moreira Gonçalves
- Center for Petroleum Engineering (CEPETRO), University of Campinas (Unicamp), 13083-970, Campinas, SP, Brazil
| | - Leopoldo Lusquino Filho
- Group of Automation and Integrated Systems, São Paulo State University (Unesp), 18087-180, Sorocaba, SP, Brazil
| | - Renato Moura
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Marcelo Zampieri
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Oscar Linares
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Vitor Ferreira
- Center for Petroleum Engineering (CEPETRO), University of Campinas (Unicamp), 13083-970, Campinas, SP, Brazil
| | - Alexandre Ferreira
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| | - Alessandra Davólio
- Center for Petroleum Engineering (CEPETRO), University of Campinas (Unicamp), 13083-970, Campinas, SP, Brazil
| | - Denis Schiozer
- School of Mechanical Engineering (FEM), University of Campinas (Unicamp), 13083-970, Campinas, SP, Brazil
| | - Anderson Rocha
- Artificial Intelligence Lab., Recod.ai, Institute of Computing, University of Campinas (Unicamp), 13083-852, Campinas, SP, Brazil
| |
Collapse
|
5
|
Bermeo C, Michell K, Kristjanpoller W. Estimation of causality in economic growth and expansionary policies using uplift modeling. Neural Comput Appl 2023. [DOI: 10.1007/s00521-023-08397-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
|
6
|
Chuang KC, Ramakrishnapillai S, Madden K, St Amant J, McKlveen K, Gwizdala K, Dhullipudi R, Bazzano L, Carmichael O. Brain effective connectivity and functional connectivity as markers of lifespan vascular exposures in middle-aged adults: The Bogalusa Heart Study. Front Aging Neurosci 2023; 15:1110434. [PMID: 36998317 PMCID: PMC10043334 DOI: 10.3389/fnagi.2023.1110434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 02/22/2023] [Indexed: 03/16/2023] Open
Abstract
IntroductionEffective connectivity (EC), the causal influence that functional activity in a source brain location exerts over functional activity in a target brain location, has the potential to provide different information about brain network dynamics than functional connectivity (FC), which quantifies activity synchrony between locations. However, head-to-head comparisons between EC and FC from either task-based or resting-state functional MRI (fMRI) data are rare, especially in terms of how they associate with salient aspects of brain health.MethodsIn this study, 100 cognitively-healthy participants in the Bogalusa Heart Study aged 54.2 ± 4.3years completed Stroop task-based fMRI, resting-state fMRI. EC and FC among 24 regions of interest (ROIs) previously identified as involved in Stroop task execution (EC-task and FC-task) and among 33 default mode network ROIs (EC-rest and FC-rest) were calculated from task-based and resting-state fMRI using deep stacking networks and Pearson correlation. The EC and FC measures were thresholded to generate directed and undirected graphs, from which standard graph metrics were calculated. Linear regression models related graph metrics to demographic, cardiometabolic risk factors, and cognitive function measures.ResultsWomen and whites (compared to men and African Americans) had better EC-task metrics, and better EC-task metrics associated with lower blood pressure, white matter hyperintensity volume, and higher vocabulary score (maximum value of p = 0.043). Women had better FC-task metrics, and better FC-task metrics associated with APOE-ε4 3–3 genotype and better hemoglobin-A1c, white matter hyperintensity volume and digit span backwards score (maximum value of p = 0.047). Better EC rest metrics associated with lower age, non-drinker status, and better BMI, white matter hyperintensity volume, logical memory II total score, and word reading score (maximum value of p = 0.044). Women and non-drinkers had better FC-rest metrics (value of p = 0.004).DiscussionIn a diverse, cognitively healthy, middle-aged community sample, EC and FC based graph metrics from task-based fMRI data, and EC based graph metrics from resting-state fMRI data, were associated with recognized indicators of brain health in differing ways. Future studies of brain health should consider taking both task-based and resting-state fMRI scans and measuring both EC and FC analyses to get a more complete picture of functional networks relevant to brain health.
Collapse
Affiliation(s)
- Kai-Cheng Chuang
- Department of Physics & Astronomy, Louisiana State University, Baton Rouge, LA, United States
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
- *Correspondence: Kai-Cheng Chuang,
| | - Sreekrishna Ramakrishnapillai
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
- Department of Electrical and Computer Engineering, Louisiana State University, Baton Rouge, LA, United States
| | - Kaitlyn Madden
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
| | - Julia St Amant
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
| | - Kevin McKlveen
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
| | - Kathryn Gwizdala
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
| | | | - Lydia Bazzano
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, United States
| | - Owen Carmichael
- Pennington Biomedical Research Center, Baton Rouge, LA, United States
| |
Collapse
|
7
|
Chew LY, Ong YS. Granger causality using Jacobian in neural networks. CHAOS (WOODBURY, N.Y.) 2023; 33:023126. [PMID: 36859223 DOI: 10.1063/5.0106666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 01/16/2023] [Indexed: 06/18/2023]
Abstract
Granger causality is a commonly used method for uncovering information flow and dependencies in a time series. Here, we introduce JGC (Jacobian Granger causality), a neural network-based approach to Granger causality using the Jacobian as a measure of variable importance, and propose a variable selection procedure for inferring Granger causal variables with this measure, using criteria of significance and consistency. The resulting approach performs consistently well compared to other approaches in identifying Granger causal variables, the associated time lags, as well as interaction signs. In addition, we also discuss the need for contemporaneous variables in Granger causal modeling as well as how these neural network-based approaches reduce the impact of nonseparability in dynamical systems, a problem where predictive information on a target variable is not unique to its causes, but also contained in the history of the target variable itself.
Collapse
Affiliation(s)
- Lock Yue Chew
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| | - Yew-Soon Ong
- School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798
| |
Collapse
|
8
|
Assaad CK, Devijver E, Gaussier E. Entropy-Based Discovery of Summary Causal Graphs in Time Series. ENTROPY (BASEL, SWITZERLAND) 2022; 24:1156. [PMID: 36010820 PMCID: PMC9407574 DOI: 10.3390/e24081156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 08/05/2022] [Accepted: 08/14/2022] [Indexed: 06/15/2023]
Abstract
This study addresses the problem of learning a summary causal graph on time series with potentially different sampling rates. To do so, we first propose a new causal temporal mutual information measure for time series. We then show how this measure relates to an entropy reduction principle that can be seen as a special case of the probability raising principle. We finally combine these two ingredients in PC-like and FCI-like algorithms to construct the summary causal graph. There algorithm are evaluated on several datasets, which shows both their efficacy and efficiency.
Collapse
Affiliation(s)
- Charles K. Assaad
- R&D Department, EasyVista, 38000 Grenoble, France
- Department of Mathematics, Information and Communication Sciences, University of Grenoble Alpes, CNRS, Grenoble INP, LIG, 38000 Grenoble, France
| | - Emilie Devijver
- Department of Mathematics, Information and Communication Sciences, University of Grenoble Alpes, CNRS, Grenoble INP, LIG, 38000 Grenoble, France
| | - Eric Gaussier
- Department of Mathematics, Information and Communication Sciences, University of Grenoble Alpes, CNRS, Grenoble INP, LIG, 38000 Grenoble, France
| |
Collapse
|
9
|
Chaibub Neto E, Perumal TM, Pratap A, Tediarjo A, Bot BM, Mangravite L, Omberg L. Disentangling personalized treatment effects from “time-of-the-day” confounding in mobile health studies. PLoS One 2022; 17:e0271766. [PMID: 35925980 PMCID: PMC9352058 DOI: 10.1371/journal.pone.0271766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 07/06/2022] [Indexed: 11/18/2022] Open
Abstract
Ideally, a patient’s response to medication can be monitored by measuring changes in performance of some activity. In observational studies, however, any detected association between treatment (“on-medication” vs “off-medication”) and the outcome (performance in the activity) might be due to confounders. In particular, causal inferences at the personalized level are especially vulnerable to confounding effects that arise in a cyclic fashion. For quick acting medications, effects can be confounded by circadian rhythms and daily routines. Using the time-of-the-day as a surrogate for these confounders and the performance measurements as captured on a smartphone, we propose a personalized statistical approach to disentangle putative treatment and “time-of-the-day” effects, that leverages conditional independence relations spanned by causal graphical models involving the treatment, time-of-the-day, and outcome variables. Our approach is based on conditional independence tests implemented via standard and temporal linear regression models. Using synthetic data, we investigate when and how residual autocorrelation can affect the standard tests, and how time series modeling (namely, ARIMA and robust regression via HAC covariance matrix estimators) can remedy these issues. In particular, our simulations illustrate that when patients perform their activities in a paired fashion, positive autocorrelation can lead to conservative results for the standard regression approach (i.e., lead to deflated true positive detection), whereas negative autocorrelation can lead to anticonservative behavior (i.e., lead to inflated false positive detection). The adoption of time series methods, on the other hand, leads to well controlled type I error rates. We illustrate the application of our methodology with data from a Parkinson’s disease mobile health study.
Collapse
Affiliation(s)
- Elias Chaibub Neto
- Sage Bionetworks, Seattle, Washington, United States of America
- * E-mail:
| | | | - Abhishek Pratap
- Sage Bionetworks, Seattle, Washington, United States of America
| | - Aryton Tediarjo
- Sage Bionetworks, Seattle, Washington, United States of America
| | - Brian M. Bot
- Sage Bionetworks, Seattle, Washington, United States of America
| | - Lara Mangravite
- Sage Bionetworks, Seattle, Washington, United States of America
| | - Larsson Omberg
- Sage Bionetworks, Seattle, Washington, United States of America
| |
Collapse
|
10
|
Going deep into schizophrenia with artificial intelligence. Schizophr Res 2022; 245:122-140. [PMID: 34103242 DOI: 10.1016/j.schres.2021.05.018] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Revised: 05/24/2021] [Accepted: 05/27/2021] [Indexed: 12/30/2022]
Abstract
Despite years of research, the mechanisms governing the onset, relapse, symptomatology, and treatment of schizophrenia (SZ) remain elusive. The lack of appropriate analytic tools to deal with the heterogeneity and complexity of SZ may be one of the reasons behind this situation. Deep learning, a subfield of artificial intelligence (AI) inspired by the nervous system, has recently provided an accessible way of modeling and analyzing complex, high-dimensional, nonlinear systems. The unprecedented accuracy of deep learning algorithms in classification and prediction tasks has revolutionized a wide range of scientific fields and is rapidly permeating SZ research. Deep learning has the potential of becoming a valuable aid for clinicians in the prediction, diagnosis, and treatment of SZ, especially in combination with principles from Bayesian statistics. Furthermore, deep learning could become a powerful tool for uncovering the mechanisms underlying SZ thanks to a growing number of techniques designed for improving model interpretability and causal reasoning. The purpose of this article is to introduce SZ researchers to the field of deep learning and review its latest applications in SZ research. In general, existing studies have yielded impressive results in classification and outcome prediction tasks. However, methodological concerns related to the assessment of model performance in several studies, the widespread use of small training datasets, and the little clinical value of some models suggest that some of these results should be taken with caution.
Collapse
|
11
|
Learning dynamic causal mechanisms from non-stationary data. APPL INTELL 2022. [DOI: 10.1007/s10489-022-03843-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
12
|
Kumar P, Kuttippurath J, Mitra A. Causal discovery of drivers of surface ozone variability in Antarctica using a deep learning algorithm. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2022; 24:447-459. [PMID: 35156666 DOI: 10.1039/d1em00383f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
The discovery of causal structures behind a phenomenon under investigation has been at the heart of scientific inquiry since the beginning. Randomized control trials, the gold standard for causal analysis, may not always be feasible, such as in the domain of climate sciences. In the absence of interventional data, we are forced to depend only on observational data. This study demonstrates the application of one such causal discovery algorithm using a neural network for identifying the drivers of surface ozone variability in Antarctica. The analyses reveal the overarching influence of the stratosphere on the surface ozone variability in Antarctica, buttressed by the southern annular mode and tropospheric wave forcing in mid-latitudes. We find no significant and robust evidence for the influence of tropical teleconnection on the ground-level ozone in Antarctica. As the field of atmospheric science is now replete with a massive stock of observational data, both satellite and ground-based, this tool for automated causal structure discovery might prove to be invaluable for scientific investigation and flawless decision making.
Collapse
Affiliation(s)
- P Kumar
- CORAL, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India.
| | - J Kuttippurath
- CORAL, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India.
| | - A Mitra
- Centre of Excellence in Artificial Intelligence, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| |
Collapse
|
13
|
Neural Networks for Directed Connectivity Estimation in Source-Reconstructed EEG Data. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12062889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Directed connectivity between brain sources identified from scalp electroencephalography (EEG) can shed light on the brain’s information flows and provide a biomarker of neurological disorders. However, as volume conductance results in scalp activity being a mix of activities originating from multiple sources, the correct interpretation of their connectivity is a formidable challenge despite source localization being applied with some success. Traditional connectivity approaches rely on statistical assumptions that usually do not hold for EEG, calling for a model-free approach. We investigated several types of Artificial Neural Networks in estimating Directed Connectivity between Reconstructed EEG Sources and assessed their accuracy with respect to several ground truths. We show that a Long Short-Term Memory neural network with Non-Uniform Embedding yields the most promising results due to its relative robustness to differing dipole locations. We conclude that certain network architectures can compete with the already established methods for brain connectivity analysis.
Collapse
|
14
|
Mansouri M, Khakabimamaghani S, Chindelevitch L, Ester M. Aristotle: stratified causal discovery for omics data. BMC Bioinformatics 2022; 23:42. [PMID: 35033007 PMCID: PMC8760642 DOI: 10.1186/s12859-021-04521-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2021] [Accepted: 12/08/2021] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying pathophysiology of diseases. Causal analysis of omics data may provide important insight into the underlying biological mechanisms. Existing causal analysis methods yield promising results when identifying potential general causes of an observed outcome based on omics data. However, they may fail to discover the causes specific to a particular stratum of individuals and missing from others. METHODS To fill this gap, we introduce the problem of stratified causal discovery and propose a method, Aristotle, for solving it. Aristotle addresses the two challenges intrinsic to omics data: high dimensionality and hidden stratification. It employs existing biological knowledge and a state-of-the-art patient stratification method to tackle the above challenges and applies a quasi-experimental design method to each stratum to find stratum-specific potential causes. RESULTS Evaluation based on synthetic data shows better performance for Aristotle in discovering true causes under different conditions compared to existing causal discovery methods. Experiments on a real dataset on Anthracycline Cardiotoxicity indicate that Aristotle's predictions are consistent with the existing literature. Moreover, Aristotle makes additional predictions that suggest further investigations.
Collapse
Affiliation(s)
- Mehrdad Mansouri
- grid.61971.380000 0004 1936 7494School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, CA USA
| | - Sahand Khakabimamaghani
- grid.61971.380000 0004 1936 7494School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, CA USA
| | - Leonid Chindelevitch
- grid.61971.380000 0004 1936 7494School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, CA USA
| | - Martin Ester
- grid.61971.380000 0004 1936 7494School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, CA USA
| |
Collapse
|
15
|
Moharamzadeh N, Motie Nasrabadi A. A fuzzy sensitivity analysis approach to estimate brain effective connectivity and its application to epileptic seizure detection. BIOMED ENG-BIOMED TE 2021; 67:19-32. [PMID: 34953180 DOI: 10.1515/bmt-2021-0058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 11/26/2021] [Indexed: 11/15/2022]
Abstract
The brain is considered to be the most complicated organ in human body. Inferring and quantification of effective (causal) connectivity among regions of the brain is an important step in characterization of its complicated functions. The proposed method is comprised of modeling multivariate time series with Adaptive Neurofuzzy Inference System (ANFIS) and carrying out a sensitivity analysis using Fuzzy network parameters as a new approach to introduce a connectivity measure for detecting causal interactions between interactive input time series. The results of simulations indicate that this method is successful in detecting causal connectivity. After validating the performance of the proposed method on synthetic linear and nonlinear interconnected time series, it is applied to epileptic intracranial Electroencephalography (EEG) signals. The result of applying the proposed method on Freiburg epileptic intracranial EEG data recorded during seizure shows that the proposed method is capable of discriminating between the seizure and non-seizure states of the brain.
Collapse
Affiliation(s)
- Nader Moharamzadeh
- Department of Biomedical Engineering, Faculty of Engineering, Shahed University, Tehran, Iran
| | - Ali Motie Nasrabadi
- Department of Biomedical Engineering, Faculty of Engineering, Shahed University, Tehran, Iran
| |
Collapse
|
16
|
Duggento A, Guerrisi M, Toschi N. Echo state network models for nonlinear Granger causality. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2021; 379:20200256. [PMID: 34689621 DOI: 10.1098/rsta.2020.0256] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 06/15/2021] [Indexed: 06/13/2023]
Abstract
While Granger causality (GC) has been often employed in network neuroscience, most GC applications are based on linear multivariate autoregressive (MVAR) models. However, real-life systems like biological networks exhibit notable nonlinear behaviour, hence undermining the validity of MVAR-based GC (MVAR-GC). Most nonlinear GC estimators only cater for additive nonlinearities or, alternatively, are based on recurrent neural networks or long short-term memory networks, which present considerable training difficulties and tailoring needs. We reformulate the GC framework in terms of echo-state networks-based models for arbitrarily complex networks, and characterize its ability to capture nonlinear causal relations in a network of noisy Duffing oscillators, showing a net advantage of echo state GC (ES-GC) in detecting nonlinear, causal links. We then explore the structure of ES-GC networks in the human brain employing functional MRI data from 1003 healthy subjects drawn from the human connectome project, demonstrating the existence of previously unknown directed within-brain interactions. In addition, we examine joint brain-heart signals in 15 subjects where we explore directed interaction between brain networks and central vagal cardiac control in order to investigate the so-called central autonomic network in a causal manner. This article is part of the theme issue 'Advanced computation in cardiovascular physiology: new challenges and opportunities'.
Collapse
Affiliation(s)
- Andrea Duggento
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, Italy
| | - Maria Guerrisi
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, Italy
| | - Nicola Toschi
- Department of Biomedicine and Prevention, University of Rome Tor Vergata, Rome, Italy
- Department of Radiology, Athinoula A. Martinos Center for Biomedical Imaging, Boston, MA, USA
| |
Collapse
|
17
|
Koutroulis G, Botler L, Mutlu B, Diwold K, Römer K, Kern R. KOMPOS: Connecting Causal Knots in Large Nonlinear Time Series with Non-Parametric Regression Splines. ACM T INTEL SYST TEC 2021. [DOI: 10.1145/3480971] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Recovering causality from copious time series data beyond mere correlations has been an important contributing factor in numerous scientific fields. Most existing works assume linearity in the data that may not comply with many real-world scenarios. Moreover, it is usually not sufficient to solely infer the causal relationships. Identifying the correct time delay of cause-effect is extremely vital for further insight and effective policies in inter-disciplinary domains. To bridge this gap, we propose KOMPOS, a novel algorithmic framework that combines a powerful concept from causal discovery of additive noise models with graphical ones. We primarily build our structural causal model from multivariate adaptive regression splines with inherent additive local nonlinearities, which render the underlying causal structure more easily identifiable. In contrast to other methods, our approach is not restricted to Gaussian or non-Gaussian noise due to the non-parametric attribute of the regression method. We conduct extensive experiments on both synthetic and real-world datasets, demonstrating the superiority of the proposed algorithm over existing causal discovery methods, especially for the challenging cases of autocorrelated and non-stationary time series.
Collapse
Affiliation(s)
| | - Leo Botler
- Graz University of Technology, Graz, Austria
| | | | - Konrad Diwold
- Pro2Future GmbH and Graz University of Technology, Graz, Austria
| | - Kay Römer
- Graz University of Technology, Graz, Austria
| | - Roman Kern
- Graz University of Technology, Graz, Austria
| |
Collapse
|
18
|
Menegozzo G, Dall'Alba D, Fiorini P. Industrial Time Series Modeling With Causal Precursors and Separable Temporal Convolutions. IEEE Robot Autom Lett 2021. [DOI: 10.1109/lra.2021.3095907] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
19
|
Jin Z, Guo S, Chen N, Weiskopf D, Gotz D, Cao N. Visual Causality Analysis of Event Sequence Data. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2021; 27:1343-1352. [PMID: 33048746 DOI: 10.1109/tvcg.2020.3030465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Causality is crucial to understanding the mechanisms behind complex systems and making decisions that lead to intended outcomes. Event sequence data is widely collected from many real-world processes, such as electronic health records, web clickstreams, and financial transactions, which transmit a great deal of information reflecting the causal relations among event types. Unfortunately, recovering causalities from observational event sequences is challenging, as the heterogeneous and high-dimensional event variables are often connected to rather complex underlying event excitation mechanisms that are hard to infer from limited observations. Many existing automated causal analysis techniques suffer from poor explainability and fail to include an adequate amount of human knowledge. In this paper, we introduce a visual analytics method for recovering causalities in event sequence data. We extend the Granger causality analysis algorithm on Hawkes processes to incorporate user feedback into causal model refinement. The visualization system includes an interactive causal analysis framework that supports bottom-up causal exploration, iterative causal verification and refinement, and causal comparison through a set of novel visualizations and interactions. We report two forms of evaluation: a quantitative evaluation of the model improvements resulting from the user-feedback mechanism, and a qualitative evaluation through case studies in different application domains to demonstrate the usefulness of the system.
Collapse
|
20
|
Barić D, Fumić P, Horvatić D, Lipic T. Benchmarking Attention-Based Interpretability of Deep Learning in Multivariate Time Series Predictions. ENTROPY (BASEL, SWITZERLAND) 2021; 23:143. [PMID: 33503822 PMCID: PMC7912396 DOI: 10.3390/e23020143] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2020] [Revised: 01/20/2021] [Accepted: 01/21/2021] [Indexed: 11/26/2022]
Abstract
The adaptation of deep learning models within safety-critical systems cannot rely only on good prediction performance but needs to provide interpretable and robust explanations for their decisions. When modeling complex sequences, attention mechanisms are regarded as the established approach to support deep neural networks with intrinsic interpretability. This paper focuses on the emerging trend of specifically designing diagnostic datasets for understanding the inner workings of attention mechanism based deep learning models for multivariate forecasting tasks. We design a novel benchmark of synthetically designed datasets with the transparent underlying generating process of multiple time series interactions with increasing complexity. The benchmark enables empirical evaluation of the performance of attention based deep neural networks in three different aspects: (i) prediction performance score, (ii) interpretability correctness, (iii) sensitivity analysis. Our analysis shows that although most models have satisfying and stable prediction performance results, they often fail to give correct interpretability. The only model with both a satisfying performance score and correct interpretability is IMV-LSTM, capturing both autocorrelations and crosscorrelations between multiple time series. Interestingly, while evaluating IMV-LSTM on simulated data from statistical and mechanistic models, the correctness of interpretability increases with more complex datasets.
Collapse
Affiliation(s)
- Domjan Barić
- Department of Physics, Faculty of Science, University of Zagreb, Bijenička cesta 32, 10000 Zagreb, Croatia; (D.B.); (P.F.)
| | - Petar Fumić
- Department of Physics, Faculty of Science, University of Zagreb, Bijenička cesta 32, 10000 Zagreb, Croatia; (D.B.); (P.F.)
| | - Davor Horvatić
- Department of Physics, Faculty of Science, University of Zagreb, Bijenička cesta 32, 10000 Zagreb, Croatia; (D.B.); (P.F.)
| | - Tomislav Lipic
- Division of Electronics, Ruđer Bošković Institute, Bijenička cesta 54, 10000 Zagreb, Croatia
| |
Collapse
|
21
|
Wang Y, You L, Chyr J, Lan L, Zhao W, Zhou Y, Xu H, Noble P, Zhou X. Causal Discovery in Radiographic Markers of Knee Osteoarthritis and Prediction for Knee Osteoarthritis Severity With Attention-Long Short-Term Memory. Front Public Health 2020; 8:604654. [PMID: 33409263 PMCID: PMC7779681 DOI: 10.3389/fpubh.2020.604654] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 11/09/2020] [Indexed: 01/08/2023] Open
Abstract
The goal of this study is to build a prognostic model to predict the severity of radiographic knee osteoarthritis (KOA) and to identify long-term disease progression risk factors for early intervention and treatment. We designed a long short-term memory (LSTM) model with an attention mechanism to predict Kellgren/Lawrence (KL) grade for knee osteoarthritis patients. The attention scores reveal a time-associated impact of different variables on KL grades. We also employed a fast causal inference (FCI) algorithm to estimate the causal relation of key variables, which will aid in clinical interpretability. Based on the clinical information of current visits, we accurately predicted the KL grade of the patient's next visits with 90% accuracy. We found that joint space narrowing was a major contributor to KOA progression. Furthermore, our causal structure model indicated that knee alignments may lead to joint space narrowing, while symptoms (swelling, grinding, catching, and limited mobility) have little impact on KOA progression. This study evaluated a broad spectrum of potential risk factors from clinical data, questionnaires, and radiographic markers that are rarely considered in previous studies. Using our statistical model, providers are able to predict the risk of the future progression of KOA, which will provide a basis for selecting proper interventions, such as proceeding to joint arthroplasty for patients. Our causal model suggests that knee alignment should be considered in the primary treatment and KOA progression was independent of clinical symptoms.
Collapse
Affiliation(s)
- Yanfei Wang
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Lei You
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Jacqueline Chyr
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Lan Lan
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Weiling Zhao
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Yujia Zhou
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Hua Xu
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Philip Noble
- McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Xiaobo Zhou
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX, United States.,McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, United States
| |
Collapse
|
22
|
Khan S, Hauptman R, Kelly L. Engineering the Microbiome to Prevent Adverse Events: Challenges and Opportunities. Annu Rev Pharmacol Toxicol 2020; 61:159-179. [PMID: 33049161 DOI: 10.1146/annurev-pharmtox-031620-031509] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In the past decade of microbiome research, we have learned about numerous adverse interactions between the microbiome and medical interventions such as drugs, radiation, and surgery. What if we could alter our microbiomes to prevent these events? In this review, we discuss potential routes to mitigate microbiome adverse events, including applications from the emerging field of microbiome engineering. We highlight cases where the microbiome acts directly on a treatment, such as via differential drug metabolism, and cases where a treatment directly harms the microbiome, such as in radiation therapy. Understanding and preventing microbiome adverse events is a difficult challenge that will require a data-driven approach involving causal statistics, multiomics techniques, and a personalized means of mitigating adverse events. We propose research considerations to encourage productive work in preventing microbiome adverse events, and we highlight the many challenges and opportunities that await.
Collapse
Affiliation(s)
- Saad Khan
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, New York, NY 10461, USA;
| | - Ruth Hauptman
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, New York, NY 10461, USA;
| | - Libusha Kelly
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, New York, NY 10461, USA; .,Department of Microbiology and Immunology, Albert Einstein College of Medicine, New York, NY 10461, USA
| |
Collapse
|
23
|
Attention-Based SeriesNet: An Attention-Based Hybrid Neural Network Model for Conditional Time Series Forecasting. INFORMATION 2020. [DOI: 10.3390/info11060305] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Traditional time series forecasting techniques can not extract good enough sequence data features, and their accuracies are limited. The deep learning structure SeriesNet is an advanced method, which adopts hybrid neural networks, including dilated causal convolutional neural network (DC-CNN) and Long-short term memory recurrent neural network (LSTM-RNN), to learn multi-range and multi-level features from multi-conditional time series with higher accuracy. However, they didn’t consider the attention mechanisms to learn temporal features. Besides, the conditioning method for CNN and RNN is not specific, and the number of parameters in each layer is tremendous. This paper proposes the conditioning method for two types of neural networks, and respectively uses the gated recurrent unit network (GRU) and the dilated depthwise separable temporal convolutional networks (DDSTCNs) instead of LSTM and DC-CNN for reducing the parameters. Furthermore, this paper presents the lightweight RNN-based hidden state attention module (HSAM) combined with the proposed CNN-based convolutional block attention module (CBAM) for time series forecasting. Experimental results show our model is superior to other models from the viewpoint of forecasting accuracy and computation efficiency.
Collapse
|
24
|
He R, Chen G, Sun S, Dong C, Jiang S. Attention-Based Long Short-Term Memory Method for Alarm Root-Cause Diagnosis in Chemical Processes. Ind Eng Chem Res 2020. [DOI: 10.1021/acs.iecr.0c00417] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Rui He
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), Qingdao, Shandong 266580, P.R. China
| | - Guoming Chen
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), Qingdao, Shandong 266580, P.R. China
| | - Shufeng Sun
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), Qingdao, Shandong 266580, P.R. China
| | - Che Dong
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), Qingdao, Shandong 266580, P.R. China
| | - Shengyu Jiang
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), Qingdao, Shandong 266580, P.R. China
| |
Collapse
|
25
|
Multi-Step Short-Term Wind Speed Prediction Using a Residual Dilated Causal Convolutional Network with Nonlinear Attention. ENERGIES 2020. [DOI: 10.3390/en13071772] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Wind energy is the most used renewable energy worldwide second only to hydropower. However, the stochastic nature of wind speed makes it harder for wind farms to manage the future power production and maintenance schedules efficiently. Many wind speed prediction models exist that focus on advance neural networks and/or preprocessing techniques to improve the accuracy. Since most of these models require a large amount of historic wind data and are validated using the data split method, the application to real-world scenarios cannot be determined. In this paper, we present a multi-step univariate prediction model for wind speed data inspired by the residual U-net architecture of the convolutional neural network (CNN). We propose a residual dilated causal convolutional neural network (Res-DCCNN) with nonlinear attention for multi-step-ahead wind speed forecasting. Our model can outperform long-term short-term memory networks (LSTM), gated recurrent units (GRU), and Res-DCCNN using sliding window validation techniques for 50-step-ahead wind speed prediction. We tested the performance of the proposed model on six real-world wind speed datasets with different probability distributions to confirm its effectiveness, and using several error metrics, we demonstrated that our proposed model was robust, precise, and applicable to real-world cases.
Collapse
|
26
|
Abstract
Causality is the most important topic in the history of western science, and since the beginning of the statistical paradigm, its meaning has been reconceptualized many times. Causality entered into the realm of multi-causal and statistical scenarios some centuries ago. Despite widespread critics, today deep learning and machine learning advances are not weakening causality but are creating a new way of finding correlations between indirect factors. This process makes it possible for us to talk about approximate causality, as well as about a situated causality.
Collapse
|