1
|
Lyu R, Qu Y, Divaris K, Wu D. Methodological Considerations in Longitudinal Analyses of Microbiome Data: A Comprehensive Review. Genes (Basel) 2023; 15:51. [PMID: 38254941 PMCID: PMC11154524 DOI: 10.3390/genes15010051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/22/2023] [Accepted: 12/26/2023] [Indexed: 01/24/2024] Open
Abstract
Biological processes underlying health and disease are inherently dynamic and are best understood when characterized in a time-informed manner. In this comprehensive review, we discuss challenges inherent in time-series microbiome data analyses and compare available approaches and methods to overcome them. Appropriate handling of longitudinal microbiome data can shed light on important roles, functions, patterns, and potential interactions between large numbers of microbial taxa or genes in the context of health, disease, or interventions. We present a comprehensive review and comparison of existing microbiome time-series analysis methods, for both preprocessing and downstream analyses, including differential analysis, clustering, network inference, and trait classification. We posit that the careful selection and appropriate utilization of computational tools for longitudinal microbiome analyses can help advance our understanding of the dynamic host-microbiome relationships that underlie health-maintaining homeostases, progressions to disease-promoting dysbioses, as well as phases of physiologic development like those encountered in childhood.
Collapse
Affiliation(s)
- Ruiqi Lyu
- Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA 15213, USA;
| | - Yixiang Qu
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA;
| | - Kimon Divaris
- Division of Pediatric and Public Health, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA;
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Di Wu
- Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA;
- Division of Oral and Craniofacial Health Sciences, Adams School of Dentistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
2
|
Xu CCY, Lemoine J, Albert A, Whirter ÉM, Barrett RDH. Community assembly of the human piercing microbiome. Proc Biol Sci 2023; 290:20231174. [PMID: 38018103 PMCID: PMC10685111 DOI: 10.1098/rspb.2023.1174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 11/03/2023] [Indexed: 11/30/2023] Open
Abstract
Predicting how biological communities respond to disturbance requires understanding the forces that govern their assembly. We propose using human skin piercings as a model system for studying community assembly after rapid environmental change. Local skin sterilization provides a 'clean slate' within the novel ecological niche created by the piercing. Stochastic assembly processes can dominate skin microbiomes due to the influence of environmental exposure on local dispersal, but deterministic processes might play a greater role within occluded skin piercings if piercing habitats impose strong selection pressures on colonizing species. Here we explore the human ear-piercing microbiome and demonstrate that community assembly is predominantly stochastic but becomes significantly more deterministic with time, producing increasingly diverse and ecologically complex communities. We also observed changes in two dominant and medically relevant antagonists (Cutibacterium acnes and Staphylococcus epidermidis), consistent with competitive exclusion induced by a transition from sebaceous to moist environments. By exploiting this common yet uniquely human practice, we show that skin piercings are not just culturally significant but also represent ecosystem engineering on the human body. The novel habitats and communities that skin piercings produce may provide general insights into biological responses to environmental disturbances with implications for both ecosystem and human health.
Collapse
Affiliation(s)
- Charles C. Y. Xu
- Redpath Museum, McGill University, 859 Sherbrooke Street West, Montreal, Quebec, Canada H3A 0C4
- Department of Biology, McGill University, Montreal, Quebec, Canada H3A 1B1
| | - Juliette Lemoine
- Redpath Museum, McGill University, 859 Sherbrooke Street West, Montreal, Quebec, Canada H3A 0C4
- Department of Biology, McGill University, Montreal, Quebec, Canada H3A 1B1
- Department of Ecology and Evolution, University of Lausanne, Lausanne 1015, Switzerland
| | - Avery Albert
- Redpath Museum, McGill University, 859 Sherbrooke Street West, Montreal, Quebec, Canada H3A 0C4
- Department of Natural Resource Sciences, McGill University, Sainte-Anne-de-Bellevue, Quebec, Canada H9X 3V9
- Trottier Space Institute, McGill University, Montreal, Quebec, Canada H3A 2A7
| | | | - Rowan D. H. Barrett
- Redpath Museum, McGill University, 859 Sherbrooke Street West, Montreal, Quebec, Canada H3A 0C4
- Department of Biology, McGill University, Montreal, Quebec, Canada H3A 1B1
| |
Collapse
|
3
|
Ai D, Chen L, Xie J, Cheng L, Zhang F, Luan Y, Li Y, Hou S, Sun F, Xia LC. Identifying local associations in biological time series: algorithms, statistical significance, and applications. Brief Bioinform 2023; 24:bbad390. [PMID: 37930023 DOI: 10.1093/bib/bbad390] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 08/21/2023] [Accepted: 09/14/2023] [Indexed: 11/07/2023] Open
Abstract
Local associations refer to spatial-temporal correlations that emerge from the biological realm, such as time-dependent gene co-expression or seasonal interactions between microbes. One can reveal the intricate dynamics and inherent interactions of biological systems by examining the biological time series data for these associations. To accomplish this goal, local similarity analysis algorithms and statistical methods that facilitate the local alignment of time series and assess the significance of the resulting alignments have been developed. Although these algorithms were initially devised for gene expression analysis from microarrays, they have been adapted and accelerated for multi-omics next generation sequencing datasets, achieving high scientific impact. In this review, we present an overview of the historical developments and recent advances for local similarity analysis algorithms, their statistical properties, and real applications in analyzing biological time series data. The benchmark data and analysis scripts used in this review are freely available at http://github.com/labxscut/lsareview.
Collapse
Affiliation(s)
- Dongmei Ai
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083, China
| | - Lulu Chen
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083, China
| | - Jiemin Xie
- Department of Statistics and Financial Mathematics, School of Mathematics, South China University of Technology, Guangzhou 510641, China
| | - Longwei Cheng
- School of Mathematics and Physics, University of Science and Technology Beijing, Beijing 100083, China
| | - Fang Zhang
- Shenwan Hongyuan Securities Co. Ltd., Shanghai 200031, China
| | - Yihui Luan
- School of Mathematics, Shandong University, Jinan 250100, China
| | - Yang Li
- Department of Statistics and Financial Mathematics, School of Mathematics, South China University of Technology, Guangzhou 510641, China
| | - Shengwei Hou
- Department of Ocean Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, China
| | - Fengzhu Sun
- Department of Quantitative and Computational Biology, University of Southern California, California, 90007, USA
| | - Li Charlie Xia
- Department of Statistics and Financial Mathematics, School of Mathematics, South China University of Technology, Guangzhou 510641, China
| |
Collapse
|
4
|
Gonzalez JM, Aranda B. Microbial Growth under Limiting Conditions-Future Perspectives. Microorganisms 2023; 11:1641. [PMID: 37512814 PMCID: PMC10383181 DOI: 10.3390/microorganisms11071641] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 06/02/2023] [Accepted: 06/20/2023] [Indexed: 07/30/2023] Open
Abstract
Microorganisms rule the functioning of our planet and each one of the individual macroscopic living creature. Nevertheless, microbial activity and growth status have always been challenging tasks to determine both in situ and in vivo. Microbial activity is generally related to growth, and the growth rate is a result of the availability of nutrients under adequate or adverse conditions faced by microbial cells in a changing environment. Most studies on microorganisms have been carried out under optimum or near-optimum growth conditions, but scarce information is available about microorganisms at slow-growing states (i.e., near-zero growth and maintenance metabolism). This study aims to better understand microorganisms under growth-limiting conditions. This is expected to provide new perspectives on the functions and relevance of the microbial world. This is because (i) microorganisms in nature frequently face conditions of severe growth limitation, (ii) microorganisms activate singular pathways (mostly genes remaining to be functionally annotated), resulting in a broad range of secondary metabolites, and (iii) the response of microorganisms to slow-growth conditions remains to be understood, including persistence strategies, gene expression, and cell differentiation both within clonal populations and due to the complexity of the environment.
Collapse
Affiliation(s)
- Juan M Gonzalez
- Instituto de Recursos Naturales y Agrobiología de Sevilla, Consejo Superior de Investigaciones Científicas, IRNAS-CSIC, E-41012 Sevilla, Spain
| | - Beatriz Aranda
- Instituto de Recursos Naturales y Agrobiología de Sevilla, Consejo Superior de Investigaciones Científicas, IRNAS-CSIC, E-41012 Sevilla, Spain
| |
Collapse
|
5
|
Chew LY, Ong YS. Granger causality using Jacobian in neural networks. CHAOS (WOODBURY, N.Y.) 2023; 33:023126. [PMID: 36859223 DOI: 10.1063/5.0106666] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 01/16/2023] [Indexed: 06/18/2023]
Abstract
Granger causality is a commonly used method for uncovering information flow and dependencies in a time series. Here, we introduce JGC (Jacobian Granger causality), a neural network-based approach to Granger causality using the Jacobian as a measure of variable importance, and propose a variable selection procedure for inferring Granger causal variables with this measure, using criteria of significance and consistency. The resulting approach performs consistently well compared to other approaches in identifying Granger causal variables, the associated time lags, as well as interaction signs. In addition, we also discuss the need for contemporaneous variables in Granger causal modeling as well as how these neural network-based approaches reduce the impact of nonseparability in dynamical systems, a problem where predictive information on a target variable is not unique to its causes, but also contained in the history of the target variable itself.
Collapse
Affiliation(s)
- Lock Yue Chew
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore 637371
| | - Yew-Soon Ong
- School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798
| |
Collapse
|
6
|
Yuan AE, Shou W. Data-driven causal analysis of observational biological time series. eLife 2022; 11:e72518. [PMID: 35983746 PMCID: PMC9391047 DOI: 10.7554/elife.72518] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 01/23/2022] [Indexed: 11/28/2022] Open
Abstract
Complex systems are challenging to understand, especially when they defy manipulative experiments for practical or ethical reasons. Several fields have developed parallel approaches to infer causal relations from observational time series. Yet, these methods are easy to misunderstand and often controversial. Here, we provide an accessible and critical review of three statistical causal discovery approaches (pairwise correlation, Granger causality, and state space reconstruction), using examples inspired by ecological processes. For each approach, we ask what it tests for, what causal statement it might imply, and when it could lead us astray. We devise new ways of visualizing key concepts, describe some novel pathologies of existing methods, and point out how so-called 'model-free' causality tests are not assumption-free. We hope that our synthesis will facilitate thoughtful application of methods, promote communication across different fields, and encourage explicit statements of assumptions. A video walkthrough is available (Video 1 or https://youtu.be/AIV0ttQrjK8).
Collapse
Affiliation(s)
- Alex Eric Yuan
- Molecular and Cellular Biology PhD program, University of WashingtonSeattleUnited States
- Basic Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Wenying Shou
- Centre for Life’s Origins and Evolution, Department of Genetics, Evolution and Environment, University College LondonLondonUnited Kingdom
| |
Collapse
|
7
|
Cai W, Han X, Sangeetha T, Yao H. Causality and correlation analysis for deciphering the microbial interactions in activated sludge. Front Microbiol 2022; 13:870766. [PMID: 35992723 PMCID: PMC9387910 DOI: 10.3389/fmicb.2022.870766] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 07/04/2022] [Indexed: 12/02/2022] Open
Abstract
Time series data has been considered to be a massive information provider for comprehending more about microbial dynamics and interaction, leading to a causality inference in a complex microbial community. Granger causality and correlation analysis have been investigated and applied for the construction of a microbial causal correlation network (MCCN) and efficient prediction of the ecological interaction within activated sludge, which thereby exhibited ecological interactions at the OTU-level. Application of MCCN to a time series of activated sludge data revealed that the hub species OTU56, classified as the one belonging to the genus Nitrospira, was responsible for nitrification in activated sludge and interaction with Proteobacteria and Bacteroidetes in the form of amensal and commensal relationships, respectively. The phylogenetic tree suggested a mutualistic relationship between Nitrospira and denitrifiers. Zoogloea displayed the highest ncf value within the classified OTUs of the MCCN, indicating that it could be a foundation for activated sludge through the formation of characteristic cell aggregate matrices where other organisms embed during floc formation. Inclusively, the research outcomes of this study have provided a deep insight into the ecological interactions within the communities of activated sludge.
Collapse
Affiliation(s)
- Weiwei Cai
- School of Civil Engineering, Beijing Jiaotong University, Beijing, China
| | - Xiangyu Han
- School of Civil Engineering, Beijing Jiaotong University, Beijing, China
| | - Thangavel Sangeetha
- Research Center of Energy Conservation for New Generation of Residential, Commercial, and Industrial Sectors, National Taipei University of Technology, Taipei, Taiwan
- Department of Energy and Refrigerating Air-Conditioning Engineering, National Taipei University of Technology, Taipei, Taiwan
| | - Hong Yao
- School of Civil Engineering, Beijing Jiaotong University, Beijing, China
- *Correspondence: Hong Yao,
| |
Collapse
|
8
|
Park SY, Ufondu A, Lee K, Jayaraman A. Emerging computational tools and models for studying gut microbiota composition and function. Curr Opin Biotechnol 2020; 66:301-311. [PMID: 33248408 PMCID: PMC7744364 DOI: 10.1016/j.copbio.2020.10.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023]
Abstract
The gut microbiota and its metabolites play critical roles in human health and disease. Advances in high-throughput sequencing, mass spectrometry, and other omics assay platforms have improved our ability to generate large volumes of data exploring the temporal variations in the compositions and functions of microbial communities. To elucidate mechanisms, methods and tools are needed that can rigorously model the dependencies within time-series data. Longitudinal data are often sparse and unevenly sampled, and nontrivial challenges remain in determining statistical significance, normalization across different data types, and model validation. In this review, we highlight recent developments in models and software tools for the analysis of time series microbiome and metabolome data, as well as integration of these data.
Collapse
Affiliation(s)
- Seo-Young Park
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA, USA
| | - Arinzechukwu Ufondu
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
| | - Kyongbum Lee
- Department of Chemical and Biological Engineering, Tufts University, Medford, MA, USA.
| | - Arul Jayaraman
- Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA; Department of Biomedical Engineering, Texas A&M University, College Station, TX, USA.
| |
Collapse
|
9
|
Use and abuse of correlation analyses in microbial ecology. ISME JOURNAL 2019; 13:2647-2655. [PMID: 31253856 DOI: 10.1038/s41396-019-0459-z] [Citation(s) in RCA: 118] [Impact Index Per Article: 23.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 05/25/2019] [Accepted: 06/03/2019] [Indexed: 12/14/2022]
Abstract
Correlation analyses are often included in bioinformatic pipelines as methods for inferring taxon-taxon interactions. In this perspective, we highlight the pitfalls of inferring interactions from covariance and suggest methods, study design considerations, and additional data types for improving high-throughput interaction inferences. We conclude that correlation, even when augmented by other data types, almost never provides reliable information on direct biotic interactions in real-world ecosystems. These bioinformatically inferred associations are useful for reducing the number of potential hypotheses that we might test, but will never preclude the necessity for experimental validation.
Collapse
|