1
|
Jang H, Koh H. A unified web cloud computing platform MiMedSurv for microbiome causal mediation analysis with survival responses. Sci Rep 2024; 14:20650. [PMID: 39232070 PMCID: PMC11374894 DOI: 10.1038/s41598-024-71852-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 08/31/2024] [Indexed: 09/06/2024] Open
Abstract
In human microbiome studies, mediation analysis has recently been spotlighted as a practical and powerful analytic tool to survey the causal roles of the microbiome as a mediator to explain the observed relationships between a medical treatment/environmental exposure and a human disease. We also note that, in a clinical research, investigators often trace disease progression sequentially in time; as such, time-to-event (e.g., time-to-disease, time-to-cure) responses, known as survival responses, are prevalent as a surrogate variable for human health or disease. In this paper, we introduce a web cloud computing platform, named as microbiome mediation analysis with survival responses (MiMedSurv), for comprehensive microbiome mediation analysis with survival responses on user-friendly web environments. MiMedSurv is an extension of our prior web cloud computing platform, named as microbiome mediation analysis (MiMed), for survival responses. The two main features that are well-distinguished are as follows. First, MiMedSurv conducts some baseline exploratory non-mediational survival analysis, not involving microbiome, to survey the disparity in survival response between medical treatments/environmental exposures. Then, MiMedSurv identifies the mediating roles of the microbiome in various aspects: (i) as a microbial ecosystem using ecological indices (e.g., alpha and beta diversity indices) and (ii) as individual microbial taxa in various hierarchies (e.g., phyla, classes, orders, families, genera, species). To illustrate its use, we survey the mediating roles of the gut microbiome between antibiotic treatment and time-to-type 1 diabetes. MiMedSurv is freely available on our web server ( http://mimedsurv.micloud.kr ).
Collapse
Affiliation(s)
- Hyojung Jang
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea.
| |
Collapse
|
2
|
Kim J, Jang H, Koh H. MiMultiCat: A Unified Cloud Platform for the Analysis of Microbiome Data with Multi-Categorical Responses. Bioengineering (Basel) 2024; 11:60. [PMID: 38247937 PMCID: PMC10813402 DOI: 10.3390/bioengineering11010060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Revised: 12/21/2023] [Accepted: 12/31/2023] [Indexed: 01/23/2024] Open
Abstract
The field of the human microbiome is rapidly growing due to the recent advances in high-throughput sequencing technologies. Meanwhile, there have also been many new analytic pipelines, methods and/or tools developed for microbiome data preprocessing and analytics. They are usually focused on microbiome data with continuous (e.g., body mass index) or binary responses (e.g., diseased vs. healthy), yet multi-categorical responses that have more than two categories are also common in reality. In this paper, we introduce a new unified cloud platform, named MiMultiCat, for the analysis of microbiome data with multi-categorical responses. The two main distinguishing features of MiMultiCat are as follows: First, MiMultiCat streamlines a long sequence of microbiome data preprocessing and analytic procedures on user-friendly web interfaces; as such, it is easy to use for many people in various disciplines (e.g., biology, medicine, public health). Second, MiMultiCat performs both association testing and prediction modeling extensively. For association testing, MiMultiCat handles both ecological (e.g., alpha and beta diversity) and taxonomical (e.g., phylum, class, order, family, genus, species) contexts through covariate-adjusted or unadjusted analysis. For prediction modeling, MiMultiCat employs the random forest and gradient boosting algorithms that are well suited to microbiome data while providing nice visual interpretations. We demonstrate its use through the reanalysis of gut microbiome data on obesity with body mass index categories. MiMultiCat is freely available on our web server.
Collapse
Affiliation(s)
| | | | - Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York (SUNY), Incheon 21985, Republic of Korea
| |
Collapse
|
3
|
Kim J, Koh H. MiTree: A Unified Web Cloud Analytic Platform for User-Friendly and Interpretable Microbiome Data Mining Using Tree-Based Methods. Microorganisms 2023; 11:2816. [PMID: 38004827 PMCID: PMC10672986 DOI: 10.3390/microorganisms11112816] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2023] [Revised: 11/05/2023] [Accepted: 11/17/2023] [Indexed: 11/26/2023] Open
Abstract
The advent of next-generation sequencing has greatly accelerated the field of human microbiome studies. Currently, investigators are seeking, struggling and competing to find new ways to diagnose, treat and prevent human diseases through the human microbiome. Machine learning is a promising approach to help such an effort, especially due to the high complexity of microbiome data. However, many of the current machine learning algorithms are in a "black box", i.e., they are difficult to understand and interpret. In addition, clinicians, public health practitioners and biologists are not usually skilled at computer programming, and they do not always have high-end computing devices. Thus, in this study, we introduce a unified web cloud analytic platform, named MiTree, for user-friendly and interpretable microbiome data mining. MiTree employs tree-based learning methods, including decision tree, random forest and gradient boosting, that are well understood and suited to human microbiome studies. We also stress that MiTree can address both classification and regression problems through covariate-adjusted or unadjusted analysis. MiTree should serve as an easy-to-use and interpretable data mining tool for microbiome-based disease prediction modeling, and should provide new insights into microbiome-based diagnostics, treatment and prevention. MiTree is an open-source software that is available on our web server.
Collapse
|
4
|
Jang H, Park S, Koh H. Comprehensive microbiome causal mediation analysis using MiMed on user-friendly web interfaces. Biol Methods Protoc 2023; 8:bpad023. [PMID: 37840574 PMCID: PMC10576642 DOI: 10.1093/biomethods/bpad023] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 09/19/2023] [Accepted: 10/02/2023] [Indexed: 10/17/2023] Open
Abstract
It is a central goal of human microbiome studies to see the roles of the microbiome as a mediator that transmits environmental, behavioral, or medical exposures to health or disease outcomes. Yet, mediation analysis is not used as much as it should be. One reason is because of the lack of carefully planned routines, compilers, and automated computing systems for microbiome mediation analysis (MiMed) to perform a series of data processing, diversity calculation, data normalization, downstream data analysis, and visualizations. Many researchers in various disciplines (e.g. clinicians, public health practitioners, and biologists) are not also familiar with related statistical methods and programming languages on command-line interfaces. Thus, in this article, we introduce a web cloud computing platform, named as MiMed, that enables comprehensive MiMed on user-friendly web interfaces. The main features of MiMed are as follows. First, MiMed can survey the microbiome in various spheres (i) as a whole microbial ecosystem using different ecological measures (e.g. alpha- and beta-diversity indices) or (ii) as individual microbial taxa (e.g. phyla, classes, orders, families, genera, and species) using different data normalization methods. Second, MiMed enables covariate-adjusted analysis to control for potential confounding factors (e.g. age and gender), which is essential to enhance the causality of the results, especially for observational studies. Third, MiMed enables a breadth of statistical inferences in both mediation effect estimation and significance testing. Fourth, MiMed provides flexible and easy-to-use data processing and analytic modules and creates nice graphical representations. Finally, MiMed employs ChatGPT to search for what has been known about the microbial taxa that are found significantly as mediators using artificial intelligence technologies. For demonstration purposes, we applied MiMed to the study on the mediating roles of oral microbiome in subgingival niches between e-cigarette smoking and gingival inflammation. MiMed is freely available on our web server (http://mimed.micloud.kr).
Collapse
Affiliation(s)
- Hyojung Jang
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Solha Park
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| |
Collapse
|
5
|
Gu W, Koh H, Jang H, Lee B, Kang B. MiSurv: an Integrative Web Cloud Platform for User-Friendly Microbiome Data Analysis with Survival Responses. Microbiol Spectr 2023; 11:e0505922. [PMID: 37039671 PMCID: PMC10269532 DOI: 10.1128/spectrum.05059-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 03/12/2023] [Indexed: 04/12/2023] Open
Abstract
Investigators have studied the treatment effects on human health or disease, the treatment effects on human microbiome, and the roles of the microbiome on human health or disease. Especially, in a clinical trial, investigators commonly trace disease status over a lengthy period to survey the sequential disease progression for different treatment groups (e.g., treatment versus placebo, new treatment versus old treatment). Hence, disease responses are often available in the form of survival (i.e., time-to-event) responses stratified by treatment groups. While the recent web cloud platforms have enabled user-friendly microbiome data processing and analytics, there is currently no web cloud platform to analyze microbiome data with survival responses. Therefore, we introduce here an integrative web cloud platform, called MiSurv, for comprehensive microbiome data analysis with survival responses. IMPORTANCE MiSurv consists of a data processing module and its following four data analytic modules: (i) Module 1: Comparative survival analysis between treatment groups, (ii) Module 2: Comparative analysis in microbial composition between treatment groups, (iii) Module 3: Association testing between microbial composition and survival responses, (iv) Module 4: Prediction modeling using microbial taxa on survival responses. We demonstrate its use through an example trial on the effects of antibiotic use on the survival rate against type 1 diabetes (T1D) onset and gut microbiome composition, respectively, and the effects of the gut microbiome on the survival rate against T1D onset. MiSurv is freely available on our web server (http://misurv.micloud.kr) or can alternatively run on the user's local computer (https://github.com/wg99526/MiSurvGit).
Collapse
Affiliation(s)
- Won Gu
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyunwook Koh
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyojung Jang
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Byungho Lee
- Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Byungkon Kang
- Department of Computer Science, The State University of New York, Korea, Incheon, South Korea
| |
Collapse
|
6
|
Jang H, Koh H, Gu W, Kang B. Integrative web cloud computing and analytics using MiPair for design-based comparative analysis with paired microbiome data. Sci Rep 2022; 12:20465. [PMID: 36443470 PMCID: PMC9705534 DOI: 10.1038/s41598-022-25093-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 11/24/2022] [Indexed: 11/29/2022] Open
Abstract
Pairing (or blocking) is a design technique that is widely used in comparative microbiome studies to efficiently control for the effects of potential confounders (e.g., genetic, environmental, or behavioral factors). Some typical paired (block) designs for human microbiome studies are repeated measures designs that profile each subject's microbiome twice (or more than twice) (1) for pre and post treatments to see the effects of a treatment on microbiome, or (2) for different organs of the body (e.g., gut, mouth, skin) to see the disparity in microbiome between (or across) body sites. Researchers have developed a sheer number of web-based tools for user-friendly microbiome data processing and analytics, though there is no web-based tool currently available for such paired microbiome studies. In this paper, we thus introduce an integrative web-based tool, named MiPair, for design-based comparative analysis with paired microbiome data. MiPair is a user-friendly web cloud service that is built with step-by-step data processing and analytic procedures for comparative analysis between (or across) groups or between baseline and other groups. MiPair employs parametric and non-parametric tests for complete or incomplete block designs to perform comparative analyses with respect to microbial ecology (alpha- and beta-diversity) and taxonomy (e.g., phylum, class, order, family, genus, species). We demonstrate its usage through an example clinical trial on the effects of antibiotics on gut microbiome. MiPair is an open-source software that can be run on our web server ( http://mipair.micloud.kr ) or on user's computer ( https://github.com/yj7599/mipairgit ).
Collapse
Affiliation(s)
- Hyojung Jang
- grid.410685.e0000 0004 7650 0888Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Hyunwook Koh
- grid.410685.e0000 0004 7650 0888Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Won Gu
- grid.410685.e0000 0004 7650 0888Department of Applied Mathematics and Statistics, The State University of New York, Korea, Incheon, South Korea
| | - Byungkon Kang
- grid.410685.e0000 0004 7650 0888Department of Computer Science, The State University of New York, Korea, Incheon, South Korea
| |
Collapse
|