Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Bai X, Ren J, Fan Y, Sun F. KIMI: Knockoff Inference for Motif Identification from molecular sequences with controlled false discovery rate. Bioinformatics 2021;37:759-766. [PMID: 33119059 PMCID: PMC8599924 DOI: 10.1093/bioinformatics/btaa912] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2020] [Revised: 09/11/2020] [Accepted: 10/14/2020] [Indexed: 01/09/2023] Open

Abstract

MOTIVATION

The rapid development of sequencing technologies has enabled us to generate a large number of metagenomic reads from genetic materials in microbial communities, making it possible to gain deep insights into understanding the differences between the genetic materials of different groups of microorganisms, such as bacteria, viruses, plasmids, etc. Computational methods based on k-mer frequencies have been shown to be highly effective for classifying metagenomic sequencing reads into different groups. However, such methods usually use all the k-mers as features for prediction without selecting relevant k-mers for the different groups of sequences, i.e. unique nucleotide patterns containing biological significance.

RESULTS

To select k-mers for distinguishing different groups of sequences with guaranteed false discovery rate (FDR) control, we develop KIMI, a general framework based on model-X Knockoffs regarded as the state-of-the-art statistical method for FDR control, for sequence motif discovery with arbitrary target FDR level, such that reproducibility can be theoretically guaranteed. KIMI is shown through simulation studies to be effective in simultaneously controlling FDR and yielding high power, outperforming the broadly used Benjamini-Hochberg procedure and the q-value method for FDR control. To illustrate the usefulness of KIMI in analyzing real datasets, we take the viral motif discovery problem as an example and implement KIMI on a real dataset consisting of viral and bacterial contigs. We show that the accuracy of predicting viral and bacterial contigs can be increased by training the prediction model only on relevant k-mers selected by KIMI.

AVAILABILITYAND IMPLEMENTATION

Our implementation of KIMI is available at https://github.com/xinbaiusc/KIMI.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Murga-Garrido SM, Hong Q, Cross TWL, Hutchison ER, Han J, Thomas SP, Vivas EI, Denu J, Ceschin DG, Tang ZZ, Rey FE. Gut microbiome variation modulates the effects of dietary fiber on host metabolism. MICROBIOME 2021;9:117. [PMID: 34016169 PMCID: PMC8138933 DOI: 10.1186/s40168-021-01061-6] [Citation(s) in RCA: 65] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Accepted: 03/24/2021] [Indexed: 05/11/2023]

Abstract

BACKGROUND

There is general consensus that consumption of dietary fermentable fiber improves cardiometabolic health, in part by promoting mutualistic microbes and by increasing production of beneficial metabolites in the distal gut. However, human studies have reported variations in the observed benefits among individuals consuming the same fiber. Several factors likely contribute to this variation, including host genetic and gut microbial differences. We hypothesized that gut microbial metabolism of dietary fiber represents an important and differential factor that modulates how dietary fiber impacts the host.

RESULTS

We examined genetically identical gnotobiotic mice harboring two distinct complex gut microbial communities and exposed to four isocaloric diets, each containing different fibers: (i) cellulose, (ii) inulin, (iii) pectin, (iv) a mix of 5 fermentable fibers (assorted fiber). Gut microbiome analysis showed that each transplanted community preserved a core of common taxa across diets that differentiated it from the other community, but there were variations in richness and bacterial taxa abundance within each community among the different diet treatments. Host epigenetic, transcriptional, and metabolomic analyses revealed diet-directed differences between animals colonized with the two communities, including variation in amino acids and lipid pathways that were associated with divergent health outcomes.

CONCLUSION

This study demonstrates that interindividual variation in the gut microbiome is causally linked to differential effects of dietary fiber on host metabolic phenotypes and suggests that a one-fits-all fiber supplementation approach to promote health is unlikely to elicit consistent effects across individuals. Overall, the presented results underscore the importance of microbe-diet interactions on host metabolism and suggest that gut microbes modulate dietary fiber efficacy. Video abstract.

Collapse

Uh HW, Klarić L, Ugrina I, Lauc G, Smilde AK, Houwing-Duistermaat JJ. Choosing proper normalization is essential for discovery of sparse glycan biomarkers. Mol Omics 2021;16:231-242. [PMID: 32211690 DOI: 10.1039/c9mo00174c] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Fiksel J, Zeger S, Datta A. A transformation-free linear regression for compositional outcomes and predictors. Biometrics 2021;78:974-987. [PMID: 33788259 DOI: 10.1111/biom.13465] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 03/09/2021] [Accepted: 03/15/2021] [Indexed: 11/29/2022]

Wu X, Liang R, Yang H. Penalized and constrained LAD estimation in fixed and high dimension. Stat Pap (Berl) 2021;63:53-95. [PMID: 33814727 PMCID: PMC8009762 DOI: 10.1007/s00362-021-01229-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 02/27/2021] [Indexed: 11/26/2022]

Shi P, Zhou Y, Zhang AR. High-dimensional log-error-in-variable regression with applications to microbial compositional data analysis. Biometrika 2021. [DOI: 10.1093/biomet/asab020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open

Zhou F, He K, Li Q, Chapkin RS, Ni Y. Bayesian biclustering for microbial metagenomic sequencing data via multinomial matrix factorization. Biostatistics 2021;23:891-909. [PMID: 33634824 DOI: 10.1093/biostatistics/kxab002] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Revised: 10/08/2020] [Accepted: 01/10/2021] [Indexed: 12/26/2022] Open

Zhang H, Chen J, Feng Y, Wang C, Li H, Liu L. Mediation effect selection in high-dimensional and compositional microbiome data. Stat Med 2021;40:885-896. [PMID: 33205470 PMCID: PMC7855955 DOI: 10.1002/sim.8808] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2020] [Revised: 08/31/2020] [Accepted: 10/16/2020] [Indexed: 01/08/2023]

Yang F, Zou Q, Gao B. GutBalance: a server for the human gut microbiome-based disease prediction and biomarker discovery with compositionality addressed. Brief Bioinform 2021;22:6123951. [PMID: 33515036 DOI: 10.1093/bib/bbaa436] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 12/17/2020] [Accepted: 12/26/2020] [Indexed: 02/07/2023] Open

Abstract

The compositionality of the microbiome data is well-known but often neglected. The compositional transformation pertains to the supervised learning of microbiome data and is a critical step that decides the performance and reliability of the disease classifiers. We value the excellent performance of the distal discriminative balance analysis (DBA) method, which selects distal balances of pairs and trios of bacteria, in addressing the classification of high-dimensional microbiome data. By applying this method to the species-level abundances of all the disease phenotypes in the GMrepo database, we build a balance-based model repository for the classification of human gut microbiome-related diseases. The model repository supports the prediction of disease risks for new sample(s). More importantly, we highlight the concept of balance-disease associations rather than the conventional microbe-disease associations and develop the human Gut Balance-Disease Association Database (GBDAD). Each predictable balance for each disease model indicates a potential biomarker-disease relationship and can be interpreted as a bacteria ratio positively or negatively correlated with the disease. Furthermore, by linking the balance-disease associations to the evidenced microbe-disease associations in MicroPhenoDB, we surprisingly found that most species-disease associations inferred from the shotgun metagenomic datasets can be validated by external evidence beyond MicroPhenoDB. The balance-based species-disease association inference will accelerate the generation of new microbe-disease association hypotheses in gastrointestinal microecology research and clinical trials. The model repository and the GBDAD database are deployed on the GutBalance server, which supports interactive visualization and systematic interrogation of the disease models, disease-related balances and disease-related species of interest.

Collapse

Li Z, Tian L, O’Malley AJ, Karagas MR, Hoen AG, Christensen BC, Madan JC, Wu Q, Gharaibeh RZ, Jobin C, Li H. IFAA: Robust Association Identification and Inference for Absolute Abundance in Microbiome Analyses. J Am Stat Assoc 2021;116:1595-1608. [PMID: 35241863 PMCID: PMC8890673 DOI: 10.1080/01621459.2020.1860770] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2019] [Revised: 09/30/2020] [Accepted: 12/03/2020] [Indexed: 12/15/2022]

Ma X, Zhang P. Quantile regression for compositional covariates. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2020.1862231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Sun Z, Xu W, Cong X, Li G, Chen K. LOG-CONTRAST REGRESSION WITH FUNCTIONAL COMPOSITIONAL PREDICTORS: LINKING PRETERM INFANT'S GUT MICROBIOME TRAJECTORIES TO NEUROBEHAVIORAL OUTCOME. Ann Appl Stat 2020;14:1535-1556. [PMID: 34163544 DOI: 10.1214/20-aoas1357] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Yan X, Bien J. Rare Feature Selection in High Dimensions. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1796677] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Koslovsky MD, Hoffman KL, Daniel CR, Vannucci M. A Bayesian model of microbiome data for simultaneous identification of covariate associations and prediction of phenotypic outcomes. Ann Appl Stat 2020. [DOI: 10.1214/20-aoas1354] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Srinivasan S, Chambers LC, Tapia KA, Hoffman NG, Munch MM, Morgan JL, Domogala D, Sylvan Lowens M, Proll S, Huang ML, Soge OO, Jerome KR, Golden MR, Hughes JP, Fredricks DN, Manhart LE. Urethral Microbiota in Men: Association of Haemophilus influenzae and Mycoplasma penetrans With Nongonococcal Urethritis. Clin Infect Dis 2020;73:e1684-e1693. [PMID: 32750107 DOI: 10.1093/cid/ciaa1123] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 07/30/2020] [Indexed: 01/15/2023] Open

Affiliation(s)

Sujatha Srinivasan Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Laura C Chambers Department of Epidemiology, University of Washington, Seattle, Washington, USA
Kenneth A Tapia Department of Global Health, University of Washington, Seattle, Washington, USA
Noah G Hoffman Department of Laboratory Medicine, University of Washington, Seattle, Washington, USA
Matthew M Munch Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Jennifer L Morgan Public Health-Seattle & King County HIV/STD Program, Seattle, Washington, USA
Daniel Domogala Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
M Sylvan Lowens Public Health-Seattle & King County HIV/STD Program, Seattle, Washington, USA
Sean Proll Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
Meei-Li Huang Department of Laboratory Medicine, University of Washington, Seattle, Washington, USA
Olusegun O Soge Department of Global Health, University of Washington, Seattle, Washington, USA.,Department of Medicine, University of Washington, Seattle, Washington, USA
Keith R Jerome Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.,Department of Laboratory Medicine, University of Washington, Seattle, Washington, USA
Matthew R Golden Public Health-Seattle & King County HIV/STD Program, Seattle, Washington, USA.,Department of Medicine, University of Washington, Seattle, Washington, USA
James P Hughes Department of Biostatistics, University of Washington, Seattle, Washington, USA
David N Fredricks Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA.,Department of Medicine, University of Washington, Seattle, Washington, USA
Lisa E Manhart Department of Epidemiology, University of Washington, Seattle, Washington, USA.,Department of Global Health, University of Washington, Seattle, Washington, USA

Collapse

Jeon JJ, Kim Y, Won S, Choi H. Primal path algorithm for compositional data analysis. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2020.106958] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]

Chen J, Zhang X, Hron K. Partial least squares regression with compositional response variables and covariates. J Appl Stat 2020;48:3130-3149. [DOI: 10.1080/02664763.2020.1795813] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Zhang L, Shi Y, Jenq RR, Do KA, Peterson CB. Bayesian compositional regression with structured priors for microbiome feature selection. Biometrics 2020;77:824-838. [PMID: 32686846 DOI: 10.1111/biom.13335] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 07/13/2020] [Indexed: 01/10/2023]

Wang S, Cai TT, Li H. Hypothesis testing for phylogenetic composition: a minimum-cost flow perspective. Biometrika 2020;108:17-36. [PMID: 33716568 DOI: 10.1093/biomet/asaa061] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Indexed: 12/30/2022] Open

Regression Models for Compositional Data: General Log-Contrast Formulations, Proximal Optimization, and Microbiome Data Applications. STATISTICS IN BIOSCIENCES 2020. [DOI: 10.1007/s12561-020-09283-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Liu T, Zhao H, Wang T. An empirical Bayes approach to normalization and differential abundance testing for microbiome data. BMC Bioinformatics 2020;21:225. [PMID: 32493208 PMCID: PMC7268703 DOI: 10.1186/s12859-020-03552-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Accepted: 05/18/2020] [Indexed: 12/14/2022] Open

Altenbuchinger M, Weihs A, Quackenbush J, Grabe HJ, Zacharias HU. Gaussian and Mixed Graphical Models as (multi-)omics data analysis tools. BIOCHIMICA ET BIOPHYSICA ACTA. GENE REGULATORY MECHANISMS 2020;1863:194418. [PMID: 31639475 PMCID: PMC7166149 DOI: 10.1016/j.bbagrm.2019.194418] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 08/21/2019] [Accepted: 08/21/2019] [Indexed: 11/30/2022]

Susin A, Wang Y, Lê Cao KA, Calle ML. Variable selection in microbiome compositional data analysis. NAR Genom Bioinform 2020;2:lqaa029. [PMID: 33575585 PMCID: PMC7671404 DOI: 10.1093/nargab/lqaa029] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Revised: 03/13/2020] [Accepted: 04/29/2020] [Indexed: 12/25/2022] Open

Xia Y. Correlation and association analyses in microbiome study integrating multiomics in health and disease. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2020;171:309-491. [PMID: 32475527 DOI: 10.1016/bs.pmbts.2020.04.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Abstract

Correlation and association analyses are one of the most widely used statistical methods in research fields, including microbiome and integrative multiomics studies. Correlation and association have two implications: dependence and co-occurrence. Microbiome data are structured as phylogenetic tree and have several unique characteristics, including high dimensionality, compositionality, sparsity with excess zeros, and heterogeneity. These unique characteristics cause several statistical issues when analyzing microbiome data and integrating multiomics data, such as large p and small n, dependency, overdispersion, and zero-inflation. In microbiome research, on the one hand, classic correlation and association methods are still applied in real studies and used for the development of new methods; on the other hand, new methods have been developed to target statistical issues arising from unique characteristics of microbiome data. Here, we first provide a comprehensive view of classic and newly developed univariate correlation and association-based methods. We discuss the appropriateness and limitations of using classic methods and demonstrate how the newly developed methods mitigate the issues of microbiome data. Second, we emphasize that concepts of correlation and association analyses have been shifted by introducing network analysis, microbe-metabolite interactions, functional analysis, etc. Third, we introduce multivariate correlation and association-based methods, which are organized by the categories of exploratory, interpretive, and discriminatory analyses and classification methods. Fourth, we focus on the hypothesis testing of univariate and multivariate regression-based association methods, including alpha and beta diversities-based, count-based, and relative abundance (or compositional)-based association analyses. We demonstrate the characteristics and limitations of each approaches. Fifth, we introduce two specific microbiome-based methods: phylogenetic tree-based association analysis and testing for survival outcomes. Sixth, we provide an overall view of longitudinal methods in analysis of microbiome and omics data, which cover standard, static, regression-based time series methods, principal trend analysis, and newly developed univariate overdispersed and zero-inflated as well as multivariate distance/kernel-based longitudinal models. Finally, we comment on current association analysis and future direction of association analysis in microbiome and multiomics studies.

Collapse

Louzada F, Shimizu TK, Suzuki AK. The Spike-and-Slab Lasso regression modeling with compositional covariates: An application on Brazilian children malnutrition data. Stat Methods Med Res 2020;29:1434-1446. [PMID: 31333069 DOI: 10.1177/0962280219863817] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

McGregor DE, Palarea-Albaladejo J, Dall PM, Hron K, Chastin S. Cox regression survival analysis with compositional covariates: Application to modelling mortality risk from 24-h physical activity patterns. Stat Methods Med Res 2020;29:1447-1465. [PMID: 31342855 DOI: 10.1177/0962280219864125] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Interpretable Log Contrasts for the Classification of Health Biomarkers: a New Approach to Balance Selection. mSystems 2020;5:5/2/e00230-19. [PMID: 32265314 PMCID: PMC7141889 DOI: 10.1128/msystems.00230-19] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open

Abstract

High-throughput sequencing provides an easy and cost-effective way to measure the relative abundance of bacteria in any environmental or biological sample. When these samples come from humans, the microbiome signatures can act as biomarkers for disease prediction. However, because bacterial abundance is measured as a composition, the data have unique properties that make conventional analyses inappropriate. To overcome this, analysts often use cumbersome normalizations. This article proposes an alternative method that identifies pairs and trios of bacteria whose stoichiometric presence can differentiate between diseased and nondiseased samples. By using interpretable log contrasts called balances, we developed an entirely normalization-free classification procedure that reduces the feature space and improves the interpretability, without sacrificing classifier performance.

Since the turn of the century, technological advances have made it possible to obtain the molecular profile of any tissue in a cost-effective manner. Among these advances are sophisticated high-throughput assays that measure the relative abundances of microorganisms, RNA molecules, and metabolites. While these data are most often collected to gain new insights into biological systems, they can also be used as biomarkers to create clinically useful diagnostic classifiers. How best to classify high-dimensional -omics data remains an area of active research. However, few explicitly model the relative nature of these data and instead rely on cumbersome normalizations. This report (i) emphasizes the relative nature of health biomarkers, (ii) discusses the literature surrounding the classification of relative data, and (iii) benchmarks how different transformations perform for regularized logistic regression across multiple biomarker types. We show how an interpretable set of log contrasts, called balances, can prepare data for classification. We propose a simple procedure, called discriminative balance analysis, to select groups of 2 and 3 bacteria that can together discriminate between experimental conditions. Discriminative balance analysis is a fast, accurate, and interpretable alternative to data normalization.

IMPORTANCE High-throughput sequencing provides an easy and cost-effective way to measure the relative abundance of bacteria in any environmental or biological sample. When these samples come from humans, the microbiome signatures can act as biomarkers for disease prediction. However, because bacterial abundance is measured as a composition, the data have unique properties that make conventional analyses inappropriate. To overcome this, analysts often use cumbersome normalizations. This article proposes an alternative method that identifies pairs and trios of bacteria whose stoichiometric presence can differentiate between diseased and nondiseased samples. By using interpretable log contrasts called balances, we developed an entirely normalization-free classification procedure that reduces the feature space and improves the interpretability, without sacrificing classifier performance.

Collapse

Chen X, Ma X, Zhou W. Kernel density regression. J Stat Plan Inference 2020. [DOI: 10.1016/j.jspi.2019.09.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Bar HY, Booth JG, Wells MT. A Scalable Empirical Bayes Approach to Variable Selection in Generalized Linear Models. J Comput Graph Stat 2020;29:535-546. [PMID: 38919169 PMCID: PMC11198964 DOI: 10.1080/10618600.2019.1706542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Revised: 04/20/2019] [Accepted: 12/11/2019] [Indexed: 10/25/2022]

Lausser L, Szekely R, Klimmek A, Schmid F, Kestler HA. Constraining classifiers in molecular analysis: invariance and robustness. J R Soc Interface 2020;17:20190612. [PMID: 32019472 PMCID: PMC7061712 DOI: 10.1098/rsif.2019.0612] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2020] [Accepted: 01/09/2020] [Indexed: 12/02/2022] Open

Cao Y, Zhang A, Li H. Multisample estimation of bacterial composition matrices in metagenomics data. Biometrika 2019. [DOI: 10.1093/biomet/asz062] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open

Wang Y, LêCao KA. Managing batch effects in microbiome data. Brief Bioinform 2019;21:1954-1970. [PMID: 31776547 DOI: 10.1093/bib/bbz105] [Citation(s) in RCA: 53] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2019] [Revised: 07/24/2019] [Indexed: 12/20/2022] Open

Jiang D, Armour CR, Hu C, Mei M, Tian C, Sharpton TJ, Jiang Y. Microbiome Multi-Omics Network Analysis: Statistical Considerations, Limitations, and Opportunities. Front Genet 2019;10:995. [PMID: 31781153 PMCID: PMC6857202 DOI: 10.3389/fgene.2019.00995] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 09/18/2019] [Indexed: 12/21/2022] Open

A multi-source data integration approach reveals novel associations between metabolites and renal outcomes in the German Chronic Kidney Disease study. Sci Rep 2019;9:13954. [PMID: 31562371 PMCID: PMC6764972 DOI: 10.1038/s41598-019-50346-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Accepted: 09/09/2019] [Indexed: 01/25/2023] Open

Randomized Lasso Links Microbial Taxa with Aquatic Functional Groups Inferred from Flow Cytometry. mSystems 2019;4:4/5/e00093-19. [PMID: 31506260 PMCID: PMC6739098 DOI: 10.1128/msystems.00093-19] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Abstract

A major goal in microbial ecology is to understand how microbial community structure influences ecosystem functioning. Various methods to directly associate bacterial taxa to functional groups in the environment are being developed. In this study, we applied machine learning methods to relate taxonomic data obtained from marker gene surveys to functional groups identified by flow cytometry. This allowed us to identify the taxa that are associated with heterotrophic productivity in freshwater lakes and indicated that the key contributors were highly system specific, regularly rare members of the community, and that some could possibly switch between being low and high contributors. Our approach provides a promising framework to identify taxa that contribute to ecosystem functioning and can be further developed to explore microbial contributions beyond heterotrophic production.

High-nucleic-acid (HNA) and low-nucleic-acid (LNA) bacteria are two operational groups identified by flow cytometry (FCM) in aquatic systems. A number of reports have shown that HNA cell density correlates strongly with heterotrophic production, while LNA cell density does not. However, which taxa are specifically associated with these groups, and by extension, productivity has remained elusive. Here, we addressed this knowledge gap by using a machine learning-based variable selection approach that integrated FCM and 16S rRNA gene sequencing data collected from 14 freshwater lakes spanning a broad range in physicochemical conditions. There was a strong association between bacterial heterotrophic production and HNA absolute cell abundances (R² = 0.65), but not with the more abundant LNA cells. This solidifies findings, mainly from marine systems, that HNA and LNA bacteria could be considered separate functional groups, the former contributing a disproportionately large share of carbon cycling. Taxa selected by the models could predict HNA and LNA absolute cell abundances at all taxonomic levels. Selected operational taxonomic units (OTUs) ranged from low to high relative abundance and were mostly lake system specific (89.5% to 99.2%). A subset of selected OTUs was associated with both LNA and HNA groups (12.5% to 33.3%), suggesting either phenotypic plasticity or within-OTU genetic and physiological heterogeneity. These findings may lead to the identification of system-specific putative ecological indicators for heterotrophic productivity. Generally, our approach allows for the association of OTUs with specific functional groups in diverse ecosystems in order to improve our understanding of (microbial) biodiversity-ecosystem functioning relationships.

IMPORTANCE A major goal in microbial ecology is to understand how microbial community structure influences ecosystem functioning. Various methods to directly associate bacterial taxa to functional groups in the environment are being developed. In this study, we applied machine learning methods to relate taxonomic data obtained from marker gene surveys to functional groups identified by flow cytometry. This allowed us to identify the taxa that are associated with heterotrophic productivity in freshwater lakes and indicated that the key contributors were highly system specific, regularly rare members of the community, and that some could possibly switch between being low and high contributors. Our approach provides a promising framework to identify taxa that contribute to ecosystem functioning and can be further developed to explore microbial contributions beyond heterotrophic production.

Collapse

Wang C, Hu J, Blaser MJ, Li H. Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data. Bioinformatics 2019;36:347-355. [PMID: 31329243 PMCID: PMC7867996 DOI: 10.1093/bioinformatics/btz565] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 06/17/2019] [Accepted: 07/16/2019] [Indexed: 02/07/2023] Open

Abstract

MOTIVATION

Recent microbiome association studies have revealed important associations between microbiome and disease/health status. Such findings encourage scientists to dive deeper to uncover the causal role of microbiome in the underlying biological mechanism, and have led to applying statistical models to quantify causal microbiome effects and to identify the specific microbial agents. However, there are no existing causal mediation methods specifically designed to handle high dimensional and compositional microbiome data.

RESULTS

We propose a rigorous Sparse Microbial Causal Mediation Model (SparseMCMM) specifically designed for the high dimensional and compositional microbiome data in a typical three-factor (treatment, microbiome and outcome) causal study design. In particular, linear log-contrast regression model and Dirichlet regression model are proposed to estimate the causal direct effect of treatment and the causal mediation effects of microbiome at both the community and individual taxon levels. Regularization techniques are used to perform the variable selection in the proposed model framework to identify signature causal microbes. Two hypothesis tests on the overall mediation effect are proposed and their statistical significance is estimated by permutation procedures. Extensive simulated scenarios show that SparseMCMM has excellent performance in estimation and hypothesis testing. Finally, we showcase the utility of the proposed SparseMCMM method in a study which the murine microbiome has been manipulated by providing a clear and sensible causal path among antibiotic treatment, microbiome composition and mouse weight.

AVAILABILITY AND IMPLEMENTATION

https://sites.google.com/site/huilinli09/software and https://github.com/chanw0/SparseMCMM.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Compositional data: the sample space and its structure. TEST-SPAIN 2019. [DOI: 10.1007/s11749-019-00670-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Yoon G, Gaynanova I, Müller CL. Microbial Networks in SPRING - Semi-parametric Rank-Based Correlation and Partial Correlation Estimation for Quantitative Microbiome Data. Front Genet 2019;10:516. [PMID: 31244881 PMCID: PMC6563871 DOI: 10.3389/fgene.2019.00516] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Accepted: 05/13/2019] [Indexed: 12/15/2022] Open

Abstract

High-throughput microbial sequencing techniques, such as targeted amplicon-based and metagenomic profiling, provide low-cost genomic survey data of microbial communities in their natural environment, ranging from marine ecosystems to host-associated habitats. While standard microbiome profiling data can provide sparse relative abundances of operational taxonomic units or genes, recent advances in experimental protocols give a more quantitative picture of microbial communities by pairing sequencing-based techniques with orthogonal measurements of microbial cell counts from the same sample. These tandem measurements provide absolute microbial count data albeit with a large excess of zeros due to limited sequencing depth. In this contribution we consider the fundamental statistical problem of estimating correlations and partial correlations from such quantitative microbiome data. To this end, we propose a semi-parametric rank-based approach to correlation estimation that can naturally deal with the excess zeros in the data. Combining this estimator with sparse graphical modeling techniques leads to the Semi-Parametric Rank-based approach for INference in Graphical model (SPRING). SPRING enables inference of statistical microbial association networks from quantitative microbiome data which can serve as high-level statistical summary of the underlying microbial ecosystem and can provide testable hypotheses for functional species-species interactions. Due to the absence of verified microbial associations we also introduce a novel quantitative microbiome data generation mechanism which mimics empirical marginal distributions of measured count data while simultaneously allowing user-specified dependencies among the variables. SPRING shows superior network recovery performance on a wide range of realistic benchmark problems with varying network topologies and is robust to misspecifications of the total cell count estimate. To highlight SPRING's broad applicability we infer taxon-taxon associations from the American Gut Project data and genus-genus associations from a recent quantitative gut microbiome dataset. We believe that, as quantitative microbiome profiling data will become increasingly available, the semi-parametric estimators for correlation and partial correlation estimation introduced here provide an important tool for reliable statistical analysis of quantitative microbiome data.

Collapse

Tang ZZ, Chen G, Hong Q, Huang S, Smith HM, Shah RD, Scholz M, Ferguson JF. Multi-Omic Analysis of the Microbiome and Metabolome in Healthy Subjects Reveals Microbiome-Dependent Relationships Between Diet and Metabolites. Front Genet 2019;10:454. [PMID: 31164901 PMCID: PMC6534069 DOI: 10.3389/fgene.2019.00454] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2019] [Accepted: 04/30/2019] [Indexed: 12/22/2022] Open

Zhang J, Lin W. Scalable estimation and regularization for the logistic normal multinomial model. Biometrics 2019;75:1098-1108. [PMID: 31009062 DOI: 10.1111/biom.13071] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Revised: 02/15/2019] [Accepted: 04/02/2019] [Indexed: 11/29/2022]

Wang T, Yang C, Zhao H. Prediction analysis for microbiome sequencing data. Biometrics 2019;75:875-884. [PMID: 30994187 DOI: 10.1111/biom.13061] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2017] [Revised: 03/08/2019] [Accepted: 03/13/2019] [Indexed: 01/22/2023]

Bates S, Tibshirani R. Log-ratio lasso: Scalable, sparse estimation for log-ratio models. Biometrics 2019;75:613-624. [PMID: 30387139 DOI: 10.1111/biom.12995] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 10/16/2018] [Indexed: 11/28/2022]

Sohn MB, Li H. Compositional mediation analysis for microbiome studies. Ann Appl Stat 2019. [DOI: 10.1214/18-aoas1210] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Wang H, Wang Z, Wang S. Sliced inverse regression method for multivariate compositional data modeling. Stat Pap (Berl) 2019. [DOI: 10.1007/s00362-019-01093-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Towards Quantitative Microbiome Community Profiling Using Internal Standards. Appl Environ Microbiol 2019;85:AEM.02634-18. [PMID: 30552195 DOI: 10.1128/aem.02634-18] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Accepted: 12/10/2018] [Indexed: 12/16/2022] Open

Abstract

An inherent issue in high-throughput rRNA gene tag sequencing microbiome surveys is that they provide compositional data in relative abundances. This often leads to spurious correlations, making the interpretation of relationships to biogeochemical rates challenging. To overcome this issue, we quantitatively estimated the abundance of microorganisms by spiking in known amounts of internal DNA standards. Using a 3-year sample set of diverse microbial communities from the Western Antarctica Peninsula, we demonstrated that the internal standard method yielded community profiles and taxon cooccurrence patterns substantially different from those derived using relative abundances. We found that the method provided results consistent with the traditional CHEMTAX analysis of pigments and total bacterial counts by flow cytometry. Using the internal standard method, we also showed that chloroplast 16S rRNA gene data in microbial surveys can be used to estimate abundances of certain eukaryotic phototrophs such as cryptophytes and diatoms. In Phaeocystis, scatter in the 16S/18S rRNA gene ratio may be explained by physiological adaptation to environmental conditions. We conclude that the internal standard method, when applied to rRNA gene microbial community profiling, is quantitative and that its application will substantially improve our understanding of microbial ecosystems.IMPORTANCE High-throughput-sequencing-based marine microbiome profiling is rapidly expanding and changing how we study the oceans. Although powerful, the technique is not fully quantitative; it provides taxon counts only in relative abundances. In order to address this issue, we present a method to quantitatively estimate microbial abundances per unit volume of seawater filtered by spiking known amounts of internal DNA standards into each sample. We validated this method by comparing the calculated abundances to other independent estimates, including chemical markers (pigments) and total bacterial cell counts by flow cytometry. The internal standard approach allows us to quantitatively estimate and compare marine microbial community profiles, with important implications for linking environmental microbiomes to quantitative processes such as metabolic and biogeochemical rates.

Collapse

Sinha R, Ahsan H, Blaser M, Caporaso JG, Carmical JR, Chan AT, Fodor A, Gail MH, Harris CC, Helzlsouer K, Huttenhower C, Knight R, Kong HH, Lai GY, Hutchinson DLS, Le Marchand L, Li H, Orlich MJ, Shi J, Truelove A, Verma M, Vogtmann E, White O, Willett W, Zheng W, Mahabir S, Abnet C. Next steps in studying the human microbiome and health in prospective studies, Bethesda, MD, May 16-17, 2017. MICROBIOME 2018;6:210. [PMID: 30477563 PMCID: PMC6257978 DOI: 10.1186/s40168-018-0596-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 11/15/2018] [Indexed: 06/09/2023]

Affiliation(s)

Rashmi Sinha Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, 20892, USA.
Habibul Ahsan Comprehensive Cancer Center University of Chicago Medicine and Biological Sciences, Chicago, IL, 60615, USA
Martin Blaser Departments of Medicine and Microbiology, New York University Langone Medical Center, New York, NY, 10016, USA
J Gregory Caporaso Pathogen and Microbiome Institute and Department of Biological Sciences, Northern Arizona University, Flagstaff, AZ, 86011, USA
Joseph Russell Carmical Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, 77030, USA
Andrew T Chan Clinical and Translational Epidemiology Unit, Massachusetts General Hospital and Harvard Medical School, Boston, MA, 02114, USA Division of Gastroenterology, Massachusetts General Hospital, Boston, MA, 02114, USA Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, MA, 02115, USA Department of Immunology and Infectious Diseases, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA
Anthony Fodor Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC, 28223, USA
Mitchell H Gail Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, 20892, USA
Curtis C Harris Laboratory of Human Carcinogenesis, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Kathy Helzlsouer Division of Cancer Control and Population Sciences, National Cancer Institute, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Curtis Huttenhower Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, 02142, USA Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, 02115, USA
Rob Knight Center for Microbiome Innovation, and Departments of Pediatrics and Computer Science and Engineering, University of California San Diego, San Diego, CA, 92093, USA
Heidi H Kong Dermatology Branch, National Cancer Institute, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Gabriel Y Lai Environmental Epidemiology Branch, National Cancer Institute, Bethesda, MD, 20892, USA
Diane Leigh Smith Hutchinson Alkek Center for Metagenomics and Microbiome Research, Department of Molecular Virology and Microbiology, Baylor College of Medicine, Houston, TX, 77030, USA
Loic Le Marchand Cancer Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, 96813, USA
Hongzhe Li Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, 19104, USA
Michael J Orlich School of Public Health and Department of Preventive Medicine, School of Medicine, Loma Linda University, Loma Linda, CA, 92350, USA
Jianxin Shi Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, 20892, USA
Ann Truelove Westat, Rockville, MD, 20850, USA
Mukesh Verma Division of Cancer Control and Population Sciences, National Cancer Institute, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Emily Vogtmann Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, 20892, USA
Owen White Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, 21201, USA
Walter Willett Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, MA, 02115, USA Departments of Epidemiology and Nutrition, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
Wei Zheng Division of Epidemiology, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Somdat Mahabir Division of Cancer Control and Population Sciences, National Cancer Institute, National Cancer Institute, National Institutes of Health, Bethesda, MD, 20892, USA
Christian Abnet Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, 20892, USA

Collapse

Zacharias HU, Altenbuchinger M, Gronwald W. Statistical Analysis of NMR Metabolic Fingerprints: Established Methods and Recent Advances. Metabolites 2018;8:E47. [PMID: 30154338 PMCID: PMC6161311 DOI: 10.3390/metabo8030047] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Revised: 08/01/2018] [Accepted: 08/18/2018] [Indexed: 01/02/2023] Open

Fröhlich H, Balling R, Beerenwinkel N, Kohlbacher O, Kumar S, Lengauer T, Maathuis MH, Moreau Y, Murphy SA, Przytycka TM, Rebhan M, Röst H, Schuppert A, Schwab M, Spang R, Stekhoven D, Sun J, Weber A, Ziemek D, Zupan B. From hype to reality: data science enabling personalized medicine. BMC Med 2018;16:150. [PMID: 30145981 PMCID: PMC6109989 DOI: 10.1186/s12916-018-1122-7] [Citation(s) in RCA: 187] [Impact Index Per Article: 31.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Accepted: 07/09/2018] [Indexed: 02/08/2023] Open

Affiliation(s)

Holger Fröhlich UCB Biosciences GmbH, Alfred-Nobel-Str. Str. 10, 40789 Monheim, Germany University of Bonn, Bonn-Aachen International Center for IT, Endenicher Allee 19c, 53115 Bonn, Germany
Rudi Balling University of Luxembourg, 6 avenue du Swing, 4367 Belvaux, Luxembourg
Niko Beerenwinkel Department of Biosciences and Engineering, ETH Zurich, Mattenstr. 26, 4058 Basel, Switzerland
Oliver Kohlbacher University of Tübingen, WSI/ZBIT, Sand 14, 72076 Tübingen, Germany Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076 Tübingen, Germany Quantitative Biology Center, University of Tübingen, Auf der Morgenstelle 8, 72076 Tübingen, Germany Institute for Translational Bioinformatics, University Medical Center Tübingen, Sand 14, 72076 Tübingen, Germany
Santosh Kumar Department of Computer Science, University of Memphis, 2222 Dunn Hall, Memphis, TN 38152 USA
Thomas Lengauer Max-Planck-Institute for Informatics, 66123 Saarbrücken, Germany
Marloes H. Maathuis ETH Zurich, Seminar für Statistik, Rämistrasse 101, 8092 Zurich, Switzerland
Yves Moreau University of Leuven, ESAT, Kasteelpark Arenberg 10, 3001 Leuven, Belgium
Susan A. Murphy Harvard University, Science Center 400 Suite, Oxford Street, Cambridge, MA 02138-2901 USA
Teresa M. Przytycka National Center of Biotechnology Information, National Institute of Health, 8600 Rockville Pike, Bethesda, MD 20894-6075 USA
Michael Rebhan Novartis Institutes for Biomedical Research, 4056 Basel, Switzerland
Hannes Röst Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S 3E1 Canada
Andreas Schuppert RWTH Aachen, Joint Research Center for Computational Biomedicine, Pauwelsstrasse 19, 52074 Aachen, Germany
Matthias Schwab Dr. Margarete Fischer-Bosch Institute of Clinical Pharmacology, Aucherbachstrasse 112, 70376 Stuttgart, Germany University of Tübingen, Departments of Clinical Pharmacology and of Pharmacy and Biochemistry, Tübingen, Germany
Rainer Spang University of Regensburg, Institute of Functional Genomics, Am BioPark 9, 93053 Regensburg, Germany
Daniel Stekhoven ETH Zurich, NEXUS Personalized Health Technol., Otto-Stern-Weg 7, 8093 Zurich, Switzerland
Jimeng Sun Georgia Tech University, 801 Atlantic Drive, Atlanta, GA 30332-0280 USA
Andreas Weber Institute for Computer Science, University of Bonn, Endenicher Allee 19a, 53115 Bonn, Germany
Daniel Ziemek Pfizer, Worldwide Research and Development, Linkstraße 10, 10785 Berlin, Germany
Blaz Zupan Faculty of Computer and Information Science, University of Ljubljana, Večna pot 113, SI-1000 Ljubljana, Slovenia

Collapse

Lu J, Shi P, Li H. Generalized linear models with linear constraints for microbiome compositional data. Biometrics 2018;75:235-244. [PMID: 30039859 DOI: 10.1111/biom.12956] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Revised: 06/01/2018] [Accepted: 06/01/2018] [Indexed: 01/04/2023]

100

Gaines BR, Kim J, Zhou H. Algorithms for Fitting the Constrained Lasso. J Comput Graph Stat 2018;27:861-871. [PMID: 30618485 PMCID: PMC6320228 DOI: 10.1080/10618600.2018.1473777] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Revised: 03/01/2018] [Indexed: 01/22/2023]