1
|
Edelmann D, Welchowski T, Benner A. A consistent version of distance covariance for right-censored survival data and its application in hypothesis testing. Biometrics 2021; 78:867-879. [PMID: 33847373 DOI: 10.1111/biom.13470] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2020] [Revised: 12/24/2020] [Accepted: 03/31/2021] [Indexed: 11/25/2022]
Abstract
Distance covariance is a powerful new dependence measure that was recently introduced by Székely et al. and Székely and Rizzo. In this work, the concept of distance covariance is extended to measuring dependence between a covariate vector and a right-censored survival endpoint by establishing an estimator based on an inverse-probability-of-censoring weighted U-statistic. The consistency of the novel estimator is derived. In a large simulation study, it is shown that induced distance covariance permutation tests show a good performance in detecting various complex associations. Applying the distance covariance permutation tests on a gene expression dataset from breast cancer patients outlines its potential for biostatistical practice.
Collapse
Affiliation(s)
- Dominic Edelmann
- Division of Biostatistics, German Cancer Research Center, Heidelberg, Germany
| | - Thomas Welchowski
- Institute of Medical Biometry, Informatics and Epidemiology, University Hospital of Bonn, Bonn, Germany
| | - Axel Benner
- Division of Biostatistics, German Cancer Research Center, Heidelberg, Germany
| |
Collapse
|
2
|
Abstract
AbstractIn this paper we propose a sufficient dimension reduction algorithm based on the difference of inverse medians. The classic methodology based on inverse means in each slice was recently extended, by using inverse medians, to robustify existing methodology at the presence of outliers. Our effort is focused on using differences between inverse medians in pairs of slices. We demonstrate that our method outperforms existing methods at the presence of outliers. We also propose a second algorithm which is not affected by the ordering of slices when the response variable is categorical with no underlying ordering of its values.
Collapse
|
3
|
Huo L, Wen XM, Yu Z. A model-free conditional screening approach via sufficient dimension reduction. J Nonparametr Stat 2020. [DOI: 10.1080/10485252.2020.1834554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Lei Huo
- Department of Mathematics and Statistics, Missouri University of Science and Technology, Rolla, MO, USA
| | - Xuerong Meggie Wen
- Department of Mathematics and Statistics, Missouri University of Science and Technology, Rolla, MO, USA
| | - Zhou Yu
- School of Statistics, East China Normal University, Shanghai, People's Republic of China
| |
Collapse
|
4
|
|
5
|
Alothman A, Dong Y, Artemiou A. On dual model-free variable selection with two groups of variables. J MULTIVARIATE ANAL 2018. [DOI: 10.1016/j.jmva.2018.06.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
6
|
Vepakomma P, Tonde C, Elgammal A. Supervised dimensionality reduction via distance correlation maximization. Electron J Stat 2018. [DOI: 10.1214/18-ejs1403] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
7
|
|
8
|
Persson E, Häggström J, Waernbaum I, de Luna X. Data-driven algorithms for dimension reduction in causal inference. Comput Stat Data Anal 2017. [DOI: 10.1016/j.csda.2016.08.012] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
9
|
Yu Z, Dong Y, Shao J. On marginal sliced inverse regression for ultrahigh dimensional model-free feature selection. Ann Stat 2016. [DOI: 10.1214/15-aos1424] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
10
|
Dong Y, Yang C, Yu Z. On permutation tests for predictor contribution in sufficient dimension reduction. J MULTIVARIATE ANAL 2016. [DOI: 10.1016/j.jmva.2016.02.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
11
|
|
12
|
Yoo JK. Tutorial: Methodologies for sufficient dimension reduction in regression. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2016. [DOI: 10.5351/csam.2016.23.2.105] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
13
|
Hsueh HM, Tsai CA. Gene set analysis using sufficient dimension reduction. BMC Bioinformatics 2016; 17:74. [PMID: 26852017 PMCID: PMC4744442 DOI: 10.1186/s12859-016-0928-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2015] [Accepted: 02/01/2016] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND Gene set analysis (GSA) aims to evaluate the association between the expression of biological pathways, or a priori defined gene sets, and a particular phenotype. Numerous GSA methods have been proposed to assess the enrichment of sets of genes. However, most methods are developed with respect to a specific alternative scenario, such as a differential mean pattern or a differential coexpression. Moreover, a very limited number of methods can handle either binary, categorical, or continuous phenotypes. In this paper, we develop two novel GSA tests, called SDRs, based on the sufficient dimension reduction technique, which aims to capture sufficient information about the relationship between genes and the phenotype. The advantages of our proposed methods are that they allow for categorical and continuous phenotypes, and they are also able to identify a variety of enriched gene sets. RESULTS Through simulation studies, we compared the type I error and power of SDRs with existing GSA methods for binary, triple, and continuous phenotypes. We found that SDR methods adequately control the type I error rate at the pre-specified nominal level, and they have a satisfactory power to detect gene sets with differential coexpression and to test non-linear associations between gene sets and a continuous phenotype. In addition, the SDR methods were compared with seven widely-used GSA methods using two real microarray datasets for illustration. CONCLUSIONS We concluded that the SDR methods outperform the others because of their flexibility with regard to handling different kinds of phenotypes and their power to detect a wide range of alternative scenarios. Our real data analysis highlights the differences between GSA methods for detecting enriched gene sets.
Collapse
Affiliation(s)
- Huey-Miin Hsueh
- Department of Statistics, National Chengchi UniversityZhinan Road, Taipei116, Taiwan, Taipei, 116, Taiwan.
| | - Chen-An Tsai
- Department of Agronomy, National Taiwan University, No. 1, Section 4, Roosevelt Road, Taipei, 106, Taiwan.
| |
Collapse
|
14
|
|
15
|
Yoo JK, Jeong S. A Note on Bootstrapping in Sufficient Dimension Reduction. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2015. [DOI: 10.5351/csam.2015.22.3.285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Jae Keun Yoo
- Department of Statistics, Ewha Womans University, Korea
| | | |
Collapse
|
16
|
Lue HH. An Inverse-regression Method of Dependent Variable Transformation for Dimension Reduction with Non-linear Confounding. Scand Stat Theory Appl 2014. [DOI: 10.1111/sjos.12135] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
17
|
Belcher C, Jill Heatley J, Petzinger C, Hoppes S, Larner CD, Sheather SJ, Macfarlane RD. Evaluation of Plasma Cholesterol, Triglyceride, and Lipid Density Profiles in Captive Monk Parakeets (Myiopsitta monachus). J Exot Pet Med 2014. [DOI: 10.1053/j.jepm.2013.11.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
18
|
Lindsey CD, Sheather SJ, McKean JW. Using sliced mean variance–covariance inverse regression for classification and dimension reduction. Comput Stat 2013. [DOI: 10.1007/s00180-013-0460-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
19
|
Lee K, Oh S, Yoo JK. Method-Free Permutation Predictor Hypothesis Tests in Sufficient Dimension Reduction. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2013. [DOI: 10.5351/csam.2013.20.4.291] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
20
|
|
21
|
Yu Z, Zhu L, Wen XM. On model-free conditional coordinate tests for regressions. J MULTIVARIATE ANAL 2012. [DOI: 10.1016/j.jmva.2012.02.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
22
|
Pfeiffer RM, Forzani L, Bura E. Sufficient dimension reduction for longitudinally measured predictors. Stat Med 2011; 31:2414-27. [PMID: 22161635 DOI: 10.1002/sim.4437] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2011] [Accepted: 09/19/2011] [Indexed: 11/12/2022]
Abstract
We propose a method to combine several predictors (markers) that are measured repeatedly over time into a composite marker score without assuming a model and only requiring a mild condition on the predictor distribution. Assuming that the first and second moments of the predictors can be decomposed into a time and a marker component via a Kronecker product structure that accommodates the longitudinal nature of the predictors, we develop first-moment sufficient dimension reduction techniques to replace the original markers with linear transformations that contain sufficient information for the regression of the predictors on the outcome. These linear combinations can then be combined into a score that has better predictive performance than a score built under a general model that ignores the longitudinal structure of the data. Our methods can be applied to either continuous or categorical outcome measures. In simulations, we focus on binary outcomes and show that our method outperforms existing alternatives by using the AUC, the area under the receiver-operator characteristics (ROC) curve, as a summary measure of the discriminatory ability of a single continuous diagnostic marker for binary disease outcomes.
Collapse
Affiliation(s)
- Ruth M Pfeiffer
- Biostatistics Branch, National Cancer Institute, Bethesda, MD 20892-7244, USA.
| | | | | |
Collapse
|
23
|
Yoo JK. Unified predictor hypothesis tests in sufficient dimension reduction: A bootstrap approach. J Korean Stat Soc 2011. [DOI: 10.1016/j.jkss.2010.09.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
24
|
Bura E, Yang J. Dimension estimation in sufficient dimension reduction: A unifying approach. J MULTIVARIATE ANAL 2011. [DOI: 10.1016/j.jmva.2010.08.007] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
25
|
Sliced inverse moment regression using weighted chi-squared tests for dimension reduction. J Stat Plan Inference 2010. [DOI: 10.1016/j.jspi.2010.04.015] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
26
|
|
27
|
Guan Y, Wang H. Sufficient dimension reduction for spatial point processes directed by Gaussian random fields. J R Stat Soc Series B Stat Methodol 2010. [DOI: 10.1111/j.1467-9868.2010.00738.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
28
|
|
29
|
|
30
|
Wen XM, Cook RD. New approaches to model-free dimension reduction for bivariate regression. J Stat Plan Inference 2009. [DOI: 10.1016/j.jspi.2008.01.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
|
31
|
Lue HH. Sliced Average Variance Estimation for Censored Data. COMMUN STAT-THEOR M 2008. [DOI: 10.1080/03610920802101555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|