1
|
Qiu SF, Zhang XL, Qu YQ, Han YQ. Multiple test procedures of disease prevalence based on stratified partially validated series in the presence of a gold standard. J Biopharm Stat 2024; 34:753-774. [PMID: 37853747 DOI: 10.1080/10543406.2023.2269262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 10/05/2023] [Indexed: 10/20/2023]
Abstract
This paper discusses the problem of disease prevalence in clinical studies, focusing on multiple comparisons based on stratified partially validated series in the presence of a gold standard. Five test statistics, including two Wald-type test statistics, the inverse hyperbolic tangent transformation test statistic, likelihood ratio test statistic, and score test statistic, are proposed to conduct multiple comparisons. To control the overall type I error rate, several adjustment procedures are developed, namely the Bonferroni, Single-step adjusted MaxT, Single-step adjusted MinP, Holm's Step-down, and Hochberg's step-up procedures, based on these test statistics. The performance of the proposed methods is evaluated through simulation studies in terms of the empirical type I error rate and empirical power. Simulation results show that the Single-step adjusted MaxT procedure and Single-step adjusted MinP procedure generally outperform the other three procedures, and these two test procedures based on all test statistics have satisfactory performance. Notably, the Single-step adjusted MinP procedure tends to exhibit higher empirical power than the Single-step adjusted MaxT procedure. Furthermore, the Step-down and Step-up procedures show greater power compared to the Bonferroni method. The study also observes that as the validated ratio increases, the empirical type I errors of all test procedures approach the nominal level while maintaining higher power. Two real examples are presented to illustrate the proposed methods.
Collapse
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China
| | - Xiao-Liang Zhang
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China
- School of Iflytfk Data Science, Chongqing City Vocational College, Chongqing, China
| | - Ying-Qiu Qu
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China
- School of Finance, Chongqing College of Finance and Economics, Chongqing, China
| | - Yuan-Quan Han
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
2
|
Qiu SF, Wang LM, Tang ML, Poon WY. Confidence interval construction for proportion difference from partially validated series with two fallible classifiers. J Biopharm Stat 2022; 32:871-896. [PMID: 35536693 DOI: 10.1080/10543406.2022.2058527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
This article investigates the confidence interval (CI) construction of proportion difference for two independent partially validated series under the double-sampling scheme in which both classifiers are fallible. Several CIs based on the variance estimates recovery method of combining confidence limits from asymptotic, bootstrap, and Bayesian methods for two independent binomial proportions are developed under two models. Simulation results show that all CIs except for the bootstrap percentile-t CI and Bayesian credible interval with uniform prior under the independence model and all CIs under the dependence model generally perform well and are recommended. Two examples are used to illustrate the methodologies.
Collapse
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China
| | - Li-Ming Wang
- Department of Statistics and Data Science, Chongqing University of Technology, Chongqing, China.,Chongqing Industry Polytechnic College, China
| | - Man-Lai Tang
- Department of Mathematics, Statistics and Insurance, Hang Seng University of Hong Kong, Hong Kong, China
| | - Wai-Yin Poon
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
3
|
Liu D, Li S, Yu Y, Moustaki I. Assessing Partial Association Between Ordinal Variables: Quantification, Visualization, and Hypothesis Testing. J Am Stat Assoc 2020. [DOI: 10.1080/01621459.2020.1796394] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Dungang Liu
- Department of Operations, Business Analytics and Information Systems, University of Cincinnati Lindner College of Business, Cincinnati, OH
| | - Shaobo Li
- University of Kansas School of Business, Lawrence, KS
| | - Yan Yu
- Department of Operations, Business Analytics and Information Systems, University of Cincinnati Lindner College of Business, Cincinnati, OH
| | - Irini Moustaki
- Department of Statistics, London School of Economics, London, UK
| |
Collapse
|
4
|
Qiu SF, Fu QX. Homogeneity testing for binomial proportions under stratified double-sampling scheme with two fallible classifiers. Stat Methods Med Res 2020; 29:3547-3568. [PMID: 32640937 DOI: 10.1177/0962280220932601] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This article investigates the homogeneity testing problem of binomial proportions for stratified partially validated data obtained by double-sampling method with two fallible classifiers. Several test procedures, including the weighted-least-squares test with/without log-transformation, logit-transformation and double log-transformation, and likelihood ratio test and score test, are developed to test the homogeneity under two models, distinguished by conditional independence assumption of two classifiers. Simulation results show that score test performs better than other tests in the sense that the empirical size is generally controlled around the nominal level, and hence be recommended to practical applications. Other tests also perform well when both binomial proportions and sample sizes are not small. Approximate sample sizes based on score test, likelihood ratio test and the weighted-least-squares test with double log-transformation are generally accurate in terms of the empirical power and type I error rate with the estimated sample sizes, and hence be recommended. An example from the malaria study is illustrated by the proposed methodologies.
Collapse
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics, Chongqing University of Technology, Chongqing, China
| | - Qi-Xiang Fu
- Department of Statistics, Chongqing University of Technology, Chongqing, China
| |
Collapse
|
5
|
Han Y, Lu ZH, Poon WY. Noninferiority testing for matched-pair ordinal data with misclassification. Stat Med 2019; 38:5332-5349. [PMID: 31637752 DOI: 10.1002/sim.8364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 07/21/2019] [Accepted: 08/18/2019] [Indexed: 11/11/2022]
Abstract
New treatments that are noninferior or equivalent to-but not necessarily superior to-the reference treatment may still be beneficial to patients because they have fewer side effects, are more convenient, take less time, or cost less. The noninferiority test is widely used in medical research to provide guidance in such situation. In addition, categorical variables are frequently encountered in medical research, such as in studies involving patient-reported outcomes. In this paper, we develop a noninferiority testing procedure for correlated ordinal categorical variables based on a paired design with a latent normal distribution approach. Misclassification is frequently encountered in the collection of ordinal categorical data; therefore, we further extend the procedure to account for misclassification using information in the partially validated data. Simulation studies are conducted to investigate the accuracy of the estimates, the type I error rates, and the power of the proposed procedure. Finally, we analyze one substantive example to demonstrate the utility of the proposed approach.
Collapse
Affiliation(s)
- Yuanyuan Han
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee
| | - Zhao-Hua Lu
- Department of Biostatistics, St. Jude Children's Research Hospital, Memphis, Tennessee
| | - Wai-Yin Poon
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong
| |
Collapse
|
6
|
Qiu SF, He J, Tao JR, Tang ML, Poon WY. Comparison of disease prevalence in two populations under double-sampling scheme with two fallible classifiers. J Appl Stat 2019; 47:1375-1401. [DOI: 10.1080/02664763.2019.1679727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics, Chongqing University of Technology, Chongqing, People's Republic of China
| | - Jie He
- Department of Statistics, Chongqing University of Technology, Chongqing, People's Republic of China
| | - Ji-Ran Tao
- School of Mathematics and Statistics, Beijing Institute of Technology, Beijing, People's Republic of China
| | - Man-Lai Tang
- Department of Mathematics and Statistics, Hang Seng University of Hong Kong, Hong Kong, People's Republic of China
| | - Wai-Yin Poon
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, People's Republic of China
| |
Collapse
|
7
|
Qiu SF, Poon WY, Tang ML. Confidence intervals for an ordinal effect size measure based on partially validated series. Comput Stat Data Anal 2016. [DOI: 10.1016/j.csda.2016.05.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
8
|
Roy S, Rana S, Das K. Clustered data analysis under miscategorized ordinal outcomes and missing covariates. Stat Med 2016. [PMID: 26215983 DOI: 10.1002/sim.6599] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The primary objective in this article is to look into the analysis of clustered ordinal model where complete information on one or more covariates cease to occur. In addition, we also focus on the analysis of miscategorized data that occur in many situations as outcomes are often classified into a category that does not truly reflect its actual state. A general model structure is assumed to accommodate the information that is obtained via surrogate variables. The theoretical motivation actually developed while encountering an orthodontic data to investigate the effects of age, sex and food habit on the extent of plaque deposit. The model we propose is quite flexible and is capable of tackling those additional noises like miscategorization and missingness, which occur in the data most frequently. A new two-step approach has been proposed to estimate the parameters of model framed. A rigorous simulation study has also been carried out to justify the validity of the model taken up for analysis. Copyright © 2015 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Surupa Roy
- Department of Statistics, St. Xavier's College, Kolkata, India
| | - Subrata Rana
- Department of Statistics, Ballygunge Science College, University of Calcutta, 35, Ballygunge Circular Road, Kolkata, 700019, India
| | - Kalyan Das
- Department of Statistics, Ballygunge Science College, University of Calcutta, 35, Ballygunge Circular Road, Kolkata, 700019, India
| |
Collapse
|
9
|
Qiu SF, Poon WY, Tang ML. Confidence intervals for proportion difference from two independent partially validated series. Stat Methods Med Res 2016; 25:2250-2273. [DOI: 10.1177/0962280213519718] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Partially validated series are common when a gold-standard test is too expensive to be applied to all subjects, and hence a fallible device is used accordingly to measure the presence of a characteristic of interest. In this article, confidence interval construction for proportion difference between two independent partially validated series is studied. Ten confidence intervals based on the method of variance estimates recovery (MOVER) are proposed, with each using the confidence limits for the two independent binomial proportions obtained by the asymptotic, Logit-transformation, Agresti–Coull and Bayesian methods. The performances of the proposed confidence intervals and three likelihood-based intervals available in the literature are compared with respect to the empirical coverage probability, confidence width and ratio of mesial non-coverage to non-coverage probability. Our empirical results show that (1) all confidence intervals exhibit good performance in large samples; (2) confidence intervals based on MOVER combining the confidence limits for binomial proportions based on Wilson, Agresti–Coull, Logit-transformation, Bayesian (with three priors) methods perform satisfactorily from small to large samples, and hence can be recommended for practical applications. Two real data sets are analysed to illustrate the proposed methods.
Collapse
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics, Chongqing University of Technology, Chongqing, China
| | - Wai-Yin Poon
- Department of Statistics, The Chinese University of Hong Kong, Hong Kong, China
| | - Man-Lai Tang
- Department of Mathematics, Hong Kong Baptist University, Kowloon Tong, Hong Kong, China
| |
Collapse
|
10
|
Comparison of disease prevalence in two populations in the presence of misclassification. Biom J 2012; 54:786-807. [DOI: 10.1002/bimj.201100216] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2011] [Revised: 05/22/2012] [Accepted: 07/19/2012] [Indexed: 11/07/2022]
|
11
|
Tang ML, Qiu SF, Poon WY. Confidence interval construction for disease prevalence based on partial validation series. Comput Stat Data Anal 2012. [DOI: 10.1016/j.csda.2011.02.010] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
12
|
Qiu SF, Poon WY, Tang ML. Sample size determination for disease prevalence studies with partially validated data. Stat Methods Med Res 2012; 25:37-63. [DOI: 10.1177/0962280212439576] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Summary Disease prevalence is an important topic in medical research, and its study is based on data that are obtained by classifying subjects according to whether a disease has been contracted. Classification can be conducted with high-cost gold standard tests or low-cost screening tests, but the latter are subject to the misclassification of subjects. As a compromise between the two, many research studies use partially validated datasets in which all data points are classified by fallible tests, and some of the data points are validated in the sense that they are also classified by the completely accurate gold-standard test. In this article, we investigate the determination of sample sizes for disease prevalence studies with partially validated data. We use two approaches. The first is to find sample sizes that can achieve a pre-specified power of a statistical test at a chosen significance level, and the second is to find sample sizes that can control the width of a confidence interval with a pre-specified confidence level. Empirical studies have been conducted to demonstrate the performance of various testing procedures with the proposed sample sizes. The applicability of the proposed methods are illustrated by a real-data example.
Collapse
Affiliation(s)
- Shi-Fang Qiu
- Department of Statistics, Chongqing University of Technology, China
| | - Wai-Yin Poon
- Department of Statistics, The Chinese University of Hong Kong, China
| | - Man-Lai Tang
- Department of Mathematics, Hong Kong Baptist University, China
| |
Collapse
|
13
|
Poon WY, Wang HB. Analysis of ordinal categorical data with misclassification. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2010; 63:17-42. [PMID: 19364445 DOI: 10.1348/000711008x401314] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
We develop a method for the analysis of multivariate ordinal categorical data with misclassification based on the latent normal variable approach. Misclassification arises if a subject has been classified into a category that does not truly reflect its actual state, and can occur with one or more variables. A basic framework is developed to enable the analysis of two types of data. The first corresponds to a single sample that is obtained from a fallible design that may lead to misclassified data. The other corresponds to data that is obtained by double sampling. Double sampling data consists of two parts: a sample that is obtained by classifying subjects using the fallible design only and a sample that is obtained by classifying subjects using both fallible and true designs, which is assumed to have no misclassification. A unified expectation-maximization approach is developed to find the maximum likelihood estimate of model parameters. Simulation studies and examples that are based on real data are used to demonstrate the applicability and practicability of the proposed methods.
Collapse
Affiliation(s)
- Wai-Yin Poon
- Department of Statistics, Chinese University of Hong Kong, Shatin, Hong Kong, People's Republic of China.
| | | |
Collapse
|