1
|
Li S, Hu T, Wang L, McMahan CS, Tebbs JM. Regression analysis of group-tested current status data. Biometrika 2024; 111:1047-1061. [PMID: 39691693 PMCID: PMC11648127 DOI: 10.1093/biomet/asae006] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Indexed: 12/19/2024] Open
Abstract
Group testing is an effective way to reduce the time and cost associated with conducting large-scale screening for infectious diseases. Benefits are realized through testing pools formed by combining specimens, such as blood or urine, from different individuals. In some studies, individuals are assessed only once and a time-to-event endpoint is recorded, for example, the time until infection. Combining group testing with this type of endpoint results in group-tested current status data (Petito & Jewell, 2016). To analyse these complex data, we propose methods that estimate a proportional hazard regression model based on test outcomes from measuring the pools. A sieve maximum likelihood estimation approach is developed that approximates the cumulative baseline hazard function with a piecewise constant function. To identify the sieve estimator, a computationally efficient expectation-maximization algorithm is derived by using data augmentation. Asymptotic properties of both the parametric and nonparametric components of the sieve estimator are then established by applying modern empirical process theory. Numerical results from simulation studies show that our proposed method performs nominally and has advantages over the corresponding estimation method based on individual testing results. We illustrate our work by analysing a chlamydia dataset collected by the State Hygienic Laboratory at the University of Iowa.
Collapse
Affiliation(s)
- Shuwei Li
- School of Economics and Statistics, Guangzhou University, Daxuecheng Road 230, Guangzhou, Guangdong 510006, China
| | - Tao Hu
- School of Mathematical Sciences, Capital Normal University, Beijing 100048, China
| | - Lianming Wang
- Department of Statistics, University of South Carolina, 209A LeConte College, Columbia, South Carolina 29208, USA
| | - Christopher S McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Martin Hall, Clemson, South Carolina 29634, USA
| | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, 217 LeConte College, Columbia, South Carolina 29208, USA
| |
Collapse
|
2
|
Delaigle A, Tan R. Group testing regression analysis with covariates and specimens subject to missingness. Stat Med 2023; 42:731-744. [PMID: 36646446 DOI: 10.1002/sim.9640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 09/06/2022] [Accepted: 12/16/2022] [Indexed: 01/18/2023]
Abstract
We develop parametric estimators of a conditional prevalence in the group testing context. Group testing is applied when a binary outcome variable, often a disease indicator, is assessed by testing a specimen for the presence of the disease. Instead of testing all individual specimens separately, these are pooled in groups and the grouped specimens are tested for the disease, which permits to significantly reduce the number of tests to be performed. Various techniques have been developed in the literature for estimating a conditional prevalence from group testing data, but most of them are not valid when the data are subject to missingness. We consider this problem in the case where the specimen and the covariates are subject to nonmonotone missingness. We propose parametric estimators of the conditional prevalence, establish identifiability conditions for a logistic missing not at random model, and introduce an ignorable missing at random model. In theory, our estimators could be applied with multiple covariates missing, but in practice, they face numerical challenges when more than one covariate is missing for given individuals. We illustrate the method on simulated data and on a dataset from the Demographics and Health Survey.
Collapse
Affiliation(s)
- Aurore Delaigle
- School of Mathematics and Statistics, University of Melbourne, 3010, Victoria, Parkville, Australia
| | - Ruoxu Tan
- School of Mathematics and Statistics, University of Melbourne, 3010, Victoria, Parkville, Australia
- Department of Statistics and Actuarial Science, University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
3
|
Warasi MS. groupTesting: an R package for group testing estimation. COMMUN STAT-SIMUL C 2021. [DOI: 10.1080/03610918.2021.2009867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Md S. Warasi
- Department of Mathematics and Statistics, Radford University, Radford, VA, USA
| |
Collapse
|
4
|
Yu J, Huang Y, Shen ZJ. Optimizing and evaluating PCR-based pooled screening during COVID-19 pandemics. Sci Rep 2021; 11:21460. [PMID: 34728759 PMCID: PMC8564549 DOI: 10.1038/s41598-021-01065-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 10/19/2021] [Indexed: 12/13/2022] Open
Abstract
Population screening played a substantial role in safely reopening the economy and avoiding new outbreaks of COVID-19. PCR-based pooled screening makes it possible to test the population with limited resources by pooling multiple individual samples. Our study compared different population-wide screening methods as transmission-mitigating interventions, including pooled PCR, individual PCR, and antigen screening. Incorporating testing-isolation process and individual-level viral load trajectories into an epidemic model, we further studied the impacts of testing-isolation on test sensitivities. Results show that the testing-isolation process could maintain a stable test sensitivity during the outbreak by removing most infected individuals, especially during the epidemic decline. Moreover, we compared the efficiency, accuracy, and cost of different screening methods during the pandemic. Our results show that PCR-based pooled screening is cost-effective in reversing the pandemic at low prevalence. When the prevalence is high, PCR-based pooled screening may not stop the outbreak. In contrast, antigen screening with sufficient frequency could reverse the epidemic, despite the high cost and the large numbers of false positives in the screening process.
Collapse
Affiliation(s)
- Jiali Yu
- Tsinghua-Berkeley Shenzhen Institute (TBSI), Tsinghua University, Shenzhen, China
| | - Yiduo Huang
- Department of Civil and Environmental Engineering, University of California Berkeley, Berkeley, CA, USA
| | - Zuo-Jun Shen
- College of Engineering, University of California Berkeley, Berkeley, CA, USA.
- Faculty of Engineering and Faculty of Business and Economics, University of Hong Kong, Hong Kong, China.
| |
Collapse
|
5
|
Zhang W, Liu A, Li Q, Albert PS. Nonparametric estimation of distributions and diagnostic accuracy based on group-tested results with differential misclassification. Biometrics 2020; 76:1147-1156. [PMID: 32083733 PMCID: PMC8581970 DOI: 10.1111/biom.13236] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Revised: 12/06/2019] [Accepted: 01/27/2020] [Indexed: 11/30/2022]
Abstract
This article concerns the problem of estimating a continuous distribution in a diseased or nondiseased population when only group-based test results on the disease status are available. The problem is challenging in that individual disease statuses are not observed and testing results are often subject to misclassification, with further complication that the misclassification may be differential as the group size and the number of the diseased individuals in the group vary. We propose a method to construct nonparametric estimation of the distribution and obtain its asymptotic properties. The performance of the distribution estimator is evaluated under various design considerations concerning group sizes and classification errors. The method is exemplified with data from the National Health and Nutrition Examination Survey study to estimate the distribution and diagnostic accuracy of C-reactive protein in blood samples in predicting chlamydia incidence.
Collapse
Affiliation(s)
- Wei Zhang
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Aiyi Liu
- Biostatistics and Bioinformatics Branch, Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, Maryland
| | - Qizhai Li
- LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, Maryland
| |
Collapse
|
6
|
Wang D, Mou X, Li X, Huang X. Local polynomial regression for pooled response data. J Nonparametr Stat 2020; 32:814-837. [PMID: 33762800 PMCID: PMC7986571 DOI: 10.1080/10485252.2020.1834104] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2020] [Accepted: 10/03/2020] [Indexed: 10/23/2022]
Abstract
We propose local polynomial estimators for the conditional mean of a continuous response when only pooled response data are collected under different pooling designs. Asymptotic properties of these estimators are investigated and compared. Extensive simulation studies are carried out to compare finite sample performance of the proposed estimators under various model settings and pooling strategies. We apply the proposed local polynomial regression methods to two real-life applications to illustrate practical implementation and performance of the estimators for the mean function.
Collapse
Affiliation(s)
- Dewei Wang
- Department of Statistics, University of South Carolina, Columbia, South Carolina, U.S.A
| | - Xichen Mou
- Division of Epidemiology, Biostatistics, and Environmental Health, University of Memphis, Memphis, Tennessee, U.S.A
| | - Xiang Li
- JPMorgan Chase, Jersey City, New Jersey 07310, U.S.A
| | - Xianzheng Huang
- Department of Statistics, University of South Carolina, Columbia, South Carolina, U.S.A
| |
Collapse
|
7
|
Joyner CN, McMahan CS, Tebbs JM, Bilder CR. From mixed effects modeling to spike and slab variable selection: A Bayesian regression model for group testing data. Biometrics 2020; 76:913-923. [PMID: 31729015 PMCID: PMC7944974 DOI: 10.1111/biom.13176] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Revised: 10/22/2019] [Accepted: 10/29/2019] [Indexed: 12/20/2022]
Abstract
Due to reductions in both time and cost, group testing is a popular alternative to individual-level testing for disease screening. These reductions are obtained by testing pooled biospecimens (eg, blood, urine, swabs, etc.) for the presence of an infectious agent. However, these reductions come at the expense of data complexity, making the task of conducting disease surveillance more tenuous when compared to using individual-level data. This is because an individual's disease status may be obscured by a group testing protocol and the effect of imperfect testing. Furthermore, unlike individual-level testing, a given participant could be involved in multiple testing outcomes and/or may never be tested individually. To circumvent these complexities and to incorporate all available information, we propose a Bayesian generalized linear mixed model that accommodates data arising from any group testing protocol, estimates unknown assay accuracy probabilities and accounts for potential heterogeneity in the covariate effects across population subgroups (eg, clinic sites, etc.); this latter feature is of key interest to practitioners tasked with conducting disease surveillance. To achieve model selection, our proposal uses spike and slab priors for both fixed and random effects. The methodology is illustrated through numerical studies and is applied to chlamydia surveillance data collected in Iowa.
Collapse
Affiliation(s)
- Chase N. Joyner
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, U.S.A
| | - Christopher S. McMahan
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634, U.S.A
| | - Joshua M. Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, U.S.A
| | | |
Collapse
|
8
|
|
9
|
Hepworth G, Katholi CR. Mid‐
P
confidence intervals for group testing based on the total number of positive groups. Biom J 2019; 61:688-697. [DOI: 10.1002/bimj.201700190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2017] [Revised: 06/24/2018] [Accepted: 08/29/2018] [Indexed: 11/06/2022]
Affiliation(s)
- Graham Hepworth
- School of Mathematics and StatisticsThe University of MelbourneVictoria Australia
| | - Charles R. Katholi
- Department of BiostatisticsUniversity of Alabama at BirminghamAlabama USA
| |
Collapse
|
10
|
Determination of Varying Group Sizes for Pooling Procedure. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2019; 2019:4381084. [PMID: 31065292 PMCID: PMC6466917 DOI: 10.1155/2019/4381084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/15/2018] [Revised: 01/17/2019] [Accepted: 02/05/2019] [Indexed: 11/17/2022]
Abstract
Pooling is an attractive strategy in screening infected specimens, especially for rare diseases. An essential step of performing the pooled test is to determine the group size. Sometimes, equal group size is not appropriate due to population heterogeneity. In this case, varying group sizes are preferred and could be determined while individual information is available. In this study, we propose a sequential procedure to determine varying group sizes through fully utilizing available information. This procedure is data driven. Simulations show that it has good performance in estimating parameters.
Collapse
|
11
|
Affiliation(s)
- Gregory Haber
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, Maryland, USA
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, Maryland, USA
| |
Collapse
|
12
|
Warasi MS, McMahan CS, Tebbs JM, Bilder CR. Group testing regression models with dilution submodels. Stat Med 2017; 36:4860-4872. [PMID: 28856774 DOI: 10.1002/sim.7455] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2017] [Revised: 05/27/2017] [Accepted: 08/11/2017] [Indexed: 12/21/2022]
Abstract
Group testing, where specimens are tested initially in pools, is widely used to screen individuals for sexually transmitted diseases. However, a common problem encountered in practice is that group testing can increase the number of false negative test results. This occurs primarily when positive individual specimens within a pool are diluted by negative ones, resulting in positive pools testing negatively. If the goal is to estimate a population-level regression model relating individual disease status to observed covariates, severe bias can result if an adjustment for dilution is not made. Recognizing this as a critical issue, recent binary regression approaches in group testing have utilized continuous biomarker information to acknowledge the effect of dilution. In this paper, we have the same overall goal but take a different approach. We augment existing group testing regression models (that assume no dilution) with a parametric dilution submodel for pool-level sensitivity and estimate all parameters using maximum likelihood. An advantage of our approach is that it does not rely on external biomarker test data, which may not be available in surveillance studies. Furthermore, unlike previous approaches, our framework allows one to formally test whether dilution is present based on the observed group testing data. We use simulation to illustrate the performance of our estimation and inference methods, and we apply these methods to 2 infectious disease data sets.
Collapse
Affiliation(s)
- Md S Warasi
- Department of Mathematics and Statistics, Radford University, Radford, VA 24142, USA
| | | | - Joshua M Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| | - Christopher R Bilder
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE 68583, NE, USA
| |
Collapse
|
13
|
McMahan CS, McLain AC, Gallagher CM, Schisterman EF. Estimating covariate-adjusted measures of diagnostic accuracy based on pooled biomarker assessments. Biom J 2016; 58:944-61. [PMID: 26927583 DOI: 10.1002/bimj.201500195] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Revised: 12/31/2015] [Accepted: 01/06/2016] [Indexed: 11/10/2022]
Abstract
There is a need for epidemiological and medical researchers to identify new biomarkers (biological markers) that are useful in determining exposure levels and/or for the purposes of disease detection. Often this process is stunted by high testing costs associated with evaluating new biomarkers. Traditionally, biomarker assessments are individually tested within a target population. Pooling has been proposed to help alleviate the testing costs, where pools are formed by combining several individual specimens. Methods for using pooled biomarker assessments to estimate discriminatory ability have been developed. However, all these procedures have failed to acknowledge confounding factors. In this paper, we propose a regression methodology based on pooled biomarker measurements that allow the assessment of the discriminatory ability of a biomarker of interest. In particular, we develop covariate-adjusted estimators of the receiver-operating characteristic curve, the area under the curve, and Youden's index. We establish the asymptotic properties of these estimators and develop inferential techniques that allow one to assess whether a biomarker is a good discriminator between cases and controls, while controlling for confounders. The finite sample performance of the proposed methodology is illustrated through simulation. We apply our methods to analyze myocardial infarction (MI) data, with the goal of determining whether the pro-inflammatory cytokine interleukin-6 is a good predictor of MI after controlling for the subjects' cholesterol levels.
Collapse
Affiliation(s)
| | - Alexander C McLain
- Department of Epidemiology and Biostatistics, University of South Carolina, Columbia, SC 29208, USA
| | - Colin M Gallagher
- Department of Mathematical Sciences, Clemson University, Clemson, SC 29634, USA
| | - Enrique F Schisterman
- Epidemiology Branch, Division of Intramural Population Health Research, Eunice Kennedy Shriver National Institute of Child Health and Human Development, Bethesda, MD 20892, USA
| |
Collapse
|
14
|
Delaigle A, Zhou WX. Nonparametric and Parametric Estimators of Prevalence From Group Testing Data With Aggregated Covariates. J Am Stat Assoc 2016. [DOI: 10.1080/01621459.2015.1054491] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
15
|
Wang D, McMahan CS, Gallagher CM. A general regression framework for group testing data, which incorporates pool dilution effects. Stat Med 2015; 34:3606-21. [PMID: 26173957 DOI: 10.1002/sim.6578] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Revised: 04/21/2015] [Accepted: 06/15/2015] [Indexed: 01/01/2023]
Abstract
Group testing, through the use of pooling, has been widely implemented as a more efficient means to screen individuals for infectious diseases. Typically, in these settings, practitioners are tasked with the complimentary goals of both case identification and estimation. For these purposes, many group testing strategies have been proposed, which address issues such as preserving anonymity in estimation studies, quality control, and classification. In general, these strategies require that a significant number of the individuals be retested, either in pools or individually. In order to provide practitioners with a general methodology that can be used to accurately and precisely analyze data of this form, herein, we propose a binary regression framework that can incorporate data arising from any group testing strategy. Further, we relax previously made assumptions regarding testing error rates by relating the diagnostic testing results to the latent biological marker levels of the individuals being tested. We investigate the finite sample performance of our proposed methodology through simulation and by applying our techniques to hepatitis B data collected as part of a study involving Irish prisoners.
Collapse
Affiliation(s)
- Dewei Wang
- Department of Statistics, University of South Carolina, Columbia, SC 29028, U.S.A
| | | | - Colin M Gallagher
- Department of Mathematical Sciences, Clemson University, Clemson, SC 29634, U.S.A
| |
Collapse
|
16
|
Noguchi A, Akiyama H, Nakamura K, Sakata K, Minegishi Y, Mano J, Takabatake R, Futo S, Kitta K, Teshima R, Kondo K, Nishimaki-Mogami T. A novel trait-specific real-time PCR method enables quantification of genetically modified (GM) maize content in ground grain samples containing stacked GM maize. Eur Food Res Technol 2014. [DOI: 10.1007/s00217-014-2340-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
17
|
Delaigle A, Hall P, Wishart JR. New approaches to nonparametric and semiparametric regression for univariate and multivariate group testing data. Biometrika 2014. [DOI: 10.1093/biomet/asu025] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
18
|
Wang D, McMahan CS, Gallagher CM, Kulasekera KB. Semiparametric group testing regression models. Biometrika 2014. [DOI: 10.1093/biomet/asu007] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
|
19
|
Zhang B, Bilder CR, Tebbs JM. Regression analysis for multiple-disease group testing data. Stat Med 2013; 32:4954-66. [PMID: 23703944 PMCID: PMC4301740 DOI: 10.1002/sim.5858] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 04/29/2013] [Indexed: 11/06/2022]
Abstract
Group testing, where individual specimens are composited into groups to test for the presence of a disease (or other binary characteristic), is a procedure commonly used to reduce the costs of screening a large number of individuals. Group testing data are unique in that only group responses may be available, but inferences are needed at the individual level. A further methodological challenge arises when individuals are tested in groups for multiple diseases simultaneously, because unobserved individual disease statuses are likely correlated. In this paper, we propose new regression techniques for multiple-disease group testing data. We develop an expectation-solution based algorithm that provides consistent parameter estimates and natural large-sample inference procedures. We apply our proposed methodology to chlamydia and gonorrhea screening data collected in Nebraska as part of the Infertility Prevention Project and to prenatal infectious disease screening data from Kenya.
Collapse
Affiliation(s)
- Boan Zhang
- Department of Statistics, University of Nebraska-Lincoln, Lincoln, NE 68583, USA
| | | | - Joshua M. Tebbs
- Department of Statistics, University of South Carolina, Columbia, SC 29208, USA
| |
Collapse
|
20
|
Wang D, Zhou H, Kulasekera KB. A semi-local likelihood regression estimator of the proportion based on group testing data. J Nonparametr Stat 2013. [DOI: 10.1080/10485252.2012.750726] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
21
|
Hund L, Pagano M. Estimating HIV prevalence from surveys with low individual consent rates: annealing individual and pooled samples. Emerg Themes Epidemiol 2013; 10:2. [PMID: 23446064 PMCID: PMC3649931 DOI: 10.1186/1742-7622-10-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2012] [Accepted: 02/20/2013] [Indexed: 11/30/2022] Open
Abstract
Many HIV prevalence surveys are plagued by the problem that a sizeable number of surveyed individuals do not consent to contribute blood samples for testing. One can ignore this problem, as is often done, but the resultant bias can be of sufficient magnitude to invalidate the results of the survey, especially if the number of non-responders is high and the reason for refusing to participate is related to the individual’s HIV status. One reason for refusing to participate may be for reasons of privacy. For those individuals, we suggest offering the option of being tested in a pool. This form of testing is less certain than individual testing, but, if it convinces more people to submit to testing, it should reduce the potential for bias and give a cleaner answer to the question of prevalence. This paper explores the logistics of implementing a combined individual and pooled testing approach and evaluates the analytical advantages to such a combined testing strategy. We quantify improvements in a prevalence estimator based on this combined testing strategy, relative to an individual testing only approach and a pooled testing only approach. Minimizing non-response is key for reducing bias, and, if pooled testing assuages privacy concerns, offering a pooled testing strategy has the potential to substantially improve HIV prevalence estimates.
Collapse
Affiliation(s)
- Lauren Hund
- Department of Family and Community Medicine, University of New Mexico, 2400 Tucker NE, Albuquerque, NM 87106, USA.
| | | |
Collapse
|
22
|
|
23
|
Abstract
In situations where individuals are screened for an infectious disease or other binary characteristic and where resources for testing are limited, group testing can offer substantial benefits. Group testing, where subjects are tested in groups (pools) initially, has been successfully applied to problems in blood bank screening, public health, drug discovery, genetics, and many other areas. In these applications, often the goal is to identify each individual as positive or negative using initial group tests and subsequent retests of individuals within positive groups. Many group testing identification procedures have been proposed; however, the vast majority of them fail to incorporate heterogeneity among the individuals being screened. In this paper, we present a new approach to identify positive individuals when covariate information is available on each. This covariate information is used to structure how retesting is implemented within positive groups; therefore, we call this new approach "informative retesting." We derive closed-form expressions and implementation algorithms for the probability mass functions for the number of tests needed to decode positive groups. These informative retesting procedures are illustrated through a number of examples and are applied to chlamydia and gonorrhea testing in Nebraska for the Infertility Prevention Project. Overall, our work shows compelling evidence that informative retesting can dramatically decrease the number of tests while providing accuracy similar to established non-informative retesting procedures.
Collapse
Affiliation(s)
- Christopher R Bilder
- Associate Professor of Statistics ( , Website: www.chrisbilder.com ) at the University of Nebraska-Lincoln, Lincoln, NE 68583
| | | | | |
Collapse
|
24
|
Mano J, Yanaka Y, Ikezu Y, Onishi M, Futo S, Minegishi Y, Ninomiya K, Yotsuyanagi Y, Spiegelhalter F, Akiyama H, Teshima R, Hino A, Naito S, Koiwa T, Takabatake R, Furui S, Kitta K. Practicable group testing method to evaluate weight/weight GMO content in maize grains. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2011; 59:6856-6863. [PMID: 21604714 DOI: 10.1021/jf200212v] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
Because of the increasing use of maize hybrids with genetically modified (GM) stacked events, the established and commonly used bulk sample methods for PCR quantification of GM maize in non-GM maize are prone to overestimate the GM organism (GMO) content, compared to the actual weight/weight percentage of GM maize in the grain sample. As an alternative method, we designed and assessed a group testing strategy in which the GMO content is statistically evaluated based on qualitative analyses of multiple small pools, consisting of 20 maize kernels each. This approach enables the GMO content evaluation on a weight/weight basis, irrespective of the presence of stacked-event kernels. To enhance the method's user-friendliness in routine application, we devised an easy-to-use PCR-based qualitative analytical method comprising a sample preparation step in which 20 maize kernels are ground in a lysis buffer and a subsequent PCR assay in which the lysate is directly used as a DNA template. This method was validated in a multilaboratory collaborative trial.
Collapse
Affiliation(s)
- Junichi Mano
- National Food Research Institute, National Agriculture and Food Research Organization, Kannondai, Tsukuba, Ibaraki 305-8642, Japan
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
|
26
|
Abstract
Group testing, where subjects are tested in pools rather than individually, has a long history of successful application in infectious disease screening. In this article, we develop group testing regression models to include covariate effects that are best regarded as random. We present approaches to fit mixed effects models using maximum likelihood, investigate likelihood ratio and score tests for variance components, and evaluate small sample performance using simulation. We illustrate our methods using chlamydia and gonorrhea data collected by the state of Nebraska as part of the Infertility Prevention Project.
Collapse
Affiliation(s)
- Peng Chen
- Department of Statistics, University of South Carolina, Columbia, South Carolina 29208, USA
| | | | | |
Collapse
|
27
|
Chen P, Tebbs JM, Bilder CR. Global goodness-of-fit tests for group testing regression models. Stat Med 2009; 28:2912-28. [PMID: 19610130 DOI: 10.1002/sim.3678] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In a variety of biomedical applications, particularly those involving screening for infectious diseases, testing individuals (e.g. blood/urine samples, etc.) in pools has become a standard method of data collection. This experimental design, known as group testing (or pooled testing), can provide a large reduction in testing costs and can offer nearly the same precision as individual testing. To account for covariate information on individual subjects, regression models for group testing data have been proposed recently. However, there are currently no tools available to check the adequacy of these models. In this paper, we present various global goodness-of-fit tests for regression models with group testing data. We use simulation to examine the small-sample size and power properties of the tests for different pool composition strategies. We illustrate our methods using two infectious disease data sets, one from an HIV study in Kenya and one from the Infertility Prevention Project.
Collapse
Affiliation(s)
- Peng Chen
- Takeda Global Research and Development Center, Inc., 675 North Field Drive, Lake Forest, IL 60045, USA
| | | | | |
Collapse
|