1
|
Delaigle A, Tan R. Group testing regression analysis with covariates and specimens subject to missingness. Stat Med 2023; 42:731-744. [PMID: 36646446 DOI: 10.1002/sim.9640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 09/06/2022] [Accepted: 12/16/2022] [Indexed: 01/18/2023]
Abstract
We develop parametric estimators of a conditional prevalence in the group testing context. Group testing is applied when a binary outcome variable, often a disease indicator, is assessed by testing a specimen for the presence of the disease. Instead of testing all individual specimens separately, these are pooled in groups and the grouped specimens are tested for the disease, which permits to significantly reduce the number of tests to be performed. Various techniques have been developed in the literature for estimating a conditional prevalence from group testing data, but most of them are not valid when the data are subject to missingness. We consider this problem in the case where the specimen and the covariates are subject to nonmonotone missingness. We propose parametric estimators of the conditional prevalence, establish identifiability conditions for a logistic missing not at random model, and introduce an ignorable missing at random model. In theory, our estimators could be applied with multiple covariates missing, but in practice, they face numerical challenges when more than one covariate is missing for given individuals. We illustrate the method on simulated data and on a dataset from the Demographics and Health Survey.
Collapse
Affiliation(s)
- Aurore Delaigle
- School of Mathematics and Statistics, University of Melbourne, 3010, Victoria, Parkville, Australia
| | - Ruoxu Tan
- School of Mathematics and Statistics, University of Melbourne, 3010, Victoria, Parkville, Australia
- Department of Statistics and Actuarial Science, University of Hong Kong, Hong Kong SAR, China
| |
Collapse
|
2
|
Mehta T, Malinovsky Y, Abnet CC, Albert PS. Using group testing in a two-phase epidemiologic design to identify the effects of a large number of antibody reactions on disease risk. BMC Med Res Methodol 2022; 22:324. [PMID: 36526967 PMCID: PMC9756457 DOI: 10.1186/s12874-022-01798-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 11/21/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND The role of immunological responses to exposed bacteria on disease incidence is increasingly under investigation. With many bacterial species, and many potential antibody reactions to a particular species, the large number of assays required for this type of discovery can make it prohibitively expensive. We propose a two-phase group testing design to more efficiently screen numerous antibody effects in a case-control setting. METHODS Phase 1 uses group testing to select antibodies that are differentially expressed between cases and controls. The selected antibodies go on to Phase 2 individual testing. RESULTS We evaluate the two-phase group testing design through simulations and example data and find that it substantially reduces the number of assays required relative to standard case-control and group testing designs, while maintaining similar statistical properties. CONCLUSION The proposed two-phase group testing design can dramatically reduce the number of assays required, while providing comparable results to a case-control design.
Collapse
Affiliation(s)
- Tanvi Mehta
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, Room SG/7E146, Rockville, MD, 20850, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland, Baltimore County, Baltimore, MD, USA
| | - Christian C Abnet
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, Room SG/7E146, Rockville, MD, 20850, USA
| | - Paul S Albert
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, 9609 Medical Center Drive, Room SG/7E146, Rockville, MD, 20850, USA.
| |
Collapse
|
3
|
Zhong Y, Xu P, Zhong S, Ding J. A sequential decoding procedure for pooled quantitative measure. Seq Anal 2022. [DOI: 10.1080/07474946.2022.2043049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Yunning Zhong
- School of Mathematics and Statistics, Fujian Normal University, Fuzhou, Fujian, China
| | - Ping Xu
- School of Mathematics and Statistics, Guangxi Normal University, Guilin, Guangxi, China
| | - Siming Zhong
- School of Mathematics and Statistics, Guangxi Normal University, Guilin, Guangxi, China
| | - Juan Ding
- School of Mathematics and Statistics, Guangxi Normal University, Guilin, Guangxi, China
| |
Collapse
|
4
|
Best AF, Malinovsky Y, Albert PS. The efficient design of Nested Group Testing algorithms for disease identification in clustered data. J Appl Stat 2022; 50:2228-2245. [PMID: 37434628 PMCID: PMC10332225 DOI: 10.1080/02664763.2022.2071419] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Accepted: 04/23/2022] [Indexed: 10/18/2022]
Abstract
Group testing study designs have been used since the 1940s to reduce screening costs for uncommon diseases; for rare diseases, all cases are identifiable with substantially fewer tests than the population size. Substantial research has identified efficient designs under this paradigm. However, little work has focused on the important problem of disease screening among clustered data, such as geographic heterogeneity in HIV prevalence. We evaluated designs where we first estimate disease prevalence and then apply efficient group testing algorithms using these estimates. Specifically, we evaluate prevalence using individual testing on a fixed-size subset of each cluster and use these prevalence estimates to choose group sizes that minimize the corresponding estimated average number of tests per subject. We compare designs where we estimate cluster-specific prevalences as well as a common prevalence across clusters, use different group testing algorithms, construct groups from individuals within and in different clusters, and consider misclassification. For diseases with low prevalence, our results suggest that accounting for clustering is unnecessary. However, for diseases with higher prevalence and sizeable between-cluster heterogeneity, accounting for clustering in study design and implementation improves efficiency. We consider the practical aspects of our design recommendations with two examples with strong clustering effects: (1) Identification of HIV carriers in the US population and (2) Laboratory screening of anti-cancer compounds using cell lines.
Collapse
Affiliation(s)
- Ana F. Best
- Biostatistics Branch, Biometrics Research Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Yaakov Malinovsky
- Department of Mathematics and Statistics, University of Maryland Baltimore County, Baltimore, MD, USA
| | - Paul S. Albert
- Biostatistics Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
5
|
Verdun CM, Fuchs T, Harar P, Elbrächter D, Fischer DS, Berner J, Grohs P, Theis FJ, Krahmer F. Group Testing for SARS-CoV-2 Allows for Up to 10-Fold Efficiency Increase Across Realistic Scenarios and Testing Strategies. Front Public Health 2021; 9:583377. [PMID: 34490172 PMCID: PMC8416485 DOI: 10.3389/fpubh.2021.583377] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 07/26/2021] [Indexed: 11/24/2022] Open
Abstract
Background: Due to the ongoing COVID-19 pandemic, demand for diagnostic testing has increased drastically, resulting in shortages of necessary materials to conduct the tests and overwhelming the capacity of testing laboratories. The supply scarcity and capacity limits affect test administration: priority must be given to hospitalized patients and symptomatic individuals, which can prevent the identification of asymptomatic and presymptomatic individuals and hence effective tracking and tracing policies. We describe optimized group testing strategies applicable to SARS-CoV-2 tests in scenarios tailored to the current COVID-19 pandemic and assess significant gains compared to individual testing. Methods: We account for biochemically realistic scenarios in the context of dilution effects on SARS-CoV-2 samples and consider evidence on specificity and sensitivity of PCR-based tests for the novel coronavirus. Because of the current uncertainty and the temporal and spatial changes in the prevalence regime, we provide analysis for several realistic scenarios and propose fast and reliable strategies for massive testing procedures. Key Findings: We find significant efficiency gaps between different group testing strategies in realistic scenarios for SARS-CoV-2 testing, highlighting the need for an informed decision of the pooling protocol depending on estimated prevalence, target specificity, and high- vs. low-risk population. For example, using one of the presented methods, all 1.47 million inhabitants of Munich, Germany, could be tested using only around 141 thousand tests if the infection rate is below 0.4% is assumed. Using 1 million tests, the 6.69 million inhabitants from the city of Rio de Janeiro, Brazil, could be tested as long as the infection rate does not exceed 1%. Moreover, we provide an interactive web application, available at www.grouptexting.com, for visualizing the different strategies and designing pooling schemes according to specific prevalence scenarios and test configurations. Interpretation: Altogether, this work may help provide a basis for an efficient upscaling of current testing procedures, which takes the population heterogeneity into account and is fine-grained towards the desired study populations, e.g., mild/asymptomatic individuals vs. symptomatic ones but also mixtures thereof. Funding: German Science Foundation (DFG), German Federal Ministry of Education and Research (BMBF), Chan Zuckerberg Initiative DAF, and Austrian Science Fund (FWF).
Collapse
Affiliation(s)
- Claudio M. Verdun
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Department of Electrical and Computer Engineering, Technical University of Munich, Munich, Germany
| | - Tim Fuchs
- Department of Mathematics, Technical University of Munich, Garching, Germany
| | - Pavol Harar
- Research Network Data Science, University of Vienna, Vienna, Austria
- Department of Telecommunications, Brno University of Technology, Brno, Czechia
| | | | - David S. Fischer
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Julius Berner
- Faculty of Mathematics, University of Vienna, Vienna, Austria
| | - Philipp Grohs
- Research Network Data Science, University of Vienna, Vienna, Austria
- Faculty of Mathematics, University of Vienna, Vienna, Austria
- Johann Radon Institute for Computational and Applied Mathematics, Austrian Academy of Sciences, Linz, Austria
| | - Fabian J. Theis
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Institute of Computational Biology, Helmholtz Zentrum München, Munich, Germany
| | - Felix Krahmer
- Department of Mathematics, Technical University of Munich, Garching, Germany
- Munich Data Science Institute, Technical University of Munich, Garching, Germany
| |
Collapse
|
6
|
Čižikovienė U, Skorniakov V. On a couple of unresolved group testing conjectures. COMMUN STAT-THEOR M 2021. [DOI: 10.1080/03610926.2021.1953531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Affiliation(s)
- Ugnė Čižikovienė
- Institute of Applied Mathematics, Faculty of Mathematics and Informatics, Vilnius University, Vilnius, Lithuania
| | - Viktor Skorniakov
- Institute of Applied Mathematics, Faculty of Mathematics and Informatics, Vilnius University, Vilnius, Lithuania
| |
Collapse
|
7
|
|