Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Tao R, Zeng D, Lin DY. Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies. J Am Stat Assoc 2017;112:1468-1476. [PMID: 29479125 DOI: 10.1080/01621459.2017.1295864] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

For:	Tao R, Zeng D, Lin DY. Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies. J Am Stat Assoc 2017;112:1468-1476. [PMID: 29479125 DOI: 10.1080/01621459.2017.1295864] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Number

Cited by Other Article(s)

Kang K, Seidlitz J, Bethlehem RAI, Xiong J, Jones MT, Mehta K, Keller AS, Tao R, Randolph A, Larsen B, Tervo-Clemmens B, Feczko E, Dominguez OM, Nelson SM, Schildcrout J, Fair DA, Satterthwaite TD, Alexander-Bloch A, Vandekar S. Study design features increase replicability in brain-wide association studies. Nature 2024;636:719-727. [PMID: 39604734 PMCID: PMC11655360 DOI: 10.1038/s41586-024-08260-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Accepted: 10/21/2024] [Indexed: 11/29/2024]

Affiliation(s)

Kaidi Kang Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.
Jakob Seidlitz Department of Child and Adolescent Psychiatry and Behavioral Sciences, The Children's Hospital of Philadelphia, Philadelphia, PA, USA Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Lifespan Brain Institute of The Children's Hospital of Philadelphia and Penn Medicine, Philadelphia, PA, USA
Richard A I Bethlehem Department of Psychology, University of Cambridge, Cambridge, UK
Jiangmei Xiong Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
Megan T Jones Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
Kahini Mehta Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Lifespan Brain Institute of The Children's Hospital of Philadelphia and Penn Medicine, Philadelphia, PA, USA Penn Lifespan Informatics and Neuroimaging Center (PennLINC), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Arielle S Keller Department of Psychological Sciences, University of Connecticut, Mansfield, CT, USA Institute for the Brain and Cognitive Sciences, University of Connecticut, Mansfield, CT, USA
Ran Tao Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Anita Randolph Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Bart Larsen Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Brenden Tervo-Clemmens Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA Department of Psychiatry and Behavioral Sciences, University of Minnesota Medical School, Minneapolis, MN, USA
Eric Feczko Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Oscar Miranda Dominguez Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Steven M Nelson Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA
Jonathan Schildcrout Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA
Damien A Fair Department of Pediatrics, University of Minnesota Medical School, Minneapolis, MN, USA Masonic Institute for the Developing Brain, University of Minnesota, Minneapolis, MN, USA Institute of Child Development, University of Minnesota, Minneapolis, MN, USA
Theodore D Satterthwaite Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Lifespan Brain Institute of The Children's Hospital of Philadelphia and Penn Medicine, Philadelphia, PA, USA Penn Lifespan Informatics and Neuroimaging Center (PennLINC), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
Aaron Alexander-Bloch Department of Child and Adolescent Psychiatry and Behavioral Sciences, The Children's Hospital of Philadelphia, Philadelphia, PA, USA Department of Psychiatry, University of Pennsylvania, Philadelphia, PA, USA Lifespan Brain Institute of The Children's Hospital of Philadelphia and Penn Medicine, Philadelphia, PA, USA
Simon Vandekar Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.

Collapse

Hasler J, Ma Y, Wei Y, Parikh R, Chen J. A SEMIPARAMETRIC METHOD FOR RISK PREDICTION USING INTEGRATED ELECTRONIC HEALTH RECORD DATA. Ann Appl Stat 2024;18:3318-3337. [PMID: 40134753 PMCID: PMC11934126 DOI: 10.1214/24-aoas1938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]

Kang K, Seidlitz J, Bethlehem RA, Xiong J, Jones MT, Mehta K, Keller AS, Tao R, Randolph A, Larsen B, Tervo-Clemmens B, Feczko E, Miranda Dominguez O, Nelson S, Lifespan Brain Chart Consortium, 3R-BRAIN, AIBL, Alzheimer’s Disease Neuroimaging Initiative, Alzheimer’s Disease Repository Without Borders Investigators, CALM Team, CCNP, COBRE, cVEDA, Harvard Aging Brain Study, IMAGEN, POND, The PREVENT-AD Research Group, Schildcrout J, Fair D, Satterthwaite TD, Alexander-Bloch A, Vandekar S. Study design features increase replicability in cross-sectional and longitudinal brain-wide association studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.29.542742. [PMID: 37398345 PMCID: PMC10312450 DOI: 10.1101/2023.05.29.542742] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]

Abstract

Brain-wide association studies (BWAS) are a fundamental tool in discovering brain-behavior associations. Several recent studies showed that thousands of study participants are required for good replicability of BWAS because the standardized effect sizes (ESs) are much smaller than the reported standardized ESs in smaller studies. Here, we perform analyses and meta-analyses of a robust effect size index using 63 longitudinal and cross-sectional magnetic resonance imaging studies from the Lifespan Brain Chart Consortium (77,695 total scans) to demonstrate that optimizing study design is critical for increasing standardized ESs and replicability in BWAS. A meta-analysis of brain volume associations with age indicates that BWAS with larger variability in covariate have larger reported standardized ES. In addition, the longitudinal studies we examined reported systematically larger standardized ES than cross-sectional studies. Analyzing age effects on global and regional brain measures from the United Kingdom Biobank and the Alzheimer's Disease Neuroimaging Initiative, we show that modifying longitudinal study design through sampling schemes improves the standardized ESs and replicability. Sampling schemes that improve standardized ESs and replicability include increasing between-subject age variability in the sample and adding a single additional longitudinal measurement per subject. To ensure that our results are generalizable, we further evaluate these longitudinal sampling schemes on cognitive, psychopathology, and demographic associations with structural and functional brain outcome measures in the Adolescent Brain and Cognitive Development dataset. We demonstrate that commonly used longitudinal models can, counterintuitively, reduce standardized ESs and replicability. The benefit of conducting longitudinal studies depends on the strengths of the between- versus within-subject associations of the brain and non-brain measures. Explicitly modeling between- versus within-subject effects avoids averaging the effects and allows optimizing the standardized ESs for each separately. Together, these results provide guidance for study designs that improve the replicability of BWAS.

Collapse

Affiliation(s)

Kaidi Kang Department of Biostatistics, Vanderbilt University Medical Center
Jakob Seidlitz Department of Child and Adolescent Psychiatry and Behavioral Sciences, The Children’s Hospital of Philadelphia Department of Psychiatry, University of Pennsylvania Lifespan Brain Institute of The Children’s Hospital of Philadelphia and Penn Medicine
Richard A.I. Bethlehem Department of Psychology, University of Cambridge
Jiangmei Xiong Department of Biostatistics, Vanderbilt University Medical Center
Megan T. Jones Department of Biostatistics, Vanderbilt University Medical Center
Kahini Mehta Department of Psychiatry, University of Pennsylvania Lifespan Brain Institute of The Children’s Hospital of Philadelphia and Penn Medicine Penn Lifespan Informatics and Neuroimaging Center (PennLINC), Perelman School of Medicine, University of Pennsylvania
Arielle S. Keller Department of Psychiatry, University of Pennsylvania Lifespan Brain Institute of The Children’s Hospital of Philadelphia and Penn Medicine Penn Lifespan Informatics and Neuroimaging Center (PennLINC), Perelman School of Medicine, University of Pennsylvania
Ran Tao Department of Biostatistics, Vanderbilt University Medical Center
Anita Randolph Department of Pediatrics, University of Minnesota Medical School
Bart Larsen Department of Pediatrics, University of Minnesota Medical School
Brenden Tervo-Clemmens Department of Department of Psychiatry & Behavioral Sciences, University of Minnesota Medical School
Eric Feczko Department of Pediatrics, University of Minnesota Medical School
Oscar Miranda Dominguez Department of Pediatrics, University of Minnesota Medical School
Steve Nelson Department of Pediatrics, University of Minnesota Medical School
Lifespan Brain Chart Consortium
3R-BRAIN
AIBL
Alzheimer’s Disease Neuroimaging Initiative
Alzheimer’s Disease Repository Without Borders Investigators
CALM Team
CCNP
COBRE
cVEDA
Harvard Aging Brain Study
IMAGEN
POND
The PREVENT-AD Research Group
Jonathan Schildcrout Department of Biostatistics, Vanderbilt University Medical Center
Damien Fair Department of Pediatrics, University of Minnesota Medical School
Theodore D. Satterthwaite Department of Psychiatry, University of Pennsylvania Lifespan Brain Institute of The Children’s Hospital of Philadelphia and Penn Medicine Penn Lifespan Informatics and Neuroimaging Center (PennLINC), Perelman School of Medicine, University of Pennsylvania
Aaron Alexander-Bloch Department of Child and Adolescent Psychiatry and Behavioral Sciences, The Children’s Hospital of Philadelphia Department of Psychiatry, University of Pennsylvania Lifespan Brain Institute of The Children’s Hospital of Philadelphia and Penn Medicine
Simon Vandekar Department of Biostatistics, Vanderbilt University Medical Center

Collapse

Di Gravio C, Schildcrout JS, Tao R. Efficient designs and analysis of two-phase studies with longitudinal binary data. Biometrics 2024;80:ujad010. [PMID: 38364804 PMCID: PMC10871867 DOI: 10.1093/biomtc/ujad010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Revised: 08/23/2023] [Accepted: 11/09/2023] [Indexed: 02/18/2024]

Lee M, Chen J, Zeleniuch-Jacquotte A, Liu M. Goodness-of-fit two-phase sampling designs for time-to-event outcomes: a simulation study based on New York University Women's Health Study for breast cancer. BMC Med Res Methodol 2023;23:119. [PMID: 37208600 DOI: 10.1186/s12874-023-01950-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 05/11/2023] [Indexed: 05/21/2023] Open

Abstract

BACKGROUND

Sub-cohort sampling designs such as a case-cohort study play a key role in studying biomarker-disease associations due to their cost effectiveness. Time-to-event outcome is often the focus in cohort studies, and the research goal is to assess the association between the event risk and risk factors. In this paper, we propose a novel goodness-of-fit two-phase sampling design for time-to-event outcomes when some covariates (e.g., biomarkers) can only be measured on a subgroup of study subjects.

METHODS

Assuming that an external model, which can be the well-established risk models such as the Gail model for breast cancer, Gleason score for prostate cancer, and Framingham risk models for heart diseases, or built from preliminary data, is available to relate the outcome and complete covariates, we propose to oversample subjects with worse goodness-of-fit (GOF) based on an external survival model and time-to-event. With the cases and controls sampled using the GOF two-phase design, the inverse sampling probability weighting method is used to estimate the log hazard ratio of both incomplete and complete covariates. We conducted extensive simulations to evaluate the efficiency gain of our proposed GOF two-phase sampling designs over case-cohort study designs.

RESULTS

Through extensive simulations based on a dataset from the New York University Women's Health Study, we showed that the proposed GOF two-phase sampling designs were unbiased and generally had higher efficiency compared to the standard case-cohort study designs.

CONCLUSION

In cohort studies with rare outcomes, an important design question is how to select informative subjects to reduce sampling costs while maintaining statistical efficiency. Our proposed goodness-of-fit two-phase design provides efficient alternatives to standard case-cohort designs for assessing the association between time-to-event outcome and risk factors. This method is conveniently implemented in standard software.

Collapse

Maronge JM, Schildcrout JS, Rathouz PJ. Model misspecification and robust analysis for outcome-dependent sampling designs under generalized linear models. Stat Med 2023;42:1338-1352. [PMID: 36757145 PMCID: PMC10883476 DOI: 10.1002/sim.9673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 12/19/2022] [Accepted: 01/13/2023] [Indexed: 02/10/2023]

Abstract

Outcome-dependent sampling (ODS) is a commonly used class of sampling designs to increase estimation efficiency in settings where response information (and possibly adjuster covariates) is available, but the exposure is expensive and/or cumbersome to collect. We focus on ODS within the context of a two-phase study, where in Phase One the response and adjuster covariate information is collected on a large cohort that is representative of the target population, but the expensive exposure variable is not yet measured. In Phase Two, using response information from Phase One, we selectively oversample a subset of informative subjects in whom we collect expensive exposure information. Importantly, the Phase Two sample is no longer representative, and we must use ascertainment-correcting analysis procedures for valid inferences. In this paper, we focus on likelihood-based analysis procedures, particularly a conditional-likelihood approach and a full-likelihood approach. Whereas the full-likelihood retains incomplete Phase One data for subjects not selected into Phase Two, the conditional-likelihood explicitly conditions on Phase Two sample selection (ie, it is a "complete case" analysis procedure). These designs and analysis procedures are typically implemented assuming a known, parametric model for the response distribution. However, in this paper, we approach analyses implementing a novel semi-parametric extension to generalized linear models (SPGLM) to develop likelihood-based procedures with improved robustness to misspecification of distributional assumptions. We specifically focus on the common setting where standard GLM distributional assumptions are not satisfied (eg, misspecified mean/variance relationship). We aim to provide practical design guidance and flexible tools for practitioners in these settings.

Collapse

Ryan B, Nirmalkanna A, Cigsar C, Yilmaz YE. Evaluation of Designs and Estimation Methods Under Response-Dependent Two-Phase Sampling for Genetic Association Studies. STATISTICS IN BIOSCIENCES 2023. [DOI: 10.1007/s12561-023-09369-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2023]

Maronge JM, Tao R, Schildcrout JS, Rathouz PJ. Generalized case-control sampling under generalized linear models. Biometrics 2023;79:332-343. [PMID: 34586638 PMCID: PMC9358725 DOI: 10.1111/biom.13571] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Revised: 08/17/2021] [Accepted: 09/14/2021] [Indexed: 12/01/2022]

Che M, Han P, Lawless JF. Improving estimation efficiency for two-phase, outcome-dependent sampling studies. Electron J Stat 2023. [DOI: 10.1214/23-ejs2124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]

Lotspeich SC, Shepherd BE, Amorim GGC, Shaw PA, Tao R. Efficient odds ratio estimation under two-phase sampling using error-prone data from a multi-national HIV research cohort. Biometrics 2022;78:1674-1685. [PMID: 34213008 PMCID: PMC8720323 DOI: 10.1111/biom.13512] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 05/19/2021] [Accepted: 06/17/2021] [Indexed: 12/30/2022]

Chen T, Lumley T. Optimal sampling for design-based estimators of regression models. Stat Med 2022;41:1482-1497. [PMID: 34989429 PMCID: PMC8918008 DOI: 10.1002/sim.9300] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 12/02/2021] [Accepted: 12/10/2021] [Indexed: 11/05/2022]

Gravio CD, Tao R, Schildcrout JS. Design and analysis of two-phase studies with multivariate longitudinal data. Biometrics 2022. [PMID: 35014029 DOI: 10.1111/biom.13616] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 11/03/2021] [Accepted: 12/10/2021] [Indexed: 11/27/2022]

Cao Y, Haneuse S, Zheng Y, Chen J. Two-phase stratified sampling and analysis for predicting binary outcomes. Biostatistics 2021:6470040. [PMID: 34923588 DOI: 10.1093/biostatistics/kxab044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 11/03/2021] [Accepted: 11/22/2021] [Indexed: 11/13/2022] Open

Amorim G, Tao R, Lotspeich S, Shaw PA, Lumley T, Shepherd BE. Two-Phase Sampling Designs for Data Validation in Settings with Covariate Measurement Error and Continuous Outcome. JOURNAL OF THE ROYAL STATISTICAL SOCIETY. SERIES A, (STATISTICS IN SOCIETY) 2021;184:1368-1389. [PMID: 34975235 PMCID: PMC8715909 DOI: 10.1111/rssa.12689] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Le Guen Y, Belloy ME, Napolioni V, Eger SJ, Kennedy G, Tao R, He Z, Greicius MD. A novel age-informed approach for genetic association analysis in Alzheimer's disease. Alzheimers Res Ther 2021;13:72. [PMID: 33794991 PMCID: PMC8017764 DOI: 10.1186/s13195-021-00808-5] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Accepted: 03/11/2021] [Indexed: 01/17/2023]

Tao R, Lotspeich SC, Amorim G, Shaw PA, Shepherd BE. Efficient semiparametric inference for two-phase studies with outcome and covariate measurement errors. Stat Med 2021;40:725-738. [PMID: 33145800 PMCID: PMC8214478 DOI: 10.1002/sim.8799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 09/07/2020] [Accepted: 10/20/2020] [Indexed: 11/07/2022]

Tao R, Mercaldo ND, Haneuse S, Maronge JM, Rathouz PJ, Heagerty PJ, Schildcrout JS. Two-wave two-phase outcome-dependent sampling designs, with applications to longitudinal binary data. Stat Med 2021;40:1863-1876. [PMID: 33442883 DOI: 10.1002/sim.8876] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 12/07/2020] [Accepted: 12/25/2020] [Indexed: 12/26/2022]

Chen T, Lumley T. Optimal multiwave sampling for regression modeling in two-phase designs. Stat Med 2020;39:4912-4921. [PMID: 33016376 PMCID: PMC7902311 DOI: 10.1002/sim.8760] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 08/27/2020] [Accepted: 09/08/2020] [Indexed: 11/09/2022]

Han K, Lumley T, Shepherd BE, Shaw PA. Two-phase analysis and study design for survival models with error-prone exposures. Stat Methods Med Res 2020;30:962280220978500. [PMID: 33327876 PMCID: PMC8715910 DOI: 10.1177/0962280220978500] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]

Shepherd BE, Shaw PA. Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities. STATISTICAL COMMUNICATIONS IN INFECTIOUS DISEASES 2020;12:20190015. [PMID: 35880997 PMCID: PMC9204761 DOI: 10.1515/scid-2019-0015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Accepted: 08/21/2020] [Indexed: 06/15/2023]

Che M, Lawless JF, Han P. Empirical and conditional likelihoods for two‐phase studies. CAN J STAT 2020. [DOI: 10.1002/cjs.11566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Wang L, Williams ML, Chen Y, Chen J. Novel two-phase sampling designs for studying binary outcomes. Biometrics 2020;76:210-223. [PMID: 31449330 PMCID: PMC7042058 DOI: 10.1111/biom.13140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2017] [Accepted: 08/06/2019] [Indexed: 11/26/2022]

Schildcrout JS, Haneuse S, Tao R, Zelnick LR, Schisterman EF, Garbett SP, Mercaldo ND, Rathouz PJ, Heagerty PJ. Two-Phase, Generalized Case-Control Designs for the Study of Quantitative Longitudinal Outcomes. Am J Epidemiol 2020;189:81-90. [PMID: 31165875 DOI: 10.1093/aje/kwz127] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 05/06/2019] [Accepted: 05/14/2019] [Indexed: 01/30/2023] Open

Flanders WD. Invited Commentary: Two-Phase, Generalized Case-Control Designs for Quantitative Longitudinal Outcomes and Evolution of the Case-Control Study. Am J Epidemiol 2020;189:91-94. [PMID: 31566676 DOI: 10.1093/aje/kwz200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2019] [Revised: 08/23/2019] [Accepted: 08/27/2019] [Indexed: 11/12/2022] Open

Tao R, Zeng D, Lin DY. Optimal Designs of Two-Phase Studies. J Am Stat Assoc 2019;115:1946-1959. [PMID: 33716361 DOI: 10.1080/01621459.2019.1671200] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Ni A, Satagopan JM. Estimating Additive Interaction Effect in Stratified Two-Phase Case-Control Design. Hum Hered 2019;84:90-108. [PMID: 31634888 PMCID: PMC6925975 DOI: 10.1159/000502738] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Accepted: 08/15/2019] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND AND AIMS

There is considerable interest in epidemiology to estimate an additive interaction effect between two risk factors in case-control studies. An additive interaction is defined as the differential reduction in absolute risk associated with one factor between different levels of the other factor. A stratified two-phase case-control design is commonly used in epidemiology to reduce the cost of assembling covariates. It is crucial to obtain valid estimates of the model parameters by accounting for the underlying stratification scheme to obtain accurate and precise estimates of additive interaction effects. The aim of this paper is to examine the properties of different methods for estimating model parameters and additive interaction effects under a stratified two-phase case-control design.

METHODS

Using simulations, we investigate the properties of three existing methods, namely stratum-specific offset, inverse-probability weighting, and multiple imputation for estimating model parameters and additive interaction effects. We also illustrate these properties using data from two published epidemiology studies.

RESULTS

Simulation studies show that the multiple imputation method performs well when both the true and analysis models are additive (i.e., does not include multiplicative interaction terms) but does not provide a discernible advantage over the offset method when the analysis models are non-additive (i.e., includes multiplicative interaction terms). The offset method exhibits the best overall properties when the analysis model contains multiplicative interaction effects.

CONCLUSION

When estimating additive interaction between risk factors in stratified two-phase case-control studies, we recommend estimating model parameters using multiple imputation when the analysis model is additive, and we recommend the offset method when the analysis model is non-additive.

Collapse

Bjørnland T, Bye A, Ryeng E, Wisløff U, Langaas M. Powerful extreme phenotype sampling designs and score tests for genetic association studies. Stat Med 2018;37:4234-4251. [PMID: 30088284 DOI: 10.1002/sim.7914] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Revised: 06/20/2018] [Accepted: 06/25/2018] [Indexed: 12/15/2022]