Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Alonzo TA, Brinton JT, Ringham BM, Glueck DH. Bias in estimating accuracy of a binary screening test with differential disease verification. Stat Med 2011;30:1852-64. [PMID: 21495059 DOI: 10.1002/sim.4232] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2010] [Accepted: 01/18/2011] [Indexed: 12/14/2022]

For:	Alonzo TA, Brinton JT, Ringham BM, Glueck DH. Bias in estimating accuracy of a binary screening test with differential disease verification. Stat Med 2011;30:1852-64. [PMID: 21495059 DOI: 10.1002/sim.4232] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2010] [Accepted: 01/18/2011] [Indexed: 12/14/2022]

Number

Cited by Other Article(s)

Fuld S, Constantinescu G, Pamporaki C, Peitzsch M, Schulze M, Yang J, Müller L, Prejbisz A, Januszewicz A, Remde H, Kürzinger L, Dischinger U, Ernst M, Gruber S, Reincke M, Beuschlein F, Lenders JWM, Eisenhofer G. Screening for Primary Aldosteronism by Mass Spectrometry Versus Immunoassay Measurements of Aldosterone: A Prospective Within-Patient Study. J Appl Lab Med 2024;9:752-766. [PMID: 38532521 DOI: 10.1093/jalm/jfae017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Accepted: 01/18/2024] [Indexed: 03/28/2024]

Abstract

BACKGROUND

Measurements of aldosterone by mass spectrometry are more accurate and less prone to interferences than immunoassay measurements, and may produce a more accurate aldosterone:renin ratio (ARR) when screening for primary aldosteronism (PA).

METHODS

Differences in diagnostic performance of the ARR using mass spectrometry vs immunoassay measurements of aldosterone were examined in 710 patients screened for PA. PA was confirmed in 153 patients and excluded in 451 others. Disease classifications were not achieved in 106 patients. Areas under receiver-operating characteristic curves (AUROC) and other measures were used to compare diagnostic performance.

RESULTS

Mass spectrometry-based measurements yielded lower plasma aldosterone concentrations than immunoassay measurements. For the ARR based on immunoassay measurements of aldosterone, AUROCs were slightly lower (P = 0.018) than those using mass spectrometry measurements (0.895 vs 0.906). The cutoff for the ARR to reach a sensitivity of 95% was 30 and 21.5 pmol/mU by respective immunoassay and mass spectrometry-based measurements, which corresponded to specificities of 57% for both. With data restricted to patients with unilateral PA, diagnostic sensitivities of 94% with specificities >81% could be achieved at cutoffs of 68 and 52 pmol/mU for respective immunoassay and mass spectrometry measurements.

CONCLUSIONS

Mass spectrometry-based measurements of aldosterone for the ARR provide no clear diagnostic advantage over immunoassay-based measurements. Both approaches offer limited diagnostic accuracy for the ARR as a screening test. One solution is to employ the higher cutoffs to triage patients likely to have unilateral PA for further tests and possible adrenalectomy, while using the lower cutoffs to identify others for targeted medical therapy.German Clinical Trials Register ID: DRKS00017084.

Collapse

Affiliation(s)

Sybille Fuld Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Georgiana Constantinescu Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Christina Pamporaki Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Mirko Peitzsch Institute of Clinical Chemistry and Laboratory Medicine, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
Manuel Schulze Center for Interdisciplinary Digital Sciences, Department Information Services and High Performance Computing, Technische Universität Dresden, Dresden, Germany
Jun Yang Centre for Endocrinology and Metabolism, Hudson Institute of Medical Research, Clayton, Australia
Lisa Müller Department of Medicine IV, University Hospital, Ludwig Maximilian University Munich, Munich, Germany
Aleksander Prejbisz Department of Epidemiology, Cardiovascular Prevention and Health Promotion, National Institute of Cardiology, Warsaw, Poland
Andrzej Januszewicz Department of Hypertension, National Institute of Cardiology, Warsaw, Poland
Hanna Remde Department of Internal Medicine I, Division of Endocrinology and Diabetes, University Hospital, University of Würzburg, Würzburg, Germany
Lydia Kürzinger Department of Internal Medicine I, Division of Endocrinology and Diabetes, University Hospital, University of Würzburg, Würzburg, Germany
Ulrich Dischinger Department of Internal Medicine I, Division of Endocrinology and Diabetes, University Hospital, University of Würzburg, Würzburg, Germany
Matthias Ernst Department of Endocrinology, Diabetology and Clinical Nutrition, University Hospital Zurich (USZ) and University of Zurich (UZH), Zurich, Switzerland
Sven Gruber Department of Endocrinology, Diabetology and Clinical Nutrition, University Hospital Zurich (USZ) and University of Zurich (UZH), Zurich, Switzerland
Martin Reincke Department of Medicine IV, University Hospital, Ludwig Maximilian University Munich, Munich, Germany
Felix Beuschlein Department of Medicine IV, University Hospital, Ludwig Maximilian University Munich, Munich, Germany Department of Endocrinology, Diabetology and Clinical Nutrition, University Hospital Zurich (USZ) and University of Zurich (UZH), Zurich, Switzerland The LOOP Medical Research Center, Zurich, Switzerland
Jacques W M Lenders Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany Department of Internal Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
Graeme Eisenhofer Department of Medicine III, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany

Collapse

Chubak J, Burnett-Hartman AN, Barlow WE, Corley DA, Croswell JM, Neslund-Dudas C, Vachani A, Silver MI, Tiro JA, Kamineni A. Estimating Cancer Screening Sensitivity and Specificity Using Healthcare Utilization Data: Defining the Accuracy Assessment Interval. Cancer Epidemiol Biomarkers Prev 2022;31:1517-1520. [PMID: 35916602 PMCID: PMC9484579 DOI: 10.1158/1055-9965.epi-22-0232] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 04/29/2022] [Accepted: 05/23/2022] [Indexed: 11/16/2022] Open

Day E, Eldred-Evans D, Prevost AT, Ahmed HU, Fiorentino F. Adjusting for verification bias in diagnostic accuracy measures when comparing multiple screening tests - an application to the IP1-PROSTAGRAM study. BMC Med Res Methodol 2022;22:70. [PMID: 35300611 PMCID: PMC8932251 DOI: 10.1186/s12874-021-01481-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 11/18/2021] [Indexed: 11/29/2022] Open

Abstract

Introduction

Novel screening tests used to detect a target condition are compared against either a reference standard or other existing screening methods. However, as it is not always possible to apply the reference standard on the whole population under study, verification bias is introduced. Statistical methods exist to adjust estimates to account for this bias. We extend common methods to adjust for verification bias when multiple tests are compared to a reference standard using data from a prospective double blind screening study for prostate cancer.

Methods

Begg and Greenes method and multiple imputation are extended to include the results of multiple screening tests which determine condition verification status. These two methods are compared to the complete case analysis using the IP1-PROSTAGRAM study data. IP1-PROSTAGRAM used a paired-cohort double-blind design to evaluate the use of imaging as alternative tests to screen for prostate cancer, compared to a blood test called prostate specific antigen (PSA). Participants with positive imaging (index) and/or PSA (control) underwent a prostate biopsy (reference standard).

Results

When comparing complete case results to Begg and Greenes and methods of multiple imputation there is a statistically significant increase in the specificity estimates for all screening tests. Sensitivity estimates remained similar across the methods, with completely overlapping 95% confidence intervals. Negative predictive value (NPV) estimates were higher when adjusting for verification bias, compared to complete case analysis, even though the 95% confidence intervals overlap. Positive predictive value (PPV) estimates were similar across all methods.

Conclusion

Statistical methods are required to adjust for verification bias in accuracy estimates of screening tests. Expanding Begg and Greenes method to include multiple screening tests can be computationally intensive, hence multiple imputation is recommended, especially as it can be modified for low prevalence of the target condition.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12874-021-01481-w.

Collapse

Umemneku Chikere CM, Wilson K, Graziadio S, Vale L, Allen AJ. Diagnostic test evaluation methodology: A systematic review of methods employed to evaluate diagnostic tests in the absence of gold standard - An update. PLoS One 2019;14:e0223832. [PMID: 31603953 PMCID: PMC6788703 DOI: 10.1371/journal.pone.0223832] [Citation(s) in RCA: 101] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Accepted: 09/29/2019] [Indexed: 12/29/2022] Open

Abstract

OBJECTIVE

To systematically review methods developed and employed to evaluate the diagnostic accuracy of medical test when there is a missing or no gold standard.

STUDY DESIGN AND SETTINGS

Articles that proposed or applied any methods to evaluate the diagnostic accuracy of medical test(s) in the absence of gold standard were reviewed. The protocol for this review was registered in PROSPERO (CRD42018089349).

RESULTS

Identified methods were classified into four main groups: methods employed when there is a missing gold standard; correction methods (which make adjustment for an imperfect reference standard with known diagnostic accuracy measures); methods employed to evaluate a medical test using multiple imperfect reference standards; and other methods, like agreement studies, and a mixed group of alternative study designs. Fifty-one statistical methods were identified from the review that were developed to evaluate medical test(s) when the true disease status of some participants is unverified with the gold standard. Seven correction methods were identified and four methods were identified to evaluate medical test(s) using multiple imperfect reference standards. Flow-diagrams were developed to guide the selection of appropriate methods.

CONCLUSION

Various methods have been proposed to evaluate medical test(s) in the absence of a gold standard for some or all participants in a diagnostic accuracy study. These methods depend on the availability of the gold standard, its' application to the participants in the study and the availability of alternative reference standard(s). The clinical application of some of these methods, especially methods developed when there is missing gold standard is however limited. This may be due to the complexity of these methods and/or a disconnection between the fields of expertise of those who develop (e.g. mathematicians) and those who employ the methods (e.g. clinical researchers). This review aims to help close this gap with our classification and guidance tools.

Collapse

Naaktgeboren CA, de Groot JAH, Rutjes AWS, Bossuyt PMM, Reitsma JB, Moons KGM. Anticipating missing reference standard data when planning diagnostic accuracy studies. BMJ 2016;352:i402. [PMID: 26861453 PMCID: PMC4772780 DOI: 10.1136/bmj.i402] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Sun X, Allison C, Matthews FE, Zhang Z, Auyeung B, Baron-Cohen S, Brayne C. Exploring the Underdiagnosis and Prevalence of Autism Spectrum Conditions in Beijing. Autism Res 2015;8:250-60. [PMID: 25952676 PMCID: PMC4690159 DOI: 10.1002/aur.1441] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Accepted: 11/17/2014] [Indexed: 01/07/2023]

Margel D, Benjaminov O, Ozalvo R, Shavit Grievink L, Kedar I, Yerushalmi R, Ben-Aharon I, Neiman V, Yossepowitch O, Kedar D, Levy Z, Shohat M, Brenner B, Baniel J, Rosenbaum E. Personalized prostate cancer screening among men with high risk genetic predisposition- study protocol for a prospective cohort study. BMC Cancer 2014;14:528. [PMID: 25047061 PMCID: PMC4223504 DOI: 10.1186/1471-2407-14-528] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Accepted: 07/10/2014] [Indexed: 12/24/2022] Open

Collins J, Huynh M. Estimation of diagnostic test accuracy without full verification: a review of latent class methods. Stat Med 2014;33:4141-69. [PMID: 24910172 DOI: 10.1002/sim.6218] [Citation(s) in RCA: 74] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2013] [Revised: 05/02/2014] [Accepted: 05/05/2014] [Indexed: 11/09/2022]

Whiting PF, Rutjes AWS, Westwood ME, Mallett S. A systematic review classifies sources of bias and variation in diagnostic test accuracy studies. J Clin Epidemiol 2013;66:1093-104. [PMID: 23958378 DOI: 10.1016/j.jclinepi.2013.05.014] [Citation(s) in RCA: 190] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Revised: 05/08/2013] [Accepted: 05/15/2013] [Indexed: 11/15/2022]

Abbey CK, Eckstein MP, Boone JM. Estimating the relative utility of screening mammography. Med Decis Making 2013;33:510-20. [PMID: 23295543 DOI: 10.1177/0272989x12470756] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]