1
|
Schwäble Santamaria A, Grassi M, Meeusen JW, Lieske JC, Scott R, Robertson A, Schiffer E. Performance of Nuclear Magnetic Resonance-Based Estimated Glomerular Filtration Rate in a Real-World Setting. Bioengineering (Basel) 2023; 10:717. [PMID: 37370648 DOI: 10.3390/bioengineering10060717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 05/25/2023] [Accepted: 06/06/2023] [Indexed: 06/29/2023] Open
Abstract
An accurate estimate of glomerular filtration rate (eGFR) is essential for proper clinical management, especially in patients with kidney dysfunction. This prospective observational study evaluated the real-world performance of the nuclear magnetic resonance (NMR)-based GFRNMR equation, which combines creatinine, cystatin C, valine, and myo-inositol with age and sex. We compared GFRNMR performance to that of the 2021 CKD-EPI creatinine and creatinine-cystatin C equations (CKD-EPI2021Cr and CKD-EPI2021CrCys), using 115 fresh routine samples of patients scheduled for urinary iothalamate clearance measurement (mGFR). Median bias to mGFR of the three eGFR equations was comparably low, ranging from 0.4 to 2.0 mL/min/1.73 m2. GFRNMR outperformed the 2021 CKD-EPI equations in terms of precision (interquartile range to mGFR of 10.5 vs. 17.9 mL/min/1.73 m2 for GFRNMR vs. CKD-EPI2021CrCys; p = 0.01) and accuracy (P15, P20, and P30 of 66.1% vs. 48.7% [p = 0.007], 80.0% vs. 60.0% [p < 0.001] and 95.7% vs. 86.1% [p = 0.006], respectively, for GFRNMR vs. CKD-EPI2021CrCys). Clinical parameters such as etiology, comorbidities, or medications did not significantly alter the performance of the three eGFR equations. Altogether, this study confirmed the utility of GFRNMR for accurate GFR estimation, and its potential value in routine clinical practice for improved medical care.
Collapse
Affiliation(s)
| | - Marcello Grassi
- Department of Research and Development, Numares AG, 93053 Regensburg, Germany
| | - Jeffrey W Meeusen
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA
| | - John C Lieske
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA
- Division of Nephrology and Hypertension, Mayo Clinic, Rochester, MN 55905, USA
| | - Renee Scott
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA
| | - Andrew Robertson
- Department of Research and Development, Numares AG, 93053 Regensburg, Germany
| | - Eric Schiffer
- Department of Research and Development, Numares AG, 93053 Regensburg, Germany
| |
Collapse
|
2
|
Jiang S, Cook RJ. The polytomous discrimination index for prediction involving multistate processes under intermittent observation. Stat Med 2022; 41:3661-3678. [PMID: 35596238 PMCID: PMC9308735 DOI: 10.1002/sim.9441] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 04/19/2022] [Accepted: 05/10/2022] [Indexed: 11/09/2022]
Abstract
With the increasing importance of predictive modeling in health research comes the need for methods to rigorously assess predictive accuracy. We consider the problem of evaluating the accuracy of predictive models for nominal outcomes when outcome data are coarsened at random. We first consider the problem in the context of a multinomial response modeled by polytomous logistic regression. Attention is then directed to the motivating setting in which class membership corresponds to the state occupied in a multistate disease process at a time horizon of interest. Here, class (state) membership may be unknown at the time horizon since disease processes are under intermittent observation. We propose a novel extension to the polytomous discrimination index to address this and evaluate the predictive accuracy of an intensity-based model in the context of a study involving patients with arthritis from a registry at the University of Toronto Centre for Prognosis Studies in Rheumatic Diseases.
Collapse
Affiliation(s)
- Shu Jiang
- Division of Public Health Sciences, Washington University School of Medicine in St. Louis, MO, USA
| | - Richard J. Cook
- Department of Statistics and Actuarial Science, University of Waterloo, ON, Canada
| |
Collapse
|
3
|
Bantis LE, Tsimikas JV. On optimal biomarker cutoffs accounting for misclassification costs in diagnostic trilemmas with applications to pancreatic cancer. Stat Med 2022; 41:3527-3546. [PMID: 35543227 PMCID: PMC9707502 DOI: 10.1002/sim.9432] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 03/29/2022] [Accepted: 04/06/2022] [Indexed: 11/11/2022]
Abstract
Pancreatic ductal adenocarcinoma (PDAC) is the most deadly cancer and currently there is strong clinical interest in novel biomarkers that contribute to its early detection. Assessing appropriately the accuracy of such biomarkers is a crucial issue and often one needs to take into account that many assays include biospecimens of individuals coming from three groups: healthy, chronic pancreatitis, and PDAC. The ROC surface is an appropriate tool for assessing the overall accuracy of a marker employed under such trichotomous settings. A decision/classification rule is often based on the so-called Youden index and its three-dimensional generalization. However, both the clinical and the statistical literature have not paid the necessary attention to the underlying false classification (FC) rates that are of equal or even greater importance. In this article we provide a framework to make inferences around all classification rates as well as comparisons. We explore the trinormal model, flexible models based on power transformations, and robust non-parametric alternatives. We provide a full framework for the construction of confidence intervals, regions, and spaces for joint inferences or for clinically meaningful points of interest. We further discuss the implications of costs related to different FCs. We evaluate our approaches through extensive simulations and illustrate them using data from a recent PDAC study conducted at the MD Anderson Cancer Center.
Collapse
Affiliation(s)
- Leonidas E. Bantis
- Dept. of Biostatistics and Data Science, University of Kansas Medical Center, Kansas City, U.S.A
| | - John V. Tsimikas
- Dept of Statistics and Actuarial-Financial Mathematics, University of the Aegean, Samos, Greece
| |
Collapse
|
4
|
Kazianka H, Morgenbesser A, Nowak T. Assessing the discriminatory power of loss given default models. J Appl Stat 2022; 49:2700-2716. [DOI: 10.1080/02664763.2021.1910936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Affiliation(s)
- Hannes Kazianka
- Department of Statistics, University of Klagenfurt, Klagenfurt, Austria
| | | | | |
Collapse
|
5
|
Sener H, Gulmez Sevim D, Erkilic K, Oner A, Gunay Sener AB. Evaluation of Ring Amplitude and Factors Affecting Ring Amplitude in Multifocal Electroretinography in Diabetic Eyes. Semin Ophthalmol 2022; 37:895-901. [PMID: 35834721 DOI: 10.1080/08820538.2022.2100714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
PURPOSE The aim of this paper was to evaluate the ring amplitudes in diabetic patients and to evaluate the effect of the risk factors for diabetic retinopathy on the ring amplitudes. We also aimed to investigate the success of ring amplitudes in classifying diabetic retinopathy. METHODS The study included 32 eyes of 32 diabetic patients without retinopathy (DM), 34 eyes of 34 patients with mild non-proliferative diabetic retinopathy (NPDR) without macular edema, and 62 eyes of 62 age- and sex-matched controls (CG). All subjects were evaluated using mfERG. The relationship between age, diabetes duration, HbA1c and ring amplitudes and the effect of diabetes and hypertension on ring amplitudes were evaluated. Three-way ROC analysis was performed to evaluate the discrimination power of the ring amplitudes. RESULTS In the comparison of the ring amplitudes, the amplitudes of the DM and NPDR groups were statistically significantly decreased compared to the CG (p < .05). A moderate to strong correlation was found between the duration of diabetes, HbA1c and ring amplitudes (p < .05). The effect of diabetes decreased towards the peripheral rings and hypertension did not affect ring amplitudes. Volume under the ROC surface of R1 = 0.65 had p < .05 and 95% CI [0.50-0.72], and the best cut-off point pair to differentiate the three classes was found to be c1 = 217.3, c2 = 151.2 in three-way ROC analysis. CONCLUSION In conclusion, the effects of diabetes are unevenly distributed on the retina topographically. Diabetes affects the central rings more than peripheral rings in multifocal ERG. Both ring densities and ring ratios are effective ways to identify early changes in retinal function.
Collapse
Affiliation(s)
- Hidayet Sener
- Department of Ophthalmology, Cukurca State Hospital, Hakkari, Turkey
| | - Duygu Gulmez Sevim
- Department of Ophthalmology, Erciyes University School of Medicine, Kayseri, Turkey
| | - Kuddusi Erkilic
- Department of Ophthalmology, Erciyes University School of Medicine, Kayseri, Turkey
| | - Ayse Oner
- Department of Ophthalmology, Acibadem Hospital, Kayseri, Turkey
| | - Ayse Busra Gunay Sener
- Department of Medical Informatics and Biostatistics, Erciyes University School of Medicine, Kayseri, Turkey
| |
Collapse
|
6
|
Kersey J, Samawi H, Yin J, Rochani H, Zhang X. On diagnostic accuracy measure with cut-points criterion for ordinal disease classification based on concordance and discordance. J Appl Stat 2022. [DOI: 10.1080/02664763.2022.2041567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Jing Kersey
- Department of Biostatistics, Epidemiology and Environmental Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro
| | - Hani Samawi
- Department of Biostatistics, Epidemiology and Environmental Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro
| | - Jingjing Yin
- Department of Biostatistics, Epidemiology and Environmental Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro
| | - Haresh Rochani
- Department of Biostatistics, Epidemiology and Environmental Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro
| | - Xinyan Zhang
- Department of Biostatistics, Epidemiology and Environmental Sciences, Jiann-Ping Hsu College of Public Health, Georgia Southern University, Statesboro
| |
Collapse
|
7
|
Yim S, Kim S, Kim I, Park JW, Cho JH, Hong M, Kang KH, Kim M, Kim SJ, Kim YJ, Kim YH, Lim SH, Sung SJ, Kim N, Baek SH. Accuracy of one-step automated orthodontic diagnosis model using a convolutional neural network and lateral cephalogram images with different qualities obtained from nationwide multi-hospitals. Korean J Orthod 2022; 52:3-19. [PMID: 35046138 PMCID: PMC8770967 DOI: 10.4041/kjod.2022.52.1.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 06/01/2021] [Accepted: 07/02/2021] [Indexed: 11/10/2022] Open
Abstract
Objective The purpose of this study was to investigate the accuracy of one-step automated orthodontic diagnosis of skeletodental discrepancies using a convolutional neural network (CNN) and lateral cephalogram images with different qualities from nationwide multi-hospitals. Methods Among 2,174 lateral cephalograms, 1,993 cephalograms from two hospitals were used for training and internal test sets and 181 cephalograms from eight other hospitals were used for an external test set. They were divided into three classification groups according to anteroposterior skeletal discrepancies (Class I, II, and III), vertical skeletal discrepancies (normodivergent, hypodivergent, and hyperdivergent patterns), and vertical dental discrepancies (normal overbite, deep bite, and open bite) as a gold standard. Pre-trained DenseNet-169 was used as a CNN classifier model. Diagnostic performance was evaluated by receiver operating characteristic (ROC) analysis, t-stochastic neighbor embedding (t-SNE), and gradient-weighted class activation mapping (Grad-CAM). Results In the ROC analysis, the mean area under the curve and the mean accuracy of all classifications were high with both internal and external test sets (all, > 0.89 and > 0.80). In the t-SNE analysis, our model succeeded in creating good separation between three classification groups. Grad-CAM figures showed differences in the location and size of the focus areas between three classification groups in each diagnosis. Conclusions Since the accuracy of our model was validated with both internal and external test sets, it shows the possible usefulness of a one-step automated orthodontic diagnosis tool using a CNN model. However, it still needs technical improvement in terms of classifying vertical dental discrepancies.
Collapse
Affiliation(s)
- Sunjin Yim
- Department of Orthodontics, School of Dentistry, Seoul National University, Seoul, Korea
| | - Sungchul Kim
- Department of Biomedical Engineering, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Inhwan Kim
- Department of Biomedical Engineering, Asan Medical Institute of Convergence Science and Technology, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | | | - Jin-Hyoung Cho
- Department of Orthodontics, Chonnam National University School of Dentistry, Gwangju, Korea
| | - Mihee Hong
- Department of Orthodontics, School of Dentistry, Kyungpook National University, Daegu, Korea
| | - Kyung-Hwa Kang
- Department of Orthodontics, School of Dentistry, Wonkwang University, Iksan, Korea
| | - Minji Kim
- Department of Orthodontics, College of Medicine, Ewha Womans University, Seoul, Korea
| | - Su-Jung Kim
- Department of Orthodontics, Kyung Hee University School of Dentistry, Seoul, Korea
| | - Yoon-Ji Kim
- Department of Orthodontics, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Young Ho Kim
- Department of Orthodontics, Institute of Oral Health Science, Ajou University School of Medicine, Suwon, Korea
| | - Sung-Hoon Lim
- Department of Orthodontics, College of Dentistry, Chosun University, Gwangju, Korea
| | - Sang Jin Sung
- Department of Orthodontics, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Namkug Kim
- Department of Convergence Medicine, Asan Medical Center, University of Ulsan College of Medicine, Seoul, Korea
| | - Seung-Hak Baek
- Department of Orthodontics, School of Dentistry, Dental Research Institute, Seoul National University, Seoul, Korea
| |
Collapse
|
8
|
Xiong C, Luo J, Agboola F, Grant E, Morris JC. A family of estimators to diagnostic accuracy when candidate tests are subject to detection limits-Application to diagnosing early stage Alzheimer disease. Stat Methods Med Res 2022; 31:882-898. [PMID: 35044258 DOI: 10.1177/09622802211072511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
In disease diagnosis, individuals are usually assumed to be one of the two basic types, healthy or diseased, as typically based on an established gold standard. Candidate markers for diagnosing a disease often are much cheaper and less invasive than the gold standard but must be evaluated against the gold standard for their sensitivity and specificity to accurately diagnose the disease. When candidate diagnostic markers are fully measured, receiver operating characteristic curves have been the standard approaches for assessing diagnostic accuracy. However, full measurements of diagnostic markers may not be available above or below certain limits due to various practical and technical limitations. For example, in the diagnosis of Alzheimer disease using cerebrospinal fluid biomarkers, the Roche Elecsys® immunoassays have a measuring range for multiple cerebrospinal fluid molecular concentrations. Many cognitive tests used in diagnosing dementia due to Alzheimer disease are also subject to detection limits, often referred to as the floor and ceiling effects in the neuropsychological literature. We propose a new statistical methodology for estimating the diagnostic accuracy when a diagnostic marker is subject to detection limits by dividing the entire study sample into two sub-samples by a threshold of the diagnostic marker. We then propose a family of estimators to the area under the receiver operating characteristic curve by combining a conditional nonparametric estimator and another conditional semi-parametric estimator derived from Cox's proportional hazards model. We derive the variance to the proposed estimators, and further, assess the performance of the proposed estimators as a function of possible thresholds through an extensive simulation study, and recommend the optimum thresholds. Finally, we apply the proposed methodology to assess the ability of several cerebrospinal fluid biomarkers and cognitive tests in diagnosing early stage Alzheimer disease dementia.
Collapse
Affiliation(s)
- Chengjie Xiong
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA.,Knight Alzheimer Disease Research Center, Washington University School of Medicine, St. Louis, MO, USA
| | - Jingqin Luo
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA.,Division of Public Health Sciences, Department of Surgery, Washington University School of Medicine, St. Louis, MO, USA.,Siteman Cancer Center Biostatistics Core, Washington University School of Medicine, St. Louis, MO, USA
| | - Folasade Agboola
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA.,Knight Alzheimer Disease Research Center, Washington University School of Medicine, St. Louis, MO, USA
| | - Elizabeth Grant
- Division of Biostatistics, Washington University School of Medicine, St. Louis, MO, USA.,Knight Alzheimer Disease Research Center, Washington University School of Medicine, St. Louis, MO, USA
| | - John C Morris
- Knight Alzheimer Disease Research Center, Washington University School of Medicine, St. Louis, MO, USA.,Department of Neurology, Washington University School of Medicine, St. Louis, MO, USA.,Departments of Pathology and Immunology, Washington University School of Medicine, St. Louis, MO, USA
| |
Collapse
|
9
|
Zhang S, Qu Y, Cheng Y, Lopez OL, Wahed AS. Prognostic accuracy for predicting ordinal competing risk outcomes using ROC surfaces. LIFETIME DATA ANALYSIS 2022; 28:1-22. [PMID: 34807357 DOI: 10.1007/s10985-021-09539-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Accepted: 10/08/2021] [Indexed: 06/13/2023]
Abstract
Many medical conditions are marked by a sequence of events in association with continuous changes in biomarkers. Few works have evaluated the overall accuracy of a biomarker in predicting disease progression. We thus extend the concept of receiver operating characteristic (ROC) surface and the volume under the surface (VUS) from multi-category outcomes to ordinal competing-risk outcomes that are also subject to noninformative censoring. Two VUS estimators are considered. One is based on the definition of the ROC surface and obtained by integrating the estimated ROC surface. The other is an inverse probability weighted U estimator that is built upon the equivalence of the VUS to the concordance probability between the marker and sequential outcomes. Both estimators have nice asymptotic results that can be derived using counting process techniques and U-statistics theory. We illustrate their good practical performances through simulations and applications to two studies of cognition and a transplant dataset.
Collapse
Affiliation(s)
- Song Zhang
- Department of Biostatistics, University of Pittsburgh, Pennsylvania, PA, 15260, USA
| | - Yang Qu
- Department of Statistics, University of Pittsburgh, Pennsylvania, PA, 15260, USA
| | - Yu Cheng
- Departments of Statistics and Biostatistics, University of Pittsburgh, Pennsylvania, PA, 15260, USA.
| | - Oscar L Lopez
- Departments of Neurology, Psychiatry, and Clinical and Translational Sciences, University of Pittsburgh, Pennsylvania, PA, 15260, USA
| | - Abdus S Wahed
- Department of Biostatistics, University of Pittsburgh, Pennsylvania, PA, 15260, USA
| |
Collapse
|
10
|
Rahman H, Zhao Y. Empirical likelihood confidence interval for sensitivity to the early disease stage. Pharm Stat 2021; 21:566-583. [PMID: 34962077 PMCID: PMC9050820 DOI: 10.1002/pst.2186] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Revised: 08/30/2021] [Accepted: 10/02/2021] [Indexed: 11/09/2022]
Abstract
Disease status can naturally be classified into three or more ordinal stages rather than just being binary stages. Many works have been done for the estimation and inference procedure regarding three ordinal disease stages, which are non-disease, early disease, and full disease stages. The early disease stage can be very important for therapeutic intervention and prevention potentiality. As a diagnostic measure, sensitivity to the early disease stage is often used. In this article, we propose confidence intervals for the sensitivity to early disease stage based on given target specificity for non-disease stage and target sensitivity to full disease stage using both empirical likelihood (EL) and adjusted EL procedures. We compare the performance of the proposed EL confidence intervals with other procedures in our simulation study. The proposed procedures are further applied to Alzheimer's Disease Neuroimaging Initiative data set.
Collapse
Affiliation(s)
- Husneara Rahman
- Department of Mathematics and Statistics, Georgia State University, Atlanta, Georgia, USA
| | - Yichuan Zhao
- Department of Mathematics and Statistics, Georgia State University, Atlanta, Georgia, USA
| | | |
Collapse
|
11
|
Stämmler F, Grassi M, Meeusen JW, Lieske JC, Dasari S, Dubourg L, Lemoine S, Ehrich J, Schiffer E. Estimating Glomerular Filtration Rate from Serum Myo-Inositol, Valine, Creatinine and Cystatin C. Diagnostics (Basel) 2021; 11:2291. [PMID: 34943527 PMCID: PMC8700166 DOI: 10.3390/diagnostics11122291] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2021] [Revised: 11/26/2021] [Accepted: 11/30/2021] [Indexed: 11/22/2022] Open
Abstract
Assessment of renal function relies on the estimation of the glomerular filtration rate (eGFR). Existing eGFR equations, usually based on serum levels of creatinine and/or cystatin C, are not uniformly accurate across patient populations. In the present study, we expanded a recent proof-of-concept approach to optimize an eGFR equation targeting the adult population with and without chronic kidney disease (CKD), based on a nuclear magnetic resonance spectroscopy (NMR) derived 'metabolite constellation' (GFRNMR). A total of 1855 serum samples were partitioned into development, internal validation and external validation datasets. The new GFRNMR equation used serum myo-inositol, valine, creatinine and cystatin C plus age and sex. GFRNMR had a lower bias to tracer measured GFR (mGFR) than existing eGFR equations, with a median bias (95% confidence interval [CI]) of 0.0 (-1.0; 1.0) mL/min/1.73 m2 for GFRNMR vs. -6.0 (-7.0; -5.0) mL/min/1.73 m2 for the Chronic Kidney Disease Epidemiology Collaboration equation that combines creatinine and cystatin C (CKD-EPI2012) (p < 0.0001). Accuracy (95% CI) within 15% of mGFR (1-P15) was 38.8% (34.3; 42.5) for GFRNMR vs. 47.3% (43.2; 51.5) for CKD-EPI2012 (p < 0.010). Thus, GFRNMR holds promise as an alternative way to assess eGFR with superior accuracy in adult patients with and without CKD.
Collapse
Affiliation(s)
- Frank Stämmler
- Department of Research and Development, numares AG, 93053 Regensburg, Germany; (F.S.); (M.G.)
| | - Marcello Grassi
- Department of Research and Development, numares AG, 93053 Regensburg, Germany; (F.S.); (M.G.)
| | - Jeffrey W. Meeusen
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA; (J.W.M.); (J.C.L.)
| | - John C. Lieske
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN 55905, USA; (J.W.M.); (J.C.L.)
- Division of Nephrology and Hypertension, Mayo Clinic, Rochester, MN 55905, USA
| | - Surendra Dasari
- Division of Endocrinology, Diabetes, Metabolism, and Nutrition, Mayo Clinic, Rochester, MN 55905, USA;
| | - Laurence Dubourg
- Service d’Explorations Fonctionnelles Rénales et Métaboliques, Hôpital Edouard Herriot, 69437 Lyon, France; (L.D.); (S.L.)
| | - Sandrine Lemoine
- Service d’Explorations Fonctionnelles Rénales et Métaboliques, Hôpital Edouard Herriot, 69437 Lyon, France; (L.D.); (S.L.)
| | - Jochen Ehrich
- Children’s Hospital, Hannover Medical School, 30625 Hannover, Germany;
| | - Eric Schiffer
- Department of Research and Development, numares AG, 93053 Regensburg, Germany; (F.S.); (M.G.)
| |
Collapse
|
12
|
Feng Y, Tian L. Flexible diagnostic measures and new cut-point selection methods under multiple ordered classes. Pharm Stat 2021; 21:220-240. [PMID: 34449107 DOI: 10.1002/pst.2166] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 07/21/2021] [Accepted: 08/01/2021] [Indexed: 11/08/2022]
Abstract
Medical diagnosis is essentially a classification problem and usually it is done with multiple ordered classes. For example, cancer diagnosis might be "non-malignant," "early stage," or "late stage." Therefore, appropriate measures are needed to assess the accuracy of diagnostic markers under multiple ordered classes. However, all existing measures fail to differentiate among some distinctly different biomarkers. This paper presents a multi-step procedure for evaluating biomarker accuracy under multiple ordered classes. This procedure leads to two new flexible overall measures as well as three new cut-point selection methods with great computational ease. The performance of proposed measures and cut-point selection methods are numerically explored via a simulation study. In the end, an ovarian cancer dataset from the Prostate, Lung, Colorectal, and Ovarian cancer study is analyzed. The proposed accuracy measures were estimated for markers CA125 and HE4, and cut-points were estimated for the risk of ovarian malignancy algorithm score.
Collapse
Affiliation(s)
- Yingdong Feng
- Department of Biostatistics, University at Buffalo, Buffalo, New York, USA
| | - Lili Tian
- Department of Biostatistics, University at Buffalo, Buffalo, New York, USA
| |
Collapse
|
13
|
Li H, Leurgans S, Elm J, Gebregziabher M. Statistical Methodology for Multiclass Classifications: Applications to Dementia. J Alzheimers Dis 2020; 68:173-186. [PMID: 30741679 DOI: 10.3233/jad-180580] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Alzheimer's disease (AD) is a common, devastating disease which carries a heavy economic burden. Accelerated efforts to identify presymptomatic stages of AD and biomarkers to classify the disease are urgent needs. Currently, no biomarkers can perfectly discriminate individuals into multiple disease categories of AD (no cognitive impairment, mild cognitive impairment, and dementia). Although many biomarkers for diagnosis and their various features are being studied, we lack advanced statistical methods which can fully utilize biomarkers to classify AD accurately, thereby facilitating evaluation of putative markers both alone and in combination. In this paper, we propose two approaches: 1) a forward addition procedure in which we adapt an additive logistic regression model to the setting for disease with ordered multiple categories. Using this approach, we select and combine multiple cross-sectional biomarkers to improve diagnostic accuracy, and 2) a method by extending the Neyman-Pearson Lemma to the ordered three disease categories to construct optimal cutoff points to distinguish multiple disease categories. We evaluate the robustness of the proposed model using a simulation study. Then we apply these two methods to data from the Religious Orders Study to examine the feasibility of combining biomarkers, and compare the diagnostic accuracy between the proposed methods and existing methods including model-based methods (ordinal logistic regression and quadratic discriminant analysis), a tree-based method CART, and the Youden index method. The two proposed methods facilitate evaluations of biomarkers for conditions with graded, rather than binary, classifications. The evaluation of the performance of different approaches provides guidance of how to choose approaches to address research questions.
Collapse
Affiliation(s)
- Hong Li
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Sue Leurgans
- Department of Neurological Sciences & Preventive Medicine, Rush University Medical Center Chicago, IL, USA
| | - Jordan Elm
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| | - Mulugeta Gebregziabher
- Department of Public Health Sciences, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
14
|
Noll S, Furrer R, Reiser B, Nakas CT. Inference in receiver operating characteristic surface analysis via a trinormal model‐based testing approach. Stat (Int Stat Inst) 2020. [DOI: 10.1002/sta4.249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Samuel Noll
- Department of MathematicsUniversity of Zurich Zurich Switzerland
| | - Reinhard Furrer
- Department of MathematicsUniversity of Zurich Zurich Switzerland
- Department of Computational ScienceUniversity of Zurich Zurich Switzerland
| | | | - Christos T. Nakas
- Department of Agriculture, Crop Production and Rural EnvironmentUniversity of Thessaly Volos 38446 Greece
- Department of Clinical ChemistryInselspital, Bern University Hospital, University of Bern Bern Switzerland
| |
Collapse
|
15
|
deCastro BR. Cumulative ROC curves for discriminating three or more ordinal outcomes with cutpoints on a shared continuous measurement scale. PLoS One 2019; 14:e0221433. [PMID: 31469848 PMCID: PMC6716631 DOI: 10.1371/journal.pone.0221433] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Accepted: 08/06/2019] [Indexed: 11/19/2022] Open
Abstract
Cumulative receiver operator characteristic (ROC) curve analysis extends classic ROC curve analysis to discriminate three or more ordinal outcome levels on a shared continuous scale. The procedure combines cumulative logit regression with a cumulative extension to the ROC curve and performs as expected with ternary (three-level) ordinal outcomes under a variety of simulated conditions (unbalanced data, proportional and non-proportional odds, areas under the ROC curve [AUCs] from 0.70 to 0.95). Simulations also compared several criteria for selecting cutpoints to discriminate outcome levels: the Youden Index, Matthews Correlation Coefficient, Total Accuracy, and Markedness. Total Accuracy demonstrated the least absolute percent-bias. Cutpoints computed from maximum likelihood regression parameters demonstrated bias that was often negligible. The procedure was also applied to publicly available data related to computer imaging and biomarker exposure science, yielding good to excellent AUCs, as well as cutpoints with sensitivities and specificities of commensurate quality. Implementation of cumulative ROC curve analysis and extension to more than three outcome levels are straightforward. The author's programs for ternary ordinal outcomes are publicly available.
Collapse
Affiliation(s)
- B. Rey deCastro
- National Center for Environmental Health, Centers for Disease Control and Prevention, Atlanta, Georgia, United States of America
| |
Collapse
|
16
|
Zhu R, Ghosal S. Bayesian nonparametric estimation of ROC surface under verification bias. Stat Med 2019; 38:3361-3377. [DOI: 10.1002/sim.8181] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2018] [Revised: 02/19/2019] [Accepted: 04/06/2019] [Indexed: 12/11/2022]
Affiliation(s)
- Rui Zhu
- Department of StatisticsNorth Carolina State University Raleigh North Carolina
| | - Subhashis Ghosal
- Department of StatisticsNorth Carolina State University Raleigh North Carolina
| |
Collapse
|
17
|
Bantis LE, Feng Z. Comparison of two correlated ROC surfaces at a given pair of true classification rates. Stat Med 2018; 37:4022-4035. [DOI: 10.1002/sim.7894] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Revised: 01/04/2018] [Accepted: 03/08/2018] [Indexed: 11/12/2022]
Affiliation(s)
- Leonidas E. Bantis
- Department of Biostatistics; The University of Texas MD Anderson Cancer Center; Houston Texas 77030
| | - Ziding Feng
- Department of Biostatistics; The University of Texas MD Anderson Cancer Center; Houston Texas 77030
| |
Collapse
|
18
|
Wang D, Feng Y, Attwood K, Tian L. Optimal threshold selection methods under tree or umbrella ordering. J Biopharm Stat 2018; 29:98-114. [DOI: 10.1080/10543406.2018.1489410] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Affiliation(s)
- Dan Wang
- TTx/Biomarker Statistics, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, IN, USA
| | - Yingdong Feng
- Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| | - Kristopher Attwood
- Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Lili Tian
- Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| |
Collapse
|
19
|
Obuchowski NA, Bullen JA. Receiver operating characteristic (ROC) curves: review of methods with applications in diagnostic medicine. ACTA ACUST UNITED AC 2018; 63:07TR01. [DOI: 10.1088/1361-6560/aab4b1] [Citation(s) in RCA: 149] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
20
|
Xiong C, Luo J, Chen L, Gao F, Liu J, Wang G, Bateman R, Morris JC. Estimating diagnostic accuracy for clustered ordinal diagnostic groups in the three-class case-Application to the early diagnosis of Alzheimer disease. Stat Methods Med Res 2018; 27:701-714. [PMID: 29182052 PMCID: PMC5841923 DOI: 10.1177/0962280217742539] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Many medical diagnostic studies involve three ordinal diagnostic populations in which the diagnostic accuracy can be summarized by the volume or partial volume under the receiver operating characteristic surface for a diagnostic marker. When the diagnostic populations are clustered, e.g. by families, we propose to model the diagnostic marker by a general linear mixed model that takes into account of the correlation on the diagnostic marker from members of the same clusters. This model then facilitates the maximum likelihood estimation and statistical inferences of the diagnostic accuracy for the diagnostic marker. This approach naturally allows the incorporation of covariates as well as missing data when some clusters do not have subjects on all diagnostic groups in the estimation of, and the subsequent inferences on the diagnostic accuracy. We further study the performance of the proposed methods in a large simulation study with clustered data. Finally, we apply the proposed methodology to the data of several biomarkers collected by the Dominantly Inherited Alzheimer Network, an international family-clustered registry to study autosomal dominant Alzheimer disease which is a rare form of Alzheimer disease caused by mutations in any of the three genes including the amyloid precursor protein, presenilin 1 and presenilin 2. We estimate the accuracy of several cerebrospinal fluid and neuroimaging biomarkers in differentiating three diagnostic and genetic populations: normal non-mutation carriers, asymptomatic mutation carriers, and symptomatic mutation carriers.
Collapse
Affiliation(s)
- Chengjie Xiong
- Division of Biostatistics, Washington University in St. Louis, St. Louis, MO, U.S.A
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Jingqin Luo
- Division of Public health, Department of Surgery, Washington University in St. Louis, St. Louis, MO, U.S.A
- Biostatistics Core, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Ling Chen
- Division of Biostatistics, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Feng Gao
- Division of Public health, Department of Surgery, Washington University in St. Louis, St. Louis, MO, U.S.A
- Biostatistics Core, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Jingxia Liu
- Division of Public health, Department of Surgery, Washington University in St. Louis, St. Louis, MO, U.S.A
- Biostatistics Core, Siteman Cancer Center, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Guoqiao Wang
- Division of Biostatistics, Washington University in St. Louis, St. Louis, MO, U.S.A
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - Randall Bateman
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, U.S.A
| | - John C. Morris
- Department of Neurology, Washington University in St. Louis, St. Louis, MO, U.S.A
- Departments of Pathology and Immunology, Washington University in St. Louis, St. Louis, MO, U.S.A
| |
Collapse
|
21
|
Feng Y, Tian L. Measuring diagnostic accuracy for biomarkers under tree-ordering. Stat Methods Med Res 2018; 28:1328-1346. [DOI: 10.1177/0962280218755810] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
In the field of diagnostic studies for tree or umbrella ordering, under which the marker measurement for one class is lower or higher than those for the rest unordered classes, there exist a few diagnostic measures such as the naive AUC ( NAUC), the umbrella volume ( UV), and the recently proposed TAUC, i.e. area under a ROC curve for tree or umbrella ordering (TROC). However, an important characteristic about tree or umbrella ordering has been neglected. This paper mainly focuses on promoting the use of the integrated false negative rate under tree ordering ( ITFNR) as an additional diagnostic measure besides TAUC, and proposing the idea of using ( TAUC, ITFNR) instead of TAUC to evaluate the diagnostic accuracy of a biomarker under tree or umbrella ordering. Parametric and non-parametric approaches for constructing joint confidence region of ( TAUC, ITFNR) are proposed. Simulation studies under a variety of settings are carried out to assess and compare the performance of these methods. In the end, a published microarray data set is analyzed.
Collapse
Affiliation(s)
- Yingdong Feng
- Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| | - Lili Tian
- Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| |
Collapse
|
22
|
Farahani ES, Choudhury SH, Cortese F, Costello F, Goodyear B, Smith MR. Three-way ROC validation of rs-fMRI visual information propagation transfer functions used to differentiate between RRMS and CIS optic neuritis patients. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2017; 2017:525-528. [PMID: 29059925 DOI: 10.1109/embc.2017.8036877] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Resting-state fMRI (rs-fMRI) measures the temporal synchrony between different brain regions while the subject is at rest. We present an investigation using visual information propagation transfer functions as potential optic neuritis (ON) markers for the pathways between the lateral geniculate nuclei, the primary visual cortex, the lateral occipital cortex and the superior parietal cortex. We investigate marker reliability in differentiating between healthy controls and ON patients with clinically isolated syndrome (CIS), and relapsing-remitting multiple sclerosis (RRMS) using a three-way receiver operating characteristics analysis. We identify useful and reliable three-way ON related metrics in the rs-fMRI low-frequency band 0.0 Hz to 0.1 Hz, with potential markers associated with the higher frequency harmonics of these signals in the 0.1 Hz to 0.2 Hz and 0.2 Hz to 0.3 Hz bands.
Collapse
|
23
|
Li J, Fine JP, Pencina MJ. Multi-category diagnostic accuracy based on logistic regression. ACTA ACUST UNITED AC 2017. [DOI: 10.1080/24754269.2017.1319105] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Jialiang Li
- Department of Statistics and Applied Probability, Duke-NUS Graduate Medical School, Singapore Eye Research Institute, National University of Singapore, Singapore
| | - Jason P. Fine
- Department of Biostatistics, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
24
|
Balaguer M, Alejandre C, Vila D, Esteban E, Carrasco JL, Cambra FJ, Jordan I. Bronchiolitis Score of Sant Joan de Déu: BROSJOD Score, validation and usefulness. Pediatr Pulmonol 2017; 52:533-539. [PMID: 28328090 DOI: 10.1002/ppul.23546] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/26/2016] [Revised: 06/15/2016] [Accepted: 07/18/2016] [Indexed: 11/10/2022]
Abstract
OBJECTIVE To validate the bronchiolitis score of Sant Joan de Déu (BROSJOD) and to examine the previously defined scoring cutoff. PATIENTS AND METHODS Prospective, observational study. BROSJOD scoring was done by two independent physicians (at admission, 24 and 48 hr). Internal consistency of the score was assessed using Cronbach's α. To determine inter-rater reliability, the concordance correlation coefficient estimated as an intraclass correlation coefficient (CCC) and limits of agreement estimated as the 90% total deviation index (TDI) were estimated. An expert opinion was used to classify patients according to clinical severity. A validity analysis was conducted comparing the 3-level classification score to that expert opinion. Volume under the surface (VUS), predictive values, and probability of correct classification (PCC) were measured to assess discriminant validity. RESULTS About 112 patients were recruited, 62 of them (55.4%) males. Median age: 52.5 days (IQR: 32.75-115.25). The admission Cronbach's α was 0.77 (CI95%: 0.71; 0.82) and at 24 hr it was 0.65 (CI95%: 0.48; 0.7). The inter-rater reliability analysis was: CCC at admission 0.96 (95%CI 0.94-0.97), at 24 h 0.77 (95%CI 0.65-0.86), and at 48 hr 0.94 (95%CI 0.94-0.97); TDI 90%: 1.6, 2.9, and 1.57, respectively. The discriminant validity at admission: VUS of 0.8 (95%CI 0.70-0.90), at 24 h 0.92 (95%CI 0.85-0.99), and at 48 hr 0.93 (95%CI 0.87-0.99). The predictive values and PCC values were within 38-100% depending on the level of clinical severity. CONCLUSION There is a high inter-rater reliability, showing the BROSJOD score to be reliable and valid, even when different observers apply it. Pediatr Pulmonol. 2017;52:533-539. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Mònica Balaguer
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain
| | - Carme Alejandre
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain
| | - David Vila
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain
| | - Elisabeth Esteban
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain
| | - Josep L Carrasco
- Biostatistics, Public Health Department, University of Barcelona, Barcelona, Spain
| | - Francisco José Cambra
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain
| | - Iolanda Jordan
- Pediatric Critical Care Unit, Hospital Sant Joan de Deu, Pg. Sant Joan de Deu n°2, Esplugues de Llobregat 08950, Barcelona, Spain.,Paediatric Intensive Care Unit, CIBERESP, Agrupación Hospitalaria Clínic-Sant Joan de Déu, Esplugues de Llobregat, Barcelona, Spain
| |
Collapse
|
25
|
Wellens HLL, BeGole EA, Kuijpers-Jagtman AM. ROC surface assessment of the ANB angle and Wits appraisal’s diagnostic performance with a statistically derived ‘gold standard’: does normalizing measurements have any merit? Eur J Orthod 2017; 39:358-364. [DOI: 10.1093/ejo/cjx002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
26
|
Li J, Feng Q, Fine JP, Pencina MJ, Van Calster B. Nonparametric estimation and inference for polytomous discrimination index. Stat Methods Med Res 2017; 27:3092-3103. [DOI: 10.1177/0962280217692830] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Polytomous discrimination index is a novel and important diagnostic accuracy measure for multi-category classification. After reconstructing its probabilistic definition, we propose a nonparametric approach to the estimation of polytomous discrimination index based on an empirical sample of biomarker values. In this paper, we provide the finite-sample and asymptotic properties of the proposed estimators and such analytic results may facilitate the statistical inference. Simulation studies are performed to examine the performance of the nonparametric estimators. Two real data examples are analysed to illustrate our methodology.
Collapse
Affiliation(s)
- Jialiang Li
- National University of Singapore, Singapore, Singapore
- Duke-NUS Graduate Medical School, Singapore, Singapore
- Singapore Eye Research Institute, Singapore, Singapore
| | - Qunqiang Feng
- National University of Singapore, Singapore, Singapore
- University of Science and Technology of China, Hefei Shi, China
| | | | | | | |
Collapse
|
27
|
Xu W, Chen Z, Zhang Y, Cheng L. Order Statistics Concordance Coefficient With Applications to Multichannel Biosignal Analysis. IEEE J Biomed Health Inform 2016; 21:1206-1215. [PMID: 27740502 DOI: 10.1109/jbhi.2016.2616512] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
In this paper, we propose a novel concordance coefficient, called order statistics concordance coefficient (OSCOC), to quantify the association among multichannel biosignals. To uncover its properties, we compare OSCOC with three other similar indexes, i.e., average Pearson's product moment correlation coefficient (APPMCC), Kendall's concordance coefficients (KCC), and average Kendall's tau (AKT), under a multivariate normal model (MNM), linear model (LM), and nonlinear model. To further demonstrate its usefulness, we present an example on atrial arrhythmia analysis based on real-world multichannel cardiac signals. Theoretical derivations as well as numerical results suggest that 1) under MNM and LM, OSCOC performs equally well with APPMCC, and outperforms the other two methods, 2) in nonlinear case, OSCOC even has better performance than KCC and AKT, which are well known to be robust under increasing nonlinear transformations, and 3) OSCOC performs the best in the case study of arrhythmia analysis in terms of the volume under the surface.
Collapse
|
28
|
Samhaber KT, Buhl T, Brauns B, Hofmann L, Mitteldorf C, Seitz CS, Schön MP, Rosenberger A, Haenssle HA. Morphologic criteria of vesiculobullous skin disorders by in vivo reflectance confocal microscopy. J Dtsch Dermatol Ges 2016; 14:797-805. [DOI: 10.1111/ddg.13058] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Kinga T. Samhaber
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Göttingen Germany
| | - Timo Buhl
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Göttingen Germany
| | - Birka Brauns
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Rostock Germany
| | - Lars Hofmann
- Department of Dermatology; University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nuremberg; Germany
| | - Christina Mitteldorf
- Department of Dermatology, Venereology, and Allergology; Medical Center Hildesheim GmbH; Hildesheim Germany
| | - Cornelia S. Seitz
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Göttingen Germany
| | - Michael P. Schön
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Göttingen Germany
| | - Albert Rosenberger
- Institute of Genetic Epidemiology; University Medical Center; Göttingen Germany
| | - Holger A. Haenssle
- Department of Dermatology, Venereology, and Allergology; University Medical Center; Heidelberg Germany
| |
Collapse
|
29
|
Samhaber KT, Buhl T, Brauns B, Hofmann L, Mitteldorf C, Seitz CS, Schön MP, Rosenberger A, Haenssle HA. Morphologische Kriterien vesikulobullöser Hauterkrankungen in der konfokalen In-vivo-Laserscanmikroskopie. J Dtsch Dermatol Ges 2016; 14:797-806. [DOI: 10.1111/ddg.13058_g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Kinga T. Samhaber
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Timo Buhl
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Birka Brauns
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Lars Hofmann
- Hautklinik; Universitätsklinikum Erlangen, Friedrich-Alexander-Universität, Erlangen-Nürnberg; Deutschland
| | - Christina Mitteldorf
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Cornelia S. Seitz
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Michael P. Schön
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| | - Albert Rosenberger
- Abteilung Genetische Epidemiologie; Universitätsmedizin Göttingen; Deutschland
| | - Holger A. Haenssle
- Klinik für Dermatologie, Venerologie und Allergologie; Universitätsmedizin Göttingen; Deutschland
| |
Collapse
|
30
|
Zhang Y, Alonzo TA. Inverse probability weighting estimation of the volume under the ROC surface in the presence of verification bias. Biom J 2016; 58:1338-1356. [PMID: 27338713 DOI: 10.1002/bimj.201500225] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2015] [Revised: 02/28/2016] [Accepted: 03/10/2016] [Indexed: 11/08/2022]
Abstract
In diagnostic medicine, the volume under the receiver operating characteristic (ROC) surface (VUS) is a commonly used index to quantify the ability of a continuous diagnostic test to discriminate between three disease states. In practice, verification of the true disease status may be performed only for a subset of subjects under study since the verification procedure is invasive, risky, or expensive. The selection for disease examination might depend on the results of the diagnostic test and other clinical characteristics of the patients, which in turn can cause bias in estimates of the VUS. This bias is referred to as verification bias. Existing verification bias correction in three-way ROC analysis focuses on ordinal tests. We propose verification bias-correction methods to construct ROC surface and estimate the VUS for a continuous diagnostic test, based on inverse probability weighting. By applying U-statistics theory, we develop asymptotic properties for the estimator. A Jackknife estimator of variance is also derived. Extensive simulation studies are performed to evaluate the performance of the new estimators in terms of bias correction and variance. The proposed methods are used to assess the ability of a biomarker to accurately identify stages of Alzheimer's disease.
Collapse
Affiliation(s)
- Ying Zhang
- Department of Biostatistics, University of Southern California, Keck School of Medicine, Los Angeles, California 90033, USA.
| | - Todd A Alonzo
- Department of Biostatistics, University of Southern California, Keck School of Medicine, Los Angeles, California 90033, USA
| | | |
Collapse
|
31
|
Ossima ADN, Belkacemi MC, Daurès JP. Using Adjacent-category Logits Procedure for Estimating Receiver Operating Characteristic Surface. COMMUN STAT-SIMUL C 2016. [DOI: 10.1080/03610918.2013.879888] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- Arnaud D. Nze Ossima
- Laboratoire de Biostatistiques, Épidémiologie, et Santé Publique, Institut Universitaire de Recherche Clinique, Université de Montpellier 1, Montpellier, France
| | - Mohamed C. Belkacemi
- Laboratoire de Biostatistiques, Épidémiologie, et Santé Publique, Institut Universitaire de Recherche Clinique, Université de Montpellier 1, Montpellier, France
| | - Jean-Pierre Daurès
- Laboratoire de Biostatistiques, Épidémiologie, et Santé Publique, Institut Universitaire de Recherche Clinique, Université de Montpellier 1, Montpellier, France
| |
Collapse
|
32
|
Hong C, Won C. Parameter estimation for the imbalanced credit scoring data using AUC maximization. KOREAN JOURNAL OF APPLIED STATISTICS 2016. [DOI: 10.5351/kjas.2016.29.2.309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
33
|
Wang D, Attwood K, Tian L. Receiver operating characteristic analysis under tree orderings of disease classes. Stat Med 2015; 35:1907-26. [DOI: 10.1002/sim.6843] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Revised: 11/15/2015] [Accepted: 11/19/2015] [Indexed: 11/11/2022]
Affiliation(s)
- Dan Wang
- Department of Biostatistics & Bioinformatics; Roswell Park Cancer Institute; Elm and Carlton Streets Buffalo 14263 NY U.S.A
- Department of Biostatistics; SUNY University at Buffalo; 3435 Main St. Buffalo 14214 NY U.S.A
| | - Kristopher Attwood
- Department of Biostatistics & Bioinformatics; Roswell Park Cancer Institute; Elm and Carlton Streets Buffalo 14263 NY U.S.A
| | - Lili Tian
- Department of Biostatistics & Bioinformatics; Roswell Park Cancer Institute; Elm and Carlton Streets Buffalo 14263 NY U.S.A
- Department of Biostatistics; SUNY University at Buffalo; 3435 Main St. Buffalo 14214 NY U.S.A
| |
Collapse
|
34
|
Dong T, Attwood K, Hutson A, Liu S, Tian L. A new diagnostic accuracy measure and cut-point selection criterion. Stat Methods Med Res 2015; 26:2832-2852. [PMID: 26486150 DOI: 10.1177/0962280215611631] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Most diagnostic accuracy measures and criteria for selecting optimal cut-points are only applicable to diseases with binary or three stages. Currently, there exist two diagnostic measures for diseases with general k stages: the hypervolume under the manifold and the generalized Youden index. While hypervolume under the manifold cannot be used for cut-points selection, generalized Youden index is only defined upon correct classification rates. This paper proposes a new measure named maximum absolute determinant for diseases with k stages ([Formula: see text]). This comprehensive new measure utilizes all the available classification information and serves as a cut-points selection criterion as well. Both the geometric and probabilistic interpretations for the new measure are examined. Power and simulation studies are carried out to investigate its performance as a measure of diagnostic accuracy as well as cut-points selection criterion. A real data set from Alzheimer's Disease Neuroimaging Initiative is analyzed using the proposed maximum absolute determinant.
Collapse
Affiliation(s)
- Tuochuan Dong
- 1 Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| | - Kristopher Attwood
- 2 Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Alan Hutson
- 1 Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| | - Song Liu
- 2 Department of Biostatistics and Bioinformatics, Roswell Park Cancer Institute, Buffalo, NY, USA
| | - Lili Tian
- 1 Department of Biostatistics, University at Buffalo, Buffalo, NY, USA
| |
Collapse
|
35
|
Wan S, Zhang B. Using proportional odds models for semiparametric ROC surface estimation. Stat Probab Lett 2015. [DOI: 10.1016/j.spl.2015.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
36
|
Chang CH, Lin LC, Chen IC, Yen CH. Assessing the suitability of sets-based approaches: estimating the discriminative power of risk models for ordinal outcome treatments. Int Health 2015; 9:69-75. [PMID: 26409872 DOI: 10.1093/inthealth/ihv058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2015] [Revised: 06/29/2015] [Accepted: 07/14/2015] [Indexed: 11/14/2022] Open
Abstract
BACKGROUND In order to evaluate the discrimination performance of an ordinal model for improved disease screening, a new test was proposed where information was obtained across all samples simultaneously. METHODS The ordinal c-index builds upon the volume under the surface methodology without focusing on the accompanying receiver operating characteristic surfaces. However, it can be simplified to an average of pairwise c-indexes. In this paper, a set-based estimate (information was obtained across all samples simultaneously) was proposed by summing all correctly ordered groups. The asymptotic distribution of this proposed estimate was derived using U-statistics. RESULTS A predictive model was applied using the blood urea nitrogen/creatinine ratio to discriminate stroke in evolution in acute ischemic stroke patients, which could potentially be life-saving in emergency departments. CONCLUSIONS By conducting Monte Carlo simulations, it was concluded that the measure proposed herein is a better choice for clinical use because of the asymmetry of the predicted probabilities of groups.
Collapse
Affiliation(s)
- Chia-Hao Chang
- College of Nursing & the Chronic Diseases and Health Promotion Research Center, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan .,Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan
| | - Leng-Chieh Lin
- Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan.,Department of Emergency Medicine, Chang Gung Memorial Hospital, Chiayi, Taiwan
| | - I-Chuan Chen
- Department of Nursing, Chang Gung University of Science and Technology, Chiayi Campus, Chiayi, Taiwan.,Department of Emergency Medicine, Chang Gung Memorial Hospital, Chiayi, Taiwan
| | - Ching-Ho Yen
- Department of Industrial Engineering & Management Information, Huafan University, New Taipei City, Taiwan
| |
Collapse
|
37
|
Hong CS, Cho MH. VUS and HUM Represented with Mann-Whitney Statistic. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2015. [DOI: 10.5351/csam.2015.22.3.223] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Chong Sun Hong
- Department of Statistics, Sungkyunkwan University, Korea
| | - Min Ho Cho
- Department of Statistics, Sungkyunkwan University, Korea
| |
Collapse
|
38
|
Sultana and Jialiang Li MP, Hu J. Comparison of three-dimensional ROC surfaces for clustered and correlated markers, with a proteomics application. STAT NEERL 2015. [DOI: 10.1111/stan.12065] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
| | - Jianhua Hu
- Department of Biostatistics; The University of Texas M.D. Anderson Cancer Center; Houston 77030 TX USA
| |
Collapse
|
39
|
Wu Y, Jiang X, Wang S, Jiang W, Li P, Ohno-Machado L. Grid multi-category response logistic models. BMC Med Inform Decis Mak 2015; 15:10. [PMID: 25886151 PMCID: PMC4342889 DOI: 10.1186/s12911-015-0133-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2014] [Accepted: 01/15/2015] [Indexed: 11/28/2022] Open
Abstract
Background Multi-category response models are very important complements to binary logistic models in medical decision-making. Decomposing model construction by aggregating computation developed at different sites is necessary when data cannot be moved outside institutions due to privacy or other concerns. Such decomposition makes it possible to conduct grid computing to protect the privacy of individual observations. Methods This paper proposes two grid multi-category response models for ordinal and multinomial logistic regressions. Grid computation to test model assumptions is also developed for these two types of models. In addition, we present grid methods for goodness-of-fit assessment and for classification performance evaluation. Results Simulation results show that the grid models produce the same results as those obtained from corresponding centralized models, demonstrating that it is possible to build models using multi-center data without losing accuracy or transmitting observation-level data. Two real data sets are used to evaluate the performance of our proposed grid models. Conclusions The grid fitting method offers a practical solution for resolving privacy and other issues caused by pooling all data in a central site. The proposed method is applicable for various likelihood estimation problems, including other generalized linear models. Electronic supplementary material The online version of this article (doi:10.1186/s12911-015-0133-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yuan Wu
- Department of Biostatistics & Bioinformatics, Duke University, Durham, NC, 27708, USA.
| | - Xiaoqian Jiang
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Shuang Wang
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Wenchao Jiang
- Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, 200240, China
| | - Pinghao Li
- Department of Electronic Engineering, Shanghai Jiaotong University, Shanghai, 200240, China
| | - Lucila Ohno-Machado
- Division of Biomedical Informatics, Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| |
Collapse
|
40
|
Nze Ossima AD, Daurès JP, Bessaoud F, Trétarre B. The generalized Lehmann ROC curves: Lehmann family of ROC surfaces. J STAT COMPUT SIM 2015. [DOI: 10.1080/00949655.2013.831863] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
41
|
Dong T, Tian L. Confidence Interval Estimation for Sensitivity to the Early Diseased Stage Based on Empirical Likelihood. J Biopharm Stat 2014; 25:1215-33. [PMID: 25372999 PMCID: PMC5540368 DOI: 10.1080/10543406.2014.971173] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Many disease processes can be divided into three stages: the non-diseased stage: the early diseased stage, and the fully diseased stage. To assess the accuracy of diagnostic tests for such diseases, various summary indexes have been proposed, such as volume under the surface (VUS), partial volume under the surface (PVUS), and the sensitivity to the early diseased stage given specificity and the sensitivity to the fully diseased stage (P2). This paper focuses on confidence interval estimation for P2 based on empirical likelihood. Simulation studies are carried out to assess the performance of the new methods compared to the existing parametric and nonparametric ones. A real dataset from Alzheimer's Disease Neuroimaging Initiative (ADNI) is analyzed.
Collapse
Affiliation(s)
- Tuochuan Dong
- Department of Biostatistics, University at Buffalo, Buffalo, NY 14214, USA
| | - Lili Tian
- Department of Biostatistics, University at Buffalo, Buffalo, NY 14214, USA
| |
Collapse
|
42
|
Schubert CM, Guennel T. Comparing Performance of Multiclass Classification Systems with ROC Manifolds: When Volume and Correct Classification Fails. COMMUN STAT-SIMUL C 2014. [DOI: 10.1080/03610918.2013.794284] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
43
|
Chang CH, Yang JT, Lee MH. A Novel “Maximizing Kappa” Approach for Assessing the Ability of a Diagnostic Marker and Its Optimal Cutoff Value. J Biopharm Stat 2014; 25:1005-19. [DOI: 10.1080/10543406.2014.920347] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
44
|
|
45
|
Wan S, Zhang B. Semiparametric ROC surface estimation for continuous diagnostic tests via polytomous logistic regression procedures. J STAT COMPUT SIM 2013. [DOI: 10.1080/00949655.2012.684096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
46
|
Dong T, Kang L, Hutson A, Xiong C, Tian L. Confidence interval estimation of the difference between two sensitivities to the early disease stage. Biom J 2013; 56:270-86. [PMID: 24265123 DOI: 10.1002/bimj.201200012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2012] [Revised: 06/18/2013] [Accepted: 08/26/2013] [Indexed: 11/11/2022]
Abstract
Although most of the statistical methods for diagnostic studies focus on disease processes with binary disease status, many diseases can be naturally classified into three ordinal diagnostic categories, that is normal, early stage, and fully diseased. For such diseases, the volume under the ROC surface (VUS) is the most commonly used index of diagnostic accuracy. Because the early disease stage is most likely the optimal time window for therapeutic intervention, the sensitivity to the early diseased stage has been suggested as another diagnostic measure. For the purpose of comparing the diagnostic abilities on early disease detection between two markers, it is of interest to estimate the confidence interval of the difference between sensitivities to the early diseased stage. In this paper, we present both parametric and non-parametric methods for this purpose. An extensive simulation study is carried out for a variety of settings for the purpose of evaluating and comparing the performance of the proposed methods. A real example of Alzheimer's disease (AD) is analyzed using the proposed approaches.
Collapse
Affiliation(s)
- Tuochuan Dong
- Department of Biostatistics, University at Buffalo, Buffalo, NY 14214, USA
| | | | | | | | | |
Collapse
|
47
|
Carnicelli AP, Stone JJ, Doyle A, Chowdhry AK, Mix D, Ellis J, Gillespie DL, Chandra A. Cross-sectional area for the calculation of carotid artery stenosis on computed tomographic angiography. J Vasc Surg 2013; 58:659-65. [DOI: 10.1016/j.jvs.2013.02.031] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2012] [Revised: 12/05/2012] [Accepted: 02/12/2013] [Indexed: 11/24/2022]
|
48
|
Validation of Monte Carlo estimates of three-class ideal observer operating points for normal data. Acad Radiol 2013; 20:908-14. [PMID: 23747155 DOI: 10.1016/j.acra.2013.04.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Revised: 04/15/2013] [Accepted: 04/16/2013] [Indexed: 11/23/2022]
Abstract
RATIONALE AND OBJECTIVES Traditional two-class receiver operating characteristic (ROC) analysis is inadequate for the complete evaluation of observer performance in tasks with more than two classes. MATERIALS AND METHODS Here, a Monte Carlo estimation method for operating point coordinates on a three-class ROC surface is developed and compared with analytically calculated coordinates in two special cases: (1) univariate and (2) restricted bivariate trinormal underlying data. RESULTS In both cases, the statistical estimates were found to be good in the sense that the analytical values lay within the 95% confidence interval of the estimated values about 95% of the time. CONCLUSIONS The statistical estimation method should be key in the development of a pragmatic performance metric for evaluation of observers in classification tasks with three or more classes.
Collapse
|
49
|
|
50
|
Assessing the discriminative ability of risk models for more than two outcome categories. Eur J Epidemiol 2012; 27:761-70. [PMID: 23054032 DOI: 10.1007/s10654-012-9733-3] [Citation(s) in RCA: 62] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 09/14/2012] [Indexed: 12/21/2022]
Abstract
The discriminative ability of risk models for dichotomous outcomes is often evaluated with the concordance index (c-index). However, many medical prediction problems are polytomous, meaning that more than two outcome categories need to be predicted. Unfortunately such problems are often dichotomized in prediction research. We present a perspective on the evaluation of discriminative ability of polytomous risk models, which may instigate researchers to consider polytomous prediction models more often. First, we suggest a "discrimination plot" as a tool to visualize the model's discriminative ability. Second, we discuss the use of one overall polytomous c-index versus a set of dichotomous measures to summarize the performance of the model. Third, we address several aspects to consider when constructing a polytomous c-index. These involve the assessment of concordance in pairs versus sets of patients, weighting by outcome prevalence, the value related to models with random performance, the reduction to the dichotomous c-index for dichotomous problems, and interpretation. We illustrate these issues on case studies dealing with ovarian cancer (four outcome categories) and testicular cancer (three categories). We recommend the use of a discrimination plot together with an overall c-index such as the Polytomous Discrimination Index. If the overall c-index suggests that the model has relevant discriminative ability, pairwise c-indexes for each pair of outcome categories are informative. For pairwise c-indexes we recommend the 'conditional-risk' method which is consistent with the analytical approach of the multinomial logistic regression used to develop polytomous risk models.
Collapse
|