1
|
Klinkhammer H, Staerk C, Maj C, Krawitz PM, Mayr A. Genetic Prediction Modeling in Large Cohort Studies via Boosting Targeted Loss Functions. Stat Med 2024. [PMID: 39440393 DOI: 10.1002/sim.10249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 09/11/2024] [Accepted: 10/01/2024] [Indexed: 10/25/2024]
Abstract
Polygenic risk scores (PRS) aim to predict a trait from genetic information, relying on common genetic variants with low to medium effect sizes. As genotype data are high-dimensional in nature, it is crucial to develop methods that can be applied to large-scale data (largen $$ n $$ and largep $$ p $$ ). Many PRS tools aggregate univariate summary statistics from genome-wide association studies into a single score. Recent advancements allow simultaneous modeling of variant effects from individual-level genotype data. In this context, we introduced snpboost, an algorithm that applies statistical boosting on individual-level genotype data to estimate PRS via multivariable regression models. By processing variants iteratively in batches, snpboost can deal with large-scale cohort data. Having solved the technical obstacles due to data dimensionality, the methodological scope can now be broadened-focusing on key objectives for the clinical application of PRS. Similar to most methods in this context, snpboost has, so far, been restricted to quantitative and binary traits. Now, we incorporate more advanced alternatives-targeted to the particular aim and outcome. Adapting the loss function extends the snpboost framework to further data situations such as time-to-event and count data. Furthermore, alternative loss functions for continuous outcomes allow us to focus not only on the mean of the conditional distribution but also on other aspects that may be more helpful in the risk stratification of individual patients and can quantify prediction uncertainty, for example, median or quantile regression. This work enhances PRS fitting across multiple model classes previously unfeasible for this data type.
Collapse
Affiliation(s)
- Hannah Klinkhammer
- Institute of Medical Biometry, Informatics and Epidemiology, Medical Faculty, University of Bonn, Bonn, Germany
- Institute of Genomic Statistics and Bioinformatics, Medical Faculty, University of Bonn, Bonn, Germany
| | - Christian Staerk
- Institute of Medical Biometry, Informatics and Epidemiology, Medical Faculty, University of Bonn, Bonn, Germany
- IUF - Leibniz Research Institute for Environmental Medicine, Düsseldorf, Germany
- Department of Statistics, TU Dortmund University, Dortmund, Germany
| | - Carlo Maj
- Institute of Genomic Statistics and Bioinformatics, Medical Faculty, University of Bonn, Bonn, Germany
- Center for Human Genetics, Philipps-University Marburg, Marburg, Germany
| | - Peter M Krawitz
- Institute of Genomic Statistics and Bioinformatics, Medical Faculty, University of Bonn, Bonn, Germany
| | - Andreas Mayr
- Institute of Medical Biometry, Informatics and Epidemiology, Medical Faculty, University of Bonn, Bonn, Germany
| |
Collapse
|
2
|
Li Y, Herold T, Mansmann U, Hornung R. Does combining numerous data types in multi-omics data improve or hinder performance in survival prediction? Insights from a large-scale benchmark study. BMC Med Inform Decis Mak 2024; 24:244. [PMID: 39223659 PMCID: PMC11370316 DOI: 10.1186/s12911-024-02642-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Accepted: 08/21/2024] [Indexed: 09/04/2024] Open
Abstract
BACKGROUND Predictive modeling based on multi-omics data, which incorporates several types of omics data for the same patients, has shown potential to outperform single-omics predictive modeling. Most research in this domain focuses on incorporating numerous data types, despite the complexity and cost of acquiring them. The prevailing assumption is that increasing the number of data types necessarily improves predictive performance. However, the integration of less informative or redundant data types could potentially hinder this performance. Therefore, identifying the most effective combinations of omics data types that enhance predictive performance is critical for cost-effective and accurate predictions. METHODS In this study, we systematically evaluated the predictive performance of all 31 possible combinations including at least one of five genomic data types (mRNA, miRNA, methylation, DNAseq, and copy number variation) using 14 cancer datasets with right-censored survival outcomes, publicly available from the TCGA database. We employed various prediction methods and up-weighted clinical data in every model to leverage their predictive importance. Harrell's C-index and the integrated Brier Score were used as performance measures. To assess the robustness of our findings, we performed a bootstrap analysis at the level of the included datasets. Statistical testing was conducted for key results, limiting the number of tests to ensure a low risk of false positives. RESULTS Contrary to expectations, we found that using only mRNA data or a combination of mRNA and miRNA data was sufficient for most cancer types. For some cancer types, the additional inclusion of methylation data led to improved prediction results. Far from enhancing performance, the introduction of more data types most often resulted in a decline in performance, which varied between the two performance measures. CONCLUSIONS Our findings challenge the prevailing notion that combining multiple omics data types in multi-omics survival prediction improves predictive performance. Thus, the widespread approach in multi-omics prediction of incorporating as many data types as possible should be reconsidered to avoid suboptimal prediction results and unnecessary expenditure.
Collapse
Affiliation(s)
- Yingxia Li
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Marchioninistr. 15, 81377, Munich, Germany.
| | - Tobias Herold
- Laboratory for Leukemia Diagnostics, Department of Medicine III, LMU University Hospital, LMU Munich, Munich, Germany
| | - Ulrich Mansmann
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Marchioninistr. 15, 81377, Munich, Germany
| | - Roman Hornung
- Institute for Medical Information Processing, Biometry and Epidemiology, LMU Munich, Marchioninistr. 15, 81377, Munich, Germany
- Munich Center for Machine Learning (MCML), Munich, Germany
| |
Collapse
|
3
|
Koutakis P, Hernandez H, Miserlis D, Thompson JR, Papoutsi E, Mietus CJ, Haynatzki G, Kim JK, Casale GP, Pipinos II. Oxidative damage in the gastrocnemius predicts long-term survival in patients with peripheral artery disease. NPJ AGING 2024; 10:21. [PMID: 38580664 PMCID: PMC10997596 DOI: 10.1038/s41514-024-00147-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Accepted: 03/20/2024] [Indexed: 04/07/2024]
Abstract
Patients with peripheral artery disease (PAD) have increased mortality rates and a myopathy in their affected legs which is characterized by increased oxidative damage, reduced antioxidant enzymatic activity and defective mitochondrial bioenergetics. This study evaluated the hypothesis that increased levels of oxidative damage in gastrocnemius biopsies from patients with PAD predict long-term mortality rates. Oxidative damage was quantified as carbonyl adducts in myofibers of the gastrocnemius of PAD patients. The oxidative stress data were grouped into tertiles and the 5-year, all-cause mortality for each tertile was determined by Kaplan-Meier curves and compared by the Modified Peto test. A Cox-regression model was used to control the effects of clinical characteristics. Results were adjusted for age, sex, race, body mass index, ankle-brachial index, smoking, physical activity, and comorbidities. Of the 240 study participants, 99 died during a mean follow up of 37.8 months. Patients in the highest tertile of oxidative damage demonstrated the highest 5-year mortality rate. The mortality hazard ratios (HR) from the Cox analysis were statistically significant for oxidative damage (lowest vs middle tertile; HR = 6.33; p = 0.0001 and lowest vs highest; HR = 8.37; p < 0.0001). Survival analysis of a contemporaneous population of PAD patients identifies abundance of carbonyl adducts in myofibers of their gastrocnemius as a predictor of mortality rate independently of ankle-brachial index, disease stage and other clinical and myopathy-related covariates.
Collapse
Affiliation(s)
- Panagiotis Koutakis
- Department of Biology, Baylor University, Waco, TX, USA.
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA.
| | - Hernan Hernandez
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
| | - Dimitrios Miserlis
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
- Department of Surgery and Perioperative Care, University of Texas at Austin, Austin, TX, USA
| | - Jonathan R Thompson
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
| | - Evlampia Papoutsi
- Department of Biology, Baylor University, Waco, TX, USA
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
| | - Constance J Mietus
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
- Department of Neurological Surgery, University of Massachusetts Medical School, Worcester, MA, USA
| | - Gleb Haynatzki
- Department of Biostatistics, University of Nebraska Medical Center, Omaha, NE, USA
| | - Julian K Kim
- Department of Biology, Baylor University, Waco, TX, USA
| | - George P Casale
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA
| | - Iraklis I Pipinos
- Department of Surgery, University of Nebraska Medical Center, Omaha, NE, USA.
- Department of Surgery and VA Research Service, VA Nebraska-Western Iowa Health Care System, Omaha, NE, USA.
| |
Collapse
|
4
|
Schmid M, Friede T, Klein N, Weinhold L. Accounting for time dependency in meta-analyses of concordance probability estimates. Res Synth Methods 2023; 14:807-823. [PMID: 37429580 DOI: 10.1002/jrsm.1655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 04/21/2023] [Accepted: 06/19/2023] [Indexed: 07/12/2023]
Abstract
Recent years have seen the development of many novel scoring tools for disease prognosis and prediction. To become accepted for use in clinical applications, these tools have to be validated on external data. In practice, validation is often hampered by logistical issues, resulting in multiple small-sized validation studies. It is therefore necessary to synthesize the results of these studies using techniques for meta-analysis. Here we consider strategies for meta analyzing the concordance probability for time-to-event data ("C-index"), which has become a popular tool to evaluate the discriminatory power of prediction models with a right-censored outcome. We show that standard meta-analysis of the C-index may lead to biased results, as the magnitude of the concordance probability depends on the length of the time interval used for evaluation (defined e.g., by the follow-up time, which might differ considerably between studies). To address this issue, we propose a set of methods for random-effects meta-regression that incorporate time directly as covariate in the model equation. In addition to analyzing nonlinear time trends via fractional polynomial, spline, and exponential decay models, we provide recommendations on suitable transformations of the C-index before meta-regression. Our results suggest that the C-index is best meta-analyzed using fractional polynomial meta-regression with logit-transformed C-index values. Classical random-effects meta-analysis (not considering time as covariate) is demonstrated to be a suitable alternative when follow-up times are small. Our findings have implications for the reporting of C-index values in future studies, which should include information on the length of the time interval underlying the calculations.
Collapse
Affiliation(s)
- Matthias Schmid
- Department of Medical Biometry, Informatics, and Epidemiology, University Hospital Bonn, Bonn, Germany
| | - Tim Friede
- Department of Medical Statistics, University Medical Center Göttingen, Göttingen, Germany
| | - Nadja Klein
- Research Center for Trustworthy Data Science and Security, UA Ruhr/Department of Statistics, Technische Universität Dortmund, Dortmund, Germany
| | - Leonie Weinhold
- Department of Medical Biometry, Informatics, and Epidemiology, University Hospital Bonn, Bonn, Germany
| |
Collapse
|
5
|
Li R, Ning J, Feng Z. Estimation and inference of predictive discrimination for survival outcome risk prediction models. LIFETIME DATA ANALYSIS 2022; 28:219-240. [PMID: 35061146 PMCID: PMC10084512 DOI: 10.1007/s10985-022-09545-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 12/28/2021] [Indexed: 06/14/2023]
Abstract
Accurate risk prediction has been the central goal in many studies of survival outcomes. In the presence of multiple risk factors, a censored regression model can be employed to estimate a risk prediction rule. Before the prediction tool can be popularized for practical use, it is crucial to rigorously assess its prediction performance. In our motivating example, researchers are interested in developing and validating a risk prediction tool to identify future lung cancer cases by integrating demographic information, disease characteristics and smoking-related data. Considering the long latency period of cancer, it is desirable for a prediction tool to achieve discriminative performance that does not weaken over time. We propose estimation and inferential procedures to comprehensively assess both the overall predictive discrimination and the temporal pattern of an estimated prediction rule. The proposed methods readily accommodate commonly used censored regression models, including the Cox proportional hazards model and the accelerated failure time model. The estimators are consistent and asymptotically normal, and reliable variance estimators are also developed. The proposed methods offer an informative tool for inferring time-dependent predictive discrimination, as well as for comparing the discrimination performance between candidate models. Applications of the proposed methods demonstrate enduring performance of the risk prediction tool in the PLCO study and detected decaying performance in a study of liver disease.
Collapse
Affiliation(s)
- Ruosha Li
- Department of Biostatistics and Data Science, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Jing Ning
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
| | - Ziding Feng
- Division of Public Health Sciences, Fred Hutchison Cancer Research Center, Seattle, WA, USA
| |
Collapse
|
6
|
Linden T, Hanses F, Domingo-Fernández D, DeLong LN, Kodamullil AT, Schneider J, Vehreschild MJGT, Lanznaster J, Ruethrich MM, Borgmann S, Hower M, Wille K, Feldt T, Rieg S, Hertenstein B, Wyen C, Roemmele C, Vehreschild JJ, Jakob CEM, Stecher M, Kuzikov M, Zaliani A, Fröhlich H. Machine Learning Based Prediction of COVID-19 Mortality Suggests Repositioning of Anticancer Drug for Treating Severe Cases. ARTIFICIAL INTELLIGENCE IN THE LIFE SCIENCES 2021; 1:100020. [PMID: 34988543 PMCID: PMC8677630 DOI: 10.1016/j.ailsci.2021.100020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 11/22/2021] [Accepted: 11/22/2021] [Indexed: 02/08/2023]
Abstract
Despite available vaccinations COVID-19 case numbers around the world are still growing, and effective medications against severe cases are lacking. In this work, we developed a machine learning model which predicts mortality for COVID-19 patients using data from the multi-center 'Lean European Open Survey on SARS-CoV-2-infected patients' (LEOSS) observational study (>100 active sites in Europe, primarily in Germany), resulting into an AUC of almost 80%. We showed that molecular mechanisms related to dementia, one of the relevant predictors in our model, intersect with those associated to COVID-19. Most notably, among these molecules was tyrosine kinase 2 (TYK2), a protein that has been patented as drug target in Alzheimer's Disease but also genetically associated with severe COVID-19 outcomes. We experimentally verified that anti-cancer drugs Sorafenib and Regorafenib showed a clear anti-cytopathic effect in Caco2 and VERO-E6 cells and can thus be regarded as potential treatments against COVID-19. Altogether, our work demonstrates that interpretation of machine learning based risk models can point towards drug targets and new treatment options, which are strongly needed for COVID-19.
Collapse
Affiliation(s)
- Thomas Linden
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
- University of Bonn, Bonn-Aachen International Center for IT, Friedrich Hirzebruch-Allee 6, 53115 Bonn, Germany
| | - Frank Hanses
- Emergency Department, University Hospital Regensburg, 93053 Regensburg, Germany
- Department for Infectious Diseases and Infection Control, University Hospital Regensburg, Germany
| | - Daniel Domingo-Fernández
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
| | - Lauren Nicole DeLong
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
- University of Bonn, Bonn-Aachen International Center for IT, Friedrich Hirzebruch-Allee 6, 53115 Bonn, Germany
| | - Alpha Tom Kodamullil
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
| | - Jochen Schneider
- Technical University of Munich, School of Medicine, University Hospital rechts der Isar, Department of Internal Medicine II, 81675 Munich, Germany
| | - Maria J G T Vehreschild
- Department II of Internal Medicine, Infectious Diseases, University Hospital Frankfurt, Goethe University, 60590 Frankfurt, Germany
| | - Julia Lanznaster
- Department of Internal Medicine II, Hospital Passau, Innstraße 76, 94032 Passau, Germany
| | - Maria Madeleine Ruethrich
- Institute for Infection Medicine and Hospital Hygiene, University Hospital Jena, 07743 Jena, Germany
| | - Stefan Borgmann
- Department of Infectious Diseases and Infection Control, Hospital Ingolstadt, 85049 Ingolstadt, Germany
| | - Martin Hower
- Department of Pneumology, Infectious Diseases and Intensive Care, Klinikum Dortmund gGmbH, Hospital of University Witten / Herdecke, 44137 Dortmund, Germany
| | - Kai Wille
- University Clinic for Haematology, Oncology, Haemostaseology and Palliative Care, Johannes Wesling Medical Centre Minden, 32429 Minden, Germany
| | - Torsten Feldt
- Department of Gastroenterology, Hepatology and Infectious Diseases, University Hospital Düsseldorf, Medical Faculty of Heinrich Heine University Düsseldorf, Moorenstrasse 5, 40225 Düsseldorf, Germany
| | - Siegbert Rieg
- Department of Medicine II, University Hospital Freiburg, 79110 Freiburg, Germany
| | - Bernd Hertenstein
- Department of Medicine II, University Hospital Freiburg, 79110 Freiburg, Germany
| | - Christoph Wyen
- Christoph Wyen, Praxis am Ebertplatz Cologne, 50668 Cologne, Germany
| | - Christoph Roemmele
- Internal Medicine III - Gastroenterology and Infectious Diseases, University Hospital Augsburg, 86156 Augsburg, Germany
| | - Jörg Janne Vehreschild
- Department II of Internal Medicine, Infectious Diseases, University Hospital Frankfurt, Goethe University, 60590 Frankfurt, Germany
| | - Carolin E M Jakob
- Department I for Internal Medicine, University Hospital of Cologne, University of Cologne, 50931 Cologne, Germany
| | - Melanie Stecher
- Fraunhofer Institute for Translational Medicine and Pharmacologie (ITMP), VolksparkLabs, Schnackenburgallee 114, 22535 Hamburg, Germany
| | - Maria Kuzikov
- Department for Infectious Diseases and Infection Control, University Hospital Regensburg, Germany
| | - Andrea Zaliani
- Department for Infectious Diseases and Infection Control, University Hospital Regensburg, Germany
| | - Holger Fröhlich
- Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
- University of Bonn, Bonn-Aachen International Center for IT, Friedrich Hirzebruch-Allee 6, 53115 Bonn, Germany
| |
Collapse
|
7
|
Zadeh SG, Schmid M. Bias in Cross-Entropy-Based Training of Deep Survival Networks. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2021; 43:3126-3137. [PMID: 32149626 DOI: 10.1109/tpami.2020.2979450] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Over the last years, utilizing deep learning for the analysis of survival data has become attractive to many researchers. This has led to the advent of numerous network architectures for the prediction of possibly censored time-to-event variables. Unlike networks for cross-sectional data (used e.g., in classification), deep survival networks require the specification of a suitably defined loss function that incorporates typical characteristics of survival data such as censoring and time-dependent features. Here, we provide an in-depth analysis of the cross-entropy loss function, which is a popular loss function for training deep survival networks. For each time point t, the cross-entropy loss is defined in terms of a binary outcome with levels "event at or before t" and "event after t". Using both theoretical and empirical approaches, we show that this definition may result in a high prediction error and a heavy bias in the predicted survival probabilities. To overcome this problem, we analyze an alternative loss function that is derived from the negative log-likelihood function of a discrete time-to-event model. We show that replacing the cross-entropy loss by the negative log-likelihood loss results in much better calibrated prediction rules and also in an improved discriminatory power, as measured by the concordance index.
Collapse
|
8
|
Chauzeix J, Pastoret C, Donaty L, Gachard N, Fest T, Feuillard J, Rizzo D. A reduced panel of eight genes (ATM, SF3B1, NOTCH1, BIRC3, XPO1, MYD88, TNFAIP3, and TP53) as an estimator of the tumor mutational burden in chronic lymphocytic leukemia. Int J Lab Hematol 2021; 43:683-692. [PMID: 33325634 PMCID: PMC8451785 DOI: 10.1111/ijlh.13435] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Revised: 11/10/2020] [Accepted: 11/24/2020] [Indexed: 12/30/2022]
Abstract
INTRODUCTION Mutational complexity or tumor mutational burden (TMB) influences the course of chronic lymphocytic leukemia (CLL). However, this information is not routinely used because TMB is usually obtained from whole genome or exome, or from large gene panel high-throughput sequencing. METHODS Here, we used the C-Harrel concordance index to determine the minimum panel of genes for which mutations predict treatment-free survival (TFS) as well as large resequencing panels. RESULTS An eight gene estimator was defined encompassing ATM, SF3B1, NOTCH1, BIRC3, XPO1, MYD88, TNFAIP3, and TP53. TMB estimated from either a large panel of genes or the eight gene estimator was increased in treated patients or in those with a short TFS (<2 years), unmutated IGHV gene or with an unfavorable karyotype. Being an independent prognostic parameter, any mutation in the eight gene estimator predicted a shorter TFS better than Binet stage and IGHV mutational status among patients with an apparently non-progressive disease (TFS >6 months). Strikingly, the eight gene estimator was also highly informative for patients with Binet stage A CLL or with a good prognosis karyotype. CONCLUSION These results suggest that the eight gene estimator, that is easily achievable by high-throughput resequencing, brings robust and valuable information that predicts evolution of untreated patients at diagnosis better than any other parameter.
Collapse
Affiliation(s)
- Jasmine Chauzeix
- Laboratoire d'Hématologie etUMR CNRS 7276/INSERM 1262CRIBLCentre de Biologie et de Recherche en SantéCHU et Université de LimogesLimogesFrance
| | - Cédric Pastoret
- InsermMICMAC ‐ UMR_S 1236CHU RennesUniversité Rennes 1RennesFrance
| | - Lucie Donaty
- Laboratoire d'Hématologie etUMR CNRS 7276/INSERM 1262CRIBLCentre de Biologie et de Recherche en SantéCHU et Université de LimogesLimogesFrance
| | - Nathalie Gachard
- Laboratoire d'Hématologie etUMR CNRS 7276/INSERM 1262CRIBLCentre de Biologie et de Recherche en SantéCHU et Université de LimogesLimogesFrance
| | - Thierry Fest
- InsermMICMAC ‐ UMR_S 1236CHU RennesUniversité Rennes 1RennesFrance
| | - Jean Feuillard
- Laboratoire d'Hématologie etUMR CNRS 7276/INSERM 1262CRIBLCentre de Biologie et de Recherche en SantéCHU et Université de LimogesLimogesFrance
| | - David Rizzo
- Laboratoire d'Hématologie etUMR CNRS 7276/INSERM 1262CRIBLCentre de Biologie et de Recherche en SantéCHU et Université de LimogesLimogesFrance
| |
Collapse
|
9
|
van Geloven N, He Y, Zwinderman A, Putter H. Estimation of incident dynamic AUC in practice. Comput Stat Data Anal 2021. [DOI: 10.1016/j.csda.2020.107095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
10
|
Gomez DE, Bedford S, Darby S, Palmisano M, MacKay RJ, Renaud DL. Acid-base disorders in sick goats and their association with mortality: A simplified strong ion difference approach. J Vet Intern Med 2020; 34:2776-2786. [PMID: 33140905 PMCID: PMC7694813 DOI: 10.1111/jvim.15956] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 10/23/2020] [Accepted: 10/23/2020] [Indexed: 12/16/2022] Open
Abstract
OBJECTIVES To investigate the acid-base status of sick goats using the simplified strong ion difference (sSID) approach, to establish the quantitative contribution of sSID variables to changes in blood pH and HCO3 - and to determine whether clinical, acid-base, and biochemical variables on admission are associated with the mortality of sick goats. ANIMALS One hundred forty-three sick goats. METHODS Retrospective study. Calculated sSID variables included SID using 6 electrolytes unmeasured strong ions (USI) and the total nonvolatile buffer ion concentration in plasma (Atot ). The relationship between measured blood pH and HCO3 - , and the sSID variables was examined using forward stepwise linear regression. Cox proportional hazard models were constructed to assess associations between potential predictor variables and mortality of goats during hospitalization. RESULTS Hypocapnia, hypokalemia, hyperchloremia, hyperlactatemia, and hyperproteinemia were common abnormalities identified in sick goats. Respiratory alkalosis, strong ion acidosis, and Atot acidosis were acid-base disorders frequently encountered in sick goats. In sick goats, the sSID variables explained 97% and 100% of the changes in blood pH and HCO3 - , respectively. The results indicated that changes in the respiratory rate (<16 respirations per minute), USI, and pH at admission were associated with increased hazard of hospital mortality in sick goats. CONCLUSIONS AND CLINICAL IMPORTANCE The sSID approach is a useful methodology to quantify acid-base disorders in goats and to determine the mechanisms of their development. Clinicians should consider calculation of USI in sick goats as part of the battery of information required to establish prognosis.
Collapse
Affiliation(s)
- Diego E. Gomez
- Department of Large Animal Clinical SciencesCollege of Veterinary Medicine, University of FloridaGainesvilleFloridaUSA
- Department of Clinical StudiesOntario Veterinary College, University of GuelphGuelphOntarioCanada
| | - Sofia Bedford
- Department of Clinical StudiesOntario Veterinary College, University of GuelphGuelphOntarioCanada
| | - Shannon Darby
- Department of Large Animal Clinical SciencesCollege of Veterinary Medicine, University of FloridaGainesvilleFloridaUSA
| | - Megan Palmisano
- Department of Large Animal Clinical SciencesCollege of Veterinary Medicine, University of FloridaGainesvilleFloridaUSA
| | - Robert J. MacKay
- Department of Large Animal Clinical SciencesCollege of Veterinary Medicine, University of FloridaGainesvilleFloridaUSA
| | - David L. Renaud
- Department of Population MedicineOntario Veterinary College, University of GuelphGuelphOntarioCanada
| |
Collapse
|
11
|
Dey R, Sebastiani G, Saha-Chaudhuri P. Inference about time-dependent prognostic accuracy measures in the presence of competing risks. BMC Med Res Methodol 2020; 20:219. [PMID: 32859153 PMCID: PMC7456384 DOI: 10.1186/s12874-020-01100-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Accepted: 08/12/2020] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Evaluating a candidate marker or developing a model for predicting risk of future conditions is one of the major goals in medicine. However, model development and assessment for a time-to-event outcome may be complicated in the presence of competing risks. In this manuscript, we propose a local and a global estimators of cause-specific AUC for right-censored survival times in the presence of competing risks. METHODS The local estimator - cause-specific weighted mean rank (cWMR) - is a local average of time-specific observed cause-specific AUCs within a neighborhood of given time t. The global estimator - cause-specific fractional polynomials (cFPL) - is based on modelling the cause-specific AUC as a function of t through fractional polynomials. RESULTS We investigated the performance of the proposed cWMR and cFPL estimators through simulation studies and real-life data analysis. The estimators perform well in small samples, have minimal bias and appropriate coverage. CONCLUSIONS The local estimator cWMR and the global estimator cFPL will provide computationally efficient options for assessing the prognostic accuracy of markers for time-to-event outcome in the presence of competing risks in many practical settings.
Collapse
Affiliation(s)
- Rajib Dey
- Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, Canada
| | - Giada Sebastiani
- Division of Gastroenterology and Hepatology, McGill University Health Centre, Montreal, Canada
| | | |
Collapse
|
12
|
Longato E, Vettoretti M, Di Camillo B. A practical perspective on the concordance index for the evaluation and selection of prognostic time-to-event models. J Biomed Inform 2020; 108:103496. [PMID: 32652236 DOI: 10.1016/j.jbi.2020.103496] [Citation(s) in RCA: 79] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2019] [Revised: 05/12/2020] [Accepted: 06/24/2020] [Indexed: 11/18/2022]
Abstract
Developing a prognostic model for biomedical applications typically requires mapping an individual's set of covariates to a measure of the risk that he or she may experience the event to be predicted. Many scenarios, however, especially those involving adverse pathological outcomes, are better described by explicitly accounting for the timing of these events, as well as their probability. As a result, in these cases, traditional classification or ranking metrics may be inadequate to inform model evaluation or selection. To address this limitation, it is common practice to reframe the problem in the context of survival analysis, and resort, instead, to the concordance index (C-index), which summarises how well a predicted risk score describes an observed sequence of events. A practically meaningful interpretation of the C-index, however, may present several difficulties and pitfalls. Specifically, we identify two main issues: i) the C-index remains implicitly, and subtly, dependent on time, and ii) its relationship with the number of subjects whose risk was incorrectly predicted is not straightforward. Failure to consider these two aspects may introduce undesirable and unwanted biases in the evaluation process, and even result in the selection of a suboptimal model. Hence, here, we discuss ways to obtain a meaningful interpretation in spite of these difficulties. Aiming to assist experimenters regardless of their familiarity with the C-index, we start from an introductory-level presentation of its most popular estimator, highlighting the latter's temporal dependency, and suggesting how it might be correctly used to inform model selection. We also address the nonlinearity of the C-index with respect to the number of correct risk predictions, elaborating a simplified framework that may enable an easier interpretation and quantification of C-index improvements or deteriorations.
Collapse
Affiliation(s)
- Enrico Longato
- Department of Information Engineering, University of Padova, Padova, Italy.
| | - Martina Vettoretti
- Department of Information Engineering, University of Padova, Padova, Italy.
| | - Barbara Di Camillo
- Department of Information Engineering, University of Padova, Padova, Italy.
| |
Collapse
|
13
|
Meier A, Nekolla K, Hewitt LC, Earle S, Yoshikawa T, Oshima T, Miyagi Y, Huss R, Schmidt G, Grabsch HI. Hypothesis-free deep survival learning applied to the tumour microenvironment in gastric cancer. JOURNAL OF PATHOLOGY CLINICAL RESEARCH 2020; 6:273-282. [PMID: 32592447 PMCID: PMC7578283 DOI: 10.1002/cjp2.170] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 04/14/2020] [Accepted: 04/20/2020] [Indexed: 02/06/2023]
Abstract
The biological complexity reflected in histology images requires advanced approaches for unbiased prognostication. Machine learning and particularly deep learning methods are increasingly applied in the field of digital pathology. In this study, we propose new ways to predict risk for cancer‐specific death from digital images of immunohistochemically (IHC) stained tissue microarrays (TMAs). Specifically, we evaluated a cohort of 248 gastric cancer patients using convolutional neural networks (CNNs) in an end‐to‐end weakly supervised scheme independent of subjective pathologist input. To account for the time‐to‐event characteristic of the outcome data, we developed new survival models to guide the network training. In addition to the standard H&E staining, we investigated the prognostic value of a panel of immune cell markers (CD8, CD20, CD68) and a proliferation marker (Ki67). Our CNN‐derived risk scores provided additional prognostic value when compared to the gold standard prognostic tool TNM stage. The CNN‐derived risk scores were also shown to be superior when systematically compared to cell density measurements or a CNN score derived from binary 5‐year survival classification, which ignores time‐to‐event. To better understand the underlying biological mechanisms, we qualitatively investigated risk heat maps for each marker which visualised the network output. We identified patterns of biological interest that were related to low risk of cancer‐specific death such as the presence of B‐cell predominated clusters and Ki67 positive sub‐regions and showed that the corresponding risk scores had prognostic value in multivariate Cox regression analyses (Ki67&CD20 risks: hazard ratio (HR) = 1.47, 95% confidence interval (CI) = 1.15–1.89, p = 0.002; CD20&CD68 risks: HR = 1.33, 95% CI = 1.07–1.67, p = 0.009). Our study demonstrates the potential additional value that deep learning in combination with a panel of IHC markers can bring to the field of precision oncology.
Collapse
Affiliation(s)
- Armin Meier
- Image Data Sciences, Definiens GmbH, Munich, Germany
| | | | - Lindsay C Hewitt
- Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands
| | - Sophie Earle
- Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St. James's , University of Leeds, Leeds, UK
| | - Takaki Yoshikawa
- Department of Gastric Surgery, National Cancer Center Hospital, Tokyo, Japan
| | - Takashi Oshima
- Department of Gastrointestinal Surgery, Kanagawa Cancer Center Hospital, Yokohama, Japan
| | - Yohei Miyagi
- Molecular Pathology and Genetics Division, Kanagawa Cancer Center Research Institute, Yokohama, Japan
| | - Ralf Huss
- Institute of Pathology and Molecular Diagnostic, University Hospital Augsburg, Augsburg, Germany
| | | | - Heike I Grabsch
- Department of Pathology, GROW School for Oncology and Developmental Biology, Maastricht University Medical Center+, Maastricht, The Netherlands.,Division of Pathology and Data Analytics, Leeds Institute of Medical Research at St. James's , University of Leeds, Leeds, UK
| |
Collapse
|
14
|
Clinical-learning versus machine-learning for transdiagnostic prediction of psychosis onset in individuals at-risk. Transl Psychiatry 2019; 9:259. [PMID: 31624229 PMCID: PMC6797779 DOI: 10.1038/s41398-019-0600-9] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/09/2019] [Revised: 05/03/2019] [Accepted: 05/31/2019] [Indexed: 02/08/2023] Open
Abstract
Predicting the onset of psychosis in individuals at-risk is based on robust prognostic model building methods including a priori clinical knowledge (also termed clinical-learning) to preselect predictors or machine-learning methods to select predictors automatically. To date, there is no empirical research comparing the prognostic accuracy of these two methods for the prediction of psychosis onset. In a first experiment, no improved performance was observed when machine-learning methods (LASSO and RIDGE) were applied-using the same predictors-to an individualised, transdiagnostic, clinically based, risk calculator previously developed on the basis of clinical-learning (predictors: age, gender, age by gender, ethnicity, ICD-10 diagnostic spectrum), and externally validated twice. In a second experiment, two refined versions of the published model which expanded the granularity of the ICD-10 diagnosis were introduced: ICD-10 diagnostic categories and ICD-10 diagnostic subdivisions. Although these refined versions showed an increase in apparent performance, their external performance was similar to the original model. In a third experiment, the three refined models were analysed under machine-learning and clinical-learning with a variable event per variable ratio (EPV). The best performing model under low EPVs was obtained through machine-learning approaches. The development of prognostic models on the basis of a priori clinical knowledge, large samples and adequate events per variable is a robust clinical prediction method to forecast psychosis onset in patients at-risk, and is comparable to machine-learning methods, which are more difficult to interpret and implement. Machine-learning methods should be preferred for high dimensional data when no a priori knowledge is available.
Collapse
|
15
|
Beyene KM, El Ghouch A, Oulhaj A. On the validity of time-dependent AUC estimation in the presence of cure fraction. Biom J 2019; 61:1430-1447. [PMID: 31310019 DOI: 10.1002/bimj.201800376] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Revised: 04/16/2019] [Accepted: 06/04/2019] [Indexed: 11/09/2022]
Abstract
During the last decades, several approaches have been proposed to estimate the time-dependent area under the receiver operating characteristic curve (AUC) of risk tools derived from survival data. The validity of these estimators relies on some regularity assumptions among which a survival function being proper. In practice, this assumption is not always satisfied because a fraction of the population may not be susceptible to experience the event of interest even for long follow-up. Studying the sensitivity of the proposed estimators to the violation of this assumption is of substantial interest. In this paper, we investigate the performance of a nonparametric simple estimator, developed for classical survival data, in the case when the population exhibits a cure fraction. Motivated from the current practice of deriving risk tools in oncology and cardiovascular disease prevention, we also assess the loss, in terms of predictive performance, when deriving risk tools from survival models that do not acknowledge the presence of cure. The simulation results show that the investigated method is valid even under the presence of cure. They also show that risk tools derived from survival models that ignore the presence of cure have smaller AUC compared to those derived from survival models that acknowledge the presence of cure. This was also attested with a real data analysis from a breast cancer study.
Collapse
Affiliation(s)
- Kassu M Beyene
- Institute of Statistics, Biostatistics and Actuarial Sciences, Catholic University of Louvain, Louvain la Neuve, Belgium
| | - Anouar El Ghouch
- Institute of Statistics, Biostatistics and Actuarial Sciences, Catholic University of Louvain, Louvain la Neuve, Belgium
| | - Abderrahim Oulhaj
- Institute of Public Health, College of Medicine and Health Sciences, UAE University, Al-Ain, United Arab Emirates
| |
Collapse
|
16
|
Gleiss A, Gnant M, Schemper M. Explained variation in shared frailty models. Stat Med 2018; 37:1482-1490. [PMID: 29282754 DOI: 10.1002/sim.7592] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Revised: 11/21/2017] [Accepted: 11/30/2017] [Indexed: 11/06/2022]
Abstract
Explained variation measures the relative gain in predictive accuracy when prediction based on prognostic factors replaces unconditional prediction. The factors may be measured on different scales or may be of different types (dichotomous, qualitative, or continuous). Thus, explained variation permits to establish a ranking of the importance of factors, even if predictive accuracy is too low to be helpful in clinical practice. In this contribution, the explained variation measure by Schemper and Henderson (2000) is extended to accommodate random factors, such as center effects in multicenter studies. This permits a direct comparison of the importance of centers and of other prognostic factors. We develop this extension for a shared frailty Cox model and provide an SAS macro and an R function to facilitate its application. Interesting empirical properties of the variation explained by a random factor are explored by a Monte Carlo study. Advantages of the approach are exemplified by an Austrian multicenter study of colon cancer.
Collapse
Affiliation(s)
- Andreas Gleiss
- Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, Vienna, Austria
| | - Michael Gnant
- Department of Surgery, Medical University of Vienna, Vienna, Austria
| | - Michael Schemper
- Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, Vienna, Austria
| |
Collapse
|
17
|
Abstract
In modeling censored data, survival forest models are a competitive nonparametric alternative to traditional parametric or semiparametric models when the function forms are possibly misspecified or the underlying assumptions are violated. In this work, we propose a survival forest approach with trees constructed using a novel pseudo R2 splitting rules. By studying the well-known benchmark data sets, we find that the proposed model generally outperforms popular survival models such as random survival forest with different splitting rules, Cox proportional hazard model, and generalized boosted model in terms of C-index metric.
Collapse
Affiliation(s)
- Hong Wang
- School of Mathematics and Statistics, Central South University, Changsha, China
| | - Xiaolin Chen
- School of Statistics, Qufu Normal University, Qufu, China
| | - Gang Li
- Department of Biostatistics, School of Public Health, University of California at Los Angeles, Los Angeles, California
| |
Collapse
|
18
|
Binder H, Gefeller O, Schmid M, Mayr A. Extending Statistical Boosting. Methods Inf Med 2018; 53:428-35. [DOI: 10.3414/me13-01-0123] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2013] [Accepted: 05/02/2014] [Indexed: 11/09/2022]
Abstract
SummaryBackground: Boosting algorithms to simultaneously estimate and select predictor effects in statistical models have gained substantial interest during the last decade.Objectives: This review highlights recent methodological developments regarding boosting algorithms for statistical modelling especially focusing on topics relevant for biomedical research.Methods: We suggest a unified framework for gradient boosting and likelihood-based boosting (statistical boosting) which have been addressed separately in the literature up to now.Results: The methodological developments on statistical boosting during the last ten years can be grouped into three different lines of research: i) efforts to ensure variable selection leading to sparser models, ii) developments regarding different types of predictor effects and how to choose them, iii) approaches to extend the statistical boosting framework to new regression settings.Conclusions: Statistical boosting algorithms have been adapted to carry out unbiased variable selection and automated model choice during the fitting process and can nowadays be applied in almost any regression setting in combination with a large amount of different types of predictor effects.
Collapse
|
19
|
Rahman MS, Ambler G, Choodari-Oskooei B, Omar RZ. Review and evaluation of performance measures for survival prediction models in external validation settings. BMC Med Res Methodol 2017; 17:60. [PMID: 28420338 PMCID: PMC5395888 DOI: 10.1186/s12874-017-0336-2] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2017] [Accepted: 04/03/2017] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND When developing a prediction model for survival data it is essential to validate its performance in external validation settings using appropriate performance measures. Although a number of such measures have been proposed, there is only limited guidance regarding their use in the context of model validation. This paper reviewed and evaluated a wide range of performance measures to provide some guidelines for their use in practice. METHODS An extensive simulation study based on two clinical datasets was conducted to investigate the performance of the measures in external validation settings. Measures were selected from categories that assess the overall performance, discrimination and calibration of a survival prediction model. Some of these have been modified to allow their use with validation data, and a case study is provided to describe how these measures can be estimated in practice. The measures were evaluated with respect to their robustness to censoring and ease of interpretation. All measures are implemented, or are straightforward to implement, in statistical software. RESULTS Most of the performance measures were reasonably robust to moderate levels of censoring. One exception was Harrell's concordance measure which tended to increase as censoring increased. CONCLUSIONS We recommend that Uno's concordance measure is used to quantify concordance when there are moderate levels of censoring. Alternatively, Gönen and Heller's measure could be considered, especially if censoring is very high, but we suggest that the prediction model is re-calibrated first. We also recommend that Royston's D is routinely reported to assess discrimination since it has an appealing interpretation. The calibration slope is useful for both internal and external validation settings and recommended to report routinely. Our recommendation would be to use any of the predictive accuracy measures and provide the corresponding predictive accuracy curves. In addition, we recommend to investigate the characteristics of the validation data such as the level of censoring and the distribution of the prognostic index derived in the validation setting before choosing the performance measures.
Collapse
Affiliation(s)
- M. Shafiqur Rahman
- Institute of Statistical Research and Training, University of Dhaka, Dhaka, Bangladesh
| | - Gareth Ambler
- Department of Statistical Science, University College London, London, UK
| | | | - Rumana Z. Omar
- Department of Statistical Science, University College London, London, UK
| |
Collapse
|
20
|
Wang H, Li G. A Selective Review on Random Survival Forests for High Dimensional Data. QUANTITATIVE BIO-SCIENCE 2017; 36:85-96. [PMID: 30740388 PMCID: PMC6364686 DOI: 10.22283/qbs.2017.36.2.85] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Over the past decades, there has been considerable interest in applying statistical machine learning methods in survival analysis. Ensemble based approaches, especially random survival forests, have been developed in a variety of contexts due to their high precision and non-parametric nature. This article aims to provide a timely review on recent developments and applications of random survival forests for time-to-event data with high dimensional covariates. This selective review begins with an introduction to the random survival forest framework, followed by a survey of recent developments on splitting criteria, variable selection, and other advanced topics of random survival forests for time-to-event data in high dimensional settings. We also discuss potential research directions for future research.
Collapse
Affiliation(s)
- Hong Wang
- School of Mathematics and Statistics, Central South University, Hunan 410083, China
| | - Gang Li
- Department of Biostatistics and Biomathematics, School of Public Health, University of California at Los Angeles, CA 90095, USA
| |
Collapse
|
21
|
Devlin SM, Ostrovnaya I, Gönen M. Boomerang: A method for recursive reclassification. Biometrics 2016; 72:995-1002. [PMID: 26754051 PMCID: PMC4940305 DOI: 10.1111/biom.12469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2015] [Revised: 11/01/2015] [Accepted: 11/01/2015] [Indexed: 11/29/2022]
Abstract
While there are many validated prognostic classifiers used in practice, often their accuracy is modest and heterogeneity in clinical outcomes exists in one or more risk subgroups. Newly available markers, such as genomic mutations, may be used to improve the accuracy of an existing classifier by reclassifying patients from a heterogenous group into a higher or lower risk category. The statistical tools typically applied to develop the initial classifiers are not easily adapted toward this reclassification goal. In this article, we develop a new method designed to refine an existing prognostic classifier by incorporating new markers. The two-stage algorithm called Boomerang first searches for modifications of the existing classifier that increase the overall predictive accuracy and then merges to a prespecified number of risk groups. Resampling techniques are proposed to assess the improvement in predictive accuracy when an independent validation data set is not available. The performance of the algorithm is assessed under various simulation scenarios where the marker frequency, degree of censoring, and total sample size are varied. The results suggest that the method selects few false positive markers and is able to improve the predictive accuracy of the classifier in many settings. Lastly, the method is illustrated on an acute myeloid leukemia data set where a new refined classifier incorporates four new mutations into the existing three category classifier and is validated on an independent data set.
Collapse
Affiliation(s)
- Sean M Devlin
- Memorial Sloan Kettering Cancer Center, New York, New York 10065, U.S.A..
| | - Irina Ostrovnaya
- Memorial Sloan Kettering Cancer Center, New York, New York 10065, U.S.A
| | - Mithat Gönen
- Memorial Sloan Kettering Cancer Center, New York, New York 10065, U.S.A
| |
Collapse
|
22
|
Mayr A, Hofner B, Schmid M. Boosting the discriminatory power of sparse survival models via optimization of the concordance index and stability selection. BMC Bioinformatics 2016; 17:288. [PMID: 27444890 PMCID: PMC4957316 DOI: 10.1186/s12859-016-1149-8] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Accepted: 07/13/2016] [Indexed: 12/15/2022] Open
Abstract
Background When constructing new biomarker or gene signature scores for time-to-event outcomes, the underlying aims are to develop a discrimination model that helps to predict whether patients have a poor or good prognosis and to identify the most influential variables for this task. In practice, this is often done fitting Cox models. Those are, however, not necessarily optimal with respect to the resulting discriminatory power and are based on restrictive assumptions. We present a combined approach to automatically select and fit sparse discrimination models for potentially high-dimensional survival data based on boosting a smooth version of the concordance index (C-index). Due to this objective function, the resulting prediction models are optimal with respect to their ability to discriminate between patients with longer and shorter survival times. The gradient boosting algorithm is combined with the stability selection approach to enhance and control its variable selection properties. Results The resulting algorithm fits prediction models based on the rankings of the survival times and automatically selects only the most stable predictors. The performance of the approach, which works best for small numbers of informative predictors, is demonstrated in a large scale simulation study: C-index boosting in combination with stability selection is able to identify a small subset of informative predictors from a much larger set of non-informative ones while controlling the per-family error rate. In an application to discover biomarkers for breast cancer patients based on gene expression data, stability selection yielded sparser models and the resulting discriminatory power was higher than with lasso penalized Cox regression models. Conclusion The combination of stability selection and C-index boosting can be used to select small numbers of informative biomarkers and to derive new prediction rules that are optimal with respect to their discriminatory power. Stability selection controls the per-family error rate which makes the new approach also appealing from an inferential point of view, as it provides an alternative to classical hypothesis tests for single predictor effects. Due to the shrinkage and variable selection properties of statistical boosting algorithms, the latter tests are typically unfeasible for prediction models fitted by boosting. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1149-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Andreas Mayr
- Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Waldstr. 6, Erlangen, 91054, Germany. .,Institut für Medizinische Biometrie, Informatik und Epidemiologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Sigmund-Freud-Str. 25, Bonn, 53105, Germany.
| | - Benjamin Hofner
- Institut für Medizininformatik, Biometrie und Epidemiologie, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Waldstr. 6, Erlangen, 91054, Germany
| | - Matthias Schmid
- Institut für Medizinische Biometrie, Informatik und Epidemiologie, Rheinische Friedrich-Wilhelms-Universität Bonn, Sigmund-Freud-Str. 25, Bonn, 53105, Germany
| |
Collapse
|
23
|
Gleiss A, Zeillinger R, Braicu EI, Trillsch F, Vergote I, Schemper M. Statistical controversies in clinical research: the importance of importance. Ann Oncol 2016; 27:1185-9. [PMID: 27052655 DOI: 10.1093/annonc/mdw159] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 03/30/2016] [Indexed: 11/13/2022] Open
Abstract
We define the notion of 'importance' of prognostic factors in studies of survival and suggest quantifying it by the Schemper-Henderson measure of explained variation. Conceptual differences to the standard approach for the statistical analysis of oncologic studies of survival are discussed and exemplified by means of a study of ovarian cancer. Explained variation permits to establish a ranking of the importance of factors, also if measured on different scales, or of different types (dichotomous, qualitative or continuous), and permits to compare groups of related factors. In practice, the importance of prognostic factors often is disappointingly low. From this, it follows that even strong and highly significant prognostic factors often do not translate into close determination of individual survival of patients.
Collapse
Affiliation(s)
- A Gleiss
- Section for Clinical Biometrics, Center for Medical Statistics, Informatics, and Intelligent Systems
| | - R Zeillinger
- Molecular Oncology Group, Department of Obstetrics and Gynecology, Medical University of Vienna, Vienna, Austria
| | - E I Braicu
- Department of Gynecology, Tumor Bank Ovarian Cancer (TOC), European Competence Center for Ovarian Cancer, Campus Virchow Klinikum, Charité-Universitätsmedizin Berlin, Berlin
| | - F Trillsch
- Department of Gynecology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - I Vergote
- Department of Gynecologic Oncology, Leuven Cancer Institute, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| | - M Schemper
- Section for Clinical Biometrics, Center for Medical Statistics, Informatics, and Intelligent Systems
| |
Collapse
|
24
|
Ternès N, Rotolo F, Michiels S. Empirical extensions of the lasso penalty to reduce the false discovery rate in high-dimensional Cox regression models. Stat Med 2016; 35:2561-73. [PMID: 26970107 DOI: 10.1002/sim.6927] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Revised: 02/11/2016] [Accepted: 02/13/2016] [Indexed: 01/15/2023]
Abstract
Correct selection of prognostic biomarkers among multiple candidates is becoming increasingly challenging as the dimensionality of biological data becomes higher. Therefore, minimizing the false discovery rate (FDR) is of primary importance, while a low false negative rate (FNR) is a complementary measure. The lasso is a popular selection method in Cox regression, but its results depend heavily on the penalty parameter λ. Usually, λ is chosen using maximum cross-validated log-likelihood (max-cvl). However, this method has often a very high FDR. We review methods for a more conservative choice of λ. We propose an empirical extension of the cvl by adding a penalization term, which trades off between the goodness-of-fit and the parsimony of the model, leading to the selection of fewer biomarkers and, as we show, to the reduction of the FDR without large increase in FNR. We conducted a simulation study considering null and moderately sparse alternative scenarios and compared our approach with the standard lasso and 10 other competitors: Akaike information criterion (AIC), corrected AIC, Bayesian information criterion (BIC), extended BIC, Hannan and Quinn information criterion (HQIC), risk information criterion (RIC), one-standard-error rule, adaptive lasso, stability selection, and percentile lasso. Our extension achieved the best compromise across all the scenarios between a reduction of the FDR and a limited raise of the FNR, followed by the AIC, the RIC, and the adaptive lasso, which performed well in some settings. We illustrate the methods using gene expression data of 523 breast cancer patients. In conclusion, we propose to apply our extension to the lasso whenever a stringent FDR with a limited FNR is targeted. Copyright © 2016 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Nils Ternès
- Université Paris-Saclay, Univ. Paris-Sud, UVSQ, CESP, INSERM, F-94805, Villejuif, France.,Gustave Roussy, Service de biostatistique et d'épidémiologie, F-94805, Villejuif, France
| | - Federico Rotolo
- Université Paris-Saclay, Univ. Paris-Sud, UVSQ, CESP, INSERM, F-94805, Villejuif, France.,Gustave Roussy, Service de biostatistique et d'épidémiologie, F-94805, Villejuif, France
| | - Stefan Michiels
- Université Paris-Saclay, Univ. Paris-Sud, UVSQ, CESP, INSERM, F-94805, Villejuif, France.,Gustave Roussy, Service de biostatistique et d'épidémiologie, F-94805, Villejuif, France
| |
Collapse
|
25
|
Dorajoo SR, Tan WJH, Koo SX, Tan WS, Chew MH, Tang CL, Wee HL, Yap CW. A scoring model for predicting survival following primary tumour resection in stage IV colorectal cancer patients with unresectable metastasis. Int J Colorectal Dis 2016; 31:235-45. [PMID: 26490055 DOI: 10.1007/s00384-015-2419-z] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/14/2015] [Indexed: 02/04/2023]
Abstract
BACKGROUND Stage IV colorectal cancer patients with unresectable metastasis who undergo elective primary tumour resection experience heterogeneous post-operative survival. We aimed to develop a scoring model for predicting post-operative survival using pre-operative variables to identify patients who are least likely to experience extended survival following the procedure. METHODS Survival data were collected from stage IV colorectal cancer patients who had undergone elective primary tumour resection between January 1999 and December 2007. Coefficients of significant covariates from the multivariate Cox regression model were used to compute individual survival scores to classify patients into three prognostic groups. A survival function was derived for each group via Kaplan-Meier estimation. Internal validation was performed. RESULTS Advanced age (hazard ratio, HR 1.43 (1.16-1.78)); poorly differentiated tumour (HR 2.72 (1.49-5.04)); metastasis to liver (HR 1.76 (1.33-2.33)), lung (HR 1.37 (1.10-1.71)) and bone (HR 2.08 ((1.16-3.71)); carcinomatosis (HR 1.68 (1.30-2.16)); hypoalbuminaemia (HR 1.30 (1.04-1.61) and elevated carcinoembryonic antigen levels (HR 1.89 (1.49-2.39)) significantly shorten post-operative survival. The scoring model separated patients into three prognostic groups with distinct median survival lengths of 4.8, 12.4 and 18.6 months (p < 0.0001). Internal validation revealed a concordance probability estimate of 0.65 and a time-dependent area under receiver operating curve of 0.75 at 6 months. Temporal split-sample validation implied good local generalizability to future patient populations (p < 0.0001). CONCLUSION Predicting survival following elective primary tumour resection using pre-operative variables has been demonstrated with the scoring model developed. Model-based survival prognostication can support clinical decisions on elective primary tumour resection eligibility.
Collapse
|
26
|
Weiß V, Schmidt M, Hellmich M. A novel nonparametric measure of explained variation for survival data with an easy graphical interpretation. GERMAN MEDICAL SCIENCE : GMS E-JOURNAL 2015; 13:Doc18. [PMID: 26550007 PMCID: PMC4633600 DOI: 10.3205/000222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2015] [Revised: 09/17/2015] [Indexed: 11/29/2022]
Abstract
INTRODUCTION For survival data the coefficient of determination cannot be used to describe how good a model fits to the data. Therefore, several measures of explained variation for survival data have been proposed in recent years. METHODS We analyse an existing measure of explained variation with regard to minimisation aspects and demonstrate that these are not fulfilled for the measure. RESULTS In analogy to the least squares method from linear regression analysis we develop a novel measure for categorical covariates which is based only on the Kaplan-Meier estimator. Hence, the novel measure is a completely nonparametric measure with an easy graphical interpretation. For the novel measure different weighting possibilities are available and a statistical test of significance can be performed. Eventually, we apply the novel measure and further measures of explained variation to a dataset comprising persons with a histopathological papillary thyroid carcinoma. CONCLUSION We propose a novel measure of explained variation with a comprehensible derivation as well as a graphical interpretation, which may be used in further analyses with survival data.
Collapse
Affiliation(s)
- Verena Weiß
- Institute of Medical Statistics, Informatics and Epidemiology, University of Cologne, Germany
| | | | - Martin Hellmich
- Institute of Medical Statistics, Informatics and Epidemiology, University of Cologne, Germany,*To whom correspondence should be addressed: Martin Hellmich, Institute of Medical Statistics, Informatics and Epidemiology, University of Cologne, Kerpener Straße 62, 50924 Köln, Germany, Phone: +49 221 478 6509/6501, Fax: +49 221 478 6520, E-mail:
| |
Collapse
|
27
|
Choodari-Oskooei B, Royston P, Parmar MKB. The extension of total gain (TG) statistic in survival models: properties and applications. BMC Med Res Methodol 2015; 15:50. [PMID: 26126418 PMCID: PMC4486698 DOI: 10.1186/s12874-015-0042-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2014] [Accepted: 06/12/2015] [Indexed: 01/19/2023] Open
Abstract
BACKGROUND The results of multivariable regression models are usually summarized in the form of parameter estimates for the covariates, goodness-of-fit statistics, and the relevant p-values. These statistics do not inform us about whether covariate information will lead to any substantial improvement in prediction. Predictive ability measures can be used for this purpose since they provide important information about the practical significance of prognostic factors. R (2)-type indices are the most familiar forms of such measures in survival models, but they all have limitations and none is widely used. METHODS In this paper, we extend the total gain (TG) measure, proposed for a logistic regression model, to survival models and explore its properties using simulations and real data. TG is based on the binary regression quantile plot, otherwise known as the predictiveness curve. Standardised TG ranges from 0 (no explanatory power) to 1 ('perfect' explanatory power). RESULTS The results of our simulations show that unlike many of the other R (2)-type predictive ability measures, TG is independent of random censoring. It increases as the effect of a covariate increases and can be applied to different types of survival models, including models with time-dependent covariate effects. We also apply TG to quantify the predictive ability of multivariable prognostic models developed in several disease areas. CONCLUSIONS Overall, TG performs well in our simulation studies and can be recommended as a measure to quantify the predictive ability in survival models.
Collapse
Affiliation(s)
| | - Patrick Royston
- MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK.
| | - Mahesh K B Parmar
- MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK.
| |
Collapse
|
28
|
Cho JK, Hyun SH, Choi N, Kim MJ, Padera TP, Choi JY, Jeong HS. Significance of lymph node metastasis in cancer dissemination of head and neck cancer. Transl Oncol 2015; 8:119-25. [PMID: 25926078 PMCID: PMC4415144 DOI: 10.1016/j.tranon.2015.03.001] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Revised: 03/05/2015] [Accepted: 03/09/2015] [Indexed: 01/13/2023] Open
Abstract
Lymph node metastasis (LNM) in many solid cancers is a well-known prognostic factor; however, it has been debated whether regional LNM simply reflects tumor aggressiveness or is a source for further tumor dissemination. Similarly, the metastatic process in head and neck cancer (HNC) has not been fully evaluated. Thus, we aimed to investigate the relative significance of LNM in metastatic cascade of HNC using functional imaging of HNC patients and molecular imaging in in vivo models. First, we analyzed 18Fluorodeoxyglucose positron emission tomography (PET) parameters of 117 patients with oral cancer. The primary tumor and nodal PET parameters were measured separately, and survival analyses were conducted on the basis of clinical and PET variables to identify significant prognostic factors. In multivariate analyses, we found that only the metastatic node PET values were significant. Next, we compared the relative frequency of lung metastasis in primary ear tumors versus lymph node (LN) tumors, and we tested the rate of lung metastasis in another animal model, in which each animal had both primary and LN tumors that were expressing different colors. As a result, LN tumors showed higher frequencies of lung metastasis compared to orthotopic primary tumors. In color-matched comparisons, the relative contribution to lung metastasis was higher in LN tumors than in primary tumors, although both primary and LN tumors caused lung metastases. In summary, tumors growing in the LN microenvironment spread to systemic sites more commonly than primary tumors in HNC, suggesting that the adequate management of LNM can reduce further systemic metastasis.
Collapse
Affiliation(s)
- Jae-Keun Cho
- Department of Otorhinolaryngology-Head and Neck Surgery, Pusan National University, Pusan, Korea
| | - Seung Hyup Hyun
- Department of Nuclear Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea
| | - Nayeon Choi
- Department of Otorhinolaryngology-Head and Neck Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea
| | - Min-Ji Kim
- Biostatistics and Clinical Epidemiology Center, Research Institute for Future Medicine, Samsung Medical Center, Seoul, Korea
| | - Timothy P Padera
- Edwin L. Steele Laboratory, Department of Radiation Oncology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Joon Young Choi
- Department of Nuclear Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea.
| | - Han-Sin Jeong
- Department of Otorhinolaryngology-Head and Neck Surgery, Samsung Medical Center, Sungkyunkwan University School of Medicine, Seoul, Korea.
| |
Collapse
|
29
|
Boosting the concordance index for survival data--a unified framework to derive and evaluate biomarker combinations. PLoS One 2014; 9:e84483. [PMID: 24400093 PMCID: PMC3882229 DOI: 10.1371/journal.pone.0084483] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2013] [Accepted: 11/14/2013] [Indexed: 11/30/2022] Open
Abstract
The development of molecular signatures for the prediction of time-to-event outcomes is a methodologically challenging task in bioinformatics and biostatistics. Although there are numerous approaches for the derivation of marker combinations and their evaluation, the underlying methodology often suffers from the problem that different optimization criteria are mixed during the feature selection, estimation and evaluation steps. This might result in marker combinations that are suboptimal regarding the evaluation criterion of interest. To address this issue, we propose a unified framework to derive and evaluate biomarker combinations. Our approach is based on the concordance index for time-to-event data, which is a non-parametric measure to quantify the discriminatory power of a prediction rule. Specifically, we propose a gradient boosting algorithm that results in linear biomarker combinations that are optimal with respect to a smoothed version of the concordance index. We investigate the performance of our algorithm in a large-scale simulation study and in two molecular data sets for the prediction of survival in breast cancer patients. Our numerical results show that the new approach is not only methodologically sound but can also lead to a higher discriminatory power than traditional approaches for the derivation of gene signatures.
Collapse
|
30
|
Hayashi K. <b>BIAS REDUCTION IN ESTIMATING A CONCORDANCE FOR </b><b>CENSORED TIME-TO-EVENT RESPONSES </b>. JOURNAL JAPANESE SOCIETY OF COMPUTATIONAL STATISTICS 2014. [DOI: 10.5183/jjscs.1312001_209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
31
|
Abstract
Recent developments in molecular biology have led to the massive discovery of new marker candidates for the prediction of patient survival. To evaluate the predictive value of these markers, statistical tools for measuring the performance of survival models are needed. We consider estimators of discrimination measures, which are a popular approach to evaluate survival predictions in biomarker studies. Estimators of discrimination measures are usually based on regularity assumptions such as the proportional hazards assumption. Based on two sets of molecular data and a simulation study, we show that violations of the regularity assumptions may lead to over-optimistic estimates of prediction accuracy and may therefore result in biased conclusions regarding the clinical utility of new biomarkers. In particular, we demonstrate that biased medical decision making is possible even if statistical checks indicate that all regularity assumptions are satisfied.
Collapse
|