Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: McShane LM, Polley MYC. Development of omics-based clinical tests for prognosis and therapy selection: the challenge of achieving statistical robustness and clinical utility. Clin Trials 2013;10:653-65. [PMID: 24000377 DOI: 10.1177/1740774513499458] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

For:	McShane LM, Polley MYC. Development of omics-based clinical tests for prognosis and therapy selection: the challenge of achieving statistical robustness and clinical utility. Clin Trials 2013;10:653-65. [PMID: 24000377 DOI: 10.1177/1740774513499458] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Number

Cited by Other Article(s)

Dong Y, Gottardo R. An approach for integrating multimodal omics data into sparse and interpretable models. CELL REPORTS METHODS 2024;4:100718. [PMID: 38412832 PMCID: PMC10921032 DOI: 10.1016/j.crmeth.2024.100718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Revised: 02/06/2024] [Accepted: 02/06/2024] [Indexed: 02/29/2024]

Rahnenführer J, De Bin R, Benner A, Ambrogi F, Lusa L, Boulesteix AL, Migliavacca E, Binder H, Michiels S, Sauerbrei W, McShane L. Statistical analysis of high-dimensional biomedical data: a gentle introduction to analytical goals, common approaches and challenges. BMC Med 2023;21:182. [PMID: 37189125 DOI: 10.1186/s12916-023-02858-y] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 04/03/2023] [Indexed: 05/17/2023] Open

Abstract

BACKGROUND

In high-dimensional data (HDD) settings, the number of variables associated with each observation is very large. Prominent examples of HDD in biomedical research include omics data with a large number of variables such as many measurements across the genome, proteome, or metabolome, as well as electronic health records data that have large numbers of variables recorded for each patient. The statistical analysis of such data requires knowledge and experience, sometimes of complex methods adapted to the respective research questions.

METHODS

Advances in statistical methodology and machine learning methods offer new opportunities for innovative analyses of HDD, but at the same time require a deeper understanding of some fundamental statistical concepts. Topic group TG9 "High-dimensional data" of the STRATOS (STRengthening Analytical Thinking for Observational Studies) initiative provides guidance for the analysis of observational studies, addressing particular statistical challenges and opportunities for the analysis of studies involving HDD. In this overview, we discuss key aspects of HDD analysis to provide a gentle introduction for non-statisticians and for classically trained statisticians with little experience specific to HDD.

RESULTS

The paper is organized with respect to subtopics that are most relevant for the analysis of HDD, in particular initial data analysis, exploratory data analysis, multiple testing, and prediction. For each subtopic, main analytical goals in HDD settings are outlined. For each of these goals, basic explanations for some commonly used analysis methods are provided. Situations are identified where traditional statistical methods cannot, or should not, be used in the HDD setting, or where adequate analytic tools are still lacking. Many key references are provided.

CONCLUSIONS

This review aims to provide a solid statistical foundation for researchers, including statisticians and non-statisticians, who are new to research with HDD or simply want to better evaluate and understand the results of HDD analyses.

Collapse

Xu Z, De A. Assessing model accuracy using random data split: a simulation study. J Biopharm Stat 2023;33:131-139. [PMID: 35730900 DOI: 10.1080/10543406.2022.2089158] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]

De A. Statistical Considerations and Challenges for Pivotal Clinical Studies of Artificial Intelligence Medical Tests for Widespread Use: Opportunities for Inter-Disciplinary Collaboration. Stat Biopharm Res 2023. [DOI: 10.1080/19466315.2023.2169752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Saeed RF, Awan UA, Saeed S, Mumtaz S, Akhtar N, Aslam S. Targeted Therapy and Personalized Medicine. Cancer Treat Res 2023;185:177-205. [PMID: 37306910 DOI: 10.1007/978-3-031-27156-4_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Grobe N, Scheiber J, Zhang H, Garbe C, Wang X. Omics and Artificial Intelligence in Kidney Diseases. ADVANCES IN KIDNEY DISEASE AND HEALTH 2023;30:47-52. [PMID: 36723282 DOI: 10.1053/j.akdh.2022.11.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 10/28/2022] [Accepted: 11/16/2022] [Indexed: 01/20/2023]

Diakou I, Papakonstantinou E, Papageorgiou L, Pierouli K, Dragoumani K, Spandidos DA, Bacopoulou F, Chrousos GP, Goulielmos GΝ, Eliopoulos E, Vlachakis D. Multiple sclerosis and computational biology (Review). Biomed Rep 2022;17:96. [PMID: 36382258 PMCID: PMC9634047 DOI: 10.3892/br.2022.1579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 09/27/2022] [Indexed: 12/02/2022] Open

Affiliation(s)

Io Diakou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Eleni Papakonstantinou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Louis Papageorgiou Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Katerina Pierouli Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Konstantina Dragoumani Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Demetrios A. Spandidos Laboratory of Clinical Virology, School of Medicine, University of Crete, 71003 Heraklion, Greece
Flora Bacopoulou University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
George P. Chrousos University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece
Georges Ν. Goulielmos Section of Molecular Pathology and Human Genetics, Department of Internal Medicine, School of Medicine, University of Crete, 71003 Heraklion, Greece
Elias Eliopoulos Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece
Dimitrios Vlachakis Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, 11855 Athens, Greece University Research Institute of Maternal and Child Health and Precision Medicine, and UNESCO Chair on Adolescent Health Care, National and Kapodistrian University of Athens, ‘Aghia Sophia’ Children's Hospital, 11527 Athens, Greece Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of The Academy of Athens, 11527 Athens, Greece

Collapse

Gharipour M, Nezafati P, Sadeghian L, Eftekhari A, Rothenberg I, Jahanfar S. Precision medicine and metabolic syndrome. ARYA ATHEROSCLEROSIS 2022;18:1-10. [PMID: 36817343 PMCID: PMC9937665 DOI: 10.22122/arya.2022.26215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Subscribe] [Scholar Register] [Received: 07/18/2021] [Accepted: 10/09/2021] [Indexed: 02/24/2023]

Dobbin KK, McShane LM. Sample size methods for evaluation of predictive biomarkers. Stat Med 2022;41:3199-3210. [DOI: 10.1002/sim.9412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Revised: 03/24/2022] [Accepted: 04/02/2022] [Indexed: 11/09/2022]

Polley MYC, Dignam JJ. Statistical Considerations in the Evaluation of Continuous Biomarkers. J Nucl Med 2021;62:605-611. [PMID: 33579807 DOI: 10.2967/jnumed.120.251520] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 01/19/2021] [Indexed: 01/02/2023] Open

Abstract

Discovery of biomarkers has been steadily increasing over the past decade. Although a plethora of biomarkers has been reported in the biomedical literature, few have been sufficiently validated for broader clinical applications. One particular challenge that may have hindered the adoption of biomarkers into practice is the lack of reproducible biomarker cut points. In this article, we attempt to identify some common statistical issues related to biomarker cut point identification and provide guidance on proper evaluation, interpretation, and validation of such cut points. First, we illustrate how discretization of a continuous biomarker using sample percentiles results in significant information loss and should be avoided. Second, we review the popular "minimal-P-value" approach for cut point identification and show that this method results in highly unstable P values and unduly increases the chance of significant findings when the biomarker is not associated with outcome. Third, we critically review a common analysis strategy by which the selected biomarker cut point is used to categorize patients into different risk categories and then the difference in survival curves among these risk groups in the same dataset is claimed as the evidence supporting the biomarker's prognostic strength. We show that this method yields an exaggerated P value and overestimates the prognostic impact of the biomarker. We illustrate that the degree of the optimistic bias increases with the number of variables being considered in a risk model. Finally, we discuss methods to appropriately ascertain the additional prognostic contribution of the new biomarker in disease settings where standard prognostic factors already exist. Throughout the article, we use real examples in oncology to highlight relevant methodologic issues, and when appropriate, we use simulations to illustrate more abstract statistical concepts.

Collapse

Pires JG, da Silva GF, Weyssow T, Conforte AJ, Pagnoncelli D, da Silva FAB, Carels N. Galaxy and MEAN Stack to Create a User-Friendly Workflow for the Rational Optimization of Cancer Chemotherapy. Front Genet 2021;12:624259. [PMID: 33679888 PMCID: PMC7935533 DOI: 10.3389/fgene.2021.624259] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 01/22/2021] [Indexed: 12/24/2022] Open

Identification of robust reference genes for studies of gene expression in FFPE melanoma samples and melanoma cell lines. Melanoma Res 2020;30:26-38. [PMID: 31567589 PMCID: PMC6940030 DOI: 10.1097/cmr.0000000000000644] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]

Challenges and Opportunities in Clinical Applications of Blood-Based Proteomics in Cancer. Cancers (Basel) 2020;12:cancers12092428. [PMID: 32867043 PMCID: PMC7564506 DOI: 10.3390/cancers12092428] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 08/23/2020] [Accepted: 08/25/2020] [Indexed: 12/12/2022] Open

Abstract

Simple Summary

The traditional approach in identifying cancer related protein biomarkers has focused on evaluation of a single peptide/protein in tissue or circulation. At best, this approach has had limited success for clinical applications, since multiple pathological tumor pathways may be involved during initiation or progression of cancer which diminishes the significance of a single candidate protein/peptide. Emerging sensitive proteomic based technologies like liquid chromatography mass spectrometry (LC-MS)-based quantitative proteomics can provide a platform for evaluating serial serum or plasma samples to interrogate secreted products of tumor–host interactions, thereby revealing a more “complete” repertoire of biological variables encompassing heterogeneous tumor biology. However, several challenges need to be met for successful application of serum/plasma based proteomics. These include uniform pre-analyte processing of specimens, sensitive and specific proteomic analytical platforms and adequate attention to study design during discovery phase followed by validation of discovery-level signatures for prognostic, predictive, and diagnostic cancer biomarker applications.

Abstract

Blood is a readily accessible biofluid containing a plethora of important proteins, nucleic acids, and metabolites that can be used as clinical diagnostic tools in diseases, including cancer. Like the on-going efforts for cancer biomarker discovery using the liquid biopsy detection of circulating cell-free and cell-based tumor nucleic acids, the circulatory proteome has been underexplored for clinical cancer biomarker applications. A comprehensive proteome analysis of human serum/plasma with high-quality data and compelling interpretation can potentially provide opportunities for understanding disease mechanisms, although several challenges will have to be met. Serum/plasma proteome biomarkers are present in very low abundance, and there is high complexity involved due to the heterogeneity of cancers, for which there is a compelling need to develop sensitive and specific proteomic technologies and analytical platforms. To date, liquid chromatography mass spectrometry (LC-MS)-based quantitative proteomics has been a dominant analytical workflow to discover new potential cancer biomarkers in serum/plasma. This review will summarize the opportunities of serum proteomics for clinical applications; the challenges in the discovery of novel biomarkers in serum/plasma; and current proteomic strategies in cancer research for the application of serum/plasma proteomics for clinical prognostic, predictive, and diagnostic applications, as well as for monitoring minimal residual disease after treatments. We will highlight some of the recent advances in MS-based proteomics technologies with appropriate sample collection, processing uniformity, study design, and data analysis, focusing on how these integrated workflows can identify novel potential cancer biomarkers for clinical applications.

Collapse

Biomarker development for axial spondyloarthritis. Nat Rev Rheumatol 2020;16:448-463. [PMID: 32606474 DOI: 10.1038/s41584-020-0450-0] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/28/2020] [Indexed: 01/10/2023]

Gal J, Bailleux C, Chardin D, Pourcher T, Gilhodes J, Jing L, Guigonis JM, Ferrero JM, Milano G, Mograbi B, Brest P, Chateau Y, Humbert O, Chamorey E. Comparison of unsupervised machine-learning methods to identify metabolomic signatures in patients with localized breast cancer. Comput Struct Biotechnol J 2020;18:1509-1524. [PMID: 32637048 PMCID: PMC7327012 DOI: 10.1016/j.csbj.2020.05.021] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 05/15/2020] [Accepted: 05/16/2020] [Indexed: 02/08/2023] Open

Abstract

Genomics and transcriptomics have led to the widely-used molecular classification of breast cancer (BC). However, heterogeneous biological behaviors persist within breast cancer subtypes. Metabolomics is a rapidly-expanding field of study dedicated to cellular metabolisms affected by the environment. The aim of this study was to compare metabolomic signatures of BC obtained by 5 different unsupervised machine learning (ML) methods. Fifty-two consecutive patients with BC with an indication for adjuvant chemotherapy between 2013 and 2016 were retrospectively included. We performed metabolomic profiling of tumor resection samples using liquid chromatography-mass spectrometry. Here, four hundred and forty-nine identified metabolites were selected for further analysis. Clusters obtained using 5 unsupervised ML methods (PCA k-means, sparse k-means, spectral clustering, SIMLR and k-sparse) were compared in terms of clinical and biological characteristics. With an optimal partitioning parameter k = 3, the five methods identified three prognosis groups of patients (favorable, intermediate, unfavorable) with different clinical and biological profiles. SIMLR and K-sparse methods were the most effective techniques in terms of clustering. In-silico survival analysis revealed a significant difference for 5-year predicted OS between the 3 clusters. Further pathway analysis using the 449 selected metabolites showed significant differences in amino acid and glucose metabolism between BC histologic subtypes. Our results provide proof-of-concept for the use of unsupervised ML metabolomics enabling stratification and personalized management of BC patients. The design of novel computational methods incorporating ML and bioinformatics techniques should make available tools particularly suited to improving the outcome of cancer treatment and reducing cancer-related mortalities.

Collapse

Affiliation(s)

Jocelyn Gal University Côte d’Azur, Epidemiology and Biostatistics Department, Centre Antoine Lacassagne, Nice F-06189, France
Caroline Bailleux University Côte d’Azur, Medical Oncology Department Centre Antoine Lacassagne, Nice F-06189, France
David Chardin University Côte d’Azur, Nuclear Medicine Department, Centre Antoine Lacassagne, Nice F-06189, France University Côte d’Azur, Commissariat à l’Energie Atomique, Institut de Biosciences et Biotechnologies d'Aix-Marseille, Laboratory Transporters in Imaging and Radiotherapy in Oncology, Faculty of Medicine, Nice F-06100, France
Thierry Pourcher University Côte d’Azur, Commissariat à l’Energie Atomique, Institut de Biosciences et Biotechnologies d'Aix-Marseille, Laboratory Transporters in Imaging and Radiotherapy in Oncology, Faculty of Medicine, Nice F-06100, France
Julia Gilhodes Department of Biostatistics, Institut Claudius Regaud, IUCT-O Toulouse, France
Lun Jing University Côte d’Azur, Commissariat à l’Energie Atomique, Institut de Biosciences et Biotechnologies d'Aix-Marseille, Laboratory Transporters in Imaging and Radiotherapy in Oncology, Faculty of Medicine, Nice F-06100, France
Jean-Marie Guigonis University Côte d’Azur, Commissariat à l’Energie Atomique, Institut de Biosciences et Biotechnologies d'Aix-Marseille, Laboratory Transporters in Imaging and Radiotherapy in Oncology, Faculty of Medicine, Nice F-06100, France
Jean-Marc Ferrero University Côte d’Azur, Medical Oncology Department Centre Antoine Lacassagne, Nice F-06189, France
Gerard Milano University Côte d’Azur, Centre Antoine Lacassagne, Oncopharmacology Unit, Nice F-06189, France
Baharia Mograbi University Côte d’Azur, CNRS UMR7284, INSERM U1081, IRCAN TEAM4 Centre Antoine Lacassagne FHU-Oncoage, Nice F-06189, France
Patrick Brest University Côte d’Azur, CNRS UMR7284, INSERM U1081, IRCAN TEAM4 Centre Antoine Lacassagne FHU-Oncoage, Nice F-06189, France
Yann Chateau University Côte d’Azur, Epidemiology and Biostatistics Department, Centre Antoine Lacassagne, Nice F-06189, France
Olivier Humbert University Côte d’Azur, Nuclear Medicine Department, Centre Antoine Lacassagne, Nice F-06189, France University Côte d’Azur, Commissariat à l’Energie Atomique, Institut de Biosciences et Biotechnologies d'Aix-Marseille, Laboratory Transporters in Imaging and Radiotherapy in Oncology, Faculty of Medicine, Nice F-06100, France
Emmanuel Chamorey University Côte d’Azur, Epidemiology and Biostatistics Department, Centre Antoine Lacassagne, Nice F-06189, France

Collapse

Varela N, Lanas F, Salazar LA, Zambrano T. The Current State of MicroRNAs as Restenosis Biomarkers. Front Genet 2020;10:1247. [PMID: 31998354 PMCID: PMC6967329 DOI: 10.3389/fgene.2019.01247] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 11/13/2019] [Indexed: 12/21/2022] Open

Sparano J, O'Neill A, Alpaugh K, Wolff AC, Northfelt DW, Dang CT, Sledge GW, Miller KD. Association of Circulating Tumor Cells With Late Recurrence of Estrogen Receptor-Positive Breast Cancer: A Secondary Analysis of a Randomized Clinical Trial. JAMA Oncol 2019;4:1700-1706. [PMID: 30054636 DOI: 10.1001/jamaoncol.2018.2574] [Citation(s) in RCA: 132] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Abstract

Importance

Late recurrence 5 or more years after diagnosis accounts for at least one-half of all cases of recurrent hormone receptor-positive breast cancer.

Objective

To determine whether the presence of circulating tumor cells (CTCs) in a peripheral blood sample obtained approximately 5 years after diagnosis was associated with late clinical recurrence of operable human epidermal growth factor receptor 2-negative breast cancer.

Design, Setting, and Participants

This per-protocol secondary analysis of the Double-Blind Phase III Trial of Doxorubicin and Cyclophosphamide Followed by Paclitaxel With Bevacizumab or Placebo in Patients With Lymph Node Positive and High Risk Lymph Node Negative Breast Cancer enrolled patients from 2007 to 2011 who were without clinical evidence of recurrence between 4.5 and 7.5 years after primary surgical treatment of human epidermal growth factor receptor 2-negative stage II-III breast cancer followed by adjuvant systemic therapy. Patients were enrolled in a subprotocol for secondary analysis from February 25, 2013, to July 29, 2016, after signing consent for the subprotocol. The analysis was performed in April 2018.

Interventions

A blood sample was obtained for identification and enumeration of CTCs.

Main Outcome and Measures

The association between a positive CTC assay result (at least 1 CTC per 7.5 mL of blood) and clinical recurrence.

Results

Among 547 women included in this analysis, the results of the CTC assay were positive for 18 of 353 with hormone receptor-positive disease (5.1% [95% CI, 3.0%-7.9%]); 23 of 353 patients (6.5% [95% CI, 4.2%-9.6%]) had a clinical recurrence. The recurrence rates per person-year of follow-up in the CTC-positive and CTC-negative groups were 21.4% (7 recurrences per 32.7 person-years) and 2.0% (16 recurrences per 796.3 person-years), respectively. In multivariate models including clinical covariates, a positive CTC assay result was associated with a 13.1-fold higher risk of recurrence (hazard ratio point estimate, 13.1; 95% CI, 4.7-36.3). Seven of 23 patients (30.4% [95% CI, 13.2%-52.9%]) with recurrence had a positive CTC assay result at a median of 2.8 years (range, 0.1-2.8 years) before clinical recurrence. The CTC assay result was also positive for 8 of 193 patients (4.1% [95% CI, 1.8%-8.0%]) with hormone receptor-negative disease, although only 1 patient (0.5% [95% CI, 0%-2.9%]) experienced disease recurrence (this patient was CTC negative).

Conclusions and Relevance

A single positive CTC assay result 5 years after diagnosis of hormone receptor-positive breast cancer provided independent prognostic information for late clinical recurrence, which provides proof of concept that liquid-based biomarkers may be used to risk stratify for late recurrence and guide therapy.

Trial Registration

ClinicalTrials.gov identifier: NCT00433511.

Collapse

Krzyszczyk P, Acevedo A, Davidoff EJ, Timmins LM, Marrero-Berrios I, Patel M, White C, Lowe C, Sherba JJ, Hartmanshenn C, O'Neill KM, Balter ML, Fritz ZR, Androulakis IP, Schloss RS, Yarmush ML. The growing role of precision and personalized medicine for cancer treatment. TECHNOLOGY 2018;6:79-100. [PMID: 30713991 PMCID: PMC6352312 DOI: 10.1142/s2339547818300020] [Citation(s) in RCA: 196] [Impact Index Per Article: 32.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Affiliation(s)

Paulina Krzyszczyk Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Alison Acevedo Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Erika J Davidoff Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Lauren M Timmins Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Ileana Marrero-Berrios Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Misaal Patel Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Corina White Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Christopher Lowe Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Joseph J Sherba Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Clara Hartmanshenn Department of Chemical & Biochemical Engineering, Rutgers University, 98 Brett Road, Piscataway, NJ 08854, USA
Kate M O'Neill Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Max L Balter Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Zachary R Fritz Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Ioannis P Androulakis Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA Department of Chemical & Biochemical Engineering, Rutgers University, 98 Brett Road, Piscataway, NJ 08854, USA
Rene S Schloss Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA
Martin L Yarmush Department of Biomedical Engineering, Rutgers University, 599 Taylor Road, Piscataway, NJ 08854, USA Department of Chemical & Biochemical Engineering, Rutgers University, 98 Brett Road, Piscataway, NJ 08854, USA

Collapse

Cuzick J. Prognosis vs Treatment Interaction. JNCI Cancer Spectr 2018;2:pky006. [PMID: 31360838 PMCID: PMC6649762 DOI: 10.1093/jncics/pky006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2018] [Revised: 02/12/2018] [Accepted: 02/21/2018] [Indexed: 11/18/2022] Open

CD8+ T cell infiltration in breast and colon cancer: A histologic and statistical analysis. PLoS One 2018;13:e0190158. [PMID: 29320521 PMCID: PMC5761898 DOI: 10.1371/journal.pone.0190158] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 12/08/2017] [Indexed: 12/27/2022] Open

Kang T, Ding W, Zhang L, Ziemek D, Zarringhalam K. A biological network-based regularized artificial neural network model for robust phenotype prediction from gene expression data. BMC Bioinformatics 2017;18:565. [PMID: 29258445 PMCID: PMC5735940 DOI: 10.1186/s12859-017-1984-2] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2017] [Accepted: 12/05/2017] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

Stratification of patient subpopulations that respond favorably to treatment or experience and adverse reaction is an essential step toward development of new personalized therapies and diagnostics. It is currently feasible to generate omic-scale biological measurements for all patients in a study, providing an opportunity for machine learning models to identify molecular markers for disease diagnosis and progression. However, the high variability of genetic background in human populations hampers the reproducibility of omic-scale markers. In this paper, we develop a biological network-based regularized artificial neural network model for prediction of phenotype from transcriptomic measurements in clinical trials. To improve model sparsity and the overall reproducibility of the model, we incorporate regularization for simultaneous shrinkage of gene sets based on active upstream regulatory mechanisms into the model.

RESULTS

We benchmark our method against various regression, support vector machines and artificial neural network models and demonstrate the ability of our method in predicting the clinical outcomes using clinical trial data on acute rejection in kidney transplantation and response to Infliximab in ulcerative colitis. We show that integration of prior biological knowledge into the classification as developed in this paper, significantly improves the robustness and generalizability of predictions to independent datasets. We provide a Java code of our algorithm along with a parsed version of the STRING DB database.

CONCLUSION

In summary, we present a method for prediction of clinical phenotypes using baseline genome-wide expression data that makes use of prior biological knowledge on gene-regulatory interactions in order to increase robustness and reproducibility of omic-scale markers. The integrated group-wise regularization methods increases the interpretability of biological signatures and gives stable performance estimates across independent test sets.

Collapse

Robles AI, Harris CC. Integration of multiple "OMIC" biomarkers: A precision medicine strategy for lung cancer. Lung Cancer 2017;107:50-58. [PMID: 27344275 PMCID: PMC5156586 DOI: 10.1016/j.lungcan.2016.06.003] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2016] [Revised: 06/07/2016] [Accepted: 06/10/2016] [Indexed: 12/17/2022]

Abstract

More than half of all new lung cancer diagnoses are made in patients with locally advanced or metastatic disease, at which point therapeutic options are scarce. It is anticipated, however, that the widespread use of Low-Dose Computed Tomography (LDCT) screening, will lead to a greater proportion of lung cancers being diagnosed at an early, operable, stage. Still, the overall rate of recurrence for surgically treated Stage I lung cancer patients is up to 30% within 5 years of diagnosis. Thus, the identification and clinical application of biomarkers of early stage lung cancer are a pressing medical need. The integrative analysis of "omic," clinical and epidemiological data for single patients is a core principle of precision medicine. Through rigorous bioinformatics and statistical analyses we have identified biomarkers of early-stage lung cancer based on DNA methylation, expression of mRNA and miRNA, inflammatory cytokines, and urinary metabolites. Beyond a more comprehensive understanding of the molecular taxonomy of lung cancer, these biomarkers can have very practical implications in the context of unmet clinical needs of early stage lung cancer patients: First, current guidelines for LDCT screening broadly include individuals based on age and history of heavy smoking. Tumor-derived circulating biomarkers in the blood and urine associated with lung cancer risk could narrow and prioritize individuals for LDCT screening. Second, a high number of nodules are identified by LDCT, of which fewer than 5% are finally diagnosed as lung cancer. Biomarkers may help discriminate malignant nodules from benign or indolent lesions. Third, the expected rise in the numbers of lung cancer patients diagnosed at an early stage will necessitate new treatment options. Circulating, urinary and tissue-based biomarkers that molecularly categorize Stage I patients after tumor resection can help identify high-risk patients who may benefit from adjuvant chemotherapy or innovative immunotherapy regimens.

Collapse

Quezada H, Guzmán-Ortiz AL, Díaz-Sánchez H, Valle-Rios R, Aguirre-Hernández J. Omics-based biomarkers: current status and potential use in the clinic. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.bmhime.2017.11.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Omics-based biomarkers: current status and potential use in the clinic. BOLETIN MEDICO DEL HOSPITAL INFANTIL DE MEXICO 2017;74:219-226. [DOI: 10.1016/j.bmhimx.2017.03.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Accepted: 03/17/2017] [Indexed: 12/20/2022] Open

Delmar P, Irl C, Tian L. Innovative methods for the identification of predictive biomarker signatures in oncology: Application to bevacizumab. Contemp Clin Trials Commun 2017;5:107-115. [PMID: 29740627 PMCID: PMC5936698 DOI: 10.1016/j.conctc.2017.01.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Revised: 12/06/2016] [Accepted: 01/17/2017] [Indexed: 11/26/2022] Open

Abstract

Current methods for subgroup analyses of data collected from randomized clinical trials (RCTs) may lead to false-positives from multiple testing, lack power to detect moderate but clinically meaningful differences, or be too simplistic in characterizing patients who may benefit from treatment. Herein, we present a general procedure based on a set of newly developed statistical methods for the identification and evaluation of complex multivariate predictors of treatment effect. Furthermore, we implemented this procedure to identify a subgroup of patients who may receive the largest benefit from bevacizumab treatment using a panel of 10 biomarkers measured at baseline in patients enrolled on two RCTs investigating bevacizumab in metastatic breast cancer. Data were collected from patients with human epidermal growth factor receptor 2 (HER2)-negative (AVADO) and HER2-positive (AVEREL) metastatic breast cancer. We first developed a classification rule based on an estimated individual scoring system, using data from the AVADO study only. The classification rule takes into consideration a panel of biomarkers, including vascular endothelial growth factor (VEGF)-A. We then classified the patients in the independent AVEREL study into patient groups according to “promising” or “not-promising” treatment benefit based on this rule and conducted a statistical analysis within these subgroups to compute point estimates, confidence intervals, and p-values for treatment effect and its interaction. In the group with promising treatment benefit in the AVEREL study, the estimated hazard ratio of bevacizumab versus placebo for progression-free survival was 0.687 (95% confidence interval [CI]: 0.462–1.024, p = 0.065), while in the not-promising group the hazard ratio (HR) was 1.152 (95% CI: 0.526–2.524, p = 0.723). Using the median level of VEGF-A from the AVEREL study to divide the study population, then the HR becomes 0.711 (95% CI: 0.435–1.163, p = 0.174) in the promising group and 0.828 (95% CI: 0.496–1.380, p = 0.468) in the not-promising group. Similar results were obtained with the median VEGF-A levels from the AVADO study (“promising” group: HR = 0.709, 95%CI: 0.444–1.133, p = 0.151; “not-promising” group: HR = 0.851, 95% CI: 0.497–1.458, p = 0.556). Our analysis shows it is feasible to employ statistical methods for empirically constructing and validating a scoring system based on a panel of biomarkers. This scoring system can be used to estimate the treatment effect for individual patients and identify a subgroup of patients who may benefit from treatment. The proposed procedure can provide a general framework to organize many statistical methods (existing or to be developed) into a coherent set of analyses for the development of personalized medicines and has the potential of broad applications.

Collapse

Rankin NJ, Preiss D, Welsh P, Sattar N. Applying metabolomics to cardiometabolic intervention studies and trials: past experiences and a roadmap for the future. Int J Epidemiol 2016;45:1351-1371. [PMID: 27789671 PMCID: PMC5100629 DOI: 10.1093/ije/dyw271] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/01/2016] [Indexed: 12/22/2022] Open

Abstract

Metabolomics and lipidomics are emerging methods for detailed phenotyping of small molecules in samples. It is hoped that such data will: (i) enhance baseline prediction of patient response to pharmacotherapies (beneficial or adverse); (ii) reveal changes in metabolites shortly after initiation of therapy that may predict patient response, including adverse effects, before routine biomarkers are altered; and( iii) give new insights into mechanisms of drug action, particularly where the results of a trial of a new agent were unexpected, and thus help future drug development. In these ways, metabolomics could enhance research findings from intervention studies. This narrative review provides an overview of metabolomics and lipidomics in early clinical intervention studies for investigation of mechanisms of drug action and prediction of drug response (both desired and undesired). We highlight early examples from drug intervention studies associated with cardiometabolic disease. Despite the strengths of such studies, particularly the use of state-of-the-art technologies and advanced statistical methods, currently published studies in the metabolomics arena are largely underpowered and should be considered as hypothesis-generating. In order for metabolomics to meaningfully improve stratified medicine approaches to patient treatment, there is a need for higher quality studies, with better exploitation of biobanks from randomized clinical trials i.e. with large sample size, adjudicated outcomes, standardized procedures, validation cohorts, comparison witth routine biochemistry and both active and control/placebo arms. On the basis of this review, and based on our research experience using clinically established biomarkers, we propose steps to more speedily advance this area of research towards potential clinical impact.

Collapse

Korenkova V, Slyskova J, Novosadova V, Pizzamiglio S, Langerova L, Bjorkman J, Vycital O, Liska V, Levy M, Veskrna K, Vodicka P, Vodickova L, Kubista M, Verderio P. The focus on sample quality: Influence of colon tissue collection on reliability of qPCR data. Sci Rep 2016;6:29023. [PMID: 27383461 PMCID: PMC4935944 DOI: 10.1038/srep29023] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2016] [Accepted: 06/14/2016] [Indexed: 01/12/2023] Open

Affiliation(s)

Vlasta Korenkova Institute of Biotechnology, BIOCEV Centre, Czech Academy of Sciences, Průmyslová 595, 252 42, Vestec u Prahy, Czech Republic
Jana Slyskova Institute of Experimental Medicine, Czech Academy of Sciences, Prague, Czech Republic
Vendula Novosadova Institute of Biotechnology, BIOCEV Centre, Czech Academy of Sciences, Průmyslová 595, 252 42, Vestec u Prahy, Czech Republic
Sara Pizzamiglio Unit of Medical Statistics, Biometry and Bioinformatics, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Istituto Nazionale dei Tumori, Milan, Italy
Lucie Langerova Institute of Biotechnology, BIOCEV Centre, Czech Academy of Sciences, Průmyslová 595, 252 42, Vestec u Prahy, Czech Republic
Jens Bjorkman TATAA Biocenter AB, Göteborg, Sweden
Ondrej Vycital Deparment of Surgery, Teaching Hospital and Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic.,Biomedical Centre, Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic
Vaclav Liska Deparment of Surgery, Teaching Hospital and Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic.,Biomedical Centre, Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic
Miroslav Levy Surgical Department, Thomayer Hospital, First Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
Karel Veskrna Surgical Department, Thomayer Hospital, First Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
Pavel Vodicka Institute of Experimental Medicine, Czech Academy of Sciences, Prague, Czech Republic.,Biomedical Centre, Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic.,Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
Ludmila Vodickova Institute of Experimental Medicine, Czech Academy of Sciences, Prague, Czech Republic.,Biomedical Centre, Medical School Pilsen, Charles University in Prague, Pilsen, Czech Republic.,Institute of Biology and Medical Genetics, First Faculty of Medicine, Charles University in Prague, Prague, Czech Republic
Mikael Kubista Institute of Biotechnology, BIOCEV Centre, Czech Academy of Sciences, Průmyslová 595, 252 42, Vestec u Prahy, Czech Republic.,TATAA Biocenter AB, Göteborg, Sweden
Paolo Verderio Unit of Medical Statistics, Biometry and Bioinformatics, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Istituto Nazionale dei Tumori, Milan, Italy

Collapse

Lerner SP, Bajorin DF, Dinney CP, Efstathiou JA, Groshen S, Hahn NM, Hansel D, Kwiatkowski D, O’Donnell M, Rosenberg J, Svatek R, Abrams JS, Al-Ahmadie H, Apolo AB, Bellmunt J, Callahan M, Cha EK, Drake C, Jarow J, Kamat A, Kim W, Knowles M, Mann B, Marchionni L, McConkey D, McShane L, Ramirez N, Sharabi A, Sharpe AH, Solit D, Tangen CM, Amiri AT, Van Allen E, West PJ, Witjes JA, Quale DZ. Summary and Recommendations from the National Cancer Institute's Clinical Trials Planning Meeting on Novel Therapeutics for Non-Muscle Invasive Bladder Cancer. Bladder Cancer 2016;2:165-202. [PMID: 27376138 PMCID: PMC4927845 DOI: 10.3233/blc-160053] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Abstract

The NCI Bladder Cancer Task Force convened a Clinical Trials Planning Meeting (CTPM) Workshop focused on Novel Therapeutics for Non-Muscle Invasive Bladder Cancer (NMIBC). Meeting attendees included a broad and multi-disciplinary group of clinical and research stakeholders and included leaders from NCI, FDA, National Clinical Trials Network (NCTN), advocacy and the pharmaceutical and biotech industry. The meeting goals and objectives were to: 1) create a collaborative environment in which the greater bladder research community can pursue future optimally designed novel clinical trials focused on the theme of molecular targeted and immune-based therapies in NMIBC; 2) frame the clinical and translational questions that are of highest priority; and 3) develop two clinical trial designs focusing on immunotherapy and molecular targeted therapy. Despite successful development and implementation of large Phase II and Phase III trials in bladder and upper urinary tract cancers, there are no active and accruing trials in the NMIBC space within the NCTN. Disappointingly, there has been only one new FDA approved drug (Valrubicin) in any bladder cancer disease state since 1998. Although genomic-based data for bladder cancer are increasingly available, translating these discoveries into practice changing treatment is still to come. Recently, major efforts in defining the genomic characteristics of NMIBC have been achieved. Aligned with these data is the growing number of targeted therapy agents approved and/or in development in other organ site cancers and the multiple similarities of bladder cancer with molecular subtypes in these other cancers. Additionally, although bladder cancer is one of the more immunogenic tumors, some tumors have the ability to attenuate or eliminate host immune responses. Two trial concepts emerged from the meeting including a window of opportunity trial (Phase 0) testing an FGFR3 inhibitor and a second multi-arm multi-stage trial testing combinations of BCG or radiotherapy and immunomodulatory agents in patients who recur after induction BCG (BCG failure).

Collapse

Affiliation(s)

Seth P. Lerner Baylor College of Medicine, Houston, TX, USA
Dean F. Bajorin Memorial Sloan Kettering Cancer Center, New York, NY, USA Weill Medical College of Cornell University, New York, NY, USA
Colin P. Dinney The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Jason A. Efstathiou Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
Susan Groshen USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA
Noah M. Hahn Johns Hopkins Sidney Kimmel Comprehensive Cancer Center, Baltimore, MD, USA
Donna Hansel University of California, La Jolla, San Diego, CA, USA
David Kwiatkowski Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, USA
Michael O’Donnell The University of Iowa, IA, USA
Jonathan Rosenberg Memorial Sloan Kettering Cancer Center, New York, NY, USA Weill Medical College of Cornell University, New York, NY, USA
Robert Svatek UT Health Science Center San Antonio, San Antonio, TX, USA
Jeffrey S. Abrams Cancer Therapy Evaluation Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
Hikmat Al-Ahmadie Memorial Sloan Kettering Cancer Center, New York, NY, USA
Andrea B. Apolo Genitourinary Malignancies Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
Joaquim Bellmunt Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA
Margaret Callahan Memorial Sloan Kettering Cancer Center, New York, NY, USA Weill Medical College of Cornell University, New York, NY, USA
Eugene K. Cha Memorial Sloan Kettering Cancer Center, New York, NY, USA
Charles Drake Johns Hopkins Sidney Kimmel Comprehensive Cancer Center, Baltimore, MD, USA
Jonathan Jarow Office of Hematology and Oncology Products, U.S. Food and Drug Administration, Silver Spring, MD, USA
Ashish Kamat The University of Texas MD Anderson Cancer Center, Houston, TX, USA
William Kim University of North Carolina Lineberger Comprehensive Cancer Center, Chapel Hill, NC, USA
Margaret Knowles Leeds Institute of Cancer and Pathology, University of Leeds, Leeds, UK
Bhupinder Mann Cancer Therapy Evaluation Program, Division of Cancer Treatment and Diagnosis, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
Luigi Marchionni Johns Hopkins Sidney Kimmel Comprehensive Cancer Center, Baltimore, MD, USA
David McConkey The University of Texas MD Anderson Cancer Center, Houston, TX, USA
Lisa McShane Biometric Research Branch, Division of Cancer Treatment and Diagnosis, National Cancer Institute, Bethesda, MD, USA
Nilsa Ramirez The Research Institute at Nationwide Children’s Hospital, Columbus, OH, USA
Andrew Sharabi USC Norris Comprehensive Cancer Center, University of Southern California, Los Angeles, CA, USA Johns Hopkins Sidney Kimmel Comprehensive Cancer Center, Baltimore, MD, USA
Arlene H. Sharpe Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA
David Solit Memorial Sloan Kettering Cancer Center, New York, NY, USA Weill Medical College of Cornell University, New York, NY, USA
Catherine M. Tangen SWOG Statistical Center, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
Abdul Tawab Amiri Baylor College of Medicine, Houston, TX, USA
Eliezer Van Allen Dana-Farber Cancer Institute and Harvard Medical School, Boston, MA, USA
Pamela J. West The Emmes Corporation, Rockville, MD, USA
J. A. Witjes Department of Urology, Radboud UMC, Nijmegen, The Netherlands
Diane Zipursky Quale Bladder Cancer Advocacy Network, Bethesda, MD, USA

Collapse

Kim S, Lin CW, Tseng GC. MetaKTSP: a meta-analytic top scoring pair method for robust cross-study validation of omics prediction analysis. Bioinformatics 2016;32:1966-73. [PMID: 27153719 DOI: 10.1093/bioinformatics/btw115] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Accepted: 02/19/2016] [Indexed: 01/08/2023] Open

Abstract

MOTIVATION

Supervised machine learning is widely applied to transcriptomic data to predict disease diagnosis, prognosis or survival. Robust and interpretable classifiers with high accuracy are usually favored for their clinical and translational potential. The top scoring pair (TSP) algorithm is an example that applies a simple rank-based algorithm to identify rank-altered gene pairs for classifier construction. Although many classification methods perform well in cross-validation of single expression profile, the performance usually greatly reduces in cross-study validation (i.e. the prediction model is established in the training study and applied to an independent test study) for all machine learning methods, including TSP. The failure of cross-study validation has largely diminished the potential translational and clinical values of the models. The purpose of this article is to develop a meta-analytic top scoring pair (MetaKTSP) framework that combines multiple transcriptomic studies and generates a robust prediction model applicable to independent test studies.

RESULTS

We proposed two frameworks, by averaging TSP scores or by combining P-values from individual studies, to select the top gene pairs for model construction. We applied the proposed methods in simulated data sets and three large-scale real applications in breast cancer, idiopathic pulmonary fibrosis and pan-cancer methylation. The result showed superior performance of cross-study validation accuracy and biomarker selection for the new meta-analytic framework. In conclusion, combining multiple omics data sets in the public domain increases robustness and accuracy of the classification model that will ultimately improve disease understanding and clinical treatment decisions to benefit patients.

AVAILABILITY AND IMPLEMENTATION

An R package MetaKTSP is available online. (http://tsenglab.biostat.pitt.edu/software.htm).

CONTACT

ctseng@pitt.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Tang R, Pennello G. Validation of Prognostic Marker Tests: Statistical Lessons Learned From Regulatory Experience. Ther Innov Regul Sci 2016;50:241-252. [DOI: 10.1177/2168479015601721] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]