1
|
Jopek MA, Pastuszak K, Sieczczyński M, Cygert S, Żaczek AJ, Rondina MT, Supernat A. Improving platelet-RNA-based diagnostics: a comparative analysis of machine learning models for cancer detection and multiclass classification. Mol Oncol 2024. [PMID: 38887841 DOI: 10.1002/1878-0261.13689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2023] [Revised: 05/15/2024] [Accepted: 06/05/2024] [Indexed: 06/20/2024] Open
Abstract
Liquid biopsy demonstrates excellent potential in patient management by providing a minimally invasive and cost-effective approach to detecting and monitoring cancer, even at its early stages. Due to the complexity of liquid biopsy data, machine-learning techniques are increasingly gaining attention in sample analysis, especially for multidimensional data such as RNA expression profiles. Yet, there is no agreement in the community on which methods are the most effective or how to process the data. To circumvent this, we performed a large-scale study using various machine-learning techniques. First, we took a closer look at existing datasets and filtered out some patients to assert data collection quality. The final data collection included platelet RNA samples acquired from 1397 cancer patients (17 types of cancer) and 354 asymptomatic, presumed healthy, donors. Then, we assessed an array of different machine-learning models and techniques (e.g., feature selection of RNA transcripts) in pan-cancer detection and multiclass classification. Our results show that simple logistic regression performs the best, reaching a 68% cancer detection rate at a 99% specificity level, and multiclass classification accuracy of 79.38% when distinguishing between five cancer types. In summary, by revisiting classical machine-learning models, we have exceeded the previously used method by 5% and 9.65% in cancer detection and multiclass classification, respectively. To ease further research, we open-source our code and data processing pipelines (https://gitlab.com/jopekmaksym/improving-platelet-rna-based-diagnostics), which we hope will serve the community as a strong baseline.
Collapse
Affiliation(s)
- Maksym A Jopek
- Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland
- Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Poland
| | - Krzysztof Pastuszak
- Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland
- Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Poland
- Department of Algorithms and Systems Modelling, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland
| | - Michał Sieczczyński
- Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland
- Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Poland
| | - Sebastian Cygert
- Department of Multimedia Systems, Faculty of Electronics, Telecommunications and Informatics, Gdańsk University of Technology, Poland
- Ideas, NCBR, Warsaw, Poland
| | - Anna J Żaczek
- Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland
| | - Matthew T Rondina
- Molecular Medicine Program, University of Utah, Salt Lake City, UT, USA
- George E. Wahlen Veterans Affairs Medical Center Department of Internal Medicine and the Geriatric Research Education and Clinical Center (GRECC), Salt Lake City, UT, USA
- Department of Pathology, University of Utah, Salt Lake City, UT, USA
- Division of General Internal Medicine, Department of Internal Medicine, University of Utah, Salt Lake City, UT, USA
| | - Anna Supernat
- Laboratory of Translational Oncology, Intercollegiate Faculty of Biotechnology of the University of Gdańsk and the Medical University of Gdańsk, Poland
- Centre of Biostatistics and Bioinformatics, Medical University of Gdańsk, Poland
| |
Collapse
|
2
|
van der Leest P, Schuuring E. Critical Factors in the Analytical Work Flow of Circulating Tumor DNA-Based Molecular Profiling. Clin Chem 2024; 70:220-233. [PMID: 38175597 DOI: 10.1093/clinchem/hvad194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Accepted: 10/30/2023] [Indexed: 01/05/2024]
Abstract
BACKGROUND Liquid biopsy testing, especially molecular tumor profiling of circulating tumor DNA (ctDNA) in cell-free plasma, has received increasing interest in recent years as it serves as a reliable alternative for the detection of tumor-specific aberrations to guide treatment decision-making in oncology. Many (commercially available) applications have been developed, however, broad divergences in (pre)analytical work flows and lack of universally applied guidelines impede routine clinical implementation. In this review, critical factors in the blood-based ctDNA liquid biopsy work flow are evaluated. CONTENT In the preanalytical phase, several aspects (e.g., blood collection tubes [BCTs], plasma processing, and extraction method) affect the quantity and quality of the circulating cell-free DNA (ccfDNA) applicable for subsequent molecular analyses and should meet certain standards to be applied in diagnostic work flows. Analytical considerations, such as analytical input and choice of assay, might vary based on the clinical application (i.e., screening, primary diagnosis, minimal residual disease [MRD], response monitoring, and resistance identification). In addition to practical procedures, variant interpretation and reporting ctDNA results should be harmonized. Collaborative efforts in (inter)national consortia and societies are essential for the establishment of standard operating procedures (SOPs) in attempts to standardize the plasma-based ctDNA analysis work flow. SUMMARY Development of universally applicable guidelines regarding the critical factors in liquid biopsy testing are necessary to pave the way to clinical implementation for routine diagnostics.
Collapse
Affiliation(s)
- Paul van der Leest
- Department of Pathology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
- Department of Laboratory Medicine, Netherlands Cancer Institute, Amsterdam, the Netherlands
| | - Ed Schuuring
- Department of Pathology, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| |
Collapse
|
3
|
Jiang L, Yang H, Cheng W, Ni Z, Xiang N. Droplet microfluidics for CTC-based liquid biopsy: a review. Analyst 2023; 148:203-221. [PMID: 36508171 DOI: 10.1039/d2an01747d] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
Circulating tumor cells (CTCs) are important biomarkers of liquid biopsy. The number and heterogeneity of CTCs play an important role in cancer diagnosis and personalized medicine. However, owing to the low-abundance biomarkers of CTCs, conventional assays are only able to detect CTCs at the population level. Therefore, there is a pressing need for a highly sensitive method to analyze CTCs at the single-cell level. As an important branch of microfluidics, droplet microfluidics is a high-throughput and sensitive single-cell analysis platform for the quantitative detection and heterogeneity analysis of CTCs. In this review, we focus on the quantitative detection and heterogeneity analysis of CTCs using droplet microfluidics. Technologies that enable droplet microfluidics, particularly high-throughput droplet generation and high-efficiency droplet manipulation, are first discussed. Then, recent advances in detecting and analyzing CTCs using droplet microfluidics from the different aspects of nucleic acids, proteins, and metabolites are introduced. The purpose of this review is to provide guidance for the continued study of droplet microfluidics for CTC-based liquid biopsy.
Collapse
Affiliation(s)
- Lin Jiang
- School of Mechanical Engineering, and Jiangsu Key Laboratory for Design and Manufacture of Micro-Nano Biomedical Instruments, Southeast University, Nanjing, 211189, China.
| | - Hang Yang
- School of Mechanical Engineering, and Jiangsu Key Laboratory for Design and Manufacture of Micro-Nano Biomedical Instruments, Southeast University, Nanjing, 211189, China.
| | - Weiqi Cheng
- School of Mechanical Engineering, and Jiangsu Key Laboratory for Design and Manufacture of Micro-Nano Biomedical Instruments, Southeast University, Nanjing, 211189, China.
| | - Zhonghua Ni
- School of Mechanical Engineering, and Jiangsu Key Laboratory for Design and Manufacture of Micro-Nano Biomedical Instruments, Southeast University, Nanjing, 211189, China.
| | - Nan Xiang
- School of Mechanical Engineering, and Jiangsu Key Laboratory for Design and Manufacture of Micro-Nano Biomedical Instruments, Southeast University, Nanjing, 211189, China.
| |
Collapse
|
4
|
Drandi D, Decruyenaere P, Ferrante M, Offner F, Vandesompele J, Ferrero S. Nucleic Acid Biomarkers in Waldenström Macroglobulinemia and IgM-MGUS: Current Insights and Clinical Relevance. Diagnostics (Basel) 2022; 12:diagnostics12040969. [PMID: 35454017 PMCID: PMC9028641 DOI: 10.3390/diagnostics12040969] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 04/07/2022] [Accepted: 04/09/2022] [Indexed: 12/13/2022] Open
Abstract
Waldenström Macroglobulinemia (WM) is an indolent lymphoplasmacytic lymphoma, characterized by the production of excess immunoglobulin M monoclonal protein. WM belongs to the spectrum of IgM gammopathies, ranging from asymptomatic IgM monoclonal gammopathy of undetermined significance (IgM-MGUS), through IgM-related disorders and asymptomatic WM to symptomatic WM. In recent years, its complex genomic and transcriptomic landscape has been extensively explored, hereby elucidating the biological mechanisms underlying disease onset, progression and therapy response. An increasing number of mutations, cytogenetic abnormalities, and molecular signatures have been described that have diagnostic, phenotype defining or prognostic implications. Moreover, cell-free nucleic acid biomarkers are increasingly being investigated, benefiting the patient in a minimally invasive way. This review aims to provide an extensive overview of molecular biomarkers in WM and IgM-MGUS, considering current shortcomings, as well as potential future applications in a precision medicine approach.
Collapse
Affiliation(s)
- Daniela Drandi
- Department of Molecular Biotechnology and Health Sciences, Hematology Division, University of Torino, 10126 Torino, Italy; (M.F.); (S.F.)
- Correspondence: (D.D.); (P.D.)
| | - Philippe Decruyenaere
- Department of Hematology, Ghent University Hospital, 9000 Ghent, Belgium;
- OncoRNALab, Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium;
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
- Correspondence: (D.D.); (P.D.)
| | - Martina Ferrante
- Department of Molecular Biotechnology and Health Sciences, Hematology Division, University of Torino, 10126 Torino, Italy; (M.F.); (S.F.)
| | - Fritz Offner
- Department of Hematology, Ghent University Hospital, 9000 Ghent, Belgium;
| | - Jo Vandesompele
- OncoRNALab, Cancer Research Institute Ghent (CRIG), 9000 Ghent, Belgium;
- Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Simone Ferrero
- Department of Molecular Biotechnology and Health Sciences, Hematology Division, University of Torino, 10126 Torino, Italy; (M.F.); (S.F.)
| |
Collapse
|