1
|
Hamidi F, Gilani N, Arabi Belaghi R, Yaghoobi H, Babaei E, Sarbakhsh P, Malakouti J. Identifying potential circulating miRNA biomarkers for the diagnosis and prediction of ovarian cancer using machine-learning approach: application of Boruta. Front Digit Health 2023; 5:1187578. [PMID: 37621964 PMCID: PMC10445490 DOI: 10.3389/fdgth.2023.1187578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 07/20/2023] [Indexed: 08/26/2023] Open
Abstract
Introduction In gynecologic oncology, ovarian cancer is a great clinical challenge. Because of the lack of typical symptoms and effective biomarkers for noninvasive screening, most patients develop advanced-stage ovarian cancer by the time of diagnosis. MicroRNAs (miRNAs) are a type of non-coding RNA molecule that has been linked to human cancers. Specifying diagnostic biomarkers to determine non-cancer and cancer samples is difficult. Methods By using Boruta, a novel random forest-based feature selection in the machine-learning techniques, we aimed to identify biomarkers associated with ovarian cancer using cancerous and non-cancer samples from the Gene Expression Omnibus (GEO) database: GSE106817. In this study, we used two independent GEO data sets as external validation, including GSE113486 and GSE113740. We utilized five state-of-the-art machine-learning algorithms for classification: logistic regression, random forest, decision trees, artificial neural networks, and XGBoost. Results Four models discovered in GSE113486 had an AUC of 100%, three in GSE113740 with AUC of over 94%, and four in GSE113486 with AUC of over 94%. We identified 10 miRNAs to distinguish ovarian cancer cases from normal controls: hsa-miR-1290, hsa-miR-1233-5p, hsa-miR-1914-5p, hsa-miR-1469, hsa-miR-4675, hsa-miR-1228-5p, hsa-miR-3184-5p, hsa-miR-6784-5p, hsa-miR-6800-5p, and hsa-miR-5100. Our findings suggest that miRNAs could be used as possible biomarkers for ovarian cancer screening, for possible intervention.
Collapse
Affiliation(s)
- Farzaneh Hamidi
- Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Neda Gilani
- Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran
- Road Traffic Injury Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Reza Arabi Belaghi
- Department of Mathematics, Applied Mathematics and Statistics, Uppsala University, Uppsala, Sweden
- Department of Statistics, Faculty of Mathematical Science, University of Tabriz, Tabriz, Iran
- Department of Energy and Technology, Swedish Agricultural University, Uppsala, Sweden
| | - Hanif Yaghoobi
- Department of Biological Sciences, School of Natural Sciences, University of Tabriz, Tabriz, Iran
| | - Esmaeil Babaei
- Department of Biological Sciences, School of Natural Sciences, University of Tabriz, Tabriz, Iran
- Interfaculty Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, Tübingen, Germany
| | - Parvin Sarbakhsh
- Department of Statistics and Epidemiology, Faculty of Health, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Jamileh Malakouti
- Department of Midwifery, Faculty of Nursing and Midwifery, Tabriz University of Medical Science, Tabriz, Iran
| |
Collapse
|
2
|
Menna G, Piaser Guerrato G, Bilgin L, Ceccarelli GM, Olivi A, Della Pepa GM. Is There a Role for Machine Learning in Liquid Biopsy for Brain Tumors? A Systematic Review. Int J Mol Sci 2023; 24:9723. [PMID: 37298673 PMCID: PMC10253654 DOI: 10.3390/ijms24119723] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 05/29/2023] [Accepted: 05/31/2023] [Indexed: 06/12/2023] Open
Abstract
The paucity of studies available in the literature on brain tumors demonstrates that liquid biopsy (LB) is not currently applied for central nervous system (CNS) cancers. The purpose of this systematic review focused on the application of machine learning (ML) to LB for brain tumors to provide practical guidance for neurosurgeons to understand the state-of-the-art practices and open challenges. The herein presented study was conducted in accordance with the PRISMA-P (preferred reporting items for systematic review and meta-analysis protocols) guidelines. An online literature search was launched on PubMed/Medline, Scopus, and Web of Science databases using the following query: "((Liquid biopsy) AND (Glioblastoma OR Brain tumor) AND (Machine learning OR Artificial Intelligence))". The last database search was conducted in April 2023. Upon the full-text review, 14 articles were included in the study. These were then divided into two subgroups: those dealing with applications of machine learning to liquid biopsy in the field of brain tumors, which is the main aim of this review (n = 8); and those dealing with applications of machine learning to liquid biopsy in the diagnosis of other tumors (n = 6). Although studies on the application of ML to LB in the field of brain tumors are still in their infancy, the rapid development of new techniques, as evidenced by the increase in publications on the subject in the past two years, may in the future allow for rapid, accurate, and noninvasive analysis of tumor data. Thus making it possible to identify key features in the LB samples that are associated with the presence of a brain tumor. These features could then be used by doctors for disease monitoring and treatment planning.
Collapse
|
3
|
Brito-Rocha T, Constâncio V, Henrique R, Jerónimo C. Shifting the Cancer Screening Paradigm: The Rising Potential of Blood-Based Multi-Cancer Early Detection Tests. Cells 2023; 12:cells12060935. [PMID: 36980276 PMCID: PMC10047029 DOI: 10.3390/cells12060935] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Revised: 03/14/2023] [Accepted: 03/15/2023] [Indexed: 03/30/2023] Open
Abstract
Cancer remains a leading cause of death worldwide, partly owing to late detection which entails limited and often ineffective therapeutic options. Most cancers lack validated screening procedures, and the ones available disclose several drawbacks, leading to low patient compliance and unnecessary workups, adding up the costs to healthcare systems. Hence, there is a great need for innovative, accurate, and minimally invasive tools for early cancer detection. In recent years, multi-cancer early detection (MCED) tests emerged as a promising screening tool, combining molecular analysis of tumor-related markers present in body fluids with artificial intelligence to simultaneously detect a variety of cancers and further discriminate the underlying cancer type. Herein, we aim to provide a highlight of the variety of strategies currently under development concerning MCED, as well as the major factors which are preventing clinical implementation. Although MCED tests depict great potential for clinical application, large-scale clinical validation studies are still lacking.
Collapse
Affiliation(s)
- Tiago Brito-Rocha
- Cancer Biology and Epigenetics Group, Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO-Porto)/Porto Comprehensive Cancer Center Raquel Seruca (P.CCC), Rua Dr. António Bernardino de Almeida, 4200-072 Porto, Portugal
- Master Program in Oncology, School of Medicine & Biomedical Sciences, University of Porto (ICBAS-UP), Rua Jorge Viterbo Ferreira 228, 4050-513 Porto, Portugal
| | - Vera Constâncio
- Cancer Biology and Epigenetics Group, Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO-Porto)/Porto Comprehensive Cancer Center Raquel Seruca (P.CCC), Rua Dr. António Bernardino de Almeida, 4200-072 Porto, Portugal
- Doctoral Program in Biomedical Sciences, School of Medicine & Biomedical Sciences, University of Porto (ICBAS-UP), Rua Jorge Viterbo Ferreira 228, 4050-513 Porto, Portugal
| | - Rui Henrique
- Cancer Biology and Epigenetics Group, Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO-Porto)/Porto Comprehensive Cancer Center Raquel Seruca (P.CCC), Rua Dr. António Bernardino de Almeida, 4200-072 Porto, Portugal
- Department of Pathology, Portuguese Oncology Institute of Porto (IPO-Porto), Rua Dr. António Bernardino de Almeida, 4200-072 Porto, Portugal
- Department of Pathology and Molecular Immunology, School of Medicine & Biomedical Sciences, University of Porto (ICBAS-UP), Rua Jorge Viterbo Ferreira 228, 4050-513 Porto, Portugal
| | - Carmen Jerónimo
- Cancer Biology and Epigenetics Group, Research Center (CI-IPOP)/RISE@CI-IPOP (Health Research Network), Portuguese Oncology Institute of Porto (IPO-Porto)/Porto Comprehensive Cancer Center Raquel Seruca (P.CCC), Rua Dr. António Bernardino de Almeida, 4200-072 Porto, Portugal
- Department of Pathology and Molecular Immunology, School of Medicine & Biomedical Sciences, University of Porto (ICBAS-UP), Rua Jorge Viterbo Ferreira 228, 4050-513 Porto, Portugal
| |
Collapse
|
4
|
Huang X, Su B, Zhu C, He X, Lin X. Dynamic Network Construction for Identifying Early Warning Signals Based On a Data-Driven Approach: Early Diagnosis Biomarker Discovery for Gastric Cancer. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:923-931. [PMID: 35594220 DOI: 10.1109/tcbb.2022.3176319] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
During the development of complex diseases, there is a critical transition from one status to another at a tipping point, which can be an early indicator of disease deterioration. To effectively enhance the performance of early risk identification, a novel dynamic network construction algorithm for identifying early warning signals based on a data-driven approach (EWS-DDA) was proposed. In EWS-DDA, the shrunken centroid was introduced to measure dynamic expression changes in assumed pathway reactions during the progression of complex disease for network construction and to define early warning signals by means of a data-driven approach. We applied EWS-DDA to perform a comprehensive analysis of gene expression profiles of gastric cancer (GC) from The Cancer Genome Atlas database and the Gene Expression Omnibus database. Six crucial genes were selected as potential biomarkers for the early diagnosis of GC. The experimental results of statistical analysis and biological analysis suggested that the six genes play important roles in GC occurrence and development. Then, EWS-DDA was compared with other state-of-the-art network methods to validate its performance. The theoretical analysis and comparison results suggested that EWS-DDA has great potential for a more complete presentation of disease deterioration and effective extraction of early warning information.
Collapse
|
5
|
Machine learning-based exceptional response prediction of nivolumab monotherapy with circulating microRNAs in non-small cell lung cancer. Lung Cancer 2022; 173:107-115. [DOI: 10.1016/j.lungcan.2022.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 08/29/2022] [Accepted: 09/06/2022] [Indexed: 11/24/2022]
|
6
|
Beheshti Z. BMPA-TVSinV: A Binary Marine Predators Algorithm using time-varying sine and V-shaped transfer functions for wrapper-based feature selection. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
7
|
Zhou X, Ding S, Wang D, Chen L, Feng K, Huang T, Li Z, Cai Y. Identification of Cell Markers and Their Expression Patterns in Skin Based on Single-Cell RNA-Sequencing Profiles. Life (Basel) 2022; 12:life12040550. [PMID: 35455041 PMCID: PMC9025372 DOI: 10.3390/life12040550] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2022] [Revised: 03/27/2022] [Accepted: 04/04/2022] [Indexed: 12/19/2022] Open
Abstract
Atopic dermatitis and psoriasis are members of a family of inflammatory skin disorders. Cellular immune responses in skin tissues contribute to the development of these diseases. However, their underlying immune mechanisms remain to be fully elucidated. We developed a computational pipeline for analyzing the single-cell RNA-sequencing profiles of the Human Cell Atlas skin dataset to investigate the pathological mechanisms of skin diseases. First, we applied the maximum relevance criterion and the Boruta feature selection method to exclude irrelevant gene features from the single-cell gene expression profiles of inflammatory skin disease samples and healthy controls. The retained gene features were ranked by using the Monte Carlo feature selection method on the basis of their importance, and a feature list was compiled. This list was then introduced into the incremental feature selection method that combined the decision tree and random forest algorithms to extract important cell markers and thus build excellent classifiers and decision rules. These cell markers and their expression patterns have been analyzed and validated in recent studies and are potential therapeutic and diagnostic targets for skin diseases because their expression affects the pathogenesis of inflammatory skin diseases.
Collapse
Affiliation(s)
- Xianchao Zhou
- School of Life Sciences, Shanghai University, Shanghai 200444, China; (X.Z.); (S.D.)
- Center for Single-Cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Shijian Ding
- School of Life Sciences, Shanghai University, Shanghai 200444, China; (X.Z.); (S.D.)
| | - Deling Wang
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Department of Medical Imaging, Sun Yat-sen University Cancer Center, Guangzhou 510060, China;
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China;
| | - Kaiyan Feng
- Department of Computer Science, Guangdong AIB Polytechnic College, Guangzhou 510507, China;
| | - Tao Huang
- Bio-Med Big Data Center, CAS Key Laboratory of Computational Biology, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- CAS Key Laboratory of Tissue Microenvironment and Tumor, Shanghai Institute of Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, China
- Correspondence: (T.H.); (Z.L.); (Y.C.); Tel.: +86-21-54923269 (T.H.); +86-21-66136132 (Y.C.)
| | - Zhandong Li
- College of Food Engineering, Jilin Engineering Normal University, Changchun 130052, China
- Correspondence: (T.H.); (Z.L.); (Y.C.); Tel.: +86-21-54923269 (T.H.); +86-21-66136132 (Y.C.)
| | - Yudong Cai
- School of Life Sciences, Shanghai University, Shanghai 200444, China; (X.Z.); (S.D.)
- Correspondence: (T.H.); (Z.L.); (Y.C.); Tel.: +86-21-54923269 (T.H.); +86-21-66136132 (Y.C.)
| |
Collapse
|
8
|
Yang Y, Chen L. Identification of Drug-Disease Associations by Using Multiple Drug and
Disease Networks. Curr Bioinform 2022. [DOI: 10.2174/1574893616666210825115406] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Drug repositioning is a new research area in drug development. It aims to discover
novel therapeutic uses of existing drugs. It could accelerate the process of designing novel drugs
for some diseases and considerably decrease the cost. The traditional method to determine novel therapeutic
uses of an existing drug is quite laborious. It is alternative to design computational methods to
overcome such defect.
Objective:
This study aims to propose a novel model for the identification of drug–disease associations.
Method:
Twelve drug networks and three disease networks were built, which were fed into a powerful
network-embedding algorithm called Mashup to produce informative drug and disease features. These
features were combined to represent each drug–disease association. Classic classification algorithm,
random forest, was used to build the model.
Results:
Tenfold cross-validation results indicated that the MCC, AUROC, and AUPR were 0.7156,
0.9280, and 0.9191, respectively.
Conclusion:
The proposed model showed good performance. Some tests indicated that a small dimension
of drug features and a large dimension of disease features were beneficial for constructing the
model. Moreover, the model was quite robust even if some drug or disease properties were not available.
Collapse
Affiliation(s)
- Ying Yang
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| | - Lei Chen
- College of Information Engineering, Shanghai Maritime University, Shanghai 201306, China
| |
Collapse
|
9
|
Convolutional neural network for human cancer types prediction by integrating protein interaction networks and omics data. Sci Rep 2021; 11:20691. [PMID: 34667236 PMCID: PMC8526703 DOI: 10.1038/s41598-021-98814-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Accepted: 09/14/2021] [Indexed: 02/07/2023] Open
Abstract
Many studies have proven the power of gene expression profile in cancer identification, however, the explosive growth of genomics data increasing needs of tools for cancer diagnosis and prognosis in high accuracy and short times. Here, we collected 6136 human samples from 11 cancer types, and integrated their gene expression profiles and protein-protein interaction (PPI) network to generate 2D images with spectral clustering method. To predict normal samples and 11 cancer tumor types, the images of these 6136 human cancer network were separated into training and validation dataset to develop convolutional neural network (CNN). Our model showed 97.4% and 95.4% accuracies in identification of normal versus tumors and 11 cancer types, respectively. We also provided the results that tumors located in neighboring tissues or in the same cell types, would induce machine make error classification due to the similar gene expression profiles. Furthermore, we observed some patients may exhibit better prognosis if their tumors often misjudged into normal samples. As far as we know, we are the first to generate thousands of cancer networks to predict and classify multiple cancer types with CNN architecture. We believe that our model not only can be applied to cancer diagnosis and prognosis, but also promote the discovery of multiple cancer biomarkers.
Collapse
|
10
|
Identifying Infliximab- (IFX-) Responsive Blood Signatures for the Treatment of Rheumatoid Arthritis. BIOMED RESEARCH INTERNATIONAL 2021. [DOI: 10.1155/2021/5556784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Rheumatoid arthritis (RA) is a severe chronic pathogenic inflammatory abnormality that damages small joints. Comprehensive diagnosis and treatment procedures for RA have been established because of its severe symptoms and relatively high morbidity. Medication and surgery are the two major therapeutic approaches. Infliximab (IFX) is a novel biological agent applied for the treatment of RA. IFX improves physical functions and benefits the achievement of clinical remission even under discontinuous medication. However, not all patients react to IFX, and distinguishing IFX-sensitive and IFX-resistant patients is quite difficult. Thus, how to predict the therapeutic effects of IFX on patients with RA is one of the urgent translational medicine problems in the clinical treatment of RA. In this study, we present a novel computational method for the identification of the applicable and substantial blood gene signatures of IFX sensitivity by liquid biopsy, which may assist in the establishment of a clinical drug sensitivity test standard for RA and contribute to the revelation of unique IFX-associated pharmacological mechanisms.
Collapse
|