1
|
Mathur A, Arya N, Pasupa K, Saha S, Roy Dey S, Saha S. Breast cancer prognosis through the use of multi-modal classifiers: current state of the art and the way forward. Brief Funct Genomics 2024:elae015. [PMID: 38688724 DOI: 10.1093/bfgp/elae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 03/01/2024] [Accepted: 04/09/2024] [Indexed: 05/02/2024] Open
Abstract
We present a survey of the current state-of-the-art in breast cancer detection and prognosis. We analyze the evolution of Artificial Intelligence-based approaches from using just uni-modal information to multi-modality for detection and how such paradigm shift facilitates the efficacy of detection, consistent with clinical observations. We conclude that interpretable AI-based predictions and ability to handle class imbalance should be considered priority.
Collapse
Affiliation(s)
- Archana Mathur
- Department of Information Science and Engineering, Nitte Meenakshi Institute of Technology, Yelahanka, 560064, Karnataka, India
| | - Nikhilanand Arya
- School of Computer Engineering, Kalinga Institute of Industrial Technology, Deemed to be University, Bhubaneshwar, 751024, Odisha, India
| | - Kitsuchart Pasupa
- School of Information Technology, King Mongkut's Institute of Technology Ladkrabang, 1 Soi Chalongkrung 1, 10520, Bangkok, Thailand
| | - Sriparna Saha
- Computer Science and Engineering, Indian Institute of Technology Patna, Bihta, 801106, Bihar, India
| | - Sudeepa Roy Dey
- Department of Computer Science and Engineering, PES University, Hosur Road, 560100, Karnataka, India
| | - Snehanshu Saha
- CSIS and APPCAIR, BITS Pilani K.K Birla Goa Campus, Goa, 403726, Goa, India
- Div of AI Research, HappyMonk AI, Bangalore, 560078, Karnataka, India
| |
Collapse
|
2
|
Abbasi EY, Deng Z, Ali Q, Khan A, Shaikh A, Reshan MSA, Sulaiman A, Alshahrani H. A machine learning and deep learning-based integrated multi-omics technique for leukemia prediction. Heliyon 2024; 10:e25369. [PMID: 38352790 PMCID: PMC10862685 DOI: 10.1016/j.heliyon.2024.e25369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 12/13/2023] [Accepted: 01/25/2024] [Indexed: 02/16/2024] Open
Abstract
In recent years, scientific data on cancer has expanded, providing potential for a better understanding of malignancies and improved tailored care. Advances in Artificial Intelligence (AI) processing power and algorithmic development position Machine Learning (ML) and Deep Learning (DL) as crucial players in predicting Leukemia, a blood cancer, using integrated multi-omics technology. However, realizing these goals demands novel approaches to harness this data deluge. This study introduces a novel Leukemia diagnosis approach, analyzing multi-omics data for accuracy using ML and DL algorithms. ML techniques, including Random Forest (RF), Naive Bayes (NB), Decision Tree (DT), Logistic Regression (LR), Gradient Boosting (GB), and DL methods such as Recurrent Neural Networks (RNN) and Feedforward Neural Networks (FNN) are compared. GB achieved 97 % accuracy in ML, while RNN outperformed by achieving 98 % accuracy in DL. This approach filters unclassified data effectively, demonstrating the significance of DL for leukemia prediction. The testing validation was based on 17 different features such as patient age, sex, mutation type, treatment methods, chromosomes, and others. Our study compares ML and DL techniques and chooses the best technique that gives optimum results. The study emphasizes the implications of high-throughput technology in healthcare, offering improved patient care.
Collapse
Affiliation(s)
- Erum Yousef Abbasi
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Zhongliang Deng
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Qasim Ali
- Department of Software Engineering, Mehran University of Engineering and Technology, Jamshoro, Pakistan
| | - Adil Khan
- State Key Laboratory of Wireless Network Positioning and Communication Engineering Integration Research, School of Electronics Engineering, Beijing University of Posts and Telecommunications, Beijing, China
| | - Asadullah Shaikh
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| | - Mana Saleh Al Reshan
- Department of Information Systems, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
- Scientific and Engineering Research Centre, Najran University, Najran, 61441, Saudi Arabia
| | - Adel Sulaiman
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| | - Hani Alshahrani
- Department of Computer Science, College of Computer Science and Information Systems, Najran University, Najran, 61441, Saudi Arabia
| |
Collapse
|
3
|
Ferjani HL, Dhia SB, Nessib DB, Dghaies A, Kaffel D, Maatallah K, Hamdi W. The childhood arthritis radiographic score of the hip: the proposal cut-off value using cluster analysis. Clin Rheumatol 2024; 43:465-472. [PMID: 37635192 DOI: 10.1007/s10067-023-06749-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 08/08/2023] [Accepted: 08/18/2023] [Indexed: 08/29/2023]
Abstract
BACKGROUND Juvenile idiopathic arthritis (JIA) is a chronic rheumatic disease that affects children. It is crucial to detect and treat hip involvement in JIA early to prevent functional impairment and reduced quality of life. The Childhood Arthritis Radiographic Score of the Hip (CARSH) is a validated radiographic scoring system used to assess hip involvement in JIA. In this study, we aimed to determine cut-off values for CARSH scores using cluster analysis. METHODS The study was conducted as a cross-sectional analysis and included JIA patients with hip involvement who underwent a pelvic radiograph. The same pelvic radiograph was interpreted by two experienced pediatric rheumatologists at baseline and after 3 weeks by both readers for reliability. The CARSH scores were calculated for each hip four times (twice by each reader). For the 50 hips, a total of 200 interpretations of the CARSH score were obtained. Model-based clustering was employed to identify distinct groups of CARSH score interpretations and characterize the phenotype of each cluster. RESULTS Twenty-five children with hip involvement were included. The mean age was 13.9 ± 4.6 years. JIA subtypes were as follows: ERA in 64%, oligoarthritis in 16%, psoriatic arthritis in 12%, polyarthritis RF + in 4%, and RF - in 4% of patients. For the 200 hip interpretations, three clusters based on the level of the CARSH were identified by model-based clustering. Cluster 1 consisted of 17 CARSH score interpretations with a median score of 7 ± 3 (ranging from 1 to 15). This group primarily comprised patients with enthesitis-related arthritis (ERA) and psoriatic arthritis. Patients in cluster 1 were generally older, experienced longer diagnostic delays, and had a longer disease duration compared to the other clusters. Cluster 2 exhibited a moderate CARSH score, with an average score of 4 ± 3 (1 to 15). Patients in this group had significantly higher body weight compared to the other clusters. Cluster 3 represented the group with the least severe hip involvement, characterized by CARSH scores of 2 ± 1 (ranging from 0 to 9). This cluster had a higher proportion of male patients and higher C-reactive protein (CRP) levels than the other clusters. Regarding the individual items of the CARSH score, cluster 1 showed higher percentages of hip radiograph abnormalities such as joint space narrowing, erosions, growth abnormalities, and subchondral cysts. Cluster 2 was characterized by a high rate of acetabular sclerosis, with little to no abnormalities in other CARSH score items. Cluster 3 was the only group that exhibited hip subluxation, with minimal abnormalities in the other score items. In conclusion, this study identified three distinct groups of CARSH scores, representing varying levels of severity in hip involvement in JIA. These findings provide valuable insights for clinicians in assessing and managing JIA patients with hip involvement, enabling tailored treatment strategies based on the severity of the condition. Key Points • While a Childhood Arthritis Radiographic Score of the Hip (CARSH) is a valid and reliable tool in hip-related juvenile idiopathic arthritis, its use is limited in daily practice due to the lack of available cut-off values. • The cluster analysis defined three clusters based on the CARSH levels. • Cluster 1 exhibited the highest score with more damage and disability. Cluster 2 involved a moderate score and more overweight patients. Cluster 3 included the least level of the score but with an active disease parameter.
Collapse
Affiliation(s)
- Hanene Lassoued Ferjani
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia.
- Faculty of Medicine of Tunis, University of Tunis El Manar, Tunis, Tunisia.
- Research Unit UR17SP04, Ksar Saïd, 20102010, Tunis, Tunisia.
| | - Siwar Ben Dhia
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
| | - Dorra Ben Nessib
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
- Faculty of Medicine of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Research Unit UR17SP04, Ksar Saïd, 20102010, Tunis, Tunisia
| | - Abir Dghaies
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
| | - Dhia Kaffel
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
- Faculty of Medicine of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Research Unit UR17SP04, Ksar Saïd, 20102010, Tunis, Tunisia
| | - Kaouther Maatallah
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
- Faculty of Medicine of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Research Unit UR17SP04, Ksar Saïd, 20102010, Tunis, Tunisia
| | - Wafa Hamdi
- Pediatric and Adult Rheumatology Department, Kassab Institute of Orthopedics, Ksar Saïd, Tunis, Tunisia
- Faculty of Medicine of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Research Unit UR17SP04, Ksar Saïd, 20102010, Tunis, Tunisia
| |
Collapse
|
4
|
Dai M, Feng X, Yu H, Guo W, Li X. A Monte Carlo manifold spectral clustering algorithm based on emotional preference and migratory behavior. APPL INTELL 2023. [DOI: 10.1007/s10489-023-04484-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
5
|
Benkirane H, Pradat Y, Michiels S, Cournède PH. CustOmics: A versatile deep-learning based strategy for multi-omics integration. PLoS Comput Biol 2023; 19:e1010921. [PMID: 36877736 PMCID: PMC10019780 DOI: 10.1371/journal.pcbi.1010921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 03/16/2023] [Accepted: 02/04/2023] [Indexed: 03/07/2023] Open
Abstract
The availability of patient cohorts with several types of omics data opens new perspectives for exploring the disease's underlying biological processes and developing predictive models. It also comes with new challenges in computational biology in terms of integrating high-dimensional and heterogeneous data in a fashion that captures the interrelationships between multiple genes and their functions. Deep learning methods offer promising perspectives for integrating multi-omics data. In this paper, we review the existing integration strategies based on autoencoders and propose a new customizable one whose principle relies on a two-phase approach. In the first phase, we adapt the training to each data source independently before learning cross-modality interactions in the second phase. By taking into account each source's singularity, we show that this approach succeeds at taking advantage of all the sources more efficiently than other strategies. Moreover, by adapting our architecture to the computation of Shapley additive explanations, our model can provide interpretable results in a multi-source setting. Using multiple omics sources from different TCGA cohorts, we demonstrate the performance of the proposed method for cancer on test cases for several tasks, such as the classification of tumor types and breast cancer subtypes, as well as survival outcome prediction. We show through our experiments the great performances of our architecture on seven different datasets with various sizes and provide some interpretations of the results obtained. Our code is available on (https://github.com/HakimBenkirane/CustOmics).
Collapse
Affiliation(s)
- Hakim Benkirane
- Université Paris-Saclay, CentraleSupélec, Lab of Mathematics and Informatics (MICS), Gif-sur-Yvette, France
- Oncostat U1018, Inserm, Université Paris-Saclay, Équipe Labellisée Ligue Contre le Cancer, CESP, Villejuif, France
| | - Yoann Pradat
- Université Paris-Saclay, CentraleSupélec, Lab of Mathematics and Informatics (MICS), Gif-sur-Yvette, France
| | - Stefan Michiels
- Oncostat U1018, Inserm, Université Paris-Saclay, Équipe Labellisée Ligue Contre le Cancer, CESP, Villejuif, France
- Bureau de Biostatistique et d’Épidémiologie, Gustave Roussy, Université Paris-Saclay, Villejuif, France
| | - Paul-Henry Cournède
- Université Paris-Saclay, CentraleSupélec, Lab of Mathematics and Informatics (MICS), Gif-sur-Yvette, France
- * E-mail:
| |
Collapse
|
6
|
Mohammed MA, Abdulkareem KH, Dinar AM, Zapirain BG. Rise of Deep Learning Clinical Applications and Challenges in Omics Data: A Systematic Review. Diagnostics (Basel) 2023; 13:diagnostics13040664. [PMID: 36832152 PMCID: PMC9955380 DOI: 10.3390/diagnostics13040664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Revised: 02/05/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023] Open
Abstract
This research aims to review and evaluate the most relevant scientific studies about deep learning (DL) models in the omics field. It also aims to realize the potential of DL techniques in omics data analysis fully by demonstrating this potential and identifying the key challenges that must be addressed. Numerous elements are essential for comprehending numerous studies by surveying the existing literature. For example, the clinical applications and datasets from the literature are essential elements. The published literature highlights the difficulties encountered by other researchers. In addition to looking for other studies, such as guidelines, comparative studies, and review papers, a systematic approach is used to search all relevant publications on omics and DL using different keyword variants. From 2018 to 2022, the search procedure was conducted on four Internet search engines: IEEE Xplore, Web of Science, ScienceDirect, and PubMed. These indexes were chosen because they offer enough coverage and linkages to numerous papers in the biological field. A total of 65 articles were added to the final list. The inclusion and exclusion criteria were specified. Of the 65 publications, 42 are clinical applications of DL in omics data. Furthermore, 16 out of 65 articles comprised the review publications based on single- and multi-omics data from the proposed taxonomy. Finally, only a small number of articles (7/65) were included in papers focusing on comparative analysis and guidelines. The use of DL in studying omics data presented several obstacles related to DL itself, preprocessing procedures, datasets, model validation, and testbed applications. Numerous relevant investigations were performed to address these issues. Unlike other review papers, our study distinctly reflects different observations on omics with DL model areas. We believe that the result of this study can be a useful guideline for practitioners who look for a comprehensive view of the role of DL in omics data analysis.
Collapse
Affiliation(s)
- Mazin Abed Mohammed
- College of Computer Science and Information Technology, University of Anbar, Anbar 31001, Iraq
- eVIDA Lab, University of Deusto, 48007 Bilbao, Spain
- Correspondence: (M.A.M.); (B.G.Z.)
| | - Karrar Hameed Abdulkareem
- College of Agriculture, Al-Muthanna University, Samawah 66001, Iraq
- College of Engineering, University of Warith Al-Anbiyaa, Karbala 56001, Iraq
| | - Ahmed M. Dinar
- Computer Engineering Department, University of Technology- Iraq, Baghdad 19006, Iraq
| | | |
Collapse
|