1
|
Priyadharshini S, Ramkumar K, Vairavasundaram S, Narasimhan K, Venkatesh S, Madhavasarma P, Kotecha K. Bio-inspired feature selection for early diagnosis of Parkinson's disease through optimization of deep 3D nested learning. Sci Rep 2024; 14:23394. [PMID: 39379451 PMCID: PMC11461848 DOI: 10.1038/s41598-024-74405-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 09/25/2024] [Indexed: 10/10/2024] Open
Abstract
Parkinson's disease (PD) is one of the most common neurodegenerative disorders that affect the quality of human life of millions of people throughout the world. The probability of getting affected by this disease increases with age, and it is common among the elderly population. Early detection can help in initiating medications at an earlier stage. It can significantly slow down the progression of this disease, assisting the patient to maintain a good quality of life for a more extended period. Magnetic resonance imaging (MRI)-based brain imaging is an area of active research that is used to diagnose PD disease early and to understand the key biomarkers. The prior research investigations using MRI data mainly focus on volume, structural, and morphological changes in the basal ganglia (BG) region for diagnosing PD. Recently, researchers have emphasized the significance of studying other areas of the human brain for a more comprehensive understanding of PD and also to analyze changes happening in brain tissue. Thus, to perform accurate diagnosis and treatment planning for early identification of PD, this work focuses on learning the onset of PD from images taken from whole-brain MRI using a novel 3D-convolutional neural network (3D-CNN) deep learning architecture. The conventional 3D-Resent deep learning model, after various hyper-parameter tuning and architectural changes, has achieved an accuracy of 90%. In this work, a novel 3D-CNN architecture was developed, and after several ablation studies, the model yielded results with an improved accuracy of 93.4%. Combining features from the 3D-CNN and 3D ResNet models using Canonical Correlation Analysis (CCA) resulted in 95% accuracy. For further enhancements of the model performance, feature fusion with optimization was employed, utilizing various optimization techniques. Whale optimization based on a biologically inspired approach was selected on the basis of a convergence diagram. The performance of this approach is compared to other methods and has given an accuracy of 97%. This work represents a critical advancement in improving PD diagnosis techniques and emphasizing the importance of deep nested 3D learning and bio-inspired feature selection.
Collapse
Affiliation(s)
- S Priyadharshini
- School of Electrical and Electronics Engineering, SASTRA Deemed University, Thanjavur, India
| | - K Ramkumar
- School of Computing, SASTRA Deemed University, Thanjavur, India
| | | | - K Narasimhan
- School of Electrical and Electronics Engineering, SASTRA Deemed University, Thanjavur, India
| | - S Venkatesh
- School of Electrical and Electronics Engineering, SASTRA Deemed University, Thanjavur, India
| | - P Madhavasarma
- School of Electrical and Electronics Engineering, SASTRA Deemed University, Thanjavur, India
| | - Ketan Kotecha
- Symbiosis Centre for Applied Artificial, Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune, India.
| |
Collapse
|
2
|
Miah H, Kollias D, Pedone GL, Provan D, Chen F. Can Machine Learning Assist in Diagnosis of Primary Immune Thrombocytopenia? A Feasibility Study. Diagnostics (Basel) 2024; 14:1352. [PMID: 39001244 PMCID: PMC11240714 DOI: 10.3390/diagnostics14131352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2024] [Revised: 06/11/2024] [Accepted: 06/13/2024] [Indexed: 07/16/2024] Open
Abstract
Primary Immune Thrombocytopenia (ITP) is a rare autoimmune disease characterised by the immune-mediated destruction of peripheral blood platelets in patients leading to low platelet counts and bleeding. The diagnosis and effective management of ITP are challenging because there is no established test to confirm the disease and no biomarker with which one can predict the response to treatment and outcome. In this work, we conduct a feasibility study to check if machine learning can be applied effectively for the diagnosis of ITP using routine blood tests and demographic data in a non-acute outpatient setting. Various ML models, including Logistic Regression, Support Vector Machine, k-Nearest Neighbor, Decision Tree and Random Forest, were applied to data from the UK Adult ITP Registry and a general haematology clinic. Two different approaches were investigated: a demographic-unaware and a demographic-aware one. We conduct extensive experiments to evaluate the predictive performance of these models and approaches, as well as their bias. The results revealed that Decision Tree and Random Forest models were both superior and fair, achieving nearly perfect predictive and fairness scores, with platelet count identified as the most significant variable. Models not provided with demographic information performed better in terms of predictive accuracy but showed lower fairness scores, illustrating a trade-off between predictive performance and fairness.
Collapse
Affiliation(s)
- Haroon Miah
- Centre of Immunobiology, Blizard Institute, Queen Mary University of London, London E1 2AT, UK; (H.M.); (D.P.)
- Haematology Department, Barts Health NHS Trust, London E1 1BB, UK;
| | - Dimitrios Kollias
- School of Electronic Engineering & Computer Science, Queen Mary University of London, London E1 4NS, UK
| | | | - Drew Provan
- Centre of Immunobiology, Blizard Institute, Queen Mary University of London, London E1 2AT, UK; (H.M.); (D.P.)
- Haematology Department, Barts Health NHS Trust, London E1 1BB, UK;
| | - Frederick Chen
- Centre of Immunobiology, Blizard Institute, Queen Mary University of London, London E1 2AT, UK; (H.M.); (D.P.)
- Haematology Department, Barts Health NHS Trust, London E1 1BB, UK;
| |
Collapse
|
3
|
Duwe G, Mercier D, Wiesmann C, Kauth V, Moench K, Junker M, Neumann CCM, Haferkamp A, Dengel A, Höfner T. Challenges and perspectives in use of artificial intelligence to support treatment recommendations in clinical oncology. Cancer Med 2024; 13:e7398. [PMID: 38923826 PMCID: PMC11196383 DOI: 10.1002/cam4.7398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/31/2024] [Accepted: 06/06/2024] [Indexed: 06/28/2024] Open
Abstract
Artificial intelligence (AI) promises to be the next revolutionary step in modern society. Yet, its role in all fields of industry and science need to be determined. One very promising field is represented by AI-based decision-making tools in clinical oncology leading to more comprehensive, personalized therapy approaches. In this review, the authors provide an overview on all relevant technical applications of AI in oncology, which are required to understand the future challenges and realistic perspectives for decision-making tools. In recent years, various applications of AI in medicine have been developed focusing on the analysis of radiological and pathological images. AI applications encompass large amounts of complex data supporting clinical decision-making and reducing errors by objectively quantifying all aspects of the data collected. In clinical oncology, almost all patients receive a treatment recommendation in a multidisciplinary cancer conference at the beginning and during their treatment periods. These highly complex decisions are based on a large amount of information (of the patients and of the various treatment options), which need to be analyzed and correctly classified in a short time. In this review, the authors describe the technical and medical requirements of AI to address these scientific challenges in a multidisciplinary manner. Major challenges in the use of AI in oncology and decision-making tools are data security, data representation, and explainability of AI-based outcome predictions, in particular for decision-making processes in multidisciplinary cancer conferences. Finally, limitations and potential solutions are described and compared for current and future research attempts.
Collapse
Affiliation(s)
- Gregor Duwe
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
| | - Dominique Mercier
- Research Unit Smart Data and Knowledge ServicesGerman Research Center for Artificial IntelligenceKaiserslauternGermany
| | - Crispin Wiesmann
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
| | - Verena Kauth
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
| | - Kerstin Moench
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
| | - Markus Junker
- Research Unit Smart Data and Knowledge ServicesGerman Research Center for Artificial IntelligenceKaiserslauternGermany
| | - Christopher C. M. Neumann
- Department of Hematology, Oncology and Tumor ImmunologyCharité‐Universitätsmedizin Berlin, Freie Universität Berlin, Humboldt‐Universität zu BerlinBerlinGermany
| | - Axel Haferkamp
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
| | - Andreas Dengel
- Research Unit Smart Data and Knowledge ServicesGerman Research Center for Artificial IntelligenceKaiserslauternGermany
| | - Thomas Höfner
- Department of Urology and Pediatric UrologyUniversity Medical Center, Johannes Gutenberg UniversityMainzGermany
- Department of Urology, Ordensklinikum Linz ElisabethinenLinzAustria
| |
Collapse
|
4
|
Morani K, Ayana EK, Kollias D, Unay D. COVID-19 Detection from Computed Tomography Images Using Slice Processing Techniques and a Modified Xception Classifier. Int J Biomed Imaging 2024; 2024:9962839. [PMID: 38883272 PMCID: PMC11178392 DOI: 10.1155/2024/9962839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 04/09/2024] [Accepted: 05/06/2024] [Indexed: 06/18/2024] Open
Abstract
This paper extends our previous method for COVID-19 diagnosis, proposing an enhanced solution for detecting COVID-19 from computed tomography (CT) images using a lean transfer learning-based model. To decrease model misclassifications, two key steps of image processing were employed. Firstly, the uppermost and lowermost slices were removed, preserving sixty percent of each patient's slices. Secondly, all slices underwent manual cropping to emphasize the lung areas. Subsequently, resized CT scans (224 × 224) were input into an Xception transfer learning model with a modified output. Both Xception's architecture and pretrained weights were leveraged in the method. A big and rigorously annotated database of CT images was used to verify the method. The number of patients/subjects in the dataset is more than 5000, and the number and shape of the slices in each CT scan varies greatly. Verification was made both on the validation partition and on the test partition of unseen images. Results on the COV19-CT database showcased not only improvement from our previous solution and the baseline but also comparable performance to the highest-achieving methods on the same dataset. Further validation studies could explore the scalability and adaptability of the developed methodologies across diverse healthcare settings and patient populations. Additionally, investigating the integration of advanced image processing techniques, such as automated region of interest detection and segmentation algorithms, could enhance the efficiency and accuracy of COVID-19 diagnosis.
Collapse
Affiliation(s)
- Kenan Morani
- Izmir Democracy University Uckuyular, Gursel Aksel Blv No: 14 35140, Karabaglar, Izmir, Türkiye
| | - Esra Kaya Ayana
- Yildiz Technical University Yildiz 34349 Besiktas, Istanbul, Türkiye
| | | | - Devrim Unay
- Izmir Democracy University Uckuyular, Gursel Aksel Blv No: 14 35140, Karabaglar, Izmir, Türkiye
| |
Collapse
|
5
|
Berenguer AD, Kvasnytsia M, Bossa MN, Mukherjee T, Deligiannis N, Sahli H. Semi-supervised medical image classification via distance correlation minimization and graph attention regularization. Med Image Anal 2024; 94:103107. [PMID: 38401269 DOI: 10.1016/j.media.2024.103107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Revised: 12/11/2023] [Accepted: 02/13/2024] [Indexed: 02/26/2024]
Abstract
We propose a novel semi-supervised learning method to leverage unlabeled data alongside minimal annotated data and improve medical imaging classification performance in realistic scenarios with limited labeling budgets to afford data annotations. Our method introduces distance correlation to minimize correlations between feature representations from different views of the same image encoded with non-coupled deep neural networks architectures. In addition, it incorporates a data-driven graph-attention based regularization strategy to model affinities among images within the unlabeled data by exploiting their inherent relational information in the feature space. We conduct extensive experiments on four medical imaging benchmark data sets involving X-ray, dermoscopic, magnetic resonance, and computer tomography imaging on single and multi-label medical imaging classification scenarios. Our experiments demonstrate the effectiveness of our method in achieving very competitive performance and outperforming several state-of-the-art semi-supervised learning methods. Furthermore, they confirm the suitability of distance correlation as a versatile dependence measure and the benefits of the proposed graph-attention based regularization for semi-supervised learning in medical imaging analysis.
Collapse
Affiliation(s)
- Abel Díaz Berenguer
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium.
| | - Maryna Kvasnytsia
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium
| | - Matías Nicolás Bossa
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium
| | - Tanmoy Mukherjee
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium
| | - Nikos Deligiannis
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium; Interuniversity Microelectronics Centre (IMEC), Kapeldreef 75, 3001 Heverlee, Belgium
| | - Hichem Sahli
- Vrije Universiteit Brussel (VUB), Department of Electronics and Informatics (ETRO), Pleinlaan 2, 1050 Brussels, Belgium; Interuniversity Microelectronics Centre (IMEC), Kapeldreef 75, 3001 Heverlee, Belgium
| |
Collapse
|
6
|
Park S, No C, Kim S, Han K, Jung JM, Kwon KY, Lee M. A multimodal screening system for elderly neurological diseases based on deep learning. Sci Rep 2023; 13:21013. [PMID: 38030653 PMCID: PMC10687257 DOI: 10.1038/s41598-023-48071-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Accepted: 11/22/2023] [Indexed: 12/01/2023] Open
Abstract
In this paper, we propose a deep-learning-based algorithm for screening neurological diseases. We proposed various examination protocols for screening neurological diseases and collected data by video-recording persons performing these protocols. We converted video data into human landmarks that capture action information with a much smaller data dimension. We also used voice data which are also effective indicators of neurological disorders. We designed a subnetwork for each protocol to extract features from landmarks or voice and a feature aggregator that combines all the information extracted from the protocols to make a final decision. Multitask learning was applied to screen two neurological diseases. To capture meaningful information about these human landmarks and voices, we applied various pre-trained models to extract preliminary features. The spatiotemporal characteristics of landmarks are extracted using a pre-trained graph neural network, and voice features are extracted using a pre-trained time-delay neural network. These extracted high-level features are then passed onto the subnetworks and an additional feature aggregator that are simultaneously trained. We also used various data augmentation techniques to overcome the shortage of data. Using a frame-length staticizer that considers the characteristics of the data, we can capture momentary tremors without wasting information. Finally, we examine the effectiveness of different protocols and different modalities (different body parts and voice) through extensive experiments. The proposed method achieves AUC scores of 0.802 for stroke and 0.780 for Parkinson's disease, which is effective for a screening system.
Collapse
Affiliation(s)
- Sangyoung Park
- Department of Electrical and Electronic Engineering, Hanyang University ERICA, Ansan, 15588, South Korea
| | - Changho No
- Department of Electrical and Electronic Engineering, Hanyang University ERICA, Ansan, 15588, South Korea
| | - Sora Kim
- Department of Electrical and Electronic Engineering, Hanyang University ERICA, Ansan, 15588, South Korea
| | - Kyoungmin Han
- Department of Electrical and Electronic Engineering, Hanyang University ERICA, Ansan, 15588, South Korea
| | - Jin-Man Jung
- Department of Neurology, Korea University Ansan Hospital, Ansan, 15355, South Korea
| | - Kyum-Yil Kwon
- Department of Neurology, Soonchunhyang University Seoul Hospital, Seoul, 04401, South Korea
| | - Minsik Lee
- Department of Electrical and Electronic Engineering, Hanyang University ERICA, Ansan, 15588, South Korea.
| |
Collapse
|
7
|
Yang M, Huang X, Huang L, Cai G. Diagnosis of Parkinson’s disease based on 3D ResNet: The frontal lobe is crucial. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104904] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
8
|
Automated methods for diagnosis of Parkinson’s disease and predicting severity level. Neural Comput Appl 2022. [DOI: 10.1007/s00521-021-06626-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
9
|
Gaur L, Bhandari M, Razdan T, Mallik S, Zhao Z. Explanation-Driven Deep Learning Model for Prediction of Brain Tumour Status Using MRI Image Data. Front Genet 2022; 13:822666. [PMID: 35360838 PMCID: PMC8964286 DOI: 10.3389/fgene.2022.822666] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2021] [Accepted: 02/17/2022] [Indexed: 11/13/2022] Open
Abstract
Cancer research has seen explosive development exploring deep learning (DL) techniques for analysing magnetic resonance imaging (MRI) images for predicting brain tumours. We have observed a substantial gap in explanation, interpretability, and high accuracy for DL models. Consequently, we propose an explanation-driven DL model by utilising a convolutional neural network (CNN), local interpretable model-agnostic explanation (LIME), and Shapley additive explanation (SHAP) for the prediction of discrete subtypes of brain tumours (meningioma, glioma, and pituitary) using an MRI image dataset. Unlike previous models, our model used a dual-input CNN approach to prevail over the classification challenge with images of inferior quality in terms of noise and metal artifacts by adding Gaussian noise. Our CNN training results reveal 94.64% accuracy as compared to other state-of-the-art methods. We used SHAP to ensure consistency and local accuracy for interpretation as Shapley values examine all future predictions applying all possible combinations of inputs. In contrast, LIME constructs sparse linear models around each prediction to illustrate how the model operates in the immediate area. Our emphasis for this study is interpretability and high accuracy, which is critical for realising disparities in predictive performance, helpful in developing trust, and essential in integration into clinical practice. The proposed method has a vast clinical application that could potentially be used for mass screening in resource-constraint countries.
Collapse
Affiliation(s)
- Loveleen Gaur
- Amity International Business School, Amity University, Noida, India
| | - Mohan Bhandari
- Nepal College of Information Technology, Lalitpur, Nepal
| | - Tanvi Razdan
- Amity International Business School, Amity University, Noida, India
| | - Saurav Mallik
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
| | - Zhongming Zhao
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, United States
- *Correspondence: Zhongming Zhao,
| |
Collapse
|
10
|
Deng X, Yin Q, Guo P. Efficient structural pseudoinverse learning-based hierarchical representation learning for skin lesion classification. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-021-00588-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
AbstractThe success of deep learning in skin lesion classification mainly depends on the ultra-deep neural network and the significantly large training data set. Deep learning training is usually time-consuming, and large datasets with labels are hard to obtain, especially skin lesion images. Although pre-training and data augmentation can alleviate these issues, there are still some problems: (1) the data domain is not consistent, resulting in the slow convergence; and (2) low robustness to confusing skin lesions. To solve these problems, we propose an efficient structural pseudoinverse learning-based hierarchical representation learning method. Preliminary feature extraction, shallow network feature extraction and deep learning feature extraction are carried out respectively before the classification of skin lesion images. Gabor filter and pre-trained deep convolutional neural network are used for preliminary feature extraction. The structural pseudoinverse learning (S-PIL) algorithm is used to extract the shallow features. Then, S-PIL preliminarily identifies the skin lesion images that are difficult to be classified to form a new training set for deep learning feature extraction. Through the hierarchical representation learning, we analyze the features of skin lesion images layer by layer to improve the final classification. Our method not only avoid the slow convergence caused by inconsistency of data domain but also enhances the training of confusing examples. Without using additional data, our approach outperforms existing methods in the ISIC 2017 and ISIC 2018 datasets.
Collapse
|
11
|
Mahendran N, Vincent PMDR, Srinivasan K, Chang CY. Improving the Classification of Alzheimer's Disease Using Hybrid Gene Selection Pipeline and Deep Learning. Front Genet 2021; 12:784814. [PMID: 34868275 PMCID: PMC8632950 DOI: 10.3389/fgene.2021.784814] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Accepted: 10/20/2021] [Indexed: 11/13/2022] Open
Abstract
Alzheimer’s is a progressive, irreversible, neurodegenerative brain disease. Even with prominent symptoms, it takes years to notice, decode, and reveal Alzheimer’s. However, advancements in technologies, such as imaging techniques, help in early diagnosis. Still, sometimes the results are inaccurate, which delays the treatment. Thus, the research in recent times focused on identifying the molecular biomarkers that differentiate the genotype and phenotype characteristics. However, the gene expression dataset’s generated features are huge, 1,000 or even more than 10,000. To overcome such a curse of dimensionality, feature selection techniques are introduced. We designed a gene selection pipeline combining a filter, wrapper, and unsupervised method to select the relevant genes. We combined the minimum Redundancy and maximum Relevance (mRmR), Wrapper-based Particle Swarm Optimization (WPSO), and Auto encoder to select the relevant features. We used the GSE5281 Alzheimer’s dataset from the Gene Expression Omnibus We implemented an Improved Deep Belief Network (IDBN) with simple stopping criteria after choosing the relevant genes. We used a Bayesian Optimization technique to tune the hyperparameters in the Improved Deep Belief Network. The tabulated results show that the proposed pipeline shows promising results.
Collapse
Affiliation(s)
- Nivedhitha Mahendran
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - P M Durai Raj Vincent
- School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, India
| | - Kathiravan Srinivasan
- School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, India
| | - Chuan-Yu Chang
- Department of Computer Science and Information Engineering, National Yunlin University of Science and Technology, Yunlin, Taiwan
| |
Collapse
|
12
|
Anwar T. COVID19 Diagnosis using AutoML from 3D CT scans. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW) 2021. [DOI: 10.1109/iccvw54120.2021.00061] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
13
|
Zhang Z, Li G, Xu Y, Tang X. Application of Artificial Intelligence in the MRI Classification Task of Human Brain Neurological and Psychiatric Diseases: A Scoping Review. Diagnostics (Basel) 2021; 11:1402. [PMID: 34441336 PMCID: PMC8392727 DOI: 10.3390/diagnostics11081402] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 07/21/2021] [Accepted: 07/21/2021] [Indexed: 12/12/2022] Open
Abstract
Artificial intelligence (AI) for medical imaging is a technology with great potential. An in-depth understanding of the principles and applications of magnetic resonance imaging (MRI), machine learning (ML), and deep learning (DL) is fundamental for developing AI-based algorithms that can meet the requirements of clinical diagnosis and have excellent quality and efficiency. Moreover, a more comprehensive understanding of applications and opportunities would help to implement AI-based methods in an ethical and sustainable manner. This review first summarizes recent research advances in ML and DL techniques for classifying human brain magnetic resonance images. Then, the application of ML and DL methods to six typical neurological and psychiatric diseases is summarized, including Alzheimer's disease (AD), Parkinson's disease (PD), major depressive disorder (MDD), schizophrenia (SCZ), attention-deficit/hyperactivity disorder (ADHD), and autism spectrum disorder (ASD). Finally, the limitations of the existing research are discussed, and possible future research directions are proposed.
Collapse
Affiliation(s)
- Zhao Zhang
- 715-3 Teaching Building No.5, Department of Biomedical Engineering, School of Life Sciences, Beijing Institute of Technology, 5 South Zhongguancun Road, Haidian District, Beijing 100081, China; (Z.Z.); (G.L.)
| | - Guangfei Li
- 715-3 Teaching Building No.5, Department of Biomedical Engineering, School of Life Sciences, Beijing Institute of Technology, 5 South Zhongguancun Road, Haidian District, Beijing 100081, China; (Z.Z.); (G.L.)
| | - Yong Xu
- Department of Cardiology, Chinese PLA General Hospital, Beijing 100853, China;
| | - Xiaoying Tang
- 715-3 Teaching Building No.5, Department of Biomedical Engineering, School of Life Sciences, Beijing Institute of Technology, 5 South Zhongguancun Road, Haidian District, Beijing 100081, China; (Z.Z.); (G.L.)
| |
Collapse
|
14
|
UAVs in Disaster Management: Application of Integrated Aerial Imagery and Convolutional Neural Network for Flood Detection. SUSTAINABILITY 2021. [DOI: 10.3390/su13147547] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Floods have been a major cause of destruction, instigating fatalities and massive damage to the infrastructure and overall economy of the affected country. Flood-related devastation results in the loss of homes, buildings, and critical infrastructure, leaving no means of communication or travel for the people stuck in such disasters. Thus, it is essential to develop systems that can detect floods in a region to provide timely aid and relief to stranded people, save their livelihoods, homes, and buildings, and protect key city infrastructure. Flood prediction and warning systems have been implemented in developed countries, but the manufacturing cost of such systems is too high for developing countries. Remote sensing, satellite imagery, global positioning system, and geographical information systems are currently used for flood detection to assess the flood-related damages. These techniques use neural networks, machine learning, or deep learning methods. However, unmanned aerial vehicles (UAVs) coupled with convolution neural networks have not been explored in these contexts to instigate a swift disaster management response to minimize damage to infrastructure. Accordingly, this paper uses UAV-based aerial imagery as a flood detection method based on Convolutional Neural Network (CNN) to extract flood-related features from the images of the disaster zone. This method is effective in assessing the damage to local infrastructures in the disaster zones. The study area is based on a flood-prone region of the Indus River in Pakistan, where both pre-and post-disaster images are collected through UAVs. For the training phase, 2150 image patches are created by resizing and cropping the source images. These patches in the training dataset train the CNN model to detect and extract the regions where a flood-related change has occurred. The model is tested against both pre-and post-disaster images to validate it, which has positive flood detection results with an accuracy of 91%. Disaster management organizations can use this model to assess the damages to critical city infrastructure and other assets worldwide to instigate proper disaster responses and minimize the damages. This can help with the smart governance of the cities where all emergent disasters are addressed promptly.
Collapse
|
15
|
Abstract
This paper provides a review of an emerging field in the food processing sector, referring to efficient and safe food supply chains, ’from farm to fork’, as enabled by Artificial Intelligence (AI). The field is of great significance from economic, food safety and public health points of views. The paper focuses on effective food production, food maintenance energy management and food retail packaging labeling control, using recent advances in machine learning. Appropriate deep neural architectures are adopted and used for this purpose, including Fully Convolutional Networks, Long Short-Term Memories and Recurrent Neural Networks, Auto-Encoders and Attention mechanisms, Latent Variable extraction and clustering, as well as Domain Adaptation. Three experimental studies are presented, illustrating the ability of these AI methodologies to produce state-of-the-art performance in the whole food supply chain. In particular, these concern: (i) predicting plant growth and tomato yield in greenhouses, thus matching food production to market needs and reducing food waste or food unavailability; (ii) optimizing energy consumption across large networks of food retail refrigeration systems, through optimal selection of systems that can be shut-down and through prediction of the respective food de-freezing times, during peaks of power demand load; (iii) optical recognition and verification of food consumption expiry date in automatic inspection of retail packaged food, thus ensuring safety of food and people’s health.
Collapse
|
16
|
Comparative performance analysis of quantum machine learning with deep learning for diabetes prediction. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-021-00398-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Abstract
Background
Diabetes, the fastest growing health emergency, has created several life-threatening challenges to public health globally. It is a metabolic disorder and triggers many other chronic diseases such as heart attack, diabetic nephropathy, brain strokes, etc. The prime objective of this work is to develop a prognosis tool based on the PIMA Indian Diabetes dataset that will help medical practitioners in reducing the lethality associated with diabetes.
Methods
Based on the features present in the dataset, two prediction models have been proposed by employing deep learning (DL) and quantum machine learning (QML) techniques. The accuracy has been used to evaluate the prediction capability of these developed models. The outlier rejection, filling missing values, and normalization have been used to uplift the discriminatory performance of these models. Also, the performance of these models has been compared against state-of-the-art models.
Results
The performance measures such as precision, accuracy, recall, F1 score, specificity, balanced accuracy, false detection rate, missed detection rate, and diagnostic odds ratio have been achieved as 0.90, 0.95, 0.95, 0.93, 0.95, 0.95, 0.03, 0.02, and 399.00 for DL model respectively, However for QML, these measures have been computed as 0.74, 0.86, 0.85, 0.79, 0.86, 0.86, 0.11, 0.05, and 35.89 respectively.
Conclusion
The proposed DL model has a high diabetes prediction accuracy as compared with the developed QML and existing state-of-the-art models. It also uplifts the performance by 1.06% compared to reported work. However, the performance of the QML model has been found as satisfactory and comparable with existing literature.
Collapse
|
17
|
Habib M, Faris M, Qaddoura R, Alomari M, Alomari A, Faris H. Toward an Automatic Quality Assessment of Voice-Based Telemedicine Consultations: A Deep Learning Approach. SENSORS 2021; 21:s21093279. [PMID: 34068602 PMCID: PMC8126050 DOI: 10.3390/s21093279] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 05/03/2021] [Accepted: 05/05/2021] [Indexed: 11/16/2022]
Abstract
Maintaining a high quality of conversation between doctors and patients is essential in telehealth services, where efficient and competent communication is important to promote patient health. Assessing the quality of medical conversations is often handled based on a human auditory-perceptual evaluation. Typically, trained experts are needed for such tasks, as they follow systematic evaluation criteria. However, the daily rapid increase of consultations makes the evaluation process inefficient and impractical. This paper investigates the automation of the quality assessment process of patient–doctor voice-based conversations in a telehealth service using a deep-learning-based classification model. For this, the data consist of audio recordings obtained from Altibbi. Altibbi is a digital health platform that provides telemedicine and telehealth services in the Middle East and North Africa (MENA). The objective is to assist Altibbi’s operations team in the evaluation of the provided consultations in an automated manner. The proposed model is developed using three sets of features: features extracted from the signal level, the transcript level, and the signal and transcript levels. At the signal level, various statistical and spectral information is calculated to characterize the spectral envelope of the speech recordings. At the transcript level, a pre-trained embedding model is utilized to encompass the semantic and contextual features of the textual information. Additionally, the hybrid of the signal and transcript levels is explored and analyzed. The designed classification model relies on stacked layers of deep neural networks and convolutional neural networks. Evaluation results show that the model achieved a higher level of precision when compared with the manual evaluation approach followed by Altibbi’s operations team.
Collapse
Affiliation(s)
- Maria Habib
- Altibbi, King Hussein Business Park, Amman 11831, Jordan; (M.H.); (M.F.); (M.A.); (A.A.)
| | - Mohammad Faris
- Altibbi, King Hussein Business Park, Amman 11831, Jordan; (M.H.); (M.F.); (M.A.); (A.A.)
| | - Raneem Qaddoura
- Faculty of Information Technology, Philadelphia University, Amman 19392, Jordan;
| | - Manal Alomari
- Altibbi, King Hussein Business Park, Amman 11831, Jordan; (M.H.); (M.F.); (M.A.); (A.A.)
| | - Alaa Alomari
- Altibbi, King Hussein Business Park, Amman 11831, Jordan; (M.H.); (M.F.); (M.A.); (A.A.)
| | - Hossam Faris
- Altibbi, King Hussein Business Park, Amman 11831, Jordan; (M.H.); (M.F.); (M.A.); (A.A.)
- King Abdullah II School for Information Technology, The University of Jordan, Amman 11942, Jordan
- School of Computing and Informatics, Al Hussein Technical University, Amman 11831, Jordan
- Correspondence: or
| |
Collapse
|
18
|
3D-semantic segmentation and classification of stomach infections using uncertainty aware deep neural networks. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-021-00328-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
AbstractWireless capsule endoscopy (WCE) might move through human body and captures the small bowel and captures the video and require the analysis of all frames of video due to which the diagnosis of gastrointestinal infections by the physician is a tedious task. This tiresome assignment has fuelled the researcher’s efforts to present an automated technique for gastrointestinal infections detection. The segmentation of stomach infections is a challenging task because the lesion region having low contrast and irregular shape and size. To handle this challenging task, in this research work a new deep semantic segmentation model is suggested for 3D-segmentation of the different types of stomach infections. In the segmentation model, deep labv3 is employed as a backbone of the ResNet-50 model. The model is trained with ground-masks and accurately performs pixel-wise classification in the testing phase. Similarity among the different types of stomach lesions accurate classification is a difficult task, which is addressed in this reported research by extracting deep features from global input images using a pre-trained ResNet-50 model. Furthermore, the latest advances in the estimation of uncertainty and model interpretability in the classification of different types of stomach infections is presented. The classification results estimate uncertainty related to the vital features in input and show how uncertainty and interpretability might be modeled in ResNet-50 for the classification of the different types of stomach infections. The proposed model achieved up to 90% prediction scores to authenticate the method performance.
Collapse
|
19
|
Hu H, Wei Y, Zhou Y. Product-harm crisis intelligent warning system design based on fine-grained sentiment analysis of automobile complaints. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-021-00306-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
AbstractDue to the huge losses caused by product-harm crises and subsequent recalls in the automobile industry, companies must urgently design a product-harm crisis warning system. However, the designs of existing warning systems use the recurrent neural network algorithm, which suffers from gradient disappearance and gradient explosion issues. To compensate for these defects, this study uses a long and short-term memory algorithm to achieve a final prediction accuracy of 90%. This study contributes to the research and design of automatic crisis warning systems by considering sentiment and improving the accuracy of automobile product-harm crisis prediction.
Collapse
|
20
|
Abstract
A novel machine learning approach is presented in this paper, based on extracting latent information and using it to assist decision making on ambulance attendance and conveyance to a hospital. The approach includes two steps: in the first, a forward model analyzes the clinical and, possibly, non-clinical factors (explanatory variables), predicting whether positive decisions (response variables) should be given to the ambulance call, or not; in the second, a backward model analyzes the latent variables extracted from the forward model to infer the decision making procedure. The forward model is implemented through a machine, or deep learning technique, whilst the backward model is implemented through unsupervised learning. An experimental study is presented, which illustrates the obtained results, by investigating emergency ambulance calls to people in nursing and residential care homes, over a one-year period, using an anonymized data set provided by East Midlands Ambulance Service in United Kingdom.
Collapse
|
21
|
Knowledge from the original network: restore a better pruned network with knowledge distillation. COMPLEX INTELL SYST 2021. [DOI: 10.1007/s40747-020-00248-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
AbstractTo deploy deep neural networks to edge devices with limited computation and storage costs, model compression is necessary for the application of deep learning. Pruning, as a traditional way of model compression, seeks to reduce the parameters of model weights. However, when a deep neural network is pruned, the accuracy of the network will significantly decrease. The traditional way to decrease the accuracy loss is fine-tuning. When over many parameters are pruned, the pruned network’s capacity is reduced heavily and cannot recover to high accuracy. In this paper, we apply the knowledge distillation strategy to abate the accuracy loss of pruned models. The original network of the pruned network was used as the teacher network, aiming to transfer the dark knowledge from the original network to the pruned sub-network. We have applied three mainstream knowledge distillation methods: response-based knowledge, feature-based knowledge, and relation-based knowledge (Gou et al. in Knowledge distillation: a survey. arXiv:200605525, 2020), and compare the result to the traditional fine-tuning method with grand-truth labels. Experiments have been done on the CIFAR100 dataset with several deep convolution neural network. Results show that the pruned network recovered by knowledge distillation with its original network performs better accuracy than it recovered by fine-tuning with sample labels. It has also been validated in this paper that the original network as the teacher performs better than differently structured networks with same accuracy as the teacher.
Collapse
|
22
|
LSTM-based Deep Learning Model for Stock Prediction and Predictive Optimization Model. EURO JOURNAL ON DECISION PROCESSES 2021. [DOI: 10.1016/j.ejdp.2021.100001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
23
|
Shankar K, Perumal E. A novel hand-crafted with deep learning features based fusion model for COVID-19 diagnosis and classification using chest X-ray images. COMPLEX INTELL SYST 2020; 7:1277-1293. [PMID: 34777955 PMCID: PMC7659408 DOI: 10.1007/s40747-020-00216-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2020] [Accepted: 10/06/2020] [Indexed: 11/25/2022]
Abstract
COVID-19 pandemic is increasing in an exponential rate, with restricted accessibility of rapid test kits. So, the design and implementation of COVID-19 testing kits remain an open research problem. Several findings attained using radio-imaging approaches recommend that the images comprise important data related to coronaviruses. The application of recently developed artificial intelligence (AI) techniques, integrated with radiological imaging, is helpful in the precise diagnosis and classification of the disease. In this view, the current research paper presents a novel fusion model hand-crafted with deep learning features called FM-HCF-DLF model for diagnosis and classification of COVID-19. The proposed FM-HCF-DLF model comprises three major processes, namely Gaussian filtering-based preprocessing, FM for feature extraction and classification. FM model incorporates the fusion of handcrafted features with the help of local binary patterns (LBP) and deep learning (DL) features and it also utilizes convolutional neural network (CNN)-based Inception v3 technique. To further improve the performance of Inception v3 model, the learning rate scheduler using Adam optimizer is applied. At last, multilayer perceptron (MLP) is employed to carry out the classification process. The proposed FM-HCF-DLF model was experimentally validated using chest X-ray dataset. The experimental outcomes inferred that the proposed model yielded superior performance with maximum sensitivity of 93.61%, specificity of 94.56%, precision of 94.85%, accuracy of 94.08%, F score of 93.2% and kappa value of 93.5%.
Collapse
Affiliation(s)
- K Shankar
- Department of Computer Applications, Alagappa University, Karaikudi, India
| | - Eswaran Perumal
- Department of Computer Applications, Alagappa University, Karaikudi, India
| |
Collapse
|
24
|
Noor MBT, Zenia NZ, Kaiser MS, Mamun SA, Mahmud M. Application of deep learning in detecting neurological disorders from magnetic resonance images: a survey on the detection of Alzheimer's disease, Parkinson's disease and schizophrenia. Brain Inform 2020; 7:11. [PMID: 33034769 PMCID: PMC7547060 DOI: 10.1186/s40708-020-00112-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2020] [Accepted: 09/17/2020] [Indexed: 12/12/2022] Open
Abstract
Neuroimaging, in particular magnetic resonance imaging (MRI), has been playing an important role in understanding brain functionalities and its disorders during the last couple of decades. These cutting-edge MRI scans, supported by high-performance computational tools and novel ML techniques, have opened up possibilities to unprecedentedly identify neurological disorders. However, similarities in disease phenotypes make it very difficult to detect such disorders accurately from the acquired neuroimaging data. This article critically examines and compares performances of the existing deep learning (DL)-based methods to detect neurological disorders-focusing on Alzheimer's disease, Parkinson's disease and schizophrenia-from MRI data acquired using different modalities including functional and structural MRI. The comparative performance analysis of various DL architectures across different disorders and imaging modalities suggests that the Convolutional Neural Network outperforms other methods in detecting neurological disorders. Towards the end, a number of current research challenges are indicated and some possible future research directions are provided.
Collapse
Affiliation(s)
- Manan Binth Taj Noor
- Institute of Information Technology, Jahangirnagar University, Savar, 1342, Dhaka, Bangladesh
| | - Nusrat Zerin Zenia
- Institute of Information Technology, Jahangirnagar University, Savar, 1342, Dhaka, Bangladesh
| | - M Shamim Kaiser
- Institute of Information Technology, Jahangirnagar University, Savar, 1342, Dhaka, Bangladesh.
| | - Shamim Al Mamun
- Institute of Information Technology, Jahangirnagar University, Savar, 1342, Dhaka, Bangladesh
| | - Mufti Mahmud
- Department of Computing & Technology, Nottingham Trent University, NG11 8NS, Nottingham, UK.
| |
Collapse
|
25
|
Kheradvar A, Jafarkhani H, Guy TS, Finn JP. Prospect of artificial intelligence for the assessment of cardiac function and treatment of cardiovascular disease. Future Cardiol 2020; 17:183-187. [PMID: 32933328 DOI: 10.2217/fca-2020-0128] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Affiliation(s)
- Arash Kheradvar
- The Edwards Lifesciences Center for Advanced Cardiovascular Technology, University of California, Irvine, Irvine, CA 92697, USA
| | - Hamid Jafarkhani
- Center for Pervasive Communications & Computing, University of California, Irvine, Irvine, CA 92697, USA
| | - Thomas Sloane Guy
- Division of Cardiac Surgery, Department of Surgery, Thomas Jefferson University, Philadelphia, PA 19107, USA
| | - John Paul Finn
- Department of Radiological Sciences, David Geffen School of Medicine, UCLA, Los Angeles, CA 90095, USA
| |
Collapse
|
26
|
MR Image Synthesis Using Generative Adversarial Networks for Parkinson’s Disease Classification. ACTA ACUST UNITED AC 2020. [DOI: 10.1007/978-981-15-4992-2_30] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2023]
|
27
|
Abstract
AbstractAn important area of computer vision is real-time object tracking, which is now widely used in intelligent transportation and smart industry technologies. Although the correlation filter object tracking methods have a good real-time tracking effect, it still faces many challenges such as scale variation, occlusion, and boundary effects. Many scholars have continuously improved existing methods for better efficiency and tracking performance in some aspects. To provide a comprehensive understanding of the background, key technologies and algorithms of single object tracking, this article focuses on the correlation filter-based object tracking algorithms. Specifically, the background and current advancement of the object tracking methodologies, as well as the presentation of the main datasets are introduced. All kinds of methods are summarized to present tracking results in various vision problems, and a visual tracking method based on reliability is observed.
Collapse
|
28
|
Abstract
AbstractThis paper discusses the problem of decoding gestures represented by surface electromyography (sEMG) signals in the presence of variable force levels. It is an attempt that multi-task learning (MTL) is proposed to recognize gestures and force levels synchronously. First, methods of gesture recognition with different force levels are investigated. Then, MTL framework is presented to improve the gesture recognition performance and give information about force levels. Last but not least, to solve the problem that using the greedy principle in MTL, a modified pseudo-task augmentation (PTA) trajectory is introduced. Experiments conducted on two representative datasets demonstrate that compared with other methods, frequency domain information with convolutional neural network (CNN) is more suitable for gesture recognition with variable force levels. Besides, the feasibility of extracting features that are closely related to both gestures and force levels is verified via MTL. By influencing learning dynamics, the proposed PTA method can improve the results of all tasks, and make it applicable to the case where the main tasks and auxiliary tasks are clear.
Collapse
|
29
|
Dupuy-Chessa S, Proper HA. Medical Dialogue Summarization for Automated Reporting in Healthcare. ADVANCED INFORMATION SYSTEMS ENGINEERING WORKSHOPS 2020. [PMCID: PMC7225507 DOI: 10.1007/978-3-030-49165-9_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Healthcare providers generally spend excessive time on administrative tasks at the expense of direct patient care. The emergence of new artificial intelligence and natural language processing technologies gives rise to innovations that could relieve them of this burden. In this paper, we present a pipeline structure for building dialogue summarization systems. Our pipeline summarizes a consultation of a patient with a care provider and automatically generates a report compliant with medical formats. Four pipeline components are used to generate a report based on audio input. The outputs of each component are analyzed to determine the most important challenges and issues. The current proof-of-concept, which was applied to eight doctor-to-patient sessions concerning ear infection, shows that automatic dialogue summarization and reporting is achievable, but requires improvements to increase completeness.
Collapse
Affiliation(s)
| | - Henderik A. Proper
- Luxembourg Institute of Science and Technology, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
30
|
Jmaiel M, Mokhtari M, Abdulrazak B, Aloulou H, Kallel S. Alzheimer’s Disease Early Detection Using a Low Cost Three-Dimensional Densenet-121 Architecture. LECTURE NOTES IN COMPUTER SCIENCE 2020. [PMCID: PMC7313295 DOI: 10.1007/978-3-030-51517-1_1] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
The objective of this work is to detect Alzheimer’s disease using Magnetic Resonance Imaging. For this, we use a three-dimensional densenet-121 architecture. With the use of only freely available tools, we obtain good results: a deep neural network showing metrics of 87% accuracy, 87% sensitivity (micro-average), 88% specificity (micro-average), and 92% AUROC (micro-average) for the task of classifying five different classes (disease stages). The use of tools available for free means that this work can be replicated in developing countries.
Collapse
|
31
|
Detecting Neurodegenerative Disease from MRI: A Brief Review on a Deep Learning Perspective. Brain Inform 2019. [DOI: 10.1007/978-3-030-37078-7_12] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|