1
|
Swain BK, Mohapatra S, Mishra M, Sharma R. A unified approach for Parkinson's disease recognition: imbalance mitigation and grid search optimized boosting with LightGBM. Med Biol Eng Comput 2024; 62:3471-3491. [PMID: 38874706 DOI: 10.1007/s11517-024-03139-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 05/24/2024] [Indexed: 06/15/2024]
Abstract
The work elucidates the importance of accurate Parkinson's disease classification within medical diagnostics and introduces a novel framework for achieving this goal. Specifically, the study focuses on enhancing disease identification accuracy utilizing boosting methods. A standout contribution of this work lies in the utilization of a light gradient boosting machine (LGBM) coupled with hyperparameter tuning through grid search optimization (GSO) on the Parkinson's disease dataset derived from speech recording signals. In addition, the Synthetic Minority Over-sampling Technique (SMOTE) has also been employed as a pre-processing technique to balance the dataset, enhancing the robustness and reliability of the analysis. This approach is a novel addition to the study and underscores its potential to enhance disease identification accuracy. The datasets employed in this work include both gender-specific and combined cases, utilizing several distinctive feature subsets including baseline, Mel-frequency cepstral coefficients (MFCC), time-frequency, wavelet transform (WT), vocal fold, and tunable-Q-factor wavelet transform (TQWT). Comparative analyses against state-of-the-art boosting methods, such as AdaBoost and XG-Boost, reveal the superior performance of our proposed approach across diverse datasets and metrics. Notably, on the male cohort dataset, our method achieves exceptional results, demonstrating an accuracy of 0.98, precision of 1.00, sensitivity of 0.97, F1-Score of 0.98, and specificity of 1.00 when utilizing all features with GSO-LGBM. In comparison to AdaBoost and XGBoost, the proposed framework utilizing LGBM demonstrates superior accuracy, achieving an average improvement of 5% in classification accuracy across all feature subsets and datasets. These findings underscore the potential of the proposed methodology to enhance disease identification accuracy and provide valuable insights for further advancements in medical diagnostics.
Collapse
Affiliation(s)
- Bhanja Kishor Swain
- Department of Electrical Engineering, Siksha O Anusandhan University, Bhubaneswar, 751030, India
- Center for Internet of Things, Siksha O Anusandhan University, Bhubaneswar, 751030, India
| | - Subhashree Mohapatra
- Department of Computer Science and Engineering, Siksha O Anusandhan University, Bhubaneswar, 751030, India
| | - Manohar Mishra
- Department of Electrical and Electronics Engineering, Siksha O Anusandhan University, Bhubaneswar, 751030, India.
| | - Renu Sharma
- Department of Electrical Engineering, Siksha O Anusandhan University, Bhubaneswar, 751030, India
| |
Collapse
|
2
|
Jeong SM, Kim S, Lee EC, Kim HJ. Exploring Spectrogram-Based Audio Classification for Parkinson's Disease: A Study on Speech Classification and Qualitative Reliability Verification. SENSORS (BASEL, SWITZERLAND) 2024; 24:4625. [PMID: 39066023 PMCID: PMC11280556 DOI: 10.3390/s24144625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2024] [Revised: 07/15/2024] [Accepted: 07/16/2024] [Indexed: 07/28/2024]
Abstract
Patients suffering from Parkinson's disease suffer from voice impairment. In this study, we introduce models to classify normal and Parkinson's patients using their speech. We used an AST (audio spectrogram transformer), a transformer-based speech classification model that has recently outperformed CNN-based models in many fields, and a CNN-based PSLA (pretraining, sampling, labeling, and aggregation), a high-performance model in the existing speech classification field, for the study. This study compares and analyzes the models from both quantitative and qualitative perspectives. First, qualitatively, PSLA outperformed AST by more than 4% in accuracy, and the AUC was also higher, with 94.16% for AST and 97.43% for PSLA. Furthermore, we qualitatively evaluated the ability of the models to capture the acoustic features of Parkinson's through various CAM (class activation map)-based XAI (eXplainable AI) models such as GradCAM and EigenCAM. Based on PSLA, we found that the model focuses well on the muffled frequency band of Parkinson's speech, and the heatmap analysis of false positives and false negatives shows that the speech features are also visually represented when the model actually makes incorrect predictions. The contribution of this paper is that we not only found a suitable model for diagnosing Parkinson's through speech using two different types of models but also validated the predictions of the model in practice.
Collapse
Affiliation(s)
- Seung-Min Jeong
- Department of AI & Informatics, Graduate School, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea; (S.-M.J.); (S.K.)
| | - Seunghyun Kim
- Department of AI & Informatics, Graduate School, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea; (S.-M.J.); (S.K.)
| | - Eui Chul Lee
- Department of Human-Centered Artificial Intelligence, Sangmyung University, Hongjimun 2-gil 20, Jongno-gu, Seoul 03016, Republic of Korea
| | - Han Joon Kim
- Department of Neurology, Seoul National University College of Medicine, Seoul National University Hospital, Daehak-ro 101, Jongno-gu, Seoul 03080, Republic of Korea
| |
Collapse
|
3
|
Bhidayasiri R, Sringean J, Phumphid S, Anan C, Thanawattano C, Deoisres S, Panyakaew P, Phokaewvarangkul O, Maytharakcheep S, Buranasrikul V, Prasertpan T, Khontong R, Jagota P, Chaisongkram A, Jankate W, Meesri J, Chantadunga A, Rattanajun P, Sutaphan P, Jitpugdee W, Chokpatcharavate M, Avihingsanon Y, Sittipunt C, Sittitrai W, Boonrach G, Phonsrithong A, Suvanprakorn P, Vichitcholchai J, Bunnag T. The rise of Parkinson's disease is a global challenge, but efforts to tackle this must begin at a national level: a protocol for national digital screening and "eat, move, sleep" lifestyle interventions to prevent or slow the rise of non-communicable diseases in Thailand. Front Neurol 2024; 15:1386608. [PMID: 38803644 PMCID: PMC11129688 DOI: 10.3389/fneur.2024.1386608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 04/19/2024] [Indexed: 05/29/2024] Open
Abstract
The rising prevalence of Parkinson's disease (PD) globally presents a significant public health challenge for national healthcare systems, particularly in low-to-middle income countries, such as Thailand, which may have insufficient resources to meet these escalating healthcare needs. There are also many undiagnosed cases of early-stage PD, a period when therapeutic interventions would have the most value and least cost. The traditional "passive" approach, whereby clinicians wait for patients with symptomatic PD to seek treatment, is inadequate. Proactive, early identification of PD will allow timely therapeutic interventions, and digital health technologies can be scaled up in the identification and early diagnosis of cases. The Parkinson's disease risk survey (TCTR20231025005) aims to evaluate a digital population screening platform to identify undiagnosed PD cases in the Thai population. Recognizing the long prodromal phase of PD, the target demographic for screening is people aged ≥ 40 years, approximately 20 years before the usual emergence of motor symptoms. Thailand has a highly rated healthcare system with an established universal healthcare program for citizens, making it ideal for deploying a national screening program using digital technology. Designed by a multidisciplinary group of PD experts, the digital platform comprises a 20-item questionnaire about PD symptoms along with objective tests of eight digital markers: voice vowel, voice sentences, resting and postural tremor, alternate finger tapping, a "pinch-to-size" test, gait and balance, with performance recorded using a mobile application and smartphone's sensors. Machine learning tools use the collected data to identify subjects at risk of developing, or with early signs of, PD. This article describes the selection and validation of questionnaire items and digital markers, with results showing the chosen parameters and data analysis methods to be robust, reliable, and reproducible. This digital platform could serve as a model for similar screening strategies for other non-communicable diseases in Thailand.
Collapse
Affiliation(s)
- Roongroj Bhidayasiri
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
- The Academy of Science, The Royal Society of Thailand, Bangkok, Thailand
| | - Jirada Sringean
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Saisamorn Phumphid
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Chanawat Anan
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | | | - Suwijak Deoisres
- National Electronics and Computer Technology Centre, Pathum Thani, Thailand
| | - Pattamon Panyakaew
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Onanong Phokaewvarangkul
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Suppata Maytharakcheep
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Vijittra Buranasrikul
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Tittaya Prasertpan
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
- Sawanpracharak Hospital, Nakhon Sawan, Thailand
| | | | - Priya Jagota
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Araya Chaisongkram
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Worawit Jankate
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Jeeranun Meesri
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Araya Chantadunga
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Piyaporn Rattanajun
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Phantakarn Sutaphan
- Chulalongkorn Centre of Excellence for Parkinson’s Disease and Related Disorders, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Weerachai Jitpugdee
- Department of Rehabilitation Medicine, King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Marisa Chokpatcharavate
- Chulalongkorn Parkinson's Disease Support Group, Department of Medicine, Faculty of Medicine, Chulalongkorn Centre of Excellence for Parkinson's Disease and Related Disorders, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Bangkok, Thailand
| | - Yingyos Avihingsanon
- Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Thai Red Cross Society, Bangkok, Thailand
| | - Chanchai Sittipunt
- Faculty of Medicine, Chulalongkorn University, Bangkok, Thailand
- Thai Red Cross Society, Bangkok, Thailand
| | | | | | | | | | | | - Tej Bunnag
- Thai Red Cross Society, Bangkok, Thailand
| |
Collapse
|
4
|
Ameli A, Peña-Castillo L, Usefi H. Assessing the reproducibility of machine-learning-based biomarker discovery in Parkinson's disease. Comput Biol Med 2024; 174:108407. [PMID: 38603902 DOI: 10.1016/j.compbiomed.2024.108407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 03/21/2024] [Accepted: 04/01/2024] [Indexed: 04/13/2024]
Abstract
Feature selection and machine learning algorithms can be used to analyze Single Nucleotide Polymorphisms (SNPs) data and identify potential disease biomarkers. Reproducibility of identified biomarkers is critical for them to be useful for clinical research; however, genotyping platforms and selection criteria for individuals to be genotyped affect the reproducibility of identified biomarkers. To assess biomarkers reproducibility, we collected five SNPs datasets from the database of Genotypes and Phenotypes (dbGaP) and explored several data integration strategies. While combining datasets can lead to a reduction in classification accuracy, it has the potential to improve the reproducibility of potential biomarkers. We evaluated the agreement among different strategies in terms of the SNPs that were identified as potential Parkinson's disease (PD) biomarkers. Our findings indicate that, on average, 93% of the SNPs identified in a single dataset fail to be identified in other datasets. However, through dataset integration, this lack of replication is reduced to 62%. We discovered fifty SNPs that were identified at least twice, which could potentially serve as novel PD biomarkers. These SNPs are indirectly linked to PD in the literature but have not been directly associated with PD before. These findings open up new potential avenues of investigation.
Collapse
Affiliation(s)
- Ali Ameli
- Department of Computer Science, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, A1C5S7, NL, Canada
| | - Lourdes Peña-Castillo
- Department of Computer Science, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, A1C5S7, NL, Canada; Department of Biology, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, A1C5S7, NL, Canada.
| | - Hamid Usefi
- Department of Computer Science, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, A1C5S7, NL, Canada; Department of Mathematics and Statistics, Memorial University of Newfoundland, 230 Elizabeth Ave, St. John's, A1C5S7, NL, Canada.
| |
Collapse
|
5
|
Abumalloh RA, Nilashi M, Samad S, Ahmadi H, Alghamdi A, Alrizq M, Alyami S. Parkinson's disease diagnosis using deep learning: A bibliometric analysis and literature review. Ageing Res Rev 2024; 96:102285. [PMID: 38554785 DOI: 10.1016/j.arr.2024.102285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 03/20/2024] [Accepted: 03/24/2024] [Indexed: 04/02/2024]
Abstract
Parkinson's Disease (PD) is a progressive neurodegenerative illness triggered by decreased dopamine secretion. Deep Learning (DL) has gained substantial attention in PD diagnosis research, with an increase in the number of published papers in this discipline. PD detection using DL has presented more promising outcomes as compared with common machine learning approaches. This article aims to conduct a bibliometric analysis and a literature review focusing on the prominent developments taking place in this area. To achieve the target of the study, we retrieved and analyzed the available research papers in the Scopus database. Following that, we conducted a bibliometric analysis to inspect the structure of keywords, authors, and countries in the surveyed studies by providing visual representations of the bibliometric data using VOSviewer software. The study also provides an in-depth review of the literature focusing on different indicators of PD, deployed approaches, and performance metrics. The outcomes indicate the firm development of PD diagnosis using DL approaches over time and a large diversity of studies worldwide. Additionally, the literature review presented a research gap in DL approaches related to incremental learning, particularly in relation to big data analysis.
Collapse
Affiliation(s)
- Rabab Ali Abumalloh
- Department of Computer Science and Engineering, Qatar University, Doha 2713, Qatar
| | - Mehrbakhsh Nilashi
- Institute of Research and Development, Duy Tan University, Da Nang, Vietnam; School of Computer Science, Duy Tan University, Da Nang, Vietnam; UCSI Graduate Business School, UCSI University, No. 1 Jalan Menara Gading, UCSI Heights, Cheras, Kuala Lumpur 56000, Malaysia; Centre for Global Sustainability Studies (CGSS), Universiti Sains Malaysia, Penang 11800, Malaysia.
| | - Sarminah Samad
- Faculty of Business, UNITAR International University, Tierra Crest, Jalan SS6/3, Petaling Jaya, Selangor 47301, Malaysia
| | - Hossein Ahmadi
- Centre for Health Technology, Faculty of Health, University of Plymouth, Plymouth PL4 8AA, UK
| | - Abdullah Alghamdi
- Information Systems Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia; AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia
| | - Mesfer Alrizq
- Information Systems Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia; AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia
| | - Sultan Alyami
- AI Lab, Scientific and Engineering Research Center (SERC), Najran University, Najran, Saudi Arabia; Computer Science Dept., College of Computer Science and Information Systems, Najran University, Najran, Saudi Arabia
| |
Collapse
|
6
|
Malekroodi HS, Madusanka N, Lee BI, Yi M. Leveraging Deep Learning for Fine-Grained Categorization of Parkinson's Disease Progression Levels through Analysis of Vocal Acoustic Patterns. Bioengineering (Basel) 2024; 11:295. [PMID: 38534569 DOI: 10.3390/bioengineering11030295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 03/18/2024] [Accepted: 03/18/2024] [Indexed: 03/28/2024] Open
Abstract
Speech impairments often emerge as one of the primary indicators of Parkinson's disease (PD), albeit not readily apparent in its early stages. While previous studies focused predominantly on binary PD detection, this research explored the use of deep learning models to automatically classify sustained vowel recordings into healthy controls, mild PD, or severe PD based on motor symptom severity scores. Popular convolutional neural network (CNN) architectures, VGG and ResNet, as well as vision transformers, Swin, were fine-tuned on log mel spectrogram image representations of the segmented voice data. Furthermore, the research investigated the effects of audio segment lengths and specific vowel sounds on the performance of these models. The findings indicated that implementing longer segments yielded better performance. The models showed strong capability in distinguishing PD from healthy subjects, achieving over 95% precision. However, reliably discriminating between mild and severe PD cases remained challenging. The VGG16 achieved the best overall classification performance with 91.8% accuracy and the largest area under the ROC curve. Furthermore, focusing analysis on the vowel /u/ could further improve accuracy to 96%. Applying visualization techniques like Grad-CAM also highlighted how CNN models focused on localized spectrogram regions while transformers attended to more widespread patterns. Overall, this work showed the potential of deep learning for non-invasive screening and monitoring of PD progression from voice recordings, but larger multi-class labeled datasets are needed to further improve severity classification.
Collapse
Affiliation(s)
- Hadi Sedigh Malekroodi
- Industry 4.0 Convergence Bionics Engineering, Pukyong National University, Busan 48513, Republic of Korea
| | - Nuwan Madusanka
- Digital of Healthcare Research Center, Institute of Information Technology and Convergence, Pukyong National University, Busan 48513, Republic of Korea
| | - Byeong-Il Lee
- Industry 4.0 Convergence Bionics Engineering, Pukyong National University, Busan 48513, Republic of Korea
- Digital of Healthcare Research Center, Institute of Information Technology and Convergence, Pukyong National University, Busan 48513, Republic of Korea
- Division of Smart Healthcare, Pukyong National University, Busan 48513, Republic of Korea
| | - Myunggi Yi
- Industry 4.0 Convergence Bionics Engineering, Pukyong National University, Busan 48513, Republic of Korea
- Digital of Healthcare Research Center, Institute of Information Technology and Convergence, Pukyong National University, Busan 48513, Republic of Korea
- Division of Smart Healthcare, Pukyong National University, Busan 48513, Republic of Korea
| |
Collapse
|
7
|
Wang Y, Li F, Zhang X, Wang P, Li Y, Zhang Y. Intra-subject enveloped multilayer fuzzy sample compression for speech diagnosis of Parkinson's disease. Med Biol Eng Comput 2024; 62:371-388. [PMID: 37874453 DOI: 10.1007/s11517-023-02944-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Accepted: 10/02/2023] [Indexed: 10/25/2023]
Abstract
Machine learning-based Parkinson's disease (PD) speech diagnosis is a current research hotspot. However, existing methods use each corpus sample as the base unit for modeling. Since different corpus samples within the same subject have different sensitive speech features, it is difficult to obtain unified and stable sensitive speech features (diagnostic markers) that reflect the pathology of the whole subject. Therefore, this study aims at compressing the corpus samples within the subject to facilitate the search for diagnostic markers with high diagnostic accuracy. A two-step sample compression module (TSCM) can solve the problem above. It includes two major parts: sample pruning module (SPM) and sample fuzzy clustering mechanism (SFCMD). Based on stacking multiple TSCMs, a multilayer sample compression module (MSCM) is formed to obtain multilayer compression samples. After that, simultaneous sample/feature selection mechanism (SS/FSM) is designed for feature selection. Based on the multilayer compression samples processed by MSCM and SS/FSM, a novel ensemble learning algorithm (EMSFE) is designed with sparse fusion ensemble learning mechanism (SFELM). The proposed EMSFE is validated by visualization of extracted features and performance comparison with related algorithms. The experimental results show that the proposed algorithm can effectively extract the stable diagnostic markers by compressing the corpus samples within the subject. Furthermore, based on LOSO cross validation, the proposed algorithm with extreme learning machine (ELM) classifier can achieve the accuracy of 92.5%, 93.75% and 91.67% on three datasets, respectively. The proposed EMSFE can extract unified and stable sensitive features that accurately reflect the overall pathology of the subject, which can better meet the requirements of clinical applications.The code and datasets can be found in: https://github.com/wywwwww/EMSFE-supplementary-material.git.
Collapse
Affiliation(s)
- Yiwen Wang
- School of Microelectornics and Communication Engineering, Chongqing University, Chongqing, 400044, China
| | - Fan Li
- School of Microelectornics and Communication Engineering, Chongqing University, Chongqing, 400044, China
| | - Xiaoheng Zhang
- School of Microelectornics and Communication Engineering, Chongqing University, Chongqing, 400044, China
| | - Pin Wang
- School of Microelectornics and Communication Engineering, Chongqing University, Chongqing, 400044, China
| | - Yongming Li
- School of Microelectornics and Communication Engineering, Chongqing University, Chongqing, 400044, China.
| | - Yanling Zhang
- Department of Neurology, Southwest Hospital, Army Medical University, Chongqing, 400038, China
| |
Collapse
|
8
|
Ali L, Javeed A, Noor A, Rauf HT, Kadry S, Gandomi AH. Parkinson's disease detection based on features refinement through L1 regularized SVM and deep neural network. Sci Rep 2024; 14:1333. [PMID: 38228772 PMCID: PMC10791701 DOI: 10.1038/s41598-024-51600-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Accepted: 01/07/2024] [Indexed: 01/18/2024] Open
Abstract
In previous studies, replicated and multiple types of speech data have been used for Parkinson's disease (PD) detection. However, two main problems in these studies are lower PD detection accuracy and inappropriate validation methodologies leading to unreliable results. This study discusses the effects of inappropriate validation methodologies used in previous studies and highlights the use of appropriate alternative validation methods that would ensure generalization. To enhance PD detection accuracy, we propose a two-stage diagnostic system that refines the extracted set of features through [Formula: see text] regularized linear support vector machine and classifies the refined subset of features through a deep neural network. To rigorously evaluate the effectiveness of the proposed diagnostic system, experiments are performed on two different voice recording-based benchmark datasets. For both datasets, the proposed diagnostic system achieves 100% accuracy under leave-one-subject-out (LOSO) cross-validation (CV) and 97.5% accuracy under k-fold CV. The results show that the proposed system outperforms the existing methods regarding PD detection accuracy. The results suggest that the proposed diagnostic system is essential to improving non-invasive diagnostic decision support in PD.
Collapse
Affiliation(s)
- Liaqat Ali
- Department of Electrical Engineering, University of Science and Technology Bannu, Bannu, Pakistan
| | - Ashir Javeed
- Aging Research Center, Karolinska Institutet, Solna, Sweden
| | - Adeeb Noor
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, 80221, Jeddah, Saudi Arabia
| | | | - Seifedine Kadry
- Department of Applied Data Science, Noroff University College, Kristiansand, Norway
- Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, 346, United Arab Emirates
- Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon
| | - Amir H Gandomi
- Faculty of Engineering and Information Technology, University of Technology Sydney, Ultimo, NSW, 2007, Australia.
- University Research and Innovation Center (EKIK), Óbuda University, Budapest, 1034, Hungary.
| |
Collapse
|
9
|
Ibarra EJ, Arias-Londoño JD, Zañartu M, Godino-Llorente JI. Towards a Corpus (and Language)-Independent Screening of Parkinson's Disease from Voice and Speech through Domain Adaptation. Bioengineering (Basel) 2023; 10:1316. [PMID: 38002440 PMCID: PMC10669342 DOI: 10.3390/bioengineering10111316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 11/03/2023] [Accepted: 11/10/2023] [Indexed: 11/26/2023] Open
Abstract
End-to-end deep learning models have shown promising results for the automatic screening of Parkinson's disease by voice and speech. However, these models often suffer degradation in their performance when applied to scenarios involving multiple corpora. In addition, they also show corpus-dependent clusterings. These facts indicate a lack of generalisation or the presence of certain shortcuts in the decision, and also suggest the need for developing new corpus-independent models. In this respect, this work explores the use of domain adversarial training as a viable strategy to develop models that retain their discriminative capacity to detect Parkinson's disease across diverse datasets. The paper presents three deep learning architectures and their domain adversarial counterparts. The models were evaluated with sustained vowels and diadochokinetic recordings extracted from four corpora with different demographics, dialects or languages, and recording conditions. The results showed that the space distribution of the embedding features extracted by the domain adversarial networks exhibits a higher intra-class cohesion. This behaviour is supported by a decrease in the variability and inter-domain divergence computed within each class. The findings suggest that domain adversarial networks are able to learn the common characteristics present in Parkinsonian voice and speech, which are supposed to be corpus, and consequently, language independent. Overall, this effort provides evidence that domain adaptation techniques refine the existing end-to-end deep learning approaches for Parkinson's disease detection from voice and speech, achieving more generalizable models.
Collapse
Affiliation(s)
- Emiro J. Ibarra
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Avenida España 1680, Casilla 110-V, Valparaíso 2390123, Chile; (E.J.I.); (M.Z.)
| | - Julián D. Arias-Londoño
- Escuela Técnica Superior de Ingeneiros de Telecomunicación, Universidad Politécnica de Madrid, Avda, Ciudad Universitaria, 30, 28040 Madrid, Spain;
| | - Matías Zañartu
- Department of Electronic Engineering, Universidad Técnica Federico Santa María, Avenida España 1680, Casilla 110-V, Valparaíso 2390123, Chile; (E.J.I.); (M.Z.)
| | - Juan I. Godino-Llorente
- Escuela Técnica Superior de Ingeneiros de Telecomunicación, Universidad Politécnica de Madrid, Avda, Ciudad Universitaria, 30, 28040 Madrid, Spain;
| |
Collapse
|
10
|
Warule P, Mishra SP, Deb S. Time-frequency analysis of speech signal using Chirplet transform for automatic diagnosis of Parkinson's disease. Biomed Eng Lett 2023; 13:613-623. [PMID: 37872998 PMCID: PMC10590362 DOI: 10.1007/s13534-023-00283-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Revised: 04/22/2023] [Accepted: 04/25/2023] [Indexed: 10/25/2023] Open
Abstract
Parkinson's disease (PD) is the second most prevalent neurodegenerative disorder in the world after Alzheimer's disease. Early diagnosing PD is challenging as it evolved slowly, and its symptoms eventuate gradually. Recent studies have demonstrated that changes in speech may be utilized as an excellent biomarker for the early diagnosis of PD. In this study, we have proposed a Chirplet transform (CT) based novel approach for diagnosing PD using speech signals. We employed CT to get the time-frequency matrix (TFM) of each speech recording, and we extracted time-frequency based entropy (TFE) features from the TFM. The statistical analysis demonstrates that the TFE features reflect the changes in speech that occurs in the speech due to PD, hence can be used for classifying the PD and healthy control (HC) individuals. The effectiveness of the proposed framework is validated using the vowels and words from the PC-GITA database. The genetic algorithm is utilized to select the optimum features subset, while a support vector machine (SVM), decision tree (DT), K-Nearest Neighbor (KNN), and Naïve Bayes (NB) classifiers are employed for classification. The TFE features outperform the breathiness and Mel frequency cepstral coefficients (MFCC) features. The SVM classifier is most effective compared to other machine-learning classifiers. The highest classification accuracy rates of 98% and 99% are achieved using the vowel /a/ and word /atleta/, respectively. The results reveal that the proposed CT-based entropy features effectively diagnose PD using the speech of a person.
Collapse
Affiliation(s)
- Pankaj Warule
- Department of Electronics Engineering, Sardar Vallabhbhai National Institute of Technology, Surat, India
| | - Siba Prasad Mishra
- Department of Electronics Engineering, Sardar Vallabhbhai National Institute of Technology, Surat, India
| | - Suman Deb
- Department of Electronics Engineering, Sardar Vallabhbhai National Institute of Technology, Surat, India
| |
Collapse
|
11
|
Favaro A, Tsai YT, Butala A, Thebaud T, Villalba J, Dehak N, Moro-Velázquez L. Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput Biol Med 2023; 166:107559. [PMID: 37852107 DOI: 10.1016/j.compbiomed.2023.107559] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/07/2023] [Accepted: 10/09/2023] [Indexed: 10/20/2023]
Abstract
Speech-based approaches for assessing Parkinson's Disease (PD) often rely on feature extraction for automatic classification or detection. While many studies prioritize accuracy by using non-interpretable embeddings from Deep Neural Networks, this work aims to explore the predictive capabilities and language robustness of both feature types in a systematic fashion. As interpretable features, prosodic, linguistic, and cognitive descriptors were adopted, while x-vectors, Wav2Vec 2.0, HuBERT, and TRILLsson representations were used as non-interpretable features. Mono-lingual, multi-lingual, and cross-lingual machine learning experiments were conducted leveraging six data sets comprising speech recordings from various languages: American English, Castilian Spanish, Colombian Spanish, Italian, German, and Czech. For interpretable feature-based models, the mean of the best F1-scores obtained from each language was 81% in mono-lingual, 81% in multi-lingual, and 71% in cross-lingual experiments. For non-interpretable feature-based models, instead, they were 85% in mono-lingual, 88% in multi-lingual, and 79% in cross-lingual experiments. Firstly, models based on non-interpretable features outperformed interpretable ones, especially in cross-lingual experiments. Specifically, TRILLsson provided the most stable and accurate results across tasks and data sets. Conversely, the two types of features adopted showed some level of language robustness in multi-lingual and cross-lingual experiments. Overall, these results suggest that interpretable feature-based models can be used by clinicians to evaluate the deterioration of the speech of patients with PD, while non-interpretable feature-based models can be leveraged to achieve higher detection accuracy.
Collapse
Affiliation(s)
- Anna Favaro
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America.
| | - Yi-Ting Tsai
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Ankur Butala
- Department of Neurology, The Johns Hopkins University, Baltimore, 21218, MD, United States of America; Department of Psychiatry and Behavioral Sciences, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Thomas Thebaud
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Jesús Villalba
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Najim Dehak
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| | - Laureano Moro-Velázquez
- Department of Electrical and Computer Engineering, The Johns Hopkins University, Baltimore, 21218, MD, United States of America
| |
Collapse
|
12
|
Dhanalakshmi S, Maanasaa RS, Maalikaa RS, Senthil R. A review of emergent intelligent systems for the detection of Parkinson's disease. Biomed Eng Lett 2023; 13:591-612. [PMID: 37872986 PMCID: PMC10590348 DOI: 10.1007/s13534-023-00319-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 08/11/2023] [Accepted: 09/07/2023] [Indexed: 10/25/2023] Open
Abstract
Parkinson's disease (PD) is a neurodegenerative disorder affecting people worldwide. The PD symptoms are divided into motor and non-motor symptoms. Detection of PD is very crucial and essential. Such challenges can be overcome by applying artificial intelligence to diagnose PD. Many studies have also proposed the implementation of computer-aided diagnosis for the detection of PD. This systematic review comprehensively analyzed all appropriate algorithms for detecting and assessing PD based on the literature from 2012 to 2023 which are conducted as per PRISMA model. This review focused on motor symptoms, namely handwriting dynamics, voice impairments and gait, multimodal features, and brain observation using single photon emission computed tomography, magnetic resonance and electroencephalogram signals. The significant challenges are critically analyzed, and appropriate recommendations are provided. The critical discussion of this review article can be helpful in today's PD community in such a way that it allows clinicians to provide proper treatment and timely medication.
Collapse
Affiliation(s)
- Samiappan Dhanalakshmi
- Department of Electronics and Communication Engineering, SRM Institute of Science and Technology, Kattankulathur, Chennai, 603203 India
| | - Ramesh Sai Maanasaa
- Department of Electronics and Communication Engineering, SRM Institute of Science and Technology, Kattankulathur, Chennai, 603203 India
| | - Ramesh Sai Maalikaa
- Department of Electronics and Communication Engineering, SRM Institute of Science and Technology, Kattankulathur, Chennai, 603203 India
| | - Ramalingam Senthil
- Department of Mechanical Engineering, SRM Institute of Science and Technology, Kattankulathur, Chennai, 603203 India
| |
Collapse
|
13
|
Zhang S, Yang J, Zhang Y, Zhong J, Hu W, Li C, Jiang J. The Combination of a Graph Neural Network Technique and Brain Imaging to Diagnose Neurological Disorders: A Review and Outlook. Brain Sci 2023; 13:1462. [PMID: 37891830 PMCID: PMC10605282 DOI: 10.3390/brainsci13101462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Revised: 10/06/2023] [Accepted: 10/12/2023] [Indexed: 10/29/2023] Open
Abstract
Neurological disorders (NDs), such as Alzheimer's disease, have been a threat to human health all over the world. It is of great importance to diagnose ND through combining artificial intelligence technology and brain imaging. A graph neural network (GNN) can model and analyze the brain, imaging from morphology, anatomical structure, function features, and other aspects, thus becoming one of the best deep learning models in the diagnosis of ND. Some researchers have investigated the application of GNN in the medical field, but the scope is broad, and its application to NDs is less frequent and not detailed enough. This review focuses on the research progress of GNNs in the diagnosis of ND. Firstly, we systematically investigated the GNN framework of ND, including graph construction, graph convolution, graph pooling, and graph prediction. Secondly, we investigated common NDs using the GNN diagnostic model in terms of data modality, number of subjects, and diagnostic accuracy. Thirdly, we discussed some research challenges and future research directions. The results of this review may be a valuable contribution to the ongoing intersection of artificial intelligence technology and brain imaging.
Collapse
Affiliation(s)
- Shuoyan Zhang
- School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
| | - Jiacheng Yang
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Ying Zhang
- School of Communication and Information Engineering, Shanghai University, Shanghai 200444, China
| | - Jiayi Zhong
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Wenjing Hu
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Chenyang Li
- School of Life Sciences, Shanghai University, Shanghai 200444, China
| | - Jiehui Jiang
- Shanghai Institute of Biomedical Engineering, Shanghai University, Shanghai 200444, China
| |
Collapse
|
14
|
Nijhawan R, Kumar M, Arya S, Mendirtta N, Kumar S, Towfek SK, Khafaga DS, Alkahtani HK, Abdelhamid AA. A Novel Artificial-Intelligence-Based Approach for Classification of Parkinson's Disease Using Complex and Large Vocal Features. Biomimetics (Basel) 2023; 8:351. [PMID: 37622956 PMCID: PMC10452203 DOI: 10.3390/biomimetics8040351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/28/2023] [Accepted: 07/31/2023] [Indexed: 08/26/2023] Open
Abstract
Parkinson's disease (PD) affects a large proportion of elderly people. Symptoms include tremors, slow movement, rigid muscles, and trouble speaking. With the aging of the developed world's population, this number is expected to rise. The early detection of PD and avoiding its severe consequences require a precise and efficient system. Our goal is to create an accurate AI model that can identify PD using human voices. We developed a transformer-based method for detecting PD by retrieving dysphonia measures from a subject's voice recording. It is uncommon to use a neural network (NN)-based solution for tabular vocal characteristics, but it has several advantages over a tree-based approach, including compatibility with continuous learning and the network's potential to be linked with an image/voice encoder for a more accurate multi modal solution, shifting SOTA approach from tree-based to a neural network (NN) is crucial for advancing research in multimodal solutions. Our method outperforms the state of the art (SOTA), namely Gradient-Boosted Decision Trees (GBDTs), by at least 1% AUC, and the precision and recall scores are also improved. We additionally offered an XgBoost-based feature-selection method and a fully connected NN layer technique for including continuous dysphonia measures, in addition to the solution network. We also discussed numerous important discoveries relating to our suggested solution and deep learning (DL) and its application to dysphonia measures, such as how a transformer-based network is more resilient to increased depth compared to a simple MLP network. The performance of the proposed approach and conventional machine learning techniques such as MLP, SVM, and Random Forest (RF) have also been compared. A detailed performance comparison matrix has been added to this article, along with the proposed solution's space and time complexity.
Collapse
Affiliation(s)
- Rahul Nijhawan
- Computer Science and Engineering, Thapar Institute of Engineering and Technology, Patiala 147004, India
| | - Mukul Kumar
- Blackstraw Technologies Pvt Ltd., Chennai 160015, India
| | | | - Neha Mendirtta
- Computer Science and Engineering, Chandigarh University, Ajitgarh 140413, India
| | - Sunil Kumar
- Department of Computer Science and Artificial Intelligence, SR University, Warangal 506371, India
- Department of Computer Science, Graphic Era Hill University, Dehradun 248001, India
| | - S. K. Towfek
- Computer Science and Intelligent Systems Research Center, Blacksburg, VA 24060, USA
- Department of Communications and Electronics, Delta Higher Institute of Engineering and Technology, Mansoura 35111, Egypt
| | - Doaa Sami Khafaga
- Computer Sciences, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia;
| | - Hend K. Alkahtani
- Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia
| | - Abdelaziz A. Abdelhamid
- Department of Computer Science, College of Computing and Information Technology, Shaqra University, Shaqra 11961, Saudi Arabia;
- Department of Computer Science, Faculty of Computer and Information Sciences, Ain Shams University, Cairo 11566, Egypt
| |
Collapse
|
15
|
Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023; 25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open
Abstract
BACKGROUND Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems. OBJECTIVE This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest. METHODS This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group. RESULTS In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models. CONCLUSIONS This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.
Collapse
Affiliation(s)
- Alper Idrisoglu
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Ana Luiza Dallora
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
| | - Peter Anderberg
- Department of Health, Blekinge Institute of Technology, Karslkrona, Sweden
- School of Health Sciences, University of Skövde, Skövde, Sweden
| | | |
Collapse
|
16
|
Wu P, Cao B, Liang Z, Wu M. The advantages of artificial intelligence-based gait assessment in detecting, predicting, and managing Parkinson's disease. Front Aging Neurosci 2023; 15:1191378. [PMID: 37502426 PMCID: PMC10368956 DOI: 10.3389/fnagi.2023.1191378] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 06/05/2023] [Indexed: 07/29/2023] Open
Abstract
Background Parkinson's disease is a neurological disorder that can cause gait disturbance, leading to mobility issues and falls. Early diagnosis and prediction of freeze episodes are essential for mitigating symptoms and monitoring the disease. Objective This review aims to evaluate the use of artificial intelligence (AI)-based gait evaluation in diagnosing and managing Parkinson's disease, and to explore the potential benefits of this technology for clinical decision-making and treatment support. Methods A thorough review of published literature was conducted to identify studies, articles, and research related to AI-based gait evaluation in Parkinson's disease. Results AI-based gait evaluation has shown promise in preventing freeze episodes, improving diagnosis, and increasing motor independence in patients with Parkinson's disease. Its advantages include higher diagnostic accuracy, continuous monitoring, and personalized therapeutic interventions. Conclusion AI-based gait evaluation systems hold great promise for managing Parkinson's disease and improving patient outcomes. They offer the potential to transform clinical decision-making and inform personalized therapies, but further research is needed to determine their effectiveness and refine their use.
Collapse
Affiliation(s)
- Peng Wu
- College of Acupuncture and Orthopedics, Hubei University of Chinese Medicine, Wuhan, Hubei, China
| | - Biwei Cao
- Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
- Affiliated Hospital of Hubei University of Chinese Medicine, Wuhan, Hubei, China
- Hubei Academy of Traditional Chinese Medicine, Wuhan, Hubei, China
| | - Zhendong Liang
- College of Acupuncture and Orthopedics, Hubei University of Chinese Medicine, Wuhan, Hubei, China
| | - Miao Wu
- Hubei Provincial Hospital of Traditional Chinese Medicine, Wuhan, China
- Affiliated Hospital of Hubei University of Chinese Medicine, Wuhan, Hubei, China
- Hubei Academy of Traditional Chinese Medicine, Wuhan, Hubei, China
| |
Collapse
|
17
|
Motin MA, Pah ND, Kumar DK. Monitoring the Effect of Levodopa Using Sustained Phonemes in Parkinson's Disease Patients. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38083746 DOI: 10.1109/embc40787.2023.10340507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Parkinson's disease (PD) is a neurological disease identified by multiple symptoms, and levodopa is one of the most effective medications for treating the disease. To determine the dosage of levodopa, it is necessary to meet on a regular basis and observe motor function. The early detection and progression of the disease have been proposed using hypokinetic dysarthria. However, previous studies have not examined the effects of levodopa on speech rigorously and have provided inconsistent results. In this study, three sustained phonemes of PD patients were investigated for the effect of medication. A set of features characterizing vocal fold dynamics as well as the vocal tract coordinators were extracted from the sustained phonemes /of 28 PD patients during levodopa medication off and on states. All the features were statistically investigated and classified using a linear discriminant analysis (LDA) classifier. LDA classifier identified medication on from medication off based on the combined features from phoneme /a/, /o/ and /m/ with the accuracy=82.75% and F1-score=82.18%. Voice recording of PD patients during sustained phonemes /a/, /o/ and /m/ has the potential for identifying whether the patients are in On state or Off state of medication.Clinical Relevance- The outcomes of this study have the potential to monitor the effect and progress of levodopa on PD patients.
Collapse
|
18
|
Pah ND, Motin MA, Oliveira GC, Kumar DK. The Change of Vocal Tract Length in People with Parkinson's Disease. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-4. [PMID: 38082914 DOI: 10.1109/embc40787.2023.10340263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Hypokinetic dysarthria is one of the early symptoms of Parkinson's disease (PD) and has been proposed for early detection and also for monitoring of the progression of the disease. PD reduces the control of vocal tract muscles such as the tongue and lips and, therefore the length of the active vocal tract is altered. However, the change in the vocal tract length due to the disease has not been investigated. The aim of this study was to determine the difference in the apparent vocal tract length (AVTL) between people with PD and age-matched control healthy people. The phoneme, /a/ from the UCI Parkinson's Disease Classification Dataset and the Italian Parkinson's Voice and Speech Dataset were used and AVTL was calculated based on the first four formants of the sustained phoneme (F1-F4). The results show a correlation between Parkinson's disease and an increase in vocal tract length. The most sensitive feature was the AVTL calculated using the first formants of sustained phonemes (F1). The other significant finding reported in this article is that the difference is significant and only appeared in the male participants. However, the size of the database is not sufficiently large to identify the possible confounding factors such as the severity and duration of the disease, medication, age, and comorbidity factors.Clinical relevance-The outcomes of this research have the potential to improve the identification of early Parkinsonian dysarthria and monitor PD progression.
Collapse
|
19
|
Doumari SA, Berahmand K, Ebadi MJ. Early and High-Accuracy Diagnosis of Parkinson's Disease: Outcomes of a New Model. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2023; 2023:1493676. [PMID: 37304324 PMCID: PMC10256450 DOI: 10.1155/2023/1493676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 03/02/2023] [Accepted: 03/06/2023] [Indexed: 06/13/2023]
Abstract
Parkinson's disease (PD) is one of the significant common neurological disorders of the current age that causes uncontrollable movements like shaking, stiffness, and difficulty. The early clinical diagnosis of this disease is essential for preventing the progression of PD. Hence, an innovative method is proposed here based on combining the crow search algorithm and decision tree (CSADT) for the early PD diagnosis. This approach is used on four crucial Parkinson's datasets, including meander, spiral, voice, and speech-Sakar. Using the presented method, PD is effectively diagnosed by evaluating each dataset's critical features and extracting the primary practical outcomes. The used algorithm was compared with other machine learning algorithms of k-nearest neighbor (KNN), support vector machine (SVM), naive Baye (NB), multilayer perceptron (MLP), decision tree (DT), random tree, logistic regression, support vector machine of radial base functions (SVM of RBFs), and combined classifier in terms of accuracy, recall, and combination measure F1. The analytical results emphasize the used algorithm's superiority over the other selected ones. The proposed model yields nearly 100% accuracy through various trials on the datasets. Notably, a high detection speed achieved the lowest detection time of 2.6 seconds. The main novelty of this paper is attributed to the accuracy of the presented PD diagnosis method, which is much higher than its counterparts.
Collapse
Affiliation(s)
- Sajjad Amiri Doumari
- Department of Mathematics and Computer Science, Sirjan University of Technology, Sirjan, Iran
| | - Kamal Berahmand
- Department of Information Technology and Communications, Azarbaijan Shahid Madani University, Tabriz, Iran
| | - M. J. Ebadi
- Department of Mathematics, Chabahar Maritime University, Chabahar, Iran
| |
Collapse
|
20
|
Alalayah KM, Senan EM, Atlam HF, Ahmed IA, Shatnawi HSA. Automatic and Early Detection of Parkinson's Disease by Analyzing Acoustic Signals Using Classification Algorithms Based on Recursive Feature Elimination Method. Diagnostics (Basel) 2023; 13:diagnostics13111924. [PMID: 37296776 DOI: 10.3390/diagnostics13111924] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 05/23/2023] [Accepted: 05/27/2023] [Indexed: 06/12/2023] Open
Abstract
Parkinson's disease (PD) is a neurodegenerative condition generated by the dysfunction of brain cells and their 60-80% inability to produce dopamine, an organic chemical responsible for controlling a person's movement. This condition causes PD symptoms to appear. Diagnosis involves many physical and psychological tests and specialist examinations of the patient's nervous system, which causes several issues. The methodology method of early diagnosis of PD is based on analysing voice disorders. This method extracts a set of features from a recording of the person's voice. Then machine-learning (ML) methods are used to analyse and diagnose the recorded voice to distinguish Parkinson's cases from healthy ones. This paper proposes novel techniques to optimize the techniques for early diagnosis of PD by evaluating selected features and hyperparameter tuning of ML algorithms for diagnosing PD based on voice disorders. The dataset was balanced by the synthetic minority oversampling technique (SMOTE) and features were arranged according to their contribution to the target characteristic by the recursive feature elimination (RFE) algorithm. We applied two algorithms, t-distributed stochastic neighbour embedding (t-SNE) and principal component analysis (PCA), to reduce the dimensions of the dataset. Both t-SNE and PCA finally fed the resulting features into the classifiers support-vector machine (SVM), K-nearest neighbours (KNN), decision tree (DT), random forest (RF), and multilayer perception (MLP). Experimental results proved that the proposed techniques were superior to existing studies in which RF with the t-SNE algorithm yielded an accuracy of 97%, precision of 96.50%, recall of 94%, and F1-score of 95%. In addition, MLP with the PCA algorithm yielded an accuracy of 98%, precision of 97.66%, recall of 96%, and F1-score of 96.66%.
Collapse
Affiliation(s)
- Khaled M Alalayah
- Department of Computer Science, Faculty of Science and Arts, Najran University, Sharurah 68341, Saudi Arabia
| | - Ebrahim Mohammed Senan
- Department of Artificial Intelligence, Faculty of Computer Science and Information Technology, Alrazi University, Sana'a, Yemen
| | - Hany F Atlam
- Cyber Security Centre, WMG, University of Warwick, Coventry CV4 7AL, UK
| | | | | |
Collapse
|
21
|
Campi M, Peters GW, Toczydlowska D. Ataxic speech disorders and Parkinson's disease diagnostics via stochastic embedding of empirical mode decomposition. PLoS One 2023; 18:e0284667. [PMID: 37099544 PMCID: PMC10132693 DOI: 10.1371/journal.pone.0284667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 04/05/2023] [Indexed: 04/27/2023] Open
Abstract
Medical diagnostic methods that utilise modalities of patient symptoms such as speech are increasingly being used for initial diagnostic purposes and monitoring disease state progression. Speech disorders are particularly prevalent in neurological degenerative diseases such as Parkinson's disease, the focus of the study undertaken in this work. We will demonstrate state-of-the-art statistical time-series methods that combine elements of statistical time series modelling and signal processing with modern machine learning methods based on Gaussian process models to develop methods to accurately detect a core symptom of speech disorder in individuals who have Parkinson's disease. We will show that the proposed methods out-perform standard best practices of speech diagnostics in detecting ataxic speech disorders, and we will focus the study, particularly on a detailed analysis of a well regarded Parkinson's data speech study publicly available making all our results reproducible. The methodology developed is based on a specialised technique not widely adopted in medical statistics that found great success in other domains such as signal processing, seismology, speech analysis and ecology. In this work, we will present this method from a statistical perspective and generalise it to a stochastic model, which will be used to design a test for speech disorders when applied to speech time series signals. As such, this work is making contributions both of a practical and statistical methodological nature.
Collapse
Affiliation(s)
- Marta Campi
- CERIAH, Institut de L’Audition, Institut Pasteur, Paris, France
| | - Gareth W. Peters
- Department of Statistics & Applied Probability, University of California, Santa Barbara (UCSB), Santa Barbara, California, United States of America
| | - Dorota Toczydlowska
- School of Mathematics and Physical Science, University of Technology Sydney, Sydney, Australia
| |
Collapse
|
22
|
Khaskhoussy R, Ayed YB. Improving Parkinson’s disease recognition through voice analysis using deep learning. Pattern Recognit Lett 2023. [DOI: 10.1016/j.patrec.2023.03.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
|
23
|
Ge W, Lueck C, Suominen H, Apthorp D. Has machine learning over-promised in healthcare? A critical analysis and a proposal for improved evaluation, with evidence from Parkinson’s disease. Artif Intell Med 2023; 139:102524. [PMID: 37100503 DOI: 10.1016/j.artmed.2023.102524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 02/22/2023] [Accepted: 02/28/2023] [Indexed: 03/17/2023]
Abstract
Adoption of artificial intelligence (AI) by the medical community has long been anticipated, endorsed by a stream of machine learning literature showcasing AI systems that yield extraordinary performance. However, many of these systems are likely over-promising and will under-deliver in practice. One key reason is the community's failure to acknowledge and address the presence of inflationary effects in the data. These simultaneously inflate evaluation performance and prevent a model from learning the underlying task, thus severely misrepresenting how that model would perform in the real world. This paper investigated the impact of these inflationary effects on healthcare tasks, as well as how these effects can be addressed. Specifically, we defined three inflationary effects that occur in medical data sets and allow models to easily reach small training losses and prevent skillful learning. We investigated two data sets of sustained vowel phonation from participants with and without Parkinson's disease, and revealed that published models which have achieved high classification performances on these were artificially enhanced due to the inflationary effects. Our experiments showed that removing each inflationary effect corresponded with a decrease in classification accuracy, and that removing all inflationary effects reduced the evaluated performance by up to 30%. Additionally, the performance on a more realistic test set increased, suggesting that the removal of these inflationary effects enabled the model to better learn the underlying task and generalize. Source code is available at https://github.com/Wenbo-G/pd-phonation-analysis under the MIT license.
Collapse
|
24
|
Zhang J, Wu J, Qiu Y, Song A, Li W, Li X, Liu Y. Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review. Comput Biol Med 2023; 153:106517. [PMID: 36623438 PMCID: PMC9814440 DOI: 10.1016/j.compbiomed.2022.106517] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 12/23/2022] [Accepted: 12/31/2022] [Indexed: 01/07/2023]
Abstract
The growing and aging of the world population have driven the shortage of medical resources in recent years, especially during the COVID-19 pandemic. Fortunately, the rapid development of robotics and artificial intelligence technologies help to adapt to the challenges in the healthcare field. Among them, intelligent speech technology (IST) has served doctors and patients to improve the efficiency of medical behavior and alleviate the medical burden. However, problems like noise interference in complex medical scenarios and pronunciation differences between patients and healthy people hamper the broad application of IST in hospitals. In recent years, technologies such as machine learning have developed rapidly in intelligent speech recognition, which is expected to solve these problems. This paper first introduces IST's procedure and system architecture and analyzes its application in medical scenarios. Secondly, we review existing IST applications in smart hospitals in detail, including electronic medical documentation, disease diagnosis and evaluation, and human-medical equipment interaction. In addition, we elaborate on an application case of IST in the early recognition, diagnosis, rehabilitation training, evaluation, and daily care of stroke patients. Finally, we discuss IST's limitations, challenges, and future directions in the medical field. Furthermore, we propose a novel medical voice analysis system architecture that employs active hardware, active software, and human-computer interaction to realize intelligent and evolvable speech recognition. This comprehensive review and the proposed architecture offer directions for future studies on IST and its applications in smart hospitals.
Collapse
Affiliation(s)
- Jun Zhang
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China,Corresponding author
| | - Jingyue Wu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Yiyi Qiu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Aiguo Song
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Weifeng Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Xin Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Yecheng Liu
- Emergency Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
| |
Collapse
|
25
|
Zhao H, Cao J, Xie J, Liao WH, Lei Y, Cao H, Qu Q, Bowen C. Wearable sensors and features for diagnosis of neurodegenerative diseases: A systematic review. Digit Health 2023; 9:20552076231173569. [PMID: 37214662 PMCID: PMC10192816 DOI: 10.1177/20552076231173569] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 04/17/2023] [Indexed: 05/24/2023] Open
Abstract
Objective Neurodegenerative diseases affect millions of families around the world, while various wearable sensors and corresponding data analysis can be of great support for clinical diagnosis and health assessment. This systematic review aims to provide a comprehensive overview of the existing research that uses wearable sensors and features for the diagnosis of neurodegenerative diseases. Methods A systematic review was conducted of studies published between 2015 and 2022 in major scientific databases such as Web of Science, Google Scholar, PubMed, and Scopes. The obtained studies were analyzed and organized into the process of diagnosis: wearable sensors, feature extraction, and feature selection. Results The search led to 171 eligible studies included in this overview. Wearable sensors such as force sensors, inertial sensors, electromyography, electroencephalography, acoustic sensors, optical fiber sensors, and global positioning systems were employed to monitor and diagnose neurodegenerative diseases. Various features including physical features, statistical features, nonlinear features, and features from the network can be extracted from these wearable sensors, and the alteration of features toward neurodegenerative diseases was illustrated. Moreover, different kinds of feature selection methods such as filter, wrapper, and embedded methods help to find the distinctive indicator of the diseases and benefit to a better diagnosis performance. Conclusions This systematic review enables a comprehensive understanding of wearable sensors and features for the diagnosis of neurodegenerative diseases.
Collapse
Affiliation(s)
- Huan Zhao
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi'an, P.R. China
| | - Junyi Cao
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi'an, P.R. China
| | - Junxiao Xie
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi'an, P.R. China
| | - Wei-Hsin Liao
- Department of Mechanical and Automation
Engineering, The Chinese University of Hong
Kong, Shatin, N.T., Hong Kong, China
| | - Yaguo Lei
- School of Mechanical Engineering, Xi’an Jiaotong University, Xi'an, P.R. China
| | - Hongmei Cao
- Department of Neurology, The First
Affiliated Hospital of Xi’an Jiaotong University, Xi’an, P.R. China
| | - Qiumin Qu
- Department of Neurology, The First
Affiliated Hospital of Xi’an Jiaotong University, Xi’an, P.R. China
| | - Chris Bowen
- Department of Mechanical Engineering, University of Bath, Bath, UK
| |
Collapse
|
26
|
Li Y, Liu C, Wang P, Zhang H, Wei A, Zhang Y. Envelope multi-type transformation ensemble algorithm of Parkinson speech samples. APPL INTELL 2022. [DOI: 10.1007/s10489-022-04345-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
|
27
|
Barukab O, Ahmad A, Khan T, Thayyil Kunhumuhammed MR. Analysis of Parkinson's Disease Using an Imbalanced-Speech Dataset by Employing Decision Tree Ensemble Methods. Diagnostics (Basel) 2022; 12:diagnostics12123000. [PMID: 36553007 PMCID: PMC9776735 DOI: 10.3390/diagnostics12123000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 11/07/2022] [Accepted: 11/24/2022] [Indexed: 12/05/2022] Open
Abstract
Parkinson's disease (PD) currently affects approximately 10 million people worldwide. The detection of PD positive subjects is vital in terms of disease prognostics, diagnostics, management and treatment. Different types of early symptoms, such as speech impairment and changes in writing, are associated with Parkinson disease. To classify potential patients of PD, many researchers used machine learning algorithms in various datasets related to this disease. In our research, we study the dataset of the PD vocal impairment feature, which is an imbalanced dataset. We propose comparative performance evaluation using various decision tree ensemble methods, with or without oversampling techniques. In addition, we compare the performance of classifiers with different sizes of ensembles and various ratios of the minority class and the majority class with oversampling and undersampling. Finally, we combine feature selection with best-performing ensemble classifiers. The result shows that AdaBoost, random forest, and decision tree developed for the RUSBoost imbalanced dataset perform well in performance metrics such as precision, recall, F1-score, area under the receiver operating characteristic curve (AUROC) and the geometric mean. Further, feature selection methods, namely lasso and information gain, were used to screen the 10 best features using the best ensemble classifiers. AdaBoost with information gain feature selection method is the best performing ensemble method with an F1-score of 0.903.
Collapse
Affiliation(s)
- Omar Barukab
- Department of Information Technology, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah 21589, Saudi Arabia
- Correspondence:
| | - Amir Ahmad
- College of Information Technology, United Arab Emirates University, Al Ain P.O. Box 15551, United Arab Emirates
| | - Tabrej Khan
- Department of Information Systems, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - Mujeeb Rahiman Thayyil Kunhumuhammed
- Department of Computer Science, Faculty of Computing and Information Technology in Rabigh (FCITR), King Abdulaziz University, Jeddah 21589, Saudi Arabia
| |
Collapse
|
28
|
Rana A, Dumka A, Singh R, Panda MK, Priyadarshi N. A Computerized Analysis with Machine Learning Techniques for the Diagnosis of Parkinson's Disease: Past Studies and Future Perspectives. Diagnostics (Basel) 2022; 12:2708. [PMID: 36359550 PMCID: PMC9689408 DOI: 10.3390/diagnostics12112708] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 10/30/2022] [Accepted: 11/02/2022] [Indexed: 08/03/2023] Open
Abstract
According to the World Health Organization (WHO), Parkinson's disease (PD) is a neurodegenerative disease of the brain that causes motor symptoms including slower movement, rigidity, tremor, and imbalance in addition to other problems like Alzheimer's disease (AD), psychiatric problems, insomnia, anxiety, and sensory abnormalities. Techniques including artificial intelligence (AI), machine learning (ML), and deep learning (DL) have been established for the classification of PD and normal controls (NC) with similar therapeutic appearances in order to address these problems and improve the diagnostic procedure for PD. In this article, we examine a literature survey of research articles published up to September 2022 in order to present an in-depth analysis of the use of datasets, various modalities, experimental setups, and architectures that have been applied in the diagnosis of subjective disease. This analysis includes a total of 217 research publications with a list of the various datasets, methodologies, and features. These findings suggest that ML/DL methods and novel biomarkers hold promising results for application in medical decision-making, leading to a more methodical and thorough detection of PD. Finally, we highlight the challenges and provide appropriate recommendations on selecting approaches that might be used for subgrouping and connection analysis with structural magnetic resonance imaging (sMRI), DaTSCAN, and single-photon emission computerized tomography (SPECT) data for future Parkinson's research.
Collapse
Affiliation(s)
- Arti Rana
- Computer Science & Engineering, Veer Madho Singh Bhandari Uttarakhand Technical University, Dehradun 248007, Uttarakhand, India
| | - Ankur Dumka
- Department of Computer Science and Engineering, Women Institute of Technology, Dehradun 248007, Uttarakhand, India
- Department of Computer Science & Engineering, Graphic Era Deemed to be University, Dehradun 248001, Uttarakhand, India
| | - Rajesh Singh
- Division of Research and Innovation, Uttaranchal Institute of Technology, Uttaranchal University, Dehradun 248007, Uttarakhand, India
- Department of Project Management, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
| | - Manoj Kumar Panda
- Department of Electrical Engineering, G.B. Pant Institute of Engineering and Technology, Pauri 246194, Uttarakhand, India
| | - Neeraj Priyadarshi
- Department of Electrical Engineering, JIS College of Engineering, Kolkata 741235, West Bengal, India
| |
Collapse
|
29
|
Macedo F, Valadas R, Carrasquinha E, Oliveira MR, Pacheco A. Feature selection using Decomposed Mutual Information Maximization. Neurocomputing 2022. [DOI: 10.1016/j.neucom.2022.09.101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
30
|
Ngo QC, Motin MA, Pah ND, Drotár P, Kempster P, Kumar D. Computerized analysis of speech and voice for Parkinson's disease: A systematic review. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107133. [PMID: 36183641 DOI: 10.1016/j.cmpb.2022.107133] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 09/13/2022] [Accepted: 09/13/2022] [Indexed: 06/16/2023]
Abstract
BACKGROUND AND OBJECTIVE Speech impairment is an early symptom of Parkinson's disease (PD). This study has summarized the literature related to speech and voice in detecting PD and assessing its severity. METHODS A systematic review of the literature from 2010 to 2021 to investigate analysis methods and signal features. The keywords "Automatic analysis" in conjunction with "PD speech" or "PD voice" were used, and the PubMed and ScienceDirect databases were searched. A total of 838 papers were found on the first run, of which 189 were selected. One hundred and forty-seven were found to be suitable for the review. The different datasets, recording protocols, signal analysis methods and features that were reported are listed. Values of the features that separate PD patients from healthy controls were tabulated. Finally, the barriers that limit the wide use of computerized speech analysis are discussed. RESULTS Speech and voice may be valuable markers for PD. However, large differences between the datasets make it difficult to compare different studies. In addition, speech analytic methods that are not informed by physiological understanding may alienate clinicians. CONCLUSIONS The potential usefulness of speech and voice for the detection and assessment of PD is confirmed by evidence from the classification and correlation results.
Collapse
Affiliation(s)
| | - Mohammod Abdul Motin
- Biosignals Lab, RMIT University, Melbourne, Australia; Department of Electrical & Electronic Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh
| | - Nemuel Daniel Pah
- Biosignals Lab, RMIT University, Melbourne, Australia; Universitas Surabaya, Indonesia
| | - Peter Drotár
- Intelligent Information Systems Lab, Technical University of Kosice, Letna 9, 42001, Kosice, Slovakia
| | - Peter Kempster
- Neurosciences Department, Monash Health, Clayton, VIC, Australia; Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC, Australia
| | - Dinesh Kumar
- Biosignals Lab, RMIT University, Melbourne, Australia.
| |
Collapse
|
31
|
Bárcenas R, Fuentes-García R, Naranjo L. Mixed kernel SVR addressing Parkinson's progression from voice features. PLoS One 2022; 17:e0275721. [PMID: 36206238 PMCID: PMC9543766 DOI: 10.1371/journal.pone.0275721] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 09/21/2022] [Indexed: 11/07/2022] Open
Abstract
Parkinson’s Disease (PD) is a progressive neurodegenerative disease with multiple motor and non-motor characteristics. PD patients commonly face vocal impairments during the early stages of the disease. In this article, the aim is to explain the Unified Parkinson’s Disease Rating Scale (UPDRS) as a measure of the progression of Parkinson’s disease using a set of covariates obtained from voice signals. In particular, a Support Vector Regression (SVR) model based on a combination of kernel functions is introduced. Theoretically, this proposal, that relies on a mixed kernel (global and local) produces an admissible kernel function. The optimal fitting was obtained for the combination given by the product of radial and polynomial basis. Important results are the non-linear relationships inferred from the features to the response, as well as a considerable improvement in prediction performance metrics, when compared to other learning approaches. Furthermore, with knowledge on factors such as age and gender, it is possible to describe the dynamics of patients’ UPDRS from the data collected during their monitoring. In summary, these advances could expand learning processes and intelligent systems to assist in monitoring the evolution of Parkinson’s disease.
Collapse
Affiliation(s)
- Roberto Bárcenas
- Departamento de Matemáticas, Facultad de Ciencias, Universidad Nacional Autónoma de México, Ciudad Universitaria, Mexico City, Mexico
- * E-mail:
| | - Ruth Fuentes-García
- Departamento de Matemáticas, Facultad de Ciencias, Universidad Nacional Autónoma de México, Ciudad Universitaria, Mexico City, Mexico
| | - Lizbeth Naranjo
- Departamento de Matemáticas, Facultad de Ciencias, Universidad Nacional Autónoma de México, Ciudad Universitaria, Mexico City, Mexico
| |
Collapse
|
32
|
Kamikubo R, Wang L, Marte C, Mahmood A, Kacorri H. Data Representativeness in Accessibility Datasets: A Meta-Analysis. ASSETS. ANNUAL ACM CONFERENCE ON ASSISTIVE TECHNOLOGIES 2022; 2022:8. [PMID: 36939417 PMCID: PMC10024595 DOI: 10.1145/3517428.3544826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/21/2023]
Abstract
As data-driven systems are increasingly deployed at scale, ethical concerns have arisen around unfair and discriminatory outcomes for historically marginalized groups that are underrepresented in training data. In response, work around AI fairness and inclusion has called for datasets that are representative of various demographic groups. In this paper, we contribute an analysis of the representativeness of age, gender, and race & ethnicity in accessibility datasets-datasets sourced from people with disabilities and older adults-that can potentially play an important role in mitigating bias for inclusive AI-infused applications. We examine the current state of representation within datasets sourced by people with disabilities by reviewing publicly-available information of 190 datasets, we call these accessibility datasets. We find that accessibility datasets represent diverse ages, but have gender and race representation gaps. Additionally, we investigate how the sensitive and complex nature of demographic variables makes classification difficult and inconsistent (e.g., gender, race & ethnicity), with the source of labeling often unknown. By reflecting on the current challenges and opportunities for representation of disabled data contributors, we hope our effort expands the space of possibility for greater inclusion of marginalized communities in AI-infused systems.
Collapse
Affiliation(s)
- Rie Kamikubo
- College of Information Studies, University of Maryland, College Park, United States
| | - Lining Wang
- Department of Computer Science, University of Maryland, College Park, United States
| | - Crystal Marte
- College of Information Studies, University of Maryland, College Park, United States
| | - Amnah Mahmood
- Department of Mathematics, University of Maryland, College Park, United States
| | - Hernisa Kacorri
- College of Information Studies, University of Maryland, College Park, United States
| |
Collapse
|
33
|
Herath HMDPM, Weraniyagoda WASA, Rajapaksha RTM, Wijesekara PADSN, Sudheera KLK, Chong PHJ. Automatic Assessment of Aphasic Speech Sensed by Audio Sensors for Classification into Aphasia Severity Levels to Recommend Speech Therapies. SENSORS (BASEL, SWITZERLAND) 2022; 22:6966. [PMID: 36146316 PMCID: PMC9501827 DOI: 10.3390/s22186966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/01/2022] [Accepted: 09/08/2022] [Indexed: 06/16/2023]
Abstract
Aphasia is a type of speech disorder that can cause speech defects in a person. Identifying the severity level of the aphasia patient is critical for the rehabilitation process. In this research, we identify ten aphasia severity levels motivated by specific speech therapies based on the presence or absence of identified characteristics in aphasic speech in order to give more specific treatment to the patient. In the aphasia severity level classification process, we experiment on different speech feature extraction techniques, lengths of input audio samples, and machine learning classifiers toward classification performance. Aphasic speech is required to be sensed by an audio sensor and then recorded and divided into audio frames and passed through an audio feature extractor before feeding into the machine learning classifier. According to the results, the mel frequency cepstral coefficient (MFCC) is the most suitable audio feature extraction method for the aphasic speech level classification process, as it outperformed the classification performance of all mel-spectrogram, chroma, and zero crossing rates by a large margin. Furthermore, the classification performance is higher when 20 s audio samples are used compared with 10 s chunks, even though the performance gap is narrow. Finally, the deep neural network approach resulted in the best classification performance, which was slightly better than both K-nearest neighbor (KNN) and random forest classifiers, and it was significantly better than decision tree algorithms. Therefore, the study shows that aphasia level classification can be completed with accuracy, precision, recall, and F1-score values of 0.99 using MFCC for 20 s audio samples using the deep neural network approach in order to recommend corresponding speech therapy for the identified level. A web application was developed for English-speaking aphasia patients to self-diagnose the severity level and engage in speech therapies.
Collapse
Affiliation(s)
| | | | | | | | | | - Peter Han Joo Chong
- Department of Electrical and Electronic Engineering, Auckland University of Technology, Auckland 1010, New Zealand
| |
Collapse
|
34
|
Fang L, Liang X. A Novel Method Based on Nonlinear Binary Grasshopper Whale Optimization Algorithm for Feature Selection. JOURNAL OF BIONIC ENGINEERING 2022; 20:237-252. [PMID: 36089930 PMCID: PMC9449924 DOI: 10.1007/s42235-022-00253-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 07/15/2022] [Accepted: 07/20/2022] [Indexed: 06/02/2023]
Abstract
Feature Selection (FS) is considered as an important preprocessing step in data mining and is used to remove redundant or unrelated features from high-dimensional data. Most optimization algorithms for FS problems are not balanced in search. A hybrid algorithm called nonlinear binary grasshopper whale optimization algorithm (NL-BGWOA) is proposed to solve the problem in this paper. In the proposed method, a new position updating strategy combining the position changes of whales and grasshoppers population is expressed, which optimizes the diversity of searching in the target domain. Ten distinct high-dimensional UCI datasets, the multi-modal Parkinson's speech datasets, and the COVID-19 symptom dataset are used to validate the proposed method. It has been demonstrated that the proposed NL-BGWOA performs well across most of high-dimensional datasets, which shows a high accuracy rate of up to 0.9895. Furthermore, the experimental results on the medical datasets also demonstrate the advantages of the proposed method in actual FS problem, including accuracy, size of feature subsets, and fitness with best values of 0.913, 5.7, and 0.0873, respectively. The results reveal that the proposed NL-BGWOA has comprehensive superiority in solving the FS problem of high-dimensional data.
Collapse
Affiliation(s)
- Lingling Fang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian, 116081 China
| | - Xiyue Liang
- Department of Computing and Information Technology, Liaoning Normal University, Dalian, 116081 China
| |
Collapse
|
35
|
Attribute Reduction Based on Lift and Random Sampling. Symmetry (Basel) 2022. [DOI: 10.3390/sym14091828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
As one of the key topics in the development of neighborhood rough set, attribute reduction has attracted extensive attentions because of its practicability and interpretability for dimension reduction or feature selection. Although the random sampling strategy has been introduced in attribute reduction to avoid overfitting, uncontrollable sampling may still affect the efficiency of search reduct. By utilizing inherent characteristics of each label, Multi-label learning with Label specIfic FeaTures (Lift) algorithm can improve the performance of mathematical modeling. Therefore, here, it is attempted to use Lift algorithm to guide the sampling for reduce the uncontrollability of sampling. In this paper, an attribute reduction algorithm based on Lift and random sampling called ARLRS is proposed, which aims to improve the efficiency of searching reduct. Firstly, Lift algorithm is used to choose the samples from the dataset as the members of the first group, then the reduct of the first group is calculated. Secondly, random sampling strategy is used to divide the rest of samples into groups which have symmetry structure. Finally, the reducts are calculated group-by-group, which is guided by the maintenance of the reducts’ classification performance. Comparing with other 5 attribute reduction strategies based on rough set theory over 17 University of California Irvine (UCI) datasets, experimental results show that: (1) ARLRS algorithm can significantly reduce the time consumption of searching reduct; (2) the reduct derived from ARLRS algorithm can provide satisfying performance in classification tasks.
Collapse
|
36
|
Automatic Parkinson’s disease detection based on the combination of long-term acoustic features and Mel frequency cepstral coefficients (MFCC). Biomed Signal Process Control 2022. [DOI: 10.1016/j.bspc.2022.104013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
37
|
Yousif NR, Balaha HM, Haikal AY, El-Gendy EM. A generic optimization and learning framework for Parkinson disease via speech and handwritten records. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2022; 14:1-21. [PMID: 36042792 PMCID: PMC9411848 DOI: 10.1007/s12652-022-04342-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 07/11/2022] [Indexed: 06/15/2023]
Abstract
Parkinson's disease (PD) is a neurodegenerative disorder with slow progression whose symptoms can be identified at late stages. Early diagnosis and treatment of PD can help to relieve the symptoms and delay progression. However, this is very challenging due to the similarities between the symptoms of PD and other diseases. The current study proposes a generic framework for the diagnosis of PD using handwritten images and (or) speech signals. For the handwriting images, 8 pre-trained convolutional neural networks (CNN) via transfer learning tuned by Aquila Optimizer were trained on the NewHandPD dataset to diagnose PD. For the speech signals, features from the MDVR-KCL dataset are extracted numerically using 16 feature extraction algorithms and fed to 4 different machine learning algorithms tuned by Grid Search algorithm, and graphically using 5 different techniques and fed to the 8 pretrained CNN structures. The authors propose a new technique in extracting the features from the voice dataset based on the segmentation of variable speech-signal-segment-durations, i.e., the use of different durations in the segmentation phase. Using the proposed technique, 5 datasets with 281 numerical features are generated. Results from different experiments are collected and recorded. For the NewHandPD dataset, the best-reported metric is 99.75% using the VGG19 structure. For the MDVR-KCL dataset, the best-reported metrics are 99.94% using the KNN and SVM ML algorithms and the combined numerical features; and 100% using the combined the mel-specgram graphical features and VGG19 structure. These results are better than other state-of-the-art researches.
Collapse
Affiliation(s)
- Nada R. Yousif
- Computer and Control Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Hossam Magdy Balaha
- Computer and Control Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Amira Y. Haikal
- Computer and Control Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| | - Eman M. El-Gendy
- Computer and Control Systems Engineering Department, Faculty of Engineering, Mansoura University, Mansoura, Egypt
| |
Collapse
|
38
|
Rana A, Dumka A, Singh R, Panda MK, Priyadarshi N, Twala B. Imperative Role of Machine Learning Algorithm for Detection of Parkinson’s Disease: Review, Challenges and Recommendations. Diagnostics (Basel) 2022; 12:diagnostics12082003. [PMID: 36010353 PMCID: PMC9407112 DOI: 10.3390/diagnostics12082003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 08/08/2022] [Accepted: 08/11/2022] [Indexed: 11/24/2022] Open
Abstract
Parkinson’s disease (PD) is a neurodegenerative disease that affects the neural, behavioral, and physiological systems of the brain. This disease is also known as tremor. The common symptoms of this disease are a slowness of movement known as ‘bradykinesia’, loss of automatic movements, speech/writing changes, and difficulty with walking at early stages. To solve these issues and to enhance the diagnostic process of PD, machine learning (ML) algorithms have been implemented for the categorization of subjective disease and healthy controls (HC) with comparable medical appearances. To provide a far-reaching outline of data modalities and artificial intelligence techniques that have been utilized in the analysis and diagnosis of PD, we conducted a literature analysis of research papers published up until 2022. A total of 112 research papers were included in this study, with an examination of their targets, data sources and different types of datasets, ML algorithms, and associated outcomes. The results showed that ML approaches and new biomarkers have a lot of promise for being used in clinical decision-making, resulting in a more systematic and informed diagnosis of PD. In this study, some major challenges were addressed along with a future recommendation.
Collapse
Affiliation(s)
- Arti Rana
- Computer Science & Engineering, Veer Madho Singh Bhandari Uttarakhand Technical University, Dehradun 248007, Uttarakhand, India
| | - Ankur Dumka
- Department of Computer Science and Engineering, Women Institute of Technology, Uttarakhand Technical University (UTU), Dehradun 248007, Uttarakhand, India
| | - Rajesh Singh
- Division of Research and Innovation, Uttaranchal Institute of Technology, Uttaranchal University, Dehradun 248007, Uttarakhand, India
- Department of Project Management, Universidad Internacional Iberoamericana, Campeche 24560, Mexico
| | - Manoj Kumar Panda
- Department of Electrical Engineering, G.B. Pant Institute of Engineering and Technology, Pauri 246194, Uttarakhand, India
| | - Neeraj Priyadarshi
- Department of Electrical Engineering, JIS College of Engineering, Kolkata 741235, West Bengal, India
| | - Bhekisipho Twala
- Digital Transformation Portfolio, Tshwane University of Technology, Staatsartillerie Rd, Pretoria West, Pretoria 0183, South Africa
- Correspondence:
| |
Collapse
|
39
|
Hecker P, Steckhan N, Eyben F, Schuller BW, Arnrich B. Voice Analysis for Neurological Disorder Recognition–A Systematic Review and Perspective on Emerging Trends. Front Digit Health 2022; 4:842301. [PMID: 35899034 PMCID: PMC9309252 DOI: 10.3389/fdgth.2022.842301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 05/25/2022] [Indexed: 11/25/2022] Open
Abstract
Quantifying neurological disorders from voice is a rapidly growing field of research and holds promise for unobtrusive and large-scale disorder monitoring. The data recording setup and data analysis pipelines are both crucial aspects to effectively obtain relevant information from participants. Therefore, we performed a systematic review to provide a high-level overview of practices across various neurological disorders and highlight emerging trends. PRISMA-based literature searches were conducted through PubMed, Web of Science, and IEEE Xplore to identify publications in which original (i.e., newly recorded) datasets were collected. Disorders of interest were psychiatric as well as neurodegenerative disorders, such as bipolar disorder, depression, and stress, as well as amyotrophic lateral sclerosis amyotrophic lateral sclerosis, Alzheimer's, and Parkinson's disease, and speech impairments (aphasia, dysarthria, and dysphonia). Of the 43 retrieved studies, Parkinson's disease is represented most prominently with 19 discovered datasets. Free speech and read speech tasks are most commonly used across disorders. Besides popular feature extraction toolkits, many studies utilise custom-built feature sets. Correlations of acoustic features with psychiatric and neurodegenerative disorders are presented. In terms of analysis, statistical analysis for significance of individual features is commonly used, as well as predictive modeling approaches, especially with support vector machines and a small number of artificial neural networks. An emerging trend and recommendation for future studies is to collect data in everyday life to facilitate longitudinal data collection and to capture the behavior of participants more naturally. Another emerging trend is to record additional modalities to voice, which can potentially increase analytical performance.
Collapse
Affiliation(s)
- Pascal Hecker
- Digital Health – Connected Healthcare, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany
- audEERING GmbH, Gilching, Germany
- *Correspondence: Pascal Hecker ; orcid.org/0000-0001-6604-1671
| | - Nico Steckhan
- Digital Health – Connected Healthcare, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany
| | | | - Björn W. Schuller
- audEERING GmbH, Gilching, Germany
- EIHW – Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Augsburg, Germany
- GLAM – Group on Language, Audio, & Music, Imperial College London, London, United Kingdom
| | - Bert Arnrich
- Digital Health – Connected Healthcare, Hasso Plattner Institute, University of Potsdam, Potsdam, Germany
| |
Collapse
|
40
|
She Y, Wang Z, Shen J. Gaining Outlier Resistance With Progressive Quantiles: Fast Algorithms and Theoretical Studies. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2020.1850460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Affiliation(s)
- Yiyuan She
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Zhifeng Wang
- Department of Statistics, Florida State University, Tallahassee, FL
| | - Jiahui Shen
- Department of Statistics, Florida State University, Tallahassee, FL
| |
Collapse
|
41
|
Phonemes based detection of parkinson's disease for telehealth applications. Sci Rep 2022; 12:9687. [PMID: 35690657 PMCID: PMC9188600 DOI: 10.1038/s41598-022-13865-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2021] [Accepted: 05/30/2022] [Indexed: 12/22/2022] Open
Abstract
Dysarthria is an early symptom of Parkinson’s disease (PD) which has been proposed for detection and monitoring of the disease with potential for telehealth. However, with inherent differences between voices of different people, computerized analysis have not demonstrated high performance that is consistent for different datasets. The aim of this study was to improve the performance in detecting PD voices and test this with different datasets. This study has investigated the effectiveness of three groups of phoneme parameters, i.e. voice intensity variation, perturbation of glottal vibration, and apparent vocal tract length (VTL) for differentiating people with PD from healthy subjects using two public databases. The parameters were extracted from five sustained phonemes; /a/, /e/, /i/, /o/, and /u/, recorded from 50 PD patients and 50 healthy subjects of PC-GITA dataset. The features were statistically investigated, and then classified using Support Vector Machine (SVM). This was repeated on Viswanathan dataset with smartphone-based recordings of /a/, /o/, and /m/ of 24 PD and 22 age-matched healthy people. VTL parameters gave the highest difference between voices of people with PD and healthy subjects; classification accuracy with the five vowels of PC-GITA dataset was 84.3% while the accuracy for other features was between 54% and 69.2%. The accuracy for Viswanathan’s dataset was 96.0%. This study has demonstrated that VTL obtained from the recording of phonemes using smartphone can accurately identify people with PD. The analysis was fully computerized and automated, and this has the potential for telehealth diagnosis for PD.
Collapse
|
42
|
Senturk ZK. Layer recurrent neural network-based diagnosis of Parkinson’s disease using voice features. BIOMED ENG-BIOMED TE 2022; 67:249-266. [DOI: 10.1515/bmt-2022-0022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 05/18/2022] [Indexed: 12/13/2022]
Abstract
Abstract
Parkinson’s disease (PD), a slow-progressing neurological disease, affects a large percentage of the world’s elderly population, and this population is expected to grow over the next decade. As a result, early detection is crucial for community health and the future of the globe in order to take proper safeguards and have a less arduous treatment procedure. Recent research has begun to focus on the motor system deficits caused by PD. Because practically most of the PD patients suffer from voice abnormalities, researchers working on automated diagnostic systems investigate vocal impairments. In this paper, we undertake extensive experiments with features extracted from voice signals. We propose a layer Recurrent Neural Network (RNN) based diagnosis for PD. To prove the efficiency of the model, different network models are compared. To the best of our knowledge, several neural network topologies, namely RNN, Cascade Forward Neural Networks (CFNN), and Feed Forward Neural Networks (FFNN), are used and compared for voice-based PD detection for the first time. In addition, the impacts of data normalization and feature selection (FS) are thoroughly examined. The findings reveal that normalization increases classifier performance and Laplacian-based FS outperforms. The proposed RNN model with 300 voice features achieves 99.74% accuracy.
Collapse
Affiliation(s)
- Zehra Karapinar Senturk
- Computer Engineering Department , Faculty of Engineering, Duzce University , 81620 , Duzce , Turkey
| |
Collapse
|
43
|
FT4cip: A new functional tree for classification in class imbalance problems. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.109294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
44
|
Diagnostic classification of Parkinson’s disease based on non-motor manifestations and machine learning strategies. Neural Comput Appl 2022. [DOI: 10.1007/s00521-022-07256-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
AbstractNon-motor manifestations of Parkinson’s disease (PD) appear early and have a significant impact on the quality of life of patients, but few studies have evaluated their predictive potential with machine learning algorithms. We evaluated 9 algorithms for discriminating PD patients from controls using a wide collection of non-motor clinical PD features from two databases: Biocruces (96 subjects) and PPMI (687 subjects). In addition, we evaluated whether the combination of both databases could improve the individual results. For each database 2 versions with different granularity were created and a feature selection process was performed. We observed that most of the algorithms were able to detect PD patients with high accuracy (>80%). Support Vector Machine and Multi-Layer Perceptron obtained the best performance, with an accuracy of 86.3% and 84.7%, respectively. Likewise, feature selection led to a significant reduction in the number of variables and to better performance. Besides, the enrichment of Biocruces database with data from PPMI moderately benefited the performance of the classification algorithms, especially the recall and to a lesser extent the accuracy, while the precision worsened slightly. The use of interpretable rules obtained by the RIPPER algorithm showed that simply using two variables (autonomic manifestations and olfactory dysfunction), it was possible to achieve an accuracy of 84.4%. Our study demonstrates that the analysis of non-motor parameters of PD through machine learning techniques can detect PD patients with high accuracy and recall, and allows us to select the most discriminative non-motor variables to create potential tools for PD screening.
Collapse
|
45
|
Prediction of Parkinson’s disease based on artificial neural networks using speech datasets. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING 2022. [DOI: 10.1007/s12652-022-03825-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
|
46
|
Alghamdi NS, Zakariah M, Hoang VT, Elahi MM. Neurogenerative Disease Diagnosis in Cepstral Domain Using MFCC with Deep Learning. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2022; 2022:4364186. [PMID: 35419079 PMCID: PMC9001083 DOI: 10.1155/2022/4364186] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Revised: 03/13/2022] [Accepted: 03/17/2022] [Indexed: 11/18/2022]
Abstract
Because underlying cognitive and neuromuscular activities regulate speech signals, biomarkers in the human voice can provide insight into neurological illnesses. Multiple motor and nonmotor aspects of neurologic voice disorders arise from an underlying neurologic condition such as Parkinson's disease, multiple sclerosis, myasthenia gravis, or ALS. Voice problems can be caused by disorders that affect the corticospinal system, cerebellum, basal ganglia, and upper or lower motoneurons. According to a new study, voice pathology detection technologies can successfully aid in the assessment of voice irregularities and enable the early diagnosis of voice pathology. In this paper, we offer two deep-learning-based computational models, 1-dimensional convolutional neural network (1D CNN) and 2-dimensional convolutional neural network (2D CNN), that simultaneously detect voice pathologies caused by neurological illnesses or other causes. From the German corpus Saarbruecken Voice Database (SVD), we used voice recordings of sustained vowel /a/ generated at normal pitch. The collected voice signals are padded and segmented to maintain homogeneity and increase the number of samples. Convolutional layers are applied to raw data, and MFCC features are extracted in this project. Although the 1D CNN had the maximum accuracy of 93.11% on test data, model training produced overfitting and 2D CNN, which generalized the data better and had lower train and validation loss despite having an accuracy of 84.17% on test data. Also, 2D CNN outperforms state-of-the-art studies in the field, implying that a model trained on handcrafted features is better for speech processing than a model that extracts features directly.
Collapse
Affiliation(s)
- Norah Saleh Alghamdi
- Department of Computer Sciences, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, P.O.Box 84428, Riyadh 11671, Saudi Arabia
| | - Mohammed Zakariah
- Department of Computer Science, College of Computer and Information Sciences, King Saud University, P.O. Box 57168, Riyadh 21574, Saudi Arabia
| | - Vinh Truong Hoang
- Faculty of Computer Science, Ho Chi Minh City Open University, 97 Vo Van Tan, Ward Vo Thi Sau, District 3. Ho Chi Minh City: 70000, Vietnam
| | - Mohammad Mamun Elahi
- Department of Computer Science and Engineering, United International University, Dhaka, Bangladesh
| |
Collapse
|
47
|
Quan C, Ren K, Luo Z, Chen Z, Ling Y. End-to-end deep learning approach for Parkinson’s disease detection from speech signals. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.04.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
48
|
Omberg L, Chaibub Neto E, Perumal TM, Pratap A, Tediarjo A, Adams J, Bloem BR, Bot BM, Elson M, Goldman SM, Kellen MR, Kieburtz K, Klein A, Little MA, Schneider R, Suver C, Tarolli C, Tanner CM, Trister AD, Wilbanks J, Dorsey ER, Mangravite LM. Remote smartphone monitoring of Parkinson's disease and individual response to therapy. Nat Biotechnol 2022; 40:480-487. [PMID: 34373643 DOI: 10.1038/s41587-021-00974-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 06/04/2021] [Indexed: 02/07/2023]
Abstract
Remote health assessments that gather real-world data (RWD) outside clinic settings require a clear understanding of appropriate methods for data collection, quality assessment, analysis and interpretation. Here we examine the performance and limitations of smartphones in collecting RWD in the remote mPower observational study of Parkinson's disease (PD). Within the first 6 months of study commencement, 960 participants had enrolled and performed at least five self-administered active PD symptom assessments (speeded tapping, gait/balance, phonation or memory). Task performance, especially speeded tapping, was predictive of self-reported PD status (area under the receiver operating characteristic curve (AUC) = 0.8) and correlated with in-clinic evaluation of disease severity (r = 0.71; P < 1.8 × 10-6) when compared with motor Movement Disorder Society-Unified Parkinson's Disease Rating Scale (MDS-UPDRS). Although remote assessment requires careful consideration for accurate interpretation of RWD, our results support the use of smartphones and wearables in objective and personalized disease assessments.
Collapse
Affiliation(s)
| | | | | | - Abhishek Pratap
- Sage Bionetworks, Seattle, WA, USA.,Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, USA
| | | | - Jamie Adams
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA.,Department of Neurology, University of Rochester Medical Center, Rochester, NY, USA
| | - Bastiaan R Bloem
- Radboud University Medical Center; Donders Institute for Brain, Cognition and Behaviour; Department of Neurology, Nijmegen, the Netherlands
| | | | - Molly Elson
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA
| | - Samuel M Goldman
- Department of Neurology, University of California-San Francisco and Parkinson's Disease Research, Education and Clinical Center, San Francisco Veterans Affairs Health Care System, San Francisco, CA, USA
| | | | - Karl Kieburtz
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA.,Department of Neurology, University of Rochester Medical Center, Rochester, NY, USA
| | | | - Max A Little
- School of Computer Science, University of Birmingham, Birmingham, UK.,Media Lab, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Ruth Schneider
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA.,Department of Neurology, University of Rochester Medical Center, Rochester, NY, USA
| | | | - Christopher Tarolli
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA.,Department of Neurology, University of Rochester Medical Center, Rochester, NY, USA
| | - Caroline M Tanner
- Department of Neurology, University of California-San Francisco and Parkinson's Disease Research, Education and Clinical Center, San Francisco Veterans Affairs Health Care System, San Francisco, CA, USA
| | | | | | - E Ray Dorsey
- Center for Health and Technology, University of Rochester Medical Center, Rochester, NY, USA.,Department of Neurology, University of Rochester Medical Center, Rochester, NY, USA
| | | |
Collapse
|
49
|
Karan B, Sahu SS, Orozco-Arroyave JR. An investigation about the relationship between dysarthria level of speech and the neurological state of Parkinson’s patients. Biocybern Biomed Eng 2022. [DOI: 10.1016/j.bbe.2022.04.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
50
|
Lima AA, Mridha MF, Das SC, Kabir MM, Islam MR, Watanobe Y. A Comprehensive Survey on the Detection, Classification, and Challenges of Neurological Disorders. BIOLOGY 2022; 11:469. [PMID: 35336842 PMCID: PMC8945195 DOI: 10.3390/biology11030469] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Revised: 03/12/2022] [Accepted: 03/14/2022] [Indexed: 01/19/2023]
Abstract
Neurological disorders (NDs) are becoming more common, posing a concern to pregnant women, parents, healthy infants, and children. Neurological disorders arise in a wide variety of forms, each with its own set of origins, complications, and results. In recent years, the intricacy of brain functionalities has received a better understanding due to neuroimaging modalities, such as magnetic resonance imaging (MRI), magnetoencephalography (MEG), and positron emission tomography (PET), etc. With high-performance computational tools and various machine learning (ML) and deep learning (DL) methods, these modalities have discovered exciting possibilities for identifying and diagnosing neurological disorders. This study follows a computer-aided diagnosis methodology, leading to an overview of pre-processing and feature extraction techniques. The performance of existing ML and DL approaches for detecting NDs is critically reviewed and compared in this article. A comprehensive portion of this study also shows various modalities and disease-specified datasets that detect and records images, signals, and speeches, etc. Limited related works are also summarized on NDs, as this domain has significantly fewer works focused on disease and detection criteria. Some of the standard evaluation metrics are also presented in this study for better result analysis and comparison. This research has also been outlined in a consistent workflow. At the conclusion, a mandatory discussion section has been included to elaborate on open research challenges and directions for future work in this emerging field.
Collapse
Affiliation(s)
- Aklima Akter Lima
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh; (A.A.L.); (M.F.M.); (S.C.D.); (M.M.K.)
| | - M. Firoz Mridha
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh; (A.A.L.); (M.F.M.); (S.C.D.); (M.M.K.)
| | - Sujoy Chandra Das
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh; (A.A.L.); (M.F.M.); (S.C.D.); (M.M.K.)
| | - Muhammad Mohsin Kabir
- Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka 1216, Bangladesh; (A.A.L.); (M.F.M.); (S.C.D.); (M.M.K.)
| | - Md. Rashedul Islam
- Department of Computer Science and Engineering, University of Asia Pacific, Dhaka 1216, Bangladesh
| | - Yutaka Watanobe
- Department of Computer Science and Engineering, University of Aizu, Aizu-Wakamatsu 965-8580, Japan;
| |
Collapse
|