Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hadjitodorov S, Boyanov B, Teston B. Laryngeal pathology detection by means of class-specific neural maps. IEEE Trans Inf Technol Biomed 2000;4:68-73. [PMID: 10761776 DOI: 10.1109/4233.826861] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

For:	Hadjitodorov S, Boyanov B, Teston B. Laryngeal pathology detection by means of class-specific neural maps. IEEE Trans Inf Technol Biomed 2000;4:68-73. [PMID: 10761776 DOI: 10.1109/4233.826861] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Number

Cited by Other Article(s)

Shaikh AAS, Bhargavi MS, Naik GR. Unraveling the complexities of pathological voice through saliency analysis. Comput Biol Med 2023;166:107566. [PMID: 37857135 DOI: 10.1016/j.compbiomed.2023.107566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 09/14/2023] [Accepted: 10/10/2023] [Indexed: 10/21/2023]

Abstract

The human voice is an essential communication tool, but various disorders and habits can disrupt it. Diagnosis of pathological and abnormal voices is very important. Conventional diagnosis of these voice pathologies can be invasive and costly. Voice pathology disorders can be effectively detected using Artificial Intelligence and computer-aided voice pathology classification tools. Previous studies focused primarily on binary classification, leaving limited attention to multi-class classification. This study proposes three different neural network architectures to investigate the feature characteristics of three voice pathologies-Hyperkinetic Dysphonia, Hypokinetic Dysphonia, Reflux Laryngitis, and healthy voices using multi-class classification and the Voice ICar fEDerico II (VOICED) dataset. The study proposes UNet++ autoencoder-based denoiser techniques for accurate feature extraction to overcome noisy data. The architectures include a Multi-Layer Perceptron (MLP) trained on structured feature sets, a Short-Time Fourier Transform (STFT) model, and a Mel-Frequency Cepstral Coefficients (MFCC) model. The MLP model on 143 features achieved 97.1% accuracy, while the STFT model showed similar performance with increased sensitivity of 99.8%. The MFCC model maintained 97.1% accuracy but with a smaller model size and improved accuracy on the Reflux Laryngitis class. The study identifies crucial features through saliency analysis and reveals that detecting voice abnormalities requires the identification of regions of inaudible high-pitch sounds. Additionally, the study highlights the challenges posed by limited and disjointed pathological voice databases and proposes solutions for enhancing the performance of voice abnormality classification. Overall, the study's findings have potential applications in clinical applications and specialized audio-capturing tools.

Collapse

Alhussain G, Shuweihdi F, Abd-alrazaq A, Alali H, Househ M. The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: A Systematic Review and Meta-Analysis (Preprint).. [DOI: 10.2196/preprints.38472] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]

Abstract

BACKGROUND

Voice screening and diagnosis are processes that are used during voice disorders investigations. Both have limited standardized tests, which are affected by the clinician’s experience and subjective judgment. Machine learning (ML) algorithms were introduced and employed in screening/diagnosing patients’ voices as an objective tool. The effectiveness of ML algorithms in assessing and diagnosing voice disorders has been investigated by numerous studies.

OBJECTIVE

This systematic review aims to assess the effectiveness of ML algorithms in screening and diagnosing voice disorders.

METHODS

An electronic search was conducted in five databases. We included studies that examined the performance (accuracy, sensitivity, and specificity) of any ML algorithms in detecting abnormal voice samples. Two reviewers independently selected the studies, extracted data from the included studies, and assessed the risk of bias in the included studies. The methodological quality of each study was assessed using the QUADAS-2 tool. Characteristics of studies, population, and index tests were extracted. Meta-analyses were conducted for pooling accuracy, sensitivity, and specificity of ML techniques. Sources of heterogeneity were addressed by excluding some studies and discussing the possible sources of it.

RESULTS

Out of 1409 records retrieved, 13 studies were included (participants: 4079) in this review. Thirteen machine learning techniques were used in the included studies, but the most commonly used technique was SVM. The pooled accuracy, sensitivity, and specificity of ML techniques in screening voice disorders were 93%, 96%, and 93%, respectively. LS-SVM had the highest accuracy (99%) while K-NN had the highest sensitivity (98%) and specificity (98%). Quadric Discriminant analysis (QDA) achieved the lowest accuracy (91%), sensitivity (89%), and specificity (89%).

CONCLUSIONS

ML showed promising findings in screening voice disorders. However, the findings could not be conclusive in diagnosing voice disorders due to the limited number of studies that used ML for diagnosing purposes, thus, more investigations need to be made. Accordingly, it might not be possible to use ML as a substitution for the current diagnostic tools. Instead, it might be used as a decision support tool for clinicians to assess their patients, this could improve the management process for voice disorders assessment.

Collapse

Al-Hussain G, Shuweihdi F, Alali H, Househ M, Abd-Alrazaq A. The Effectiveness of Supervised Machine Learning in Screening and Diagnosing Voice Disorders: A Systematic Review and Meta-Analysis (Preprint). J Med Internet Res 2022;24:e38472. [PMID: 36239999 PMCID: PMC9617188 DOI: 10.2196/38472] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 06/17/2022] [Accepted: 07/28/2022] [Indexed: 11/13/2022] Open

Diagnosis of Parkinson’s Disease at an Early Stage Using Volume Rendering SPECT Image Slices. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2019. [DOI: 10.1007/s13369-019-04152-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

On the design of automatic voice condition analysis systems. Part I: Review of concepts and an insight to the state of the art. Biomed Signal Process Control 2019. [DOI: 10.1016/j.bspc.2018.12.024] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Hegde S, Shetty S, Rai S, Dodderi T. A Survey on Machine Learning Approaches for Automatic Detection of Voice Disorders. J Voice 2018;33:947.e11-947.e33. [PMID: 30316551 DOI: 10.1016/j.jvoice.2018.07.014] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2018] [Revised: 07/06/2018] [Accepted: 07/10/2018] [Indexed: 10/28/2022]

Deshpande PS, Manikandan MS. Effective Glottal Instant Detection and Electroglottographic Parameter Extraction for Automated Voice Pathology Assessment. IEEE J Biomed Health Inform 2018;22:398-408. [DOI: 10.1109/jbhi.2017.2654683] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

An Expert Diagnosis System for Parkinson Disease Based on Genetic Algorithm-Wavelet Kernel-Extreme Learning Machine. PARKINSONS DISEASE 2016;2016:5264743. [PMID: 27274882 PMCID: PMC4871978 DOI: 10.1155/2016/5264743] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Revised: 03/24/2016] [Accepted: 03/29/2016] [Indexed: 11/17/2022]

Diagnosing Parkinson's Diseases Using Fuzzy Neural System. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2016;2016:1267919. [PMID: 26881009 PMCID: PMC4736962 DOI: 10.1155/2016/1267919] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2015] [Accepted: 12/14/2015] [Indexed: 11/25/2022]

Mehta DD, Van Stan JH, Zañartu M, Ghassemi M, Guttag JV, Espinoza VM, Cortés JP, Cheyne HA, Hillman RE. Using Ambulatory Voice Monitoring to Investigate Common Voice Disorders: Research Update. Front Bioeng Biotechnol 2015;3:155. [PMID: 26528472 PMCID: PMC4607864 DOI: 10.3389/fbioe.2015.00155] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2015] [Accepted: 09/23/2015] [Indexed: 11/28/2022] Open

Jothilakshmi S. Automatic system to detect the type of voice pathology. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2014.03.036] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Akbari A, Arjmandi MK. An efficient voice pathology classification scheme based on applying multi-layer linear discriminant analysis to wavelet packet-based features. Biomed Signal Process Control 2014. [DOI: 10.1016/j.bspc.2013.11.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Roy N, Barkmeier-Kraemer J, Eadie T, Sivasankar MP, Mehta D, Paul D, Hillman R. Evidence-based clinical voice assessment: a systematic review. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2013. [PMID: 23184134 DOI: 10.1044/1058-0360(2012/12-0014)] [Citation(s) in RCA: 203] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]

An optimum algorithm in pathological voice quality assessment using wavelet-packet-based features, linear discriminant analysis and support vector machine. Biomed Signal Process Control 2012. [DOI: 10.1016/j.bspc.2011.03.010] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Pathological Likelihood Index as a Measurement of the Degree of Voice Normality and Perceived Hoarseness. J Voice 2010;24:667-77. [DOI: 10.1016/j.jvoice.2009.04.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 04/20/2009] [Indexed: 11/22/2022]

Verikas A, Gelzinis A, Bacauskiene M, Hållander M, Uloza V, Kaseta M. Combining image, voice, and the patient’s questionnaire data to categorize laryngeal disorders. Artif Intell Med 2010;49:43-50. [DOI: 10.1016/j.artmed.2010.02.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2008] [Revised: 01/19/2010] [Accepted: 02/16/2010] [Indexed: 11/28/2022]

Fonseca E, Pereira J. Normal versus pathological voice signals. ACTA ACUST UNITED AC 2010;28:44-8. [PMID: 19775956 DOI: 10.1109/memb.2009.934248] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Godino-Llorente JI, Osma-Ruiz V, Sáenz-Lechón N, Gómez-Vilda P, Blanco-Velasco M, Cruz-Roldán F. The Effectiveness of the Glottal to Noise Excitation Ratio for the Screening of Voice Disorders. J Voice 2010;24:47-56. [DOI: 10.1016/j.jvoice.2008.04.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2008] [Accepted: 04/22/2008] [Indexed: 10/21/2022]

Khadivi Heris H, Seyed Aghazadeh B, Nikkhah-Bahrami M. Optimal feature selection for the assessment of vocal fold disorders. Comput Biol Med 2009;39:860-8. [DOI: 10.1016/j.compbiomed.2009.06.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2007] [Revised: 02/22/2009] [Accepted: 06/25/2009] [Indexed: 11/16/2022]

Advances in laryngeal imaging. Eur Arch Otorhinolaryngol 2009;266:1509-20. [PMID: 19618198 DOI: 10.1007/s00405-009-1050-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Accepted: 07/07/2009] [Indexed: 10/20/2022]

Godino-Llorente J, Fraile R, Sáenz-Lechón N, Osma-Ruiz V, Gómez-Vilda P. Automatic detection of voice impairments from text-dependent running speech. Biomed Signal Process Control 2009. [DOI: 10.1016/j.bspc.2009.01.007] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Little MA, McSharry PE, Hunter EJ, Spielman J, Ramig LO. Suitability of dysphonia measurements for telemonitoring of Parkinson's disease. IEEE Trans Biomed Eng 2009;56:1015. [PMID: 21399744 PMCID: PMC3051371 DOI: 10.1109/tbme.2008.2005954] [Citation(s) in RCA: 229] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Verikas A, Gelzinis A, Bacauskiene M, Uloza V, Kaseta M. Using the patient's questionnaire data to screen laryngeal disorders. Comput Biol Med 2009;39:148-55. [PMID: 19144329 DOI: 10.1016/j.compbiomed.2008.11.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2008] [Revised: 11/28/2008] [Accepted: 11/28/2008] [Indexed: 10/21/2022]

Wormald RN, Moran RJ, Reilly RB, Lacy PD. Performance of an Automated, Remote System to Detect Vocal Fold Paralysis. Ann Otol Rhinol Laryngol 2008;117:834-8. [DOI: 10.1177/000348940811701107] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Gelzinis A, Verikas A, Bacauskiene M. Automated speech analysis applied to laryngeal disease categorization. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2008;91:36-47. [PMID: 18346812 DOI: 10.1016/j.cmpb.2008.01.008] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/08/2006] [Revised: 01/22/2008] [Accepted: 01/31/2008] [Indexed: 05/26/2023]

Crovato CDP, Schuck A. The use of wavelet packet transform and artificial neural networks in analysis and classification of dysphonic voices. IEEE Trans Biomed Eng 2007;54:1898-900. [PMID: 17926690 DOI: 10.1109/tbme.2006.889780] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Fonseca ES, Guido RC, Scalassara PR, Maciel CD, Pereira JC. Wavelet time-frequency analysis and least squares support vector machines for the identification of voice disorders. Comput Biol Med 2007;37:571-8. [PMID: 17078942 DOI: 10.1016/j.compbiomed.2006.08.008] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Godino-Llorente JI, Gómez-Vilda P, Blanco-Velasco M. Dimensionality Reduction of a Pathological Voice Quality Assessment System Based on Gaussian Mixture Models and Short-Term Cepstral Parameters. IEEE Trans Biomed Eng 2006;53:1943-53. [PMID: 17019858 DOI: 10.1109/tbme.2006.871883] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Moran RJ, Reilly RB, de Chazal P, Lacy PD. Telephony-Based Voice Pathology Assessment Using Automated Speech Analysis. IEEE Trans Biomed Eng 2006;53:468-77. [PMID: 16532773 DOI: 10.1109/tbme.2005.869776] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Godino-Llorente JI, Gómez-Vilda P. Automatic detection of voice impairments by means of short-term cepstral parameters and neural network based detectors. IEEE Trans Biomed Eng 2004;51:380-4. [PMID: 14765711 DOI: 10.1109/tbme.2003.820386] [Citation(s) in RCA: 76] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]