Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Vaziri G, Almasganj F, Behroozmand R. Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Comput Biol Med 2010;40:54-63. [PMID: 19962694 DOI: 10.1016/j.compbiomed.2009.10.011] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2008] [Revised: 09/26/2009] [Accepted: 10/27/2009] [Indexed: 11/26/2022]

For:	Vaziri G, Almasganj F, Behroozmand R. Pathological assessment of patients’ speech signals using nonlinear dynamical analysis. Comput Biol Med 2010;40:54-63. [PMID: 19962694 DOI: 10.1016/j.compbiomed.2009.10.011] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2008] [Revised: 09/26/2009] [Accepted: 10/27/2009] [Indexed: 11/26/2022]

Number

Cited by Other Article(s)

Kuo HC, Hsieh YP, Tseng HH, Wang CT, Fang SH, Tsao Y. Toward Real-World Voice Disorder Classification. IEEE Trans Biomed Eng 2023;70:2922-2932. [PMID: 37099463 DOI: 10.1109/tbme.2023.3270532] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/27/2023]

Abstract

OBJECTIVE

Voice disorders significantly compromise individuals' ability to speak in their daily lives. Without early diagnosis and treatment, these disorders may deteriorate drastically. Thus, automatic classification systems at home are desirable for people who are inaccessible to clinical disease assessments. However, the performance of such systems may be weakened due to the constrained resources and domain mismatch between the clinical data and noisy real-world data.

METHODS

This study develops a compact and domain-robust voice disorder classification system to identify the utterances of health, neoplasm, and benign structural diseases. Our proposed system utilizes a feature extractor model composed of factorized convolutional neural networks and subsequently deploys domain adversarial training to reconcile the domain mismatch by extracting domain-invariant features.

RESULTS

The results show that the unweighted average recall in the noisy real-world domain improved by 13% and remained at 80% in the clinic domain with only slight degradation. The domain mismatch was effectively eliminated. Moreover, the proposed system reduced the usage of both memory and computation by over 73.9%.

CONCLUSION

By deploying factorized convolutional neural networks and domain adversarial training, domain-invariant features can be derived for voice disorder classification with limited resources. The promising results confirm that the proposed system can significantly reduce resource consumption and improve classification accuracy by considering the domain mismatch.

SIGNIFICANCE

To the best of our knowledge, this is the first study that jointly considers real-world model compression and noise-robustness issues in voice disorder classification. The proposed system is intended for application to embedded systems with limited resources.

Collapse

Shahbazi-Gahrouei D, Bagherzadeh S, Torabinezhad F, Mahdavi SM, Fadavi P, Salmanian S. Binary logistic regression modeling of voice impairment and voice assessment in iranian patients with nonlaryngeal head-and-neck cancers after chemoradiation therapy: Objective and subjective voice evaluation. JOURNAL OF MEDICAL SIGNALS & SENSORS 2023. [DOI: 10.4103/jmss.jmss_143_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]

Rong P, Hansen O, Heidrick L. Relationship between rate-elicited changes in muscular-kinematic control strategies and acoustic performance in individuals with ALS-A multimodal investigation. JOURNAL OF COMMUNICATION DISORDERS 2022;99:106253. [PMID: 36007484 DOI: 10.1016/j.jcomdis.2022.106253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 08/08/2022] [Accepted: 08/09/2022] [Indexed: 06/15/2023]

Abstract

INTRODUCTION

As a key control variable, duration has been long suspected to mediate the organization of speech motor control strategies, which has management implications for neuromotor speech disorders. This study aimed to experimentally delineate the role of duration in organizing speech motor control in neurologically healthy and impaired speakers using a voluntary speaking rate manipulation paradigm.

METHODS

Thirteen individuals with amyotrophic lateral sclerosis (ALS) and 10 healthy controls performed a sentence reading task three times, first at their habitual rate, then at a slower rate. A multimodal approach combining surface electromyography, kinematic, and acoustic technologies was used to record jaw muscle activities, jaw kinematics, and speech acoustics. Six muscular-kinematic features were extracted and factor-analyzed to characterize the organization of the mandibular control hierarchy. Five acoustic features were extracted, measuring the spectrotemporal properties of the diphthong /ɑɪ/ and the plosives /t/ and /k/.

RESULTS

The muscular-kinematic features converged into two interpretable latent factors, reflecting the level and cohesiveness/flexibility of mandibular control, respectively. Voluntary rate reduction led to a trend toward (1) finer, less cohesive, and more flexible mandibular control, and (2) increased range and decreased transition slope of the diphthong formants, across neurologically healthy and impaired groups. Differential correlations were found between the rate-elicited changes in mandibular control and acoustic performance for neurologically healthy and impaired speakers.

CONCLUSIONS

The results provided empirical evidence for the long-suspected but previously unsubstantiated role of duration in (re)organizing speech motor control strategies. The rate-elicited reorganization of muscular-kinematic control contributed to the acoustic performance of healthy speakers, in ways consistent with theoretical predictions. Such contributions were less consistent in impaired speakers, implying the complex nature of speaking rate reduction in ALS, possibly reflecting an interplay of disease-related constraints and volitional duration control. This information may help to stratify and identify candidates for the rate manipulation therapy.

Collapse

Bao G, Lin M, Sang X, Hou Y, Liu Y, Wu Y. Classification of Dysphonic Voices in Parkinson's Disease with Semi-Supervised Competitive Learning Algorithm. BIOSENSORS 2022;12:502. [PMID: 35884305 PMCID: PMC9312485 DOI: 10.3390/bios12070502] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Revised: 07/04/2022] [Accepted: 07/07/2022] [Indexed: 06/15/2023]

Ghasemzadeh H, Doyle PC, Searl J. Image representation of the acoustic signal: An effective tool for modeling spectral and temporal dynamics of connected speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022;152:580. [PMID: 35931551 PMCID: PMC9458292 DOI: 10.1121/10.0012734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 06/09/2022] [Accepted: 06/30/2022] [Indexed: 06/15/2023]

Romana A, Bandon J, Carlozzi N, Roberts A, Provost EM. Classification of Manifest Huntington Disease using Vowel Distortion Measures. INTERSPEECH 2020;2020:4966-4970. [PMID: 33244474 PMCID: PMC7685306 DOI: 10.21437/interspeech.2020-2724] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]

Chen L, Chen J. Deep Neural Network for Automatic Classification of Pathological Voice Signals. J Voice 2020;36:288.e15-288.e24. [PMID: 32660846 DOI: 10.1016/j.jvoice.2020.05.029] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 05/17/2020] [Accepted: 05/26/2020] [Indexed: 10/23/2022]

Complexity Measures of Voice Recordings as a Discriminative Tool for Parkinson's Disease. BIOSENSORS-BASEL 2019;10:bios10010001. [PMID: 31861890 PMCID: PMC7168233 DOI: 10.3390/bios10010001] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 12/17/2019] [Accepted: 12/17/2019] [Indexed: 11/24/2022]

Abstract

In this paper, we have investigated the differences in the voices of Parkinson’s disease (PD) and age-matched control (CO) subjects when uttering three phonemes using two complexity measures: fractal dimension (FD) and normalised mutual information (NMI). Three sustained phonetic voice recordings, /a/, /u/ and /m/, from 22 CO (mean age = 66.91) and 24 PD (mean age = 71.83) participants were analysed. FD was first computed for PD and CO voice recordings, followed by the computation of NMI between the test groups: PD–CO, PD–PD and CO–CO. Four features reported in the literature—normalised pitch period entropy (Norm. PPE), glottal-to-noise excitation ratio (GNE), detrended fluctuation analysis (DFA) and glottal closing quotient (ClQ)—were also computed for comparison with the proposed complexity measures. The statistical significance of the features was tested using a one-way ANOVA test. Support vector machine (SVM) with a linear kernel was used to classify the test groups, using a leave-one-out validation method. The results showed that PD voice recordings had lower FD compared to CO (p < 0.008). It was also observed that the average NMI between CO voice recordings was significantly lower compared with the CO–PD and PD–PD groups (p < 0.036) for the three phonetic sounds. The average NMI and FD demonstrated higher accuracy (>80%) in differentiating the test groups compared with other speech feature-based classifications. This study has demonstrated that the voices of PD patients has reduced FD, and NMI between voice recordings of PD–CO and PD–PD is higher compared with CO–CO. This suggests that the use of NMI obtained from the sample voice, when paired with known groups of CO and PD, can be used to identify PD voices. These findings could have applications for population screening.

Collapse

On the design of automatic voice condition analysis systems. Part I: Review of concepts and an insight to the state of the art. Biomed Signal Process Control 2019. [DOI: 10.1016/j.bspc.2018.12.024] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach. J Voice 2018;33:634-641. [PMID: 29567049 DOI: 10.1016/j.jvoice.2018.02.003] [Citation(s) in RCA: 56] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2017] [Accepted: 02/06/2018] [Indexed: 01/20/2023]

Wu Y, Chen P, Yao Y, Ye X, Xiao Y, Liao L, Wu M, Chen J. Dysphonic Voice Pattern Analysis of Patients in Parkinson's Disease Using Minimum Interclass Probability Risk Feature Selection and Bagging Ensemble Learning Methods. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2017;2017:4201984. [PMID: 28553366 PMCID: PMC5434464 DOI: 10.1155/2017/4201984] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Revised: 03/08/2017] [Accepted: 04/06/2017] [Indexed: 11/17/2022]

Lopes LW, Batista Simões L, Delfino da Silva J, da Silva Evangelista D, da Nóbrega e Ugulino AC, Oliveira Costa Silva P, Jefferson Dias Vieira V. Accuracy of Acoustic Analysis Measurements in the Evaluation of Patients With Different Laryngeal Diagnoses. J Voice 2017;31:382.e15-382.e26. [DOI: 10.1016/j.jvoice.2016.08.015] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2016] [Revised: 08/20/2016] [Accepted: 08/23/2016] [Indexed: 11/29/2022]

Speech disorders in Parkinson’s disease: early diagnostics and effects of medication and brain stimulation. J Neural Transm (Vienna) 2017;124:303-334. [PMID: 28101650 DOI: 10.1007/s00702-017-1676-0] [Citation(s) in RCA: 98] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2016] [Accepted: 01/04/2017] [Indexed: 01/31/2023]

Mekyska J, Janousova E, Gomez-Vilda P, Smekal Z, Rektorova I, Eliasova I, Kostalova M, Mrackova M, Alonso-Hernandez JB, Faundez-Zanuy M, López-de-Ipiña K. Robust and complex approach of pathological speech signal analysis. Neurocomputing 2015. [DOI: 10.1016/j.neucom.2015.02.085] [Citation(s) in RCA: 77] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Ghasemzadeh H, Tajik Khass M, Khalil Arjmandi M, Pooyan M. Detection of vocal disorders based on phase space parameters and Lyapunov spectrum. Biomed Signal Process Control 2015. [DOI: 10.1016/j.bspc.2015.07.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Nonlinear dynamics characterization of emotional speech. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2012.05.037] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Yang S, Zheng F, Luo X, Cai S, Wu Y, Liu K, Wu M, Chen J, Krishnan S. Effective dysphonia detection using feature dimension reduction and kernel density estimation for patients with Parkinson's disease. PLoS One 2014;9:e88825. [PMID: 24586406 PMCID: PMC3930574 DOI: 10.1371/journal.pone.0088825] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2013] [Accepted: 01/12/2014] [Indexed: 11/19/2022] Open

Erfanian Saeedi N, Almasganj F. Wavelet adaptation for automatic voice disorders sorting. Comput Biol Med 2013;43:699-704. [PMID: 23668345 DOI: 10.1016/j.compbiomed.2013.03.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2011] [Revised: 11/22/2012] [Accepted: 03/17/2013] [Indexed: 10/27/2022]

Todder D, Avissar S, Schreiber G. Non-Linear Dynamic Analysis of Inter-Word Time Intervals in Psychotic Speech. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE-JTEHM 2013;1:2200107. [PMID: 27170852 PMCID: PMC4819231 DOI: 10.1109/jtehm.2013.2268850] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2013] [Revised: 05/27/2013] [Accepted: 06/07/2013] [Indexed: 11/23/2022]

Henríquez Rodríguez P, Alonso Hernández JB, Ferrer Ballester MA, Travieso González CM, Orozco-Arroyave JR. Global Selection of Features for Nonlinear Dynamics Characterization of Emotional Speech. Cognit Comput 2012. [DOI: 10.1007/s12559-012-9157-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Veiga J, Lopes AJ, Jansen JM, Melo PL. Airflow pattern complexity and airway obstruction in asthma. J Appl Physiol (1985) 2011;111:412-9. [PMID: 21565988 DOI: 10.1152/japplphysiol.00267.2011] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Arias-Londoño JD, Godino-Llorente JI, Sáenz-Lechón N, Osma-Ruiz V, Castellanos-Domínguez G. Automatic detection of pathological voices using complexity measures, noise parameters, and mel-cepstral coefficients. IEEE Trans Biomed Eng 2011;58:370-9. [PMID: 21257362 DOI: 10.1109/tbme.2010.2089052] [Citation(s) in RCA: 96] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Application of Nonlinear Dynamics Characterization to Emotional Speech. ACTA ACUST UNITED AC 2011. [DOI: 10.1007/978-3-642-25020-0_17] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]