Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Zhang Z. Mechanics of human voice production and control. J Acoust Soc Am 2016;140:2614. [PMID: 27794319 PMCID: PMC5412481 DOI: 10.1121/1.4964509] [Citation(s) in RCA: 164] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]

Number

Cited by Other Article(s)

Luchesi LC, Cavalcanti JC, Lucci TK, David VF, Otta E, Monticelli PF. Zygosity Effects on Human Voice: Fundamental Frequency Analysis of Brazilian Twins' Speech. Twin Res Hum Genet 2024:1-8. [PMID: 39355961 DOI: 10.1017/thg.2024.33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/03/2024]

Calvache C, Castillo-Triana N, Aguirre FD, Leguízamo P, Rojas S, Valenzuela P, Piedrahita MM, Ardila MDPR, Pérez DVB. Integration of Dysphagia Therapy Techniques into Voice Rehabilitation: Design and Content Validation of a Cross-Therapy Protocol. J Voice 2024:S0892-1997(24)00235-2. [PMID: 39244386 DOI: 10.1016/j.jvoice.2024.07.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 06/06/2024] [Accepted: 07/22/2024] [Indexed: 09/09/2024]

Abstract

BACKGROUND

The intricate relationship between swallowing and phonation, sharing anatomical and physiological substrates, underscores a clinical demand for integrated therapeutic approaches. Existing interventions often address these functions in isolation, overlooking their interconnected dynamics.

OBJECTIVE

To design and validate a cross-therapy protocol incorporating dysphagia therapy techniques (maneuvers/exercises) into voice rehabilitation. This protocol aims to exploit the shared biomechanical components of swallowing and phonation to improve both functions simultaneously in patients with underlying hypofunctional laryngeal pathology.

METHODS

A descriptive research design was employed, consisting of three phases: a comprehensive literature review and expert discussions in a German seminar format to conceptualize the protocol; detailed analysis and categorization of swallowing maneuvers/exercises; and content validation by a panel of seven experts through a structured evaluation instrument. The process integrated motor learning and exercise physiology principles to ensure the protocol's clinical applicability and theoretical coherence.

RESULTS

The developed cross-therapy protocol incorporates four core swallowing therapy techniques to voice therapy procedures. Selected swallowing therapy techniques target laryngeal excursion and vocal fold closure because they are critical components of swallowing and phonation. Expert validation yielded a Content Validity Coefficient exceeding 0.90 for most items, indicating high consensus on the protocol's relevance, clarity, and applicability. Adjustments were made based on feedback, enhancing the protocol's precision and user-friendliness.

CONCLUSION

We present a novel, evidence-based therapy protocol for voice and swallowing difficulties resulting from hypofunctional laryngeal pathology. Its development marks a significant step toward bridging the gap between swallowing and voice therapy. Future empirical studies are needed to assess its effectiveness in clinical settings.

Collapse

Manda Y, Kodama N, Mori K, Adachi R, Matsugishi M, Minagi S. Basic characteristics of tongue pressure and electromyography generated by articulation of a syllable using the posterior part of the tongue. Sci Rep 2024;14:20756. [PMID: 39237702 PMCID: PMC11377720 DOI: 10.1038/s41598-024-71909-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 09/02/2024] [Indexed: 09/07/2024] Open

Payten CL, Chiapello G, Weir KA, Madill CJ. Frameworks, Terminology and Definitions Used for the Classification of Voice Disorders: A Scoping Review. J Voice 2024;38:1070-1087. [PMID: 35317970 DOI: 10.1016/j.jvoice.2022.02.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 02/03/2022] [Accepted: 02/06/2022] [Indexed: 10/18/2022]

Abstract

BACKGROUND

A challenge for clinicians and researchers in laryngology is a lack of international consensus for an agreed framework to classify homogenous groups of voice disorders. Consistency in terminology and agreement in how conditions are classified will provide greater clarity for clinicians and researchers.

OBJECTIVE

This scoping review aimed to examine the published literature on frameworks, terminology, and criteria for the classification of voice disorders.

DESIGN

Seven online databases (MEDLINE, Embase, CINAHL, PsycInfo, Scopus, Cochrane Collaboration, Web of Science) and grey literature sources were searched. Studies published from 1940 to 2021 were included if they provided a descriptive detail of a classification framework structure and described the methodological approaches to determine classification. A narrative synthesis of the main concepts including terminology, classification criteria, grouping of conditions, critical appraisal items and gaps in research was undertaken.

RESULTS

A total of 2,675 publications were screened. Twenty sources met inclusion criteria, including published articles and grey literature. Thirty-five classification groups and over 150 sub-groups were described. The classification group labels, and criteria for inclusion of conditions varied across the frameworks. Several key themes in terminology and criteria useful for classification are discussed, and a core set of suggested terms and definitions are presented.

CONCLUSIONS

The quality of research on classification frameworks for voice disorders is low and not one system encompasses all voice disorders across the whole spectrum. Continued high quality research using consensus methodology and inter-rater reliability scores is recommended to develop and test an internationally agreed classification framework for voice disorders.

Collapse

Jing H, Ge H, Tang H, Weng W, Choi S, Wang C, Wang L, Cui X. Assessing respiratory airflow unsteadiness under different tidal respiratory frequencies using large eddy simulation method. Comput Biol Med 2024;179:108834. [PMID: 38996553 DOI: 10.1016/j.compbiomed.2024.108834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 06/11/2024] [Accepted: 06/29/2024] [Indexed: 07/14/2024]

Abstract

Unsteady respiratory airflow characteristics play a crucial role in understanding the deposition of toxic particles and inhaled aerosol drugs in the human respiratory tract. Considering the variations in respiratory flow rate and glottis motion under different respiratory frequencies, these respiratory airflow characteristics are studied by large-eddy simulations, including pressure field, power loss, modal spatial patterns, and vortex structures. Firstly, the results reveal that varying respiratory frequencies significantly affect airflow unsteadiness, turbulent evolution, and vortex structure dissipation, as they increase the complexity and butterfly effect introduced by the turbulent disturbance. Secondly, the pressure drops and flow rate at the glottis also conform to a power-law relationship considering the respiratory physiological characteristics, especially under low respiratory frequencies. Glottis motion plays different roles in energy consumption during inspiration and expiration, and its magnitude can be predicted using a polynomial function based on glottis area and respiratory flowrate under different respiratory frequencies. Finally, modal decomposition can be effectively applied to the study of respiratory flow characteristics, but we recommend separately studying the inspiration and expiration. The spatial distribution of the dominant mode characterizes the majority of respiratory flow characteristics and are influenced by respiratory frequency. Spectral entropy results indicate that glottis motion and slow breathing both delay the transitions in the upper respiratory tract during inspiration and expiration. These results confirm that the respiratory physiology characteristics under different respiratory frequencies have a significant impact on the unsteady respiratory airflow characteristics and warrant further study.

Collapse

Zhang Z. Contribution of Undesired Medial Surface Shape to Suboptimal Voice Outcome After Medialization Laryngoplasty. J Voice 2024;38:1220-1226. [PMID: 35410779 DOI: 10.1016/j.jvoice.2022.03.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/09/2022] [Accepted: 03/10/2022] [Indexed: 10/18/2022]

Michaud-Dorko J, Farbos de Luzan C, Dion GR, Gutmark E, Oren L. Comparison of Aerodynamic and Elastic Properties in Tissue and Synthetic Models of Vocal Fold Vibrations. Bioengineering (Basel) 2024;11:834. [PMID: 39199792 PMCID: PMC11351855 DOI: 10.3390/bioengineering11080834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Revised: 08/13/2024] [Accepted: 08/14/2024] [Indexed: 09/01/2024] Open

Deng JJ, Peterson SD. Sensitivity of Phonation Onset Pressure to Vocal Fold Stiffness Distribution. J Biomech Eng 2024;146:081003. [PMID: 38345603 DOI: 10.1115/1.4064718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Indexed: 03/22/2024]

Parra JA, Calvache C, Alzamendi GA, Ibarra EJ, Soláque L, Peterson SD, Zañartu M. Asymmetric triangular body-cover model of the vocal folds with bilateral intrinsic muscle activation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;156:939-953. [PMID: 39133633 DOI: 10.1121/10.0028164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 07/12/2024] [Indexed: 08/21/2024]

Thomson SL. Synthetic, self-oscillating vocal fold models for voice production researcha). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;156:1283-1308. [PMID: 39172710 PMCID: PMC11348498 DOI: 10.1121/10.0028267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 07/26/2024] [Accepted: 07/30/2024] [Indexed: 08/24/2024]

Cecchin-Albertoni C, Deny O, Planat-Bénard V, Guissard C, Paupert J, Vaysse F, Marty M, Casteilla L, Monsarrat P, Kémoun P. The oral organ: A new vision of the mouth as a whole for a gerophysiological approach to healthy aging. Ageing Res Rev 2024;99:102360. [PMID: 38821417 DOI: 10.1016/j.arr.2024.102360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Revised: 05/07/2024] [Accepted: 05/28/2024] [Indexed: 06/02/2024]

Affiliation(s)

Chiara Cecchin-Albertoni Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Olivier Deny Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Valérie Planat-Bénard RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Christophe Guissard Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Jenny Paupert RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Frédéric Vaysse Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France
Mathieu Marty Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; LIRDEF, Faculty of Educational Sciences, Paul Valery University, Montpellier CEDEX 5 34199, France
Louis Casteilla RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France
Paul Monsarrat Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France; Artificial and Natural Intelligence Toulouse Institute ANITI, Toulouse, France
Philippe Kémoun Oral Medicine Department and CHU de Toulouse, Toulouse Institute of Oral Medicine and Science, Toulouse, France; RESTORE Research Center, Université de Toulouse, INSERM, CNRS, EFS, ENVT, Université P. Sabatier, Toulouse, France.

Collapse

Zhang Z. Principal dimensions of voice production and their role in vocal expression. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;156:278-283. [PMID: 38980102 PMCID: PMC11236430 DOI: 10.1121/10.0027913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 06/20/2024] [Accepted: 06/24/2024] [Indexed: 07/10/2024]

Borjon JI, Abney DH, Yu C, Smith LB. Infant vocal productions coincide with body movements. Dev Sci 2024;27:e13491. [PMID: 38433472 PMCID: PMC11161311 DOI: 10.1111/desc.13491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 02/14/2024] [Accepted: 02/21/2024] [Indexed: 03/05/2024]

Patel RR, Döllinger M, Jakubaß B, Pinhack H, Katz U, Semmler M. Analyzing Vocal Fold Frequency Dynamics Using High-Speed 3D Laser Video Endoscopy. Laryngoscope 2024;134:3267-3276. [PMID: 38481073 PMCID: PMC11182720 DOI: 10.1002/lary.31394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 02/24/2024] [Accepted: 02/29/2024] [Indexed: 06/18/2024]

Jia SJ, Jing JQ, Yang CJ. A Review on Autism Spectrum Disorder Screening by Artificial Intelligence Methods. J Autism Dev Disord 2024:10.1007/s10803-024-06429-9. [PMID: 38842671 DOI: 10.1007/s10803-024-06429-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/30/2024] [Indexed: 06/07/2024]

Gao S, Ma EPM. The Relationship Between Voice Parameters and Speech Intelligibility: A Scoping Review. J Voice 2024:S0892-1997(24)00130-9. [PMID: 38755076 DOI: 10.1016/j.jvoice.2024.04.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 04/07/2024] [Accepted: 04/08/2024] [Indexed: 05/18/2024]

Abstract

OBJECTIVE

To synthesize existing evidence of the relationship between voice parameters and speech intelligibility.

METHODS

Following Preferred Reporting Items for Systematic Reviews and Meta-Analysis extension for Scoping Review (PRISMA-ScR) guidelines, 13 databases were searched and a manual search was conducted. A narrative synthesis of methodological quality, study characteristics, participant demographics, voice parameter categorization, and their relationship to speech intelligibility was conducted. A Grading of Recommendations Assessment, Development, and Evaluation (GRADE) assessment was also performed.

RESULTS

A total of 5593 studies were retrieved, and 30 eligible studies were included in the final scoping review. The studies were given scores of 10-25 (average 16.93) out of 34 in the methodological quality assessment. Research that analyzed voice parameters related to speech intelligibility, encompassing perceptual, acoustic, and aerodynamic parameters, was included. Validated and nonvalidated perceptual voice assessments showed divergent results regarding the relationship between perceptual parameters and speech intelligibility. The relationship between acoustic parameters and speech intelligibility was found to be complex and the results were inconsistent. The limited research on aerodynamic parameters did not reach a consensus on their relationship with speech intelligibility. Studies in which listeners were not speech-language pathologists (SLPs) far outnumbered those with SLP listeners, and research conducted in English contexts significantly exceeded that in non-English contexts. The GRADE evaluation indicated that the quality of evidence varied from low to moderate.

DISCUSSION

The results for the relationship between voice parameters and intelligibility showed significant heterogeneity. Future research should consider age-related voice changes and include diverse age groups. To enhance validity and comparability, it will be necessary to report effect sizes, tool validity, inter-rater reliability, and calibration procedures. Voice assessments should account for the validation status of tools because of their potential impact on the outcomes. The linguistic context may also influence the results.

Collapse

Bonini LDS, Dos Santos AP, Vitor JDS, Brasolotto AG, Antonetti-Carvalho AE, Silverio KCA. Water Resistance Therapy in Individuals with Parkinson's Disease: A Session-by-Session Analysis of the Vocal Quality. J Voice 2024:S0892-1997(24)00106-1. [PMID: 38735802 DOI: 10.1016/j.jvoice.2024.03.031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 03/23/2024] [Accepted: 03/26/2024] [Indexed: 05/14/2024]

Abstract

OBJECTIVES

Verify session-by-session effects of the water resistance therapy (WRT) on the vocal quality of individuals with Parkinson's disease (PD).

METHODS

This is a retrospective analytical study. Then, the samples were acquired from a database composed of 10 men aged between 50 and 90 years old diagnosed with PD. The participants underwent WRT with a resonance tube; then, they were guided to perform the following phonatory tasks: comfortable pitch and loudness, high pitch, low pitch, ascending and descending glissandos, and sentence uttering. Furthermore, tube depth ranged from 2 cm to 9 cm. Finally, WRT was implemented twice per week, totaling eight sessions, each lasting 45 minutes. Participants were assessed before and after each therapy session. Hence, the data were assessed with spectrographic analysis, vocal intensity, cepstral peak prominence-smoothed, alpha ratio, L1-L0, oscillatory frequency, and auditory-perceptual assessment of overall degree, roughness, breathiness, and instability. One-way repeated measures analysis of variance and Friedman tests were applied (P < 0.05). Furthermore, Holm-Sidak and Tukey tests were used as posthoc tests.

RESULTS

After the sixth session, the spectrographic analysis revealed that the tracing color intensity of medium frequencies darkened, whereas a better result could be observed after the eighth session. Regarding vocal intensity, the improvement could be observed from the third session. Additionally, L1-L0 followed the same results. The overall degree auditory-perceptual assessment revealed the best results only after the second, third, and fourth sessions; however, after the eighth session, the instability increased.

CONCLUSIONS

WRT allowed better results from the third session, with some improvements in the sixth session. However, the instability increased after the eighth session; thus, it is important to review the phonatory tasks and session numbers to avoid an overload in the phonatory system.

Collapse

Robotti C, Costantini G, Saggio G, Cesarini V, Calastri A, Maiorano E, Piloni D, Perrone T, Sabatini U, Ferretti VV, Cassaniti I, Baldanti F, Gravina A, Sakib A, Alessi E, Pietrantonio F, Pascucci M, Casali D, Zarezadeh Z, Zoppo VD, Pisani A, Benazzo M. Machine Learning-based Voice Assessment for the Detection of Positive and Recovered COVID-19 Patients. J Voice 2024;38:796.e1-796.e13. [PMID: 34965907 PMCID: PMC8616736 DOI: 10.1016/j.jvoice.2021.11.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2021] [Revised: 11/17/2021] [Accepted: 11/18/2021] [Indexed: 12/12/2022]

Abstract

Many virological tests have been implemented during the Coronavirus Disease 2019 (COVID-19) pandemic for diagnostic purposes, but they appear unsuitable for screening purposes. Furthermore, current screening strategies are not accurate enough to effectively curb the spread of the disease. Therefore, the present study was conducted within a controlled clinical environment to determine eventual detectable variations in the voice of COVID-19 patients, recovered and healthy subjects, and also to determine whether machine learning-based voice assessment (MLVA) can accurately discriminate between them, thus potentially serving as a more effective mass-screening tool. Three different subpopulations were consecutively recruited: positive COVID-19 patients, recovered COVID-19 patients and healthy individuals as controls. Positive patients were recruited within 10 days from nasal swab positivity. Recovery from COVID-19 was established clinically, virologically and radiologically. Healthy individuals reported no COVID-19 symptoms and yielded negative results at serological testing. All study participants provided three trials for multiple vocal tasks (sustained vowel phonation, speech, cough). All recordings were initially divided into three different binary classifications with a feature selection, ranking and cross-validated RBF-SVM pipeline. This brough a mean accuracy of 90.24%, a mean sensitivity of 91.15%, a mean specificity of 89.13% and a mean AUC of 0.94 across all tasks and all comparisons, and outlined the sustained vowel as the most effective vocal task for COVID discrimination. Moreover, a three-way classification was carried out on an external test set comprised of 30 subjects, 10 per class, with a mean accuracy of 80% and an accuracy of 100% for the detection of positive subjects. Within this assessment, recovered individuals proved to be the most difficult class to identify, and all the misclassified subjects were declared positive; this might be related to mid and short-term vocal traces of COVID-19, even after the clinical resolution of the infection. In conclusion, MLVA may accurately discriminate between positive COVID-19 patients, recovered COVID-19 patients and healthy individuals. Further studies should test MLVA among larger populations and asymptomatic positive COVID-19 patients to validate this novel screening technology and test its potential application as a potentially more effective surveillance strategy for COVID-19.

Collapse

Affiliation(s)

Carlo Robotti Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy.
Giovanni Costantini Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy.
Giovanni Saggio Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy.
Valerio Cesarini Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
Anna Calastri Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
Eugenia Maiorano Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
Davide Piloni Pneumology Unit, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
Tiziano Perrone Department of Internal Medicine, Fondazione IRCCS Policlinico San Matteo, University of Pavia, Pavia, Italy
Umberto Sabatini Department of Internal Medicine, Fondazione IRCCS Policlinico San Matteo, University of Pavia, Pavia, Italy
Virginia Valeria Ferretti Clinical Epidemiology and Biometry Unit, Fondazione IRCCS Policlinico San Matteo Foundation, Pavia, Italy
Irene Cassaniti Molecular Virology Unit, Microbiology and Virology Department, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
Fausto Baldanti Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy; Molecular Virology Unit, Microbiology and Virology Department, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy
Andrea Gravina Otorhinolaryngology Department, University of Rome Tor Vergata, Rome, Italy
Ahmed Sakib Otorhinolaryngology Department, University of Rome Tor Vergata, Rome, Italy
Elena Alessi Internal Medicine Unit, Ospedale dei Castelli ASL Roma 6, Ariccia, Italy
Filomena Pietrantonio Internal Medicine Unit, Ospedale dei Castelli ASL Roma 6, Ariccia, Italy
Matteo Pascucci Internal Medicine Unit, Ospedale dei Castelli ASL Roma 6, Ariccia, Italy
Daniele Casali Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
Zakarya Zarezadeh Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
Vincenzo Del Zoppo Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
Antonio Pisani Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy; IRCCS Mondino Foundation, Pavia, Italy
Marco Benazzo Department of Otolaryngology - Head and Neck Surgery, Fondazione IRCCS Policlinico San Matteo, Pavia, Italy; Department of Clinical, Surgical, Diagnostic and Pediatric Sciences, University of Pavia, Pavia, Italy

Collapse

Cao S, Rosenzweig I, Bilotta F, Jiang H, Xia M. Automatic detection of obstructive sleep apnea based on speech or snoring sounds: a narrative review. J Thorac Dis 2024;16:2654-2667. [PMID: 38738242 PMCID: PMC11087644 DOI: 10.21037/jtd-24-310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Accepted: 04/15/2024] [Indexed: 05/14/2024]

Oreskovic J, Kaufman J, Fossat Y. Impact of Audio Data Compression on Feature Extraction for Vocal Biomarker Detection: Validation Study. JMIR BIOMEDICAL ENGINEERING 2024;9:e56246. [PMID: 38875677 PMCID: PMC11058552 DOI: 10.2196/56246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 02/28/2024] [Accepted: 03/23/2024] [Indexed: 06/16/2024] Open

Abstract

BACKGROUND

Vocal biomarkers, derived from acoustic analysis of vocal characteristics, offer noninvasive avenues for medical screening, diagnostics, and monitoring. Previous research demonstrated the feasibility of predicting type 2 diabetes mellitus through acoustic analysis of smartphone-recorded speech. Building upon this work, this study explores the impact of audio data compression on acoustic vocal biomarker development, which is critical for broader applicability in health care.

OBJECTIVE

The objective of this research is to analyze how common audio compression algorithms (MP3, M4A, and WMA) applied by 3 different conversion tools at 2 bitrates affect features crucial for vocal biomarker detection.

METHODS

The impact of audio data compression on acoustic vocal biomarker development was investigated using uncompressed voice samples converted into MP3, M4A, and WMA formats at 2 bitrates (320 and 128 kbps) with MediaHuman (MH) Audio Converter, WonderShare (WS) UniConverter, and Fast Forward Moving Picture Experts Group (FFmpeg). The data set comprised recordings from 505 participants, totaling 17,298 audio files, collected using a smartphone. Participants recorded a fixed English sentence up to 6 times daily for up to 14 days. Feature extraction, including pitch, jitter, intensity, and Mel-frequency cepstral coefficients (MFCCs), was conducted using Python and Parselmouth. The Wilcoxon signed rank test and the Bonferroni correction for multiple comparisons were used for statistical analysis.

RESULTS

In this study, 36,970 audio files were initially recorded from 505 participants, with 17,298 recordings meeting the fixed sentence criteria after screening. Differences between the audio conversion software, MH, WS, and FFmpeg, were notable, impacting compression outcomes such as constant or variable bitrates. Analysis encompassed diverse data compression formats and a wide array of voice features and MFCCs. Wilcoxon signed rank tests yielded P values, with those below the Bonferroni-corrected significance level indicating significant alterations due to compression. The results indicated feature-specific impacts of compression across formats and bitrates. MH-converted files exhibited greater resilience compared to WS-converted files. Bitrate also influenced feature stability, with 38 cases affected uniquely by a single bitrate. Notably, voice features showed greater stability than MFCCs across conversion methods.

CONCLUSIONS

Compression effects were found to be feature specific, with MH and FFmpeg showing greater resilience. Some features were consistently affected, emphasizing the importance of understanding feature resilience for diagnostic applications. Considering the implementation of vocal biomarkers in health care, finding features that remain consistent through compression for data storage or transmission purposes is valuable. Focused on specific features and formats, future research could broaden the scope to include diverse features, real-time compression algorithms, and various recording methods. This study enhances our understanding of audio compression's influence on voice features and MFCCs, providing insights for developing applications across fields. The research underscores the significance of feature stability in working with compressed audio data, laying a foundation for informed voice data use in evolving technological landscapes.

Collapse

Li Z, Zhang D, Chen H, Liu Y, Wang HC. Voice Pitch Shaping and Genderization: New Needs of Cosmetic Phonoplastic Surgery. Aesthetic Plast Surg 2024:10.1007/s00266-024-03919-0. [PMID: 38565723 DOI: 10.1007/s00266-024-03919-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 02/08/2024] [Indexed: 04/04/2024]

Cruz DRD, Zheng A, Debele T, Larson P, Dion GR, Park YC. Drug delivery systems for wound healing treatment of upper airway injury. Expert Opin Drug Deliv 2024;21:573-591. [PMID: 38588553 PMCID: PMC11208077 DOI: 10.1080/17425247.2024.2340653] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 04/04/2024] [Indexed: 04/10/2024]

Sarmet M, Santos DB, Mangilli LD, Million JL, Maldaner V, Zeredo JL. Chronic respiratory failure negatively affects speech function in patients with bulbar and spinal onset amyotrophic lateral sclerosis: retrospective data from a tertiary referral center. LOGOP PHONIATR VOCO 2024;49:17-26. [PMID: 35767076 DOI: 10.1080/14015439.2022.2092209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 02/04/2022] [Accepted: 06/15/2022] [Indexed: 10/17/2022]

Abstract

Background: Although dysarthria and respiratory failure are widely described in literature as part of the natural history of Amyotrophic lateral sclerosis (ALS), the specific interaction between them has been little explored.Aim: To investigate the relationship between chronic respiratory failure and the speech of ALS patients.Materials and methods: In this cross-sectional retrospective study we reviewed the medical records of all patients diagnosed with ALS that were accompanied by a tertiary referral center. In order to determine the presence and degree of speech impairment, the Amyotrophic Lateral Sclerosis Functional Rating Scale-revised (ALSFRS-R) speech sub-scale was used. Respiratory function was assessed through spirometry and through venous blood gasometry obtained from a morning peripheral venous sample. To determine whether differences among groups classified by speech function were significant, maximum and mean spirometry values of participants were compared using multivariate analysis of variance (MANOVA) with Tukey's post hoc test.Results: Seventy-five cases were selected, of which 73.3% presented speech impairment and 70.7% respiratory impairment. Respiratory and speech functions were moderately correlated (seated FVC r = 0.64; supine FVC r = 0.60; seated FEV1 r = 0.59 and supine FEV1 r = 0.54, p < .001). Multivariable logistic regression revealed that the following variables were significantly associated with the presence of speech impairment after adjusting for other risk factors: seated FVC (odds ratio [OR] = 0.862) and seated FEV1 (OR = 1.106). The final model was 81.1% predictive of speech impairment. The presence of daytime hypercapnia was not correlated to increasing speech impairment.Conclusion: The restrictive pattern developed by ALS patients negatively influences speech function. Speech is a complex and multifactorial process, and lung volume presents a pivotal role in its function. Thus, we were able to find that lung volumes presented a significant correlation to speech function, especially in those with bulbar onset and respiratory impairment. Neurobiological and physiological aspects of this relationship should be explored in further studies with the ALS population.

Collapse

Franzone R, Petrigna L, Signorelli D, Musumeci G. The Relationship between Posture and Muscle Tensive Dysphonia in Teachers: A Systematic Scoping Review. J Funct Morphol Kinesiol 2024;9:60. [PMID: 38651418 PMCID: PMC11036206 DOI: 10.3390/jfmk9020060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 03/26/2024] [Accepted: 03/27/2024] [Indexed: 04/25/2024] Open

Che Z, Wan X, Xu J, Duan C, Zheng T, Chen J. Speaking without vocal folds using a machine-learning-assisted wearable sensing-actuation system. Nat Commun 2024;15:1873. [PMID: 38472193 PMCID: PMC10933441 DOI: 10.1038/s41467-024-45915-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 02/06/2024] [Indexed: 03/14/2024] Open

Park J, Choi S, Takatoh J, Zhao S, Harrahill A, Han BX, Wang F. Brainstem control of vocalization and its coordination with respiration. Science 2024;383:eadi8081. [PMID: 38452069 PMCID: PMC11223444 DOI: 10.1126/science.adi8081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Accepted: 01/18/2024] [Indexed: 03/09/2024]

Schlegel P, Rhyn Chung H, Döllinger M, Chhetri DK. Reconstruction of Vocal Fold Medial Surface 3D Trajectories: Effects of Neuromuscular Stimulation and Airflow. Laryngoscope 2024;134:1249-1257. [PMID: 37672673 PMCID: PMC10915101 DOI: 10.1002/lary.31029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 08/12/2023] [Accepted: 08/22/2023] [Indexed: 09/08/2023]

Abstract

INTRODUCTION

Analysis of medial surface dynamics of the vocal folds (VF) is critical to understanding voice production and treatment of voice disorders. We analyzed VF medial surface vibratory dynamics, evaluating the effects of airflow and nerve stimulation using 3D reconstruction and empirical eigenfunctions (EEF).

STUDY DESIGN

In vivo canine hemilarynx phonation.

METHODS

An in vivo canine hemilarynx was phonated while graded stimulation of the recurrent and superior laryngeal nerves (RLN and SLN) was performed. For each phonatory condition, vibratory cycles were 3D reconstructed from tattooed landmarks on the VF medial surface at low, medium, and high airflows. Parameters describing medial surface trajectory shape were calculated, and underlying patterns were emphasized using EEFs. Fundamental frequency and smoothed cepstral peak prominence (CPPS) were calculated from acoustic data.

RESULTS

Convex-hull area of landmark trajectories increased with increasing flow and decreasing nerve activation level. Trajectory shapes observed included circular, ellipsoid, bent, and figure-eight. They were more circular on the superior and anterior VF, and more elliptical and line-like on the inferior and posterior VF. The EEFs capturing synchronal opening and closing (EEF1) and alternating convergent/divergent (EEF2) glottis shapes were mostly unaffected by flow and nerve stimulation levels. CPPS increased with higher airflow except for low RLN activation and very dominant SLN stimulation.

CONCLUSION

We analyzed VF vibration as a function of neuromuscular stimulation and airflow levels. Oscillation patterns such as figure-eight and bent trajectories were linked to high nerve activation and flow. Further studies investigating longer sections of 3D reconstructed oscillations are needed.

LEVEL OF EVIDENCE

N/A, Basic Science Laryngoscope, 134:1249-1257, 2024.

Collapse

Schlegel P, Berry DA, Moffatt C, Zhang Z, Chhetri DK. Register transitions in an in vivo canine model as a function of intrinsic laryngeal muscle stimulation, fundamental frequency, and sound pressure level. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:2139-2150. [PMID: 38498507 PMCID: PMC10954347 DOI: 10.1121/10.0025135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/09/2024] [Accepted: 02/16/2024] [Indexed: 03/20/2024]

Riede T, Kobrina A, Pasch B. Anatomy and mechanisms of vocal production in harvest mice. J Exp Biol 2024;227:jeb246553. [PMID: 38269528 DOI: 10.1242/jeb.246553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 01/18/2024] [Indexed: 01/26/2024]

Elemans CPH, Jiang W, Jensen MH, Pichler H, Mussman BR, Nattestad J, Wahlberg M, Zheng X, Xue Q, Fitch WT. Evolutionary novelties underlie sound production in baleen whales. Nature 2024;627:123-129. [PMID: 38383781 DOI: 10.1038/s41586-024-07080-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 01/16/2024] [Indexed: 02/23/2024]

Kreiman J. Information conveyed by voice qualitya). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:1264-1271. [PMID: 38345424 DOI: 10.1121/10.0024609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 01/09/2024] [Indexed: 02/15/2024]

Delviniotis DS, Theodoridis S, Delvinioti N. Aerodynamic Parameters in Byzantine Chant Voices: Comparisons Across Pitch and Loudness. J Voice 2024:S0892-1997(23)00413-7. [PMID: 38246827 DOI: 10.1016/j.jvoice.2023.12.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/27/2023] [Accepted: 12/27/2023] [Indexed: 01/23/2024]

Abstract

OBJECTIVE

This study was designed to assess the impact of phonation frequency and loudness increase on aerodynamic parameters of the singing voice in Byzantine chant (BC).

DESIGN

Aerodynamic measurements in BC were obtained and statistically analyzed.

METHOD

Fifteen experienced BC chanters, all baritones, performed the ascending notes G2, C3, E3, G3, C4, E4, and G4, at normal and high levels of loudness within a mask, while repeating strings of /pi/ syllables. The parameters of airflow (FR), subglottal pressure (Psub), and sound pressure level (SPL) were directly measured, and from them, the glottal flow resistance (Rg) and vocal efficiency (VE) were calculated. All the parameters' values were statistically analyzed.

RESULTS

Statistically significant differences for FR, Psub, and SPL parameters in BC between the two loudness levels, at constant pitch, and for Psub, SPL, Rg, and VE among different pitches, at constant loudness levels were detected. When loudness increases, a) only the mean values of FR, Psub, and SPL, within C3-C4, increase, whereas those of Rg and VE do not show any change, and b) at G2, only the mean Psub increases, while in the upper range E4-G4, both mean SPL and mean VE decrease. When pitch is raised, a) for each level of loudness, within G2-E4 pitch range, the means of Psub, SPL, Rg, and VE increase while this is not the case for FR, and b) in the highest range (E4-G4), average SPL and VE drop while Rg and Psub remain stable. Our findings suggest that: a) most participants increase Psub and SPL without modification of Rg when loudness increases, and b) most participants increase both SPL and Psub while changing Rg with pitch rise. Idiosyncratic differences among the participants were detected in Rg and Psub, because of pitch rise, and, also, in Rg and VE due to loudness increase.

CONCLUSIONS

The results from this study reveal that, within the C3-C4 pitch range: a) there is independent control between the loudness and glottal adduction, and b) Psub is the main tool for increasing both the loudness and SPL. For some exceptions among the participants, either the Rg alteration or other modifications of the vocal system are, possibly, the cause of the loudness increase. The increased mean values of SPL, Rg, and Psub with pitch rise, for most participants, suggest that both glottal adduction and Psub increase together with the SPL and pitch increase. The VE increase within G2-E4 pitches reaches a maximum value at E4. Some exceptions among the participants exist that suggest the possible use of different phonatory strategies when changing either the pitch or the vocal loudness.

Collapse

Sauder CL, Kapsner-Smith MR, Simmons E, Meyer T, Doyle PC, Eadie TL. The Effect of Rating Method on Reliability of Judgments of Strain Across Populations. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2024;33:393-405. [PMID: 38060689 PMCID: PMC11000812 DOI: 10.1044/2023_ajslp-23-00174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 08/17/2023] [Accepted: 10/17/2023] [Indexed: 01/05/2024]

Abstract

PURPOSE

Variability in auditory-perceptual ratings of voice limits their utility, with the poorest reliability often noted for vocal strain. The purpose of this study was to determine whether an experimental method, called visual sort and rate (VSR), promoted stronger rater reliability than visual analog scale (VAS), for ratings of strain in two clinical populations: adductor laryngeal dystonia (ADLD) and vocal hyperfunction (VH).

METHOD

Connected speech samples from speakers with ADLD and VH as well as age- and sex-matched controls were selected from a database. Fifteen inexperienced listeners rated strain for two speaker sets (25 ADLD speakers and five controls; 25 VH speakers and five controls) across four rating blocks: VAS-ADLD, VSR-ADLD, VAS-VH, and VSR-VH. For the VAS task, listeners rated each speaker for strain using a vertically oriented 100-mm VAS. For the VSR task, stimuli were distributed into sets of samples with a range of severities in each set. Listeners sorted and ranked samples for strain within each set, and final ratings were captured on a vertically oriented 100-mm VAS. Intrarater reliability (Pearson's r) and interrater variability (mean of the squared differences between a listener's ratings and group mean ratings) were compared across rating methods and populations using two repeated-measures analyses of variance.

RESULTS

Intrarater reliability of strain was significantly stronger when listeners used VSR compared to VAS; listeners also showed significantly better intrarater reliability in ADLD than VH. Listeners demonstrated significantly less interrater variability (better reliability) when using VSR compared to VAS. No significant effect of population or interactions was found between listeners for measures of interrater variability.

CONCLUSIONS

VSR increases intrarater reliability for ratings of vocal strain in speakers with VH and ADLD. VSR decreases variability of auditory-perceptual judgments of strain between inexperienced listeners in these clinical populations. Future research should determine whether benefits of VSR extend to voice clinicians and/or clinical settings.

Collapse

Chung HR, Reddy NK, Manzoor D, Schlegel P, Zhang Z, Chhetri DK. Histologic Examination of Vocal Fold Mucosal Wave and Vibration. Laryngoscope 2024;134:264-271. [PMID: 37522475 PMCID: PMC10828106 DOI: 10.1002/lary.30928] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/29/2023] [Accepted: 07/18/2023] [Indexed: 08/01/2023]

Abstract

OBJECTIVES

Despite gross anatomic and histologic differences between human and canine vocal folds, similar wave patterns have been described yet not fully characterized. We reconstructed vocal fold (VF) vibration in a canine hemilarynx and performed histologic examination of the same vocal fold. We demonstrate comparable wave patterns while exploring the importance of certain anatomic architectures.

METHODS

An in vivo canine hemilarynx was phonated against a glass prism at low and high muscle activation conditions. Vibration was captured using high-speed video, and trajectories of VF medial surface tattooed landmarks were 3D-reconstructed. The method of empirical eigenfunctions was used to capture the essential dynamics of vibratory movement. Histologic examination of the hemilarynx was performed.

RESULTS

Oscillation patterns were highly similar between the in vivo canine and previous reports of ex vivo human models. The two most dominant eigenfunctions comprised over 90% of total variance of movement, representing opening/closing and convergent/divergent movement patterns, respectively. We demonstrate a vertical phase difference during the glottal cycle. The time delay between the inferior and superior VF was greater during opening than closing for both activation conditions. Histological examination of canine VF showed not only a thicker lamina propria layer superiorly but also a distinct pattern of thyroarytenoid muscle fibers and fascicles as described in human studies.

CONCLUSIONS

Histologic and vibratory examination of the canine vocal fold demonstrated human vocal fold vibratory patterns despite certain microstructural differences. This study suggests that the multilayered lamina propria may not be fundamental to vibratory patterns necessary for human-like voice production.

LEVEL OF EVIDENCE

NA (Basic science study) Laryngoscope, 134:264-271, 2024.

Collapse

Sundberg J, Salomão GL, Scherer KR. Emotional expressivity in singing. Assessing physiological and acoustic indicators of two opera singers' voice characteristics. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024;155:18-28. [PMID: 38169520 DOI: 10.1121/10.0023938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Accepted: 11/21/2023] [Indexed: 01/05/2024]

Cavalcanti JC, Eriksson A, Barbosa PA. Multiparametric Analysis of Speaking Fundamental Frequency in Genetically Related Speakers Using Different Speech Materials: Some Forensic Implications. J Voice 2024;38:243.e11-243.e29. [PMID: 34629229 DOI: 10.1016/j.jvoice.2021.08.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 08/03/2021] [Accepted: 08/09/2021] [Indexed: 11/18/2022]

Luizard P, Bailly L, Yousefi-Mashouf H, Girault R, Orgéas L, Henrich Bernardoni N. Flow-induced oscillations of vocal-fold replicas with tuned extensibility and material properties. Sci Rep 2023;13:22658. [PMID: 38114547 PMCID: PMC10730560 DOI: 10.1038/s41598-023-48080-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/22/2023] [Indexed: 12/21/2023] Open

Mandour YMH, El Hamshary A, Abdel-Elhay SA, Abdel-Hamid MS, Gomaa M. Laryngeal Changes After Septoplasty and Turbinectomy. Indian J Otolaryngol Head Neck Surg 2023;75:3242-3247. [PMID: 37974822 PMCID: PMC10645820 DOI: 10.1007/s12070-023-03951-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 06/08/2023] [Indexed: 11/19/2023] Open

Burk F, Traser L, Burdumy M, Richter B, Echternach M. Dynamic changes of vocal tract dimensions with sound pressure level during messa di vocea). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;154:3595-3603. [PMID: 38038612 DOI: 10.1121/10.0022582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 11/14/2023] [Indexed: 12/02/2023]

Paiva GM, Silva POC, Silva LJAD, Nascimento KA, Silva ABDVE, Abreu SRD, Almeida AAFD, Lopes LW. Spectral and cepstral measurements in women with behavioral dysphonia. Codas 2023;36:e20220327. [PMID: 37970895 DOI: 10.1590/2317-1782/20232022327pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Accepted: 03/20/2023] [Indexed: 11/19/2023] Open

Bouhabel S, Park S, Kolosova K, Latifi N, Kost K, Li-Jessen NYK, Mongeau L. Functional Analysis of Injectable Substance Treatment on Surgically Injured Rabbit Vocal Folds. J Voice 2023;37:829-839. [PMID: 34353684 PMCID: PMC8807745 DOI: 10.1016/j.jvoice.2021.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 05/28/2021] [Accepted: 06/02/2021] [Indexed: 02/04/2023]

Abstract

OBJECTIVES

The objective of this study was to evaluate the efficacy of immediate injection treatments of dexamethasone, hyaluronic acid (HA)/gelatin (Ge) hydrogel and glycol-chitosan solution on the phonatory function of rabbit larynges at 42 days after surgical injury of the vocal folds, piloting a novel ex vivo phonatory functional analysis protocol.

METHODS

A modified microflap procedure was performed on the left vocal fold of 12 rabbits to induce an acute injury. Animals were randomized into one of four treatment groups with 0.1 mL injections of dexamethasone, HA/Ge hydrogel, glycol-chitosan or saline as control. The left mid vocal fold lamina propria was injected immediately following injury. The right vocal fold served as an uninjured control. Larynges were harvested at Day 42 after injection, then were subjected to airflow-bench evaluation. Acoustic, aerodynamic and laryngeal high-speed videoendoscopy (HSV) analyses were performed. HSV segments of the vibrating vocal folds were rated by three expert laryngologists. Six parameters related to vocal fold vibratory characteristics were evaluated on a Likert scale.

RESULTS

The fundamental frequency, one possible surrogate of vocal fold stiffness and scarring, was lower in the dexamethasone and HA/Ge hydrogel treatment groups compared to that of the saline control (411.52±11.63 Hz). The lowest fundamental frequency value was observed in the dexamethasone group (348.79±14.99 Hz). Expert visual ratings of the HSV segments indicated an overall positive outcome in the dexamethasone treatment group, though the impacts were below statistical significance.

CONCLUSION

Dexamethasone injections might be used as an adjunctive option for iatrogenic vocal fold scarring. An increased sample size, histological correlate, and experimental method improvements will be needed to confirm this finding. Results suggested a promising use of HSV and acoustic analysis techniques to identify and monitor post-surgical vocal fold repair and scarring, providing a useful tool for future studies of vocal fold scar treatments.

Collapse

Albino DDO, do Nascimento UN, Plec EMRL, Santos MAR, Gama ACC. Comparison between the acoustic fundamental frequency of the voice and the vibration frequency of the vocal folds analyzed by digital kymography. Codas 2023;35:e20220173. [PMID: 37909493 PMCID: PMC10702710 DOI: 10.1590/2317-1782/20232022173pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 11/07/2022] [Indexed: 11/03/2023] Open

Abstract

PURPOSE

To compare the frequency of vocal fold opening variation, analyzed by digital kymography, with the fundamental voice frequency obtained by acoustic analysis, in individuals without laryngeal alteration.

METHODS

Observational analytical cross-sectional study. The participants were forty-eight women and 38 men from 18 to 55 years of age. The evaluation was made by voice acoustic analysis, by the habitual emission of the vowel /a/ for 3 seconds, and days of the week, and digital kymography (DKG), by the habitual emission of the vowels /i/ and /ɛ/. The measurements analyzed were acoustic fundamental frequency (f0), extracted by the Computerized Speech Lab (CSL) program, and dominant frequency of the variation of right (R-freq) and left (L-freq) vocal fold opening, obtained through the KIPS image processing program. The mounting of the kymograms consisted in the manual demarcation of the region by vertical lines delimiting width and horizontal lines separating the posterior, middle and anterior thirds of the Rima glottidis. In the statistical analysis, the Anderson-Darling test was used to verify the normality of the sample. The ANOVA and Tukey tests were performed for the comparison of measurements between the groups. For the comparison of age between the groups, the Mann-Whitney test was used.

RESULTS

There are no differences between the values of the frequency measurement analyzed by digital kymography, with the acoustic fundamental frequency, in individuals without laryngeal alteration.

CONCLUSION

The values of the dominant frequency of the vocal folds opening variation, as assessed by digital kymography, and the acoustic fundamental frequency of the voice are similar, allowing comparison between these measurements in the multidimensional evaluation of the voice, in individuals without laryngeal alteration.

Collapse

Jiang W, Zheng X, Farbos de Luzan C, Oren L, Gutmark E, Xue Q. The Effects of Negative Pressure Induced by Flow Separation Vortices on Vocal Fold Dynamics during Voice Production. Bioengineering (Basel) 2023;10:1215. [PMID: 37892945 PMCID: PMC10604472 DOI: 10.3390/bioengineering10101215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/22/2023] [Accepted: 10/17/2023] [Indexed: 10/29/2023] Open

Zhang Z. The influence of source-filter interaction on the voice source in a three-dimensional computational model of voice production. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;154:2462-2475. [PMID: 37855666 PMCID: PMC10589054 DOI: 10.1121/10.0021879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 09/28/2023] [Accepted: 09/30/2023] [Indexed: 10/20/2023]

Vurma A, Meister E, Meister L, Ross J, Raju M, Kala V, Dede T. The intensities of vowels and plosive bursts and their impact on text intelligibility in singinga). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;154:2653-2664. [PMID: 37877771 DOI: 10.1121/10.0021968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 10/05/2023] [Indexed: 10/26/2023]

Behlau M, Madazio G, Yamasaki R. Dynamic vocal analysis: vocal functionality evaluation. Codas 2023;35:e20210083. [PMID: 37729254 PMCID: PMC10546986 DOI: 10.1590/2317-1782/20232021083pt] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Accepted: 10/10/2022] [Indexed: 09/22/2023] Open

Qayyum U, Mumtaz N, Saqulain G. Vocal health of parents of children with hearing assistive devices. Pak J Med Sci 2023;39:1434-1439. [PMID: 37680838 PMCID: PMC10480716 DOI: 10.12669/pjms.39.5.7570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 05/30/2023] [Accepted: 06/16/2023] [Indexed: 09/09/2023] Open

Rai S, Ramdas D, Jacob NL, Bajaj G, Balasubramanium RK, Bhat JS. Normative data for certain vocal fold biomarkers among young normophonic adults using ultrasonography. Eur Arch Otorhinolaryngol 2023;280:4165-4173. [PMID: 37221308 PMCID: PMC10382443 DOI: 10.1007/s00405-023-08025-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 05/09/2023] [Indexed: 05/25/2023]

Chan MPY, Kuang J. The effect of tone language background on cue integration in pitch perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023;154:819-830. [PMID: 37563829 DOI: 10.1121/10.0020565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 07/18/2023] [Indexed: 08/12/2023]

Idrisoglu A, Dallora AL, Anderberg P, Berglund JS. Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review. J Med Internet Res 2023;25:e46105. [PMID: 37467031 PMCID: PMC10398366 DOI: 10.2196/46105] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Revised: 04/26/2023] [Accepted: 05/23/2023] [Indexed: 07/20/2023] Open

Abstract

BACKGROUND

Normal voice production depends on the synchronized cooperation of multiple physiological systems, which makes the voice sensitive to changes. Any systematic, neurological, and aerodigestive distortion is prone to affect voice production through reduced cognitive, pulmonary, and muscular functionality. This sensitivity inspired using voice as a biomarker to examine disorders that affect the voice. Technological improvements and emerging machine learning (ML) technologies have enabled possibilities of extracting digital vocal features from the voice for automated diagnosis and monitoring systems.

OBJECTIVE

This study aims to summarize a comprehensive view of research on voice-affecting disorders that uses ML techniques for diagnosis and monitoring through voice samples where systematic conditions, nonlaryngeal aerodigestive disorders, and neurological disorders are specifically of interest.

METHODS

This systematic literature review (SLR) investigated the state of the art of voice-based diagnostic and monitoring systems with ML technologies, targeting voice-affecting disorders without direct relation to the voice box from the point of view of applied health technology. Through a comprehensive search string, studies published from 2012 to 2022 from the databases Scopus, PubMed, and Web of Science were scanned and collected for assessment. To minimize bias, retrieval of the relevant references in other studies in the field was ensured, and 2 authors assessed the collected studies. Low-quality studies were removed through a quality assessment and relevant data were extracted through summary tables for analysis. The articles were checked for similarities between author groups to prevent cumulative redundancy bias during the screening process, where only 1 article was included from the same author group.

RESULTS

In the analysis of the 145 included studies, support vector machines were the most utilized ML technique (51/145, 35.2%), with the most studied disease being Parkinson disease (PD; reported in 87/145, 60%, studies). After 2017, 16 additional voice-affecting disorders were examined, in contrast to the 3 investigated previously. Furthermore, an upsurge in the use of artificial neural network-based architectures was observed after 2017. Almost half of the included studies were published in last 2 years (2021 and 2022). A broad interest from many countries was observed. Notably, nearly one-half (n=75) of the studies relied on 10 distinct data sets, and 11/145 (7.6%) used demographic data as an input for ML models.

CONCLUSIONS

This SLR revealed considerable interest across multiple countries in using ML techniques for diagnosing and monitoring voice-affecting disorders, with PD being the most studied disorder. However, the review identified several gaps, including limited and unbalanced data set usage in studies, and a focus on diagnostic test rather than disorder-specific monitoring. Despite the limitations of being constrained by only peer-reviewed publications written in English, the SLR provides valuable insights into the current state of research on ML-based voice-affecting disorder diagnosis and monitoring and highlighting areas to address in future research.

Collapse