1
|
Nawas KK, Shahina A, Balachandar K, Maadeshwaran P, Devanathan NG, Kumar N, Khan AN. Recurrence plot embeddings as short segment nonlinear features for multimodal speaker identification using air, bone and throat microphones. Sci Rep 2024; 14:12513. [PMID: 38822054 PMCID: PMC11143305 DOI: 10.1038/s41598-024-62406-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 05/16/2024] [Indexed: 06/02/2024] Open
Abstract
Speech is produced by a nonlinear, dynamical Vocal Tract (VT) system, and is transmitted through multiple (air, bone and skin conduction) modes, as captured by the air, bone and throat microphones respectively. Speaker specific characteristics that capture this nonlinearity are rarely used as stand-alone features for speaker modeling, and at best have been used in tandem with well known linear spectral features to produce tangible results. This paper proposes Recurrent Plot (RP) embeddings as stand-alone, non-linear speaker-discriminating features. Two datasets, the continuous multimodal TIMIT speech corpus and the consonant-vowel unimodal syllable dataset, are used in this study for conducting closed-set speaker identification experiments. Experiments with unimodal speaker recognition systems show that RP embeddings capture the nonlinear dynamics of the VT system which are unique to every speaker, in all the modes of speech. The Air (A), Bone (B) and Throat (T) microphone systems, trained purely on RP embeddings perform with an accuracy of 95.81%, 98.18% and 99.74%, respectively. Experiments using the joint feature space of combined RP embeddings for bimodal (A-T, A-B, B-T) and trimodal (A-B-T) systems show that the best trimodal system (99.84% accuracy) performs on par with trimodal systems using spectrogram (99.45%) and MFCC (99.98%). The 98.84% performance of the B-T bimodal system shows the efficacy of a speaker recognition system based entirely on alternate (bone and throat) speech, in the absence of the standard (air) speech. The results underscore the significance of the RP embedding, as a nonlinear feature representation of the dynamical VT system that can act independently for speaker recognition. It is envisaged that speech recognition too will benefit from this nonlinear feature.
Collapse
Affiliation(s)
- K Khadar Nawas
- School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, Tamilnadu, 600127, India
| | - A Shahina
- Department of Information Technology, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Tamil Nadu, 603110, India
| | - Keshav Balachandar
- Department of Information Technology, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Tamil Nadu, 603110, India
| | - P Maadeshwaran
- Department of Information Technology, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Tamil Nadu, 603110, India
| | - N G Devanathan
- Department of Information Technology, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Tamil Nadu, 603110, India
| | - Navein Kumar
- Department of Information Technology, Sri Sivasubramaniya Nadar College of Engineering, Kalavakkam, Tamil Nadu, 603110, India
| | - A Nayeemulla Khan
- School of Computer Science and Engineering, Vellore Institute of Technology, Chennai, Tamilnadu, 600127, India.
| |
Collapse
|
2
|
Tseng WH, Chiu HL, Hsiao TY, Yang TL, Shih PJ. Identification and analysis of Nonlinear behaviors of vocal fold biomechanics during phonation to assess efficacy of surgery for benign laryngeal Diseases. Comput Biol Med 2024; 169:107946. [PMID: 38176211 DOI: 10.1016/j.compbiomed.2024.107946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 12/21/2023] [Accepted: 01/01/2024] [Indexed: 01/06/2024]
Abstract
BACKGROUND Current voice assessments focus on perceptive evaluation and acoustic analysis. The interaction of vocal tract pressure (PVT) and vocal fold (VF) vibrations are important for volume and pitch control. However, there are currently little non-invasive ways to measure PVT. Limited information has been provided by previous human trials, and interactions between PVT and VF vibrations and the potential clinical application remain unclear. Here, we propose a non-invasive method for monitoring the nonlinear characteristics of PVT and VF vibrations, analyze voices from pathological and healthy individuals, and evaluate treatment efficacy. METHOD Healthy volunteers and patients with benign laryngeal lesions were recruited for this study. PVT was estimated using an airflow interruption method, VF vibrational frequency was calculated from accelerometer signals, and nonlinear relationships between PVT and VF vibrations were analyzed. Results from healthy volunteers and patients, as well as pre- and post-operation for the patients, were compared. RESULTS For healthy volunteers, nonlinearity was exhibited as an initial increase and then prompt decrease in vibrational frequency at the end of phonation, coinciding with PVT equilibrating with the subglottal pressure upon airflow interruption. For patients, nonlinearity was present throughout the phonation period pre-operatively, but showed a similar trend to healthy volunteers post-operatively. CONCLUSION This novel method simultaneously monitors PVT and VF vibration and helps clarify the role of PVT. The results demonstrate differences in nonlinear characteristics between healthy volunteers and patients, and pre-/post-operation in patients. The method may serve as an analysis tool for clinicians to assess pathological phonation and treatment efficacy.
Collapse
Affiliation(s)
- Wen-Hsuan Tseng
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan; Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Hsiang-Ling Chiu
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan
| | - Tzu-Yu Hsiao
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan
| | - Tsung-Lin Yang
- Department of Otolaryngology, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan; Graduate Institute of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan; Research Center for Developmental Biology and Regenerative Medicine, National Taiwan University, Taipei, Taiwan
| | - Po-Jen Shih
- Department of Biomedical Engineering, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
3
|
Ikuma T, McWhorter AJ, Adkins L, Kunduk M. Investigation of Vocal Bifurcations and Voice Patterns Induced by Asymmetry of Pathological Vocal Folds. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:48-60. [PMID: 36472934 DOI: 10.1044/2022_jslhr-21-00499] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
PURPOSE Vocal fold asymmetry creates irregular entrainments and modulations in voice, which may lead to rough perceptual quality. The presence of asymmetry can also cause mid-phonation bifurcations where a small change in the phonatory system causes a drastic change in vibration pattern, resulting in transitions in and out of rough voice. This study surveys sustained phonation recordings of speakers with the diagnoses of vocal fold polyp or unilateral vocal fold paralysis to investigate the resulting voice patterns. METHOD This retrospective study observed 71 sustained phonation recordings from 48 patients. Segments with distinctive signal patterns were identified within each recording with narrowband spectrogram and computer-assisted analysis of spectral peaks. RESULTS Phonation segmentation yielded 240 segments across all the recordings. Five voice patterns were recognized: (regularly or irregularly) entrained, modulated, uncoupled, unstable, and pulsed. Thirty-six patients (75%) exhibited irregular patterns. No single irregular pattern lasted for the entire phonation and was always accompanied by at least one mid-phonation bifurcation. Durations of the irregular segments (M = 0.4 s) were significantly shorter than the segments with the regular pattern (M = 1.4 s). CONCLUSIONS The results suggest that vocal fold pathology frequently introduces dynamic vibratory patterns that affect both the acoustic signals and perceptions. Due to these abnormalities, it is important for clinical voice assessment protocols, both perceptual and acoustic, to account for these possible bifurcations, irregular signal patterns, and their tendencies.
Collapse
Affiliation(s)
- Takeshi Ikuma
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Andrew J McWhorter
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Lacey Adkins
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Melda Kunduk
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
- Department of Communication Sciences & Disorders, Louisiana State University, Baton Rouge
| |
Collapse
|
4
|
Herbst CT, Nishimura T, Garcia M, Migimatsu K, Tokuda IT. Effect of Ventricular Folds on Vocalization Fundamental Frequency in Domestic Pigs (Sus scrofa domesticus). J Voice 2020; 35:805.e1-805.e15. [PMID: 33388229 DOI: 10.1016/j.jvoice.2020.01.013] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2019] [Revised: 01/10/2020] [Accepted: 01/16/2020] [Indexed: 10/22/2022]
Abstract
This study investigates the effect of the ventricular folds on fundamental frequency (fo) in the voice production of domestic pigs (Sus scrofa domesticus). The excised larynges of six subadult pigs were phonated in two preparation stages, with the ventricular folds present (PS1) and removed (PS2). Vocal fold resonances were tested with a laser vibrometer, and a four-mass computational model was created. Highly significant fo differences were found between PS1 and PS2 (means at 93.7 and 409.3 Hz, respectively). Two tissue resonances were found at 115 Hz and 250-290 Hz. The computational model had unique solutions for abducted and adducted ventricular folds at about 150 and 400 Hz, roughly matching the fo measured ex vivo for PS1 and PS2. The differing fo encountered across preparation stages PS1 and PS2 is explained by distinct activation of either a high or a low eigenfrequency mode, depending on the engagement of the ventricular folds. The inability of the investigated larynges to vibrate at frequencies below 250 Hz in PS2 suggests that in vivo low-frequency calls of domestic pigs (pre-eminently grunts) are likely produced with engaged ventricular folds. Allometric comparison suggests that the special, mechanically coupled "double oscillator" has evolved to prevent signaling disadvantages. Given these traits, the porcine larynx might - apart from special applications relating to the involvement of ventricular folds - not be an ideal candidate for emulating human voice production in excised larynx experimentation.
Collapse
Affiliation(s)
- Christian T Herbst
- Antonio Salieri Department of Vocal Studies and Vocal Research in Music Education, University of Music and Performing Arts Vienna, Vienna, Austria.
| | | | - Maxime Garcia
- ENES Lab, Université Lyon/Saint-Etienne, Neuro-PSI, CNRS UMR 9197, Saint-Etienne, France; Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zürich, Switzerland
| | - Kishin Migimatsu
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| | - Isao T Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Kusatsu, Shiga, Japan
| |
Collapse
|
5
|
GARCIA MAXIME, HERBST CHRISTIANT. Excised larynx experimentation: history, current developments, and prospects for bioacoustic research. ANTHROPOL SCI 2018. [DOI: 10.1537/ase.171216] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2022]
Affiliation(s)
- MAXIME GARCIA
- ENES Lab, Université Lyon/Saint-Etienne, Neuro-PSI, CNRS UMR 9197, Saint-Etienne
| | | |
Collapse
|
6
|
Charlton BD, Watchorn DJ, Whisson DA. Subharmonics increase the auditory impact of female koala rejection calls. Ethology 2017. [DOI: 10.1111/eth.12628] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Benjamin D. Charlton
- School of Biology and Environmental Science; University College Dublin (UCD); Dublin Ireland
| | - Darcy J. Watchorn
- School of Life and Environmental Sciences, Centre for Integrative Ecology; Deakin University; Geelong VIC Australia
| | - Desley A. Whisson
- School of Life and Environmental Sciences, Centre for Integrative Ecology; Deakin University; Geelong VIC Australia
| |
Collapse
|
7
|
Birk V, Döllinger M, Sutor A, Berry DA, Gedeon D, Traxdorf M, Wendler O, Bohr C, Kniesburges S. Automated setup for ex vivo larynx experiments. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:1349. [PMID: 28372097 PMCID: PMC6909984 DOI: 10.1121/1.4976085] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Ex vivo larynx experiments are limited in time due to degeneration of the laryngeal tissues. In order to acquire a significant and comparable amount of data, automatization of current manual experimental procedures is desirable. A computer controlled, electro-mechanical setup was developed for time-dependent variation of specific physiological parameters, including adduction and elongation level of the vocal folds and glottal flow. The setup offers a standardized method to induce defined forces on the laryngeal cartilages. Furthermore, phonation onset is detected automatically and the subsequent measurement procedure is automated and standardized to improve the efficiency of the experimental process. The setup was validated using four ex vivo porcine larynges, whereas each validation measurement series was executed with one separate larynx. Altogether 31 single measurements were undertaken, which can be summed up to a total experimental time of about 4 min. Vocal fold elongation and adduction lead both to an increase in fundamental frequency and subglottal pressure. Measurement procedures like applying defined subglottal pressure steps and onset-offset detection were reliably executed. The setup allows for a computer-based parameter control, which enables fast experimental execution over a wide range of laryngeal configurations. This maximizes the number of measurements and reduces personal effort compared with manual procedures.
Collapse
Affiliation(s)
- Veronika Birk
- Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Raumerstrasse 1a, 91054 Erlangen, Germany
| | - Michael Döllinger
- Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Raumerstrasse 1a, 91054 Erlangen, Germany
| | - Alexander Sutor
- Chair of Sensor Technology, Friedrich-Alexander-University Erlangen-Nürnberg, Paul-Gordan-Strasse 3/5, 91052 Erlangen, Germany
| | - David A Berry
- Laryngeal Dynamics Laboratory, Division of Head and Neck Surgery, David Geffen School of Medicine at UCLA, 10833 Le Conte Avenue, Los Angeles, California 90095-1624, USA
| | - Dominik Gedeon
- Chair of Sensor Technology, Friedrich-Alexander-University Erlangen-Nürnberg, Paul-Gordan-Strasse 3/5, 91052 Erlangen, Germany
| | - Maximilian Traxdorf
- Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Waldstrasse 1, 91054 Erlangen, Germany
| | - Olaf Wendler
- Laboratory for Molecular Biology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Waldstrasse 1, 91054 Erlangen, Germany
| | - Christopher Bohr
- Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Raumerstrasse 1a, 91054 Erlangen, Germany
| | - Stefan Kniesburges
- Medical School, Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology Head and Neck Surgery, University Hospital Erlangen, Raumerstrasse 1a, 91054 Erlangen, Germany
| |
Collapse
|
8
|
Maxfield L, Palaparthi A, Titze I. New Evidence That Nonlinear Source-Filter Coupling Affects Harmonic Intensity and fo Stability During Instances of Harmonics Crossing Formants. J Voice 2016; 31:149-156. [PMID: 27501922 DOI: 10.1016/j.jvoice.2016.04.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Revised: 04/19/2016] [Accepted: 04/19/2016] [Indexed: 11/15/2022]
Abstract
The traditional source-filter theory of voice production describes a linear relationship between the source (glottal flow pulse) and the filter (vocal tract). Such a linear relationship does not allow for nor explain how changes in the filter may impact the stability and regularity of the source. The objective of this experiment was to examine what effect unpredictable changes to vocal tract dimensions could have on fo stability and individual harmonic intensities in situations in which low frequency harmonics cross formants in a fundamental frequency glide. To determine these effects, eight human subjects (five male, three female) were recorded producing fo glides while their vocal tracts were artificially lengthened by a section of vinyl tubing inserted into the mouth. It was hypothesized that if the source and filter operated as a purely linear system, harmonic intensities would increase and decrease at nearly the same rates as they passed through a formant bandwidth, resulting in a relatively symmetric peak on an intensity-time contour. Additionally, fo stability should not be predictably perturbed by formant/harmonic crossings in a linear system. Acoustic analysis of these recordings, however, revealed that harmonic intensity peaks were asymmetric in 76% of cases, and that 85% of fo instabilities aligned with a crossing of one of the first four harmonics with the first three formants. These results provide further evidence that nonlinear dynamics in the source-filter relationship can impact fo stability as well as harmonic intensities as harmonics cross through formant bandwidths.
Collapse
Affiliation(s)
- Lynn Maxfield
- National Center for Voice and Speech, University of Utah, Salt Lake City, UT.
| | - Anil Palaparthi
- National Center for Voice and Speech, University of Utah, Salt Lake City, UT
| | - Ingo Titze
- National Center for Voice and Speech, University of Utah, Salt Lake City, UT; Department of Communication Sciences and Disorders, University of Iowa, Iowa City, IA
| |
Collapse
|
9
|
Lagier A, Legou T, Galant C, Amy de La Bretèque B, Meynadier Y, Giovanni A. The shouted voice: A pilot study of laryngeal physiology under extreme aerodynamic pressure. LOGOP PHONIATR VOCO 2016; 42:141-145. [PMID: 27484505 DOI: 10.1080/14015439.2016.1211735] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
INTRODUCTION The objective was to study the behavior of the larynx during shouted voice production, when the larynx is exposed to extremely high subglottic pressure. MATERIALS AND METHODS The study involved electroglottographic, acoustic, and aerodynamic analyses of shouts produced at maximum effort by three male participants. RESULTS AND DISCUSSION Under a normal speaking voice, the voice sound pressure level (SPL) is proportional to the subglottic pressure. However, when the subglottic pressure reached high levels, the voice SPL reached a maximum value and then decreased as subglottic pressure increased further. Furthermore, the electroglottographic signal sometimes lost its periodicity during the shout, suggesting irregular vocal fold vibration.
Collapse
Affiliation(s)
- Aude Lagier
- a Service d'ORL et Chirurgie cervico-faciale, CHU de Liège , Belgium.,b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France
| | - Thierry Legou
- b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France
| | - Camille Galant
- b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France.,c Service d'ORL et Chirurgie cervico-faciale, CHU Timone, AP-HM , Marseille , France
| | - Benoit Amy de La Bretèque
- b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France.,c Service d'ORL et Chirurgie cervico-faciale, CHU Timone, AP-HM , Marseille , France
| | - Yohann Meynadier
- b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France
| | - Antoine Giovanni
- b Laboratoire Parole et Langage, UMR 7309, CNRS-AMU , Aix en Provence , France.,c Service d'ORL et Chirurgie cervico-faciale, CHU Timone, AP-HM , Marseille , France
| |
Collapse
|
10
|
Maunsell R, Ouaknine M, Giovanni A, Crespo A. Vibratory pattern of vocal folds under tension asymmetry. Otolaryngol Head Neck Surg 2016; 135:438-44. [PMID: 16949979 DOI: 10.1016/j.otohns.2006.05.023] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2005] [Accepted: 05/08/2006] [Indexed: 11/28/2022]
Abstract
Objective The aim of this study was to describe and analyze the vibratory pattern of vocal folds in an asymmetric situation. Study Design and Setting Cricothyroid muscle unilateral action was simulated on excised larynges on an experimental bench. Increasing airflow rates were applied to achieve vocal fold vibration. Electroglottography and an optoreflectometer device allowed analysis of separate and simultaneous vocal fold vibration. Spectra of the signals were obtained for each level of airflow variation. Results All experiments showed periodic vibration. A phase shift was noted between the two vocal folds. Subharmonics and biphonation were identified in all the experiments. Conclusion Lax vocal folds were more susceptible to spectral changes with increasing airflow. Significance Knowledge of the consequences of mass, tension, and position asymmetries of the vocal folds is crucial for diagnosis making and defining therapeutic strategies in dysphonic patients. This study may contribute to the understanding of physiology of vocal fold interaction and its compensatory mechanisms.
Collapse
Affiliation(s)
- Rebecca Maunsell
- Otorhinolaryngology Department, Universidade Estadual de Campinas, Cidade Universitária Zeferino Vaz, CEP-13083-970 Campinas, São Paulo, Brazil.
| | | | | | | |
Collapse
|
11
|
Orozco-Arroyave JR, Belalcazar-Bolanos EA, Arias-Londono JD, Vargas-Bonilla JF, Skodda S, Rusz J, Daqrouq K, Honig F, Noth E. Characterization Methods for the Detection of Multiple Voice Disorders: Neurological, Functional, and Laryngeal Diseases. IEEE J Biomed Health Inform 2015; 19:1820-8. [DOI: 10.1109/jbhi.2015.2467375] [Citation(s) in RCA: 79] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
12
|
Charlton BD. The Acoustic Structure and Information Content of Female Koala Vocal Signals. PLoS One 2015; 10:e0138670. [PMID: 26465340 PMCID: PMC4605621 DOI: 10.1371/journal.pone.0138670] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2015] [Accepted: 09/02/2015] [Indexed: 11/28/2022] Open
Abstract
Determining the information content of animal vocalisations can give valuable insights into the potential functions of vocal signals. The source-filter theory of vocal production allows researchers to examine the information content of mammal vocalisations by linking variation in acoustic features with variation in relevant physical characteristics of the caller. Here I used a source-filter theory approach to classify female koala vocalisations into different call-types, and determine which acoustic features have the potential to convey important information about the caller to other conspecifics. A two-step cluster analysis classified female calls into bellows, snarls and tonal rejection calls. Additional results revealed that female koala vocalisations differed in their potential to provide information about a given caller’s phenotype that may be of importance to receivers. Female snarls did not contain reliable acoustic cues to the caller’s identity and age. In contrast, female bellows and tonal rejection calls were individually distinctive, and the tonal rejection calls of older female koalas had consistently lower mean, minimum and maximum fundamental frequency. In addition, female bellows were significantly shorter in duration and had higher fundamental frequency, formant frequencies, and formant frequency spacing than male bellows. These results indicate that female koala vocalisations have the potential to signal the caller’s identity, age and sex. I go on to discuss the anatomical basis for these findings, and consider the possible functional relevance of signalling this type of information in the koala’s natural habitat.
Collapse
Affiliation(s)
- Benjamin D. Charlton
- School of Biology and Environmental Science, University College Dublin (UCD), Belfield, Dublin, Ireland
- * E-mail:
| |
Collapse
|
13
|
Awan SN, Novaleski CK, Rousseau B. Nonlinear analyses of elicited modal, raised, and pressed rabbit phonation. J Voice 2014; 28:538-47. [PMID: 24836360 DOI: 10.1016/j.jvoice.2014.01.015] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2013] [Accepted: 01/28/2014] [Indexed: 11/30/2022]
Abstract
OBJECTIVES/HYPOTHESIS The purpose of this study was to use nonlinear dynamic analysis methods such as phase space portraits and correlation dimension (D2) as well as descriptive spectrographic analyses to characterize acoustic signals produced during evoked rabbit phonation. METHODS Seventeen New Zealand white breeder rabbits were used to perform the study. A Grass S-88 stimulator (SA Instrumentation, Encinitas, CA) and constant current isolation unit (Grass Telefactor, model PSIU6; West Warwick, RI) were used to provide electrical stimulation to laryngeal musculature, and transglottal airflow rate and stimulation current (mA) were manipulated to elicit modal, raised intensity, and pressed phonations. Central 1 second portions of the most stable portion of the acoustic waveform for modal, raised intensity, and pressed phonations were edited and then analyzed via phase space portraits, Poincaré sections, and the estimation of the D2. In an attempt to limit the effects of the highly variable and nonstationary characteristics of some of the signals being analyzed, D2 analysis was also performed on the most stable central 200-millisecond portion of the acoustic waveform. Descriptive analysis of each phonation was also conducted using sound spectrograms. RESULTS Results showed that the complexity of phonation and the subsequent acoustic waveform is increased as transglottal airflow rate and degree of glottal adduction are manipulated in the evoked rabbit phonation model. In particular, phonatory complexity, as quantified via D2 analyses and demonstrated via spectrographic characteristics, increases from "modal" (ie, phonation elicited at just above the phonation threshold pressure) to raised intensity (phonation elicited by increasing transglottal airflow rate) to pressed (phonation elicited by increasing the stimulation current delivered to the larynx). Variations in a single dynamic dimension (airflow rate or adductory force) resulted in significantly increased productions of nonlinear phenomenon, including bifurcations from periodicity to regions of subharmonic content, fundamental frequency, and harmonic jumps, and evidence of periodicity within aperiodic regions ("chaos"). CONCLUSIONS The evoked rabbit phonation model described in this study allows for the elicitation of various types of phonations under controlled conditions and, therefore, has the potential to provide insight regarding important variables that may elicit examples of nonlinear phenomena such as subharmonics and deterministic chaos.
Collapse
Affiliation(s)
- Shaheen N Awan
- Department of Audiology and Speech Pathology, Bloomsburg University of Pennsylvania, Bloomsburg, Pennsylvania.
| | - Carolyn K Novaleski
- Department of Hearing and Speech Sciences, Vanderbilt University, Nashville, Tennessee
| | - Bernard Rousseau
- Department of Otolaryngology, Hearing and Speech Sciences, and Mechanical Engineering, Vanderbilt University School of Medicine, Nashville, Tennessee
| |
Collapse
|
14
|
Chhetri DK, Neubauer J, Bergeron JL, Sofer E, Peng KA, Jamal N. Effects of asymmetric superior laryngeal nerve stimulation on glottic posture, acoustics, vibration. Laryngoscope 2013; 123:3110-6. [PMID: 23712542 DOI: 10.1002/lary.24209] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Revised: 04/26/2013] [Accepted: 04/26/2013] [Indexed: 11/12/2022]
Abstract
OBJECTIVES/HYPOTHESIS Evaluate the effects of asymmetric superior laryngeal nerve stimulation on the vibratory phase, laryngeal posture, and acoustics. STUDY DESIGN Basic science study using an in vivo canine model. METHODS The superior laryngeal nerves were symmetrically and asymmetrically stimulated over eight activation levels to mimic laryngeal asymmetries representing various levels of superior laryngeal nerve paresis and paralysis conditions. Glottal posture change, vocal fold speed, and vibration of these 64 distinct laryngeal-activation conditions were evaluated by high speed video and concurrent acoustic and aerodynamic recordings. Assessments were made at phonation onset. RESULTS Vibratory phase was symmetric in all symmetric activation conditions, but consistent phase asymmetry toward the vocal fold with higher superior laryngeal-nerve activation was observed. Superior laryngeal nerve paresis and paralysis conditions had reduced vocal fold strain and fundamental frequency. Superior laryngeal nerve activation increased vocal fold closure speed, but this effect was more pronounced for the ipsilateral vocal fold. Increasing asymmetry led to aperiodic and chaotic vibration. CONCLUSIONS This study directly links vocal-fold tension asymmetry with vibratory phase asymmetry, in particular the side with greater tension leads in the opening phase. The clinical observations of vocal fold lag, reduced vocal range, and aperiodic voice in superior laryngeal paresis and paralysis is also supported.
Collapse
Affiliation(s)
- Dinesh K Chhetri
- Department of Head and Neck Surgery, Laryngeal Physiology Laboratory, CHS 62-132, Los Angeles, California, U.S.A
| | | | | | | | | | | |
Collapse
|
15
|
Pinheiro AP, Kerschen G. Vibrational dynamics of vocal folds using nonlinear normal modes. Med Eng Phys 2013; 35:1079-88. [DOI: 10.1016/j.medengphy.2012.11.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Revised: 10/08/2012] [Accepted: 11/04/2012] [Indexed: 10/27/2022]
|
16
|
Nicollas R, Vicente J, Brutin D, Giordano J, Medale M, Giovanni A, Ouaknine M, Triglia JM. The very first cry: a multidisciplinary approach toward a model. Ann Otol Rhinol Laryngol 2013; 121:821-6. [PMID: 23342556 DOI: 10.1177/000348941212101209] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
OBJECTIVES In previous work, we showed that a rigid larynx-like geometry can generate a sound by itself. However, very little is known about the exact mechanisms and control of the larynx during the first cry of life. The goal of this work was to understand how the very first cry is generated. METHODS Simultaneous high-speed imaging and sound recording on 2 excised 38-week term human fetus larynges were performed. The behaviors of the vocal folds and the false vocal folds were studied separately. The behavior of the vocal folds after resection of the supraglottic structures was also analyzed. A comparative acoustic analysis of the first cry and of the sound generated by the excised organs was performed. RESULTS Our data showed that the vocal folds in a larynx with the pressure conditions of the first cry do not generate sound themselves, but induce aerodynamic conditions leading to vibrations of other parts of the larynx. CONCLUSIONS The similarities between the sound generated by an excised larynx and the first cry suggest a lack of neurologic control of the larynx during production of the first cry. A model-algorithm is proposed.
Collapse
Affiliation(s)
- Richard Nicollas
- Energetic Dynamics Laboratory of IUSTI (University Institute of Industrial and Thermal Systems), UMR 6595 CNRS Université de Provence, Marseille, France
| | | | | | | | | | | | | | | |
Collapse
|
17
|
Analysis of Speech from People with Parkinson’s Disease through Nonlinear Dynamics. ADVANCES IN NONLINEAR SPEECH PROCESSING 2013. [DOI: 10.1007/978-3-642-38847-7_15] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
|
18
|
New Cues in Low-Frequency of Speech for Automatic Detection of Parkinson’s Disease. NATURAL AND ARTIFICIAL MODELS IN COMPUTATION AND BIOLOGY 2013. [DOI: 10.1007/978-3-642-38637-4_29] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
19
|
|
20
|
Döllinger M, Kobler J, Berry DA, Mehta DD, Luegmair G, Bohr C. Experiments on Analysing Voice Production: Excised (Human, Animal) and In Vivo (Animal) Approaches. Curr Bioinform 2011; 6:286-304. [PMID: 26581597 DOI: 10.2174/157489311796904673] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Experiments on human and on animal excised specimens as well as in vivo animal preparations are so far the most realistic approaches to simulate the in vivo process of human phonation. These experiments do not have the disadvantage of limited space within the neck and enable studies of the actual organ necessary for phonation, i.e., the larynx. The studies additionally allow the analysis of flow, vocal fold dynamics, and resulting acoustics in relation to well-defined laryngeal alterations. PURPOSE OF REVIEW This paper provides an overview of the applications and usefulness of excised (human/animal) specimen and in vivo animal experiments in voice research. These experiments have enabled visualization and analysis of dehydration effects, vocal fold scarring, bifurcation and chaotic vibrations, three-dimensional vibrations, aerodynamic effects, and mucosal wave propagation along the medial surface. Quantitative data will be shown to give an overview of measured laryngeal parameter values. As yet, a full understanding of all existing interactions in voice production has not been achieved, and thus, where possible, we try to indicate areas needing further study. RECENT FINDINGS A further motivation behind this review is to highlight recent findings and technologies related to the study of vocal fold dynamics and its applications. For example, studies of interactions between vocal tract airflow and generation of acoustics have recently shown that airflow superior to the glottis is governed by not only vocal fold dynamics but also by subglottal and supraglottal structures. In addition, promising new methods to investigate kinematics and dynamics have been reported recently, including dynamic optical coherence tomography, X-ray stroboscopy and three-dimensional reconstruction with laser projection systems. Finally, we touch on the relevance of vocal fold dynamics to clinical laryngology and to clinically-oriented research.
Collapse
Affiliation(s)
- Michael Döllinger
- University Hospital Erlangen, Medical School, Laboratory for Computational Medicine, Department for Phoniatrics and Pediatric Audiology, Bohlenplatz 21, 91054 Erlangen, Germany
| | - James Kobler
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, 620 Thier Building, 55 Fruit Street, Boston, Massachusetts 02114, USA
| | - David A Berry
- The Laryngeal Dynamics Laboratory, Division of Head & Neck Surgery, UCLA School of Medicine, 31-24 Rehab Center, 1000 Veteran Ave., Los Angeles, CA, 90095-1794, USA
| | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, One Bowdoin Square, 11 Floor, Boston, Massachusetts 02114, USA
| | - Georg Luegmair
- University Hospital Erlangen, Medical School, Laboratory for Computational Medicine, Department for Phoniatrics and Pediatric Audiology, Bohlenplatz 21, 91054 Erlangen, Germany
| | - Christopher Bohr
- University Hospital Erlangen, Medical School, ENT-Hospital, Waldstrasse 1, 91054 Erlangen, Germany
| |
Collapse
|
21
|
Nonlinear Dynamic Analysis of Disordered Voice: The Relationship Between the Correlation Dimension (D2) and Pre-/Post-Treatment Change in Perceived Dysphonia Severity. J Voice 2010; 24:285-93. [DOI: 10.1016/j.jvoice.2008.11.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2008] [Accepted: 11/21/2008] [Indexed: 11/17/2022]
|
22
|
Murugappan S, Boyce S, Khosla S, Kelchner L, Gutmark E. Acoustic characteristics of phonation in "wet voice" conditions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:2578-89. [PMID: 20370039 PMCID: PMC2865707 DOI: 10.1121/1.3308478] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/11/2009] [Revised: 01/11/2010] [Accepted: 01/13/2010] [Indexed: 05/29/2023]
Abstract
A perceptible change in phonation characteristics after a swallow has long been considered evidence that food and/or drink material has entered the laryngeal vestibule and is on the surface of the vocal folds as they vibrate. The current paper investigates the acoustic characteristics of phonation when liquid material is present on the vocal folds, using ex vivo porcine larynges as a model. Consistent with instrumental examinations of swallowing disorders or dysphagia in humans, three liquids of different Varibar viscosity ("thin liquid," "nectar," and "honey") were studied at constant volume. The presence of materials on the folds during phonation was generally found to suppress the higher frequency harmonics and generate intermittent additional frequencies in the low and high end of the acoustic spectrum. Perturbation measures showed a higher percentage of jitter and shimmer when liquid material was present on the folds during phonation, but they were unable to differentiate statistically between the three fluid conditions. The finite correlation dimension and positive Lyapunov exponent measures indicated that the presence of materials on the vocal folds excited a chaotic system. Further, these measures were able to reliably differentiate between the baseline and different types of liquid on the vocal folds.
Collapse
Affiliation(s)
- Shanmugam Murugappan
- Department of Otolaryngology, Head and Neck Surgery, University of Cincinnati Medical Center, 231 Albert B. Sabin Way, Cincinnati, OH 45267-0528, USA.
| | | | | | | | | |
Collapse
|
23
|
Vocal-type classification as a tool to identify stress in piglets under on-farm conditions. Anim Welf 2009. [DOI: 10.1017/s0962728600000816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
AbstractPrevious studies have shown that the analysis of high frequency stress calls in pigs can serve as a reliable tool in welfare research. Our study focuses on the classification of three different classes of piglet vocalisation: grunting, squealing and screaming. In a castration experiment (Experiment 1), 3,285 vocalisations from 42 piglets were analysed for 21 different vocal characteristics. A first discriminant function for the three vocal types was derived from recordings made under laboratory-like conditions. A second discriminant function was derived from non-calibrated measurements of the relative sound energy content. These two classifications revealed 86.7% identical assignments of vocalisations to the three vocal types. The second classification allowed for vocalisation analyses of animals under on-farm recording conditions. This technique was validated during an open-field test (Experiment 2) with piglets housed in two different farrowing systems (11,089 vocalisations, 22 piglets). The proportion of screaming sounds was lower for piglets from a group-farrowing (GF) system than for those from a single-farrowing (SF) system. Sound properties showed differences between as well as within the two experiments for all three vocal types. Vocalisations from SF and GF piglets differed significantly in the duration, energy, and relative maximum levels. We conclude that vocal-type analysis can not only help to identify vocalisation indicative of pain during castration, but also vocal behaviour changes indicating separation stress during the open-field test. Therefore, classification of vocal types can add valuable information to studies that use pig vocalisation for the assessment of welfare.
Collapse
|
24
|
Validation of the newborn larynx modeling with aerodynamical experimental data. Med Eng Phys 2009; 31:547-52. [DOI: 10.1016/j.medengphy.2008.08.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2007] [Revised: 07/20/2008] [Accepted: 08/30/2008] [Indexed: 11/19/2022]
|
25
|
Nicollas R, Garrel R, Ouaknine M, Giovanni A, Nazarian B, Triglia JM. Normal Voice in Children Between 6 and 12 Years of Age: Database and Nonlinear Analysis. J Voice 2008; 22:671-5. [PMID: 17509823 DOI: 10.1016/j.jvoice.2007.01.009] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2006] [Accepted: 01/23/2007] [Indexed: 11/16/2022]
Abstract
This study was carried out using a transversal design. It aimed to investigate possible changes of the normal voice in children before mutation, to create a database for the parameters used in the study, and to examine the use of fractal dimension and the largest Lyapunov exponent (LLE) in the assessment of nonpathological phenomena. Two hundred twelve children were enrolled: 111 females and 101 males; and 9 six-year-olds, 24 seven-year-olds, 18 eight-year-olds, 25 nine-year-olds, 27 ten-year-olds, 55 eleven-year-olds, and 54 twelve-year-olds. Fundamental frequency (Fo) decreased with age and was lower in boys than in girls. Jitter and shimmer did not significantly differ with age or gender. Fractal dimension and LLE were significantly lower in boys; LLE decreased with age. The present series confirmed the established findings that Fo is lower in boys than in girls, even before mutation, and decreases with age; two other classical voice analysis parameters, jitter and shimmer, also showed the same behavior as described in the literature. The study of nonlinear parameters (fractal dimension and LLE) showed that laryngeal dynamics is more stable in boys than in girls, and that stability is correlated with age.
Collapse
Affiliation(s)
- Richard Nicollas
- Laboratoire d'Audiophonologie Expérimentale et Clinique du Laboratoire Parole et Langage, Université de la Méditerranée, Marseille, France.
| | | | | | | | | | | |
Collapse
|
26
|
Zollinger SA, Riede T, Suthers RA. Two-voice complexity from a single side of the syrinx in northern mockingbird Mimus polyglottos vocalizations. ACTA ACUST UNITED AC 2008; 211:1978-91. [PMID: 18515729 DOI: 10.1242/jeb.014092] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The diverse vocal signals of songbirds are produced by highly coordinated motor patterns of syringeal and respiratory muscles. These muscles control separate sound generators on the right and left side of the duplex vocal organ, the syrinx. Whereas most song is under active neural control, there has been a growing interest in a different class of nonlinear vocalizations consisting of frequency jumps, subharmonics, biphonation and deterministic chaos that are also present in the vocal repertoires of many vertebrates, including many birds. These nonlinear phenomena may not require active neural control, depending instead on the intrinsic nonlinear dynamics of the oscillators housed within each side of the syrinx. This study investigates the occurrence of these phenomena in the vocalizations of intact northern mockingbirds Mimus polyglottos. By monitoring respiratory pressure and airflow on each side of the syrinx, we provide the first analysis of the contribution made by each side of the syrinx to the production of nonlinear phenomena and are able to reliably discriminate two-voice vocalizations from potentially similar appearing, unilaterally produced, nonlinear events. We present the first evidence of syringeal lateralization of nonlinear dynamics during bilaterally produced chaotic calls. The occurrence of unilateral nonlinear events was not consistently correlated with fluctuations in air sac pressure or the rate of syringeal airflow. Our data support previous hypotheses for mechanical and acoustic coupling between the two sides of the syrinx. These results help lay a foundation upon which to understand the communicative functions of nonlinear phenomena.
Collapse
Affiliation(s)
- Sue Anne Zollinger
- Department of Biology, Jordan Hall, Indiana University, Bloomington, IN 47405, USA.
| | | | | |
Collapse
|
27
|
Yan Y, Damrose E, Bless D. Functional Analysis of Voice Using Simultaneous High-Speed Imaging and Acoustic Recordings. J Voice 2007; 21:604-16. [PMID: 16968665 DOI: 10.1016/j.jvoice.2006.05.011] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2006] [Accepted: 05/09/2006] [Indexed: 11/19/2022]
Abstract
We present a comprehensive, functional analysis of clinical voice data derived from both high-speed digital imaging (HSDI) of the larynx and simultaneously acquired acoustic recordings. The goals of this study are to: (1) correlate dynamic characteristics of the vocal folds derived from direct laryngeal imaging with indirectly acquired acoustic measurements; (2) define the advantages of using a combined imaging/acoustic approach for the analysis of voice condition; and (3) identify new quantitative measures to evaluate the regularity of the vocal fold vibration and the complexity of the vocal output -- these measures will be key to successful diagnosis of vocal abnormalities. Image- and acoustic-based analyses are performed using an analytic phase plot approach previously introduced by our group (referred to as 'Nyquist' plot). Fast Fourier Transform (FFT) spectral analyses are performed on the same data for a comparison. Clinical HSDI and acoustic recordings from subjects having normal and specific voice pathologies, including muscular tension dysphonia (MTD) and recurrent respiratory papillomatosis (RRP) were analyzed using the Nyquist plot approach. The results of these analyses show that a combined imaging/acoustic analysis approach provides better characterization of the vibratory behavior of the vocal folds as it correlates with vocal output and pathology.
Collapse
Affiliation(s)
- Yuling Yan
- Department of Otolaryngology-Head & Neck Surgery, Stanford University School of Medicine, Stanford, CA 94305, USA.
| | | | | |
Collapse
|
28
|
Garrel R, Nicollas R, Chapus E, Ouaknine M, Giovanni A. Voice improvement in unilateral laryngeal paralysis during loud voicing: theoretical impact. Eur Arch Otorhinolaryngol 2007; 264:1201-5. [PMID: 17558508 DOI: 10.1007/s00405-007-0348-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2006] [Accepted: 05/04/2007] [Indexed: 10/23/2022]
Abstract
Voice of patient with unilateral laryngeal paralysis (ULP) shows a nonlinear behaviour with sudden octave jumps, bifurcations and chaos. Such a behaviour may be due to an increased number of freedom degrees in the glottal system. We hypothesized that voice intensity (with increasing sub glottal pressure) could improve vocal signal stability with less freedom degrees in vibrating system, and then a decrease of nonlinearities. A prospective study of 32 consecutive voices of patients with ULP and severe dysphonia was conducted. Jitter and Lyapunov exponent from vocal signals were compared at comfortable and loud voicing with Wilcoxon's test. In 23 out of 32 patients, jitter significantly decreased from 5 (median) in normal voice to 1.2 in loud voice (P < 10(-3)), Lyapunov exponent decreased from 1,495 bit/s (median) to 708 bit/s (P < 10(-4)). Two patients had paradoxical results regarding jitter (higher in loud voice) and 2 regarding Lyapunov exponent. From the 23 cases of voice improvement, 15 cases showing a marked improvement of the acoustic analysis supported our hypothesis (65%). Nonlinear phenomena detected in vocal signals of ULP with severe dysphonia may be reduced in loud voice.
Collapse
Affiliation(s)
- Renaud Garrel
- Service ORL, CHU Gui de Chauliac, 34295, Montpellier, France.
| | | | | | | | | |
Collapse
|
29
|
Riede T, Owren MJ, Arcadi AC. Nonlinear acoustics in pant hoots of common chimpanzees (Pan troglodytes): frequency jumps, subharmonics, biphonation, and deterministic chaos. Am J Primatol 2005; 64:277-91. [PMID: 15538766 DOI: 10.1002/ajp.20078] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The pant hoot calls produced by common chimpanzees (Pan troglodytes) are multi-call vocalizations that have figured prominently in investigations of acoustic communication in this species. Although pant hoots are predominantly harmonically structured, they can exhibit an acoustic complexity that has recently been linked to nonlinearity in the vocal-fold dynamics underlying typical mammalian sound production. We examined the occurrence of these sorts of nonlinear phenomena in pant hoot vocalizations, contrasting quieter and lower-pitched "introduction" components with loud and high-pitched "climax" calls in the same bouts. Spectrographic evidence revealed four kinds of nonlinear phenomena, including discrete frequency jumps, subharmonics, biphonation, and deterministic chaos. While these events were virtually never observed during the introduction, they occurred in more than half of the climax calls. Biphonation was by far the most common phenomenon, followed by subharmonics, chaos, and frequency jumps. Individual callers varied in the degree to which their climax calls exhibited nonlinear phenomena, but were consistent in showing more biphonation than other forms. These outcomes show that nonlinear phenomena are routinely present in chimpanzee pant hoots, and help lay the foundation for investigating the function of such events.
Collapse
Affiliation(s)
- Tobias Riede
- Department of Psychology, Cornell University, Ithaca, New York, USA.
| | | | | |
Collapse
|
30
|
Ayache S, Ouaknine M, Dejonkere P, Prindere P, Giovanni A. Experimental study of the effects of surface mucus viscosity on the glottic cycle. J Voice 2004; 18:107-15. [PMID: 15070230 DOI: 10.1016/j.jvoice.2003.07.004] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/29/2003] [Indexed: 11/28/2022]
Abstract
Numerous clinical findings indicate that viscosity of laryngeal mucosa is a crucial factor in glottal perfomance. Experience using experimental test benches has shown the importance of humidifying air stream used to induce vibration in excised larynges. Nevertheless, there is a lack of knowledge particularly regarding the physicochemical properties of laryngeal mucus. The purpose of this study was to research vocal fold vibration in excised larynges using artificial mucus of precisely known viscosity. Eight freshly harvested porcine larynges were examined. Parameters measured were Fo and vocal fold contact time. Measurements were performed under three conditions: basal (no fluid application on vocal cord surface), after application of a fluid of 60cP viscosity (Visc60), and after application of a fluid of 100cP viscosity (Visc100). Electroglottographic measurements were performed at two different times for each condition: 1 s after airflow onset (T1) and 6 seconds after airflow onset (T2). Statistical analysis consisted of comparing data obtained under each condition at T1 and T2. The results showed a significant decrease in Fo after application of Visc60 and Visc100 fluids and a decrease in Fo at T2. Closure time was significantly higher under Visc60 conditions and under Visc100 conditions than under basal conditions. Application of artificial mucus to the mucosa of the vocal folds lowered vibratory frequency and prolonged the contact phase. Our interpretation of this data is that the presence of mucus on the surface of the vocal folds generated superficial tension and caused adhesion, which is a source of nonlinearity in vocal vibration.
Collapse
Affiliation(s)
- Stéphane Ayache
- Laboratoire d'Audio-Phonologie Clinique de l'Université de la Méditerranée (UPRES-EA 2668), Hŏpital Universitaire Timone, Marseille, France
| | | | | | | | | |
Collapse
|
31
|
Brown CH, Alipour F, Berry DA, Montequin D. Laryngeal biomechanics and vocal communication in the squirrel monkey (Saimiri boliviensis). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:2114-2126. [PMID: 12703722 DOI: 10.1121/1.1528930] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The larynges of eight squirrel monkeys were harvested, dissected, mounted on a pseudotracheal tube, and phonated using compressed air. Patterns of vocal fold oscillation were compared with sound spectrograms of calls recorded from monkeys in our colony. Four different regimes of vocal fold activation were identified. Regime 1 resembled typical human vowel production, with regular vocal-fold vibration, a prominent fundamental frequency, and an accompanying series of harmonic overtones. This regime is likely to give rise to squirrel monkey "cackles," as well as a variety of other harmonically structured calls. In regime 2, the pattern of vibrations exhibited the presence of two or more unrelated frequencies (biphonation). This regime of glottal activity resembled the biphonation observed in many exemplars of "twitter" and "kecker" calls. The vocal folds oscillated continuously in regime 3, but produced glottal pulses whose amplitudes waxed and waned rhythmically. This phenomenon resulted in the percept of a series of discrete pulses, and may give rise to "errs," "churrs," and other calls composed of a rapid sequence of acoustic elements. In regime 4, the period of each oscillation was quasi-irregular. Shrieks and other broadband calls or call elements that lack an apparent fundamental frequency may be produced in this manner.
Collapse
Affiliation(s)
- Charles H Brown
- Department of Psychology, University of South Alabama, Mobile, Alabama 36688, USA.
| | | | | | | |
Collapse
|
32
|
Yu P, Ouaknine M, Revis J, Giovanni A. Objective voice analysis for dysphonic patients: a multiparametric protocol including acoustic and aerodynamic measurements. J Voice 2001; 15:529-42. [PMID: 11792029 DOI: 10.1016/s0892-1997(01)00053-4] [Citation(s) in RCA: 92] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The purpose was to determine the clinical value of a multiparametric objective voice evaluation protocol including acoustic and aerodynamic parameters measured mainly on a sustained /a/. This was done by comparison with perceptual analysis of continuous speech by a jury composed of 6 experienced listeners. Voice samples (continuous speech) from 63 male patients with dysphonia and 21 control subjects with normal voices were recorded and assesed by a jury of listeners. The jury was instructed to classify voice samples according to the G (overall dysphonia) component of the GRBAS score on a 4-point scale ranging from 0 for normal to 3 for severe dysphonia. Objective parameters were recorded on an EVA workstation. As usual with this type of system, parameters were measured mainly on a sustained /a/. Measured parameters included fundamental frequency (F0), intensity, jitter, shimmer, signal-to-noise ratio, Lyapunov coefficient (LC), oral airflow (OAF), maximum phonatory time (MPT), and vocal range (range). Estimated subglottic pressure (ESGP) was determined on a series of /pa/. Discriminant analysis was performed to detect correlation between jury classification and combinations of parameters. Results showed that a nonlinear combination of only six parameters (range, LC, ESGP, MPT, signal-to-noise ratio, and F0) allowed 86% concordance with jury classification. Discussion deals with the relative importance of the different objective parameters for discriminant analysis. Special emphasis is placed on two measurements rarely made in routine clinical workup, i.e., estimated subglottic pressure and Lyapunov coefficient.
Collapse
Affiliation(s)
- P Yu
- Laboratoire d'Audio-Phonologie Clinique, Centre Hospitalier Universitaire Timone, Marseille, France
| | | | | | | |
Collapse
|
33
|
Surgical management of nonneoplastic vocal fold lesions: laser versus cold knife excision. Curr Opin Otolaryngol Head Neck Surg 2000. [DOI: 10.1097/00020840-200012000-00014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|