1
|
Gençağa D. Confounding Factor Analysis for Vocal Fold Oscillations. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1577. [PMID: 38136457 PMCID: PMC10742717 DOI: 10.3390/e25121577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 11/12/2023] [Accepted: 11/15/2023] [Indexed: 12/24/2023]
Abstract
This paper provides a methodology to better understand the relationships between different aspects of vocal fold motion, which are used as features in machine learning-based approaches for detecting respiratory infections from voice recordings. The relationships are derived through a joint multivariate analysis of the vocal fold oscillations of speakers. Specifically, the multivariate setting explores the displacements and velocities of the left and right vocal folds derived from recordings of five extended vowel sounds for each speaker (/aa/, /iy/, /ey/, /uw/, and /ow/). In this multivariate setting, the differences between the bivariate and conditional interactions are analyzed by information-theoretic quantities based on transfer entropy. Incorporation of the conditional quantities reveals information regarding the confounding factors that can influence the statistical interactions among other pairs of variables. This is demonstrated on a vector autoregressive process where the analytical derivations can be carried out. As a proof of concept, the methodology is applied on a clinically curated dataset of COVID-19. The findings suggest that the interaction between the vocal fold oscillations can change according to individuals and presence of any respiratory infection, such as COVID-19. The results are important in the sense that the proposed approach can be utilized to determine the selection of appropriate features as a supplementary or early detection tool in voice-based diagnostics in future studies.
Collapse
Affiliation(s)
- Deniz Gençağa
- Department of Electrical and Electronics Engineering, Antalya Bilim University, Antalya 07190, Turkey;
- Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
2
|
Barsties V Latoszek B, Englert M, Lucero JC, Behlau M. The Performance of the Acoustic Voice Quality Index and Acoustic Breathiness Index in Synthesized Voices. J Voice 2023; 37:804.e21-804.e28. [PMID: 34218968 DOI: 10.1016/j.jvoice.2021.05.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Revised: 05/07/2021] [Accepted: 05/11/2021] [Indexed: 11/25/2022]
Abstract
OBJECTIVE The aim of the present study was to investigate the performance of the Acoustic Voice Quality Index (AVQI) and the Acoustic Breathiness Index (ABI) in synthesized voice samples. METHOD The validity of the AVQI and ABI performances was analyzed in synthesized voice samples controlling the degree of predefined deviations for overall voice quality (G-scale) and breathiness (B-scale). A range of 26 synthesized voice samples with various severity degrees in G-scale with and without prominence of breathiness for male and female voices were created. RESULTS ABI received higher validity in the evaluation of breathiness than AVQI. Furthermore, ABI evaluated accurately breathiness degrees without considering roughness effects in voice samples and confirmed the findings of other studies with natural voices. Furthermore, ABI was more robust than AVQI in the evaluation of severe voice-disordered voice samples. Finally, AVQI represented moreover overall voice quality with an emphasis of breathiness evaluation and less roughness although roughness had a necessary component in overall voice quality evaluation. CONCLUSION AVQI and ABI are two robust measurements in the evaluation of voice quality. However, ABI received fewer errors than AVQI in the analyses of higher abnormalities in the voice signal. Disturbances of other subtypes of abnormal overall voice quality such as roughness were not demonstrated in the results of ABI.
Collapse
Affiliation(s)
- Ben Barsties V Latoszek
- Speech-Language Pathology, SRH University of Applied Health Sciences, Düsseldorf, Germany; Department of Phoniatrics and Pediatric Audiology, University Hospital Münster, University of Münster, Münster, Germany.
| | - Marina Englert
- Human Communication Disorders, Universidade Federal de São Paulo -UNIFESP, São Paulo, Sao Paulo, Brazil; Centro de Estudos da Voz - CEV, São Paulo ,SP, Brazil
| | - Jorge C Lucero
- Department of Computer Science, Universidade de Brasília - UnB, Brasília, Federal District, Brazil
| | - Mara Behlau
- Human Communication Disorders, Universidade Federal de São Paulo -UNIFESP, São Paulo, Sao Paulo, Brazil; Centro de Estudos da Voz - CEV, São Paulo ,SP, Brazil
| |
Collapse
|
3
|
Zhao W, Singh R. Deriving Vocal Fold Oscillation Information from Recorded Voice Signals Using Models of Phonation. ENTROPY (BASEL, SWITZERLAND) 2023; 25:1039. [PMID: 37509986 PMCID: PMC10378572 DOI: 10.3390/e25071039] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 07/03/2023] [Accepted: 07/07/2023] [Indexed: 07/30/2023]
Abstract
During phonation, the vocal folds exhibit a self-sustained oscillatory motion, which is influenced by the physical properties of the speaker's vocal folds and driven by the balance of bio-mechanical and aerodynamic forces across the glottis. Subtle changes in the speaker's physical state can affect voice production and alter these oscillatory patterns. Measuring these can be valuable in developing computational tools that analyze voice to infer the speaker's state. Traditionally, vocal fold oscillations (VFOs) are measured directly using physical devices in clinical settings. In this paper, we propose a novel analysis-by-synthesis approach that allows us to infer the VFOs directly from recorded speech signals on an individualized, speaker-by-speaker basis. The approach, called the ADLES-VFT algorithm, is proposed in the context of a joint model that combines a phonation model (with a glottal flow waveform as the output) and a vocal tract acoustic wave propagation model such that the output of the joint model is an estimated waveform. The ADLES-VFT algorithm is a forward-backward algorithm which minimizes the error between the recorded waveform and the output of this joint model to estimate its parameters. Once estimated, these parameter values are used in conjunction with a phonation model to obtain its solutions. Since the parameters correlate with the physical properties of the vocal folds of the speaker, model solutions obtained using them represent the individualized VFOs for each speaker. The approach is flexible and can be applied to various phonation models. In addition to presenting the methodology, we show how the VFOs can be quantified from a dynamical systems perspective for classification purposes. Mathematical derivations are provided in an appendix for better readability.
Collapse
Affiliation(s)
- Wayne Zhao
- Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Rita Singh
- School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
4
|
Ikuma T, McWhorter AJ, Adkins L, Kunduk M. Investigation of Vocal Bifurcations and Voice Patterns Induced by Asymmetry of Pathological Vocal Folds. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:48-60. [PMID: 36472934 DOI: 10.1044/2022_jslhr-21-00499] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
PURPOSE Vocal fold asymmetry creates irregular entrainments and modulations in voice, which may lead to rough perceptual quality. The presence of asymmetry can also cause mid-phonation bifurcations where a small change in the phonatory system causes a drastic change in vibration pattern, resulting in transitions in and out of rough voice. This study surveys sustained phonation recordings of speakers with the diagnoses of vocal fold polyp or unilateral vocal fold paralysis to investigate the resulting voice patterns. METHOD This retrospective study observed 71 sustained phonation recordings from 48 patients. Segments with distinctive signal patterns were identified within each recording with narrowband spectrogram and computer-assisted analysis of spectral peaks. RESULTS Phonation segmentation yielded 240 segments across all the recordings. Five voice patterns were recognized: (regularly or irregularly) entrained, modulated, uncoupled, unstable, and pulsed. Thirty-six patients (75%) exhibited irregular patterns. No single irregular pattern lasted for the entire phonation and was always accompanied by at least one mid-phonation bifurcation. Durations of the irregular segments (M = 0.4 s) were significantly shorter than the segments with the regular pattern (M = 1.4 s). CONCLUSIONS The results suggest that vocal fold pathology frequently introduces dynamic vibratory patterns that affect both the acoustic signals and perceptions. Due to these abnormalities, it is important for clinical voice assessment protocols, both perceptual and acoustic, to account for these possible bifurcations, irregular signal patterns, and their tendencies.
Collapse
Affiliation(s)
- Takeshi Ikuma
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Andrew J McWhorter
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Lacey Adkins
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
| | - Melda Kunduk
- Department of Otolaryngology-Head and Neck Surgery, Louisiana State University Health Sciences Center, New Orleans
- Voice Center, The Our Lady of the Lake Regional Medical Center, Baton Rouge, LA
- Department of Communication Sciences & Disorders, Louisiana State University, Baton Rouge
| |
Collapse
|
5
|
Aichinger P, Kumar SP, Lehoux S, Švec JG. Simulated Laryngeal High-Speed Videos for the Study of Normal and Dysphonic Vocal Fold Vibration. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2431-2445. [PMID: 35772399 DOI: 10.1044/2022_jslhr-21-00673] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE Laryngeal high-speed videoendoscopy (LHSV) has been recognized as a highly valuable modality for the scientific investigations of vocal fold (VF) vibrations. In contrast to stroboscopic imaging, LHSV enables visualizing aperiodic VF vibrations. However, the technique is less well established in the clinical care of disordered voices, partly because the properties of aperiodic vibration patterns are not yet described comprehensively. To address this, a computer model for simulation of VF vibration patterns observed in a variety of different phonation types is proposed. METHOD A previously published kinematic model of mucosal wave phenomena is generalized to be capable of left-right asymmetry and to simulate endoscopic videos instead of only kymograms of VF vibrations at single sagittal positions. The most influential control parameters are the glottal halfwidths, the oscillation frequencies, the amplitudes, and the phase delays. RESULTS The presented videos demonstrate zipper-like vibration, pressed voice, voice onset, constant and time-varying left-right and anterior-posterior phase differences, as well as left-right frequency differences of the VF vibration. Video frames, videokymograms, phonovibrograms, glottal area waveforms, and waveforms of VF contact area relating to electroglottograms are shown, as well as selected kinematic parameters. CONCLUSION The presented videos demonstrate the ability to produce vibration patterns that are similar to those typically seen in endoscopic videos obtained from vocally healthy and dysphonic speakers. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.20151833.
Collapse
Affiliation(s)
- Philipp Aichinger
- Division of Phoniatrics-Logopedics, Department of Otorhinolaryngology, Medical University of Vienna, Austria
| | - S Pravin Kumar
- Department of Biomedical Engineering, Sri Sivasubramaniya Nadar College of Engineering, Chennai, India
| | - Sarah Lehoux
- Voice Research Laboratory, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czech Republic
| | - Jan G Švec
- Voice Research Laboratory, Department of Experimental Physics, Faculty of Science, Palacký University, Olomouc, Czech Republic
- Voice and Hearing Centre Prague, Medical Healthcom, Ltd., Czech Republic
| |
Collapse
|
6
|
Stewart ME, Erath BD. Investigating blunt force trauma to the larynx: The role of inferior-superior vocal fold displacement on phonation. J Biomech 2021; 121:110377. [PMID: 33819698 DOI: 10.1016/j.jbiomech.2021.110377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/24/2021] [Accepted: 03/01/2021] [Indexed: 11/26/2022]
Abstract
Blunt force trauma to the larynx, which may result from motor vehicle collisions, sports activities, etc., can cause significant damage, often leading to displaced fractures of the laryngeal cartilages, thereby disrupting vocal function. Current surgical interventions primarily focus on airway restoration to stabilize the patient, with restoration of vocal function usually being a secondary consideration. Due to laryngeal fracture, asymmetric vertical misalignment of the left or right vocal fold (VF) in the inferior-superior direction often occurs. This affects VF closure and can lead to a weak, breathy voice requiring increased vocal effort. It is unclear, however, how much vertical VF misalignment can be tolerated before voice quality degrades significantly. To address this need, the influence of inferior-superior VF displacement on phonation is investigated in 1.0mm increments using synthetic, self-oscillating VF models in a physiologically-representative facility. Acoustic (SPL, frequency, H1-H2, jitter, and shimmer), kinematic (amplitude and phase differences), and aerodynamic parameters (flow rate and subglottal pressure) are investigated as a function of inferior-superior vertical displacement. Significant findings include that once the inferior-superior medial length of the VF is surpassed, sustained phonation degrades precipitously, becoming severely pathological. If laryngeal reconstruction approaches can ensure VF contact is maintained during phonation (i.e., vertical displacement doesn't surpass VF medial length), improved vocal outcomes are expected.
Collapse
Affiliation(s)
- Molly E Stewart
- Department of Mechanical and Aeronautical Engineering, Clarkson University, 8 Clarkson Ave, Potsdam, NY 13699, United States
| | - Byron D Erath
- Department of Mechanical and Aeronautical Engineering, Clarkson University, 8 Clarkson Ave, Potsdam, NY 13699, United States.
| |
Collapse
|
7
|
Drioli C, Aichinger P. Modelling sagittal and vertical phase differences in a lumped and distributed elements vocal fold model. Biomed Signal Process Control 2021. [DOI: 10.1016/j.bspc.2020.102309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
8
|
|
9
|
Bouvet A, Tokuda I, Pelorson X, Van Hirtum A. Influence of level difference due to vocal folds angular asymmetry on auto-oscillating replicas. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1136. [PMID: 32113292 DOI: 10.1121/10.0000742] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2019] [Accepted: 01/28/2020] [Indexed: 06/10/2023]
Abstract
Dysphonia is often caused by level difference between left and right vocal folds, which are positioned on different angles with respect to the transverse plane, resulting in angular asymmetry. Unilateral vocal fold paralysis may cause such angular asymmetry. In this case, the normal vocal fold is located on the transverse plane, whereas the paralyzed vocal fold is rotated in the sagittal plane as its posterior edge is moved up to the superior direction. The effect of such angular asymmetry (up to 25°) between the left and right vocal fold on the auto-oscillation is experimentally studied using mechanical replicas. For all replicas, it is observed that, as full contact between vocal folds is lost, increase of angular asymmetry results in a decrease of the signal-to-noise ratio, an increase of the total harmonic distortion rate, and an increase of the oscillation threshold pressure. These general tendencies are in agreement with clinical findings reported for vertical level difference during phonation. In analogy to the preceding experimental study in which vocal folds are spaced in parallel with a vertical trade-off, a formula is proposed to describe the oscillation threshold as a function of angular asymmetry.
Collapse
Affiliation(s)
- Anne Bouvet
- LEGI, UMR CNRS 5519, Grenoble Alpes University, France
| | - Isao Tokuda
- Department of Mechanical Engineering, Ritsumeikan University, Nojihigashi, Kusatsu, Shiga 525-8577, Japan
| | | | | |
Collapse
|
10
|
Migimatsu K, Tokuda IT. Experimental study on nonlinear source-filter interaction using synthetic vocal fold models. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:983. [PMID: 31472538 DOI: 10.1121/1.5120618] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/12/2018] [Accepted: 07/18/2019] [Indexed: 06/10/2023]
Abstract
Under certain conditions, e.g., singing voice, the fundamental frequency of the vocal folds can go up and interfere with the formant frequencies. Acoustic feedback from the vocal tract filter to the vocal fold source then becomes strong and non-negligible. An experimental study was presented on such source-filter interaction using three types of synthetic vocal fold models. Asymmetry was also created between the left and right vocal folds. The experiment reproduced various nonlinear phenomena, such as frequency jump and quenching, as reported in humans. Increase in phonation threshold pressure was also observed when resonant frequency of the vocal tract and fundamental frequency of the vocal folds crossed each other. As a combined effect, the phonation threshold pressure was further increased by the left-right asymmetry. Simulation of the asymmetric two-mass model reproduced the experiments to some extent. One of the intriguing findings of this study is the variable strength of the source-filter interaction over different model types. Among the three models, two models were strongly influenced by the vocal tract, while no clear effect of the vocal tract was observed in the other model. This implies that the level of source-filter interaction may vary considerably from one subject to another in humans.
Collapse
Affiliation(s)
- Kishin Migimatsu
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| |
Collapse
|
11
|
Cohen JT, Cohen A, Benyamini L, Adi Y, Keshet J. Predicting glottal closure insufficiency using fundamental frequency contour analysis. Head Neck 2019; 41:2324-2331. [PMID: 30763459 DOI: 10.1002/hed.25709] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 12/27/2018] [Accepted: 01/29/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Voice analysis has a limited role in a day-to-day voice clinic. We developed objective measurements of vocal folds (VF) glottal closure insufficiency (GCI) during phonation. METHODS We examined 18 subjects with no history of voice impairment and 20 patients with unilateral VF paralysis before and after injection medialization laryngoplasty. Voice analysis was extracted. We measured settling time, slope, and area under the fundamental frequency curve from the phonation onset to its settling-time. RESULTS The measured parameters, settling time, slope, and area under the curve were in correlation with the traditional acoustic voice assessments and clinical findings before treatment and after injection medialization laryngoplasty. CONCLUSION We found that the fundamental frequency curve has several typical contours which correspond to different glottal closure conditions. We proposed a new set of parameters that captures the contour type, and showed that they could be used to quantitatively assess individuals with GCI.
Collapse
Affiliation(s)
- Jacob T Cohen
- Department of Otolaryngology Head and Neck Surgery, Rambam Health Care Campus, Haifa, Israel
| | - Alma Cohen
- Berglass School of Economics, Tel-Aviv University, Tel Aviv, Israel
| | - Limor Benyamini
- Department of Otolaryngology Head and Neck Surgery, Rambam Health Care Campus, Haifa, Israel
| | - Yossi Adi
- Department of Computer Science, Bar-Ilan University, Ramat-Gan, Israel
| | - Joseph Keshet
- Department of Computer Science, Bar-Ilan University, Ramat-Gan, Israel
| |
Collapse
|
12
|
Tokuda IT, Shimamura R. Effect of level difference between left and right vocal folds on phonation: Physical experiment and theoretical study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:482. [PMID: 28863607 DOI: 10.1121/1.4996105] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
As an alternative factor to produce asymmetry between left and right vocal folds, the present study focuses on level difference, which is defined as the distance between the upper surfaces of the bilateral vocal folds in the inferior-superior direction. Physical models of the vocal folds were utilized to study the effect of the level difference on the phonation threshold pressure. A vocal tract model was also attached to the vocal fold model. For two types of different models, experiments revealed that the phonation threshold pressure tended to increase as the level difference was extended. Based upon a small amplitude approximation of the vocal fold oscillations, a theoretical formula was derived for the phonation threshold pressure. This theory agrees with the experiments, especially when the phase difference between the left and right vocal folds is not extensive. Furthermore, an asymmetric two-mass model was simulated with a level difference to validate the experiments as well as the theory. The primary conclusion is that the level difference has a potential effect on voice production especially for patients with an extended level of vertical difference in the vocal folds, which might be taken into account for the diagnosis of voice disorders.
Collapse
Affiliation(s)
- Isao T Tokuda
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| | - Ryo Shimamura
- Graduate School of Science and Engineering, Ritsumeikan University, Noji-higashi, Kusatsu, Shiga 525-8577, Japan
| |
Collapse
|
13
|
Perceptual Error Analysis of Human and Synthesized Voices. J Voice 2017; 31:516.e5-516.e18. [DOI: 10.1016/j.jvoice.2016.12.015] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2016] [Accepted: 12/21/2016] [Indexed: 11/19/2022]
|
14
|
Luizard P, Pelorson X. Threshold of oscillation of a vocal fold replica with unilateral surface growths. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:3050. [PMID: 28599523 DOI: 10.1121/1.4979935] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Among vocal fold diseases, the presence of a surface growth is often encountered and can be considered a public health issue. While more energy is required to achieve phonation than in healthy cases, this situation can lead to a wide range of voice perturbations, from a change of voice quality to aphonia. The present study aims at providing finer comprehension of the physical phenomena underlying this type of pathological phonation process. A vocal fold replica is used to perform measurements of mechanical responses of each vocal fold as well as of the subglottal pressure in both healthy and pathological configurations. Besides these physical measurements, a theoretical model is derived, using the one-mass-delayed model involving asymmetry of mass and geometry in order to simulate pressure signals. The theoretical model parameters are determined according to mechanical measurements on the replica. Results from measurements and simulations show that this unique vocal fold replica behaves in a manner comparable to clinical observations. The energy required to produce sound increases in the presence of a growth as well as with the size of the growth. Further investigation tends to show that the contact of the growth on the opposite vocal fold, considered as additional damping, plays a critical role.
Collapse
Affiliation(s)
- Paul Luizard
- GIPSA-lab UMR 5216, 11 rue des mathématiques, BP 46, 38402 Saint-Martin-D'Hères, France
| | - Xavier Pelorson
- GIPSA-lab UMR 5216, 11 rue des mathématiques, BP 46, 38402 Saint-Martin-D'Hères, France
| |
Collapse
|
15
|
Sorokin VN, Leonov AS. Determination of a vocal source by the spectral ratio method. PATTERN RECOGNITION AND IMAGE ANALYSIS 2017. [DOI: 10.1134/s105466181701014x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
16
|
Zhang Z. Mechanics of human voice production and control. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:2614. [PMID: 27794319 PMCID: PMC5412481 DOI: 10.1121/1.4964509] [Citation(s) in RCA: 164] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Collapse
Affiliation(s)
- Zhaoyan Zhang
- Department of Head and Neck Surgery, University of California, Los Angeles, 31-24 Rehabilitation Center, 1000 Veteran Avenue, Los Angeles, California 90095-1794, USA
| |
Collapse
|
17
|
Alonso RG, Kopuchian C, Amador A, Suarez MDLA, Tubaro PL, Mindlin GB. Difference between the vocalizations of two sister species of pigeons explained in dynamical terms. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2016; 202:361-70. [PMID: 27033354 DOI: 10.1007/s00359-016-1082-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Revised: 03/07/2016] [Accepted: 03/09/2016] [Indexed: 11/30/2022]
Abstract
Vocal communication is an unique example, where the nonlinear nature of the periphery can give rise to complex sounds even when driven by simple neural instructions. In this work we studied the case of two close-related bird species, Patagioenas maculosa and Patagioenas picazuro, whose vocalizations differ only in the timbre. The temporal modulation of the fundamental frequency is similar in both cases, differing only in the existence of sidebands around the fundamental frequency in the P. maculosa. We tested the hypothesis that the qualitative difference between these vocalizations lies in the nonlinear nature of the syrinx. In particular, we propose that the roughness of maculosa's vocalizations is due to an asymmetry between the right and left vibratory membranes, whose nonlinear dynamics generate the sound. To test the hypothesis, we generated a biomechanical model for vocal production with an asymmetric parameter Q with which we can control the level of asymmetry between these membranes. Using this model we generated synthetic vocalizations with the principal acoustic features of both species. In addition, we confirmed the anatomical predictions by making post mortem inspection of the syrinxes, showing that the species with tonal song (picazuro) has a more symmetrical pair of membranes compared to maculosa.
Collapse
Affiliation(s)
- R Gogui Alonso
- Department of Physics, FCEN, University of Buenos Aires and IFIBA, CONICET, Buenos Aires, Argentina.
| | - Cecilia Kopuchian
- División Ornitología, Museo Argentino de Ciencias Naturales "Bernardino Rivadavia" (MACN - CONICET), Avenida Ángel Gallardo 470, C1405DJR, Buenos Aires, Argentina.,Laboratorio de Biología de la Conservación, Centro de Ecología Aplicada del Litoral (CECOAL - CONICET), Ruta Provincial 5, Km 2.5, CEP 3400, Corrientes, Argentina
| | - Ana Amador
- Department of Physics, FCEN, University of Buenos Aires and IFIBA, CONICET, Buenos Aires, Argentina
| | | | - Pablo L Tubaro
- División Ornitología, Museo Argentino de Ciencias Naturales "Bernardino Rivadavia" (MACN - CONICET), Avenida Ángel Gallardo 470, C1405DJR, Buenos Aires, Argentina
| | - Gabriel B Mindlin
- Department of Physics, FCEN, University of Buenos Aires and IFIBA, CONICET, Buenos Aires, Argentina
| |
Collapse
|