1
|
Mohd Khairuddin KA, Ahmad K, Proehoeman SC, Mohd Ibrahim H, Yan Y. Preliminary Findings of Vocal Fold Vibratory Characteristics of Singers Analyzed by Laryngeal High-Speed Videoendoscopy. J Voice 2024:S0892-1997(24)00173-5. [PMID: 38902142 DOI: 10.1016/j.jvoice.2024.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 05/31/2024] [Accepted: 06/01/2024] [Indexed: 06/22/2024]
Abstract
OBJECTIVES This study investigates the vocal fold vibratory dynamics of singers, which are postulated to differ from those of normal speakers due to the singers' regular vocal training. The measurement of vocal fold vibration involved the utilization of laryngeal high-speed videoendoscopy (LHSV) and subsequent LHSV-based analysis. The focus of the present study is to characterize and compare the LHSV-based measures derived from the glottal area waveform (GAW), namely fundamental frequency (F0GAW), glottal perturbation (jitterGAW and shimmerGAW), open quotient (OQGAW), and Nyquist plots, between singers and normal speakers across genders. METHODS Participants comprised 13 singers from a local cultural and heritage academy and 56 normal speakers from a local university, all were evaluated to have normal voices. Each participant underwent LHSV procedures to capture images of vocal fold vibration, which were subsequently analyzed to generate the LHSV-based measures. RESULTS Male singers exhibited lower F0GAW, jitterGAW, shimmerGAW, and OQGAW than female singers. When compared to normal speakers, male singers demonstrated higher F0GAW, and lower jitterGAW and shimmerGAW. No difference in OQGAW was found between male singers and normal speakers. Female singers exhibited lower jitterGAW compared to normal speakers, but no differences were observed in shimmerGAW and OQGAW. The results of Nyquist plots indicated no gender-related associations between types of rim width and among singers. However, for rim pattern, male singers were associated with a higher percentage of clustered rim, suggesting more regular vocal fold vibration, compared to female singers and normal male speakers. CONCLUSIONS Singers, particularly male singers, demonstrate distinct and potentially superior vocal fold vibrations compared to normal speakers, likely attributed to their regular vocal training, resulting in refined vocal fold configurations even during speaking. Despite the limited sample of singers, the study offers valuable insights into the vocal fold vibratory behaviors in singers analyzed using LHSV.
Collapse
Affiliation(s)
- Khairy Anuar Mohd Khairuddin
- Speech Pathology Program, School of Health Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia; Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia.
| | - Kartini Ahmad
- Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | | | - Hasherah Mohd Ibrahim
- Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Yuling Yan
- Department of Bioengineering, School of Engineering, Santa Clara University, Santa Clara, California
| |
Collapse
|
2
|
Grama M, Barrichelo-Lindström V, Englert M, Kinghorn D, Behlau M. Resonant Voice: Perceptual and Acoustic Analysis After an Intensive Lessac Kinesensic Training Workshop. J Voice 2024; 38:541.e1-541.e12. [PMID: 34785115 DOI: 10.1016/j.jvoice.2021.09.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 09/10/2021] [Accepted: 09/10/2021] [Indexed: 11/28/2022]
Abstract
OBJECTIVE To investigate whether actors' and actresses' voices post an Intensive Lessac Kinesensic Training Workshop - ILKTW can be perceptually identified as more resonant, and to assess if the Acoustic Voice Quality Index (AVQI), the Acoustic Breathing Index (ABI), and their acoustic measures are able to indicate the classified voices as more resonant. METHODS Eight vocally healthy English first-language actors and actresses, participants of the same ILKTW, were asked to sustain the vowel /a/ and to read a piece of the Rainbow Passage, pre and postworkshop, at a self-selected habitual frequency and intensity. The readings were divided into initial and final parts, both with 34 syllables, and combined with the vowel /a/ (3s). The pre and postworkshop recordings (pairs of the initial and final parts followed by the /a/ vowel) of each participant were randomly presented to a voice expert who rated the samples as more resonant or similar. The same samples were analyzed using the AVQI and ABI scripts. RESULTS 87,5% of the initial parts and 100% of the final in postworkshop were rated by the voice expert as more resonant. Tilt was higher for the postworkshop initial parts (P = 0.036), but a correlation with the perceptual rating wasn't found. HNR was higher (P = 0.018) and jitter was lower (P = 0.017) for the postworkshop final parts. The statistical analysis comparing the perceptive and acoustic data for the final samples couldn't be applied. CONCLUSION It seems that the ILKTW has a positive impact on the development of a resonant voice and that the perceptual auditory rating was more effective to describe resonant voices than the AVQI, ABI, and its acoustic measures.
Collapse
Affiliation(s)
- Marilene Grama
- Centro de Estudos da Voz - CEV, 361 Machado Bittencourt street, 10th floor. São Paulo, São Paulo, 04044-001, Brazil.
| | - Viviane Barrichelo-Lindström
- Centro de Estudos da Voz - CEV, 361 Machado Bittencourt street, 10th floor. São Paulo, São Paulo, 04044-001, Brazil
| | - Marina Englert
- Centro de Estudos da Voz - CEV, 361 Machado Bittencourt street, 10th floor. São Paulo, São Paulo, 04044-001, Brazil
| | - Deborah Kinghorn
- Lessac Training and Research Institute, 60 Seaman Avenue, New York, NY, 10034, United States
| | - Mara Behlau
- Centro de Estudos da Voz - CEV, 361 Machado Bittencourt street, 10th floor. São Paulo, São Paulo, 04044-001, Brazil
| |
Collapse
|
3
|
Shu M, Zhang Y, Jiang JJ. The Effect of Mandarin Vowels on Acoustic Analysis: A Prospective Observational Study. J Voice 2022:S0892-1997(22)00104-7. [PMID: 35508424 DOI: 10.1016/j.jvoice.2022.03.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 03/30/2022] [Accepted: 03/30/2022] [Indexed: 10/18/2022]
Abstract
OBJECTIVES Although vowels are of interest for acoustic analysis in clinics, there is no consensus regarding the effect of vowel selection on acoustic perturbation parameters. This study aimed to reveal the effects of Mandarin vowels on acoustic measurements. STUDY DESIGN A prospective observational study. METHODS This prospective observational study enrolled normal phonation Mandarin speakers at the Otolaryngology Department of the Eye & ENT Hospital affiliated with Fudan University from December 2020 to August 2021. This study recruited 107 normal-voiced Mandarin speakers (59 women and 49 men) with a median age of 26 (22, 33) years old. The objective measures included traditional acoustic parameters (fundamental frequency, harmonic-to-noise ratio, percent jitter, and percent shimmer) and cepstral analysis (smoothed cepstral peak prominence) of six Mandarin vowels (ɑ /a/, o /o/, e /ɤ/, i /i/, u /u/, ü /y/). RESULTS The acoustic analysis revealed no significant differences in the fundamental frequency among vowels. The low vowel /a/ had the highest values for percent jitter and percent shimmer and the lowest harmonic-to-noise ratio value. The back vowel /u/ had the lowest cepstral measures (P < 0.05). CONCLUSIONS The acoustic analysis significantly varied across the different Mandarin vowels, and these differences must be considered for the effective clinical application of objective evaluations.
Collapse
Affiliation(s)
- Min Shu
- Eye & ENT Hospital of Fudan University, Department of Otolaryngology Head & Neck Surgery, China.
| | - Yi Zhang
- Eye & ENT Hospital of Fudan University, Department of Otolaryngology Head & Neck Surgery, China
| | - Jack J Jiang
- Eye & ENT Hospital of Fudan University, Department of Otolaryngology Head & Neck Surgery, China; Otolaryngology-Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin
| |
Collapse
|
4
|
Look C, McCabe P, Heard R, Madill CJ. Show and Tell: Video Modeling and Instruction Without Feedback Improves Performance but Is Not Sufficient for Retention of a Complex Voice Motor Skill. J Voice 2019; 33:239-249. [DOI: 10.1016/j.jvoice.2017.09.023] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Revised: 09/26/2017] [Accepted: 09/27/2017] [Indexed: 10/18/2022]
|
5
|
Ehrlich B, Lin L, Jiang J. Concatenation of the Moving Window Technique for Auditory-Perceptual Analysis of Voice Quality. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2018; 27:1426-1433. [PMID: 30304342 PMCID: PMC6436458 DOI: 10.1044/2018_ajslp-17-0103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Revised: 12/08/2017] [Accepted: 05/13/2018] [Indexed: 06/08/2023]
Abstract
PURPOSE The purpose of this study is to develop a program to concatenate acoustic vowel segments that were selected with the moving window technique, a previously developed technique used to segment and select the least perturbed segment from a sustained vowel segment. The concatenated acoustic segments were compared with the nonconcatenated, short, individual acoustic segments for their ability to differentiate normal and pathological voices. The concatenation process sometimes created a clicking noise or beat, which was also analyzed to determine any confounding effects. METHOD A program was developed to concatenate the moving window segments. Listeners with no previous rating experience were trained and, then, rated 20 normal and 20 pathological voice segments, both concatenated (2 s) and short (0.2 s) for a total of 80 segments. Listeners evaluated these segments on both the Grade, Roughness, Breathiness, Asthenia, and Strain scale (GRBAS; 8 listeners) and the Consensus Auditory-Perceptual Evaluation of Voice (Kempster, Gerratt, Abbott, Barkmeier-Kraemer, & Hillman, 2009) scale (7 listeners). The sensitivity and specificity of these ratings were analyzed using a receiver-operating characteristic curve. To evaluate if there were increases in particular criteria due to the beat, differences between beat and nonbeat ratings were compared using a 2-tailed analysis of variance. RESULTS Concatenated segments had a higher sensitivity and specificity for distinguishing pathological and normal voices than short segments. Compared with nonbeat segments, the beat had statistically similar increases for all criteria across Consensus Auditory-Perceptual Evaluation of Voice and GRBAS scales, except pitch and loudness. CONCLUSIONS The concatenated moving window method showed improved sensitivity and specificity for detecting voice disorders using auditory-perceptual analysis, compared with the short moving window segment. It is a helpful tool for perceptual analytic protocols, allowing for voice evaluation using standardized and automated voice-segmenting procedures. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.7100951.
Collapse
Affiliation(s)
- Benjamin Ehrlich
- School of Medicine and Public Health, University of Wisconsin–Madison
| | - Liyu Lin
- School of Medicine and Public Health, University of Wisconsin–Madison
| | - Jack Jiang
- School of Medicine and Public Health, University of Wisconsin–Madison
| |
Collapse
|
6
|
Van Stan JH, Park SW, Jarvis M, Mehta DD, Hillman RE, Sternad D. Measuring vocal motor skill with a virtual voice-controlled slingshot. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:1199. [PMID: 28964079 PMCID: PMC5648563 DOI: 10.1121/1.5000233] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2017] [Revised: 08/08/2017] [Accepted: 08/09/2017] [Indexed: 05/30/2023]
Abstract
Successful voice training (e.g., singing lessons) and vocal rehabilitation (e.g., therapy for a voice disorder) involve learning complex, vocal behaviors. However, there are no metrics describing how humans learn new vocal skills or predicting how long the improved behavior will persist post-therapy. To develop measures capable of describing and predicting vocal motor learning, a theory-based paradigm from limb motor control inspired the development of a virtual task where subjects throw projectiles at a target via modifications in vocal pitch and loudness. Ten subjects with healthy voices practiced this complex vocal task for five days. The many-to-one mapping between the execution variables pitch and loudness and resulting target error was evaluated using an analysis that quantified distributional properties of variability: Tolerance, noise, covariation costs (TNC costs). Lag-1 autocorrelation (AC1) and detrended-fluctuation-analysis scaling index (SCI) analyzed temporal aspects of variability. Vocal data replicated limb-based findings: TNC costs were positively correlated with error; AC1 and SCI were modulated in relation to the task's solution manifold. The data suggests that vocal and limb motor learning are similar in how the learner navigates the solution space. Future work calls for investigating the game's potential to improve voice disorder diagnosis and treatment.
Collapse
Affiliation(s)
- Jarrad H Van Stan
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Se-Woong Park
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, USA
| | | | - Daryush D Mehta
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Robert E Hillman
- Center for Laryngeal Surgery and Voice Rehabilitation, Massachusetts General Hospital, Boston, Massachusetts 02114, USA
| | - Dagmar Sternad
- Departments of Biology, Electrical and Computer Engineering, and Physics, Northeastern University, Boston, Massachusetts 02115, USA
| |
Collapse
|
7
|
Van Stan JH, Mehta DD, Sternad D, Petit R, Hillman RE. Ambulatory Voice Biofeedback: Relative Frequency and Summary Feedback Effects on Performance and Retention of Reduced Vocal Intensity in the Daily Lives of Participants With Normal Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:853-864. [PMID: 28329366 PMCID: PMC5548081 DOI: 10.1044/2016_jslhr-s-16-0164] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Revised: 07/14/2016] [Accepted: 10/08/2016] [Indexed: 05/16/2023]
Abstract
Purpose Ambulatory voice biofeedback has the potential to significantly improve voice therapy effectiveness by targeting carryover of desired behaviors outside the therapy session (i.e., retention). This study applies motor learning concepts (reduced frequency and delayed, summary feedback) that demonstrate increased retention to ambulatory voice monitoring for training nurses to talk softer during work hours. Method Forty-eight nurses with normal voices wore the Voice Health Monitor (Mehta, Zañartu, Feng, Cheyne, & Hillman, 2012) for 6 days: 3 baseline days, 1 biofeedback day, 1 short-term retention day, and 1 long-term retention day. Participants were block-randomized into 3 different biofeedback groups: 100%, 25%, and Summary. Performance was measured in terms of compliance time below a participant-specific vocal intensity threshold. Results All participants exhibited a significant increase in compliance time (Cohen's d = 4.5) during biofeedback days compared with baseline days. The Summary feedback group exhibited statistically smaller performance reduction during both short-term (d = 1.14) and long-term (d = 1.04) retention days compared with the 100% feedback group. Conclusions These findings suggest that modifications in feedback frequency and timing affect retention of a modified vocal behavior in daily life. Future work calls for studying the potential beneficial impact of ambulatory voice biofeedback in participants with behaviorally based voice disorders.
Collapse
Affiliation(s)
- Jarrad H. Van Stan
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | - Daryush D. Mehta
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| | | | | | - Robert E. Hillman
- Massachusetts General Hospital, Boston
- MGH Institute of Health Professions, Boston, MA
- Harvard Medical School, Boston, MA
| |
Collapse
|
8
|
Van Stan JH, Mehta DD, Petit RJ, Sternad D, Muise J, Burns JA, Hillman RE. Integration of Motor Learning Principles Into Real-Time Ambulatory Voice Biofeedback and Example Implementation Via a Clinical Case Study With Vocal Fold Nodules. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2017; 26:1-10. [PMID: 28124070 PMCID: PMC5533549 DOI: 10.1044/2016_ajslp-15-0187] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/02/2015] [Revised: 03/18/2016] [Accepted: 06/01/2016] [Indexed: 05/28/2023]
Abstract
PURPOSE Ambulatory voice biofeedback (AVB) has the potential to significantly improve voice therapy effectiveness by targeting one of the most challenging aspects of rehabilitation: carryover of desired behaviors outside of the therapy session. Although initial evidence indicates that AVB can alter vocal behavior in daily life, retention of the new behavior after biofeedback has not been demonstrated. Motor learning studies repeatedly have shown retention-related benefits when reducing feedback frequency or providing summary statistics. Therefore, novel AVB settings that are based on these concepts are developed and implemented. METHOD The underlying theoretical framework and resultant implementation of innovative AVB settings on a smartphone-based voice monitor are described. A clinical case study demonstrates the functionality of the new relative frequency feedback capabilities. RESULTS With new technical capabilities, 2 aspects of feedback are directly modifiable for AVB: relative frequency and summary feedback. Although reduced-frequency AVB was associated with improved carryover of a therapeutic vocal behavior (i.e., reduced vocal intensity) in a patient post-excision of vocal fold nodules, causation cannot be assumed. CONCLUSIONS Timing and frequency of AVB schedules can be manipulated to empirically assess generalization of motor learning principles to vocal behavior modification and test the clinical effectiveness of AVB with various feedback schedules.
Collapse
Affiliation(s)
- Jarrad H Van Stan
- Massachusetts General Hospital, BostonMGH Institute of Health Professions, Boston, MA
| | - Daryush D Mehta
- Massachusetts General Hospital, BostonMGH Institute of Health Professions, Boston, MAHarvard Medical School, Boston, MA
| | | | | | | | - James A Burns
- Massachusetts General Hospital, BostonHarvard Medical School, Boston, MA
| | - Robert E Hillman
- Massachusetts General Hospital, BostonMGH Institute of Health Professions, Boston, MAHarvard Medical School, Boston, MA
| |
Collapse
|
9
|
Van Stan JH, Mehta DD, Hillman RE. The Effect of Voice Ambulatory Biofeedback on the Daily Performance and Retention of a Modified Vocal Motor Behavior in Participants With Normal Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:713-21. [PMID: 25765862 PMCID: PMC4492465 DOI: 10.1044/2015_jslhr-s-14-0159] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 01/10/2015] [Indexed: 05/13/2023]
Abstract
PURPOSE Ambulatory biofeedback has potential to improve carryover of newly established vocal motor behaviors into daily life outside of the clinic and warrants systematic research that is lacking in the literature. This proof-of-concept study was designed to establish an empirical basis for future work in this area by formally assessing whether ambulatory biofeedback reduces daily vocal intensity (performance) and the extent to which this change remains after biofeedback removal (retention). METHOD Six participants with normal voices wore the KayPENTAX Ambulatory Phonation Monitor for 3 baseline days followed by 4 days with biofeedback provided on odd days. RESULTS Compared to baseline days, participants exhibited a statistically significant decrease in mean vocal intensity (4.4 dB) and an increase in compliance (16.8 percentage points) when biofeedback was provided above a participant-specific intensity threshold. After biofeedback removal, mean vocal intensity and compliance reverted back to baseline levels. CONCLUSIONS These findings suggest that although current ambulatory biofeedback approaches have potential to modify a vocal motor behavior, the modified behavior may not be retained after biofeedback removal. Future work calls for the testing of more innovative ambulatory biofeedback approaches on the basis of motor control and learning theories to improve retention of a desired vocal motor behavior.
Collapse
|
10
|
Shu M, Jiang JJ, Willey M. The Effect of Moving Window on Acoustic Analysis. J Voice 2015; 30:5-10. [PMID: 25998407 DOI: 10.1016/j.jvoice.2014.11.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2014] [Accepted: 11/20/2014] [Indexed: 11/26/2022]
Abstract
OBJECTIVE To investigate the effects of the moving window method on acoustic measures and discrimination ability between normal and disordered voices. METHODS Fifty-three normal voices and 50 disordered voices were recruited. Three selection methods, the moving window method, the mid-vowel method, and the whole vowel method, were applied to each raw audio signal to determine the most stable segment of each signal. Acoustic parameters such as percent jitter, percent shimmer, signal-to-noise ratio (SNR), cepstral peak prominence (CPP), and correlation dimension (D2) were calculated. The Wilcoxon test was used to compare the stability of these segments across different methods. An artificial neural network was used for estimating how well disordered voices were discriminated from normal ones. RESULTS Segments selected using the moving window method were more stable than those selected using the other two methods, meaning lower perturbation and nonlinear dynamic measurements as well as higher SNR and CPP values. The discrimination accuracy rate for the moving window method was 91.90 ± 8.73%, whereas the mid-vowel method and the whole vowel method were 72.34 ± 12.94% and 70.34 ± 5.24%, respectively. CONCLUSION The moving window method is capable of providing a more stable audio segment and can discriminate disordered voices from normal ones more effectively.
Collapse
Affiliation(s)
- Min Shu
- Department of Otolaryngology Head & Neck Surgery, Eye, Ear, Nose and Throat Hospital, Fudan University, Shanghai, China
| | - Jack J Jiang
- Department of Otolaryngology Head & Neck Surgery, Eye, Ear, Nose and Throat Hospital, Fudan University, Shanghai, China; Division of Otolaryngology-Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin.
| | - Malachi Willey
- Division of Otolaryngology-Head and Neck Surgery, Department of Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin
| |
Collapse
|
11
|
Iwarsson J. Facilitating behavioral learning and habit change in voice therapy—theoretic premises and practical strategies. LOGOP PHONIATR VOCO 2014; 40:179-86. [DOI: 10.3109/14015439.2014.936498] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
12
|
Awan SN, Miesemer SA, Nicolia TA. An Examination of Intrasubject Variability on the Dysphonia Severity Index. J Voice 2012; 26:814.e21-5. [DOI: 10.1016/j.jvoice.2012.04.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2012] [Accepted: 04/18/2012] [Indexed: 10/28/2022]
|
13
|
Voice parameters and videonasolaryngoscopy in children with vocal nodules: a longitudinal study, before and after voice therapy. Int J Pediatr Otorhinolaryngol 2012; 76:1361-5. [PMID: 22770591 DOI: 10.1016/j.ijporl.2012.06.007] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/08/2012] [Revised: 06/12/2012] [Accepted: 06/14/2012] [Indexed: 11/22/2022]
Abstract
BACKGROUND Vocal Nodules (VN) are a functional voice disorder associated with voice misuse and abuse in children. There are few reports addressing vocal parameters in children with VN, especially after a period of vocal rehabilitation. OBJECTIVE The purpose of this study is to describe measurements of vocal parameters including Fundamental Frequency (FF), Shimmer (S), and Jitter (J), videonasolaryngoscopy examination and clinical perceptual assessment, before and after voice therapy in children with VN. Voice therapy was provided using visual support through Speech-Viewer software. MATERIALS AND METHODS Twenty patients with VN were studied. An acoustical analysis of voice was performed and compared with data from subjects from a control group matched by age and gender. Also, clinical perceptual assessment of voice and videonasolaryngoscopy were performed to all patients with VN. After a period of voice therapy, provided with visual support using Speech Viewer-III (SV-III-IBM) software, new acoustical analyses, perceptual assessments and videonasolaryngoscopies were performed. RESULTS Before the onset of voice therapy, there was a significant difference (p<0.05) in mean FF, S and J, between the patients with VN and subjects from the control group. After the voice therapy period, a significant improvement (p<0.05) was found in all acoustic voice parameters. Moreover, perceptual voice analysis demonstrated improvement in all cases. Finally, videonasolaryngoscopy demonstrated that vocal nodules were no longer discernible on the vocal folds in any of the cases. CONCLUSIONS SV-III software seems to be a safe and reliable method for providing voice therapy in children with VN. Acoustic voice parameters, perceptual data and videonasolaryngoscopy were significantly improved after the speech therapy period was completed.
Collapse
|
14
|
Chan KMK, Li M, Law TY, Yiu EML. Effects of immediate feedback on learning auditory perceptual voice quality evaluation. INTERNATIONAL JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2012; 14:363-369. [PMID: 22646315 DOI: 10.3109/17549507.2012.679746] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The study investigated the effect of immediate feedback in training listeners to perceive subtle differences in voice quality, a perceptual skill that is important for speech-language pathologists. Sixty naive listeners were randomly assigned to a feedback group (Group F), a no feedback group (Group NF), and a no training group acting as a control group (Group C). The task was to evaluate the severity of a perceptual voice quality (breathiness) by using a reference-matching paradigm. All participants took part in three rating sessions (pre-training, 2 days after training and 1 week after training). Group F and Group NF participated in a training session immediately after the first rating session, where Group F practiced with immediate feedback given and Group NF practice with no immediate feedback given. The results showed that Group F and Group NF had significant improvement after training, but Group F did not retain the improvement in the third rating session. The use of a reference-matching training paradigm without giving frequent immediate feedback is suggested for auditory-perceptual voice evaluation training. The most effective frequency of immediate feedback is yet to be determined.
Collapse
|
15
|
Prakup B. Acoustic Measures of the Voices of Older Singers and Nonsingers. J Voice 2012; 26:341-50. [DOI: 10.1016/j.jvoice.2011.05.007] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2011] [Accepted: 05/17/2011] [Indexed: 10/17/2022]
|
16
|
Choi SH, Lee J, Sprecher AJ, Jiang JJ. The effect of segment selection on acoustic analysis. J Voice 2011; 26:1-7. [PMID: 21889300 DOI: 10.1016/j.jvoice.2010.10.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2010] [Accepted: 10/11/2010] [Indexed: 11/29/2022]
Abstract
OBJECTIVE/HYPOTHESIS Acoustic analysis is a commonly used method for quantitatively measuring vocal fold function. Voice signals are analyzed by selecting a waveform segment and using various algorithms to arrive at parameters such as jitter, shimmer, and signal-to-noise ratio (SNR). Accurate and reliable methods for selecting a representative vowel segment have not been established. STUDY DESIGN Prospective repeated-measure experiment. METHODS We applied a moving window method by isolating consecutive, overlapping segments of the raw voice signal from onset through offset. Ten normal voice signals were analyzed using acoustic measures calculated from the moving window. The location and value of minimum perturbation/maximum SNR was compared across individuals. The moving window method was compared with data from the whole vowel excluding onset and offset, the mid-vowel, and the visually selected steadiest portion of the voice signal. RESULTS Results showed that the steadiest portion of the waveforms, as defined by minimum perturbation and maximum SNR values, was not consistent across individuals. Perturbation and nonlinear dynamic values differed significantly based on what segment of the waveform was used. Other commonly used segment selection methods resulted in significantly higher perturbation values and significantly lower SNR values than those determined by the moving window method (P<0.001). CONCLUSIONS The selection of a sample for acoustic analysis can introduce significant inconsistencies into the analysis procedure. The moving window technique may provide more accurate and reliable acoustic measures by objectively identifying the steadiest segment of the voice sample.
Collapse
Affiliation(s)
- Seong Hee Choi
- Department of Audiology and Speech-Language Pathology, Catholic University of Daegu, Gyeongsan, Korea
| | | | | | | |
Collapse
|
17
|
Olszewski AE, Shen L, Jiang JJ. Objective methods of sample selection in acoustic analysis of voice. Ann Otol Rhinol Laryngol 2011; 120:155-61. [PMID: 21510140 DOI: 10.1177/000348941112000303] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
OBJECTIVES In acoustic voice analysis, the fact that reproducible methods of sample selection have not been defined impedes research study generalizability and clinical assessment of treatment efficacy. Because perturbation results differ along a single signal, this study sought to establish objective methods of sample selection by use of a moving window to determine the most stable regions of phonation. METHODS Voice signals obtained from 21 patients affected by laryngeal conditions associated with Parkinson's disease were analyzed to study jitter, shimmer, signal-to-noise ratio, and correlation dimension parameters when various sample selection procedures were used. Objectively selected voice samples were chosen based upon 5%, 10%, and 20% variance from a signal's minimum perturbation value. The stability of these samples, defined by the standard deviations of the acoustic measurements, was compared to the stability of unselected samples and subjectively selected samples. RESULTS A significant decrease in standard deviation values of acoustic parameters was found in comparing the objectively selected samples (particularly those selected with 5% and 10% variance) to the subjectively selected and unselected samples. CONCLUSIONS These results suggest that the development of an objective sample selection method may have significant effects on the stability and reliability of acoustic voice measurements.
Collapse
Affiliation(s)
- Aleksandra E Olszewski
- Department of Surgery, Division of Otolaryngology-Head and Neck Surgery, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin 53706-1532, USA
| | | | | |
Collapse
|
18
|
Grillo EU, Fugowski J. Voice Characteristics of Female Physical Education Student Teachers. J Voice 2011; 25:e149-57. [DOI: 10.1016/j.jvoice.2009.12.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2009] [Accepted: 12/07/2009] [Indexed: 12/01/2022]
|
19
|
Brockmann M, Storck C, Carding PN, Drinnan MJ. Voice loudness and gender effects on jitter and shimmer in healthy adults. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2008; 51:1152-60. [PMID: 18664710 DOI: 10.1044/1092-4388(2008/06-0208)] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
PURPOSE The aim of this study was to investigate voice loudness and gender effects on jitter and shimmer in healthy young adults because previous descriptions have been inconsistent. METHOD Fifty-seven healthy adults (28 women, 29 men) aged 20-40 years were included in this cross-sectional single-cohort study. Three phonations of /a/ at soft, medium, and loud individual loudness were recorded and analyzed using PRAAT software (P. Boersma & D. Weeninkk, 2006). Voice loudness and gender effects on measured sound pressure level, fundamental frequency, jitter, and shimmer were assessed through the use of descriptive and inferential (analysis of variance) statistics. RESULTS Jitter and shimmer significantly increased with decreasing voice loudness, especially in phonations below 75 dB and 80 dB. In soft and medium phonation, men were generally louder and showed significantly less shimmer. However, men had higher jitter measures when phonating softly. Gender differences in jitter and shimmer at medium loudness may be mainly linked to different habitual voice loudness levels. CONCLUSION This pragmatic study shows significant voice loudness and gender effects on perturbation. In clinical assessment, requesting phonations above 80 dB at comparable loudness between genders would enhance measurement reliability. However, voice loudness and gender effects in other age groups, in disordered voices, or when a minimal loudness is requested should be further investigated.
Collapse
Affiliation(s)
- Meike Brockmann
- Head of Speech Pathology Section, Clinic of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, Frauenklinikstrasse 24, 8091 Zürich, Switzerland.
| | | | | | | |
Collapse
|
20
|
Moura CP, Cunha LM, Vilarinho H, Cunha MJ, Freitas D, Palha M, Pueschel SM, Pais-Clemente M. Voice Parameters in Children With Down Syndrome. J Voice 2008; 22:34-42. [PMID: 17014985 DOI: 10.1016/j.jvoice.2006.08.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2006] [Accepted: 08/16/2006] [Indexed: 11/26/2022]
Abstract
Down syndrome (DS) is the most frequent chromosomal disorder. Commonly, individuals with DS have difficulties with speech and show an unusual quality in the voice. Their phenotypic characteristics include general hypotonia and maxillary hypoplasia with relative macroglossia, and these contribute to particular acoustic alterations. Subjective perceptual and acoustic assessments of the voice (Praat-4.1 software) were performed in 66 children with DS, 36 boys and 30 girls, aged 3 to 8 years. These data were compared with those of an age-matched group of children from the general population. Perceptual evaluations showed significant differences in the group of children with DS. The voice of children with DS presented a lower fundamental frequency (F(0)) with elevated dispersion. The conjunction of frequencies for formants (F(1) and F(2)) revealed a decreased distinction between the vowels, reflecting the loss of articulatory processing. The DS vocalic anatomical functional ratio represents the main distinctive parameter between the two groups studied, and it may be useful in conducting assessments.
Collapse
Affiliation(s)
- Carla Pinto Moura
- Department of Otolaryngology, Hospital São João, Porto Medical School, Porto University, Porto, Portugal.
| | | | | | | | | | | | | | | |
Collapse
|
21
|
Zraick RI, Gentry MA, Smith-Olinde L, Gregg BA. The Effect of Speaking Context on Elicitation of Habitual Pitch. J Voice 2006; 20:545-54. [PMID: 16274957 DOI: 10.1016/j.jvoice.2005.08.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2005] [Accepted: 08/24/2005] [Indexed: 10/25/2022]
Abstract
The purpose of this study was to investigate if there was an effect of speaking context on the elicitation of habitual pitch [speaking fundamental frequency (SFF)]. Six simulated speaking contexts were created (speaking during a voice evaluation, speaking in public, speaking to a peer, speaking to a superior, speaking to a subordinate, and speaking to a parent or spouse), and the SFF for 30 adult women with normal voice was compared across these contexts. A one-way analysis of variance (ANOVA) revealed a statistically significant (P < 0.001) effect of simulated speaking context on SFF, with post hoc analyses indicating a statistically significant difference in SFF while "speaking to a superior" (P < 0.001) and "speaking to a subordinate" (P < 0.001). Possible reasons for an effect of speaking context are discussed. Also, the implications of the use of varied speaking contexts when eliciting SFF are discussed, as is the possibility of an effect of speaking context on the elicitation of other clinically useful voice parameters.
Collapse
Affiliation(s)
- Richard I Zraick
- University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA.
| | | | | | | |
Collapse
|
22
|
Zraick RI, Birdwell KY, Smith-Olinde L. The Effect of Speaking Sample Duration on Determination of Habitual Pitch. J Voice 2005; 19:197-201. [PMID: 15907434 DOI: 10.1016/j.jvoice.2004.01.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/30/2004] [Indexed: 11/26/2022]
Abstract
The purpose of this study was to investigate if there was an effect of duration of speaking on determination of habitual pitch. Five speaking periods commonly used to elicit habitual pitch in clinical voice evaluations were compared (1, 5, 15, 30, and 60 seconds). Thirty female speakers with normal voices participated. Results of a within-subject univariate F-test revealed a statistically significant (p < 0.001) difference in habitual pitch among the speaking periods. Habitual pitch for the 1-second and 60-second speaking periods were found to be statistically significantly (p < 0.05) different than all remaining speaking periods, and the habitual pitch for the 30-second speaking period was found to be statistically significantly (p < 0.05) different than 60-second speaking period. Implications for the use of various speaking durations when determining habitual pitch are discussed, as is the possibility of a speaking duration effect on determination of other pitch-related voice parameters.
Collapse
Affiliation(s)
- Richard I Zraick
- University of Arkansas for Medical Sciences, Little Rock, Arkansas 72205, USA.
| | | | | |
Collapse
|
23
|
Abstract
The purpose of this study was to investigate if there was an effect of task on determination of habitual pitch, or speaking fundamental frequency (SFF). Seven tasks commonly used to elicit habitual pitch in clinical voice evaluations were compared. Three groups of normal subjects (N = 36) were examined (adult males, adult females, and male and female prepubescent children). For the adult male (n = 12) and pediatric (n = 12) subjects, no significant effect of task was found. For the adult female (n = 12) subjects, a statistically significant (P < .001) effect of task was found. All tasks were found to have high (>0.90) test-retest reliability for all subjects. Implications of the use of one task for determination of habitual pitch are discussed, as is the possibility of a task effect on determination of other voice parameters.
Collapse
Affiliation(s)
- R I Zraick
- University of Arkansas for Medical Sciences, Little Rock 72205, USA.
| | | | | |
Collapse
|
24
|
Steinhauer K, Grayhack JP. The role of knowledge of results in performance and learning of a voice motor task. J Voice 2000; 14:137-45. [PMID: 10875564 DOI: 10.1016/s0892-1997(00)80020-x] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The relationship between the provision of Knowledge of Results and the performance and learning of a voice motor task was examined. Thirty adult subjects, randomly assigned to a 100%, 50%, or No Knowledge of Results group, practiced a novel vowel nasalization task. Measures of accuracy and variability obtained during the practice session indicated influence of knowledge of results schedule on the transient effects of motor performance. Deviations from the nasalance target during the retention phase, 5 minutes later, and during a transfer phase, 24 hours later, indicated influence of knowledge of results schedule on the permanent effects of motor learning. Collective results revealed that an increase in relative frequency of knowledge of results led to a decrease in motor performance and learning of a vowel nasalization task: Both accuracy and variability were degraded as knowledge of results increased, with those subjects in the 100% group exhibiting the poorest scores.
Collapse
Affiliation(s)
- K Steinhauer
- Department of Communication Science and Disorders, University of Pittsburgh, Pennsylvania 15260, USA
| | | |
Collapse
|
25
|
Abstract
Forty normally speaking women sustained /a/ five times in two conditions, untimed and timed. In the untimed condition, phonation began whenever the subject felt ready. The timed task required subjects to begin phonation as quickly as possible after the examiner's signal. Conditions were counterbalanced across subjects. Jitter and shimmer values were obtained on a Kay Elemetrics Computerized Speech Lab (Pine Brook, NJ). Three distinct patterns of change emerged for jitter and shimmer from the untimed to the timed condition, reflecting either decreases, increases, or no change in values. These changes in jitter and shimmer values across conditions are related to a reaction time model of "bias toward accuracy."
Collapse
Affiliation(s)
- C T Ferrand
- Hofstra University, Hempstead, New York 11550-1090, USA
| |
Collapse
|