1
|
Mohd Khairuddin KA, Ahmad K, Proehoeman SC, Mohd Ibrahim H, Yan Y. Preliminary Findings of Vocal Fold Vibratory Characteristics of Singers Analyzed by Laryngeal High-Speed Videoendoscopy. J Voice 2024:S0892-1997(24)00173-5. [PMID: 38902142 DOI: 10.1016/j.jvoice.2024.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 05/31/2024] [Accepted: 06/01/2024] [Indexed: 06/22/2024]
Abstract
OBJECTIVES This study investigates the vocal fold vibratory dynamics of singers, which are postulated to differ from those of normal speakers due to the singers' regular vocal training. The measurement of vocal fold vibration involved the utilization of laryngeal high-speed videoendoscopy (LHSV) and subsequent LHSV-based analysis. The focus of the present study is to characterize and compare the LHSV-based measures derived from the glottal area waveform (GAW), namely fundamental frequency (F0GAW), glottal perturbation (jitterGAW and shimmerGAW), open quotient (OQGAW), and Nyquist plots, between singers and normal speakers across genders. METHODS Participants comprised 13 singers from a local cultural and heritage academy and 56 normal speakers from a local university, all were evaluated to have normal voices. Each participant underwent LHSV procedures to capture images of vocal fold vibration, which were subsequently analyzed to generate the LHSV-based measures. RESULTS Male singers exhibited lower F0GAW, jitterGAW, shimmerGAW, and OQGAW than female singers. When compared to normal speakers, male singers demonstrated higher F0GAW, and lower jitterGAW and shimmerGAW. No difference in OQGAW was found between male singers and normal speakers. Female singers exhibited lower jitterGAW compared to normal speakers, but no differences were observed in shimmerGAW and OQGAW. The results of Nyquist plots indicated no gender-related associations between types of rim width and among singers. However, for rim pattern, male singers were associated with a higher percentage of clustered rim, suggesting more regular vocal fold vibration, compared to female singers and normal male speakers. CONCLUSIONS Singers, particularly male singers, demonstrate distinct and potentially superior vocal fold vibrations compared to normal speakers, likely attributed to their regular vocal training, resulting in refined vocal fold configurations even during speaking. Despite the limited sample of singers, the study offers valuable insights into the vocal fold vibratory behaviors in singers analyzed using LHSV.
Collapse
Affiliation(s)
- Khairy Anuar Mohd Khairuddin
- Speech Pathology Program, School of Health Sciences, Universiti Sains Malaysia, Kubang Kerian, Kelantan, Malaysia; Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia.
| | - Kartini Ahmad
- Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | | | - Hasherah Mohd Ibrahim
- Speech Sciences Program, Faculty of Health Sciences, Universiti Kebangsaan Malaysia, Kuala Lumpur, Malaysia
| | - Yuling Yan
- Department of Bioengineering, School of Engineering, Santa Clara University, Santa Clara, California
| |
Collapse
|
2
|
Jiang F, Ng ML, Song Y, Chen Y. Effect of Face Masks on Voice Quality Associated with Young and Older Chinese Adult Speakers. J Voice 2024:S0892-1997(24)00144-9. [PMID: 38834373 DOI: 10.1016/j.jvoice.2024.04.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 04/20/2024] [Accepted: 04/22/2024] [Indexed: 06/06/2024]
Abstract
OBJECTIVES Face masks have become important after the pandemic, but the change in voice when wearing a face mask is still unclear. The study investigated the effect of face masks on the acoustic and perceptual characteristics of voice quality associated with young and older Chinese adults. METHODS Voice samples of the sustained vowel /a/ and continuous speech produced by 44 older and 61 young adults with and without an ASTM level-3 surgical face mask were recorded and analyzed. Perceptual and acoustic parameters including mean fundamental frequency (F0) and intensity, perturbation measures (jitter and shimmer), harmonic-to-noise ratio (HNR), smoothed cepstral peak prominence (CPPs), and long-term average spectrum (LTAS) measures were obtained and compared. RESULTS When comparing masked to unmasked voices, for both male and female speakers, F0 and intensity showed no significant changes, except for F0 of continuous speech, which increased significantly. Meanwhile, perturbation measures such as jitter and shimmer were reduced, while HNR and CPPs increased. In addition, LTAS measures included low-frequency mean spectral energy (MSE), high-frequency MSE, and spectral tilt (ST), which were different. For perceptual measures, the overall grade of dysphonia, and roughness were reduced, except for the breathiness among older male speakers, while the other vocal qualities were not changed. Between young and older speakers, significant differences in shimmer, CPPs, and perceived breathiness among male speakers, and low-frequency MSE among female speakers were found. CONCLUSION Wearing a surgical mask appeared to change the perceived voice quality. This is supported by the change in perturbation and LTAS measures, and HNR and CPPs values. In addition, some differences between young and older adults were observed. Oral Communication effectiveness may be affected when wearing surgical masks due to changes in voice quality. Additionally, clinicians need to exercise hightened caution in evaluating the voice quality of clients when wearing face masks.
Collapse
Affiliation(s)
- Feiyun Jiang
- Duquesne-China Health Institute, John G. Rangos, Sr. School of Health Sciences, Duquesne University, Pittsburgh, Pennsylvania; School of Rehabilitation, Kunming Medical University, Kunming, China.
| | - Manwa L Ng
- Duquesne-China Health Institute, John G. Rangos, Sr. School of Health Sciences, Duquesne University, Pittsburgh, Pennsylvania; Speech Science Laboratory, Faculty of Education, University of Hong Kong, Shatin, NT, Hong Kong, China.
| | - Yuting Song
- Duquesne-China Health Institute, John G. Rangos, Sr. School of Health Sciences, Duquesne University, Pittsburgh, Pennsylvania; School of Rehabilitation, Kunming Medical University, Kunming, China.
| | - Yang Chen
- Duquesne-China Health Institute, John G. Rangos, Sr. School of Health Sciences, Duquesne University, Pittsburgh, Pennsylvania.
| |
Collapse
|
3
|
Kankare E, Laukkanen AM. Validation of the Acoustic Breathiness Index in Speakers of Finnish Language. J Clin Med 2023; 12:7607. [PMID: 38137676 PMCID: PMC10743974 DOI: 10.3390/jcm12247607] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Revised: 11/27/2023] [Accepted: 12/07/2023] [Indexed: 12/24/2023] Open
Abstract
Breathiness (perception of turbulence noise in the voice) is one of the major components of hoarseness in dysphonic voices. This study aims to validate a multiparameter analysis tool, the Acoustic Breathiness Index (ABI), for quantification of breathiness in the speaking voice, including both sustained vowels and continuous speech. One hundred and eight speakers with dysphonia (28 M, 80 F, mean age 50, SD 15.4 years) and 87 non-dysphonic controls (18 M, 69 F, mean age 42, SD 14 years) volunteered as participants. They read a standard text and sustained vowel /a:/. Acoustic recordings were made using a head-mounted microphone. Acoustic samples were evaluated perceptually by nine voice experts of different backgrounds (speech therapists, vocologists and laryngologists). Breathiness (B) from the GRBAS scale was rated. Headphones were used in the perceptual analysis. The dysphonic and non-dysphonic speakers differed significantly from each other in the auditory perceptual evaluation of breathiness. A significant difference was also found for ABI, which had a mean value of 2.26 (SD 1.15) for non-dysphonic and 3.07 (SD 1.75) for dysphonic speakers. ABI correlated strongly with B (rs = 0.823, p = 0.01). ABI's power to distinguish the groups was high (88.6%). The highest sensitivity and specificity of ABI (80%) was obtained at threshold value 2.68. ABI is a valid tool for differentiating breathiness in non-dysphonic and dysphonic speakers of Finnish.
Collapse
Affiliation(s)
- Elina Kankare
- Department of Rehabilitation and Psychosocial Support, Logopedics, Phoniatrics, Tampere University Hospital, 33520 Tampere, Finland
| | - Anne-Maria Laukkanen
- Speech and Voice Research Laboratory, Tampere University, 33100 Tampere, Finland;
| |
Collapse
|
4
|
Baker CP, Brockmann-Bauser M, Purdy SC, Rakena TO. High and Wide: An In Silico Investigation of Frequency, Intensity, and Vibrato Effects on Widely Applied Acoustic Voice Perturbation and Noise Measures. J Voice 2023:S0892-1997(23)00316-8. [PMID: 37925330 DOI: 10.1016/j.jvoice.2023.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/05/2023] [Accepted: 10/05/2023] [Indexed: 11/06/2023]
Abstract
OBJECTIVES This in silico study explored the effects of a wide range of fundamental frequency (fo), source-spectrum tilt (SST), and vibrato extent (VE) on commonly used frequency and amplitude perturbation and noise measures. METHOD Using 53 synthesized tones produced in Madde, the effects of stepwise increases in fo, intensity (modeled by decreasing SST), and VE on the PRAAT parameters jitter % (local), relative average perturbation (RAP) %, shimmer % (local), amplitude perturbation quotient 3 (APQ3) %, and harmonics-to-noise ratio (HNR) dB were investigated. A secondary experiment was conducted to determine whether any fo effects on jitter, RAP, shimmer, APQ3, and HNR were stable. A total of 10 sinewaves were synthesized in Sopran from 100 to 1000 Hz using formant frequencies for /a/, /i/, and /u/-like vowels, respectively. All effects were statistically assessed with Kendall's tau-b and partial correlation. RESULTS Increasing fo resulted in an overall increase in jitter, RAP, shimmer, and APQ3 values, respectively (P < 0.01). Oscillations of the data across the explored fo range were observed in all measurement outputs. In the Sopran tests, the oscillatory pattern seen in the Madde fo condition remained and showed differences between vowel conditions. Increasing intensity (decreasing SST) led to reduced pitch and amplitude perturbation and HNR (P < 0.05). Increasing VE led to lower HNR and an almost linear increase of all other measures (P < 0.05). CONCLUSION These novel data offer a controlled demonstration for the behavior of jitter (local) %, RAP %, shimmer (local) %, APQ3 %, and HNR (dB) when varying fo, SST, and VE in synthesized tones. Since humans will vary in all of these aspects in spoken language and vowel phonation, researchers should take potential resonance-harmonics type effects into account when comparing intersubject or preintervention and postintervention data using these measures.
Collapse
Affiliation(s)
- Calvin Peter Baker
- Speech Science, School of Psychology, University of Auckland, Auckland, New Zealand; School of Music, University of Auckland, Auckland, New Zealand.
| | - Meike Brockmann-Bauser
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Zurich, Switzerland
| | - Suzanne C Purdy
- Speech Science, School of Psychology, University of Auckland, Auckland, New Zealand
| | - Te Oti Rakena
- School of Music, University of Auckland, Auckland, New Zealand
| |
Collapse
|
5
|
de Oliveira Florencio V, Almeida AA, Balata P, Nascimento S, Brockmann-Bauser M, Lopes LW. Differences and Reliability of Linear and Nonlinear Acoustic Measures as a Function of Vocal Intensity in Individuals With Voice Disorders. J Voice 2023; 37:663-681. [PMID: 34116889 DOI: 10.1016/j.jvoice.2021.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Revised: 03/30/2021] [Accepted: 04/05/2021] [Indexed: 11/17/2022]
Abstract
PURPOSE Linear acoustic indices are significantly influenced by speaking voice intensity. The main aim of this work was to compare acoustic measures based on linear and nonlinear models in different speaking voice intensity levels and to analyze the reliability of those measures in different intensity levels in subjects with voice disorders. METHODS 435 samples from subjects (314 women, 121 men with a mean age of 41.07 ± 13.73) diagnosed with various voice disorders were used. In total, 17 acoustic measures were derived from the vowel /ɛ/ sustained at three intensity levels (soft, comfortable, and loud). Five were linear (standard deviation of the fundamental frequency (f0), jitter, shimmer, harmonics-to-noise ratio (HNR) and smoothed cepstral peak prominence (CPPS)), and twelve were nonlinear measures, namely correlation dimension (D2), correlation entropy (H2), first minimum of the mutual information function (FMMI), relative entropy (ENTR-R), largest Lyapunov exponent (Lyap), determinism (DET), transitivity, mean diagonal line length (Lmed), Shannon entropy (ENTR-S), mean length of vertical structures, also known as trapping time (TT), laminarity (LAM) and recurrence period density entropy (RPDE). Differences between speaking voice intensity levels were assessed by Friedman's test and Nemenyi as posthoc test. Intraclass correlation coefficient was used to investigate if each acoustic measure remains in agreement (reliability) between different voice intensity levels. RESULTS There were significant differences in all acoustic measures about vocal intensity level (P < 0.001). Intraclass correlation coefficient was very good for HNR (>0.61) and good for Lyap, DET, ENTR-S, Lmed, RPDE, and TT (0.41-0.60). CONCLUSIONS All acoustic measures varied as a function of vocal intensity in voice disordered adults, while this relation was different for linear and nonlinear measures. Only the measures HNR, Lyap, DET, ENTR-S, Lmed, RPDE and TT had an acceptable reliability between different voice intensity levels. Therefore, patient`s voice SPL should be controlled or indicated during acoustic vocal assessment.
Collapse
Affiliation(s)
- Vanessa de Oliveira Florencio
- Graduate Program in Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), João Pessoa, Paraíba, Brazil
| | - Anna Alice Almeida
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil
| | - Patrícia Balata
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil
| | - Silvana Nascimento
- Electrical Engineering Department, Instituto Federal de Educação, Ciência e Tecnologia (IFPB), João Pessoa, Paraíba, Brazil
| | - Meike Brockmann-Bauser
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, University of Zurich, Frauenklinikstrasse 24, 8091 Zurich, Switzerland and University of Zurich, Zurich, Switzerland
| | - Leonardo Wanderley Lopes
- Department of Speech, Language and Hearing Sciences, Universidade Federal da Paraíba (UFPB), Graduate Program in Speech, Language and Hearing Sciences, UFPB, João Pessoa, Paraíba, Brazil.
| |
Collapse
|
6
|
Laukkanen AM, Rantala L. Does the Acoustic Voice Quality Index (AVQI) Correlate with Perceived Creak and Strain in Normophonic Young Adult Finnish Females? Folia Phoniatr Logop 2021; 74:62-69. [PMID: 33784702 DOI: 10.1159/000514796] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 01/25/2021] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND The Acoustic Voice Quality Index (AVQI) is a correlate of dysphonia. It has been found to differentiate between dysphonic and normophonic speakers and to indicate the effects of voice therapy. This study investigates how the AVQI reacts towards creak and strain, which are common in normophonic speakers. METHODS The material was obtained from an earlier study on 104 Finnish female university students (mean age 24.3 years, SD 6.3 years) with no known pathology of voice or hearing and a perceptually normal voice (G = 0 in GRBAS), who were recorded while reading aloud a standard text and sustaining the vowel [a:]. Perceptual analysis for the amount of creak and strain was carried out by 2 expert listeners. In this study, the AVQI v03.01 was analyzed and correlated with perceptual evaluations. Samples with low and high amounts of creak and strain were compared with t tests. RESULTS On average, the AVQI was below the threshold value of dysphonia in the Finnish population. The AVQI (ρ = 0.35, p = 0.000) and its subparameters, smoothed cepstral peak prominence (CPPS; ρ = -0.35, p = 0.000) and harmonics-to-noise ratio (HNR; ρ = -0.30, p = 0.002) showed low but significant correlations with creak. Strain had low but significant correlations with spectral Slope (ρ = 0.38, p = 0.000) and Tilt (ρ = -0.40, p = 0.009). The AVQI was lower (better) in samples that were evaluated as having a high amount of strain, but the difference was not significant. Only CPPS differentiated significantly between low and high amounts of creak. CONCLUSION The AVQI does not seem to differentiate between high and low amounts of creak and strain in normophonic speakers.
Collapse
Affiliation(s)
- Anne-Maria Laukkanen
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Leena Rantala
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| |
Collapse
|
7
|
Humming Facilitates a Gradual Increase in Vocal Intensity by Alleviating the Enhancement of Vocal Fold Contact and Supraglottic Constriction. J Voice 2021; 35:156.e1-156.e13. [DOI: 10.1016/j.jvoice.2019.05.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 05/09/2019] [Accepted: 05/09/2019] [Indexed: 11/23/2022]
|
8
|
Tracy LF, Segina RK, Cadiz MD, Stepp CE. The Impact of Communication Modality on Voice Production. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2913-2920. [PMID: 32762517 PMCID: PMC7890225 DOI: 10.1044/2020_jslhr-20-00161] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/09/2020] [Revised: 06/04/2020] [Accepted: 06/18/2020] [Indexed: 06/11/2023]
Abstract
Purpose Communicating remotely using audio and audiovisual technology is ubiquitous in modern work and social environments. Remote communication is increasing in medicine and in voice therapy delivery, and this evolution may have an impact on speakers' voices. This study sought to determine whether these communication modalities impact the voice production of typical speakers. Method The speech acoustics of 12 participants with healthy voices were recorded as they held standardized conversations with a single investigator using three communication modalities: in-person, remote-audio, and remote-audiovisual. Participants rated their vocal effort on a 100-mm visual analog scale. Results Compared to in-person communication, self-ratings of vocal effort were statistically significantly increased for remote-audiovisual communication; vocal effort during remote-audio and in-person communication were not significantly different. In comparison to in-person communication, vocal intensity and smoothed cepstral peak prominence (CPPS) were statistically significantly higher during remote-audio and remote-audiovisual communication. Effect sizes for CPPS changes were larger than for sound pressure level (SPL), and changes in CPPS and SPL between in-person and remote-audiovisual communication were not significantly correlated. Conclusions Vocal effort and SPL were increased when using remote-audio and remote-audiovisual communication in comparison to in-person communication. Voice quality was also impacted by technology use, with changes in CPPS that were consistent with, but not fully explained by, increases in SPL. This may impact the telepractice delivery of voice therapy, and further investigation is warranted.
Collapse
Affiliation(s)
- Lauren F. Tracy
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
| | - Roxanne K. Segina
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Manuel Diaz Cadiz
- Department of Speech, Language and Hearing Sciences, Boston University, MA
| | - Cara E. Stepp
- Department of Otolaryngology—Head and Neck Surgery, Boston University School of Medicine, MA
- Department of Speech, Language and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| |
Collapse
|
9
|
P S S, Pebbili GK. Cepstral Analysis of Voice in Young Adults. J Voice 2020; 36:43-49. [PMID: 32336570 DOI: 10.1016/j.jvoice.2020.03.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2019] [Revised: 03/09/2020] [Accepted: 03/13/2020] [Indexed: 11/25/2022]
Abstract
The fundamental frequency and harmonic organization of voice can be quantified using cepstral measure. The two commonly studied cepstral measures are Cepstral Peak Prominence (CPP) and smoothened Cepstral peak prominence (sCPP). CPP is determined by measuring the amplitude difference from the highest peak of the cepstrum to the corresponding regression line, drawn directly below the cepstral peaks and when a smoothening factor is applied, sCPP is obtained. Cepstral analysis has been reported to be a reliable and valid measure for voice evaluation even for signals with high level of aperiodicity, which the time based measures fails to evaluate. Thus, the present study aimed to investigate the reference values for CPP and sCPP in young adults with normal voice in the age range of 20-40 years. Hundred adults in the age range of 20-40 years with equal number of males and females were considered and the task includes to phonate vowels /a/, /i/ and /u/ for five seconds each and to read "300 word Kannada reading passage" (Savithri & Jayaram, 2005) and the Bengaluru passage. The cepstral analysis was done using speech tool software (version 1.65). The data was subjected to statistical analysis using SPSS (version 21). The present study provides the reference values for CPP and sCPP across various stimuli.
Collapse
Affiliation(s)
- Sujitha P S
- CDMRP, Speech Language Pathology and Audiology Unit, Department of Psychology, University of Calicut, Kerala, India.
| | - Gopi Kishore Pebbili
- Department of speech language pathology, All India Institute of Speech and Hearing, Mysore, India
| |
Collapse
|
10
|
Faham M, Laukkanen AM, Ikävalko T, Rantala L, Geneid A, Holmqvist-Jämsén S, Ruusuvirta K, Pirilä S. Acoustic Voice Quality Index as a Potential Tool for Voice Screening. J Voice 2019; 35:226-232. [PMID: 31582330 DOI: 10.1016/j.jvoice.2019.08.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 08/18/2019] [Accepted: 08/19/2019] [Indexed: 10/25/2022]
Abstract
INTRODUCTION To diminish the risk of voice disorders in people who are highly dependent on their voices, such as teachers, vocal screening is important already at the beginning of such individuals' professional studies. A reliable, specified screening tool is needed. The Acoustic Voice Quality Index (AVQI) has been found to differentiate normal voices from abnormal voices and to serve as a treatment outcome measure. This study investigated whether AVQI could be a screening tool in combination with auditory- and self-perception of the voice to discriminate normal from slightly poor voices. TYPE OF STUDY Experimental. METHODS Some 128 female teaching students (mean age 26.39 years, SD 9.80 years) with no diagnosed voice disorders participated in this study. They read aloud a text in Finnish, sustained the vowel /a:/, and filled the Voice Handicap Index (VHI) questionnaire. Voice samples were recorded with an AKG C544L headset microphone, iFocusrite soundcard, and Praat software using a 44100 sample rate and 16-bit amplitude quantization. Five expert voice therapists evaluated the samples to determine the grade of dysphonia (G) using a scale of 0-0.5 (=normal), 0.5-1 (=mild), 1-2 (=moderate), and 2-3 (=severe). Three medial seconds of [a:] and the first 31 syllables of the text were analyzed using AVQI script version 03.01 in Praat (5.3.55). The analysis gives one AVQI score per participant (scale 0-10). The AVQI threshold of normal and disordered voices for Finnish speakers is 1.83; a Gmean = 0.0-0.5 and VHI score <19 were considered normal. Statistical analysis was done using the receiver operating characteristic (ROC) curve, Spearman's correlation coefficient, and the independent samples t test. RESULTS According to the AVQI results, the area under the curve (AROC) was 0.554, which is fair. The Youden index gave a cutoff value of 0.30 with a sensitivity of 85% and a specificity of 81.1%. There were weak but significant correlations between Gmean and AVQI and two AVQI parameters, smoothed cepstral peak prominence and harmonic-to-noise ratio (r = 0.27; --0.24; -0.20, respectively; |P < 0.05); and between total VHI and AVQI score and cepstral peak prominence (r = 0.21; 0.20, respectively; P < 0.05). Furthermore, the AVQI scores differed significantly between the groups with a VHI total score <19 and ≥19. CONCLUSIONS AVQI did not differentiate between voices that had been perceptually judged as normal or slightly abnormal, but a combination of perceptual assessment in the form of AVQI and VHI could better screen slightly deviant voices.
Collapse
Affiliation(s)
- Maryam Faham
- Department of Speech and Language Pathology Rehabilitation Faculty, Shiraz University of Medical Sciences, Shiraz, Iran; Rehabilitation Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
| | - Anne-Maria Laukkanen
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Tero Ikävalko
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Leena Rantala
- Unit of Logopedics, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Ahmed Geneid
- Department of Otorhinolaryngology and Phoniatrics - Head and Neck Surgery, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | | | - Kaarina Ruusuvirta
- Department of Otorhinolaryngology and Phoniatrics - Head and Neck Surgery, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | - Sirpa Pirilä
- Faculty of Humanities/Logopedics, Child Language Research, University of Oulu, Oulu, Finland
| |
Collapse
|
11
|
Dynamically Monitoring Vocal Fatigue and Recovery Using Aerodynamic, Acoustic, and Subjective Self-Rating Measurements. J Voice 2019; 33:809.e11-809.e18. [DOI: 10.1016/j.jvoice.2018.03.014] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Accepted: 03/19/2018] [Indexed: 11/23/2022]
|
12
|
Guidi A, Gentili C, Scilingo E, Vanello N. Analysis of speech features and personality traits. Biomed Signal Process Control 2019. [DOI: 10.1016/j.bspc.2019.01.027] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
13
|
Kankare E, Barsties V. Latoszek B, Maryn Y, Asikainen M, Rorarius E, Vilpas S, Ilomäki I, Tyrmi J, Rantala L, Laukkanen AM. The acoustic voice quality index version 02.02 in the Finnish-speaking population. LOGOP PHONIATR VOCO 2019; 45:49-56. [DOI: 10.1080/14015439.2018.1556332] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Elina Kankare
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | | | - Youri Maryn
- Faculty of Medicine and Health Sciences, University of Antwerp, Antwerp, Belgium
- Sint-Augustinus Hospital, European Institute for ORL-HNS, Antwerp, Belgium
- Faculty of Education Health & Social Work, University College Ghent, Ghent, Belgium
- Department of Speech Language and Hearing Sciences, University of Ghent, Ghent, Belgium
| | - Marja Asikainen
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Eija Rorarius
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Sarkku Vilpas
- Department of Phoniatrics, Tampere University Hospital, Tampere, Finland
| | - Irma Ilomäki
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | - Jaana Tyrmi
- Speech and Voice Research Laboratory, University of Tampere, Tampere, Finland
| | - Leena Rantala
- Logopedics, School of Social Sciences and Humanities, University of Tampere, Tampere, Finland
| | | |
Collapse
|
14
|
Mezzedimi C, Spinosi MC, Massaro T, Ferretti F, Cambi J. Singing voice: acoustic parameters after vocal warm-up and cool-down. LOGOP PHONIATR VOCO 2018; 45:57-65. [DOI: 10.1080/14015439.2018.1545865] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
| | | | | | - Fabio Ferretti
- Legal Medicine Department, Policlinico S.M. alle Scotte, Siena, Italy
| | - Jacopo Cambi
- ENT Department, Policlinico S.M. alle Scotte, Siena, Italy
| |
Collapse
|
15
|
Meerschman I, D'haeseleer E, Catry T, Ruigrok B, Claeys S, Van Lierde K. Effect of two isolated vocal facilitating techniques glottal fry and yawn-sigh on the phonation of female speech-language pathology students: A pilot study. JOURNAL OF COMMUNICATION DISORDERS 2017; 66:40-50. [PMID: 28412598 DOI: 10.1016/j.jcomdis.2017.03.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Revised: 03/16/2017] [Accepted: 03/21/2017] [Indexed: 06/07/2023]
Abstract
OBJECTIVE The purpose of this study was to determine the effect of two isolated vocal facilitating techniques, glottal fry and yawn-sigh, on the phonation of vocally healthy female speech-language pathology (SLP) students. STUDY DESIGN A multigroup pretest-posttest design was used. METHODS A group of thirty-six healthy female SLP students with a mean age of 18.1 years were assigned into three groups: a glottal fry group (practicing the facilitating technique glottal fry across 18 weeks, n=12), a yawn-sigh group (practicing the facilitating technique yawn-sigh across 18 weeks, n=12) and a control group (receiving no facilitating techniques, n=12). To compare vocal measures before and after this training period, an identical objective voice assessment protocol (maximum performance task, acoustic analysis, voice range profile and Dysphonia Severity Index) was performed in the three groups. Groups were compared over time using linear mixed models. Within-group effects of time were determined using post hoc pairwise comparisons. RESULTS Glottal fry resulted in a significant decrease in lowest and highest intensity. Yawn-sigh resulted in a significant increase in fundamental frequency, a significant decrease in shimmer and noise-to-harmonic ratio, and a significant increase in highest intensity. CONCLUSIONS Yawn-sigh may have a positive effect on the phonation of female vocally healthy future SLPs, whereas results are less supportive for using glottal fry in training this population's voice.
Collapse
Affiliation(s)
- I Meerschman
- Department of Speech Language and Hearing Sciences, Ghent University, Belgium.
| | - E D'haeseleer
- Department of Speech Language and Hearing Sciences, Ghent University, Belgium
| | - T Catry
- Department of Speech Language and Hearing Sciences, Ghent University, Belgium
| | - B Ruigrok
- Department of Speech Language and Hearing Sciences, Ghent University, Belgium
| | - S Claeys
- Department of Otorhinolaryngology, Ghent University, Belgium
| | - K Van Lierde
- Department of Speech Language and Hearing Sciences, Ghent University, Belgium; Department of Speech-Language Pathology and Audiology, University of Pretoria, Pretoria, South Africa
| |
Collapse
|
16
|
The Moderating Effect of Frequent Singing on Voice Aging. J Voice 2017; 31:112.e1-112.e12. [DOI: 10.1016/j.jvoice.2016.02.015] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2015] [Accepted: 02/18/2016] [Indexed: 11/30/2022]
|
17
|
Fusaroli R, Lambrechts A, Bang D, Bowler DM, Gaigg SB. “Is voice a marker for Autism spectrum disorder? A systematic review and meta-analysis”. Autism Res 2016; 10:384-407. [DOI: 10.1002/aur.1678] [Citation(s) in RCA: 72] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Revised: 06/24/2016] [Accepted: 07/01/2016] [Indexed: 12/31/2022]
Affiliation(s)
| | | | - Dan Bang
- The Interacting Minds Centre; Aarhus University; Aarhus Denmark
- Wellcome Trust Centre for Neuroimaging; Institute of Neurology, University College London; London UK
- Calleva Research Centre for Evolution and Human Sciences; Magdalen College, University of Oxford; Oxford UK
| | | | | |
Collapse
|
18
|
Guzman M, Laukkanen AM, Traser L, Geneid A, Richter B, Muñoz D, Echternach M. The influence of water resistance therapy on vocal fold vibration: a high-speed digital imaging study. LOGOP PHONIATR VOCO 2016; 42:99-107. [PMID: 27484690 DOI: 10.1080/14015439.2016.1207097] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
PURPOSE This study investigated the influence of tube phonation into water on vocal fold vibration. METHOD Eight participants were analyzed via high-speed digital imaging while phonating into a silicon tube with the free end submerged into water. Two test sequences were studied: (1) phonation pre, during, and post tube submerged 5 cm into water; and (2) phonation into tube submerged 5 cm, 10 cm, and 18 cm into water. Several glottal area parameters were calculated using phonovibrograms. RESULTS The results showed individual differences. However, certain trends were possible to identify based on similar results found for the majority of participants. Amplitude-to-length ratio, harmonic-to-noise ratio, and spectral flatness (derived from glottal area) decreased for all tube immersion depths, while glottal closing quotient increased for 10 cm immersion and contact quotient for 18 cm immersion. Closed quotient decreased during phonation into the tube at 5 cm depth, and jitter decreased during and after it. CONCLUSION Results suggest that the depth of tube submersion appears to have an effect on phonation. Shallow immersion seems to promote smoother and more stable phonation, while deeper immersion may involve increased respiratory and glottal effort to compensate for the increased supraglottal resistance. This disparity, which is dependent upon the degree of flow resistance, should be considered when choosing treatment exercises for patients with various diagnoses, namely hyperfunctional or hypofunctional dysphonia.
Collapse
Affiliation(s)
- Marco Guzman
- a Department of Communication Sciences and Disorders , University of Chile , Santiago , Chile.,b Department of Otolaryngology , Las Condes Clinic , Santiago , Chile
| | - Anne-Maria Laukkanen
- c Speech and Voice Research Laboratory, School of Education , University of Tampere , Tampere , Finland
| | - Louisa Traser
- d Institute of Musicians' Medicine, Freiburg University Medical Center , Freiburg , Germany
| | - Ahmed Geneid
- e Department of Otorhinolaryngology and Phoniatrics-Head and Neck Surgery , Helsinki University Central Hospital, University of Helsinki , Helsinki , Finland
| | - Bernhard Richter
- d Institute of Musicians' Medicine, Freiburg University Medical Center , Freiburg , Germany
| | - Daniel Muñoz
- f Department of Otolaryngology , University of Chile , Santiago , Chile
| | - Matthias Echternach
- d Institute of Musicians' Medicine, Freiburg University Medical Center , Freiburg , Germany
| |
Collapse
|
19
|
|
20
|
Dallaston K, Rumbach AF. Vocal Performance of Group Fitness Instructors Before and After Instruction: Changes in Acoustic Measures and Self-Ratings. J Voice 2016; 30:127.e1-8. [DOI: 10.1016/j.jvoice.2015.02.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2014] [Accepted: 02/12/2015] [Indexed: 11/26/2022]
|
21
|
Hohm J, Döllinger M, Bohr C, Kniesburges S, Ziethe A. Influence of F0 and Sequence Length of Audio and Electroglottographic Signals on Perturbation Measures for Voice Assessment. J Voice 2015; 29:517.e11-21. [PMID: 25944290 DOI: 10.1016/j.jvoice.2014.10.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2014] [Accepted: 10/01/2014] [Indexed: 12/01/2022]
Abstract
OBJECTIVE Within the functional assessment of voice disorders, an objective analysis of measured parameters from audio, electroglottographic (EGG), or visual signals is desired. In a typical clinical situation, reliable objective analysis is not always possible due to missing standardization and unknown stability of the clinical parameters. The aim of this study was to investigate the robustness/stability of measured clinical parameters of the audio and EGG signals in a typical clinical setting to ensure a reliable objective analysis. In particular, the influence of F0 and of the sequence length on several definitions of jitter and shimmer will be analyzed. PATIENTS AND METHODS Seventy-four young healthy women produced a sustained vowel /a/ and an upward triad with abrupt changeovers. Different sequence lengths (100, 150, 500, and 1000 ms) of sustained phonation and triads (100 and 150 ms) were extracted from the audio and EGG signals. In total, six variations of jitter and four variations of shimmer parameters were analyzed. RESULTS Jitter%, Jitter11p, and JitterPPQ of the audio signal as well as Jittermean, Shimmer, and Shimmer11p of the EGG signal are unaffected by both sequence length and F0. CONCLUSIONS Influence of F0 and sequence length on several perturbation measures of the audio and EGG signals was identified. For an objective clinical voice assessment, unaffected definitions of jitter and shimmer should be preferred and applied to enable comparability between different recordings, examinations, and studies.
Collapse
Affiliation(s)
- Julian Hohm
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Michael Döllinger
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Christopher Bohr
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Stefan Kniesburges
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany
| | - Anke Ziethe
- Department for Phoniatrics and Pediatric Audiology, University Hospital Erlangen, Erlangen, Germany.
| |
Collapse
|
22
|
Lee YS, Kim YH, Kwon M, Ryu IS, Jung GE, Kim ST, Roh JL, Choi SH, Kim SY, Nam SY. Short-Term Treatment Results for Unilateral Vocal Fold Palsy Induced by Mediastinal Lesions. J Voice 2014; 28:809-15. [DOI: 10.1016/j.jvoice.2014.04.003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 04/01/2014] [Indexed: 10/25/2022]
|
23
|
Maruthy S, Ravibabu P. Comparison of dysphonia severity index between younger and older carnatic classical singers and nonsingers. J Voice 2014; 29:65-70. [PMID: 25179778 DOI: 10.1016/j.jvoice.2014.05.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2014] [Accepted: 05/01/2014] [Indexed: 12/01/2022]
Abstract
OBJECTIVES The main objective of this study was to compare dysphonia severity index (DSI) and its parameters between Carnatic classical singers and nonsingers. The second objective was to investigate the effect of age on DSI and its parameters in both singers and nonsingers. STUDY DESIGN This study included 30 female Carnatic classical singers who were subdivided into two groups based on their age: 15 younger singers and 15 older singers. These 30 singers were compared with 30 age and gender matched nonsingers. METHODS From each participant, measures of highest phonational frequency, lowest intensity, jitter, and maximum phonation time were obtained. Using these, DSI values were calculated. RESULTS Singers had significantly greater highest phonational frequency, longer maximum phonation time, and higher DSI values. When compared with the younger participants, older participants had significantly reduced highest phonational frequency, maximum phonation time, and DSI values. CONCLUSIONS The results of this study suggest that DSI values vary between Carnatic classical singers and nonsingers, and hence separate normative data may need to be established for this group of singers for clinical comparison purposes.
Collapse
Affiliation(s)
- Santosh Maruthy
- Department of Speech Language Sciences, All India Institute of Speech and Hearing, Mysore, Karnataka, India.
| | - Preethi Ravibabu
- Department of Speech Language Sciences, All India Institute of Speech and Hearing, Mysore, Karnataka, India
| |
Collapse
|
24
|
Bohr C, Kräck A, Dubrovskiy D, Eysholdt U, Svec J, Psychogios G, Ziethe A, Döllinger M. Spatiotemporal analysis of high-speed videolaryngoscopic imaging of organic pathologies in males. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:1148-1161. [PMID: 24686496 DOI: 10.1044/2014_jslhr-s-12-0076] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
PURPOSE The aim of this study was to identify parameters that would differentiate healthy from pathological organic-based vocal fold vibrations to emphasize clinical usefulness of high-speed imaging. METHOD Fifty-five men (M age = 36 years, SD = 20 years) were examined and separated into 4 groups: 1 healthy (26 individuals) and 3 pathological (10 individuals with contact granuloma, 12 with polyps, and 7 with cysts). Vocal fold vibrations were recorded using a high-speed camera during sustained phonation. Twenty objective glottal area waveform and 24 phonovibrogram parameters representing spatiotemporal characteristics were analyzed. Statistical group comparisons were performed to document spatiotemporal changes for organic lesions that cannot be determined visually. To look for specific pattern profiles within organic lesions, the authors performed linear discriminant analysis. RESULTS Thirteen parameters showed significant differences between the healthy group and at least 1 pathological group. The differences occurred more in temporal than in spatial parameters. Contact granuloma showed the fewest statistical differences (3 parameters), followed by cysts (9 parameters), and polyps (10 parameters). Linear discriminant analysis achieved accuracy performance of 76% (all groups separated) and 82% (healthy vs. pathological). CONCLUSION The results suggest that for males, the differences between healthy voices and organic voice disorders may be more pronounced within temporal characteristics that cannot be visually detected without high-speed imaging.
Collapse
|
25
|
Lu FL, Matteson S. Speech tasks and interrater reliability in perceptual voice evaluation. J Voice 2014; 28:725-32. [PMID: 24841668 DOI: 10.1016/j.jvoice.2014.01.018] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2013] [Accepted: 01/31/2014] [Indexed: 11/26/2022]
Abstract
OBJECTIVE/HYPOTHESIS The optimal selection of speech task is essential for more reliable perceptual ratings and a better understanding of the perceptual qualities of pathologic voices. Nevertheless, researchers have rarely explored this issue using the GRBAS scale. This study investigates the effect of speech task selection on interrater reliability during perceptual voice assessment. STUDY DESIGN Experimental study. METHODS Sixty subjects, 39 dysphonic subjects and 21 normal controls, performed 13 speech tasks including three 5-second sustained vowel sounds (/ɑ/, /i/, and /u/) each at three pitch levels (high, habitual, and low), maximum phonation of the vowel /ɑ/, pitch glide, counting from 1 to 10, and oral reading of the Rainbow Passage. A group of 18 graduate students in speech-language pathology served as perceptual judges and rated the dysphonic severity for the speech samples based on three parameters in the GRBAS scale-Grade, Roughness, and Breathiness. The formalism of the AC1 statistic proposed by Gwet was applied to determine relative reliability between the speech tasks and the raters. RESULTS The counting task and sustained vowel /ɑ/ in high, habitual, and low registers exhibited the most reproducibility and consequently the highest reliability statistic. CONCLUSIONS The counting task and sustained /ɑ/ phonation are the optimal tasks for perceptual voice judgment in regard to interrater reliability. Future perceptional studies may benefit from this finding to determine the relationship between speech task selection and the validity of any given perceptual rating system in terms of sensitivity and specificity.
Collapse
Affiliation(s)
- Fang-Ling Lu
- Department of Speech and Hearing Sciences, University of North Texas, Denton, Texas.
| | - Samuel Matteson
- Department of Physics, University of North Texas, Denton, Texas
| |
Collapse
|
26
|
Goy H, Fernandes DN, Pichora-Fuller MK, van Lieshout P. Normative Voice Data for Younger and Older Adults. J Voice 2013; 27:545-55. [DOI: 10.1016/j.jvoice.2013.03.002] [Citation(s) in RCA: 122] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Accepted: 03/07/2013] [Indexed: 11/17/2022]
|
27
|
Bohr C, Kraeck A, Eysholdt U, Ziethe A, Döllinger M. Quantitative analysis of organic vocal fold pathologies in females by high-speed endoscopy. Laryngoscope 2013; 123:1686-93. [PMID: 23649746 DOI: 10.1002/lary.23783] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2012] [Revised: 08/23/2012] [Accepted: 09/18/2012] [Indexed: 11/11/2022]
Abstract
OBJECTIVES/HYPOTHESIS Quantitative analysis of endoscopic high-speed video recordings of vocal fold vibrations has been growing in importance in recent years. The videos have mainly been analyzed using subjective evaluation, but this is examiner dependent, and the results show inadequate interobserver agreement. The aims of this study were therefore to identify appropriate objective parameters for analyzing high-speed recordings to differentiate healthy voice production from organic disorders. STUDY DESIGN METHODS A total of 152 females were examined, divided into 77 healthy and 75 with four different pathological conditions: laryngeal epithelial thickening, Reinke edema, vocal fold polyps, and vocal fold cysts. Vocal fold vibrations were recorded with a high-speed camera (4,000 Hz, 256 × 256 pixels) during sustained phonation. Parameters computed from the glottal area waveform (GAW) and from phonovibrogram (PVG) were analyzed. Multiparametric linear discriminant analysis was performed to classify pathological conditions versus the healthy group. RESULTS Twenty of 44 parameters were identified that are capable of distinguishing between the individual types of pathology. PVG parameters showed better performance than GAW parameters. Parameters representing vibrational periodicity via standard deviation showed better performance than absolute parameters. In addition, linear discriminant analysis achieved reliable differentiation between healthy and pathological vocal fold vibrations: 72% for the five-class problem (all groups separately) and 88% for the two-class problem (healthy vs. all pathologies taken as one class). CONCLUSIONS The study succeeded in defining objective parameters for analyzing endoscopic high-speed videos and suggesting first parameters for differentiation between healthy dynamics and dynamics of organic pathologies.
Collapse
Affiliation(s)
- Christopher Bohr
- Department of Otorhinolaryngology, Head and Neck Surgery, Erlangen University Hospital, Erlangen, Germany.
| | | | | | | | | |
Collapse
|
28
|
Awan SN, Giovinco A, Owens J. Effects of Vocal Intensity and Vowel Type on Cepstral Analysis of Voice. J Voice 2012; 26:670.e15-20. [DOI: 10.1016/j.jvoice.2011.12.001] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2011] [Accepted: 12/02/2011] [Indexed: 11/16/2022]
|
29
|
Guzman MA, Dowdall J, Rubin AD, Maki A, Levin S, Mayerhoff R, Jackson-Menaldi MC. Influence of emotional expression, loudness, and gender on the acoustic parameters of vibrato in classical singers. J Voice 2012; 26:675.e5-11. [PMID: 22727122 DOI: 10.1016/j.jvoice.2012.02.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2011] [Accepted: 02/22/2012] [Indexed: 11/20/2022]
Abstract
OBJECTIVES Vibrato is considered one of the most essential characteristics of the classical singing voice. Vibrato can be acoustically described by the rate, extent, onset, and regularity. The aim of this study was to determine the influence of emotional expression on acoustic parameters of vibrato in classically trained singers. STUDY DESIGN A prospective cohort study was performed. METHODS Thirty healthy classical singers were recruited for this study, 29 singers met inclusion criteria. Inclusion criteria for this study were as follow: 1) no history of vocal pathology in the past year, 2) to have at least 5 years of classical singing training. Each subject was asked to sing the phrase "I Love You," while expressing four different emotions (tenderness, anger, happiness, and sadness) and without emotion (neutral state). The musical tonality of the phrase was adapted to each singer's vocal classification. Subjects were also recorded at three levels of loudness (pianissimo, mezzo forte, and fortissimo), while expressing each emotion. Acoustical analysis was performed during the vowel /o/ of the word "Love" to determine rate of vibrato, the extent of vibrato, and vibrato jitter. RESULTS Vibrato parameters did not vary significantly when different emotions were expressed. However, vibrato jitter and extent did vary significantly between different levels of loudness. Significant differences were also noted in both rate and extent of vibrato when compared between sexes. CONCLUSIONS Expression of emotions does not affect the acoustic parameters of vibrato, although some parameters are affected by loudness and gender.
Collapse
Affiliation(s)
- Marco A Guzman
- Lakeshore Professional Voice Center, Lakeshore Ear, Nose, and Throat Center, St. Clair Shores, Michigan, USA.
| | | | | | | | | | | | | |
Collapse
|
30
|
Dehqan A, Scherer RC, Dashti G, Ansari-Moghaddam A, Fanaie S. The Effects of Aging on Acoustic Parameters of Voice. Folia Phoniatr Logop 2012; 64:265-70. [DOI: 10.1159/000343998] [Citation(s) in RCA: 50] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
|
31
|
Abstract
BACKGROUND/AIM A school teacher's occupation is one of the most vocal demanding professions. Vocal tiredness affects acoustic characteristics of voice, leads to the change in acoustic quality during vocal production. The aim of this study was to examine the influence of vocal tiredness on voice acoustic characteristics of school teachers. METHODS The study included 36 school teachers (16 male and 20 female) of 27-58 years of age. Vocal tiredness was evaluated by analyzing acoustic parameters of voice, using computerized laboratory "Kay Elemetrics". The voices were recorded before the first class on the beginning of a working week and after the last class at the end of a working week. Signal, noise and tremor parameters were processed. In their analysis the methods of descriptive statistics, as well as the analytical statistics (Student's t-test for paired samples, chi2 test, Mann Whitney U-test and Pearson's correlation coefficient) were applied. RESULTS The obtained results showed statistically significant differences in average values of parameters as a consequence of the vocal tiredness (p < 0.01), while gender, age and being smoker or not, did not have a significant effect on voice acoustic characteristics in this group of examinees (p > 0.05). CONCLUSION The results of the study suggest a casual relationship between vocal tiredness and voice quality in vocal professionals.
Collapse
|
32
|
Reliable Jitter and Shimmer Measurements in Voice Clinics: The Relevance of Vowel, Gender, Vocal Intensity, and Fundamental Frequency Effects in a Typical Clinical Task. J Voice 2011; 25:44-53. [DOI: 10.1016/j.jvoice.2009.07.002] [Citation(s) in RCA: 130] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2009] [Accepted: 07/01/2009] [Indexed: 11/18/2022]
|
33
|
Physiological Attributes of Vocal Fatigue and Their Acoustic Effects: A Synthesis of Findings for a Criterion-Based Prevention of Acquired Voice Disorders. J Voice 2010; 24:324-36. [DOI: 10.1016/j.jvoice.2008.10.001] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2008] [Accepted: 10/02/2008] [Indexed: 11/19/2022]
|
34
|
Clinical value of acoustic voice measures: a retrospective study. Eur Arch Otorhinolaryngol 2010; 267:1261-71. [DOI: 10.1007/s00405-010-1214-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2009] [Accepted: 02/02/2010] [Indexed: 10/19/2022]
|
35
|
Xue A, Muelle PB. Acoustic and perceptual characteristics of the voices of sedentary and physically active elderly speakers. LOGOP PHONIATR VOCO 2009. [DOI: 10.3109/14015439709075315] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
36
|
Dejonckere P. Effect of louder voicing on acoustical measurements in dysphonic patients. LOGOP PHONIATR VOCO 2009. [DOI: 10.1080/140154398434239] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
37
|
Laukkanen AM. Voiced bilabial fricative /ß:/ as a vocal exercise: An electroglottographic and acoustic investigation. ACTA ACUST UNITED AC 2009. [DOI: 10.3109/14015439209098736] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
38
|
Brockmann M, Storck C, Carding PN, Drinnan MJ. Voice loudness and gender effects on jitter and shimmer in healthy adults. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2008; 51:1152-60. [PMID: 18664710 DOI: 10.1044/1092-4388(2008/06-0208)] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
PURPOSE The aim of this study was to investigate voice loudness and gender effects on jitter and shimmer in healthy young adults because previous descriptions have been inconsistent. METHOD Fifty-seven healthy adults (28 women, 29 men) aged 20-40 years were included in this cross-sectional single-cohort study. Three phonations of /a/ at soft, medium, and loud individual loudness were recorded and analyzed using PRAAT software (P. Boersma & D. Weeninkk, 2006). Voice loudness and gender effects on measured sound pressure level, fundamental frequency, jitter, and shimmer were assessed through the use of descriptive and inferential (analysis of variance) statistics. RESULTS Jitter and shimmer significantly increased with decreasing voice loudness, especially in phonations below 75 dB and 80 dB. In soft and medium phonation, men were generally louder and showed significantly less shimmer. However, men had higher jitter measures when phonating softly. Gender differences in jitter and shimmer at medium loudness may be mainly linked to different habitual voice loudness levels. CONCLUSION This pragmatic study shows significant voice loudness and gender effects on perturbation. In clinical assessment, requesting phonations above 80 dB at comparable loudness between genders would enhance measurement reliability. However, voice loudness and gender effects in other age groups, in disordered voices, or when a minimal loudness is requested should be further investigated.
Collapse
Affiliation(s)
- Meike Brockmann
- Head of Speech Pathology Section, Clinic of Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, Frauenklinikstrasse 24, 8091 Zürich, Switzerland.
| | | | | | | |
Collapse
|
39
|
Laukkanen AM, Ilomäki I, Leppänen K, Vilkman E. Acoustic Measures and Self-reports of Vocal Fatigue by Female Teachers. J Voice 2008; 22:283-9. [PMID: 17134877 DOI: 10.1016/j.jvoice.2006.10.001] [Citation(s) in RCA: 140] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2006] [Accepted: 10/03/2006] [Indexed: 12/01/2022]
Abstract
This study investigated the relation of symptoms of vocal fatigue to acoustic variables reflecting type of voice production and the effects of vocal loading. Seventy-nine female primary school teachers volunteered as subjects. Before and after a working day, (1) a 1-minute text reading sample was recorded at habitual loudness and loudly (as in large classroom), (2) a prolonged phonation on [a:] was recorded at habitual speaking pitch and loudness, and (3) a questionnaire about voice quality, ease, or difficulty of phonation and tiredness of throat was completed. The samples were analyzed for average fundamental frequency (F0), sound pressure level (SPL), and phonation type reflecting alpha ratio (SPL [1-5 kHz]-SPL [50 Hz-1 kHz]). The vowel samples were additionally analyzed for perturbation (jitter and shimmer). After a working day, F0, SPL, and alpha ratio were higher, jitter and shimmer values were lower, and more tiredness of throat was reported. The average levels of the acoustic parameters did not correlate with the symptoms. Increase in jitter and mean F0 in loud reading correlated with tiredness of throat. The results seem to suggest that, at least among experienced vocal professionals, voice production type had little relevance from the point of view of vocal fatigue reported. Differences in the acoustic parameters after a vocally loading working day mainly seem to reflect increased muscle activity as a consequence of vocal loading.
Collapse
Affiliation(s)
- Anne-Maria Laukkanen
- Department of Speech Communication and Voice Research, University of Tampere, Tampere, Finland.
| | | | | | | |
Collapse
|
40
|
Acoustic analysis of voice using WPCVox: a comparative study with Multi Dimensional Voice Program. Eur Arch Otorhinolaryngol 2007; 265:465-76. [DOI: 10.1007/s00405-007-0467-x] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2007] [Accepted: 09/13/2007] [Indexed: 11/28/2022]
|
41
|
Chen SH, Hsiao TY, Hsiao LC, Chung YM, Chiang SC. Outcome of Resonant Voice Therapy for Female Teachers With Voice Disorders: Perceptual, Physiological, Acoustic, Aerodynamic, and Functional Measurements. J Voice 2007; 21:415-25. [PMID: 16581227 DOI: 10.1016/j.jvoice.2006.02.001] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2005] [Accepted: 02/22/2006] [Indexed: 11/15/2022]
Abstract
Teachers have a high percentage of voice problems. For voice disordered teachers, resonant voice therapy is hypothesized to reduce voice problems. No research has been done on the physiological, acoustic, and aerodynamic effects of resonant voice therapy for school teachers. The purpose of this study is to investigate resonant voice therapy outcome from perceptual, physiological, acoustic, aerodynamic, and functional aspects for female teachers with voice disorders. A prospective study was designed for this research. The research subjects were 24 female teachers in Taipei. All subjects received resonant voice therapy in groups of 4 subjects, 90 minutes per session, and 1 session per week for 8 weeks. The outcome of resonant voice therapy was assessed from auditory perceptual judgment, videostroboscopic examination, acoustic measurements, aerodynamic measurements, and functional measurements before and after therapy. After therapy the severity of roughness, strain, monotone, resonance, hard attack, and glottal fry in auditory perceptual judgments, the severity of vocal fold pathology, mucosal wave, amplitude, and vocal fold closure in videostroboscopic examinations, phonation threshold pressure, and the score of physical scale in the Voice Handicap Index were significantly reduced. The speaking Fo, maximum range of speaking Fo, and maximum range of speaking intensity were significantly increased after therapy. No significant change was found in perturbation and breathiness measurements after therapy. Resonant voice therapy is effective for school teachers and is suggested as one of the therapy approaches in clinics for this population.
Collapse
Affiliation(s)
- Sheng Hwa Chen
- Department of Speech and Hearing Disorders and Sciences, National Taipei College of Nursing, Taipei, Taiwan.
| | | | | | | | | |
Collapse
|
42
|
Laukkanen AM, Kankare E. Vocal Loading-Related Changes in Male Teachers’ Voices Investigated before and after a Working Day. Folia Phoniatr Logop 2006; 58:229-39. [PMID: 16825776 DOI: 10.1159/000093180] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Vocal loading-related changes have mainly been investigated in female voice users. The present study investigated male teachers' voices before and after a working day. A questionnaire was used to select 22 male teachers as subjects from a larger group. Ten reported suffering often from symptoms of vocal fatigue (MC = multiple complaints group), 12 reported few vocal complaints (FC group). The subjects recorded a text reading sample at habitual loudness and loudly, and sustained vowel [a:] before and after an approximately 6-hour working day. Text samples were analyzed for total sound pressure level (SPL) and SPL at three frequency regions (50-1,000 Hz, 1-2 kHz, 2-5 kHz), fundamental frequency (F0) and alpha ratio [(SPL 1-5 kHz) - SPL (50 Hz-1 kHz)]. Jitter and shimmer were calculated from the vowel. The subjects filled in a questionnaire about vocal sensations. The MC group reported more symptoms of vocal fatigue, and the symptoms increased during the working day. F0 and SPL increased in both groups. Alpha ratio increased in the MC group but remained the same in the FC group. The MC group had higher values of jitter and shimmer. Jitter diminished in the FC group but did not change significantly in the MC group. The differences between the groups reflect either different strategies for coping with vocal loading or different loading-induced changes in the vocal organ.
Collapse
Affiliation(s)
- Anne-Maria Laukkanen
- Department of Speech Communication and Voice Research, University of Tampere, Tampere, Finland.
| | | |
Collapse
|
43
|
González J, Cervera T, Miralles JL. [Acoustic voice analysis: reliability of a set of multi-dimensional parameters]. ACTA OTORRINOLARINGOLOGICA ESPANOLA 2002; 53:256-68. [PMID: 12185903 DOI: 10.1016/s0001-6519(02)78309-x] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A correlational study of the reliability of acoustic voice parameters was made of 148 healthy adults. Acoustic analysis was performed with MDVP-Multi-Dimensional Voice Program implemented in a CSL-Computerized Speech Lab of Kay Elemetrics. A set of 29 voice parameters were obtained from two samples of sustained vowel /a/ recorded from each subject. General results separated by sex are showed and the test-retest reliability in each pair of measures was calculated. Data show a high intra-subject stability of Frequency Fundamental parameters; acceptable stability in parameters of Frequency and Amplitude Perturbation, Noise, Subharmonics and Voice irregularities; and a very low consistency in Tremor parameters. Parameters related with shimmer were more reliable than parameters related with jitter. According to results several conclusions are reported.
Collapse
Affiliation(s)
- J González
- Departamento de Psicología Básica, Clínica y Psicobiología, Universidad Jaume I, 12080 Castellón.
| | | | | |
Collapse
|
44
|
Wolfe V, Long J, Youngblood HC, Williford H, Olson MS. Vocal parameters of aerobic instructors with and without voice problems. J Voice 2002; 16:52-60. [PMID: 12002887 DOI: 10.1016/s0892-1997(02)00072-3] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Aerobic instructors frequently experience vocal fatigue and are at risk for the development of vocal fold pathology. Six female aerobic instructors, three with self-reported voice problems and three without, served as subjects. Measures of vocal function (perturbation and EGG) were obtained before and after a 30-minute exercise session. Results showed that the group with self-reported voice problems had greater amounts of jitter, lower harmonic-to-noise ratios, and less periodicity in sustained vowels overall, but no significant differences in measures of perturbation and EGG were found before and immediately after instruction. Measures of vocal parameters showed that subjects with self-reported voice problems projected with relatively greater vocal intensity and phonated for a greater percentage of time across beginning, middle, and ending periods of aerobic instruction than subjects with no reported voice problems.
Collapse
|
45
|
Vieira MN, McInnes FR, Jack MA. On the influence of laryngeal pathologies on acoustic and electroglottographic jitter measures. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1045-1055. [PMID: 11863161 DOI: 10.1121/1.1430686] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
This study compared acoustic and electroglottographic (EGG) jitter from [a] vowels of 103 dysphonic speakers. The EGG recordings were chosen according to their intensity, signal-to-noise ratio, and percentage of unvoiced intervals, while acoustic signals were selected based on voicing detection and the reliability of jitter extraction. The agreement between jitter measures was expressed numerically as a normalized difference. In 63.1% (65/103) of the cases the differences fell within +/-22.5%. Positive differences above +22.5% were associated with increased acoustic jitter and occurred in 12.6% (13/103) of the speakers. These were, typically, cases of small nodular lesions without problems in the posterior larynx. On the other hand, substantial rises in EGG jitter leading to differences below -22.5% took place in 24.3% (25/103) of the speakers and were related to hyperfunctional voices, creaky-like voices, small laryngeal asymmetries affecting the arytenoids, or small-to-moderate glottal chinks. A clinically relevant outcome of the study was the possibility of detecting gentle laryngeal asymmetries among cases of large unilateral increase in EGG jitter. These asymmetries can be linked with vocal problems that are often overlooked in endoscopic examinations.
Collapse
Affiliation(s)
- Maurílio N Vieira
- Departamento de Física/ICEx, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil.
| | | | | |
Collapse
|
46
|
Baumgartner CA, Sapir S, Ramig TO. Voice Quality Changes Following Phonatory-Respiratory Effort Treatment (LSVT®) Versus Respiratory Effort Treatment for Individuals with Parkinson Disease. J Voice 2001; 15:105-14. [PMID: 12269625 DOI: 10.1016/s0892-1997(01)00010-8] [Citation(s) in RCA: 77] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Perceptual ratings of hoarseness and breathiness were used to assess the efficacy of two intensive methods for treating dysarthrophonia in individuals with idiopathic Parkinson disease. One method emphasized phonatory-respiratory effort (the Lee Silverman Voice Treatment, LSVT) and the other emphasized respiratory effort alone (RET). Perceptual ratings were performed by two expert listeners based on random order presentation of the patients' pretreatment and posttreatment recordings of the "Rainbow Passage." The listeners were blinded to the patients and their treatment group. Statistically significant pretreatment to posttreatment improvement in hoarseness and breathiness was observed in the LSVT group but not in the RET group. The present findings are consistent with acoustic and physiologic findings reported previously, providing further evidence for the efficacy of the LSVT.
Collapse
Affiliation(s)
- C A Baumgartner
- Wilbur James Gould Voice Research Center, Denver Center for the Performing Arts, Colorado 80204, USA
| | | | | |
Collapse
|
47
|
Rantala L, Vilkman E. Relationship between subjective voice complaints and acoustic parameters in female teachers' voices. J Voice 1999; 13:484-95. [PMID: 10622515 DOI: 10.1016/s0892-1997(99)80004-6] [Citation(s) in RCA: 91] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The aim of the study was to identify the acoustic correlates of female teachers' subjective voice complaints by recording their voices in their working environment. The subjects made recordings during lessons (N = 10) and breaks (N = 11). The subjects were divided into 2 groups: those with few voice complaints (FC group) and those with many voice complaints (MC group). The speech sample made in the breaks was maximally sustained /a/, from which fundamental frequency (F0), jitter, and shimmer were analyzed. The classroom samples were analyzed for F0, sound pressure level (SPL), and F0 time (the active vibration time of the vocal folds). Additionally, an index for assessing voice loading is presented. The results revealed a tendency of the MC group to have higher F0 and lower SPL and perturbation values than the FC group. The index values correlated moderately with the subjective vocal complaints.
Collapse
Affiliation(s)
- L Rantala
- Department of Finnish, Saamni and Logopedics, University of Oulu, Finland.
| | | |
Collapse
|
48
|
Dromey C, Ramig LO. Intentional changes in sound pressure level and rate: their impact on measures of respiration, phonation, and articulation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 1998; 41:1003-1018. [PMID: 9771624 DOI: 10.1044/jslhr.4105.1003] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The purpose of the study was to compare the effects of changing sound pressure level (SPL) and rate on respiratory, phonatory, and articulatory behavior during sentence production. Ten subjects, 5 men and 5 women, repeated the sentence, "I sell a sapapple again," under 5 SPL and 5 rate conditions. From a multi-channel recording, measures were made of lung volume (LV), SPL, fundamental frequency (F0), semitone standard deviation (STSD), and upper and lower lip displacements and peak velocities. Loud speech led to increases in LV initiation, LV termination, F0, STSD, and articulatory displacements and peak velocities for both lips. Token-to-token variability in these articulatory measures generally decreased as SPL increased, whereas rate increases were associated with increased lip movement variability. LV excursion decreased as rate increased. F0 for the men and STSD for both genders increased with rate. Lower lip displacements became smaller for faster speech. The interspeaker differences in velocity change as a function of rate contrasted with the more consistent velocity performance across speakers for changes in SPL. Because SPL and rate change are targeted in therapy for dysarthria, the present data suggest directions for future research with disordered speakers.
Collapse
Affiliation(s)
- C Dromey
- Toronto Hospital, University of Toronto, Ontario, Canada.
| | | |
Collapse
|
49
|
Abstract
Forty normally speaking women sustained /a/ five times in two conditions, untimed and timed. In the untimed condition, phonation began whenever the subject felt ready. The timed task required subjects to begin phonation as quickly as possible after the examiner's signal. Conditions were counterbalanced across subjects. Jitter and shimmer values were obtained on a Kay Elemetrics Computerized Speech Lab (Pine Brook, NJ). Three distinct patterns of change emerged for jitter and shimmer from the untimed to the timed condition, reflecting either decreases, increases, or no change in values. These changes in jitter and shimmer values across conditions are related to a reaction time model of "bias toward accuracy."
Collapse
Affiliation(s)
- C T Ferrand
- Hofstra University, Hempstead, New York 11550-1090, USA
| |
Collapse
|
50
|
Kitch JA, Oates J, Greenwood K. Performance effects on the voices of 10 choral tenors: acoustic and perceptual findings. J Voice 1996; 10:217-27. [PMID: 8865092 DOI: 10.1016/s0892-1997(96)80002-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Trained choral tenors performed a series of vocal tasks before and after a "live" performance. Acoustic (perturbation, harmonic-to-noise ratio, pitch and amplitude ranges) and perceptual analyses (auditory and proprioceptive/kinesthetic) were undertaken to detect changes from pre- to postperformance. Individuality of response to the performance was revealed, with the majority of subjects showing vocal deterioration after performance. The most sensitive vocal tasks were the comfortably pitched notes, high soft notes, and the bottom notes in scale singing. The most sensitive acoustic measure in detecting change from pre- to postperformance was harmonic-to-noise ratio. In contrast to the demonstrated acoustic changes, no significant differences in perceptual ratings were evident after the performance. Perceptual ratings did not reflect the acoustic analysis results. The present study highlights the need to establish further normative data for the singing voice and to consider individual differences in vocal characteristics in future studies of the singing voice.
Collapse
Affiliation(s)
- J A Kitch
- Speech Pathology Department, St. Vincents Hospital, Fitzroy, Victoria, Australia
| | | | | |
Collapse
|