1
|
Aghaei F, Khoramshahi H, Zamani P, Dehqan A, Hesam S. A Cepstral Peak Prominence (CPP) Voice Analysis in Iranian Post-lingual Deaf Adult Cochlear Implant Users. J Voice 2024; 38:795.e11-795.e20. [PMID: 34857450 DOI: 10.1016/j.jvoice.2021.10.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 10/18/2021] [Accepted: 10/18/2021] [Indexed: 11/17/2022]
Abstract
OBJECTIVE In standardized connected speech samples, cepstral peak prominence (CPP) and smoothed CPP (CPPS) have been described as accurate parameters to evaluate voice quality. Lack of normal auditory feedback in post-lingually deaf CI users might influence tuning the acoustic parameters in speech production. Based on shreds of evidence, normal hearing results in suitable vocal control through the sensory-motor linkage. The main aim of the present study was to compare the cepstral values between the Iranian cochlear implant group and normal peers. METHOD Persian CAPE-V sentences were recorded from 30 CI users and 30 healthy speakers (mean age=36.7 years, SD=13.5, range=18-60 years). Thirteen /a/vowels were extracted manually from syllables. Each subject phonated sustained /a/vowel for 5 seconds. PRAAT was used to calculate CPP and CPPS. To compare two age- and gender-matched groups, the independent sample t-test was applied. Then, ANCOVA was used to assess the impact of demographic factors on cepstral scores in CI participants. RESULTS Significant differences between the CI group and normal peers were discovered based on CPP and CPPS in both tasks (reading sentences and sustained vowel) (P < 0.05). Overall, CI users showed higher cepstral values. The implanted ear and prosthesis model had no significant impact on both CPP and CPPS (P ≥ 0.8). CONCLUSION Higher CPP and CPPS values in the CI users might be due to increased phonatory instability and spectral noise, with the possibility of decreased vocal control and its quality. The outcome suggests that CI group uses a different voice control strategy. These findings should be kept in mind for intervention methods, especially by assessing vocal characteristics and considering the voice quality in adult CI users.
Collapse
Affiliation(s)
- Fatemeh Aghaei
- Department of Speech Therapy, Ahvaz Jundishapur University of Medical Sciences, Iran
| | - Hassan Khoramshahi
- Musculoskeletal Rehabilitation Research Center, Ahvaz Jundishapur University of Medical Sciences, Iran; Department of Speech Therapy, School of Rehabilitation, Babol University of Medical Sciences, Babol, Iran
| | - Peyman Zamani
- Musculoskeletal Rehabilitation Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran.
| | - Ali Dehqan
- Department of Speech Therapy, Rehabilitation Faculty, Zahedan University of Medical Sciences, Zahedan, Iran
| | - Saeed Hesam
- Hearing Research Center, Clinical Sciences Research Institute, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| |
Collapse
|
2
|
Gao Y, Feng Y, Wu D, Lu F, He H, Tian C. Effect of Wearing Different Masks on Acoustic, Aerodynamic, and Formant Parameters. J Voice 2023:S0892-1997(23)00191-1. [PMID: 37541926 DOI: 10.1016/j.jvoice.2023.06.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 06/16/2023] [Accepted: 06/16/2023] [Indexed: 08/06/2023]
Abstract
OBJECTIVE This study aimed to investigate the effects of different types of masks on acoustic, aerodynamic, and formant parameters in healthy people. METHODS Our study involved 30 healthy participants, 15 of each gender, aged 20-40 years. The tests were conducted under four conditions: without a mask, after wearing a surgical mask, after wearing a head-mounted N95 mask, and after wearing an ear-mounted N95 mask. Voice recording was done with the mask on. The acoustic parameters include mean fundamental frequency (F0), mean intensity, percentage of jitter (local), percentage of shimmer (local), mean noise to harmonic ratio (NHR), aerodynamic parameter, maximum phonation time (MPT), and formant parameters (/a/, /i/, /u/ three vowels F1, F2). RESULTS The main effect of mask type was significant in MPT, mean F0, mean HNR, /a/F1, /a/F2, /i/F2. However, the effect sizes and power in /a/F2, /i/F2 were low. MPT, mean F0 and mean HNR significantly increased and /a/F1 significantly decreased after wearing the head-mounted n95 mask. The mean F0 and mean HNR increased significantly after wearing the ear-mounted n95 mask. No significant changes were observed in parameters after wearing the surgical mask in this study. When the statistics are performed separately for males and females, the results obtained are similar to those previously obtained for unspecified males and females. CONCLUSION After wearing the surgical mask, this study found insignificant changes in mean F0, jitter (local), shimmer (local), mean NHR, mean intensity, MPT, and the vowels F1 and F2. This may be due to the looser design of the surgical mask and the relatively small attenuation of sound. N95 masks have a greater effect on vocalization than surgical masks and may cause changes in F0 and HNR after wearing an N95 mask. In the present study, no significant changes in jitter and shimmer were observed after wearing the mask. In addition, there was a significant reduction in /a/F1 after wearing the N95 headgear mask may owing to its high restriction of jaw mobility. In future studies, the change in jaw movement amplitude after wearing the mouthpiece can be added to investigate.
Collapse
Affiliation(s)
- Yongxiang Gao
- Department of Hearing and Language Rehabilitation, Ningbo Rehabilitation Hospital, Ningbo, Zhejiang Province, China
| | - Yan Feng
- School of Basic Medical Sciences, Zhejiang Chinese Medical University, Hangzhou, Zhejiang Province, China
| | - Di Wu
- College of Life and Environmental Sciences, Hangzhou Normal University, Hangzhou, Zhejiang Province, China
| | - Feiao Lu
- Center for Rehabilitation Medicine, Rehabilitation & Sports Medicine Research Institute of Zhejiang Province, Department of Rehabilitation Medicine, Zhejiang Provincial People's Hospital(Affiliated People's Hospital), Hangzhou Medical College, Hangzhou, Zhejiang, China
| | - Hualan He
- Department of Hearing and Language Rehabilitation, Ningbo Rehabilitation Hospital, Ningbo, Zhejiang Province, China
| | - Chenghua Tian
- College of Medical Technology and Information Engineering, Zhejiang Chinese Medical University, Hangzhou, Zhejiang Province, China.
| |
Collapse
|
3
|
Impact of Instructed Laryngeal Manipulation on Acoustic Measures of Voice-Preliminary Results. J Voice 2023; 37:143.e1-143.e11. [PMID: 33288382 DOI: 10.1016/j.jvoice.2020.11.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 11/10/2020] [Accepted: 11/11/2020] [Indexed: 01/11/2023]
Abstract
BACKGROUND Control of laryngeal muscles is required to manipulate pitch, volume, and voice quality. False vocal fold activity (FVFA) refers to the constriction and release of constriction of the false vocal folds. True vocal fold mass (TVFM) represents the cross-sectional thickness of the vocal folds. Larynx height (LH) refers to the vertical position of the larynx in the neck. To date, studies of voice control have examined the effects of these parameters separately. No study has investigated the impact of instructed systematic manipulation of these parameters on acoustic voice measures in vocally healthy trained subjects. AIMS This study examined the effects of systematically manipulating FVFA, TVFM, and LH on several acoustic voice measures. METHOD Twelve vocally trained speakers were instructed to use specific techniques to achieve experimental conditions of constriction and release of constriction of FVFA, thicker and thinner TVFM, and normal and low LH. Each condition was implemented in combination with manipulating the other parameters. Voice recordings of sustained vowel /a/ and Rainbow Passage were obtained for all laryngeal manipulation conditions and underwent acoustic analyses for fundamental frequency (F0), signal typing, harmonics-to-noise ratio (HNR), cepstral peak prominence (CPP), and vocal relative intensity. RESULTS Constricted FVFA caused more aperiodicity in the signals, lower CPP, and lower vocal relative intensity than release of constriction. Thicker TVFM resulted in significantly higher CPP and vocal relative intensity than thinner TVFM. Modifying TVFM did not affect F0 and HNR. Low LH had significantly lower F0 but did not impact on HNR, CPP, and intensity. CONCLUSIONS The effects of systematic manipulation of each laryngeal parameter resulted in independent acoustic effects without measurable interaction. Release of constriction of FVFA, thicker TVFM, and low LH were configurations that resulted in more optimal acoustic signals.
Collapse
|
4
|
Schenck A, Hilger AI, Levant S, Kim JH, Lester-Smith RA, Larson C. The Effect of Pitch and Loudness Auditory Feedback Perturbations on Vocal Quality During Sustained Phonation. J Voice 2023; 37:37-47. [PMID: 33191054 DOI: 10.1016/j.jvoice.2020.11.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2020] [Revised: 10/31/2020] [Accepted: 11/02/2020] [Indexed: 01/11/2023]
Abstract
OBJECTIVE Dysphonia is a reduction in vocal quality that impacts communication and is often an early sign of a voice disorder. There is little information regarding the effects of auditory feedback control of loudness and pitch on voice quality. In this study, we used both loudness-shift and pitch-shift paradigms to study the relationship between auditory feedback control and vocal quality as measured by smoothed cepstral peak prominence (CPPS), which reflects the harmonicity of the voice signal. STUDY DESIGN Experimental, mixed design. METHODS We applied 200 ms loudness-shifts (± 0, 3, or 6 dB) and pitch-shifts (± 0, 50, and 100 cents) to auditory feedback during sustained vowel production in 25 healthy adults. We then measured CPPS before and after the loudness-shift or pitch-shift to investigate the effect of changes in auditory feedback on vocal harmonicity. RESULTS & CONCLUSIONS Results showed that, on average, CPPS significantly decreased between the first half of the measured segment and the last half of the segment in the absence of auditory feedback shifts, suggesting that voice quality may be reduced across longer vowels over time. Upward and downward shifts in loudness auditory feedback caused a relative increase in CPPS, indicating an improvement in vocal harmonicity, even in cases when vocal intensity was reduced. Pitch alterations had inconsistent and minimal effects. We propose that there may be a control mechanism for voice quality that increases harmonicity of the voice signal to improve voice audibility (ie, ability to be heard) in the presence of unpredictable variability in voice intensity.
Collapse
Affiliation(s)
- Alexandra Schenck
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Allison I Hilger
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Samuel Levant
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Jason H Kim
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois
| | - Rosemary A Lester-Smith
- Department of Physical Medicine and Rehabilitation, Northwestern University Feinberg School of Medicine, Chicago, Illinois
| | - Charles Larson
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois.
| |
Collapse
|
5
|
Liu S, Shao J. [Current methods of acoustic analysis of voice: a review]. LIN CHUANG ER BI YAN HOU TOU JING WAI KE ZA ZHI = JOURNAL OF CLINICAL OTORHINOLARYNGOLOGY, HEAD, AND NECK SURGERY 2022; 36:966-970;976. [PMID: 36543409 PMCID: PMC10128270 DOI: 10.13201/j.issn.2096-7993.2022.12.016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Indexed: 12/24/2022]
Abstract
Acoustic analysis of the voice, as an objective, quantitative, non-invasive and reproducible method for the evaluation of voice quality, can be used to detect and analyze the acoustic characteristics of normal, artistic or pathological voice. With the development of medicine, physics, statistics, and artificial intelligence technology, there are new advances in the study of voice acoustic analysis, especially in terms of acoustic parameters. In addition, artificial neural networks can be used to perform complex multi-parameter analysis, which greatly improves the efficiency of acoustic analysis. This paper provides an overview of the methods of acoustic analysis and its latest development.
Collapse
Affiliation(s)
- Siwei Liu
- Department of Otolaryngology,Eye&ENT Hospital,Fudan University,Shanghai,200031,China
| | - Jun Shao
- Department of Otolaryngology,Eye&ENT Hospital,Fudan University,Shanghai,200031,China
| |
Collapse
|
6
|
Heller Murray ES, Chao A, Colletti L. A Practical Guide to Calculating Cepstral Peak Prominence in Praat. J Voice 2022:S0892-1997(22)00275-2. [PMID: 36210224 DOI: 10.1016/j.jvoice.2022.09.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 09/01/2022] [Accepted: 09/02/2022] [Indexed: 11/05/2022]
Abstract
The acoustic measure of cepstral peak prominence (CPP) is recommended for the analysis of dysphonia. Yet, clinical use of this measure is not universal, as clinicians and researchers are still learning the strengths and limitations of this measure. Furthermore, affordable access to specialized acoustic software is a significant barrier to universal CPP use. This article will provide a guide on how to calculate CPP in Praat, a free software program, using a new CPP plugin. Important external factors that could influence CPP measures are discussed, and suggestions for clinical use are provided. As CPP becomes more widely used by clinicians and researchers, it is important to consider external factors that may inadvertently influence CPP values. Controlling for these external factors will aid in reducing variability across CPP values, which will make CPP a valuable tool for both clinical and research purposes.
Collapse
Affiliation(s)
- Elizabeth S Heller Murray
- Department of Communication Sciences and Disorders, College of Public Health, Temple University, Philadelphia, Pennsylvania.
| | - Andie Chao
- Department of Communication Sciences and Disorders, College of Public Health, Temple University, Philadelphia, Pennsylvania
| | - Lauren Colletti
- Department of Communication Sciences and Disorders, College of Public Health, Temple University, Philadelphia, Pennsylvania
| |
Collapse
|
7
|
Keung LC, Richardson K, Sharp Matheron D, Martel-Sauvageau V. A Comparison of Healthy and Disordered Voices Using Multi-Dimensional Voice Program, Praat, and TF32. J Voice 2022:S0892-1997(22)00011-X. [PMID: 35246346 DOI: 10.1016/j.jvoice.2022.01.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 01/06/2022] [Accepted: 01/10/2022] [Indexed: 10/19/2022]
Abstract
PURPOSE Instrumental voice assessment plays a critical role in identifying vocal issues and for documenting treatment outcomes. The reported voice data, however, are sensitive to the algorithm used by each acoustic analysis software program (AASP) to analyze the corresponding waveform. In the present study, five acoustic measures were compared across healthy speakers and speakers with dysphonia for three AASPs commonly used in research, education, and clinical practice: Multidimensional Voice Program (MDVP) by Computerized Speech Lab, Praat, and TF32. MATERIALS AND METHODS Sustained vowel phonations for the quantal vowels /ɑ/, /i/, and /u/ were analyzed for 80 speakers with organic dysphonia and 60 age- and sex-matched healthy controls. Descriptive, inferential, and correlation data are reported for mean fundamental frequency (mean F0), standard deviation of fundamental frequency (SD F0), short-term perturbation measures of jitter and shimmer, and harmonic-to-noise ratio (HNR). RESULTS The present study replicated previous findings of interprogram differences for healthy speakers, with MDVP consistently yielding higher values than Praat and TF32 for SD F0, jitter, and shimmer and lower values for HNR. Similar, but magnified patterns of results were observed for speakers with dysphonia. CONCLUSION The variation observed across programs calls into question the validity in comparing voice outcomes reported by one AASP to those previously obtained by another, particularly for acoustic signals with aperiodic components that are commonly present in disordered voices. It is advised that waveforms be visually inspected prior to conducting acoustic analysis, and that voice outcomes not be combined or compared across AASPs.
Collapse
Affiliation(s)
- Lap-Ching Keung
- Department of Communication Disorders, University of Massachusetts Amherst, Amherst, Massachusetts
| | - Kelly Richardson
- Department of Communication Disorders, University of Massachusetts Amherst, Amherst, Massachusetts.
| | - Deborah Sharp Matheron
- Communication Disorders and Sciences Department, State University of New York College at Cortland, Cortland, New York
| | - Vincent Martel-Sauvageau
- Rehabilitation Department, Speech-Language Pathology Program, Université Laval, Quebec City, Quebec, Canada
| |
Collapse
|
8
|
Eshghi M, Connaghan KP, Gutz SE, Berry JD, Yunusova Y, Green JR. Co-Occurrence of Hypernasality and Voice Impairment in Amyotrophic Lateral Sclerosis: Acoustic Quantification. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:4772-4783. [PMID: 34714698 PMCID: PMC9150680 DOI: 10.1044/2021_jslhr-21-00123] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 07/22/2021] [Accepted: 07/23/2021] [Indexed: 05/31/2023]
Abstract
PURPOSE Hypernasality and atypical voice characteristics are common features of dysarthric speech due to amyotrophic lateral sclerosis (ALS). Existing acoustic measures have been developed to primarily target either hypernasality or voice impairment, and the effects of co-occurring hypernasality-voice problems on these measures are unknown. This report explores (a) the extent to which acoustic measures are affected by concurrent perceptually identified hypernasality and voice impairment due to ALS and (b) candidate acoustic measures of early indicators of hypernasality and voice impairment in the presence of multisystem involvement in individuals with ALS. METHOD Two expert listeners rated severity of hypernasality and voice impairment in sentences produced by individuals with ALS (n = 27). The samples were stratified based on perceptual ratings: voice/hypernasality asymptomatic, predominantly hypernasal, predominantly voice impairment, and mixed (co-occurring hypernasality and voice impairment). Groups were compared using established acoustic measures of hypernasality (one-third octave analysis) and voice (cepstral/spectral analysis) impairment. RESULTS The one-third octave analysis differentiated all groups; the cepstral peak prominence differentiated all groups except asymptomatic versus mixed, whereas the low-to-high spectral ratio did not differ among groups. Additionally, one-third octave analyses demonstrated promising speech diagnostic potential. CONCLUSIONS The results highlight the need to consider the validity of measures in the context of multisubsystem involvement. Our preliminary findings further suggest that the one-third octave analysis may be an optimal approach to quantify hypernasality and voice abnormalities in the presence of multisystem speech impairment. Future evaluation of the diagnostic accuracy of the one-third octave analysis is warranted.
Collapse
Affiliation(s)
- Marziye Eshghi
- Speech and Feeding Disorders Lab, MGH Institute of Health Professions, Boston, MA
| | - Kathryn P. Connaghan
- Speech and Feeding Disorders Lab, MGH Institute of Health Professions, Boston, MA
| | - Sarah E. Gutz
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Boston, MA
| | - James D. Berry
- Sean M. Healey and AMG Center for ALS, Department of Neurology, Massachusetts General Hospital, Boston
| | - Yana Yunusova
- Department of Speech-Language Pathology, Rehabilitation Sciences Institute, University of Toronto, Ontario, Canada
- Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, Ontario, Canada
- Toronto Rehabilitation Institute (KITE), University Health Network, Ontario, Canada
| | - Jordan R. Green
- Speech and Feeding Disorders Lab, MGH Institute of Health Professions, Boston, MA
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Boston, MA
| |
Collapse
|
9
|
Does the wearing of masks change voice and speech parameters? Eur Arch Otorhinolaryngol 2021; 279:1701-1708. [PMID: 34550454 PMCID: PMC8456395 DOI: 10.1007/s00405-021-07086-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 09/10/2021] [Indexed: 11/01/2022]
Abstract
PURPOSE The authors aim to review available reports on the potential effects of masks on voice and speech parameters. METHODS A literature search was conducted using MEDLINE and Google Scholar databases through July 2021. Several targeted populations, mask scenarios and methodologies were approached. The assessed voice parameters were divided into self-reported, acoustic and aerodynamic. RESULTS It was observed that the wearing of a face mask has been shown to induce several changes in voice parameters: (1) self-reported-significantly increased vocal effort and fatigue, increased vocal tract discomfort and increased values of voice handicap index (VHI) were observed; (2) acoustics-increased voice intensity, altered formants frequency (F2 and F3) with no changes in fundamental frequency, increased harmonics-to-noise ratio (HNR) and increased mean spectral values in high-frequency levels (1000-8000 Hz), especially with KN95 mask; (3) aerodynamics-maximum phonatory time was assessed in only two reports, and showed no alterations. CONCLUSION Despite the different populations, mask-type scenarios and methodologies described by each study, the results of this review outline the significant changes in voice characteristics with the use of face masks. Wearing a mask shows to increase the perception of vocal effort and an alteration of the vocal tract length and speech articulatory movements, leading to spectral sound changes, impaired communication and perception. Studies analyzing the effect of masks on voice aerodynamics are lacking. Further research is required to study the long-term effects of face masks on the potential development of voice pathology.
Collapse
|
10
|
Icht M, Wiznitser Ressis-Tal H, Lotan M. Can the Vocal Expression of Intellectually Disabled Individuals Be Used as a Pain Indicator? Initial Findings Supporting a Possible Novice Assessment Method. Front Psychol 2021; 12:655202. [PMID: 34366973 PMCID: PMC8339267 DOI: 10.3389/fpsyg.2021.655202] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 06/22/2021] [Indexed: 11/23/2022] Open
Abstract
Pain is difficult to assess in non-verbal populations such as individuals with intellectual and developmental disability (IDD). Due to scarce research in this area, pain assessment for individuals with IDD is still lacking, leading to maltreatment. To improve medical care for individuals with IDD, immediate, reliable, easy to use pain detection methods should be developed. The goal of this preliminary study was to examine the sensitivity of acoustic features of vocal expressions in identifying pain for adults with IDD, assessing their feasibility as a pain detection indicator for those individuals. Such unique pain related vocal characteristics may be used to develop objective pain detection means. Adults with severe-profound IDD level (N = 9) were recorded in daily activities associated with pain (during diaper changes), or without pain (at rest). Spontaneous vocal expressions were acoustically analyzed to assess several voice characteristics. Analyzing the data revealed that pain related vocal expressions were characterized by significantly higher number of pulses and higher shimmer values relative to no-pain vocal expressions. Pain related productions were also characterized by longer duration, higher jitter and Cepstral Peak Prominence values, lower Harmonic-Noise Ratio, lower difference between the amplitude of the 1st and 2nd harmonic (corrected for vocal tract influence; H1H2c), and higher mean and standard deviation of voice fundamental frequency relative to no-pain related vocal productions, yet these findings were not statistically significant, possibly due to the small and heterogeneous sample. These initial results may prompt further research to explore the possibility to use pain related vocal output as an objective and easily identifiable indicator of pain in this population.
Collapse
Affiliation(s)
- Michal Icht
- Department of Communication Disorders, Ariel University, Ariel, Israel
| | | | - Meir Lotan
- Department of Physiotherapy, Ariel University, Ariel, Israel
| |
Collapse
|
11
|
Nguyen DD, McCabe P, Thomas D, Purcell A, Doble M, Novakovic D, Chacon A, Madill C. Acoustic voice characteristics with and without wearing a facemask. Sci Rep 2021; 11:5651. [PMID: 33707509 PMCID: PMC7970997 DOI: 10.1038/s41598-021-85130-8] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 02/19/2021] [Indexed: 01/31/2023] Open
Abstract
Facemasks are essential for healthcare workers but characteristics of the voice whilst wearing this personal protective equipment are not well understood. In the present study, we compared acoustic voice measures in recordings of sixteen adults producing standardised vocal tasks with and without wearing either a surgical mask or a KN95 mask. Data were analysed for mean spectral levels at 0-1 kHz and 1-8 kHz regions, an energy ratio between 0-1 and 1-8 kHz (LH1000), harmonics-to-noise ratio (HNR), smoothed cepstral peak prominence (CPPS), and vocal intensity. In connected speech there was significant attenuation of mean spectral level at 1-8 kHz region and there was no significant change in this measure at 0-1 kHz. Mean spectral levels of vowel did not change significantly in mask-wearing conditions. LH1000 for connected speech significantly increased whilst wearing either a surgical mask or KN95 mask but no significant change in this measure was found for vowel. HNR was higher in the mask-wearing conditions than the no-mask condition. CPPS and vocal intensity did not change in mask-wearing conditions. These findings implied an attenuation effects of wearing these types of masks on the voice spectra with surgical mask showing less impact than the KN95.
Collapse
Affiliation(s)
- Duy Duong Nguyen
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Patricia McCabe
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Donna Thomas
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Alison Purcell
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Maree Doble
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Daniel Novakovic
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Antonia Chacon
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| | - Catherine Madill
- grid.1013.30000 0004 1936 834XVoice Research Laboratory, Faculty of Medicine and Health, D18, Susan Wakil Health Building, Camperdown Campus, The University of Sydney, Western Avenue, Sydney, NSW 2006 Australia
| |
Collapse
|
12
|
Hosseinabad HH, Kummer AW, Boyce S. Nasal rustle: The retrospective and prospective investigation of effects of bubbling of secretions on speech. Int J Pediatr Otorhinolaryngol 2021; 140:110480. [PMID: 33187722 DOI: 10.1016/j.ijporl.2020.110480] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/26/2020] [Accepted: 10/28/2020] [Indexed: 10/23/2022]
Abstract
OBJECTIVE "Nasal rustle" is a type of nasal emission associated with a small velopharyngeal (VP) gap and distracting loud noise. Currently, the mechanisms behind noise generation are unclear. In this study, we use a combination of retrospective and prospective data to test the hypotheses that bubbling of secretions could be a source of audible noise. DESIGN Retrospective: Nasopharyngoscopy records of 151 patients with nasal rustle were reviewed to determine if bubbling occurred during their nasopharyngoscopy examination. Prospective: Nine children with nasal rustle and bubbling of secretions were suctioned with the scope in place to assure removal of secretions. The Nasometer II was used to record the children's production of oral sentences prior to and post suctioning. All sentences were analyzed for the presence or absence of noise, nasalance scores, and Cepstral Peak Prominence (CPP). Intra-and inter-judge reliability of coding was high. RESULTS Retrospective: 70% of the patients with nasal rustle had bubbling of secretions during nasopharyngoscopy. Prospective: Percentages of audible noise were reduced significantly post suctioning (Friedman's Test, Chi-square = 24.5, p = 0.001) with the greatest decrease in syllables with fricatives and bilabial stops (p < 0.05). The average CPP and nasalance scores pre-vs post-suctioning showed no significant differences (p = 0.91, 0.29). CONCLUSIONS Retrospective: The high percentage of patients with nasal rustle had bubbling of secretions when producing speech in nasopharyngoscopy evaluations. Prospective: The incidence of audible noise was reduced as a result of suctioning. This suggests that the presence of secretions contributes to the production of nasal rustle.
Collapse
Affiliation(s)
| | - Ann W Kummer
- Division of Speech-Language Pathology, Cincinnati Children's Hospital Medical Center, OH, USA
| | - Suzanne Boyce
- Department of Communication Sciences & Disorders, University of Cincinnati, OH, USA
| |
Collapse
|
13
|
Saggio G, Costantini G. Worldwide Healthy Adult Voice Baseline Parameters: A Comprehensive Review. J Voice 2020; 36:637-649. [PMID: 33039203 DOI: 10.1016/j.jvoice.2020.08.028] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/20/2020] [Accepted: 08/21/2020] [Indexed: 12/17/2022]
Abstract
The voice results in acoustic signals analyzed and synthetized at first for telecommunication matters, and more recently investigated for medical purposes. In particular, voice signal characteristics can evidence individual health conditions useful for screening, diagnostic and remote monitoring aims. Within this frame, the knowledge of baseline features of healthy voice is mandatory, in order to balance a comparison with their unhealthy counterpart. However, the baseline features of the human voice depend on gender, age-range and ethnicity and, as far as we know, no work reports as those features spread worldwide. This paper intends to cover this lack. Our database research yielded 179 relevant published studies, retrieved using digital libraries of IEEE Xplore, Scopus, Web of Science, Iop Science, Taylor and Francis Online, and Scitepress. These relevant studies report different features, among which here we consider the most investigated ones, within the most investigated age-range. In particular, the features are the fundamental frequency, the jitter, the shimmer, the harmonic-to-noise ratio, and the cepstral peak prominence, the most investigated age-range is within 20-40 years and, related to the ethnicity, 20 countries are considered.
Collapse
Affiliation(s)
- Giovanni Saggio
- Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy.
| | - Giovanni Costantini
- Department of Electronic Engineering, University of Rome Tor Vergata, Rome, Italy
| |
Collapse
|
14
|
Sampaio M, Vaz Masson ML, de Paula Soares MF, Bohlender JE, Brockmann-Bauser M. Effects of Fundamental Frequency, Vocal Intensity, Sample Duration, and Vowel Context in Cepstral and Spectral Measures of Dysphonic Voices. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1326-1339. [PMID: 32348195 DOI: 10.1044/2020_jslhr-19-00049] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose Smoothed cepstral peak prominence (CPPS) and harmonics-to-noise ratio (HNR) are acoustic measures related to the periodicity, harmonicity, and noise components of an acoustic signal. To date, there is little evidence about the advantages of CPPS over HNR in voice diagnostics. Recent studies indicate that voice fundamental frequency (F0) and intensity (sound pressure level [SPL]), sample duration (DUR), vowel context (speech vs. sustained phonation), and syllable stress (SS) may influence CPPS and HNR results. The scope of this work was to investigate the effects of voice F0 and SPL, DUR, SS, and token on CPPS and HNR in dysphonic voices. Method In this retrospective study, 27 Brazilian Portuguese speakers with voice disorders were investigated. Recordings of sustained vowels (SVs) /a:/ and manually extracted vowels (EVs) /a/ from Consensus Auditory-Perceptual Evaluation of Voice sentences were acoustically analyzed with the Praat program. Results There was a highly significant effect of F0, SPL, and DUR on both CPPS and HNR (p < .001), whereas SS and vowel context significantly affected CPPS only (p < .05). Higher SPL, F0, and lower DUR were related to higher CPPS and HNR. SVs moderately-to-highly correlated with EVs for CPPS, whereas HNR had few and moderate correlations. In addition, CPPS and HNR highly correlated in SVs and seven EVs (p < .05). Conclusion Speaking prosodic variations of F0, SPL, and DUR influenced both CPPS and HNR measures and led to acoustic differences between sustained and excised vowels, especially in CPPS. Vowel context, prosodic factors, and token type should be controlled for in clinical acoustic voice assessment.
Collapse
Affiliation(s)
- Marília Sampaio
- Department of Speech, Language and Hearing Sciences, Institute of Health Sciences, Federal University of Bahia, Salvador, Brazil
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, Switzerland
| | - Maria Lúcia Vaz Masson
- Department of Speech, Language and Hearing Sciences, Institute of Health Sciences, Federal University of Bahia, Salvador, Brazil
| | - Maria Francisca de Paula Soares
- Department of Speech, Language and Hearing Sciences, Institute of Health Sciences, Federal University of Bahia, Salvador, Brazil
| | - Jörg Edgar Bohlender
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, Switzerland
- University of Zurich, Switzerland
| | - Meike Brockmann-Bauser
- Department of Phoniatrics and Speech Pathology, Clinic for Otorhinolaryngology, Head and Neck Surgery, University Hospital Zurich, Switzerland
- University of Zurich, Switzerland
| |
Collapse
|