1
|
Hosseiniheshmatian M, Darouie A, Hasanvand A, Bakhshi E. A Comparison of the Dysphonia Severity Index and Singing Voice Handicap Index Between Trained and Untrained Male Iranian Traditional Singers. J Voice 2024:S0892-1997(24)00071-7. [PMID: 38714438 DOI: 10.1016/j.jvoice.2024.02.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Revised: 02/29/2024] [Accepted: 02/29/2024] [Indexed: 05/09/2024]
Abstract
OBJECTIVES This study aimed to compare trained and untrained Iranian traditional singers on the Dysphonia Severity Index (DSI) and the Singing Voice Handicap Index. The second objective was to examine the relationship between the DSI and Persian version of the Singing Voice Handicap Index (P-SVHI) scores in each group of trained and untrained Iranian traditional singers. STUDY DESIGN This study is a comparative cross-sectional study. METHODS This study included 17 trained male Iranian traditional singers who were compared with 17 untrained ones who were matched in terms of age. The P-SVHI was completed by trained and untrained Iranian traditional singers. Measures of jitter, lowest intensity, highest phonational frequency, and maximum phonation time (MPT) were obtained from each participant. The DSI scores were calculated using these values. RESULTS Trained male traditional singers had an average age of 33.76 ± 7.45 years, 3.24 ± 1.85 hours of daily practice, 5.24 ± 3.78 years of training, 3.06 ± 2.65 no. of professional performances, and untrained singers had an average age of 32.76 ± 12.92 years, 1.53 ± 1.17 hours of daily practice, and 0.88 ± 1.65 no. of professional performances. Trained singers had lower P-SVHI scores and sub-scores, longer MPT, higher F0 high, lower jitter, and lower I low, resulting in higher DSI values (P < 0.05). There was no significant correlation between P-SVHI scores and DSI values in each group of trained and untrained Iranian traditional singers (P > 0.05). CONCLUSIONS The results of this study show that the vocal abilities of professional vocalists are enhanced through voice training (higher DSI scores in trained singers vs untrained ones). Voice training can also help to lessen the perception of a handicap related to the singing voice. Hence, it may be necessary to consider alternative norms for the DSI and P-SVHI when administering them to patients who have undergone guided vocal training, such as voice/singing lessons. The current research indicates that the perception of singing voice handicap and DSI values are two distinct characteristics that do not have a significant correlation.
Collapse
Affiliation(s)
| | - Akbar Darouie
- Department of Speech Therapy, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
| | - Arezoo Hasanvand
- Department of Speech Therapy, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| | - Enayatollah Bakhshi
- Department of Biostatistics and Epidemiology, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran
| |
Collapse
|
2
|
Asiaee M, Nourbakhsh M, Vahedian-Azimi A, Zare M, Jafari R, Atashi SS, Keramatfar A. The feasibility of using acoustic measures for predicting the Total Opacity Scores of chest computed tomography scans in patients with COVID-19. CLINICAL LINGUISTICS & PHONETICS 2024; 38:97-115. [PMID: 36592050 DOI: 10.1080/02699206.2022.2160659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 12/10/2022] [Accepted: 12/13/2022] [Indexed: 06/17/2023]
Abstract
To study the possibility of using acoustic parameters, i.e., Acoustic Voice Quality Index (AVQI) and Maximum Phonation Time (MPT) for predicting the degree of lung involvement in COVID-19 patients. This cross-sectional case-control study was conducted on the voice samples collected from 163 healthy individuals and 181 patients with COVID-19. Each participant produced a sustained vowel/a/, and a phonetically balanced Persian text containing 36 syllables. AVQI and MPT were measured using Praat scripts. Each patient underwent a non-enhanced chest computed tomographic scan and the Total Opacity score was rated to assess the degree of lung involvement. The results revealed significant differences between patients with COVID-19 and healthy individuals in terms of AVQI and MPT. A significant difference was also observed between male and female participants in AVQI and MPT. The results from the receiver operating characteristic curve analysis and area under the curve indicated that MPT (0.909) had higher diagnostic accuracy than AVQI (0.771). A significant relationship was observed between AVQI and TO scores. In the case of MPT, however, no such relationship was observed. The findings indicated that MPT was a better classifier in differentiating patients from healthy individuals, in comparison with AVQI. The results also showed that AVQI can be used as a predictor of the degree of patients' and recovered individuals' lung involvement. A formula is suggested for calculating the degree of lung involvement using AVQI.
Collapse
Affiliation(s)
- Maral Asiaee
- Department of Linguistics, Faculty of Literature, Alzahra University, Tehran, Iran
| | - Mandana Nourbakhsh
- Department of Linguistics, Faculty of Literature, Alzahra University, Tehran, Iran
| | - Amir Vahedian-Azimi
- Trauma Research Center, Nursing faculty, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Mohammad Zare
- Department of Statistics, Faculty of Mathematics and Computer Science, Amirkabir University of Technology (Tehran Polytechnic), Tehran, Iran
| | - Ramezan Jafari
- Department of Radiology, Health Research Center, Lifestyle Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Seyed Shahab Atashi
- Department of Food and Drug control, Jundishapour University of Medical Sciences, Ahvaz, Iran
| | | |
Collapse
|
3
|
Cantor-Cutiva LC, Ramani SA, Walden PR, Hunter EJ. Screening of Voice Pathologies: Identifying the Predictive Value of Voice Acoustic Parameters for Common Voice Pathologies. J Voice 2023:S0892-1997(23)00390-9. [PMID: 38143203 PMCID: PMC11193840 DOI: 10.1016/j.jvoice.2023.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 12/01/2023] [Accepted: 12/04/2023] [Indexed: 12/26/2023]
Abstract
BACKGROUND Voice acoustic analysis is important for objectively assessing voice production and diagnosing voice disorders. AIM This study aimed to investigate the sensitivity of various voice acoustic parameters in differentiating common voice pathology types. METHODS Data from the publicly available Perceptual Voice Qualities Database were analyzed; the database includes recordings of participants with and without voice disorders. A wide range of acoustic parameters was estimated from the recordings, such as alpha ratio, harmonics-to-noise ratio (HNR), cepstral peak prominence smoothed (CPPS), pitch period entropy (PPE), fundamental frequency, jitter, shimmer, and sound pressure levels. The predictive capabilities of the parameters were evaluated using receiver operating characteristic curves. Linear regression analysis determined the associations between parameters and voice disorders. Principal component analysis was conducted to identify important parameters for distinguishing voice disorders. RESULTS AND CONCLUSION This study has identified significant differences in acoustic parameters between those with and without voice disorders. Notably, the combination of five parameters-namely, PPE, shimmer, jitter, CPPS, and HNR-was identified as a strong predictor in voice disorder screening. These findings contribute substantially to the field of voice disorders, offering valuable insights for screening and diagnosis.
Collapse
Affiliation(s)
| | - Sai Aishwarya Ramani
- Department of Communicative Sciences and Disorders, Michigan State University, East Lansing, Michigan
| | | | - Eric J Hunter
- Department of Communication Sciences and Disorders, University of Iowa, Iowa City, Iowa
| |
Collapse
|
4
|
Uloza V, Pribuišis K, Ulozaite-Staniene N, Petrauskas T, Damaševičius R, Maskeliūnas R. Accuracy Analysis of the Multiparametric Acoustic Voice Indices, the VWI, AVQI, ABI, and DSI Measures, in Differentiating between Normal and Dysphonic Voices. J Clin Med 2023; 13:99. [PMID: 38202106 PMCID: PMC10779457 DOI: 10.3390/jcm13010099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 12/21/2023] [Accepted: 12/22/2023] [Indexed: 01/12/2024] Open
Abstract
The study aimed to investigate and compare the accuracy and robustness of the multiparametric acoustic voice indices (MAVIs), namely the Dysphonia Severity Index (DSI), Acoustic Voice Quality Index (AVQI), Acoustic Breathiness Index (ABI), and Voice Wellness Index (VWI) measures in differentiating normal and dysphonic voices. The study group consisted of 129 adult individuals including 49 with normal voices and 80 patients with pathological voices. The diagnostic accuracy of the investigated MAVI in differentiating between normal and pathological voices was assessed using receiver operating characteristics (ROC). Moderate to strong positive linear correlations were observed between different MAVIs. The ROC statistical analysis revealed that all used measurements manifested in a high level of accuracy (area under the curve (AUC) of 0.80 and greater) and an acceptable level of sensitivity and specificity in discriminating between normal and pathological voices. However, with AUC 0.99, the VWI demonstrated the highest diagnostic accuracy. The highest Youden index equaled 0.93, revealing that a VWI cut-off of 4.45 corresponds with highly acceptable sensitivity (97.50%) and specificity (95.92%). In conclusion, the VWI was found to be beneficial in describing differences in voice quality status and discriminating between normal and dysphonic voices based on clinical diagnosis, i.e., dysphonia type, implying the VWI's reliable voice screening potential.
Collapse
Affiliation(s)
- Virgilijus Uloza
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania; (V.U.); (K.P.); (T.P.)
| | - Kipras Pribuišis
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania; (V.U.); (K.P.); (T.P.)
| | - Nora Ulozaite-Staniene
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania; (V.U.); (K.P.); (T.P.)
| | - Tadas Petrauskas
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania; (V.U.); (K.P.); (T.P.)
| | | | - Rytis Maskeliūnas
- Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania
| |
Collapse
|
5
|
Uloza V, Ulozaitė-Stanienė N, Petrauskas T, Pribuišis K, Ulozienė I, Blažauskas T, Damaševičius R, Maskeliūnas R. Smartphone-Based Voice Wellness Index Application for Dysphonia Screening and Assessment: Development and Reliability. J Voice 2023:S0892-1997(23)00330-2. [PMID: 37980209 DOI: 10.1016/j.jvoice.2023.10.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/12/2023] [Accepted: 10/12/2023] [Indexed: 11/20/2023]
Abstract
OBJECTIVE This study aimed to develop a Voice Wellness Index (VWI) application combining the acoustic voice quality index (AVQI) and glottal function index (GFI) data and to evaluate its reliability in quantitative voice assessment and normal versus pathological voice differentiation. STUDY DESIGN Cross-sectional study. METHODS A total of 135 adult participants (86 patients with voice disorders and 49 patients with normal voices) were included in this study. Five iOS and Android smartphones with the "Voice Wellness Index" app installed were used to estimate VWI. The VWI data obtained using smartphones were compared with VWI measurements computed from voice recordings collected from a reference studio microphone. The diagnostic efficacy of VWI in differentiating between normal and disordered voices was assessed using receiver operating characteristics (ROC). RESULTS With a Cronbach's alpha of 0.972 and an ICC of 0.972 (0.964-0.979), the VWI scores of the individual smartphones demonstrated remarkable inter-smartphone agreement and reliability. The VWI data obtained from different smartphones and a studio microphone showed nearly perfect direct linear correlations (r = 0.993-0.998). Depending on the individual smartphone device used, the cutoff scores of VWI related to differentiating between normal and pathological voice groups were calculated as 5.6-6.0 with the best balance between sensitivity (94.10-95.15%) and specificity (93.68-95.72%), The diagnostic accuracy was excellent in all cases, with an area under the curve (AUC) of 0.970-0.974. CONCLUSION The "Voice Wellness Index" application is an accurate and reliable tool for voice quality measurement and normal versus pathological voice screening and has considerable potential to be used by healthcare professionals and patients for voice assessment.
Collapse
Affiliation(s)
- Virgilijus Uloza
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Nora Ulozaitė-Stanienė
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Tadas Petrauskas
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Kipras Pribuišis
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania.
| | - Ingrida Ulozienė
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Tomas Blažauskas
- Faculty of Informatics, Kaunas University of Technology, Kaunas, Lithuania
| | | | - Rytis Maskeliūnas
- Faculty of Informatics, Kaunas University of Technology, Kaunas, Lithuania
| |
Collapse
|
6
|
Barsties V Latoszek B, Englert M, Lucero JC, Behlau M. The Performance of the Acoustic Voice Quality Index and Acoustic Breathiness Index in Synthesized Voices. J Voice 2023; 37:804.e21-804.e28. [PMID: 34218968 DOI: 10.1016/j.jvoice.2021.05.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2021] [Revised: 05/07/2021] [Accepted: 05/11/2021] [Indexed: 11/25/2022]
Abstract
OBJECTIVE The aim of the present study was to investigate the performance of the Acoustic Voice Quality Index (AVQI) and the Acoustic Breathiness Index (ABI) in synthesized voice samples. METHOD The validity of the AVQI and ABI performances was analyzed in synthesized voice samples controlling the degree of predefined deviations for overall voice quality (G-scale) and breathiness (B-scale). A range of 26 synthesized voice samples with various severity degrees in G-scale with and without prominence of breathiness for male and female voices were created. RESULTS ABI received higher validity in the evaluation of breathiness than AVQI. Furthermore, ABI evaluated accurately breathiness degrees without considering roughness effects in voice samples and confirmed the findings of other studies with natural voices. Furthermore, ABI was more robust than AVQI in the evaluation of severe voice-disordered voice samples. Finally, AVQI represented moreover overall voice quality with an emphasis of breathiness evaluation and less roughness although roughness had a necessary component in overall voice quality evaluation. CONCLUSION AVQI and ABI are two robust measurements in the evaluation of voice quality. However, ABI received fewer errors than AVQI in the analyses of higher abnormalities in the voice signal. Disturbances of other subtypes of abnormal overall voice quality such as roughness were not demonstrated in the results of ABI.
Collapse
Affiliation(s)
- Ben Barsties V Latoszek
- Speech-Language Pathology, SRH University of Applied Health Sciences, Düsseldorf, Germany; Department of Phoniatrics and Pediatric Audiology, University Hospital Münster, University of Münster, Münster, Germany.
| | - Marina Englert
- Human Communication Disorders, Universidade Federal de São Paulo -UNIFESP, São Paulo, Sao Paulo, Brazil; Centro de Estudos da Voz - CEV, São Paulo ,SP, Brazil
| | - Jorge C Lucero
- Department of Computer Science, Universidade de Brasília - UnB, Brasília, Federal District, Brazil
| | - Mara Behlau
- Human Communication Disorders, Universidade Federal de São Paulo -UNIFESP, São Paulo, Sao Paulo, Brazil; Centro de Estudos da Voz - CEV, São Paulo ,SP, Brazil
| |
Collapse
|
7
|
Gierlich J, Barsties V Latoszek B. Test-Retest Reliability of the Acoustic Voice Quality Index and the Acoustic Breathiness Index. J Voice 2023:S0892-1997(23)00223-0. [PMID: 37596099 DOI: 10.1016/j.jvoice.2023.07.013] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Revised: 07/16/2023] [Accepted: 07/17/2023] [Indexed: 08/20/2023]
Abstract
OBJECTIVE The acoustic voice quality index (AVQI) and the acoustic breathiness index (ABI) are two validated measurements for the objective-acoustic assessment of voice quality. The aim of this study was to evaluate the relative and absolute reliability of AVQI and ABI as well as detect the natural variability of vocally-healthy voices. METHODS Totally, 39 vocally-healthy participants received weekly an AVQI- and ABI measurement using the freeware VOXplot over a period of 3 weeks. This study is a within-subject design to assess significant differences between the test and retest measurements (test and mean of two retests) of AVQI and ABI with the paired sample t test and to investigate the reliability of these two acoustic parameters with the intraclass correlation coefficient (ICC), Bland-Altman plot, and the minimal detectable change (MDC). RESULTS No significant differences between the test-retest measurements of AVQI and ABI were revealed (all P-values > 0.05). The relative reliability of AVQI and ABI showed good results (eg, ICC = 0.81, and ICC = 0.92, respectively). Absolute reliability measured with the MDC yielded a test-retest value of 0.78 and 0.66 for the AVQI and ABI, respectively. Deviations within this range may be due to natural variations of the voice (random bias). CONCLUSION AVQI and ABI revealed high-reliability results in a test-retest measurement of vocally-healthy participants.
Collapse
|
8
|
Uloza V, Ulozaitė-Stanienė N, Petrauskas T, Pribuišis K, Blažauskas T, Damaševičius R, Maskeliūnas R. Reliability of Universal-Platform-Based Voice Screen Application in AVQI Measurements Captured with Different Smartphones. J Clin Med 2023; 12:4119. [PMID: 37373811 DOI: 10.3390/jcm12124119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 06/15/2023] [Accepted: 06/16/2023] [Indexed: 06/29/2023] Open
Abstract
The aim of the study was to develop a universal-platform-based (UPB) application suitable for different smartphones for estimation of the Acoustic Voice Quality Index (AVQI) and evaluate its reliability in AVQI measurements and normal and pathological voice differentiation. Our study group consisted of 135 adult individuals, including 49 with normal voices and 86 patients with pathological voices. The developed UPB "Voice Screen" application installed on five iOS and Android smartphones was used for AVQI estimation. The AVQI measures calculated from voice recordings obtained from a reference studio microphone were compared with AVQI results obtained using smartphones. The diagnostic accuracy of differentiating normal and pathological voices was evaluated by applying receiver-operating characteristics. One-way ANOVA analysis did not detect statistically significant differences between mean AVQI scores revealed using a studio microphone and different smartphones (F = 0.759; p = 0.58). Almost perfect direct linear correlations (r = 0.991-0.987) were observed between the AVQI results obtained with a studio microphone and different smartphones. An acceptable level of precision of the AVQI in discriminating between normal and pathological voices was yielded, with areas under the curve (AUC) displaying 0.834-0.862. There were no statistically significant differences between the AUCs (p > 0.05) obtained from studio and smartphones' microphones. The significant difference revealed between the AUCs was only 0.028. The UPB "Voice Screen" application represented an accurate and robust tool for voice quality measurements and normal vs. pathological voice screening purposes, demonstrating the potential to be used by patients and clinicians for voice assessment, employing both iOS and Android smartphones.
Collapse
Affiliation(s)
- Virgilijus Uloza
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania
| | - Nora Ulozaitė-Stanienė
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania
| | - Tadas Petrauskas
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania
| | - Kipras Pribuišis
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, 50061 Kaunas, Lithuania
| | - Tomas Blažauskas
- Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania
| | | | - Rytis Maskeliūnas
- Faculty of Informatics, Kaunas University of Technology, 51368 Kaunas, Lithuania
| |
Collapse
|
9
|
An iOS-based VoiceScreen application: feasibility for use in clinical settings-a pilot study. Eur Arch Otorhinolaryngol 2023; 280:277-284. [PMID: 35906420 PMCID: PMC9811036 DOI: 10.1007/s00405-022-07546-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 07/06/2022] [Indexed: 01/07/2023]
Abstract
OBJECTIVES To elaborate the application suitable for smartphones for estimation of Acoustic Voice Quality Index (AVQI) and evaluate its usability in the clinical setting. METHODS An elaborated AVQI automatization and background noise monitoring functions were implemented into a mobile "VoiceScreen" application running the iOS operating system. A study group consisted of 103 adult individuals with normal voices (n = 30) and 73 patients with pathological voices. Voice recordings were performed in the clinical setting with "VoiceScreen" app using iPhone 8 microphones. Voices of 30 patients were recorded before and 1 month after phonosurgical intervention. To evaluate the diagnostic accuracy differentiating normal and pathological voice, the receiver-operating characteristic statistics, i.e., area under the curve (AUC), sensitivity and specificity, and correct classification rate (CCR) were used. RESULTS A high level of precision of AVQI in discriminating between normal and dysphonic voices was yielded with corresponding AUC = 0.937. The AVQI cutoff score of 3.4 demonstrated a sensitivity of 86.3% and specificity of 95.6% with a CCR of 89.2%. The preoperative mean value of the AVQI [6.01(SD 2.39)] in the post-phonosurgical follow-up group decreased to 2.00 (SD 1.08). No statistically significant differences (p = 0.216) between AVQI measurements in a normal voice and 1-month follow-up after phonosurgery groups were revealed. CONCLUSIONS The "VoiceScreen" app represents an accurate and robust tool for voice quality measurement and demonstrates the potential to be used in clinical settings as a sensitive measure of voice changes across phonosurgical treatment outcomes.
Collapse
|
10
|
The effect of type 1 diabetes mellitus on voice in pediatric patients. Eur Arch Otorhinolaryngol 2023; 280:269-275. [PMID: 35829805 DOI: 10.1007/s00405-022-07543-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 07/04/2022] [Indexed: 01/12/2023]
Abstract
PURPOSE The aim of this prospective case-control study was to determine the effect on the voice of type 1 diabetes mellitus (T1DM) in paediatric patients. MATERIALS AND METHODS The study included patients aged 6-18 years followed up for at least 1 year because of T1DM, and a control group of age and gender-matched healthy volunteers. Following an Ear, Nose, and Throat (ENT) examination, all subjects underwent flexible endoscopic laryngeal examination. Fasting venous blood samples were taken in the morning for the examination of fasting blood glucose (FBG), HgbA1C, and kidney, liver, and thyroid function tests. Data were recorded from the patient files of age, gender, comorbidities, and the development of diabetes-related complications. Voice recordings were taken and the Voice Handicap Index (VHI)-10 form was completed. The patients and control group were compared in respect of the parameters of fundamental frequency, jitter, shimmer, and acoustic voice quality index (AVQI). RESULTS Evaluation was made of 64 children and adolescents as 32 in the patient group (Group 1) and 32 healthy control subjects (Group 2). Group 1 comprised 17 females and 15 males with a mean age of 12.75 ± 3.23 years. Group 2 comprised 17 females and 15 males with a mean age of 12.75 ± 3.33 years. In Group 1, mean disease duration was 5.21 ± 3.17 years (range, 1-13 years), the FBG value was mean 216.6 ± 122.3 mg/dl, mean HgbA1c was 10.7 ± 2.8, as ≤ 7 in 4 patients, 7-9 in 4, and > 9 in 24. Maximum phonation time (MPT) was determined as 10.66 ± 3.6 secs in Group 1 and 12.11 ± 4.43 in Group 2. VHI was determined as 2.33 ± 3 in Group 1 and 2.31 ± 2.77 in Group 2. No statistically significant difference was determined between the groups was determined in respect of acoustic analysis, perturbation parameters, AVQI and body mass index. CONCLUSIONS This study is the first to have investigated the effects of T1DM on the voice in paediatric patients. The study results showed that the AVQI value was higher in the patient group but not to a statistically significant level. Therefore, there is a need for further studies with larger samples. The current study can be of guidance for further studies in this field.
Collapse
|
11
|
Bhatt SS, Kabra S, Chatterjee I. A Comparative Study on Acoustic Voice Quality Index Between the Subjects with Spasmodic Dysphonia and Normophonia. Indian J Otolaryngol Head Neck Surg 2022; 74:4927-4932. [PMID: 36742688 PMCID: PMC9895368 DOI: 10.1007/s12070-021-02448-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 02/05/2021] [Indexed: 02/07/2023] Open
Abstract
Spasmodic dysphonia is one of the neurologic voice disorders that is rare, chronic and long term caused by excessive or inappropriate contraction of the laryngeal muscles. The condition is more common in females than males. Assessment of such disorders requires acoustic, perceptual, instrumental and self-reported rating scales for effective diagnosis and intervention. The Acoustic Voice Quality Index is a recent, multivariate acoustic measure of dysphonia measuring the overall quality of voice including both vowels and connected speech in Praat software using AVQI script. The study aimed at comparing, observing and analysing the thresholds of AVQI obtained in the two groups. The study also targeted to show an effective co relation between GRBAS scale and thresholds of AVQI using sustained vowel /α/ and Bengali oral passage. A total of 14 participants were chosen with age range 20-50 years, 7 participants were normophonic and 7 participants were having spasmodic dysphonia. Comparison was done between the 2 groups showing a significant difference (p < 0.05) between the groups. A strong co relation was obtained (α = 0.98) between two scales. Thus AVQI does the job of appropriate diagnostic tool for obtaining the scores of dysphonic population and need to be explored in other voice problems.
Collapse
Affiliation(s)
- Shubhangi Shree Bhatt
- Department of Speech Language and Hearing Sciences, Ali Yavar Jung National Institute of Speech and Hearing (RC), Bonhoogly BT Road, Kolkata, India
| | - Shruti Kabra
- Department of Speech Language and Hearing Sciences, Ali Yavar Jung National Institute of Speech and Hearing (RC), Bonhoogly BT Road, Kolkata, India
| | - Indranil Chatterjee
- Department of Speech Language and Hearing Sciences, Ali Yavar Jung National Institute of Speech and Hearing (RC), Bonhoogly BT Road, Kolkata, India
| |
Collapse
|
12
|
Shabnam S, Pushpavathi M. Effect of Gender on Acoustic Voice Quality Index 02.03 and Dysphonia Severity Index in Indian Normophonic Adults. Indian J Otolaryngol Head Neck Surg 2022; 74:5052-5059. [PMID: 36742738 PMCID: PMC9895247 DOI: 10.1007/s12070-021-02712-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 06/20/2021] [Indexed: 02/07/2023] Open
Abstract
Acoustic Voice Quality Index and Dysphonia Severity Index are the two most widely used multiparametric objective measures that can successfully quantify voice quality. Latoszek et al. (2019) reported that gender had no influence on both AVQI and DSI in Dutch population. However, there is a lack of research focussed on investigating the gender effect on both DSI and AVQI and on investigating the relationship between DSI and AVQI in the Indian adult population. The present study is aimed to investigate the effect of gender on DSI and AVQI and to explore the relation between the two in the Indian adult population. 138 normophonic individuals (74 females and 64 males) were considered as participants. The phonation of /a/ and reading samples were considered as stimuli for extracting AVQI analysed using AVQI script version 02.03. The raw values of MPT, highest frequency, lowest intensity, and Jitter% were obtained from VRP module and MDVP module of Computerized Speech Lab to calculate the DSI. The results revealed that gender has significant effect on CPPs, HNR and TiltLTAS, ShimLocal, ShimdB, MPT, F0-High and Jitter%. While the overall AVQI, DSI, SlopeLTAS and I-Low were independent of gender effect. To conclude, the present study provides the reference data for AVQI v.2.03 and DSI for healthy Indian adults and discusses the influence of gender on AVQI, DSI and their constituent parameters.
Collapse
Affiliation(s)
- Srushti Shabnam
- Department of Speech Language Pathology, All India Institute of Speech and Hearing, University of Mysore, Manasagnagothri, Mysuru, Karnataka India
| | - M. Pushpavathi
- Department of Speech Language Pathology, All India Institute of Speech and Hearing, University of Mysore, Manasagnagothri, Mysuru, Karnataka India
| |
Collapse
|
13
|
Zainaee S, Khadivi E, Jamali J, Sobhani-Rad D, Maryn Y, Ghaemi H. The acoustic voice quality index, version 2.06 and 3.01, for the Persian-speaking population. JOURNAL OF COMMUNICATION DISORDERS 2022; 100:106279. [PMID: 36399989 DOI: 10.1016/j.jcomdis.2022.106279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 11/05/2022] [Accepted: 11/07/2022] [Indexed: 06/16/2023]
Abstract
INTRODUCTION Dysphonia assessment includes approaches like acoustic analysis, which is non-invasive and easy to use and provides an understandable numerical output. The Acoustic Voice Quality Index (AVQI) is an acoustic model that can detect dysphonia. The Persian language is spoken by around 70,000,000 native speakers. Since AVQI versions 2.06 and 3.01 have not been validated for the Persian yet, this study investigated their concurrent validity and diagnostic accuracy among the Persian-speaking population. METHODS This scale development study was conducted from 2020 to 2021 on 180 normophonic and dysphonic native Persian-speaking residents of Mashhad, Iran. Five raters rated the samples by auditory-perceptual-judgments, including Grade from the Grade-Rough-Breathy-Asthenic-Strained (an ordinal scale) and the overall dysphonia severity from the Persian version Consensus Auditory Perceptual Evaluation of Voice (a continuous scale) to investigate both versions' concurrent validity. The intra- and inter-rater reliability and concurrent validity were evaluated for both scales. Both versions' diagnostic accuracy was assessed by the receiver operating characteristic, and the optimal thresholds were determined. RESULTS AVQI-version-2-Persian thresholds of 3.47 and 4.04 provided sensitivity of 88.30% and 85.53% and specificity of 79.07% and 85.58% by the ordinal and continuous scales, respectively. AVQI-version-3-Persian thresholds of 3.07 and 3.03 also rendered sensitivity of 74.47% and 85.53%, and specificity of 97.67% and 91.35% by the ordinal and continuous scales sequentially. CONCLUSION The significant values of concurrent validities and diagnostic accuracies of both versions of AVQI-Persian confirmed that it can discriminate between normal and pathological voices among the Persian-speaking population. Hence, it can be used for screening or diagnosis purposes.
Collapse
Affiliation(s)
- Shahryar Zainaee
- Department of Speech Therapy, School of Paramedical sciences, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Ehsan Khadivi
- Sinus and Surgical Endoscopic Research Center, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Jamshid Jamali
- Department of Biostatistics, School of Health, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Davood Sobhani-Rad
- Department of Speech Therapy, School of Paramedical sciences, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Youri Maryn
- Department of Speech, Language and Hearing Sciences, Faculty of Medicine and Health Sciences, University of Ghent, Ghent, Belgium
| | - Hamide Ghaemi
- Department of Speech Therapy, School of Paramedical sciences, Mashhad University of Medical Sciences, Mashhad, Iran.
| |
Collapse
|
14
|
Pommée T, Morsomme D. Voice Quality in Telephone Interviews: A preliminary Acoustic Investigation. J Voice 2022:S0892-1997(22)00268-5. [PMID: 36192289 DOI: 10.1016/j.jvoice.2022.08.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 08/24/2022] [Accepted: 08/25/2022] [Indexed: 10/07/2022]
Abstract
OBJECTIVES To investigate the impact of standardized mobile phone recordings passed through a telecom channel on acoustic markers of voice quality and on its perception by voice experts in normophonic speakers. METHODS Continuous speech and a sustained vowel were recorded for fourteen female and ten male normophonic speakers. The recordings were done simultaneously with a head-mounted high-quality microphone and through the telephone network on a receiving smartphone. Twenty-two acoustic voice quality, breathiness and pitch-related measures were extracted from the recordings. Nine vocologists perceptually rated the G, R and B parameters of the GRBAS scale on each voice sample. The reproducibility, the recording type, the stimulus type and the gender effects, as well as the correlation between acoustic and perceptual measures were investigated. RESULTS The sustained vowel samples are damped after one second. Only the frequencies between 100 and 3700Hz are passed through the telecom channel and the frequency response is characterized by peaks and troughs. The acoustic measures show a good reproducibility over the three repetitions. All measures significantly differ between the recording types, except for the local jitter, the harmonics-to-noise ratio by Dejonckere and Lebacq, the period standard deviation and all six pitch measures. The AVQI score is higher in telephone recordings, while the ABI score is lower. Significant differences between genders are also found for most of the measures; while the AVQI is similar in men and women, the ABI is higher in women in both recording types. For the perceptual assessment, the interrater agreement is rather low, while the reproducibility over the three repetitions is good. Few significant differences between recording types are observed, except for lower breathiness ratings on telephone recordings. G ratings are significantly more severe on the sustained vowel on both recording types, R ratings only on telephone recordings. While roughness is rated higher in men on telephone recordings by most experts, no gender effect is observed for breathiness on either recording types. Finally, neither the AVQI nor the ABI yield strong correlations with any of the perceptual parameters. CONCLUSIONS Our results show that passing a voice signal through a telecom channel induces filter and noise effects that limit the use of common acoustic voice quality measures and indexes. The AVQI and ABI are both significantly impacted by the recording type. The most reliable acoustic measures seem to be pitch perturbation (local jitter and period standard deviation) as well as the harmonics-to-noise ratio from Dejonckere and Lebacq. Our results also underline that raters are not equally sensitive to the various factors, including the recording type, the stimulus type and the gender effects. Neither of the three perceptual parameters G, R and B seem to be reliably measurable on telephone recordings using the two investigated acoustic indexes. Future studies investigating the impact of voice quality in telephone conversations should thus focus on acoustic measures on continuous speech samples that are limited to the frequency response of the telecom channel and that are not too sensitive to environmental and additive noise.
Collapse
Affiliation(s)
- Timothy Pommée
- Research Unit for a life-Course perspective on Health and Education, Voice Unit, University of Liège, Belgium.
| | - Dominique Morsomme
- Research Unit for a life-Course perspective on Health and Education, Voice Unit, University of Liège, Belgium
| |
Collapse
|
15
|
Walden PR, Rau S. Individual Voice Dimensions' Prediction of Overall Dysphonia Severity on Two Auditory-Perceptual Scales. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:2759-2777. [PMID: 35868295 DOI: 10.1044/2022_jslhr-21-00689] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
BACKGROUND Auditory-perceptual evaluation of dysphonic voice is an essential clinical activity that characterizes the nature of dysphonia and aids in planning its clinical management. Although there are multidimensional acoustic measures that correlate well with overall severity ratings, they tend to include measures that have only small or moderate correlations with individual voice characteristics frequently perceptually measured (e.g., breathiness or roughness). Given this difference between perceptual and acoustic measures, it is unclear how much individual voice characteristics contribute to a listener's perception of overall severity of dysphonia. PURPOSE The purpose of this study was to explore individual voice characteristics' relative contribution to the rating of overall dysphonia severity and to explore sex-related differences. METHOD Two hundred ninety-six voice samples were accessed from the Perceptual Voice Qualities Database. Roughness, breathiness, asthenia, strain, pitch, and loudness ratings from the Grade, Roughness, Breathiness, Asthenia, Strain and Consensus Auditory-Perceptual Evaluation of Voice scales were used to predict overall voice quality severity in linear regression with bootstrapped coefficients. RESULTS Roughness, breathiness, and strain were the strongest predictors of overall severity. Asthenia and, to a lesser extent, pitch were also significant predictors of overall severity. Loudness was not a significant predictor. There were several sex-related differences noted, as well as differences related to the scale used. CONCLUSIONS Breathiness, roughness, and strain were all important predictors of overall severity for all regressions. Clinicians should be aware of scale-related differences if they are using auditory-perceptual measures to choose voice therapy targets. Analyses accounting for perceptual strategy differences were recommended for future studies.
Collapse
Affiliation(s)
| | - Sydney Rau
- Department of Communication Sciences and Disorders, St. John's University, Queens, NY
| |
Collapse
|
16
|
魏 梅, 杜 建, 耿 磊, 王 巍. [Detection of speech pathology based on parameters of analysis of dysphonia in speech and voice]. LIN CHUANG ER BI YAN HOU TOU JING WAI KE ZA ZHI = JOURNAL OF CLINICAL OTORHINOLARYNGOLOGY, HEAD, AND NECK SURGERY 2022; 36:492-496. [PMID: 35822373 PMCID: PMC10128384 DOI: 10.13201/j.issn.2096-7993.2022.07.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Indexed: 06/15/2023]
Abstract
Objective:To analysis speech pathology based on dysphonia in speech and voice(ADSV). Methods:The acoustic signals of continuous vowels and continuous speech of one-hundred and thirteen individuals were collected, including 93 vocal cord polyps cases, 20 glottis laryngeal carcinoma cases and 47 volunteers without speech sound disorders. Cepstral peak prominence(CPP), CPP standard deviation(CPP SD), L/H spectral ratio(L/H ratio), L/H ratio standard deviation(L/H ratio SD) and cepstral/spectral index of dysphonia(CSID) were analyzed by ADSV to explore the role of these parameters in the recognition of speech pathology. Results:In the acoustic signal of continuous vowels, CPP and L/H ratio in normal group were higher than those in pathological voice group(P<0.001), while CPP SD and CSID were lower than those in pathological voice group(P<0.001), CPP and CSID areas under ROC curve were 0.95 and 0.99, respectively, which were important acoustic parameters for diagnosing pathological voice. In continuous speech acoustic signals, CPP, CPP SD and L/H ratio in the normal group were all higher than those in the speech disorders group(P<0.001), and the area under the curve of CPP SD was 0.90, which showed high accuracy in diagnosing pathological voice. The ADSV voice analysis parameters CPP, CPP SD, CSID, and L/H ratio also showed significant differences between the vocal cord polyp group and the glottic laryngeal cancer group. The results of the discriminant analysis model show that the use of ADSV voice parameters can distinguish vocal cord polyps and laryngeal cancers. Conclusion:The ADSV voice analysis parameters can not only distinguish the voice signals of the normal group and the pathological group, but also distinguish different types of pathological voices. It has high sensitivity and specificity in diagnosing pathological voices.
Collapse
Affiliation(s)
- 梅 魏
- 天津市第一中心医院耳鼻咽喉头颈外科 天津市耳鼻喉科研究所 天津市听觉言语与平衡医学重点实验室 天津市医学重点学科(耳鼻咽喉科学) 天津市耳鼻喉质量控制中心(天津,300192)Department of Otorhinolaryngology Head and Neck Surgery, Tianjin First Central Hospital, Institute of Otolaryngology of Tianjin, China Key Laboratory of Auditory Speech and Balance Medicine, Key Medical Discipline of Tianjin[Otolaryngology], China Quality Control Centre of Otolaryngology, Tianjin, 300192, China
| | - 建群 杜
- 天津市第一中心医院耳鼻咽喉头颈外科 天津市耳鼻喉科研究所 天津市听觉言语与平衡医学重点实验室 天津市医学重点学科(耳鼻咽喉科学) 天津市耳鼻喉质量控制中心(天津,300192)Department of Otorhinolaryngology Head and Neck Surgery, Tianjin First Central Hospital, Institute of Otolaryngology of Tianjin, China Key Laboratory of Auditory Speech and Balance Medicine, Key Medical Discipline of Tianjin[Otolaryngology], China Quality Control Centre of Otolaryngology, Tianjin, 300192, China
| | - 磊 耿
- 天津工业大学生命科学学院 天津光电检测技术与系统重点实验室School of Life Sciences, Tianjin University of Technology, Tianjin Key Laboratory of Photoelectric Detection Technology and System
| | - 巍 王
- 天津市第一中心医院耳鼻咽喉头颈外科 天津市耳鼻喉科研究所 天津市听觉言语与平衡医学重点实验室 天津市医学重点学科(耳鼻咽喉科学) 天津市耳鼻喉质量控制中心(天津,300192)Department of Otorhinolaryngology Head and Neck Surgery, Tianjin First Central Hospital, Institute of Otolaryngology of Tianjin, China Key Laboratory of Auditory Speech and Balance Medicine, Key Medical Discipline of Tianjin[Otolaryngology], China Quality Control Centre of Otolaryngology, Tianjin, 300192, China
| |
Collapse
|
17
|
Batthyany C, Latoszek BBV, Maryn Y. Meta-Analysis on the Validity of the Acoustic Voice Quality Index. J Voice 2022:S0892-1997(22)00132-1. [PMID: 35752532 DOI: 10.1016/j.jvoice.2022.04.022] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 04/27/2022] [Accepted: 04/27/2022] [Indexed: 02/01/2023]
Abstract
BACKGROUND Acoustic measurements are useful tools to objectively measure overall voice quality. The Acoustic Voice Quality Index (AVQI) has shown to be a valid multiparametric tool to objectify dysphonia severity. The increasing number of validity studies investigating AVQI's validity demands a comprehensive synthesis of the available outcomes. OBJECTIVE OF REVIEW The aim of the present meta-analysis is to quantify the evidence for the diagnostic accuracy of the AVQI, including its sensitivity, specificity and likelihood ratio statistics, and its concurrent validity and sensitivity to changes in auditory-perceptual voice quality ratings. TYPE OF REVIEW Meta-analysis SEARCH STRATEGY: MEDLINE, EMBASE, the Cochrane library and Web of Science were searched from 2010 till April 2021 with an additional manual search, using keywords related to AVQI and common terminologies of validity outcomes. Studies considering the clinical validity of AVQI (ie, diagnostic accuracy, concurrent validity and sensitivity to change), using auditory-perceptual voice quality evaluation as reference, were included. EVALUATION METHOD The Preferred Reporting Items for Systematic reviews and Meta-Analyses of Diagnostic Test Accuracy Studies (PRISMA-DTA) guidelines were used. Quality assessment of included studies was conducted using the QUADAS-2 tool. For the diagnostic accuracy of AVQI, the pooled sensitivity, specificity and likelihood ratio statistics were determined using a summary receiver operating characteristic approach. Weighted correlation coefficient measures (rW¯) were used to assess the concurrent validity and sensitivity to change. RESULTS A total of 198 studies were screened and 33 articles were included. In total, voice samples of 11447, 10272, and 367 different subjects were considered for analysis of diagnostic accuracy, concurrent validity and change responsiveness, respectively. Satisfying diagnostic accuracy results were found with a pooled sensitivity of 0.83 (95% CI: 0.82-0.83), a pooled specificity of 0.89 (95% CI: 0.88-0.90), a pooled positive LR of 7.75 (95% CI: 6.04-9.95), a pooled negative LR of 0.20 (95% CI: 0.16-0.23), and a pooled diagnostic odds ratio of 47.13 (95% CI: 34.82-63.79). Summary receiver operating characteristic curve analysis showed an excellent AUC value of 0.937 and Q* index of 0.874. Strong correlations of rW¯ = 0.838 for concurrent validity and rW¯ = 0.796 for sensitivity to change were found. CONCLUSIONS Our results confirm the general clinical utility of the AVQI as a robust and valid objective measure for evaluating overall dysphonia severity across languages and study methods.
Collapse
Affiliation(s)
- Christina Batthyany
- GZA Sint-Augustinus, Department of Otorhinolaryngology and Head & Neck Surgery, European Institute of ORL-HNS, Antwerp, Belgium
| | - Ben Barsties V Latoszek
- SRH University of Applied Health Sciences, Speech-Language Pathology, Düsseldorf, Germany; University of Münster, University Hospital Münster, Department of Phoniatrics and Pediatric Audiology, Münster, Germany
| | - Youri Maryn
- GZA Sint-Augustinus, Department of Otorhinolaryngology and Head & Neck Surgery, European Institute of ORL-HNS, Antwerp, Belgium; Ghent University, Faculty of Medicine and Health Sciences, Department of Rehabilitation Sciences, Ghent, Belgium; University College Ghent, Department of Speech-Language Therapy and Audiology, Ghent, Belgium; Université Catholique de Louvain, Faculty of Psychology and Pedagogical Sciences, School of Logopedics, Ottignies-Louvain-La-Neuve, Belgium; Phonanium, Lokeren, Belgium.
| |
Collapse
|
18
|
Jayakumar T, Benoy JJ. Validation of Acoustic Breathiness Index (ABI) in the South Indian Population. J Voice 2022:S0892-1997(22)00139-4. [PMID: 35690529 DOI: 10.1016/j.jvoice.2022.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Revised: 05/10/2022] [Accepted: 05/10/2022] [Indexed: 11/20/2022]
Abstract
BACKGROUND Acoustic Breathiness Index (ABI) is an acoustic model which equates to the auditory-perceptual (A-P) measure of breathiness. Globally, ABI has been validated in multiple languages. To date, empirical studies validating ABI in the South Indian population have not been carried out. Moreover, there is only minimal evidence on the impact of language and gender on ABI. OBJECTIVES This study attempted to validate ABI in languages of Malayalam and Kannada with "B" and "G" of GRBAS. The authors also examined the influence of language and gender on ABI. METHODS The authors retrospectively analyzed a dataset consisting of 160 voice samples (91 normophonic and 69 dysphonic). Five speech-language pathologists perceptually evaluated these samples on the GRBAS scale. PRAAT software was used to calculate ABI for each voice sample. The authors also determined the effects of language and gender on ABI. Following this, ABI's concurrent validity and diagnostic accuracy were examined relating to "B" and "G" of the GRBAS scale. RESULTS The judges reached a moderate agreement on the perceptual evaluation of voice quality. The authors observed neither language nor gender differences in ABI. The concurrent validity of ABI was strong (r = 0.761) when correlated against "B". Further, evaluations revealed that an ABI value of >3.66 could distinguish between breathy and non-breathy voice samples with an area under the curve of 0.816 (Sensitivity = 61.73%; Specificity = 94.93%). On the other hand, the concurrent validity of ABI in correlation with "G" did not yield the optimal result. CONCLUSIONS ABI is a valid acoustic tool for identifying breathiness in the languages of Malayalam and Kannada. ABI correlates better with "B" than "G" on the GRBAS scale. ABI is independent of language and gender.
Collapse
Affiliation(s)
- T Jayakumar
- Associate Professor Speech Sciences, Department of Speech-Language Sciences, All India Institute of Speech and Hearing, University of Mysore, Mysuru, Karnataka, India.
| | - Jesnu Jose Benoy
- Junior Research Fellow, Department of Speech-Language Sciences, All India Institute of Speech and Hearing, University of Mysore, Mysuru, Karnataka, India
| |
Collapse
|
19
|
Jayakumar T, Benoy JJ. Acoustic Voice Quality Index (AVQI) in the Measurement of Voice Quality: A Systematic Review and Meta-Analysis. J Voice 2022:S0892-1997(22)00084-4. [PMID: 35461729 DOI: 10.1016/j.jvoice.2022.03.018] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/16/2022] [Accepted: 03/18/2022] [Indexed: 01/09/2023]
Abstract
INTRODUCTION Acoustic Voice Quality Index (AVQI) is a multiparametric construct of voice quality recognized for its clinical and research applications around the globe. This study aimed to review the validity and diagnostic accuracy of AVQI (v02&03) and determine the effects of age and gender. METHODS This is a systematic review and meta-analysis registered with the PROSPERO registry. The authors searched two databases (PubMed and Cochrane Library database) for relevant studies. Studies selected for the systematic review were grouped based on study objectives. To determine the quality of the selected studies, the authors utilized the QUADAS-2 tool. RESULTS Meta-analysis of seven studies on AVQIv02 revealed a diagnostic threshold ranging from 2.72 to 3.33 for AVQIv02. In comparison, eight studies investigating AVQIv03 suggested a diagnostic threshold ranging from 1.33 to 3.15 for AVQIv03. Altogether, these studies demonstrated a pooled sensitivity and specificity of 0.85 and 0.92 for AVQIv02 and 0.82 and 0.92 for AVQIv03. The Area under the Curve was slightly better for AVQIv03 (0.94) than AVQIv02 (0.92). Three studies investigating the effect of age and gender on AVQI had a consensus that AVQI is independent of gender. However, findings were contradictory about the impact of age on AVQI. CONCLUSIONS AVQI is found to be a valid tool for the assessment of voice quality. AVQIv03 is slightly better than AVQIv02 in its diagnostic accuracy. AVQI is independent of gender. Because of the contradictory evidence, additional research on the effects of age on AVQI is necessary.
Collapse
Affiliation(s)
- Thirunavukkarasu Jayakumar
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, University of Mysore, Mysuru, Karnataka, India.
| | - Jesnu Jose Benoy
- Department of Speech-Language Sciences, All India Institute of Speech and Hearing, University of Mysore, Mysuru, Karnataka, India
| |
Collapse
|
20
|
Sensitivity of Machine Learning Approaches to Fake and Untrusted Data in Healthcare Domain. JOURNAL OF SENSOR AND ACTUATOR NETWORKS 2022. [DOI: 10.3390/jsan11020021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Machine Learning models are susceptible to attacks, such as noise, privacy invasion, replay, false data injection, and evasion attacks, which affect their reliability and trustworthiness. Evasion attacks, performed to probe and identify potential ML-trained models’ vulnerabilities, and poisoning attacks, performed to obtain skewed models whose behavior could be driven when specific inputs are submitted, represent a severe and open issue to face in order to assure security and reliability to critical domains and systems that rely on ML-based or other AI solutions, such as healthcare and justice, for example. In this study, we aimed to perform a comprehensive analysis of the sensitivity of Artificial Intelligence approaches to corrupted data in order to evaluate their reliability and resilience. These systems need to be able to understand what is wrong, figure out how to overcome the resulting problems, and then leverage what they have learned to overcome those challenges and improve their robustness. The main research goal pursued was the evaluation of the sensitivity and responsiveness of Artificial Intelligence algorithms to poisoned signals by comparing several models solicited with both trusted and corrupted data. A case study from the healthcare domain was provided to support the pursued analyses. The results achieved with the experimental campaign were evaluated in terms of accuracy, specificity, sensitivity, F1-score, and ROC area.
Collapse
|
21
|
Barsties V Latoszek B, Mathmann P, Neumann K. The cepstral spectral index of dysphonia, the acoustic voice quality index and the acoustic breathiness index as novel multiparametric indices for acoustic assessment of voice quality. Curr Opin Otolaryngol Head Neck Surg 2021; 29:451-457. [PMID: 34334615 DOI: 10.1097/moo.0000000000000743] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
PURPOSE OF REVIEW The objective assessment of voice quality using acoustic measures is an important pillar of voice diagnostics. This article reviews three recent acoustic measures and their clinical use in phoniatrics and laryngology. RECENT FINDINGS Two acoustic parameters, the cepstral spectral index of dysphonia (CSID) and the acoustic voice quality index (AVQI), have gained importance as validated multiparametric indices in the objective assessment of hoarseness because they include both continuous speech and sustained vowels. The acoustic breathiness index (ABI), another multiparametric index, assesses breathiness admixture during phonation and identifies it robustly, unaffected by other characteristics of dysphonia such as roughness. SUMMARY Acoustic measurements are useful diagnostic tools when used correctly with an appropriate recording system, consideration of environment and use of software programs. CSID, AVQI and ABI objectively improve the detection of voice quality abnormalities. In addition to their proven validity, their application is simple and their usability for clinicians is high.
Collapse
Affiliation(s)
- Ben Barsties V Latoszek
- Department of Phoniatrics and Pediatric Audiology, University Hospital Münster, University of Münster, Münster
- Speech-Language Pathology, SRH University of Applied Health Sciences, Düsseldorf, Germany
| | - Philipp Mathmann
- Department of Phoniatrics and Pediatric Audiology, University Hospital Münster, University of Münster, Münster
| | - Katrin Neumann
- Department of Phoniatrics and Pediatric Audiology, University Hospital Münster, University of Münster, Münster
| |
Collapse
|
22
|
Demirci AN, Köse A, Aydinli FE, İncebay Ö, Yilmaz T. Investigating the cepstral acoustic characteristics of voice in healthy children. Int J Pediatr Otorhinolaryngol 2021; 148:110815. [PMID: 34217000 DOI: 10.1016/j.ijporl.2021.110815] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/16/2021] [Accepted: 06/24/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVES This study aimed to determine the cepstral acoustic parameters that vary depending on age and gender in vocally healthy children, and to establish normative data for cepstral analysis. BACKGROUND Cepstral measurements are among the strongest predictors of auditory-perceptual evaluation of voice and differentiate between healthy and dysphonic voices. More spesificially, ceptral peak prominence is accepted to be as a strong acoustic predictor of breathiness and overall severity of dysphonia. Cepstral measures determine voice quality reliably not only in sustained vowel samples but also in running speech samples. Determining the parameters related to the acoustic profile of children with normal voices can lead us to a better understanding of the effect of changes in the larynx and vocal fold structure during growth and development. There is a limited number of norm studies examining the cepstral acoustic properties of pediatric voice. Determining norm-specific values and clinical guidelines of cepstral acoustics according to the age and gender in vocally healthy children are utmost important. METHODS A total of 160 vocally healthy children were divided into the following four age groups: Group-I included children within the age range of 4-7 years, Group-II included 7-11 years, Group-III 11-14 years, and Group-IV included children within the age range of 14-18 years. An equal number of male and female participants were assigned to each group. PENTAX Medical CSL Model 4500 was used for recording all tasks. For acoustic analysis, Multi-Dimensional Voice Program and Analysis of Dysphonia in Speech and Voice were used. RESULTS Cepstral Peak Prominence (CPP), Cepstral Peak Prominence Standard Deviation (CPP SD), and Low-To-High Spectral Ratio (L/H Ratio) increased with age. It is found that the CPP parameter of all-voiced sentences and nasal-weighted sentences increased with age in boys, while no significant pattern was observed in any sample for girls. For L/H ratio, it can be said that there is a general increase with age in all speech samples, except for the vowel-weighted and voiceless plosive sentence samples, evident especially in the group above the age of 15 years. This study concluded that the CPP SD parameter in the vowel-weighted sentences increased with age in boys. It was also noticed in this study that CPP F0 standard deviation (SD) intervals were narrower in vowel-weighted, easy onset, and voiceless plosive sentence samples than in all-voiced, hard glottal attack and nasal-weighted sentence samples. CONCLUSION This study established cepstral acoustic normative values for a wide age range of the pediatric population. It is thought that age and gender specific cepstral acoustic findings presented in this study contributed to the related literature. In addition, to our knowledge, this is the first study that provides a normative cepstral acoustic database of the CAPE-V/Turkish sentences in the pediatric population.
Collapse
Affiliation(s)
- Ayşe Nur Demirci
- Department of Speech and Language Therapy, Hacettepe University Faculty of Health Sciences, Hacettepe, Ankara, Turkey.
| | - Ayşen Köse
- Department of Speech and Language Therapy, Hacettepe University Faculty of Health Sciences, Hacettepe, Ankara, Turkey
| | - Fatma Esen Aydinli
- Department of Speech and Language Therapy, Hacettepe University Faculty of Health Sciences, Hacettepe, Ankara, Turkey
| | - Önal İncebay
- Department of Speech and Language Therapy, Hacettepe University Faculty of Health Sciences, Hacettepe, Ankara, Turkey
| | - Taner Yilmaz
- Department of Otolaryngology-Head and Neck Surgery, Hacettepe University Faculty of Medicine, Hacettepe, Ankara, Turkey
| |
Collapse
|
23
|
Validation of Acoustic Voice Quality Index Version 3.01 and Acoustic Breathiness Index in Korean Population. J Voice 2021; 35:660.e9-660.e18. [DOI: 10.1016/j.jvoice.2019.10.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Revised: 10/08/2019] [Accepted: 10/10/2019] [Indexed: 11/21/2022]
|
24
|
Graf S, Kirschstein L, Knopf A, Mansour N, Jeleff-Wölfler O, Buchberger AMS, Hofauer B. Systematic evaluation of laryngeal impairment in Sjögren's syndrome. Eur Arch Otorhinolaryngol 2021; 278:2421-2428. [PMID: 33765165 PMCID: PMC8165056 DOI: 10.1007/s00405-021-06746-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 03/10/2021] [Indexed: 12/30/2022]
Abstract
INTRODUCTION Sjögren's syndrome (SjS) causes malfunction of the salivary and lacrimal glands. Consequently, patients suffer from xerostomia and keratoconjunctivitis sicca. This can further affect the voice and swallowing function resulting in an impaired quality of life. Aim of this study is the systematic evaluation of the impact on voice and swallowing-related quality of life in patients with SjS. MATERIAL AND METHODS SjS patients were classified according to the American-European Consensus Group (AECG) criteria; antibodies to Ro (SS-A) or La (SS-B) antigens were detected, ESSPRI was completed. We used the following quality of life questionnaires: EORTC QLQ H&N 35, Anderson Dysphagia Inventory (ADI) and Voice Handicap Index (VHI). Patients additionally received a detailed phoniatric examination (auditory perception, videostroboscopy, acoustic analysis, Dysphonia Severity Index (DSI), aerodynamics measurements). RESULTS Almost all the 54 patients (96.3%) had a limited quality of life due to their swallowing problems and 48% due to their voice problems. Both values correlated significantly with the degree of xerostomia. In the phoniatric examination, 77.8% had an increased DSI and two-thirds had abnormalities in videostroboscopy. CONCLUSIONS A reasonable impairment of quality of life in patients with SjS due to the limitations in voice and swallowing function was observed. As SjS does not limitate life expectancy, preservation of quality of life is important. Detection of voice and swallowing problems as potential reasons for quality of life impairment should be detected and, if diagnosed, treated accordingly.
Collapse
Affiliation(s)
- S Graf
- Otorhinolaryngology/Phoniatrics, Klinikum rechts der Isar, Technical University Munich, Ismaningerstr. 22, 81675, Munich, Germany.
| | - L Kirschstein
- Otorhinolaryngology/Phoniatrics, Klinikum rechts der Isar, Technical University Munich, Ismaningerstr. 22, 81675, Munich, Germany
| | - A Knopf
- Otorhinolaryngology/Head and Neck Surgery, University Medical Center Freiburg, University of Freiburg, Freiburg im Breisgau, Germany
| | - N Mansour
- Otorhinolaryngology/Head and Neck Surgery, University Medical Center Freiburg, University of Freiburg, Freiburg im Breisgau, Germany
| | - O Jeleff-Wölfler
- Otorhinolaryngology/Phoniatrics, Klinikum rechts der Isar, Technical University Munich, Ismaningerstr. 22, 81675, Munich, Germany
| | - A M S Buchberger
- Otorhinolaryngology/Phoniatrics, Klinikum rechts der Isar, Technical University Munich, Ismaningerstr. 22, 81675, Munich, Germany
| | - B Hofauer
- Otorhinolaryngology/Phoniatrics, Klinikum rechts der Isar, Technical University Munich, Ismaningerstr. 22, 81675, Munich, Germany
| |
Collapse
|
25
|
Uloza V, Ulozaitė-Stanienė N, Petrauskas T, Kregždytė R. Accuracy of Acoustic Voice Quality Index Captured With a Smartphone - Measurements With Added Ambient Noise. J Voice 2021; 37:465.e19-465.e26. [PMID: 33676807 DOI: 10.1016/j.jvoice.2021.01.025] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2020] [Revised: 01/22/2021] [Accepted: 01/26/2021] [Indexed: 11/27/2022]
Abstract
OBJECTIVE To evaluate the accuracy of Acoustic Voice Quality Index (AVQI) measures obtained from voice recordings simultaneously using oral and smartphone microphones in a sound-proof room, and to compare them with AVQIs obtained from the same smartphone voice recordings with added ambient noise. METHODS A study group of 183 subjects with normal voices (n = 86) and various voice disorders (n = 97) was asked to read aloud a standard text and sustain the vowel /a/. The controlled ambient noise averaged at 29.61 dB SPL was added digitally to the smartphone voice recordings. Repeated measures analysis of variances (ANOVA) with Greenhouse-Geiser correction was used to evaluate AVQI changes within subjects. To evaluate the level of agreement between AVQI measurements obtained from different voice recordings Bland-Altman plots were used. RESULTS Repeated measures ANOVA showed that differences among AVQI results obtained from voice recordings done with oral studio microphone, recordings done with a smartphone microphone, and recordings done with a smartphone microphone with added ambient noise were not statistically significant (P = 0.07). No significant systemic differences and acceptable level of random errors in AVQI measurements of voice recordings made with oral and smartphone microphones (including added noise) were revealed. CONCLUSION The AVQI measures obtained from smartphone microphones voice recordings with experimentally added ambient noise revealed an acceptable agreement with results of oral microphone recordings, thus suggesting the suitability of smartphone microphone recordings performed even in the presence of acceptable ambient noise for estimation of AVQI.
Collapse
Affiliation(s)
- Virgilijus Uloza
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Nora Ulozaitė-Stanienė
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania.
| | - Tadas Petrauskas
- Department of Otorhinolaryngology, Lithuanian University of Health Sciences, Kaunas, Lithuania
| | - Rima Kregždytė
- Department of Preventive Medicine, Lithuanian University of Health Sciences, Kaunas, Lithuania
| |
Collapse
|
26
|
Grillo EU, Wolfberg J. An Assessment of Different Praat Versions for Acoustic Measures Analyzed Automatically by VoiceEvalU8 and Manually by Two Raters. J Voice 2020; 37:17-25. [PMID: 33384248 PMCID: PMC8236489 DOI: 10.1016/j.jvoice.2020.12.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 12/03/2020] [Accepted: 12/04/2020] [Indexed: 01/17/2023]
Abstract
INTRODUCTION The purpose of the study was to assess acoustic measures of fundamental frequency (fo), standard deviation of fo (SD of fo), jitter%, shimmer%, noise-to-harmonic ratio (NHR), smoothed cepstral peak prominence (CPPS), and acoustic voice quality index analyzed through multiple Praat versions automatically by VoiceEvalU8 or manually by two raters. In addition, default settings to calculate CPPS in two Praat versions manually analyzed by two raters were compared to Maryn and Weenik20 procedures for CPPS automatically analyzed by VoiceEvalU8. METHODS Nineteen vocally healthy females used VoiceEvalU8 to record three 5-s sustained /a/ trials, the all voiced phrase "we were away a year ago," and a 15-s speech sample twice a day for five consecutive days. Two raters manually completed acoustic analysis using different versions of Praat and compared that analysis to measures automatically generated through a version of Praat used by VoiceEvalU8. One-way analyses of variance were run for all acoustic measures with post-hoc testing by the Bonferroni method. For acoustic measures that demonstrated significant differences, intraclass correlation coefficients were conducted. RESULTS Results showed no significant differences across automatic and manual analysis for different versions of Praat for all acoustic measures during /a/, for fo, jitter%, shimmer%, and NHR during the phrase, for jitter%, shimmer%, NHR, and CPPS during speech, and for acoustic voice quality index calculated from both sustained /a/ and the phrase. The default Praat settings for CPPS were not significantly different from the Maryn and Weenik20 procedures for sustained /a/ and speech. Significant differences were present for SD of fo and CPPS during the phrase and fo and SD of fo during speech. SD of fo and CPPS in the phrase were moderately correlated and fo and SD of fo during speech demonstrated good to excellent correlations across the different versions of Praat. CONCLUSIONS Acoustic measures analyzed through sustained /a/ and some of the acoustic measures during the phrase and speech were not different across multiple versions of Praat. Automatic analysis by VoiceEvalU8 produced similar mean values as compared to manual analysis by two raters. Even though SD of fo and CPPS in the phrase and fo and SD of fo in speech were different across the versions of Praat, the measures demonstrated moderate to excellent reliability.
Collapse
Affiliation(s)
- Elizabeth U. Grillo
- West Chester University, Department of Communication Sciences and Disorders, 201 Carter Drive, Suite 413, West Chester, PA, 19383, USA
| | - Jeremy Wolfberg
- Massachuetts General Hospital Institute of Health Professions, Speech-Language Pathology Master’s Program, Boston, MA, USA
| |
Collapse
|
27
|
Jayakumar T, Benoy JJ, Yasin HM. Effect of Age and Gender on Acoustic Voice Quality Index Across Lifespan: A Cross-sectional Study in Indian Population. J Voice 2020; 36:436.e1-436.e8. [DOI: 10.1016/j.jvoice.2020.05.025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2020] [Revised: 05/20/2020] [Accepted: 05/26/2020] [Indexed: 10/24/2022]
|
28
|
Stappenbeck L, Barsties v. Latoszek B, Janotte B, Lehnert B. Acoustic Voice Quality Index and Acoustic Breathiness Index as two examples for strengths and weaknesses of free software in medicine. Biomed Signal Process Control 2020. [DOI: 10.1016/j.bspc.2020.101938] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
29
|
Monitoring the Outcome of Phonosurgery and Vocal Exercises with Established and New Diagnostic Tools. BIOMED RESEARCH INTERNATIONAL 2020; 2020:4208189. [PMID: 32090091 PMCID: PMC6998760 DOI: 10.1155/2020/4208189] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Accepted: 12/20/2019] [Indexed: 12/26/2022]
Abstract
Instrument-assisted measuring procedures expand the options within phoniatric diagnostics by quantifying the condition of the voice. The aim of this study was to examine objective treatment-associated changes of the recently developed vocal extent measure (VEM) and the established dysphonia severity index (DSI) in relation to subjective tools, i.e., self-evaluation via voice handicap index (VHI-12) and external evaluation via auditory-perceptual assessment of hoarseness (H). The findings for H (3 raters' group assessment), VHI-12, DSI, and VEM in 152 patients of both sexes (age range 16–75 years), taken before and 3 months after phonosurgery or vocal exercises, were compared and correlated. Posttherapeutically, all of the recorded parameters improved (p < 0.001). The degree of H reduced on average by 0.5, the VHI-12 score sank by 5 points, while DSI and VEM rose by 1.5 and 19, respectively. The correlations of these changes were significant but showed gradual differences between H and VHI-12 (r = 0.3), H and DSI (r = −0.3), and H and VEM (r = −0.4). We conclude that all investigated parameters are adequate to verify therapeutic outcomes but represent different dimensions of the voice. However, changes in the degree of H as gold standard were best recognized with the new VEM.
Collapse
|
30
|
Faham M, Laukkanen AM, Ikävalko T, Rantala L, Geneid A, Holmqvist-Jämsén S, Ruusuvirta K, Pirilä S. Acoustic Voice Quality Index as a Potential Tool for Voice Screening. J Voice 2019; 35:226-232. [PMID: 31582330 DOI: 10.1016/j.jvoice.2019.08.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 08/18/2019] [Accepted: 08/19/2019] [Indexed: 10/25/2022]
Abstract
INTRODUCTION To diminish the risk of voice disorders in people who are highly dependent on their voices, such as teachers, vocal screening is important already at the beginning of such individuals' professional studies. A reliable, specified screening tool is needed. The Acoustic Voice Quality Index (AVQI) has been found to differentiate normal voices from abnormal voices and to serve as a treatment outcome measure. This study investigated whether AVQI could be a screening tool in combination with auditory- and self-perception of the voice to discriminate normal from slightly poor voices. TYPE OF STUDY Experimental. METHODS Some 128 female teaching students (mean age 26.39 years, SD 9.80 years) with no diagnosed voice disorders participated in this study. They read aloud a text in Finnish, sustained the vowel /a:/, and filled the Voice Handicap Index (VHI) questionnaire. Voice samples were recorded with an AKG C544L headset microphone, iFocusrite soundcard, and Praat software using a 44100 sample rate and 16-bit amplitude quantization. Five expert voice therapists evaluated the samples to determine the grade of dysphonia (G) using a scale of 0-0.5 (=normal), 0.5-1 (=mild), 1-2 (=moderate), and 2-3 (=severe). Three medial seconds of [a:] and the first 31 syllables of the text were analyzed using AVQI script version 03.01 in Praat (5.3.55). The analysis gives one AVQI score per participant (scale 0-10). The AVQI threshold of normal and disordered voices for Finnish speakers is 1.83; a Gmean = 0.0-0.5 and VHI score <19 were considered normal. Statistical analysis was done using the receiver operating characteristic (ROC) curve, Spearman's correlation coefficient, and the independent samples t test. RESULTS According to the AVQI results, the area under the curve (AROC) was 0.554, which is fair. The Youden index gave a cutoff value of 0.30 with a sensitivity of 85% and a specificity of 81.1%. There were weak but significant correlations between Gmean and AVQI and two AVQI parameters, smoothed cepstral peak prominence and harmonic-to-noise ratio (r = 0.27; --0.24; -0.20, respectively; |P < 0.05); and between total VHI and AVQI score and cepstral peak prominence (r = 0.21; 0.20, respectively; P < 0.05). Furthermore, the AVQI scores differed significantly between the groups with a VHI total score <19 and ≥19. CONCLUSIONS AVQI did not differentiate between voices that had been perceptually judged as normal or slightly abnormal, but a combination of perceptual assessment in the form of AVQI and VHI could better screen slightly deviant voices.
Collapse
Affiliation(s)
- Maryam Faham
- Department of Speech and Language Pathology Rehabilitation Faculty, Shiraz University of Medical Sciences, Shiraz, Iran; Rehabilitation Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
| | - Anne-Maria Laukkanen
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Tero Ikävalko
- Speech and Voice Research Laboratory, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Leena Rantala
- Unit of Logopedics, Faculty of Social Sciences, Tampere University, Tampere, Finland
| | - Ahmed Geneid
- Department of Otorhinolaryngology and Phoniatrics - Head and Neck Surgery, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | | | - Kaarina Ruusuvirta
- Department of Otorhinolaryngology and Phoniatrics - Head and Neck Surgery, Helsinki University Hospital and University of Helsinki, Helsinki, Finland
| | - Sirpa Pirilä
- Faculty of Humanities/Logopedics, Child Language Research, University of Oulu, Oulu, Finland
| |
Collapse
|
31
|
Hosokawa K, von Latoszek BB, Ferrer-Riesgo CA, Iwahashi T, Iwahashi M, Iwaki S, Kato C, Yoshida M, Umatani M, Miyauchi A, Matsushiro N, Inohara H, Ogawa M, Maryn Y. Acoustic Breathiness Index for the Japanese-Speaking Population: Validation Study and Exploration of Affecting Factors. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:2617-2631. [PMID: 31296106 DOI: 10.1044/2019_jslhr-s-19-0077] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Objectives The purposes of this study were to validate the Acoustic Breathiness Index (ABI) for the Japanese-speaking population and to determine whether it is independent of factors such as sex, age, and perceptual ratings of roughness. Method First, the concurrent validity of the ABI for perceptual breathiness was evaluated on the concatenations of continuous speech and sustained vowels from 288 patients with varying degrees of dysphonia. The diagnostic accuracy was examined on 343 samples with 55 additional normophonic speakers. Second, the validity related to responsiveness-to-change was estimated on 222 samples obtained before and after interventions for 111 voice-disordered patients. Third, the relationships between the ABI and other variables (i.e., perceptual hoarseness/breathiness/roughness, sex, and age) were explored using bivariate and multivariate analyses for the 288 patients. Results First, the concurrent validity and the responsiveness-to-change validity were confirmed by strong correlation coefficients of .890 and .878, respectively. Second, the receiver operating characteristic analysis showed the area under the curve to be 0.939, indicating excellent accuracy. The ABI of 3.44 exhibited a sensitivity of 76.3% and a specificity of 94.1%. Third, although bivariate analyses revealed a weak relationship between ABI and roughness and an ABI difference by age, multiple regression analyses showed a strong relation between only ABI and breathiness, without a meaningful contribution from roughness, sex, and age factors. Conclusion The study confirmed that the ABI is an accurate and specific tool to estimate breathiness levels in the Japanese-speaking population and neither roughness, sex, nor age significantly affects the ABI.
Collapse
Affiliation(s)
- Kiyohito Hosokawa
- Department of Otorhinolaryngology, Japan Community Health Care Organization, Osaka Hospital, Japan
- Department of Otorhinolaryngology, Osaka Police Hospital, Japan
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | | | - Carlos Ariel Ferrer-Riesgo
- Informatics Research Center, Central University of Las Villas, Santa Clara, Cuba
- Department of Computer Science, Friedrich-Alexander University Erlangen-Nürnberg, Germany
| | - Toshihiko Iwahashi
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | | | - Shinobu Iwaki
- Department of Otorhinolaryngology and Head & Neck Surgery, Kobe University Graduate School of Medicine, Hyogo, Japan
| | - Chieri Kato
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | - Misao Yoshida
- Department of Rehabilitation, Nishinomiya Kaisei Hospital, Hyogo, Japan
| | - Masanori Umatani
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | | | - Naoki Matsushiro
- Department of Otorhinolaryngology, Osaka Police Hospital, Japan
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | - Hidenori Inohara
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | - Makoto Ogawa
- Department of Otorhinolaryngology, Japan Community Health Care Organization, Osaka Hospital, Japan
- Department of Otorhinolaryngology and Head & Neck Surgery, Osaka University Graduate School of Medicine, Japan
| | - Youri Maryn
- Speech-Language Pathology, SRH University of Applied Health Sciences, Gera, Thuringia, Germany
- European Institute for ORL, Sint-Augustinus Hospital, Antwerp, Belgium
- Faculty of Education, Health & Social Work, University College Ghent, Belgium
| |
Collapse
|
32
|
A Case of Specificity: How Does the Acoustic Voice Quality Index Perform in Normophonic Subjects? APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9122527] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The acoustic voice quality index (AVQI) is a multiparametric tool based on six acoustic measurements to quantify overall voice quality in an objective manner, with the smoothed version of the cepstral peak prominence (CPPS) as its main contributor. In the last decade, many studies demonstrated its robust diagnostic accuracy and high sensitivity to voice changes across voice therapy in different languages. The aim of the present study was to provide information regarding AVQI’s and CPPS’s performance in normophonic non-treatment-seeking subjects, since these data are still scarce; concatenated voice samples, consisting of sustained vowel phonation and continuous speech, from 123 subjects (72 females, 51 males; between 20 and 60 years old) without vocally relevant complaints were evaluated by three raters and run in AVQI v.02.06. According to this auditory-perceptual evaluation, two cohorts were set up (normophonia versus slight perceived dysphonia). First, gender effects were investigated. Secondly, between-cohort differences in AVQI and CPPS were investigated. Thirdly, with the number of judges giving G = 1 to partition three sub-levels of slight hoarseness as an independent factor, differences in AVQI and CPPS across these sub-levels were investigated; for AVQI, no significant gender effect was found, whereas, for CPPS, significant trends were observed. For both AVQI and CPPS, no significant differences were found between normophonic and slightly dysphonic subjects. For AVQI, however, this difference did approach significance; these findings emphasize the need for a normative study with a greater sample size and subsequently greater statistical power to detect possible significant effects and differences.
Collapse
|
33
|
Freymann ML, Mathmann P, Rummich J, Müller C, Neumann K, Nawka T, Caffier PP. Gender-specific reference ranges of the vocal extent measure in young and healthy adults. LOGOP PHONIATR VOCO 2019; 45:73-81. [PMID: 31157590 DOI: 10.1080/14015439.2019.1617894] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Objective: The recently developed vocal extent measure (VEM) quantifies a patient's vocal capacity as documented in the voice range profile (VRP). This study presents the first reference ranges of the VEM for young subjects without voice complaints. Furthermore, this study investigates the influence of gender on the VEM as well as the correlation of the VEM with the dysphonia severity index (DSI).Patients and methods: Reference ranges were captured by combining a retrospective analysis of subjects who received a medical fitness certificate of a healthy voice (n = 135) and a prospective analysis of adult volunteers without voice complaints (n = 67). Every participant obtained a standardized voice assessment comprising videolaryngostroboscopy, auditory-perceptual analysis, acoustic analysis, VRP, and the Voice Handicap Index (VHI-9i).Results: A total of 202 subjects were recruited and investigated. Due to our stringent selection criteria, 51 participants had to be excluded from further analysis. The remaining data of 151 participants (52 males, 99 females), aged 18-39 years (mean 24, SD 5), were analysed in more detail. The mean of the VEM amounted to 123.7 (SD 12.6) for males and 114.4 (SD 13.3) for females. The values differed significantly between both sexes and correlated significantly with the corresponding DSI values.Conclusion: By introducing the first reference values, this study represents the next step of implementing the VEM in daily phoniatric diagnostics. These values serve as a basis to interpret the VEM regarding the degree of severity of voice disorders and to evaluate treatment success.
Collapse
Affiliation(s)
- Marie-Louise Freymann
- Department of Audiology and Phoniatrics, Charité - University Medicine Berlin, Berlin, Germany
| | - Philipp Mathmann
- Department of Audiology and Phoniatrics, Charité - University Medicine Berlin, Berlin, Germany
| | - Julius Rummich
- Department of Audiology and Phoniatrics, Charité - University Medicine Berlin, Berlin, Germany
| | - Constanze Müller
- Department of Internal Medicine, HELIOS Kliniken Schwerin, Schwerin, Germany
| | - Konrad Neumann
- Institute of Biometry and Clinical Epidemiology, Charité - University Medicine Berlin, Berlin, Germany
| | - Tadeus Nawka
- Department of Audiology and Phoniatrics, Charité - University Medicine Berlin, Berlin, Germany
| | - Philipp P Caffier
- Department of Audiology and Phoniatrics, Charité - University Medicine Berlin, Berlin, Germany
| |
Collapse
|
34
|
Ulozaite-Staniene N, Petrauskas T, Šaferis V, Uloza V. Exploring the feasibility of the combination of acoustic voice quality index and glottal function index for voice pathology screening. Eur Arch Otorhinolaryngol 2019; 276:1737-1745. [DOI: 10.1007/s00405-019-05433-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2018] [Accepted: 04/12/2019] [Indexed: 11/25/2022]
|
35
|
Priss I, Barsties V Latoszek B, Jäger-Priss U, Lehnert B. [Questionnaire for the assessment of the voice self-concept in a neurological practice : Applicability for the identification of patients with high consultation needs]. DER NERVENARZT 2018; 90:601-608. [PMID: 30470870 DOI: 10.1007/s00115-018-0642-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
INTRODUCTION Hoarseness occurs as a symptom of a multitude of neurological diseases. The importance people attribute to their voice differs greatly and it seems advisable to take this into account during consultation. The Questionnaire for the assessment of the voice self-concept (FESS) is able to measure this importance. The aim of this study was to determine the applicability of the FESS in a neurological and psychiatric medical practice. MATERIAL AND METHODS A total of 556 patients of a neurological and psychiatic medical practice completed the FESS and the severity of hoarseness was measured objectively by means of the Acoustic Voice Quality Index (AVQI). Another 75 patients only completed the questionnaire. The value distribution of the questionnaire, the correlation with age and gender, the voice quality as well as the daily strain on the voice were investigated. RESULTS The value distribution ranges of all three scales of the FESS are completely utilized. Floor and ceiling effects do not play a relevant role. Gender and age do not seem to be of significance. The influence of the measurable voice quality is minor. In contrast a correlation between voice self-concept and the daily voice strain was detected. DISCUSSION The FESS has the potential to be an interesting instrument for the measurement of the subjective importance of a patient's voice. Here reference values are generated, against which individual results can be correlated. The AVQI provides an inexpensive fast instrument for the objective measurement of hoarseness in a neurological and psychiatric medical practice.
Collapse
Affiliation(s)
- Ina Priss
- Klinik für Hals- Nasen- und Ohrenkrankheiten, Kopf- und Halschirurgie, Universitätsmedizin Greifswald, Ferdinand-Sauerbruch-Straße, 17475, Greifswald, Deutschland.
| | | | | | - Bernhard Lehnert
- Klinik für Hals- Nasen- und Ohrenkrankheiten, Kopf- und Halschirurgie, Universitätsmedizin Greifswald, Ferdinand-Sauerbruch-Straße, 17475, Greifswald, Deutschland
| |
Collapse
|