1
|
Yun D, Shen Y, Lentz JJ. Verification of Estimated Output Signal-to-Noise Ratios From a Phase Inversion Technique Using a Simulated Hearing Aid. Am J Audiol 2023; 32:197-209. [PMID: 36706459 PMCID: PMC10166192 DOI: 10.1044/2022_aja-22-00023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 07/15/2022] [Accepted: 10/24/2022] [Indexed: 01/28/2023] Open
Abstract
PURPOSE The signal-to-noise ratio (SNR) for speech presented in background noise may vary after being processed by digital hearing aids with nonlinear signal processing algorithms, such as wide dynamic range compression (WDRC). A phase inversion technique has been previously developed to assess the output SNR of hearing aids. However, systematic validations of this technique have not been conducted. This study aims to validate the phase inversion technique. METHOD A simulated hearing aid with multichannel WDRC was implemented, from which the output SNRs, computed via shadow filtering, for connected speech in background noise were directly computed. The agreement between the shadow filter output SNRs and those estimated using the phase inversion technique for the same stimuli was utilized to validate the phase inversion technique. The background noise was 2- or 20-talker babble noise, and the speech stimuli were presented at SNRs of -10 to +10 dB at the input of the simulated hearing aid. The simulated hearing aid was configured to provide amplification for four representative audiograms, and the WDRC was set to be fast or slow acting. To investigate the effects of additive noise, independent of the presented noise stimulus, on the phase inversion estimated output SNR, the same simulated hearing aid was implemented with an additive Gaussian noise at its input (45 and 60 dB SPL). RESULTS Results showed that the phase inversion technique could either overestimate or underestimate output SNR, depending on the test condition; the estimation errors tended to coincide with temporal landmarks, such as natural pauses between consecutive sentences or fricatives; and increasing the simulated noise led to poorer estimates of output SNR. CONCLUSIONS Results imply that the accuracy of the phase inversion technique is dependent on the test conditions. Thus, the phase inversion technique should be used with caution, and its validity should be evaluated further.
Collapse
Affiliation(s)
- Donghyeon Yun
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| | - Yi Shen
- Department of Speech and Hearing Sciences, University of Washington, Seattle
| | - Jennifer J. Lentz
- Department of Speech, Language and Hearing Sciences, Indiana University Bloomington
| |
Collapse
|
2
|
Souza PE, Ellis G, Marks K, Wright R, Gallun F. Does the Speech Cue Profile Affect Response to Amplitude Envelope Distortion? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:2053-2069. [PMID: 34019777 PMCID: PMC8740712 DOI: 10.1044/2021_jslhr-20-00481] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 11/13/2020] [Accepted: 02/11/2021] [Indexed: 06/12/2023]
Abstract
Purpose A broad area of interest to our group is to understand the consequences of the "cue profile" (a measure of how well a listener can utilize audible temporal and/or spectral cues for listening scenarios in which a subset of cues is distorted. The study goal was to determine if listeners whose cue profile indicated that they primarily used temporal cues for recognition would respond differently to speech-envelope distortion than listeners who utilized both spectral and temporal cues. Method Twenty-five adults with sensorineural hearing loss participated in the study. The listener's cue profile was measured by analyzing identification patterns for a set of synthetic syllables in which envelope rise time and formant transitions were varied. A linear discriminant analysis quantified the relative contributions of spectral and temporal cues to identification patterns. Low-context sentences in noise were processed with time compression, wide-dynamic range compression, or a combination of time compression and wide-dynamic range compression to create a range of speech-envelope distortions. An acoustic metric, a modified version of the Spectral Correlation Index, was calculated to quantify envelope distortion. Results A binomial generalized linear mixed-effects model indicated that envelope distortion, the cue profile, the interaction between envelope distortion and the cue profile, and the pure-tone average were significant predictors of sentence recognition. Conclusions The listeners with good perception of spectro-temporal contrasts were more resilient to the detrimental effects of envelope compression than listeners who used temporal cues to a greater extent. The cue profile may provide information about individual listening that can direct choice of hearing aid parameters, especially those parameters that affect the speech envelope.
Collapse
|
3
|
Kowalewski B, Dau T, May T. Perceptual Evaluation of Signal-to-Noise-Ratio-Aware Dynamic Range Compression in Hearing Aids. Trends Hear 2020; 24:2331216520930531. [PMID: 32573373 PMCID: PMC7313326 DOI: 10.1177/2331216520930531] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Dynamic range compression is a compensation strategy commonly used in modern hearing aids. Fast-acting systems respond relatively quickly to the fluctuations in the input level. This allows for more effective compression of the dynamic range of speech and hence enhanced the audibility of its low-intensity components. However, such processing also amplifies the background noise, distorts the modulation spectra of both the speech and the background, and can reduce the output signal-to-noise ratio (SNR). Recently, May et al. proposed a novel SNR-aware compression strategy, in which the compression speed is adapted depending on whether speech is present or absent. Fast-acting compression is applied to speech-dominated time–frequency (T-F) units, while noise-dominated T-F units are processed using slow-acting compression. It has been shown that this strategy provides a similar effective compression of the speech dynamic range as conventional fast-acting compression, while introducing fewer distortions of the modulation spectrum of the background and providing an improved output SNR. In this study, this SNR-aware compression strategy was compared with conventional fast- and slow-acting compression in terms of speech intelligibility and subjective preference in a group of 17 hearing-impaired listeners with varying degree of hearing loss. The results show a speech intelligibility benefit of the SNR-aware compression strategy over the conventional slow-acting system. Furthermore, the SNR-aware approach demonstrates an increased subjective preference compared with both conventional fast- and slow-acting systems.
Collapse
Affiliation(s)
- Borys Kowalewski
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark
| | - Torsten Dau
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark
| | - Tobias May
- Hearing Systems Section, Department of Health Technology, Technical University of Denmark
| |
Collapse
|
4
|
Shetty HN, Puttabasappa M. Representation of amplified speech at cortical level in good and poor hearing aid performers. Braz J Otorhinolaryngol 2019; 86:558-567. [PMID: 31122881 PMCID: PMC9422500 DOI: 10.1016/j.bjorl.2019.02.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 09/02/2018] [Accepted: 02/17/2019] [Indexed: 11/27/2022] Open
Abstract
Introduction Hearing aid users reject their own hearing aid because of annoyance with background noise. The reason for dissatisfaction is located anywhere from the hearing aid microphone to the integrity of neurons along the auditory pathway. In this preview, the output of hearing aid was recorded at the level of ear canal and at auditory cortex in good and poor hearing aid users, who were classified using acceptable noise level. Objective To study the representation of amplified speech in good and poor hearing aid performers. Methods A total of 60 participants (age ranged 15–65 years) with moderate bilateral sensorineural hearing impairment grouped into good (n = 35) and poor (n = 25) hearing aid performers. Gap detection test and aided SNR 50 were administered. In addition, ear canal acoustic measures and cortical auditory evoked potentials were recorded in unaided and aided conditions at 65 dB SPL. Results Hearing aid minimally alters temporal contrast of speech reflected in envelope difference index. Although having similar temporal impairment, acoustic characteristics of amplified speech sounds and SNR 50 scores from both groups, the aided cortical auditory evoked potentials surprisingly showed significant earlier latencies and higher amplitudes in good performers than poor performers. In addition, good and poor performers classified based on annoyance level was predicted by latencies of 2N1 and 2P2 components of acoustic change complex. Further, a follow-up revealed hearing aid use has relation with acceptance towards noise. Conclusion Participants who are willing to accept noise from those who are not willing to accept noise have subtle physiological changes evident at the auditory cortex, which supports the hearing aid usage.
Collapse
Affiliation(s)
| | - Manjula Puttabasappa
- All India Institute of Speech and Hearing, Department of Audiology, Karnataka, India
| |
Collapse
|
5
|
Kowalewski B, Zaar J, Fereczkowski M, MacDonald EN, Strelcyk O, May T, Dau T. Effects of Slow- and Fast-Acting Compression on Hearing-Impaired Listeners' Consonant-Vowel Identification in Interrupted Noise. Trends Hear 2019; 22:2331216518800870. [PMID: 30311552 PMCID: PMC6187394 DOI: 10.1177/2331216518800870] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
There is conflicting evidence about the relative benefit of slow- and fast-acting compression for speech intelligibility. It has been hypothesized that fast-acting compression improves audibility at low signal-to-noise ratios (SNRs) but may distort the speech envelope at higher SNRs. The present study investigated the effects of compression with a nearly instantaneous attack time but either fast (10 ms) or slow (500 ms) release times on consonant identification in hearing-impaired listeners. Consonant–vowel speech tokens were presented at a range of presentation levels in two conditions: in the presence of interrupted noise and in quiet (with the compressor “shadow-controlled” by the corresponding mixture of speech and noise). These conditions were chosen to disentangle the effects of consonant audibility and noise-induced forward masking on speech intelligibility. A small but systematic intelligibility benefit of fast-acting compression was found in both the quiet and the noisy conditions for the lower speech levels. No detrimental effects of fast-acting compression were observed when the speech level exceeded the level of the noise. These findings suggest that fast-acting compression provides an audibility benefit in fluctuating interferers when compared with slow-acting compression while not substantially affecting the perception of consonants at higher SNRs.
Collapse
Affiliation(s)
- Borys Kowalewski
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Johannes Zaar
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Michal Fereczkowski
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Ewen N MacDonald
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Olaf Strelcyk
- 2 Sonova U.S. Corporate Services, Warrenville, IL, USA
| | - Tobias May
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Torsten Dau
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
6
|
May T, Kowalewski B, Dau T. Signal-to-Noise-Ratio-Aware Dynamic Range Compression in Hearing Aids. Trends Hear 2018; 22:2331216518790903. [PMID: 30117366 PMCID: PMC6100123 DOI: 10.1177/2331216518790903] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Fast-acting dynamic range compression is a level-dependent amplification scheme which aims to restore audibility for hearing-impaired listeners. However, when being applied to noisy speech at positive signal-to-noise ratios (SNRs), the gain function typically changes rapidly over time as it is driven by the short-term fluctuations of the speech signal. This leads to an amplification of the noise components in the speech gaps, which reduces the output SNR and distorts the acoustic properties of the background noise. An adaptive compression scheme is proposed here which utilizes information about the SNR in different frequency channels to adaptively change the characteristics of the compressor. Specifically, fast-acting compression is applied to speech-dominated time-frequency (T-F) units where the SNR is high, while slow-acting compression is used to effectively linearize the processing for noise-dominated T-F units where the SNR is low. A systematic evaluation of this SNR-aware compression scheme showed that the effective compression of speech components embedded in noise was similar to that of a conventional fast-acting system, whereas natural fluctuations in the background noise were preserved in a similar way as when a slow-acting compressor was applied.
Collapse
Affiliation(s)
- Tobias May
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Borys Kowalewski
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| | - Torsten Dau
- 1 Hearing Systems Group, Department of Electrical Engineering, Technical University of Denmark, Lyngby, Denmark
| |
Collapse
|
7
|
Davies-Venn E, Souza P. The role of spectral resolution, working memory, and audibility in explaining variance in susceptibility to temporal envelope distortion. J Am Acad Audiol 2018; 25:592-604. [PMID: 25313549 DOI: 10.3766/jaaa.25.6.9] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
BACKGROUND Several studies have shown that hearing thresholds alone cannot adequately predict listeners' success with hearing-aid amplification. Furthermore, previous studies have shown marked differences in listeners' susceptibility to distortions introduced by certain nonlinear amplification parameters. PURPOSE The purpose of this study was to examine the role of spectral resolution, working memory, and audibility in explaining perceptual susceptibility to temporal envelope and other hearing-aid compression-induced distortions for listeners with mild to moderate and moderate to severe hearing loss. RESEARCH DESIGN A between-subjects repeated-measures design was used to compare speech recognition scores with linear versus compression amplification, for listeners with mild to moderate and moderate to severe hearing loss. STUDY SAMPLE The study included 15 adult listeners with mild to moderate hearing loss and 13 adults with moderate to severe hearing loss. DATA COLLECTION/ANALYSIS Speech recognition scores were measured for vowel-consonant-vowel syllables processed with linear, moderate compression, and extreme compression amplification. Perceptual susceptibility to compression-induced temporal envelope distortion was defined as the difference in scores between linear and compression amplification. Both overall scores and consonant feature scores (i.e., place, manner, and voicing) were analyzed. Narrowband spectral resolution was measured using individual measures of auditory filter bandwidth at 2000 Hz. Working memory was measured using the reading span test. Signal audibility was quantified using the Aided Audibility Index. Multiple linear regression was used to determine the predictive role of spectral resolution, working memory, and audibility benefit on listeners' susceptibility to compression-induced distortions. RESULTS For all listeners, spectral resolution, working memory, and audibility benefit were significant predictors of overall distortion scores. For listeners with moderate to severe hearing loss, spectral resolution and audibility benefit predicted distortion scores for consonant place and manner of articulation features, and audibility benefit predicted distortion scores for consonant voicing features. For listeners with mild to moderate hearing loss, the model did not predict distortion scores for overall or consonant feature scores. CONCLUSIONS The results from this study suggest that when audibility is adequately controlled, measures of spectral resolution may identify the listeners who are most susceptible to compression-induced distortions. Working memory appears to modulate the negative effect of these distortions for listeners with moderate to severe hearing loss.
Collapse
Affiliation(s)
| | - Pamela Souza
- Communication Sciences and Disorders and Knowles Hearing Center, Northwestern University, Evanston, IL
| |
Collapse
|
8
|
Alexander JM, Rallapalli V. Acoustic and perceptual effects of amplitude and frequency compression on high-frequency speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:908. [PMID: 28863610 DOI: 10.1121/1.4997938] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
This study investigated how six different amplification methods influence acoustic properties, and subsequently perception, of high-frequency cues in fricatives that have been processed with conventional full bandwidth amplification or nonlinear frequency compression (NFC)-12 conditions total. Amplification methods included linear gain, fast/slow-acting wide dynamic range compression crossed with fixed/individualized compression parameters, and a method with adaptive time constants. Twenty-one hearing-impaired listeners identified seven fricatives in nonsense syllables produced by female talkers. For NFC stimuli, frequency-compressed filters that precisely aligned 1/3-octave bands between input and output were used to quantify effective compression ratio, audibility, and temporal envelope modulation relative to the input. Results indicated significant relationships between these acoustic properties, each of which contributed significantly to fricative recognition across the entire corpus of stimuli. Recognition was significantly better for NFC stimuli compared with full bandwidth stimuli, regardless of the amplification method, which had complementary effects on audibility and envelope modulation. Furthermore, while there were significant differences in recognition across the amplification methods, they were not consistent across phonemes. Therefore, neither recognition nor acoustic data overwhelmingly suggest that one amplification method should be used over another for transmission of high-frequency cues in isolated syllables. Longer duration stimuli and more realistic listening conditions should be examined.
Collapse
Affiliation(s)
- Joshua M Alexander
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| | - Varsha Rallapalli
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana 47907, USA
| |
Collapse
|
9
|
Reinhart PN, Souza PE, Srinivasan NK, Gallun FJ. Effects of Reverberation and Compression on Consonant Identification in Individuals with Hearing Impairment. Ear Hear 2016; 37:144-52. [PMID: 26462171 PMCID: PMC4767560 DOI: 10.1097/aud.0000000000000229] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Hearing aids are frequently used in reverberant environments; however, relatively little is known about how reverberation affects the processing of signals by modern hearing-aid algorithms. The purpose of this study was to investigate the acoustic and behavioral effects of reverberation and wide-dynamic range compression (WDRC) in hearing aids on consonant identification for individuals with hearing impairment. DESIGN Twenty-three listeners with mild to moderate sloping sensorineural hearing loss were tested monaurally under varying degrees of reverberation and WDRC conditions. Listeners identified consonants embedded within vowel-consonant-vowel nonsense syllables. Stimuli were processed to simulate a range of realistic reverberation times and WDRC release times using virtual acoustic simulations. In addition, the effects of these processing conditions were acoustically analyzed using a model of envelope distortion to examine the effects on the temporal envelope. RESULTS Aided consonant identification significantly decreased as reverberation time increased. Consonant identification was also significantly affected by WDRC release time. This relationship was such that individuals tended to perform significantly better with longer release times. There was no significant interaction between reverberation and WDRC. The application of the acoustic model to the processed signal showed a close relationship between trends in the behavioral performance and distortion to the temporal envelope resulting from reverberation and WDRC. The results of the acoustic model demonstrated the same trends found in the behavioral data for both reverberation and WDRC. CONCLUSIONS Reverberation and WDRC release time both affect aided consonant identification for individuals with hearing impairment, and these condition effects are associated with alterations to the temporal envelope. There was no significant interaction between reverberation and WDRC release time.
Collapse
Affiliation(s)
- Paul N Reinhart
- 1Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois, USA; 2Knowles Hearing Center, Evanston, Illinois, USA; 3National Center for Rehabilitative Auditory Research, VA Portland Health Care System, Portland, Oregon, USA; and 4Department of Otolaryngology/Head & Neck Surgery, Oregon Health and Science University, Portland, Oregon, USA
| | | | | | | |
Collapse
|
10
|
Souza PE, Wright RA, Blackburn MC, Tatman R, Gallun FJ. Individual sensitivity to spectral and temporal cues in listeners with hearing impairment. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2015; 58:520-34. [PMID: 25629388 PMCID: PMC4462137 DOI: 10.1044/2015_jslhr-h-14-0138] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2014] [Revised: 10/14/2014] [Accepted: 12/18/2014] [Indexed: 05/26/2023]
Abstract
PURPOSE The present study was designed to evaluate use of spectral and temporal cues under conditions in which both types of cues were available. METHOD Participants included adults with normal hearing and hearing loss. We focused on 3 categories of speech cues: static spectral (spectral shape), dynamic spectral (formant change), and temporal (amplitude envelope). Spectral and/or temporal dimensions of synthetic speech were systematically manipulated along a continuum, and recognition was measured using the manipulated stimuli. Level was controlled to ensure cue audibility. Discriminant function analysis was used to determine to what degree spectral and temporal information contributed to the identification of each stimulus. RESULTS Listeners with normal hearing were influenced to a greater extent by spectral cues for all stimuli. Listeners with hearing impairment generally utilized spectral cues when the information was static (spectral shape) but used temporal cues when the information was dynamic (formant transition). The relative use of spectral and temporal dimensions varied among individuals, especially among listeners with hearing loss. CONCLUSION Information about spectral and temporal cue use may aid in identifying listeners who rely to a greater extent on particular acoustic cues and applying that information toward therapeutic interventions.
Collapse
Affiliation(s)
- Pamela E. Souza
- Northwestern University, Evanston, IL
- Knowles Hearing Center, Northwestern University, Evanston, IL
| | | | | | | | - Frederick J. Gallun
- National Center for Rehabilitative Auditory Research, Portland VA Medical Center, OR
- Oregon Health & Science University, Portland
| |
Collapse
|
11
|
Geetha C, Manjula P. Effect of Compression, Digital Noise Reduction and Directionality on Envelope Difference Index, Log-Likelihood Ratio and Perceived Quality. Audiol Res 2014; 4:110. [PMID: 26557357 PMCID: PMC4627141 DOI: 10.4081/audiores.2014.110] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 08/27/2014] [Accepted: 10/01/2014] [Indexed: 12/03/2022] Open
Abstract
The aim of the present study was to evaluate the use of the envelope difference index (EDI) and log-likelihood ratio (LLR) to quantify the independent and interactive effects of wide dynamic range compression, digital noise reduction and directionality, and to carry out self-rated quality measures. A recorded sentence embedded in speech spectrum noise at +5 dB signal to noise ratio was presented to a four channel digital hearing aid and the output was recorded with different combinations of algorithms at 30, 45 and 70 dB HL levels of presentation through a 2 cc coupler. EDI and LLR were obtained in comparison with the original signal using MATLAB software. In addition, thirty participants with normal hearing sensitivity rated the output on the loudness and clarity parameters of quality. The results revealed that the temporal changes happening at the output is independent of the number of algorithms activated together in a hearing aid. However, at a higher level of presentation, temporal cues are better preserved if all of these algorithms are deactivated. The spectral components speech tend to get affected by the presentation level. The results also indicate the importance of quality rating as this helps in considering whether the spectral and/or temporal deviations created in the hearing aid are desirable or not.
Collapse
Affiliation(s)
- Chinnaraj Geetha
- Department of Audiology, All India Institute of Speech and Hearing , Mysore, India
| | - Puttabasappa Manjula
- Department of Audiology, All India Institute of Speech and Hearing , Mysore, India
| |
Collapse
|
12
|
Hoover EC, Souza PE, Gallun FJ. The consonant-weighted envelope difference index (cEDI): a proposed technique for quantifying envelope distortion. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2012; 55:1802-1806. [PMID: 22411284 PMCID: PMC3538866 DOI: 10.1044/1092-4388(2012/11-0255)] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
PURPOSE The benefits of amplitude compression in hearing aids may be limited by distortion resulting from rapid gain adjustment. To evaluate this, it is convenient to quantify distortion by using a metric that is sensitive to the changes in the processed signal that decrease consonant recognition, such as the Envelope Difference Index (EDI; Fortune, Woodruff, & Preves, 1994). However, the EDI relies on the entire duration of the signal, including portions irrelevant to consonant recognition. METHOD This note describes a computationally efficient method of automatically segmenting speech in time according to the syllable structure. Our technique uses the 1st derivative of the envelope as a basis. Peaks located in the derivative were used to generate a weighting function for the computation of a metric of signal distortion. RESULTS The weighting function significantly improved the variance explained in consonant recognition scores over previous methods. However, only 3.2% of the variance was explained in the revised model. CONCLUSION This technique was effective in focusing the analysis of distortion on specific segments of the signal. Use of the technique has implications for speech analysis. The difference in the amplitude envelope of consonants is not a robust model of the effect of hearing aid compression on consonant recognition.
Collapse
|