1
|
Wasiuk PA, Calandruccio L, Oleson JJ, Buss E. Predicting speech-in-speech recognition: Short-term audibility and spatial separation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:1827-1837. [PMID: 37728286 DOI: 10.1121/10.0021069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 08/28/2023] [Indexed: 09/21/2023]
Abstract
Quantifying the factors that predict variability in speech-in-speech recognition represents a fundamental challenge in auditory science. Stimulus factors associated with energetic and informational masking (IM) modulate variability in speech-in-speech recognition, but energetic effects can be difficult to estimate in spectro-temporally dynamic speech maskers. The current experiment characterized the effects of short-term audibility and differences in target and masker location (or perceived location) on the horizontal plane for sentence recognition in two-talker speech. Thirty young adults with normal hearing (NH) participated. Speech reception thresholds and keyword recognition at a fixed signal-to-noise ratio (SNR) were measured in each spatial condition. Short-term audibility for each keyword was quantified using a glimpsing model. Results revealed that speech-in-speech recognition depended on the proportion of audible glimpses available in the target + masker keyword stimulus in each spatial condition, even across stimuli presented at a fixed global SNR. Short-term audibility requirements were greater for colocated than spatially separated speech-in-speech recognition, and keyword recognition improved more rapidly as a function of increases in target audibility with spatial separation. Results indicate that spatial cues enhance glimpsing efficiency in competing speech for young adults with NH and provide a quantitative framework for estimating IM for speech-in-speech recognition in different spatial configurations.
Collapse
Affiliation(s)
- Peter A Wasiuk
- Department of Communication Disorders, 493 Fitch Street, Southern Connecticut State University, New Haven, Connecticut 06515, USA
| | - Lauren Calandruccio
- Department of Psychological Sciences, 11635 Euclid Avenue, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Jacob J Oleson
- Department of Biostatistics, 145 North Riverside Drive N300, College of Public Health, University of Iowa, Iowa City, Iowa 52242, USA
| | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, 170 Manning Drive, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
2
|
Vicente T, Buchholz JM, Lavandier M. Modelling binaural unmasking and the intelligibility of speech in noise and reverberation for normal-hearing and hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3275. [PMID: 34852607 DOI: 10.1121/10.0006736] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 09/26/2021] [Indexed: 05/25/2023]
Abstract
This study investigated the effect of hearing loss on binaural unmasking (BU) for the intelligibility of speech in noise. Speech reception thresholds (SRTs) were measured with normal-hearing (NH) listeners and older mildly hearing-impaired (HI) listeners while varying the presentation level of the stimuli, reverberation, modulation of the noise masker, and spatial separation of the speech and noise sources. On average across conditions, the NH listeners benefited more (by 0.6 dB) from BU than HI listeners. The binaural intelligibility model developed by Vicente, Lavandier, and Buchholz [J. Acoust. Soc. Am. 148, 3305-3317 (2020)] was used to describe the data, accurate predictions were obtained for the conditions considering moderate noise levels [50 and 60 dB sound pressure level (SPL)]. The interaural jitters that were involved in the prediction of BU had to be revised to describe the data measured at a lower level (40 dB SPL). Across all tested conditions, the correlation between the measured and predicted SRTs was 0.92, whereas the mean prediction error was 0.9 dB.
Collapse
Affiliation(s)
- Thibault Vicente
- Department of Linguistics-Audiology, Australian Hearing Hub, Macquarie University, New South Wales, 2109, Australia
| | - Jörg M Buchholz
- Department of Linguistics-Audiology, Australian Hearing Hub, Macquarie University, New South Wales, 2109, Australia
| | - Mathieu Lavandier
- Univ. Lyon, ENTPE, Laboratoire de Tribologie et Dynamique des Systèmes UMR 5513, Rue M. Audin, 69518 Vaulx-en-Velin Cedex, France
| |
Collapse
|
3
|
Patro C, Kreft HA, Wojtczak M. The search for correlates of age-related cochlear synaptopathy: Measures of temporal envelope processing and spatial release from speech-on-speech masking. Hear Res 2021; 409:108333. [PMID: 34425347 PMCID: PMC8424701 DOI: 10.1016/j.heares.2021.108333] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 07/17/2021] [Accepted: 08/04/2021] [Indexed: 01/13/2023]
Abstract
Older adults often experience difficulties understanding speech in adverse listening conditions. It has been suggested that for listeners with normal and near-normal audiograms, these difficulties may, at least in part, arise from age-related cochlear synaptopathy. The aim of this study was to assess if performance on auditory tasks relying on temporal envelope processing reveal age-related deficits consistent with those expected from cochlear synaptopathy. Listeners aged 20 to 66 years were tested using a series of psychophysical, electrophysiological, and speech-perception measures using stimulus configurations that promote coding by medium- and low-spontaneous-rate auditory-nerve fibers. Cognitive measures of executive function were obtained to control for age-related cognitive decline. Results from the different tests were not significantly correlated with each other despite a presumed reliance on common mechanisms involved in temporal envelope processing. Only gap-detection thresholds for a tone in noise and spatial release from speech-on-speech masking were significantly correlated with age. Increasing age was related to impaired cognitive executive function. Multivariate regression analyses showed that individual differences in hearing sensitivity, envelope-based measures, and scores from nonauditory cognitive tests did not significantly contribute to the variability in spatial release from speech-on-speech masking for small target/masker spatial separation, while age was a significant contributor.
Collapse
Affiliation(s)
- Chhayakanta Patro
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA.
| | - Heather A Kreft
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA
| | - Magdalena Wojtczak
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA
| |
Collapse
|
4
|
Wasiuk PA, Lavandier M, Buss E, Oleson J, Calandruccio L. The effect of fundamental frequency contour similarity on multi-talker listening in older and younger adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3527. [PMID: 33379934 PMCID: PMC7863686 DOI: 10.1121/10.0002661] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Older adults with hearing loss have greater difficulty recognizing target speech in multi-talker environments than young adults with normal hearing, especially when target and masker speech streams are perceptually similar. A difference in fundamental frequency (f0) contour depth is an effective stream segregation cue for young adults with normal hearing. This study examined whether older adults with varying degrees of sensorineural hearing loss are able to utilize differences in target/masker f0 contour depth to improve speech recognition in multi-talker listening. Speech recognition thresholds (SRTs) were measured for speech mixtures composed of target/masker streams with flat, normal, and exaggerated speaking styles, in which f0 contour depth systematically varied. Computational modeling estimated differences in energetic masking across listening conditions. Young adults had lower SRTs than older adults; a result that was partially explained by differences in audibility predicted by the model. However, audibility differences did not explain why young adults experienced a benefit from mismatched target/masker f0 contour depth, while in most conditions, older adults did not. Reduced ability to use segregation cues (differences in target/masker f0 contour depth), and deficits grouping speech with variable f0 contours likely contribute to difficulties experienced by older adults in challenging acoustic environments.
Collapse
Affiliation(s)
- Peter A Wasiuk
- Department of Psychological Sciences, 11635 Euclid Avenue, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Mathieu Lavandier
- Univ. Lyon, ENTPE, Laboratoire Génie Civil et Bâtiment, Rue M. Audin, Vaulx-en-Velin Cedex, 69518, France
| | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina, CB#7070, Chapel Hill, North Carolina 27599, USA
| | - Jacob Oleson
- Department of Biostatistics, N300 CPHB, University of Iowa, 145 North Riverside Drive, Iowa City, Iowa 52242-2007, USA
| | - Lauren Calandruccio
- Department of Psychological Sciences, 11635 Euclid Avenue, Case Western Reserve University, Cleveland, Ohio 44106, USA
| |
Collapse
|
5
|
Glyde H, Dillon H, Young T, Seeto M, Roup C. Determining unilateral or bilateral hearing aid preference in adults: a prospective study. Int J Audiol 2020; 60:341-349. [PMID: 33030067 DOI: 10.1080/14992027.2020.1828629] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
OBJECTIVE Despite high rates of bilateral hearing aid fitting globally, a number of adults continue to reject one hearing aid. The current study aimed to identify a clinically suitable tool for determining, pre-fitting, which clients might prefer one hearing aid. Study Sample: Ninety-five new adult hearing aid candidates, aged 49-87 years, were assessed prior to a first hearing aid fitting. Sixty-eight participants adhered to the prescribed protocol for both bilateral and unilateral hearing aid use. DESIGN Performance was assessed on a modified version of the Listening in Spatialised Noise - Sentences test (LiSN-S), the Dichotic Digits difference Test, the Experiential Hearing Aid simulator, and the Grooved Pegboard Test. All participants were fitted bilaterally, but were instructed to alternate between unilateral and bilateral hearing aid use over fourteen weeks post-fitting. Participants' wearing preferences were assessed via a short questionnaire. RESULTS 78% of participants expressed an overall preference for bilateral hearing aid use. Only the LiSN-S bilateral advantage test outcomes significantly correlated with overall wearing preference. CONCLUSIONS Although the LiSN-S bilateral advantage score related to overall wearing preference, the accuracy of the predictor was too low to warrant implementation of this test prior to hearing aid fitting. The current practice of recommending bilateral hearing aid use continues to be the best option for clinicians.
Collapse
Affiliation(s)
- Helen Glyde
- HEARing Cooperative Research Centre, Melbourne, Australia.,National Acoustic Laboratories, Sydney, Australia.,Department of Speech and Hearing Science, University of Manchester, Manchester, United Kingdom
| | - Harvey Dillon
- HEARing Cooperative Research Centre, Melbourne, Australia.,National Acoustic Laboratories, Sydney, Australia.,Department of Speech and Hearing Science, University of Manchester, Manchester, United Kingdom.,Department of Speech and Hearing Science, Macquarie University, Sydney, Australia
| | - Taegan Young
- HEARing Cooperative Research Centre, Melbourne, Australia.,National Acoustic Laboratories, Sydney, Australia
| | - Mark Seeto
- HEARing Cooperative Research Centre, Melbourne, Australia.,National Acoustic Laboratories, Sydney, Australia
| | | |
Collapse
|
6
|
Zhang J, Wang X, Wang NY, Fu X, Gan T, Galvin JJ, Willis S, Xu K, Thomas M, Fu QJ. Tonal Language Speakers Are Better Able to Segregate Competing Speech According to Talker Sex Differences. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:2801-2810. [PMID: 32692939 PMCID: PMC7872724 DOI: 10.1044/2020_jslhr-19-00421] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2019] [Revised: 04/01/2020] [Accepted: 05/15/2020] [Indexed: 06/01/2023]
Abstract
Purpose The aim of this study was to compare release from masking (RM) between Mandarin-speaking and English-speaking listeners with normal hearing for competing speech when target-masker sex cues, spatial cues, or both were available. Method Speech recognition thresholds (SRTs) for competing speech were measured in 21 Mandarin-speaking and 15 English-speaking adults with normal hearing using a modified coordinate response measure task. SRTs were measured for target sentences produced by a male talker in the presence of two masker talkers (different male talkers or female talkers). The target sentence was always presented directly in front of the listener, and the maskers were either colocated with the target or were spatially separated from the target (+90°, -90°). Stimuli were presented via headphones and were virtually spatialized using head-related transfer functions. Three masker conditions were used to measure RM relative to the baseline condition: (a) talker sex cues, (b) spatial cues, or (c) combined talker sex and spatial cues. Results The results showed large amounts of RM according to talker sex and/or spatial cues. There was no significant difference in SRTs between Chinese and English listeners for the baseline condition, where no talker sex or spatial cues were available. Furthermore, there was no significant difference in RM between Chinese and English listeners when spatial cues were available. However, RM was significantly larger for Chinese listeners when talker sex cues or combined talker sex and spatial cues were available. Conclusion Listeners who speak a tonal language such as Mandarin Chinese may be able to take greater advantage of talker sex cues than listeners who do not speak a tonal language.
Collapse
Affiliation(s)
- Juan Zhang
- Department of Otolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, China
| | - Xing Wang
- Department of Otolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, China
| | - Ning-yu Wang
- Department of Otolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, China
| | - Xin Fu
- Department of Otolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, China
| | - Tian Gan
- Department of Otolaryngology, Head and Neck Surgery, Beijing Chaoyang Hospital, Capital Medical University, China
| | | | - Shelby Willis
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California, Los Angeles
| | - Kevin Xu
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California, Los Angeles
| | - Mathew Thomas
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California, Los Angeles
| | - Qian-Jie Fu
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California, Los Angeles
| |
Collapse
|
7
|
Baltzell LS, Swaminathan J, Cho AY, Lavandier M, Best V. Binaural sensitivity and release from speech-on-speech masking in listeners with and without hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1546. [PMID: 32237845 PMCID: PMC7060089 DOI: 10.1121/10.0000812] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 02/07/2020] [Accepted: 02/11/2020] [Indexed: 05/29/2023]
Abstract
Listeners with sensorineural hearing loss routinely experience less spatial release from masking (SRM) in speech mixtures than listeners with normal hearing. Hearing-impaired listeners have also been shown to have degraded temporal fine structure (TFS) sensitivity, a consequence of which is degraded access to interaural time differences (ITDs) contained in the TFS. Since these "binaural TFS" cues are critical for spatial hearing, it has been hypothesized that degraded binaural TFS sensitivity accounts for the limited SRM experienced by hearing-impaired listeners. In this study, speech stimuli were noise-vocoded using carriers that were systematically decorrelated across the left and right ears, thus simulating degraded binaural TFS sensitivity. Both (1) ITD sensitivity in quiet and (2) SRM in speech mixtures spatialized using ITDs (or binaural release from masking; BRM) were measured as a function of TFS interaural decorrelation in young normal-hearing and hearing-impaired listeners. This allowed for the examination of the relationship between ITD sensitivity and BRM over a wide range of ITD thresholds. This paper found that, for a given ITD sensitivity, hearing-impaired listeners experienced less BRM than normal-hearing listeners, suggesting that binaural TFS sensitivity can account for only a modest portion of the BRM deficit in hearing-impaired listeners. However, substantial individual variability was observed.
Collapse
Affiliation(s)
- Lucas S Baltzell
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Jayaganesh Swaminathan
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Adrian Y Cho
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Mathieu Lavandier
- University of Lyon, ENTPE, Laboratoire Génie Civil et Bâtiment, Rue Maurice Audin, F-69518 Vaulx-en-Velin Cedex, France
| | - Virginia Best
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
8
|
Domingo Y, Holmes E, Macpherson E, Johnsrude IS. Using spatial release from masking to estimate the magnitude of the familiar-voice intelligibility benefit. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:3487. [PMID: 31795686 DOI: 10.1121/1.5133628] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Accepted: 10/23/2019] [Indexed: 06/10/2023]
Abstract
The ability to segregate simultaneous speech streams is crucial for successful communication. Recent studies have demonstrated that participants can report 10%-20% more words spoken by naturally familiar (e.g., friends or spouses) than unfamiliar talkers in two-voice mixtures. This benefit is commensurate with one of the largest benefits to speech intelligibility currently known-that which is gained by spatially separating two talkers. However, because of differences in the methods of these previous studies, the relative benefits of spatial separation and voice familiarity are unclear. Here, the familiar-voice benefit and spatial release from masking are directly compared, and it is examined if and how these two cues interact with one another. Talkers were recorded while speaking sentences from a published closed-set "matrix" task, and then listeners were presented with three different sentences played simultaneously. Each target sentence was played at 0° azimuth, and two masker sentences were symmetrically separated about the target. On average, participants reported 10%-30% more words correctly when the target sentence was spoken in a familiar than unfamiliar voice (collapsed over spatial separation conditions); it was found that participants gain a similar benefit from a familiar target as when an unfamiliar voice is separated from two symmetrical maskers by approximately 15° azimuth.
Collapse
Affiliation(s)
- Ysabel Domingo
- Brain and Mind Institute, University of Western Ontario, London, Ontario, Canada
| | - Emma Holmes
- Brain and Mind Institute, University of Western Ontario, London, Ontario, Canada
| | - Ewan Macpherson
- School of Communication Sciences and Disorders, University of Western Ontario, London, Ontario, Canada
| | - Ingrid S Johnsrude
- Brain and Mind Institute, University of Western Ontario, London, Ontario, Canada
| |
Collapse
|
9
|
Best V, Roverud E, Baltzell L, Rennies J, Lavandier M. The importance of a broad bandwidth for understanding "glimpsed" speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:3215. [PMID: 31795657 PMCID: PMC6847933 DOI: 10.1121/1.5131651] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
When a target talker speaks in the presence of competing talkers, the listener must not only segregate the voices but also understand the target message based on a limited set of spectrotemporal regions ("glimpses") in which the target voice dominates the acoustic mixture. Here, the hypothesis that a broad audible bandwidth is more critical for these sparse representations of speech than it is for intact speech is tested. Listeners with normal hearing were presented with sentences that were either intact, or progressively "glimpsed" according to a competing two-talker masker presented at various levels. This was achieved by using an ideal binary mask to exclude time-frequency units in the target that would be dominated by the masker in the natural mixture. In each glimpsed condition, speech intelligibility was measured for a range of low-pass conditions (cutoff frequencies from 500 to 8000 Hz). Intelligibility was poorer for sparser speech, and the bandwidth required for optimal intelligibility increased with the sparseness of the speech. The combined effects of glimpsing and bandwidth reduction were well captured by a simple metric based on the proportion of audible target glimpses retained. The findings may be relevant for understanding the impact of high-frequency hearing loss on everyday speech communication.
Collapse
Affiliation(s)
- Virginia Best
- Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Elin Roverud
- Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Lucas Baltzell
- Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Jan Rennies
- Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Mathieu Lavandier
- Department of Speech, Language and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
10
|
Evaluating the Performance of a Visually Guided Hearing Aid Using a Dynamic Auditory-Visual Word Congruence Task. Ear Hear 2019; 39:756-769. [PMID: 29252977 DOI: 10.1097/aud.0000000000000532] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES The "visually guided hearing aid" (VGHA), consisting of a beamforming microphone array steered by eye gaze, is an experimental device being tested for effectiveness in laboratory settings. Previous studies have found that beamforming without visual steering can provide significant benefits (relative to natural binaural listening) for speech identification in spatialized speech or noise maskers when sound sources are fixed in location. The aim of the present study was to evaluate the performance of the VGHA in listening conditions in which target speech could switch locations unpredictably, requiring visual steering of the beamforming. To address this aim, the present study tested an experimental simulation of the VGHA in a newly designed dynamic auditory-visual word congruence task. DESIGN Ten young normal-hearing (NH) and 11 young hearing-impaired (HI) adults participated. On each trial, three simultaneous spoken words were presented from three source positions (-30, 0, and 30 azimuth). An auditory-visual word congruence task was used in which participants indicated whether there was a match between the word printed on a screen at a location corresponding to the target source and the spoken target word presented acoustically from that location. Performance was compared for a natural binaural condition (stimuli presented using impulse responses measured on KEMAR), a simulated VGHA condition (BEAM), and a hybrid condition that combined lowpass-filtered KEMAR and highpass-filtered BEAM information (BEAMAR). In some blocks, the target remained fixed at one location across trials, and in other blocks, the target could transition in location between one trial and the next with a fixed but low probability. RESULTS Large individual variability in performance was observed. There were significant benefits for the hybrid BEAMAR condition relative to the KEMAR condition on average for both NH and HI groups when the targets were fixed. Although not apparent in the averaged data, some individuals showed BEAM benefits relative to KEMAR. Under dynamic conditions, BEAM and BEAMAR performance dropped significantly immediately following a target location transition. However, performance recovered by the second word in the sequence and was sustained until the next transition. CONCLUSIONS When performance was assessed using an auditory-visual word congruence task, the benefits of beamforming reported previously were generally preserved under dynamic conditions in which the target source could move unpredictably from one location to another (i.e., performance recovered rapidly following source transitions) while the observer steered the beamforming via eye gaze, for both young NH and young HI groups.
Collapse
|
11
|
Kidd G, Mason CR, Best V, Roverud E, Swaminathan J, Jennings T, Clayton K, Steven Colburn H. Determining the energetic and informational components of speech-on-speech masking in listeners with sensorineural hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:440. [PMID: 30710924 PMCID: PMC6347574 DOI: 10.1121/1.5087555] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Revised: 11/19/2018] [Accepted: 12/18/2018] [Indexed: 05/20/2023]
Abstract
The ability to identify the words spoken by one talker masked by two or four competing talkers was tested in young-adult listeners with sensorineural hearing loss (SNHL). In a reference/baseline condition, masking speech was colocated with target speech, target and masker talkers were female, and the masker was intelligible. Three comparison conditions included replacing female masker talkers with males, time-reversal of masker speech, and spatial separation of sources. All three variables produced significant release from masking. To emulate energetic masking (EM), stimuli were subjected to ideal time-frequency segregation retaining only the time-frequency units where target energy exceeded masker energy. Subjects were then tested with these resynthesized "glimpsed stimuli." For either two or four maskers, thresholds only varied about 3 dB across conditions suggesting that EM was roughly equal. Compared to normal-hearing listeners from an earlier study [Kidd, Mason, Swaminathan, Roverud, Clayton, and Best, J. Acoust. Soc. Am. 140, 132-144 (2016)], SNHL listeners demonstrated both greater energetic and informational masking as well as higher glimpsed thresholds. Individual differences were correlated across masking release conditions suggesting that listeners could be categorized according to their general ability to solve the task. Overall, both peripheral and central factors appear to contribute to the higher thresholds for SNHL listeners.
Collapse
Affiliation(s)
- Gerald Kidd
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Christine R Mason
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Virginia Best
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Elin Roverud
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Jayaganesh Swaminathan
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Todd Jennings
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - Kameron Clayton
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA
| | - H Steven Colburn
- Department of Biomedical Engineering, Boston University, Boston, Massachusetts 02215, USA
| |
Collapse
|
12
|
Rana B, Buchholz JM. Effect of improving audibility on better-ear glimpsing using non-linear amplification. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3465. [PMID: 30599669 DOI: 10.1121/1.5083823] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Accepted: 11/30/2018] [Indexed: 06/09/2023]
Abstract
Better-ear glimpsing (BEG) utilizes interaural level differences (ILDs) to improve speech intelligibility in noise. This spatial benefit is reduced in most hearing-impaired (HI) listeners due to their increased hearing loss at high frequencies. Even though this benefit can be improved by providing increased amplification, the improvement is limited by loudness discomfort. An alternative solution therefore extends ILDs to low frequencies, which has been shown to provide a substantial benefit from BEG. In contrast to previous studies, which only applied linear stimulus manipulations, wide dynamic range compression was applied here to improve the audibility of soft sounds while ensuring loudness comfort for loud sounds. Performance in both speech intelligibility and BEG was measured in 13 HI listeners at three different masker levels and for different interaural stimulus manipulations. The results revealed that at low signal levels, performance substantially improved with increasing masker level, but this improvement was reduced by the compressive behaviour at higher levels. Moreover, artificially extending ILDs by applying infinite (broadband) ILDs provided an extra spatial benefit in speech reception thresholds of up to 5 dB on top of that already provided by natural ILDs and interaural time differences, which increased with increasing signal level.
Collapse
Affiliation(s)
- Baljeet Rana
- Department of Linguistics, 16 University Avenue, Macquarie University, NSW 2109, Australia
| | - Jörg M Buchholz
- Department of Linguistics, 16 University Avenue, Macquarie University, NSW 2109, Australia
| |
Collapse
|
13
|
Hu H, Dietz M, Williges B, Ewert SD. Better-ear glimpsing with symmetrically-placed interferers in bilateral cochlear implant users. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2128. [PMID: 29716260 DOI: 10.1121/1.5030918] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
For a frontal target in spatially symmetrically placed interferers, normal hearing (NH) listeners can use "better-ear glimpsing" to select time-frequency segments with favorable signal-to-noise ratio in either ear. With an ideal monaural better-ear mask (IMBM) processing, some studies showed that NH listeners can reach similar performance as in the natural binaural listening condition, although interaural phase differences at low frequencies can further improve performance. In principle, bilateral cochlear implant (BiCI) listeners could use the same better-ear glimpsing, albeit without exploiting interaural phase differences. Speech reception thresholds of NH and BiCI listeners were measured in three interferers (speech-shaped stationary noise, nonsense speech, or single talker) either co-located with the target, symmetrically placed at ±60°, or independently presented to each ear, with and without IMBM processing. Furthermore, a bilateral noise vocoder based on the BiCI electrodogram was used in the same NH listeners. Headphone presentation and direct stimulation with head-related transfer functions for spatialization were used in NH and BiCI listeners, respectively. Compared to NH listeners, both NH listeners with vocoder and BiCI listeners showed strongly reduced binaural benefit from spatial separation. However, both groups greatly benefited from IMBM processing as part of the stimulation strategy.
Collapse
Affiliation(s)
- Hongmei Hu
- Medizinische Physik, Carl von Ossietzky Universität Oldenburg and Cluster of Excellence "Hearing4all," Küpkersweg 74, 26129, Oldenburg, Germany
| | - Mathias Dietz
- Medizinische Physik, Carl von Ossietzky Universität Oldenburg and Cluster of Excellence "Hearing4all," Küpkersweg 74, 26129, Oldenburg, Germany
| | - Ben Williges
- Medizinische Physik, Carl von Ossietzky Universität Oldenburg and Cluster of Excellence "Hearing4all," Küpkersweg 74, 26129, Oldenburg, Germany
| | - Stephan D Ewert
- Medizinische Physik, Carl von Ossietzky Universität Oldenburg and Cluster of Excellence "Hearing4all," Küpkersweg 74, 26129, Oldenburg, Germany
| |
Collapse
|
14
|
Rana B, Buchholz JM. Effect of audibility on better-ear glimpsing as a function of frequency in normal-hearing and hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2195. [PMID: 29716302 DOI: 10.1121/1.5031007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Better-ear glimpsing (BEG) is an auditory phenomenon that helps understanding speech in noise by utilizing interaural level differences (ILDs). The benefit provided by BEG is limited in hearing-impaired (HI) listeners by reduced audibility at high frequencies. Rana and Buchholz [(2016). J. Acoust. Soc. Am. 140(2), 1192-1205] have shown that artificially enhancing ILDs at low and mid frequencies can help HI listeners understanding speech in noise, but the achieved benefit is smaller than in normal-hearing (NH) listeners. To understand how far this difference is explained by differences in audibility, audibility was carefully controlled here in ten NH and ten HI listeners and speech reception thresholds (SRTs) in noise were measured in a spatially separated and co-located condition as a function of frequency and sensation level. Maskers were realized by noise-vocoded speech and signals were spatialized using artificially generated broadband ILDs. The spatial benefit provided by BEG and SRTs improved consistently with increasing sensation level, but was limited in the HI listeners by loudness discomfort. Further, the HI listeners performed similar to NH listeners when differences in audibility were compensated. The results help to understand the hearing aid gain that is required to maximize the spatial benefit provided by ILDs as a function of frequency.
Collapse
Affiliation(s)
- Baljeet Rana
- National Acoustic Laboratories, 16 University Avenue, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Jörg M Buchholz
- National Acoustic Laboratories, 16 University Avenue, Macquarie University, Sydney, New South Wales 2109, Australia
| |
Collapse
|
15
|
Best V, Roverud E, Mason CR, Kidd G. Examination of a hybrid beamformer that preserves auditory spatial cues. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:EL369. [PMID: 29092558 PMCID: PMC5724719 DOI: 10.1121/1.5007279] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 08/31/2017] [Accepted: 09/30/2017] [Indexed: 06/01/2023]
Abstract
A hearing-aid strategy that combines a beamforming microphone array in the high frequencies with natural binaural signals in the low frequencies was examined. This strategy attempts to balance the benefits of beamforming (improved signal-to-noise ratio) with the benefits of binaural listening (spatial awareness and location-based segregation). The crossover frequency was varied from 200 to 1200 Hz, and performance was compared to full-spectrum binaural and beamformer conditions. Speech intelligibility in the presence of noise or competing speech was measured in listeners with and without hearing loss. Results showed that the optimal crossover frequency depended on the listener and the nature of the interference.
Collapse
MESH Headings
- Acoustic Stimulation
- Adult
- Audiometry, Speech
- Case-Control Studies
- Comprehension
- Correction of Hearing Impairment/instrumentation
- Cues
- Equipment Design
- Female
- Hearing
- Hearing Aids
- Hearing Loss, Bilateral/diagnosis
- Hearing Loss, Bilateral/physiopathology
- Hearing Loss, Bilateral/psychology
- Hearing Loss, Bilateral/rehabilitation
- Hearing Loss, Sensorineural/diagnosis
- Hearing Loss, Sensorineural/physiopathology
- Hearing Loss, Sensorineural/psychology
- Hearing Loss, Sensorineural/rehabilitation
- Humans
- Male
- Noise/adverse effects
- Perceptual Masking
- Persons With Hearing Impairments/psychology
- Persons With Hearing Impairments/rehabilitation
- Signal Processing, Computer-Assisted
- Sound Localization
- Speech Intelligibility
- Speech Perception
Collapse
Affiliation(s)
- Virginia Best
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA , , ,
| | - Elin Roverud
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA , , ,
| | - Christine R Mason
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA , , ,
| | - Gerald Kidd
- Department of Speech, Language and Hearing Sciences, Boston University, Boston, Massachusetts 02215, USA , , ,
| |
Collapse
|
16
|
King A, Hopkins K, Plack CJ, Pontoppidan NH, Bramsløw L, Hietkamp RK, Vatti M, Hafez A. The effect of tone-vocoding on spatial release from masking for old, hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:2591. [PMID: 28464637 DOI: 10.1121/1.4979593] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Old, hearing-impaired listeners generally benefit little from lateral separation of multiple talkers when listening to one of them. This study aimed to determine how spatial release from masking (SRM) in such listeners is affected when the interaural time differences (ITDs) in the temporal fine structure (TFS) are manipulated by tone-vocoding (TVC) at the ears by a master hearing aid system. Word recall was compared, with and without TVC, when target and masker sentences from a closed set were played simultaneously from the front loudspeaker (co-located) and when the maskers were played 45° to the left and right of the listener (separated). For 20 hearing-impaired listeners aged 64 to 86, SRM was 3.7 dB smaller with TVC than without TVC. This difference in SRM correlated with mean audiometric thresholds below 1.5 kHz, even when monaural TFS sensitivity (discrimination of frequency-shifts in identically filtered complexes) was partialed out, suggesting that low-frequency audiometric thresholds may be a good indicator of candidacy for hearing aids that preserve ITDs. The TVC difference in SRM was not correlated with age, pure-tone ITD thresholds, nor fundamental frequency difference limens, and only with monaural TFS sensitivity before control for low-frequency audiometric thresholds.
Collapse
MESH Headings
- Acoustic Stimulation
- Age Factors
- Aged
- Aged, 80 and over
- Aging/psychology
- Audiometry, Pure-Tone
- Audiometry, Speech
- Auditory Threshold
- Correction of Hearing Impairment/instrumentation
- Cues
- Female
- Hearing
- Hearing Aids
- Hearing Loss, Bilateral/diagnosis
- Hearing Loss, Bilateral/physiopathology
- Hearing Loss, Bilateral/psychology
- Hearing Loss, Bilateral/rehabilitation
- Hearing Loss, Sensorineural/diagnosis
- Hearing Loss, Sensorineural/physiopathology
- Hearing Loss, Sensorineural/psychology
- Hearing Loss, Sensorineural/rehabilitation
- Humans
- Male
- Middle Aged
- Perceptual Masking
- Persons With Hearing Impairments/psychology
- Persons With Hearing Impairments/rehabilitation
- Pitch Discrimination
- Psychoacoustics
- Signal Processing, Computer-Assisted
- Sound Localization
- Speech Perception
Collapse
Affiliation(s)
- Andrew King
- Manchester Centre for Audiology and Deafness, University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
| | - Kathryn Hopkins
- Manchester Centre for Audiology and Deafness, University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
| | - Christopher J Plack
- Manchester Centre for Audiology and Deafness, University of Manchester, Manchester Academic Health Science Centre, Manchester, United Kingdom
| | | | - Lars Bramsløw
- Eriksholm Research Centre, Oticon A/S, Rørtangvej 20, Snekkersten, Denmark
| | - Renskje K Hietkamp
- Eriksholm Research Centre, Oticon A/S, Rørtangvej 20, Snekkersten, Denmark
| | - Marianna Vatti
- Eriksholm Research Centre, Oticon A/S, Rørtangvej 20, Snekkersten, Denmark
| | - Atefeh Hafez
- Eriksholm Research Centre, Oticon A/S, Rørtangvej 20, Snekkersten, Denmark
| |
Collapse
|
17
|
Westermann A, Buchholz JM. The effect of hearing loss on source-distance dependent speech intelligibility in rooms. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:EL140. [PMID: 28253708 DOI: 10.1121/1.4976191] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/12/2016] [Revised: 12/20/2016] [Accepted: 12/24/2016] [Indexed: 05/25/2023]
Abstract
Westermann and Buchholz [(2015). J. Acoust. Soc. Am. 137(2), 757-767] found substantial improvements in speech reception thresholds (SRTs) for normal hearing listeners in a reverberant auditorium when the target talker was separated in distance from a two-talker masker. This study applied similar methodology, but tested listeners with a hearing impairment. On average, the participants received a 7 dB benefit in SRTs when the target was fixed at 0.5 m and the masker was moved from 0.5 to 10 m. But when the target was moved away, the SRTs increased by 5 dB. This indicates that hearing impaired listeners have difficulties suppressing nearby maskers while focusing attention on a far target.
Collapse
Affiliation(s)
- A Westermann
- National Acoustic Laboratories, Australian Hearing, 16 University Avenue, Macquarie University, New South Wales 2109, Australia ,
| | - J M Buchholz
- National Acoustic Laboratories, Australian Hearing, 16 University Avenue, Macquarie University, New South Wales 2109, Australia ,
| |
Collapse
|
18
|
Best V, Mason CR, Swaminathan J, Roverud E, Kidd G. Use of a glimpsing model to understand the performance of listeners with and without hearing loss in spatialized speech mixtures. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:81. [PMID: 28147587 PMCID: PMC5392092 DOI: 10.1121/1.4973620] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
In many situations, listeners with sensorineural hearing loss demonstrate reduced spatial release from masking compared to listeners with normal hearing. This deficit is particularly evident in the "symmetric masker" paradigm in which competing talkers are located to either side of a central target talker. However, there is some evidence that reduced target audibility (rather than a spatial deficit per se) under conditions of spatial separation may contribute to the observed deficit. In this study a simple "glimpsing" model (applied separately to each ear) was used to isolate the target information that is potentially available in binaural speech mixtures. Intelligibility of these glimpsed stimuli was then measured directly. Differences between normally hearing and hearing-impaired listeners observed in the natural binaural condition persisted for the glimpsed condition, despite the fact that the task no longer required segregation or spatial processing. This result is consistent with the idea that the performance of listeners with hearing loss in the spatialized mixture was limited by their ability to identify the target speech based on sparse glimpses, possibly as a result of some of those glimpses being inaudible.
Collapse
Affiliation(s)
- Virginia Best
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Christine R Mason
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Jayaganesh Swaminathan
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Elin Roverud
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Gerald Kidd
- Department of Speech, Language, and Hearing Sciences, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
19
|
On the Contribution of Target Audibility to Performance in Spatialized Speech Mixtures. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016. [PMID: 27080649 DOI: 10.1007/978-3-319-25474-6_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register]
Abstract
Hearing loss has been shown to reduce speech understanding in spatialized multitalker listening situations, leading to the common belief that spatial processing is disrupted by hearing loss. This paper describes related studies from three laboratories that explored the contribution of reduced target audibility to this deficit. All studies used a stimulus configuration in which a speech target presented from the front was masked by speech maskers presented symmetrically from the sides. Together these studies highlight the importance of adequate stimulus audibility for optimal performance in spatialized speech mixtures and suggest that reduced access to target speech information might explain a substantial portion of the "spatial" deficit observed in listeners with hearing loss.
Collapse
|
20
|
Rana B, Buchholz JM. Better-ear glimpsing at low frequencies in normal-hearing and hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:1192. [PMID: 27586748 DOI: 10.1121/1.4961006] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Better-ear glimpsing is an auditory process that takes advantage of short-term interaural level differences (ILDs) to improve the understanding of speech in spatial fluctuating noise. Since ILDs are mainly present at high frequencies, where most hearing-impaired (HI) listeners have the strongest hearing loss, HI individuals cannot fully utilize ILDs for better-ear glimpsing, which may lead to poorer understanding of speech in noise. This problem may be alleviated by hearing aids that artificially generate ILDs at low frequencies where hearing is typically less impaired. The present study therefore investigated the spatial benefit in speech intelligibility that is provided by better-ear glimpsing with low-frequency extended ILDs in a symmetric two-distractor speech background. Speech reception thresholds were measured in a spatially co-located and separated condition as a function of frequency region in ten normal-hearing (NH) and ten mild-to-moderate sensorineural HI subjects. In both groups the extended ILDs provided a substantial spatial advantage on top of the advantage already provided by natural ILDs. Moreover, the spatial advantage was largely independent of frequency region, suggesting that both NH and HI subjects can utilize low-frequency ILDs for improving speech understanding in noise. Overall performance as well as spatial advantage was reduced in the HI group.
Collapse
Affiliation(s)
- Baljeet Rana
- National Acoustic Laboratories, 16 University Avenue, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Jörg M Buchholz
- National Acoustic Laboratories, 16 University Avenue, Macquarie University, Sydney, New South Wales 2109, Australia
| |
Collapse
|