1
|
Nagels L, Gaudrain E, Vickers D, Hendriks P, Başkent D. Prelingually Deaf Children With Cochlear Implants Show Better Perception of Voice Cues and Speech in Competing Speech Than Postlingually Deaf Adults With Cochlear Implants. Ear Hear 2024; 45:952-968. [PMID: 38616318 PMCID: PMC11175806 DOI: 10.1097/aud.0000000000001489] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 01/10/2024] [Indexed: 04/16/2024]
Abstract
OBJECTIVES Postlingually deaf adults with cochlear implants (CIs) have difficulties with perceiving differences in speakers' voice characteristics and benefit little from voice differences for the perception of speech in competing speech. However, not much is known yet about the perception and use of voice characteristics in prelingually deaf implanted children with CIs. Unlike CI adults, most CI children became deaf during the acquisition of language. Extensive neuroplastic changes during childhood could make CI children better at using the available acoustic cues than CI adults, or the lack of exposure to a normal acoustic speech signal could make it more difficult for them to learn which acoustic cues they should attend to. This study aimed to examine to what degree CI children can perceive voice cues and benefit from voice differences for perceiving speech in competing speech, comparing their abilities to those of normal-hearing (NH) children and CI adults. DESIGN CI children's voice cue discrimination (experiment 1), voice gender categorization (experiment 2), and benefit from target-masker voice differences for perceiving speech in competing speech (experiment 3) were examined in three experiments. The main focus was on the perception of mean fundamental frequency (F0) and vocal-tract length (VTL), the primary acoustic cues related to speakers' anatomy and perceived voice characteristics, such as voice gender. RESULTS CI children's F0 and VTL discrimination thresholds indicated lower sensitivity to differences compared with their NH-age-equivalent peers, but their mean discrimination thresholds of 5.92 semitones (st) for F0 and 4.10 st for VTL indicated higher sensitivity than postlingually deaf CI adults with mean thresholds of 9.19 st for F0 and 7.19 st for VTL. Furthermore, CI children's perceptual weighting of F0 and VTL cues for voice gender categorization closely resembled that of their NH-age-equivalent peers, in contrast with CI adults. Finally, CI children had more difficulties in perceiving speech in competing speech than their NH-age-equivalent peers, but they performed better than CI adults. Unlike CI adults, CI children showed a benefit from target-masker voice differences in F0 and VTL, similar to NH children. CONCLUSION Although CI children's F0 and VTL voice discrimination scores were overall lower than those of NH children, their weighting of F0 and VTL cues for voice gender categorization and their benefit from target-masker differences in F0 and VTL resembled that of NH children. Together, these results suggest that prelingually deaf implanted CI children can effectively utilize spectrotemporally degraded F0 and VTL cues for voice and speech perception, generally outperforming postlingually deaf CI adults in comparable tasks. These findings underscore the presence of F0 and VTL cues in the CI signal to a certain degree and suggest other factors contributing to the perception challenges faced by CI adults.
Collapse
Affiliation(s)
- Leanne Nagels
- Center for Language and Cognition Groningen (CLCG), University of Groningen, Groningen, The Netherlands
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioural and Cognitive Neurosciences, University of Groningen, Groningen, The Netherlands
| | - Etienne Gaudrain
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioural and Cognitive Neurosciences, University of Groningen, Groningen, The Netherlands
- CNRS UMR 5292, Lyon Neuroscience Research Center, Auditory Cognition and Psychoacoustics, Inserm UMRS 1028, Université Claude Bernard Lyon 1, Université de Lyon, Lyon, France
| | - Deborah Vickers
- Cambridge Hearing Group, Sound Lab, Clinical Neurosciences Department, University of Cambridge, Cambridge, United Kingdom
| | - Petra Hendriks
- Center for Language and Cognition Groningen (CLCG), University of Groningen, Groningen, The Netherlands
- Research School of Behavioural and Cognitive Neurosciences, University of Groningen, Groningen, The Netherlands
| | - Deniz Başkent
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioural and Cognitive Neurosciences, University of Groningen, Groningen, The Netherlands
- W.J. Kolff Institute for Biomedical Engineering and Materials Science, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
2
|
Alfakhri M, Campbell N, Lineton B, Verschuur C. Integrated bimodal fitting and binaural streaming technology outcomes for unilateral cochlear implant users. Int J Audiol 2024:1-10. [PMID: 38701176 DOI: 10.1080/14992027.2024.2341954] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 04/05/2024] [Indexed: 05/05/2024]
Abstract
OBJECTIVE Adults typically receive only one cochlear implant (CI) due to cost constraints, with a contralateral hearing aid recommended when there is aidable hearing. Standard hearing aids differ from a CI in terms of processing strategy and function as a separate entity, requiring the user to integrate the disparate signals. Integrated bimodal technology has recently been introduced to address this challenge. The aim of the study was to investigate the performance of unilateral CI users with and without an integrated bimodal fitting and determine whether binaural streaming technology offers additional benefit. STUDY SAMPLE Twenty-six CI users using integrated bimodal technology. DESIGN Repeated measures where outcomes and user experience were assessed using a functional test battery more representative of real life listening (speech perception in noise tests, localisation test, tracking test) and the speech, spatial and qualities-of-hearing scale (SSQ). RESULTS Bimodal outcomes were significantly better than for CI alone. Speech perception in noise improvements ranged from 1.4 dB to 3.5 dB depending on the location of speech and noise. The localisation and tracking tests, and the SSQ also showed significant improvements. Binaural streaming offered additional improvement (1.2 dB to 6.1 dB on the different speech tests). CONCLUSIONS Integrated bimodal and binaural streaming technology improved the performance of unilateral CI users.
Collapse
Affiliation(s)
- Manal Alfakhri
- Institute of Sound and Vibration Research, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, UK
- Auditory Implant Service, University of Southampton, Southampton, UK
- Health Rehabilitation Department, College of Applied Medical Science, Kind Saud University, Riyadh, Saudi Arabia
| | - Nicole Campbell
- Institute of Sound and Vibration Research, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, UK
- Auditory Implant Service, University of Southampton, Southampton, UK
| | - Ben Lineton
- Institute of Sound and Vibration Research, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, UK
| | - Carl Verschuur
- Institute of Sound and Vibration Research, Faculty of Engineering and Physical Sciences, University of Southampton, Southampton, UK
- Auditory Implant Service, University of Southampton, Southampton, UK
| |
Collapse
|
3
|
Hey M, Mewes A, Hocke T. Speech comprehension in noise-considerations for ecologically valid assessment of communication skills ability with cochlear implants. HNO 2023; 71:26-34. [PMID: 36480047 PMCID: PMC10409840 DOI: 10.1007/s00106-022-01232-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/10/2022] [Indexed: 12/13/2022]
Abstract
BACKGROUND Nowadays, cochlear implant (CI) patients mostly show good to very good speech comprehension in quiet, but there are known problems with communication in everyday noisy situations. There is thus a need for ecologically valid measurements of speech comprehension in real-life listening situations for hearing-impaired patients. The additional methodological effort must be balanced with clinical human and spatial resources. This study investigates possible simplifications of a complex measurement setup. METHODS The study included 20 adults from long-term follow-up after CI fitting with postlingual onset of hearing impairment. The complexity of the investigated listening situations was influenced by changing the spatiality of the noise sources and the temporal characteristics of the noise. To compare different measurement setups, speech reception thresholds (SRT) were measured unilaterally with different CI processors and settings. Ten normal-hearing subjects served as reference. RESULTS In a complex listening situation with four loudspeakers, differences in SRT from CI subjects to the control group of up to 8 dB were found. For CI subjects, this SRT correlated with the situation with frontal speech signal and fluctuating interference signal from the side with R2 = 0.69. For conditions with stationary interfering signals, R2 values <0.2 were found. CONCLUSION There is no universal solution for all audiometric questions with respect to the spatiality and temporal characteristics of noise sources. In the investigated context, simplification of the complex spatial audiometric setting while using fluctuating competing signals was possible.
Collapse
Affiliation(s)
- Matthias Hey
- Department of Otorhinolaryngology, Head and Neck Surgery, Audiology, UKSH, Campus Kiel, Arnold-Heller-Straße 14, 24105, Kiel, Germany.
| | - Alexander Mewes
- Department of Otorhinolaryngology, Head and Neck Surgery, Audiology, UKSH, Campus Kiel, Arnold-Heller-Straße 14, 24105, Kiel, Germany
| | | |
Collapse
|
4
|
Hey M, Mewes A, Hocke T. [Speech comprehension in noise-considerations for ecologically valid assessment of communication skills ability with cochlear implants. German version]. HNO 2022; 70:861-869. [PMID: 36301326 PMCID: PMC9691490 DOI: 10.1007/s00106-022-01234-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/01/2022] [Indexed: 11/25/2022]
Abstract
BACKGROUND Nowadays, cochlear implant (CI) patients mostly show good to very good speech comprehension in quiet, but there are known problems with communication in everyday noisy situations. There is thus a need for ecologically valid measurements of speech comprehension in real-life listening situations for hearing-impaired patients. The additional methodological effort must be balanced with clinical human and spatial resources. This study investigates possible simplifications of a complex measurement setup. METHODS The study included 20 adults from long-term follow-up after CI fitting with postlingual onset of hearing impairment. The complexity of the investigated listening situations was influenced by changing the spatiality of the noise sources and the temporal characteristics of the noise. To compare different measurement setups, speech reception thresholds (SRT) were measured unilaterally with different CI processors and settings. Ten normal-hearing subjects served as reference. RESULTS In a complex listening situation with four loudspeakers, differences in SRT from CI subjects to the control group of up to 8 dB were found. For CI subjects, this SRT correlated with the situation with frontal speech signal and fluctuating interference signal from the side with R2 = 0.69. For conditions with stationary interfering signals, R2 values <0.2 were found. CONCLUSION There is no universal solution for all audiometric questions with respect to the spatiality and temporal characteristics of noise sources. In the investigated context, simplification of the complex spatial audiometric setting while using fluctuating competing signals was possible.
Collapse
Affiliation(s)
- Matthias Hey
- Klinik für Hals-Nasen-Ohren-Heilkunde, Kopf- und Halschirurgie; Audiologie, UKSH, Campus Kiel, Arnold-Heller-Straße 14, 24105, Kiel, Deutschland.
| | - Alexander Mewes
- Klinik für Hals-Nasen-Ohren-Heilkunde, Kopf- und Halschirurgie; Audiologie, UKSH, Campus Kiel, Arnold-Heller-Straße 14, 24105, Kiel, Deutschland
| | | |
Collapse
|
5
|
Huet MP, Micheyl C, Parizet E, Gaudrain E. Behavioral Account of Attended Stream Enhances Neural Tracking. Front Neurosci 2021; 15:674112. [PMID: 34966252 PMCID: PMC8710602 DOI: 10.3389/fnins.2021.674112] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 10/11/2021] [Indexed: 11/13/2022] Open
Abstract
During the past decade, several studies have identified electroencephalographic (EEG) correlates of selective auditory attention to speech. In these studies, typically, listeners are instructed to focus on one of two concurrent speech streams (the "target"), while ignoring the other (the "masker"). EEG signals are recorded while participants are performing this task, and subsequently analyzed to recover the attended stream. An assumption often made in these studies is that the participant's attention can remain focused on the target throughout the test. To check this assumption, and assess when a participant's attention in a concurrent speech listening task was directed toward the target, the masker, or neither, we designed a behavioral listen-then-recall task (the Long-SWoRD test). After listening to two simultaneous short stories, participants had to identify keywords from the target story, randomly interspersed among words from the masker story and words from neither story, on a computer screen. To modulate task difficulty, and hence, the likelihood of attentional switches, masker stories were originally uttered by the same talker as the target stories. The masker voice parameters were then manipulated to parametrically control the similarity of the two streams, from clearly dissimilar to almost identical. While participants listened to the stories, EEG signals were measured and subsequently, analyzed using a temporal response function (TRF) model to reconstruct the speech stimuli. Responses in the behavioral recall task were used to infer, retrospectively, when attention was directed toward the target, the masker, or neither. During the model-training phase, the results of these behavioral-data-driven inferences were used as inputs to the model in addition to the EEG signals, to determine if this additional information would improve stimulus reconstruction accuracy, relative to performance of models trained under the assumption that the listener's attention was unwaveringly focused on the target. Results from 21 participants show that information regarding the actual - as opposed to, assumed - attentional focus can be used advantageously during model training, to enhance subsequent (test phase) accuracy of auditory stimulus-reconstruction based on EEG signals. This is the case, especially, in challenging listening situations, where the participants' attention is less likely to remain focused entirely on the target talker. In situations where the two competing voices are clearly distinct and easily separated perceptually, the assumption that listeners are able to stay focused on the target is reasonable. The behavioral recall protocol introduced here provides experimenters with a means to behaviorally track fluctuations in auditory selective attention, including, in combined behavioral/neurophysiological studies.
Collapse
Affiliation(s)
- Moïra-Phoebé Huet
- Laboratoire Vibrations Acoustique, Institut National des Sciences Appliquées de Lyon, Université de Lyon, Villeurbanne, France
- CNRS UMR 5292, INSERM U1028, Auditory Cognition and Psychoacoustics Team, Lyon Neuroscience Research Center, Lyon, France
| | | | - Etienne Parizet
- Laboratoire Vibrations Acoustique, Institut National des Sciences Appliquées de Lyon, Université de Lyon, Villeurbanne, France
| | - Etienne Gaudrain
- CNRS UMR 5292, INSERM U1028, Auditory Cognition and Psychoacoustics Team, Lyon Neuroscience Research Center, Lyon, France
- Department of Otorhinolaryngology, University Medical Center Groningen, University of Groningen, Groningen, Netherlands
| |
Collapse
|
6
|
Müller V, Lang-Roth R. Speech Recognition With Informational and Energetic Maskers in Patients With Single-Sided Deafness After Cochlear Implantation. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2021; 64:3343-3356. [PMID: 34310192 DOI: 10.1044/2021_jslhr-20-00677] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Purpose The aim of the study was to assess the susceptibility to energetic and informational masking in patients with single-sided deafness (SSD) with one normal-hearing (NH) ear and a cochlear implant (CI) in the contralateral ear, understand the effect on speech recognition when spatially separating noise and speech maskers, and investigate the influence of the CI in situations with energetic and informational masking. Method Speech recognition was measured in the presence of either a modulated speech-shaped noise or one of two competing speech maskers in 11 SSD-CI listeners. The speech maskers were manipulated with respect to fundamental frequency to consider the effect of different voices. Measurements were conducted in the unaided (NH) and aided (NHCI) conditions. Spatial release from masking (SRM) was calculated for each masker type and both listening conditions (NH and NHCI) by subtracting scores of the colocated target and masker condition (S0N0) from the spatially separated target and masker conditions (S0N≠0). Results Speech recognition was highly variable depending on the type of masker. SRM occurred in the unaided (NH) and aided (NHCI) conditions when the speech masker had the same gender as the target talker. Adding the CI improved speech recognition when this speech masker was ipsilateral to the NH ear. Conclusions The amount of informational masking is substantial in SSD-CI listeners with both colocated and spatially separated target and masker signals. The contribution of SRM to better speech recognition largely depends on the masker and is considerable when no differences in voices between the target and the competing talker occur. There is only a slight improvement in speech recognition by adding the CI.
Collapse
Affiliation(s)
- Verena Müller
- Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine, University of Cologne, Germany
| | - Ruth Lang-Roth
- Department of Otorhinolaryngology, Head and Neck Surgery, Faculty of Medicine, University of Cologne, Germany
| |
Collapse
|
7
|
Nogueira W, Boghdady NE, Langner F, Gaudrain E, Başkent D. Effect of Channel Interaction on Vocal Cue Perception in Cochlear Implant Users. Trends Hear 2021; 25:23312165211030166. [PMID: 34461780 PMCID: PMC8411629 DOI: 10.1177/23312165211030166] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Revised: 06/14/2021] [Accepted: 06/16/2021] [Indexed: 11/16/2022] Open
Abstract
Speech intelligibility in multitalker settings is challenging for most cochlear implant (CI) users. One possibility for this limitation is the suboptimal representation of vocal cues in implant processing, such as the fundamental frequency (F0), and the vocal tract length (VTL). Previous studies suggested that while F0 perception depends on spectrotemporal cues, VTL perception relies largely on spectral cues. To investigate how spectral smearing in CIs affects vocal cue perception in speech-on-speech (SoS) settings, adjacent electrodes were simultaneously stimulated using current steering in 12 Advanced Bionics users to simulate channel interaction. In current steering, two adjacent electrodes are simultaneously stimulated forming a channel of parallel stimulation. Three such stimulation patterns were used: Sequential (one current steering channel), Paired (two channels), and Triplet stimulation (three channels). F0 and VTL just-noticeable differences (JNDs; Task 1), in addition to SoS intelligibility (Task 2) and comprehension (Task 3), were measured for each stimulation strategy. In Tasks 2 and 3, four maskers were used: the same female talker, a male voice obtained by manipulating both F0 and VTL (F0+VTL) of the original female speaker, a voice where only F0 was manipulated, and a voice where only VTL was manipulated. JNDs were measured relative to the original voice for the F0, VTL, and F0+VTL manipulations. When spectral smearing was increased from Sequential to Triplet, a significant deterioration in performance was observed for Tasks 1 and 2, with no differences between Sequential and Paired stimulation. Data from Task 3 were inconclusive. These results imply that CI users may tolerate certain amounts of channel interaction without significant reduction in performance on tasks relying on voice perception. This points to possibilities for using parallel stimulation in CIs for reducing power consumption.
Collapse
Affiliation(s)
- Waldo Nogueira
- Department of Otolaryngology, Medical University
Hannover and Cluster of Excellence Hearing4all, Hanover, Germany
| | - Nawal El Boghdady
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
| | - Florian Langner
- Department of Otolaryngology, Medical University
Hannover and Cluster of Excellence Hearing4all, Hanover, Germany
| | - Etienne Gaudrain
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
- Lyon Neuroscience Research Center, CNRS UMR 5292,
INSERM U1028, University Lyon 1, Lyon, France
| | - Deniz Başkent
- Department of Otorhinolaryngology, University Medical
Center Groningen, University of Groningen, Groningen,
Netherlands
- Research School of Behavioral and Cognitive
Neurosciences, University of
Groningen, University of Groningen, Groningen,
Netherlands
| |
Collapse
|
8
|
Speech Segregation in Active Middle Ear Stimulation: Masking Release With Changing Fundamental Frequency. Ear Hear 2020; 42:709-717. [PMID: 33369941 DOI: 10.1097/aud.0000000000000973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Temporal fine structure information such as low-frequency sounds including the fundamental frequency (F0) is important to separate different talkers in noisy environments. Speech perception in noise is negatively affected by reduced temporal fine structure resolution in cochlear hearing loss. It has been shown that normal-hearing (NH) people as well as cochlear implant patients with preserved acoustic low-frequency hearing benefit from different F0 between concurrent talkers. Though patients with an active middle ear implant (AMEI) report better sound quality compared with hearing aids, they often struggle when listening in noise. The primary objective was to evaluate whether or not patients with a Vibrant Soundbridge AMEI were able to benefit from F0 differences in a concurrent talker situation and if the effect was comparable to NH individuals. DESIGN A total of 13 AMEI listeners and 13 NH individuals were included. A modified variant of the Oldenburg sentence test was used to emulate a concurrent talker scenario. One sentence from the test corpus served as the masker and the remaining sentences as target speech. The F0 of the masker sentence was shifted upward by 4, 8, and 12 semitones. The target and masker sentences were presented simultaneously to the study subjects and the speech reception threshold was assessed by adaptively varying the masker level. To evaluate any impact of the occlusion effect on speech perception, AMEI listeners were tested in two configurations: with a plugged ear-canal contralateral to the implant side, indicated as AMEIcontra, or with both ears plugged, indicated as AMEIboth. RESULTS In both study groups, speech perception improved when the F0 difference between target and masker increased. This was significant when the difference was at least 8 semitones; the F0-based release from masking was 3.0 dB in AMEIcontra (p = 0.009) and 2.9 dB in AMEIboth (p = 0.015), compared with 5.6 dB in NH listeners (p < 0.001). A difference of 12 semitones revealed a F0-based release from masking of 3.5 dB in the AMEIcontra (p = 0.002) and 3.4 dB in the AMEIboth (p = 0.003) condition, compared with 5.0 dB in NH individuals (p < 0.001). CONCLUSIONS Though AMEI users deal with problems resulting from cochlear damage, hearing amplification with the implant enables a masking release based on F0 differences when F0 between a target and masker sentence was at least 8 semitones. Additional occlusion of the ear canal on the implant side did not affect speech performance. The current results complement the knowledge about the benefit of F0 within the acoustic low-frequency hearing.
Collapse
|
9
|
Effect of Spectral Contrast Enhancement on Speech-on-Speech Intelligibility and Voice Cue Sensitivity in Cochlear Implant Users. Ear Hear 2020; 42:271-289. [PMID: 32925307 DOI: 10.1097/aud.0000000000000936] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
OBJECTIVES Speech intelligibility in the presence of a competing talker (speech-on-speech; SoS) presents more difficulties for cochlear implant (CI) users compared with normal-hearing listeners. A recent study implied that these difficulties may be related to CI users' low sensitivity to two fundamental voice cues, namely, the fundamental frequency (F0) and the vocal tract length (VTL) of the speaker. Because of the limited spectral resolution in the implant, important spectral cues carrying F0 and VTL information are expected to be distorted. This study aims to address two questions: (1) whether spectral contrast enhancement (SCE), previously shown to enhance CI users' speech intelligibility in the presence of steady state background noise, could also improve CI users' SoS intelligibility, and (2) whether such improvements in SoS from SCE processing are due to enhancements in CI users' sensitivity to F0 and VTL differences between the competing talkers. DESIGN The effect of SCE on SoS intelligibility and comprehension was measured in two separate tasks in a sample of 14 CI users with Cochlear devices. In the first task, the CI users were asked to repeat the sentence spoken by the target speaker in the presence of a single competing talker. The competing talker was the same target speaker whose F0 and VTL were parametrically manipulated to obtain the different experimental conditions. SoS intelligibility, in terms of the percentage of correctly repeated words from the target sentence, was assessed using the standard advanced combination encoder (ACE) strategy and SCE for each voice condition. In the second task, SoS comprehension accuracy and response times were measured using the same experimental setup as in the first task, but with a different corpus. In the final task, CI users' sensitivity to F0 and VTL differences were measured for the ACE and SCE strategies. The benefit in F0 and VTL discrimination from SCE processing was evaluated with respect to the improvement in SoS perception from SCE. RESULTS While SCE demonstrated the potential of improving SoS intelligibility in CI users, this effect appeared to stem from SCE improving the overall signal to noise ratio in SoS rather than improving the sensitivity to the underlying F0 and VTL differences. A second key finding of this study was that, contrary to what has been observed in a previous study for childlike voice manipulations, F0 and VTL manipulations of a reference female speaker (target speaker) toward male-like voices provided a small but significant release from masking for the CI users tested. CONCLUSIONS The present findings, together with those previously reported in the literature, indicate that SCE could serve as a possible background-noise-reduction strategy in commercial CI speech processors that could enhance speech intelligibility especially in the presence of background talkers that have longer VTLs compared with the target speaker.
Collapse
|
10
|
Warren SE, Noelle Dunbar M, Bosworth C, Agrawal S. Evaluation of a novel bimodal fitting formula in Advanced Bionics cochlear implant recipients. Cochlear Implants Int 2020; 21:323-337. [PMID: 32664814 DOI: 10.1080/14670100.2020.1787622] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Purpose: The study's objectives were to (1) evaluate benefit from a novel bimodal fitting formula (Adaptive Phonak Digital Bimodal Fitting Formula [APDB]), and (2) compare outcomes with APDB and a traditional fitting formula (NAL-NL2). Methods: This prospective study evaluated outcomes in ten adults with unilateral Advanced Bionics (AB) cochlear implants (CI). Participants were tested bimodally with NAL-NL2 and APDB programed on Naída Link UP HAs. Measures of speech perception, sound quality, and preference were obtained with two bimodal configurations (CI + HANAL-NL2 and CI + HAAPDB). Participants used the CI + HAAPDB configuration for an acclimation period, after which measures were repeated. Results: Significant bimodal benefit was measured from both HA fitting formulae for speech perception in noise compared to the CI-only condition. Improved individual outcomes with the APDB were observed, but group differences were not statistically significant. Participants reported subjective benefit from APDB on blind comparisons of preference and sound quality. Conclusions: Significant benefit was found with both bimodal conditions compared to the CI-only condition; however, bimodal speech perception results were not significantly different. Users reported benefit from the APDB formula over NAL-NL2 formula. Due to individual improved speech perception and overall subjective preference for APDB, clinicians should consider APDB with AB CI recipients.
Collapse
Affiliation(s)
- Sarah E Warren
- School of Communication Sciences and Disorders, University of Memphis, Memphis, TN, USA.,Arkansas Children's Hospital, Little Rock, AR, USA
| | - M Noelle Dunbar
- Columbia University Irving Medical Center, New York, NY, USA
| | | | | |
Collapse
|
11
|
Factors Affecting Bimodal Benefit in Pediatric Mandarin-Speaking Chinese Cochlear Implant Users. Ear Hear 2020; 40:1316-1327. [PMID: 30882534 DOI: 10.1097/aud.0000000000000712] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES While fundamental frequency (F0) cues are important to both lexical tone perception and multitalker segregation, F0 cues are poorly perceived by cochlear implant (CI) users. Adding low-frequency acoustic hearing via a hearing aid in the contralateral ear may improve CI users' F0 perception. For English-speaking CI users, contralateral acoustic hearing has been shown to improve perception of target speech in noise and in competing talkers. For tonal languages such as Mandarin Chinese, F0 information is lexically meaningful. Given competing F0 information from multiple talkers and lexical tones, contralateral acoustic hearing may be especially beneficial for Mandarin-speaking CI users' perception of competing speech. DESIGN Bimodal benefit (CI+hearing aid - CI-only) was evaluated in 11 pediatric Mandarin-speaking Chinese CI users. In experiment 1, speech recognition thresholds (SRTs) were adaptively measured using a modified coordinated response measure test; subjects were required to correctly identify 2 keywords from among 10 choices in each category. SRTs were measured with CI-only or bimodal listening in the presence of steady state noise (SSN) or competing speech with the same (M+M) or different voice gender (M+F). Unaided thresholds in the non-CI ear and demographic factors were compared with speech performance. In experiment 2, SRTs were adaptively measured in SSN for recognition of 5 keywords, a more difficult listening task than the 2-keyword recognition task in experiment 1. RESULTS In experiment 1, SRTs were significantly lower for SSN than for competing speech in both the CI-only and bimodal listening conditions. There was no significant difference between CI-only and bimodal listening for SSN and M+F (p > 0.05); SRTs were significantly lower for CI-only than for bimodal listening for M+M (p < 0.05), suggesting bimodal interference. Subjects were able to make use of voice gender differences for bimodal listening (p < 0.05) but not for CI-only listening (p > 0.05). Unaided thresholds in the non-CI ear were positively correlated with bimodal SRTs for M+M (p < 0.006) but not for SSN or M+F. No significant correlations were observed between any demographic variables and SRTs (p > 0.05 in all cases). In experiment 2, SRTs were significantly lower with two than with five keywords (p < 0.05). A significant bimodal benefit was observed only for the 5-keyword condition (p < 0.05). CONCLUSIONS With the CI alone, subjects experienced greater interference with competing speech than with SSN and were unable to use voice gender difference to segregate talkers. For the coordinated response measure task, subjects experienced no bimodal benefit and even bimodal interference when competing talkers were the same voice gender. A bimodal benefit in SSN was observed for the five-keyword condition but not for the two-keyword condition, suggesting that bimodal listening may be more beneficial as the difficulty of the listening task increased. The present data suggest that bimodal benefit may depend on the type of masker and/or the difficulty of the listening task.
Collapse
|
12
|
Meister H, Walger M, Lang-Roth R, Müller V. Voice fundamental frequency differences and speech recognition with noise and speech maskers in cochlear implant recipients. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:EL19. [PMID: 32007021 DOI: 10.1121/10.0000499] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2019] [Accepted: 12/11/2019] [Indexed: 06/10/2023]
Abstract
Cochlear implant (CI) recipients are limited in their perception of voice cues, such as the fundamental frequency (F0). This has important consequences for speech recognition when several talkers speak simultaneously. This examination considered the comparison of clear speech and noise-vocoded sentences as maskers. For the speech maskers it could be shown that good CI performers are able to benefit from F0 differences between target and masker. This was due to the fact that a F0 difference of 80 Hz significantly reduced target-masker confusions, an effect that was slightly more pronounced in bimodal than in bilateral users.
Collapse
Affiliation(s)
- Hartmut Meister
- Jean-Uhrmacher-Institute for Clinical ENT-Research, University of Cologne, Geibelstrasse 29-31, D-50931 Cologne, Germany
| | - Martin Walger
- Department of Otorhinolaryngology, Head and Neck Surgery, Medical Faculty, University of Cologne, Kerpenerstrasse 62, 50937 Cologne, , , ,
| | - Ruth Lang-Roth
- Department of Otorhinolaryngology, Head and Neck Surgery, Medical Faculty, University of Cologne, Kerpenerstrasse 62, 50937 Cologne, , , ,
| | - Verena Müller
- Department of Otorhinolaryngology, Head and Neck Surgery, Medical Faculty, University of Cologne, Kerpenerstrasse 62, 50937 Cologne, , , ,
| |
Collapse
|
13
|
Gaudrain E, Başkent D. Discrimination of Voice Pitch and Vocal-Tract Length in Cochlear Implant Users. Ear Hear 2019; 39:226-237. [PMID: 28799983 PMCID: PMC5839701 DOI: 10.1097/aud.0000000000000480] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2017] [Accepted: 06/29/2017] [Indexed: 12/02/2022]
Abstract
OBJECTIVES When listening to two competing speakers, normal-hearing (NH) listeners can take advantage of voice differences between the speakers. Users of cochlear implants (CIs) have difficulty in perceiving speech on speech. Previous literature has indicated sensitivity to voice pitch (related to fundamental frequency, F0) to be poor among implant users, while sensitivity to vocal-tract length (VTL; related to the height of the speaker and formant frequencies), the other principal voice characteristic, has not been directly investigated in CIs. A few recent studies evaluated F0 and VTL perception indirectly, through voice gender categorization, which relies on perception of both voice cues. These studies revealed that, contrary to prior literature, CI users seem to rely exclusively on F0 while not utilizing VTL to perform this task. The objective of the present study was to directly and systematically assess raw sensitivity to F0 and VTL differences in CI users to define the extent of the deficit in voice perception. DESIGN The just-noticeable differences (JNDs) for F0 and VTL were measured in 11 CI listeners using triplets of consonant-vowel syllables in an adaptive three-alternative forced choice method. RESULTS The results showed that while NH listeners had average JNDs of 1.95 and 1.73 semitones (st) for F0 and VTL, respectively, CI listeners showed JNDs of 9.19 and 7.19 st. These JNDs correspond to differences of 70% in F0 and 52% in VTL. For comparison to the natural range of voices in the population, the F0 JND in CIs remains smaller than the typical male-female F0 difference. However, the average VTL JND in CIs is about twice as large as the typical male-female VTL difference. CONCLUSIONS These findings, thus, directly confirm that CI listeners do not seem to have sufficient access to VTL cues, likely as a result of limited spectral resolution, and, hence, that CI listeners' voice perception deficit goes beyond poor perception of F0. These results provide a potential common explanation not only for a number of deficits observed in CI listeners, such as voice identification and gender categorization, but also for competing speech perception.
Collapse
Affiliation(s)
- Etienne Gaudrain
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology-Head and Neck Surgery, Groningen, The Netherlands; CNRS UMR 5292, Lyon Neuroscience Research Center, Auditory Cognition and Psychoacoustics, Université Lyon, Lyon, France; and Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| | - Deniz Başkent
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology-Head and Neck Surgery, Groningen, The Netherlands; CNRS UMR 5292, Lyon Neuroscience Research Center, Auditory Cognition and Psychoacoustics, Université Lyon, Lyon, France; and Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
14
|
El Boghdady N, Gaudrain E, Başkent D. Does good perception of vocal characteristics relate to better speech-on-speech intelligibility for cochlear implant users? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:417. [PMID: 30710943 DOI: 10.1121/1.5087693] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2018] [Accepted: 12/21/2018] [Indexed: 06/09/2023]
Abstract
Differences in voice pitch (F0) and vocal tract length (VTL) improve intelligibility of speech masked by a background talker (speech-on-speech; SoS) for normal-hearing (NH) listeners. Cochlear implant (CI) users, who are less sensitive to these two voice cues compared to NH listeners, experience difficulties in SoS perception. Three research questions were addressed: (1) whether increasing the F0 and VTL difference (ΔF0; ΔVTL) between two competing talkers benefits CI users in SoS intelligibility and comprehension, (2) whether this benefit is related to their F0 and VTL sensitivity, and (3) whether their overall SoS intelligibility and comprehension are related to their F0 and VTL sensitivity. Results showed: (1) CI users did not benefit in SoS perception from increasing ΔF0 and ΔVTL; increasing ΔVTL had a slightly detrimental effect on SoS intelligibility and comprehension. Results also showed: (2) the effect from increasing ΔF0 on SoS intelligibility was correlated with F0 sensitivity, while the effect from increasing ΔVTL on SoS comprehension was correlated with VTL sensitivity. Finally, (3) the sensitivity to both F0 and VTL, and not only one of them, was found to be correlated with overall SoS performance, elucidating important aspects of voice perception that should be optimized through future coding strategies.
Collapse
Affiliation(s)
- Nawal El Boghdady
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Etienne Gaudrain
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Deniz Başkent
- Department of Otorhinolaryngology/Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| |
Collapse
|
15
|
Warren SE, Dunbar MN. Bimodal Hearing in Individuals with Severe-to-Profound Hearing Loss: Benefits, Challenges, and Management. Semin Hear 2018; 39:405-413. [PMID: 30374211 DOI: 10.1055/s-0038-1670706] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022] Open
Abstract
Binaural hearing offers numerous advantages over monaural hearing. While bilateral implants are a successful treatment option for some patients, many individuals choose to achieve binaural hearing by using a cochlear implant with a contralateral hearing aid. Compared with monaural hearing, benefits of bimodal hearing include improved speech perception in quiet and in noise, improved localization, and more natural sound quality. Despite the advantages, there exist disadvantages to bimodal hearing, primarily related to binaural integration. Management of these devices can be challenging in that the hearing aid and cochlear implant may be managed by different clinicians. When fitting devices, strategies are recommended to optimize the integration of input from both devices. In managing bimodal devices, recommended outcomes measures include those that would reflect bimodal benefit, such as speech understanding in noise and spatial sound quality perception.
Collapse
Affiliation(s)
- Sarah E Warren
- School of Communication Sciences and Disorders, University of Memphis, Memphis, Tennessee.,Department of Audiology and Speech Pathology, Arkansas Children's Hospital, Little Rock, Arkansas
| | - M Noelle Dunbar
- Department of Otolaryngology/Head and Neck Surgery, Columbia University Medical Center, New York, New York
| |
Collapse
|
16
|
El Boghdady N, Başkent D, Gaudrain E. Effect of frequency mismatch and band partitioning on vocal tract length perception in vocoder simulations of cochlear implant processing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:3505. [PMID: 29960490 DOI: 10.1121/1.5041261] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The vocal tract length (VTL) of a speaker is an important voice cue that aids speech intelligibility in multi-talker situations. However, cochlear implant (CI) users demonstrate poor VTL sensitivity. This may be partially caused by the mismatch between frequencies received by the implant and those corresponding to places of stimulation along the cochlea. This mismatch can distort formant spacing, where VTL cues are encoded. In this study, the effects of frequency mismatch and band partitioning on VTL sensitivity were investigated in normal hearing listeners with vocoder simulations of CI processing. The hypotheses were that VTL sensitivity may be reduced by increased frequency mismatch and insufficient spectral resolution in how the frequency range is partitioned, specifically where formants lie. Moreover, optimal band partitioning might mitigate the detrimental effects of frequency mismatch on VTL sensitivity. Results showed that VTL sensitivity decreased with increased frequency mismatch and reduced spectral resolution near the low frequencies of the band partitioning map. Band partitioning was independent of mismatch, indicating that if a given partitioning is suboptimal, a better partitioning might improve VTL sensitivity despite the degree of mismatch. These findings suggest that customizing the frequency partitioning map may enhance VTL perception in individual CI users.
Collapse
Affiliation(s)
- Nawal El Boghdady
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology/Head and Neck Surgery, Groningen, The Netherlands
| | - Deniz Başkent
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology/Head and Neck Surgery, Groningen, The Netherlands
| | - Etienne Gaudrain
- University of Groningen, University Medical Center Groningen, Department of Otorhinolaryngology/Head and Neck Surgery, Groningen, The Netherlands
| |
Collapse
|
17
|
Masking release with changing fundamental frequency: Electric acoustic stimulation resembles normal hearing subjects. Hear Res 2017; 350:226-234. [DOI: 10.1016/j.heares.2017.05.004] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/01/2016] [Revised: 03/04/2017] [Accepted: 05/08/2017] [Indexed: 11/20/2022]
|
18
|
Shirvani S, Jafari Z, Motasaddi Zarandi M, Jalaie S, Mohagheghi H, Tale MR. Emotional Perception of Music in Children With Bimodal Fitting and Unilateral Cochlear Implant. Ann Otol Rhinol Laryngol 2015; 125:470-7. [PMID: 26681623 DOI: 10.1177/0003489415619943] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
OBJECTIVE Biological, structural, and acoustical constraints faced by cochlear implant (CI) users can alter the perception of music. Bimodal fitting not only provides bilateral hearing but can also improve auditory skills. This study was conducted to assess the impact of this amplification style on the emotional perception of music among children with hearing loss (HL). METHODS Twenty-five children with congenital severe to profound HL and unilateral CIs, 20 children with bimodal fitting, and 30 children with normal hearing participated in this study. Their emotional perceptions of music were measured using a method where children indicated happy or sad feelings induced by music by pointing to pictures of faces showing these emotions. RESULTS Children with bimodal fitting obtained significantly higher mean scores than children with unilateral CIs for both happy and sad music items and in overall test scores (P < .001). Both groups with HL obtained significantly lower scores than children with normal hearing (P < .001). CONCLUSIONS Bimodal fitting results in a better emotional perception of music compared to unilateral CI. Given the influence of music in neurological and linguistic development and social interactions, it is important to evaluate the possible benefits of bimodal fitting prescriptions for individuals with unilateral CIs.
Collapse
Affiliation(s)
- Sareh Shirvani
- Department of Audiology, School of Rehabilitation, Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Zahra Jafari
- Department of Basic Sciences in Rehabilitation, School of Rehabilitation Sciences, Iran University of Medical Sciences (IUMS), Tehran, Iran Canadian Center for Behavioral Neuroscience (CCBN), Lethbridge University, Lethbridge, Alberta, Canada
| | - Masoud Motasaddi Zarandi
- Cochlear Implant Research Center, AmirAlam Hospital, School of Medicine, Tehran University of Medical Sciences, Tehran, Iran
| | - Shohre Jalaie
- Department of Physiotherapy, School of Rehabilitation, Tehran University of Medical Sciences (TUMS), Tehran, Iran
| | - Hamed Mohagheghi
- Department of Audiology, School of Rehabilitation Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | | |
Collapse
|
19
|
Pyschny V, Landwehr M, Hahn M, Lang-Roth R, Walger M, Meister H. Head shadow, squelch, and summation effects with an energetic or informational masker in bilateral and bimodal CI users. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:1942-1960. [PMID: 24825129 DOI: 10.1044/2014_jslhr-h-13-0144] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 03/26/2014] [Indexed: 06/03/2023]
Abstract
PURPOSE The objective of the study was to investigate the influence of noise (energetic) and speech (energetic plus informational) maskers on the head shadow (HS), squelch (SQ), and binaural summation (SU) effect in bilateral and bimodal cochlear implant (CI) users. METHOD Speech recognition was measured in the presence of either a competing talker or modulated speech-shaped noise in 10 bimodal and 10 bilateral adult CI users. HS, SQ, and SU effects were calculated. The interfering signals were manipulated with respect to F0 to consider the influence of different speaker voices. RESULTS The effects HS, SQ, and SU differed depending on the type of masker. A detailed analysis of errors was used to dissociate energetic and informational masking effects. The analysis showed a release from energetic than from informational masking. CONCLUSION Noise interferers are not sufficient to reflect difficulties experienced with speech understanding in noise for bilateral and bimodal CI users.
Collapse
|
20
|
Visram AS, Kluk K, McKay CM. Voice gender differences and separation of simultaneous talkers in cochlear implant users with residual hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:EL135-EL141. [PMID: 22894312 DOI: 10.1121/1.4737137] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Perception of a target voice in the presence of a competing talker, of same or different gender as the target, was investigated in cochlear implant users, in implant-alone and bimodal (acoustic hearing in the non-implanted ear) conditions. Recordings of two male and two female talkers acted as targets and maskers, to investigate whether bimodal benefit increased for different compared to same gender target/maskers due to increased ability to perceive and utilize fundamental frequency and spectral-shape differences. In both listening conditions participants showed benefit of target/masker gender difference. There was an overall bimodal benefit, which was independent of target/masker gender difference.
Collapse
Affiliation(s)
- Anisa S Visram
- School of Psychological Sciences, University of Manchester, Manchester M13 9PL, United Kingdom
| | | | | |
Collapse
|