1
|
Kovačić D, James CJ. Stimulation Rate and Voice Pitch Perception in Cochlear Implants. J Assoc Res Otolaryngol 2022; 23:665-680. [PMID: 35918501 PMCID: PMC9613839 DOI: 10.1007/s10162-022-00854-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 06/05/2022] [Indexed: 06/15/2023] Open
Abstract
The stimulation rate in cochlear implant (CI) sound coding, or the "carrier" rate in pulses per second (pps), is known to influence pitch perception, as well as loudness perception and sound quality. Our main objective was to investigate the effects of reduced carrier rate on the loudness and pitch of coded speech samples. We describe two experiments with 16 Nucleus® CI users, where we controlled modulation characteristics and carrier rate using Spectral and Temporal Enhanced Processing (STEP), a novel experimental multichannel sound coder. We used a fixed set of threshold and comfortable stimulation levels for each subject, obtained from clinical MAPs. In the first experiment, we determined equivalence for voice pitch ranking and voice gender categorization between the Advanced Combination Encoder (ACE), a widely used clinical strategy in Nucleus® recipients, and STEP for fundamental frequencies (F0) 120-250 Hz. In the second experiment, loudness was determined as a function of the input amplitude of speech samples for carrier rates of 1000, 500, and 250 pps per channel. Then, using equally loud sound coder programs, we evaluated the effect of carrier rate on voice pitch perception. Although nearly all subjects could categorize voice gender significantly above chance, pitch ranking varied across subjects. Overall, carrier rate did not substantially affect voice pitch ranking or voice gender categorization: as long as the carrier rate was at least twice the fundamental frequency, or when stimulation pulses for the lowest, 250 pps carrier were aligned to F0 peaks. These results indicate that carrier rates as low as 250 pps per channel are sufficient to support functional voice pitch perception for those CI users sensitive to temporal pitch cues; at least when temporal modulations and pulse timings in the coder output are well controlled by novel strategies such as STEP.
Collapse
Affiliation(s)
- Damir Kovačić
- Department of Physics, Faculty of Science, University of Split, Ruđera Boškovića 33, 21000 Split, Croatia
| | - Chris J. James
- Cochlear France SAS, 135 Route de Saint Simon, 31100 Toulouse, France
| |
Collapse
|
2
|
Monaghan JJM, Carlyon RP, Deeks JM. Modulation Depth Discrimination by Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:285-299. [PMID: 35080684 PMCID: PMC8964891 DOI: 10.1007/s10162-022-00834-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 12/30/2021] [Indexed: 11/29/2022] Open
Abstract
Cochlear implants (CIs) convey the amplitude envelope of speech by modulating high-rate pulse trains. However, not all of the envelope may be necessary to perceive amplitude modulations (AMs); the effective envelope depth may be limited by forward and backward masking from the envelope peaks. Three experiments used modulated pulse trains to measure which portions of the envelope can be effectively processed by CI users as a function of AM frequency. Experiment 1 used a three-interval forced-choice task to test the ability of CI users to discriminate less-modulated pulse trains from a fully modulated standard, without controlling for loudness. The stimuli in experiment 2 were identical, but a two-interval task was used in which participants were required to choose the less-modulated interval, ignoring loudness. Catch trials, in which judgements based on level or modulation depth would give opposing answers, were included. Experiment 3 employed novel stimuli whose modulation envelope could be modified below a variable point in the dynamic range, without changing the loudness of the stimulus. Overall, results showed that substantial portions of the envelope are not accurately encoded by CI users. In experiment 1, where loudness cues were available, participants on average were insensitive to changes in the bottom 30% of their dynamic range. In experiment 2, where loudness was controlled, participants appeared insensitive to changes in the bottom 50% of the dynamic range. In experiment 3, participants were insensitive to changes in the bottom 80% of the dynamic range. We discuss potential reasons for this insensitivity and implications for CI speech-processing strategies.
Collapse
Affiliation(s)
- Jessica J M Monaghan
- Macquarie University, The Australian Hearing Hub, NSW, 2109, Sydney, Australia.
- National Acoustic Laboratories, The Australian Hearing Hub, Sydney, NSW, 2109, Australia.
| | - Robert P Carlyon
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| | - John M Deeks
- Cambridge Hearing Group, Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| |
Collapse
|
3
|
Relationship Between Peripheral and Psychophysical Measures of Amplitude Modulation Detection in Cochlear Implant Users. Ear Hear 2018; 38:e268-e284. [PMID: 28207576 DOI: 10.1097/aud.0000000000000417] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE This study investigates the relationship between electrophysiological and psychophysical measures of amplitude modulation (AM) detection. Prior studies have reported both measures of AM detection recorded separately from cochlear implant (CI) users and acutely deafened animals, but no study has made both measures in the same CI users. Animal studies suggest a progressive loss of high-frequency encoding as one ascends the auditory pathway from the auditory nerve to the cortex. Because the CI speech processor uses the envelope of an ongoing acoustic signal to modulate pulse trains that are subsequently delivered to the intracochlear electrodes, it is of interest to explore auditory nerve responses to modulated stimuli. In addition, psychophysical AM detection abilities have been correlated with speech perception outcomes. Thus, the goal was to explore how the auditory nerve responds to AM stimuli and to relate those physiologic measures to perception. DESIGN Eight patients using Cochlear Ltd. Implants participated in this study. Electrically evoked compound action potentials (ECAPs) were recorded using a 4000 pps pulse train that was sinusoidally amplitude modulated at 125, 250, 500, and 1000 Hz rates. Responses were measured for each pulse over at least one modulation cycle for an apical, medial, and basal electrode. Psychophysical modulation detection thresholds (MDTs) were also measured via a three-alternative forced choice, two-down, one-up adaptive procedure using the same modulation frequencies and electrodes. RESULTS ECAPs were recorded from individual pulses in the AM pulse train. ECAP amplitudes varied sinusoidally, reflecting the sinusoidal variation in the stimulus. A modulated response amplitude (MRA) metric was calculated as the difference in the maximal and minimum ECAP amplitudes over the modulation cycles. MRA increased as modulation frequency increased, with no apparent cutoff (up to 1000 Hz). In contrast, MDTs increased as the modulation frequency increased. This trend is inconsistent with the physiologic measures. For a fixed modulation frequency, correlations were observed between MDTs and MRAs; this trend was evident at all frequencies except 1000 Hz (although only statistically significant for 250 and 500 Hz AM rates), possibly an indication of central limitations in processing of high modulation frequencies. Finally, peripheral responses were larger and psychophysical thresholds were lower in the apical electrodes relative to basal and medial electrodes, which may reflect better cochlear health and neural survival evidenced by lower preoperative low-frequency audiometric thresholds and steeper growth of neural responses in ECAP amplitude growth functions for apical electrodes. CONCLUSIONS Robust ECAPs were recorded for all modulation frequencies tested. ECAP amplitudes varied sinusoidally, reflecting the periodicity of the modulated stimuli. MRAs increased as the modulation frequency increased, a trend we attribute to neural adaptation. For low modulation frequencies, there are multiple current steps between the peak and valley of the modulation cycle, which means successive stimuli are more similar to one another and neural responses are more likely to adapt. Higher MRAs were correlated with lower psychophysical thresholds at low modulation frequencies but not at 1000 Hz, implying a central limitation to processing of modulated stimuli.
Collapse
|
4
|
Brochier T, McKay C, McDermott H. Rate modulation detection thresholds for cochlear implant users. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:1214. [PMID: 29495682 DOI: 10.1121/1.5025048] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The perception of temporal amplitude modulations is critical for speech understanding by cochlear implant (CI) users. The present study compared the ability of CI users to detect sinusoidal modulations of the electrical stimulation rate and current level, at different presentation levels (80% and 40% of the dynamic range) and modulation frequencies (10 and 100 Hz). Rate modulation detection thresholds (RMDTs) and amplitude modulation detection thresholds (AMDTs) were measured and compared to assess whether there was a perceptual advantage to either modulation method. Both RMDTs and AMDTs improved with increasing presentation level and decreasing modulation frequency. RMDTs and AMDTs were correlated, indicating that a common processing mechanism may underlie the perception of rate modulation and amplitude modulation, or that some subject-dependent factors affect both types of modulation detection.
Collapse
Affiliation(s)
- Tim Brochier
- Department of Medical Bionics, University of Melbourne, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| | - Colette McKay
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| | - Hugh McDermott
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| |
Collapse
|
5
|
Sheft S, Cheng MY, Shafiro V. Discrimination of Stochastic Frequency Modulation by Cochlear Implant Users. J Am Acad Audiol 2018; 26:572-81. [PMID: 26134724 DOI: 10.3766/jaaa.14067] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
BACKGROUND Past work has shown that low-rate frequency modulation (FM) may help preserve signal coherence, aid segmentation at word and syllable boundaries, and benefit speech intelligibility in the presence of a masker. PURPOSE This study evaluated whether difficulties in speech perception by cochlear implant (CI) users relate to a deficit in the ability to discriminate among stochastic low-rate patterns of FM. RESEARCH DESIGN RESEARCH DESIGN This is a correlational study assessing the association between the ability to discriminate stochastic patterns of low-rate FM and the intelligibility of speech in noise. STUDY SAMPLE Thirteen postlingually deafened adult CI users participated in this study. DATA COLLECTION AND ANALYSIS Using modulators derived from 5-Hz lowpass noise applied to a 1-kHz carrier, thresholds were measured in terms of frequency excursion both in quiet and with a speech-babble masker present, stimulus duration, and signal-to-noise ratio in the presence of a speech-babble masker. Speech perception ability was assessed in the presence of the same speech-babble masker. Relationships were evaluated with Pearson product-moment correlation analysis with correction for family-wise error, and commonality analysis to determine the unique and common contributions across psychoacoustic variables to the association with speech ability. RESULTS Significant correlations were obtained between masked speech intelligibility and three metrics of FM discrimination involving either signal-to-noise ratio or stimulus duration, with shared variance among the three measures accounting for much of the effect. Compared to past results from young normal-hearing adults and older adults with either normal hearing or a mild-to-moderate hearing loss, mean FM discrimination thresholds obtained from CI users were higher in all conditions. CONCLUSIONS The ability to process the pattern of frequency excursions of stochastic FM may, in part, have a common basis with speech perception in noise. Discrimination of differences in the temporally distributed place coding of the stimulus could serve as this common basis for CI users.
Collapse
Affiliation(s)
- Stanley Sheft
- Department of Communication Disorders and Sciences, Rush University Medical Center, Chicago, IL
| | - Min-Yu Cheng
- Department of Communication Disorders and Sciences, Rush University Medical Center, Chicago, IL
| | - Valeriy Shafiro
- Department of Communication Disorders and Sciences, Rush University Medical Center, Chicago, IL
| |
Collapse
|
6
|
Brochier T, McDermott HJ, McKay CM. The effect of presentation level and stimulation rate on speech perception and modulation detection for cochlear implant users. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:4097. [PMID: 28618807 PMCID: PMC5457292 DOI: 10.1121/1.4983658] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Revised: 04/26/2017] [Accepted: 05/03/2017] [Indexed: 06/07/2023]
Abstract
In order to improve speech understanding for cochlear implant users, it is important to maximize the transmission of temporal information. The combined effects of stimulation rate and presentation level on temporal information transfer and speech understanding remain unclear. The present study systematically varied presentation level (60, 50, and 40 dBA) and stimulation rate [500 and 2400 pulses per second per electrode (pps)] in order to observe how the effect of rate on speech understanding changes for different presentation levels. Speech recognition in quiet and noise, and acoustic amplitude modulation detection thresholds (AMDTs) were measured with acoustic stimuli presented to speech processors via direct audio input (DAI). With the 500 pps processor, results showed significantly better performance for consonant-vowel nucleus-consonant words in quiet, and a reduced effect of noise on sentence recognition. However, no rate or level effect was found for AMDTs, perhaps partly because of amplitude compression in the sound processor. AMDTs were found to be strongly correlated with the effect of noise on sentence perception at low levels. These results indicate that AMDTs, at least when measured with the CP910 Freedom speech processor via DAI, explain between-subject variance of speech understanding, but do not explain within-subject variance for different rates and levels.
Collapse
Affiliation(s)
- Tim Brochier
- Department of Medical Bionics, University of Melbourne, Parkville, Victoria 3010, Australia
| | - Hugh J McDermott
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| | - Colette M McKay
- The Bionics Institute, 384-388 Albert Street, East Melbourne, Victoria 3002, Australia
| |
Collapse
|
7
|
Galvin JJ, Oba SI, Başkent D, Chatterjee M, Fu QJ. Envelope Interactions in Multi-Channel Amplitude Modulation Frequency Discrimination by Cochlear Implant Users. PLoS One 2015; 10:e0139546. [PMID: 26431043 PMCID: PMC4592255 DOI: 10.1371/journal.pone.0139546] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2015] [Accepted: 09/15/2015] [Indexed: 11/29/2022] Open
Abstract
Rationale Previous cochlear implant (CI) studies have shown that single-channel amplitude modulation frequency discrimination (AMFD) can be improved when coherent modulation is delivered to additional channels. It is unclear whether the multi-channel advantage is due to increased loudness, multiple envelope representations, or to component channels with better temporal processing. Measuring envelope interference may shed light on how modulated channels can be combined. Methods In this study, multi-channel AMFD was measured in CI subjects using a 3-alternative forced-choice, non-adaptive procedure (“which interval is different?”). For the reference stimulus, the reference AM (100 Hz) was delivered to all 3 channels. For the probe stimulus, the target AM (101, 102, 104, 108, 116, 132, 164, 228, or 256 Hz) was delivered to 1 of 3 channels, and the reference AM (100 Hz) delivered to the other 2 channels. The spacing between electrodes was varied to be wide or narrow to test different degrees of channel interaction. Results Results showed that CI subjects were highly sensitive to interactions between the reference and target envelopes. However, performance was non-monotonic as a function of target AM frequency. For the wide spacing, there was significantly less envelope interaction when the target AM was delivered to the basal channel. For the narrow spacing, there was no effect of target AM channel. The present data were also compared to a related previous study in which the target AM was delivered to a single channel or to all 3 channels. AMFD was much better with multiple than with single channels whether the target AM was delivered to 1 of 3 or to all 3 channels. For very small differences between the reference and target AM frequencies (2–4 Hz), there was often greater sensitivity when the target AM was delivered to 1 of 3 channels versus all 3 channels, especially for narrowly spaced electrodes. Conclusions Besides the increased loudness, the present results also suggest that multiple envelope representations may contribute to the multi-channel advantage observed in previous AMFD studies. The different patterns of results for the wide and narrow spacing suggest a peripheral contribution to multi-channel temporal processing. Because the effect of target AM frequency was non-monotonic in this study, adaptive procedures may not be suitable to measure AMFD thresholds with interfering envelopes. Envelope interactions among multiple channels may be quite complex, depending on the envelope information presented to each channel and the relative independence of the stimulated channels.
Collapse
Affiliation(s)
- John J. Galvin
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
- * E-mail: (JG)
| | - Sandra I. Oba
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Deniz Başkent
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| | - Monita Chatterjee
- Auditory Prostheses & Perception Lab, Boys Town National Research Hospital, Omaha, Nebraska, United States of America
| | - Qian-Jie Fu
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| |
Collapse
|
8
|
Amplitude Modulation Detection and Speech Recognition in Late-Implanted Prelingually and Postlingually Deafened Cochlear Implant Users. Ear Hear 2015; 36:557-66. [DOI: 10.1097/aud.0000000000000162] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
9
|
Galvin JJ, Oba S, Fu QJ, Başkent D. Single- and multi-channel modulation detection in cochlear implant users. PLoS One 2014; 9:e99338. [PMID: 24918605 PMCID: PMC4053447 DOI: 10.1371/journal.pone.0099338] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Accepted: 05/14/2014] [Indexed: 11/18/2022] Open
Abstract
Single-channel modulation detection thresholds (MDTs) have been shown to predict cochlear implant (CI) users' speech performance. However, little is known about multi-channel modulation sensitivity. Two factors likely contribute to multichannel modulation sensitivity: multichannel loudness summation and the across-site variance in single-channel MDTs. In this study, single- and multi-channel MDTs were measured in 9 CI users at relatively low and high presentation levels and modulation frequencies. Single-channel MDTs were measured at widely spaced electrode locations, and these same channels were used for the multichannel stimuli. Multichannel MDTs were measured twice, with and without adjustment for multichannel loudness summation (i.e., at the same loudness as for the single-channel MDTs or louder). Results showed that the effect of presentation level and modulation frequency were similar for single- and multi-channel MDTs. Multichannel MDTs were significantly poorer than single-channel MDTs when the current levels of the multichannel stimuli were reduced to match the loudness of the single-channel stimuli. This suggests that, at equal loudness, single-channel measures may over-estimate CI users' multichannel modulation sensitivity. At equal loudness, there was no significant correlation between the amount of multichannel loudness summation and the deficit in multichannel MDTs, relative to the average single-channel MDT. With no loudness compensation, multichannel MDTs were significantly better than the best single-channel MDT. The across-site variance in single-channel MDTs varied substantially across subjects. However, the across-site variance was not correlated with the multichannel advantage over the best single channel. This suggests that CI listeners combined envelope information across channels instead of attending to the best channel.
Collapse
Affiliation(s)
- John J. Galvin
- Division of Communication and Auditory Neuroscience, House Research Institute, Los Angeles, California, United States of America
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
- * E-mail:
| | - Sandy Oba
- Division of Communication and Auditory Neuroscience, House Research Institute, Los Angeles, California, United States of America
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Qian-Jie Fu
- Division of Communication and Auditory Neuroscience, House Research Institute, Los Angeles, California, United States of America
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, United States of America
| | - Deniz Başkent
- Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Research School of Behavioral and Cognitive Neurosciences, Graduate School of Medical Sciences, University of Groningen, Groningen, The Netherlands
| |
Collapse
|