701
|
Iglehart F. Speech Perception by Students With Cochlear Implants Using Sound-Field Systems in Classrooms. Am J Audiol 2004; 13:62-72. [PMID: 15248805 DOI: 10.1044/1059-0889(2004/009)] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Cochlear implants support deaf students' language development through the improved use of audition in the classroom. Unfortunately, the acoustics of typical classrooms greatly reduce auditory speech perception by these students. Sound-field systems can increase speech-to-noise ratios in classrooms and thus improve use of audition. These systems are used by 80% of students with cochlear implants who use an FM system in the classroom. The present study compares speech perception by 14 school-age cochlear implant recipients via 2 classroom sound-field systems, 1 wall-mounted and the other a personal, or desktop, system. Testing was conducted in 2 classroom environments, 1 noisy and reverberant (typical of many classrooms) and the other ideally quiet with reverberation of short duration. In the quiet room with low reverberation, both sound-field systems produced improved phoneme recognition, but there was no difference between the 2. In the noisy room with high reverberation, the sound-field benefits were greater, and the desktop systems provided more benefit than the wall-mounted systems.
Collapse
|
702
|
Fu QJ, Chinchilla S, Galvin JJ. The role of spectral and temporal cues in voice gender discrimination by normal-hearing listeners and cochlear implant users. J Assoc Res Otolaryngol 2004; 5:253-60. [PMID: 15492884 PMCID: PMC2504551 DOI: 10.1007/s10162-004-4046-1] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2003] [Accepted: 03/08/2004] [Indexed: 10/26/2022] Open
Abstract
The present study investigated the relative importance of temporal and spectral cues in voice gender discrimination and vowel recognition by normal-hearing subjects listening to an acoustic simulation of cochlear implant speech processing and by cochlear implant users. In the simulation, the number of speech processing channels ranged from 4 to 32, thereby varying the spectral resolution; the cutoff frequencies of the channels' envelope filters ranged from 20 to 320 Hz, thereby manipulating the available temporal cues. For normal-hearing subjects, results showed that both voice gender discrimination and vowel recognition scores improved as the number of spectral channels was increased. When only 4 spectral channels were available, voice gender discrimination significantly improved as the envelope filter cutoff frequency was increased from 20 to 320 Hz. For all spectral conditions, increasing the amount of temporal information had no significant effect on vowel recognition. Both voice gender discrimination and vowel recognition scores were highly variable among implant users. The performance of cochlear implant listeners was similar to that of normal-hearing subjects listening to comparable speech processing (4-8 spectral channels). The results suggest that both spectral and temporal cues contribute to voice gender discrimination and that temporal cues are especially important for cochlear implant users to identify the voice gender when there is reduced spectral resolution.
Collapse
Affiliation(s)
- Qian-Jie Fu
- Department of Auditory Implants and Perceptions, House Ear Institute, Los Angeles, CA 90057, USA.
| | | | | |
Collapse
|
703
|
Turner CW, Gantz BJ, Vidal C, Behrens A, Henry BA. Speech recognition in noise for cochlear implant listeners: benefits of residual acoustic hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:1729-35. [PMID: 15101651 DOI: 10.1121/1.1687425] [Citation(s) in RCA: 278] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The purpose of this study was to explore the potential advantages, both theoretical and applied, of preserving low-frequency acoustic hearing in cochlear implant patients. Several hypotheses are presented that predict that residual low-frequency acoustic hearing along with electric stimulation for high frequencies will provide an advantage over traditional long-electrode cochlear implants for the recognition of speech in competing backgrounds. A simulation experiment in normal-hearing subjects demonstrated a clear advantage for preserving low-frequency residual acoustic hearing for speech recognition in a background of other talkers, but not in steady noise. Three subjects with an implanted "short-electrode" cochlear implant and preserved low-frequency acoustic hearing were also tested on speech recognition in the same competing backgrounds and compared to a larger group of traditional cochlear implant users. Each of the three short-electrode subjects performed better than any of the traditional long-electrode implant subjects for speech recognition in a background of other talkers, but not in steady noise, in general agreement with the simulation studies. When compared to a subgroup of traditional implant users matched according to speech recognition ability in quiet, the short-electrode patients showed a 9-dB advantage in the multitalker background. These experiments provide strong preliminary support for retaining residual low-frequency acoustic hearing in cochlear implant patients. The results are consistent with the idea that better perception of voice pitch, which can aid in separating voices in a background of other talkers, was responsible for this advantage.
Collapse
Affiliation(s)
- Christopher W Turner
- Dept. of Speech Pathology and Audiology, Dept. of Otolaryngology Head and Neck Surgery, University of Iowa, Iowa City, Iowa 52242, USA
| | | | | | | | | |
Collapse
|
704
|
Abstract
OBJECTIVE The first specific aim of the present study is to compare the ability of normal-hearing and cochlear implant listeners to use temporal cues in three music perception tasks: tempo discrimination, rhythmic pattern identification, and melody identification. The second aim is to identify the relative contribution of temporal and spectral cues to melody recognition in acoustic and electric hearing. DESIGN Both normal-hearing and cochlear implant listeners participated in the experiments. Tempo discrimination was measured in a two-interval forced-choice procedure in which subjects were asked to choose the faster tempo at four standard tempo conditions (60, 80, 100, and 120 beats per minute). For rhythmic pattern identification, seven different rhythmic patterns were created and subjects were asked to read and choose the musical notation displayed on the screen that corresponded to the rhythmic pattern presented. Melody identification was evaluated with two sets of 12 familiar melodies. One set contained both rhythm and melody information (rhythm condition), whereas the other set contained only melody information (no-rhythm condition). Melody stimuli were also processed to extract the slowly varying temporal envelope from 1, 2, 4, 8, 16, 32, and 64 frequency bands, to create cochlear implant simulations. Subjects listened to a melody and had to respond by choosing one of the 12 names corresponding to the melodies displayed on a computer screen. RESULTS In tempo discrimination, the cochlear implant listeners performed similarly to the normal-hearing listeners with rate discrimination difference limens obtained at 4-6 beats per minute. In rhythmic pattern identification, the cochlear implant listeners performed 5-25 percentage points poorer than the normal-hearing listeners. The normal-hearing listeners achieved perfect scores in melody identification with and without the rhythmic cues. However, the cochlear implant listeners performed significantly poorer than the normal-hearing listeners in both rhythm and no-rhythm conditions. The simulation results from normal-hearing listeners showed a relatively high level of performance for all numbers of frequency bands in the rhythm condition but required as many as 32 bands in the no-rhythm condition. CONCLUSIONS Cochlear-implant listeners performed normally in tempo discrimination, but significantly poorer than normal-hearing listeners in rhythmic pattern identification and melody recognition. While both temporal (rhythmic) and spectral (pitch) cues contribute to melody recognition, cochlear-implant listeners mostly relied on the rhythmic cues for melody recognition. Without the rhythmic cues, high spectral resolution with as many as 32 bands was needed for melody recognition for normal-hearing listeners. This result indicates that the present cochlear implants provide sufficient spectral cues to support speech recognition in quiet, but they are not adequate to support music perception. Increasing the number of functional channels and improved encoding of the fine structure information are necessary to improve music perception for cochlear implant listeners.
Collapse
Affiliation(s)
- Ying-Yee Kong
- Department of Cognitive Sciences, University of California, Irvine, 92697, USA.
| | | | | | | |
Collapse
|
705
|
Deeks JM, Carlyon RP. Simulations of cochlear implant hearing using filtered harmonic complexes: implications for concurrent sound segregation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:1736-1746. [PMID: 15101652 DOI: 10.1121/1.1675814] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Two experiments used simulations of cochlear implant hearing to investigate the use of temporal codes in speech segregation. Sentences were filtered into six bands, and their envelopes used to modulate filtered alternating-phase harmonic complexes with rates of 80 or 140 pps. Experiment 1 showed that identification of single sentences was better for the higher rate. In experiment 2, maskers (time-reversed concatenated sentences) were scaled by -9 dB relative to a target sentence, which was added with an offset of 1.2 s. When the target and masker were each processed on all six channels, and then summed, processing the masker on a different rate to the target improved performance only when the target rate was 140 pps. When the target sentence was processed on the odd-numbered channels and the masker on the even-numbered channels, or vice versa, performance was worse overall, but showed similar effects of pulse rate. The results, combined with recent psychophysical evidence, suggest that differences in pulse rate are unlikely to prove useful for concurrent sound segregation.
Collapse
Affiliation(s)
- John M Deeks
- MRC Cognition and Brain Sciences Unit, 15 Chaucer Road, Cambridge CB2 2EF, United Kingdom
| | | |
Collapse
|
706
|
Abstract
The history of cochlear implants is marked by large improvements in performance, especially over the past two decades and especially due to the development of ever-better processing strategies. Although the progress to date has been substantial, present devices still do not restore normal speech reception, even for top performers and particularly for listening to speech in competition with noise or other talkers. In addition, a wide range of outcomes persists, with some patients receiving little benefit using the same devices that support high levels of speech reception for others. The purpose of this review is to describe some likely possibilities for further improvement, including (a) combined electric and acoustic stimulation of the auditory system for patients with significant residual hearing, (b) use of bilateral implants, (c) a closer replication with implants of the processing steps in the normal cochlea, and (d) applications of knowledge about factors that are correlated with outcomes to help patients presently at the low end of the performance scale.
Collapse
Affiliation(s)
- Blake S Wilson
- Center for Auditory Prosthesis Research, Research Triangle Institute, Research Triangle Park, North Carolina 27709, USA.
| | | | | | | | | |
Collapse
|
707
|
Kuchta J. Neuroprosthetic Hearing with Auditory Brainstem Implants / Wiederherstellung des Hörens durch auditorische Hirnstammimplantate. BIOMED ENG-BIOMED TE 2004; 49:83-7. [PMID: 15171587 DOI: 10.1515/bmt.2004.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The auditory brainstem implant (ABI) does provide auditory sensations, recognition of environmental sounds and aid in spoken communication in about 300 patients worldwide. It is no more an investigative device but widely accepted for the treatment of patients who have lost hearing due to bilateral tumors of the hearing nerve who transmits the acoustic information from the cochlea to the brain. Most of the implanted patients are completely deaf when the implant is switched off. In contrast to cochlear implants, only few of the implanted patients achieve open-set speech recognition without the help of visual cues. On average, the ABI improves communicative functions like speech recognition at about 30% when compared to lip-reading only. The task for the next years is to improve the outcome of ABI further by developing new less invasive operative approaches as well as new hardware and software for the ABI device.
Collapse
Affiliation(s)
- J Kuchta
- Department of Neurosurgery, Cologne University Hospital, Germany.
| |
Collapse
|
708
|
Speech Perception with Cochlear Implants. COCHLEAR IMPLANTS: AUDITORY PROSTHESES AND ELECTRIC HEARING 2004. [DOI: 10.1007/978-0-387-22585-2_8] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
709
|
Koch DB, Osberger MJ, Segel P, Kessler D. HiResolution TM and Conventional Sound Processing in the HiResolution TM Bionic Ear: Using Appropriate Outcome Measures to Assess Speech Recognition Ability. Audiol Neurootol 2004; 9:214-23. [PMID: 15205549 DOI: 10.1159/000078391] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2003] [Accepted: 01/15/2004] [Indexed: 11/19/2022] Open
Abstract
OBJECTIVE This study compared speech perception benefits in adults implanted with the HiResolution (HiRes) Bionic Ear who used both conventional and HiRes sound processing. A battery of speech tests was used to determine which formats were most appropriate for documenting the wide range of benefit experienced by cochlear-implant users. STUDY DESIGN A repeated-measures design was used to assess postimplantation speech perception in adults who received the HiResolution Bionic Ear in a recent clinical trial. Patients were fit first with conventional strategies and assessed after 3 months of use. Patients were then switched to HiRes sound processing and assessed again after 3 months of use. To assess the immediate effect of HiRes sound processing on speech perception performance, consonant recognition testing was performed in a subset of patients after 3 days of HiRes use and compared with their 3-month performance with conventional processing. SETTING Subjects were implanted and evaluated at 19 cochlear implant programs in the USA and Canada affiliated primarily with tertiary medical centers. PATIENTS Patients were 51 postlinguistically deafened adults. MAIN OUTCOME MEASURES Speech perception was assessed using CNC monosyllabic words, CID sentences and HINT sentences in quiet and noise. Consonant recognition testing was also administered to a subset of patients (n = 30) using the Iowa Consonant Test presented in quiet and noise. All patients completed a strategy preference questionnaire after 6 months of device use. RESULTS Consonant identification in quiet and noise improved significantly after only 3 days of HiRes use. The mean improvement from conventional to HiRes processing was significant on all speech perception tests. The largest differences occurred for the HINT sentences in noise. Ninety-six percent of the patients preferred HiRes to conventional sound processing. Ceiling effects occurred for both sentence tests in quiet. CONCLUSIONS Although most patients improved after switching to HiRes sound processing, the greatest differences were seen in the 'poor' performers because 'good' performers often reached ceiling performance, especially on tests in quiet. Future evaluations of cochlear-implant benefit should make use of more difficult measures, especially for 'good' users. Nonetheless, a range of difficulty must remain in test materials to document benefit in the entire population of implant recipients.
Collapse
|
710
|
Kuchta J, Otto SR, Shannon RV, Hitselberger WE, Brackmann DE. The multichannel auditory brainstem implant: how many electrodes make sense? J Neurosurg 2004; 100:16-23. [PMID: 14743907 DOI: 10.3171/jns.2004.100.1.0016] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Object. Development of multichannel auditory brainstem implant (ABI) systems has been based in part on the assumption that audiological outcome can be optimized by increasing the number of available electrodes. In this paper the authors critically analyze this assumption on the basis of a retrospective clinical study performed using the Nucleus 22 ABI surface electrode array.
Methods. The perceptual performances of 61 patients with neurofibromatosis Type 2 were tested approximately 6 weeks after an eight-electrode ABI had been implanted. Of eight implanted electrodes 5.57 ± 2.57 (mean ± standard deviation [SD] provided auditory sensations when stimulated. Electrodes were deactivated when stimulation resulted in significant nonauditory side effects or no auditory sensation at all, and also when they failed to provide distinctive pitch sensations. The mean (± SD) scores for patients with ABIs were the following: sound-only consonant recognition, 20.4 ± 14.3 (range 0–65%); vowel recognition, 28.8 ± 18% (range 0–67%); Monosyllable Trochee Spondee (MTS) word recognition 41.1 ± 25.3% (range 0–100%); and sentence recognition, 5.3 ± 11.4% (range 0–64%). Performance in patients in whom between one and three electrodes provided auditory sensation was significantly poorer than that in patients with between four and eight functional electrodes in the vowel, MTS word, and City University of New York (CUNY) sentence recognition tests. The correlation between performance and electrode number did not reach the 0.05 level of significance with respect to the sound effect, consonant, and MTS stress-pattern recognition tests, probably because a satisfactory performance in these tests can be obtained only with temporal cues, that is, without any information about the frequency of the sounds. In the MTS word and the CUNY sentence recognition tests, performance was optimal in the patients with eight functional electrodes. Although all top performers had more than three functional auditory electrodes, no further improvement (asymptotic performance) was seen in those with five or more active electrodes in the consonant, vowel, and sound effect recognition tests.
Conclusions. A minimum of three spectral channels, programmed in the appropriate individual tonotopic order seem to be required for satisfactory speech recognition in most patients with ABI. Due to the limited access to the tonotopic frequency gradient of the cochlear nucleus with surface stimulation, patients with ABI do not receive a wide range of spectral cues (frequency information) with multielectrode (> 5) surface arrays.
Collapse
Affiliation(s)
- Johannes Kuchta
- Department of Neurosurgery, Cologne University, Cologne, Germany.
| | | | | | | | | |
Collapse
|
711
|
Xu L, Pfingst BE. Relative importance of temporal envelope and fine structure in lexical-tone perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:3024-7. [PMID: 14714781 PMCID: PMC1283139 DOI: 10.1121/1.1623786] [Citation(s) in RCA: 100] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The relative importance of temporal envelope and fine structure in speech and music perception was investigated by Smith et al. [Nature (London) 416, 87–90 (2002)] using "auditory chimera" in which the envelope from one sound was paired with the fine structure of another. Smith et al. found that, when 4 to 16 frequency bands were used, recognition of English speech was dominated by the envelope, whereas recognition of melody was dominated by the fine structure. In the present study, Mandarin Chinese monosyllables were divided into 4, 8, or 16 frequency bands and the fine structure and envelope of one tone pattern were exchanged with those of another tone pattern of the same monosyllable. Five normal-hearing native Mandarin Chinese speakers completed a four-alternative forced-choice tone-identification task. In the vast majority of trials, subjects based their identification of the monosyllables on the fine structure rather than the envelope. Thus, the relative importance of envelope and fine structure for lexical-tone perception resembled that for melody recognition rather than that for English speech recognition. Delivering fine-structure information in cochlear implant stimulation could be particularly beneficial for lexical-tone perception.
Collapse
|
712
|
Pfingst BE, Xu L. Across-site variation in detection thresholds and maximum comfortable loudness levels for cochlear implants. J Assoc Res Otolaryngol 2003; 5:11-24. [PMID: 14605920 PMCID: PMC2538372 DOI: 10.1007/s10162-003-3051-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2002] [Accepted: 07/14/2003] [Indexed: 10/26/2022] Open
Abstract
In cochlear implants, variation across stimulation sites in psychophysical detection thresholds (T levels) and maximum comfortable loudness levels (C levels) can be large when narrow-bipolar (BP) stimulation is used. This across-site variation is typically smaller when monopolar (MP) stimulation is used. At least two models can account for across-site variation and the effects of electrode configuration on the magnitude of the variation. According to one model, across-site variation reflects site-to-site differences in the distances between the stimulating electrodes and the sites of action-potential initiation. Under this model, the lower across-site variation with MP stimulation is due to shallower current versus distance gradients. An alternative model assumes that T and C levels depend on integration of activity across the whole population of neurons and that MP stimulation activates neurons over a larger spatial extent than does BP stimulation. If T and C levels are determined by integration of activity across large overlapping populations of neurons, then their values at adjacent sites should be more similar than if these levels result from integration across smaller, more independent populations. We tested the models by examining the effects on across-site variation of three variables believed to affect the spatial extent of activation: electrode configuration, stimulus level within the dynamic range, and electrode-array design. T levels and C levels were measured in 13 subjects with Nucleus CI24M (straight array) and 9 subjects with Nucleus CI24R(CS) (Contour) cochlear implants using bipolar (BP) and monopolar (MP) electrode configurations. Site-to-site variation in T and C levels for BP stimulation was 2.1-3.3 times larger than that for MP stimulation. Contrary to the across-neuron integration hypothesis, no significant differences were found between across-site variation for T levels and that for C levels for the BP configuration. There was considerable overlap in site-to-site variation values for the two types of implants but mean site-to-site variation in C levels for CI24M implants was significantly lower than that for CI24R(CS) implants. Control studies suggested that these results were not an artifact of the scale, and not due to differences in inherent variability of the psychophysical measures, or to the method of quantifying across-site variation.
Collapse
Affiliation(s)
- Bryan E Pfingst
- Kresge Hearing Research Institute, Department of Otolaryngology, University of Michigan, Ann Arbor, MI 48109-0506, USA.
| | | |
Collapse
|
713
|
Rubinstein JT, Hong R. Signal coding in cochlear implants: exploiting stochastic effects of electrical stimulation. Ann Otol Rhinol Laryngol 2003; 191:14-9. [PMID: 14533839 DOI: 10.1177/00034894031120s904] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Speech perception in quiet with cochlear implants has increased substantially over the past 17 years. If current trends continue, average monosyllabic word scores will be nearly 80% by 2010. These improvements are due to enhancements in speech processing strategies, to the implantation of patients with more residual hearing and shorter durations of deafness, and to unknown causes. Despite these improvements, speech perception in noise and music perception are still poor in most implant patients. These deficits may be partly due to poor representation of temporal fine structure by current speech processing strategies. It may be possible to improve both this representation and the dynamic range of electrical stimulation through the exploitation of stochastic effects produced by high-rate (eg, 5-kilopulse-per-second) pulse trains. Both the loudness growth and the dynamic range of low-frequency sinusoids have been enhanced via this technique. A laboratory speech processor using this strategy is under development. Although the clinical programming for such an algorithm is likely to be complex, some guidelines for the psychophysical and electrophysiological techniques necessary can be described now.
Collapse
Affiliation(s)
- Jay T Rubinstein
- Department of Otolaryngology, University of Iowa, Iowa City, Iowa 52242, USA
| | | |
Collapse
|
714
|
Bierer JA, Middlebrooks JC. Cortical responses to cochlear implant stimulation: channel interactions. J Assoc Res Otolaryngol 2003; 5:32-48. [PMID: 14564662 PMCID: PMC2538368 DOI: 10.1007/s10162-003-3057-7] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2002] [Accepted: 07/28/2003] [Indexed: 11/25/2022] Open
Abstract
This study examined the interactions between electrical stimuli presented through two channels of a cochlear implant. Experiments were conducted in anesthetized guinea pigs. Multiunit spike activity recorded from the auditory cortex reflected the cumulative effects of electric field interactions in the cochlea as well as any neural interactions along the ascending auditory pathway. The cochlea was stimulated electrically through a 6-electrode intracochlear array. The stimulus on each channel was a single 80- micro s/phase biphasic pulse. Channel interactions were quantified as changes in the thresholds for elevation of cortical spike rates. Experimental parameters were interchannel temporal offset (0 to +/-2000 micro s), interelectrode cochlear spacing (1.5 or 2.25 mm), electrode configuration (monopolar, bipolar, or tripolar), and relative polarity between channels (same or inverted). In most conditions, presentation of a subthreshold pulse on one channel reduced the threshold for a pulse on a second channel. Threshold shifts were greatest for simultaneous pulses, but appreciable threshold reductions could persist for temporal offsets up to 640 micro s. Channel interactions varied strongly with electrode configuration: threshold shifts increased in magnitude in the order tripolar, bipolar, monopolar. Channel interactions were greater for closer electrode spacing. The results have implications for design of speech processors for cochlear implants.
Collapse
Affiliation(s)
- Julie Arenberg Bierer
- Kresge Hearing Research Institute, Department of Otorhinolaryngology, University of Michigan, Ann Arbor, MI 48109-0506, USA
| | - John C. Middlebrooks
- Kresge Hearing Research Institute, Department of Otorhinolaryngology, University of Michigan, Ann Arbor, MI 48109-0506, USA
| |
Collapse
|
715
|
Boëx C, de Balthasar C, Kós MI, Pelizzone M. Electrical field interactions in different cochlear implant systems. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2049-57. [PMID: 14587604 DOI: 10.1121/1.1610451] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The goal of this study was to evaluate electrical field interactions produced by the stimulation of different types of intracochlear electrodes in 12 adult subjects (three Ineraid, four Clarion S-Series, three S-Series with the electrode positioning system-EPS and two Clarion HiFocus-I with the EPS). Psychophysical measurements were conducted with biphasic stimuli (813 pulse per second, 153.8 micros/phase). "Perturbation" signals (300 ms) were applied to one electrode chosen at the middle of the array and their effects on detection thresholds of "probe" signals (30 ms) were measured on the neighbor basal electrode. Perturbation levels were set below the detection threshold of the perturbation electrode (-2 dB re threshold). Measurements were first conducted for simultaneous stimulation of the probe and of the perturbation electrodes, for monopolar for all subjects and for bipolar stimulus configurations for both Clarion HiFocus-I subjects. The tested Clarion electrodes did not present lower monopolar interactions than the Ineraid electrodes. Nevertheless, considering the shorter distance between electrodes for the Clarion than for the Ineraid, the tested Clarion electrodes might be more selective than the Ineraid. We did not find any significant monopolar electrical field-interaction differences between subjects who received the S-Series array with and without the EPS. We did not find lower interactions for both subjects who received the HiFocus-I array than for subjects who received the S-Series. Electrical field interactions were lower for bipolar than for monopolar configurations for both HiFocus-I subjects. A second set of measurements was conducted for nonsimultaneous stimulation similar to the one used in continuous interleaved sampling sound strategy. These measurements showed that interactions evaluated for simultaneous biphasic stimuli were larger than for nonsimultaneous stimulation.
Collapse
Affiliation(s)
- Colette Boëx
- Centre Romand d'Implants Cochléaires, University Hospital, CH-1211 Geneva 14, Switzerland.
| | | | | | | |
Collapse
|
716
|
Stone MA, Moore BCJ. Effect of the speed of a single-channel dynamic range compressor on intelligibility in a competing speech task. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:1023-1034. [PMID: 12942981 DOI: 10.1121/1.1592160] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Using a "noise-vocoder" cochlear implant simulator [Shannon et al., Science 270, 303-304 (1995)], the effect of the speed of dynamic range compression on speech intelligibility was assessed, using normal-hearing subjects. The target speech had a level 5 dB above that of the competing speech. Initially, baseline performance was measured with no compression active, using between 4 and 16 processing channels. Then, performance was measured using a fast-acting compressor and a slow-acting compressor, each operating prior to the vocoder simulation. The fast system produced significant gain variation over syllabic timescales. The slow system produced significant gain variation only over the timescale of sentences. With no compression active, about six channels were necessary to achieve 50% correct identification of words in sentences. Sixteen channels produced near-maximum performance. Slow-acting compression produced no significant degradation relative to the baseline. However, fast-acting compression consistently reduced performance relative to that for the baseline, over a wide range of performance levels. It is suggested that fast-acting compression degrades performance for two reasons: (1) because it introduces correlated fluctuations in amplitude in different frequency bands, which tends to produce perceptual fusion of the target and background sounds and (2) because it reduces amplitude modulation depth and intensity contrasts.
Collapse
Affiliation(s)
- Michael A Stone
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England.
| | | |
Collapse
|
717
|
Qin MK, Oxenham AJ. Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:446-54. [PMID: 12880055 DOI: 10.1121/1.1579009] [Citation(s) in RCA: 279] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
This study investigated the effects of simulated cochlear-implant processing on speech reception in a variety of complex masking situations. Speech recognition was measured as a function of target-to-masker ratio, processing condition (4, 8, 24 channels, and unprocessed) and masker type (speech-shaped noise, amplitude-modulated speech-shaped noise, single male talker, and single female talker). The results showed that simulated implant processing was more detrimental to speech reception in fluctuating interference than in steady-state noise. Performance in the 24-channel processing condition was substantially poorer than in the unprocessed condition, despite the comparable representation of the spectral envelope. The detrimental effects of simulated implant processing in fluctuating maskers, even with large numbers of channels, may be due to the reduction in the pitch cues used in sound source segregation, which are normally carried by the peripherally resolved low-frequency harmonics and the temporal fine structure. The results suggest that using steady-state noise to test speech intelligibility may underestimate the difficulties experienced by cochlear-implant users in fluctuating acoustic backgrounds.
Collapse
Affiliation(s)
- Michael K Qin
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
| | | |
Collapse
|
718
|
Hamzavi J, Baumgartner WD, Pok SM, Franz P, Gstoettner W. Variables affecting speech perception in postlingually deaf adults following cochlear implantation. Acta Otolaryngol 2003; 123:493-8. [PMID: 12797584 DOI: 10.1080/0036554021000028120] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
OBJECTIVE To evaluate the time span over which there is greatest improvement in postlingually deaf adults undergoing cochlear implantation. Additionally, to quantify potential patient and device factors that may predict the postoperative results. MATERIAL AND METHODS A longitudinal study was conducted. Numbers, monosyllables and sentence test results were collected for 66 cochlear implant subjects [Combi 40/40 +, n = 60; Clarion HF2, n = 2; Nucleus 24m/k, n = 4] at regular intervals for up to 6 years following cochlear implantation. RESULTS All patients showed a steady improvement over time on all tests. Progress during the first 12 months was statistically significant, with further improvements being recorded after the 12-month testing period. The duration of deafness and the number of electrodes (8 for the Combi 40, 12 for the Combi 40 + ) appeared to be weakly correlated with postoperative performance. Re-implantation after device failure had no negative effect on speech reception. Subjects who were "upgraded" from an analogue to a digital cochlear implant improved their test results almost twofold. CONCLUSION All the patients in our study gained substantial benefit from their cochlear implants. It is encouraging to note that the factors examined were not deemed to be relevant predictors of performance. Even long-term deaf subjects and re-implantees are able to achieve an excellent level of speech perception.
Collapse
Affiliation(s)
- Jafar Hamzavi
- Department of Otorhinolaryngology, University of Vienna, Medical School, Vienna, Austria.
| | | | | | | | | |
Collapse
|
719
|
Henry BA, Turner CW. The resolution of complex spectral patterns by cochlear implant and normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:2861-2873. [PMID: 12765402 DOI: 10.1121/1.1561900] [Citation(s) in RCA: 150] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The differences in spectral shape resolution abilities among cochlear implant (CI) listeners, and between CI and normal-hearing (NH) listeners, when listening with the same number of channels (12), was investigated. In addition, the effect of the number of channels on spectral shape resolution was examined. The stimuli were rippled noise signals with various ripple frequency-spacings. An adaptive 41FC procedure was used to determine the threshold for resolvable ripple spacing, which was the spacing at which an interchange in peak and valley positions could be discriminated. The results showed poorer spectral shape resolution in CI compared to NH listeners (average thresholds of approximately 3000 and 400 Hz, respectively), and wide variability among CI listeners (range of approximately 800 to 8000 Hz). There was a significant relationship between spectral shape resolution and vowel recognition. The spectral shape resolution thresholds of NH listeners increased as the number of channels increased from 1 to 16, while the CI listeners showed a performance plateau at 4-6 channels, which is consistent with previous results using speech recognition measures. These results indicate that this test may provide a measure of CI performance which is time efficient and non-linguistic, and therefore, if verified, may provide a useful contribution to the prediction of speech perception in adults and children who use CIs.
Collapse
Affiliation(s)
- Belinda A Henry
- Department of Speech Pathology and Audiology, The University of Iowa, Iowa City, Iowa 52242, USA.
| | | |
Collapse
|
720
|
Moore BCJ. Coding of sounds in the auditory system and its relevance to signal processing and coding in cochlear implants. Otol Neurotol 2003; 24:243-54. [PMID: 12621339 DOI: 10.1097/00129492-200303000-00019] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To review how the properties of sounds are "coded" in the normal auditory system and to discuss the extent to which cochlear implants can and do represent these codes. DATA SOURCES Data are taken from published studies of the response of the cochlea and auditory nerve to simple and complex stimuli, in both the normal and the electrically stimulated ear. REVIEW CONTENT: The review describes: 1) the coding in the normal auditory system of overall level (which partly determines perceived loudness), spectral shape (which partly determines perceived timbre and the identity of speech sounds), periodicity (which partly determines pitch), and sound location; 2) the role of the active mechanism in the cochlea, and particularly the fast-acting compression associated with that mechanism; 3) the neural response patterns evoked by cochlear implants; and 4) how the response patterns evoked by implants differ from those observed in the normal auditory system in response to sound. A series of specific issues is then discussed, including: 1) how to compensate for the loss of cochlear compression; 2) the effective number of independent channels in a normal ear and in cochlear implantees; 3) the importance of independence of responses across neurons; 4) the stochastic nature of normal neural responses; 5) the possible role of across-channel coincidence detection; and 6) potential benefits of binaural implantation. CONCLUSIONS Current cochlear implants do not adequately reproduce several aspects of the neural coding of sound in the normal auditory system. Improved electrode arrays and coding systems may lead to improved coding and, it is hoped, to better performance.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England.
| |
Collapse
|
721
|
Frijns JHM, Klop WMC, Bonnet RM, Briaire JJ. Optimizing the number of electrodes with high-rate stimulation of the clarion CII cochlear implant. Acta Otolaryngol 2003; 123:138-42. [PMID: 12701728 DOI: 10.1080/0036554021000028126] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
OBJECTIVE This blind crossover study evaluates the effect of the number of electrodes of the Clarion CII cochlear implant on speech perception in silence and in noise using a "high-rate" continuous interleaved sampling (CIS) strategy. MATERIAL AND METHODS Nine users of this implant with 3-11 months of experience of an 8-channel CIS strategy [833 pulses per second (pps)/channel, 75 micros/phase] were fitted in a random order with 8-, 12- and 16-channel CIS strategies (+/- 1,400 pps/channel, 21 micros/phase). After 1 month of exclusive use of each strategy the performance was tested with consonant-vowel-consonant words in silence (sound only) and in speech-shaped background noise with signal-to-noise ratios (SNRs) of + 10, + 5, 0 and -5 dB. RESULTS With "high-rate" strategies most patients' speech understanding in noise improved, although the optimum number of electrodes was highly variable. Generally, faster performers benefited from more active electrodes, whilst slower performers deteriorated. If each patient's optimal strategy was determined by a weighted sum of the test results at +10, + 5 and 0 dB SNR, the average phoneme score improved from 57% to 72% at a SNR of + 5 dB, and from 46% to 56% at a SNR of 0 dB. The average phoneme score in silence was approximately 85% for all strategies. CONCLUSION We conclude that speech perception (especially in noise) can improve significantly with "high-rate" speech processing strategies, provided that the optimum number of electrodes is determined for each patient individually.
Collapse
Affiliation(s)
- Johan H M Frijns
- ENT Department, Leiden University Medical Centre, Leiden, The Netherlands.
| | | | | | | |
Collapse
|
722
|
Garnham C, O'Driscoll M, Ramsden And R, Saeed S. Speech understanding in noise with a Med-El COMBI 40+ cochlear implant using reduced channel sets. Ear Hear 2002; 23:540-52. [PMID: 12476091 DOI: 10.1097/00003446-200212000-00005] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE The objective of the investigation described in this paper was the determination of the number of (widely spaced) active electrodes needed for users of a COMBI 40+ cochlear implant to achieve asymptotic performance in the recognition of speech against a background of wideband noise. DESIGN This study measured the performance in speech tests of patients using the Med-El implementation of continuous interleaved sampling with widely spaced electrode pair subsets of 2, 3, 4, 6, 8, and 10 out of a possible maximum of 12. An eight-vowel test, a 16-consonant test, and BKB sentences were presented against a background of pink noise. Additionally, AB monosyllabic words were presented both in quiet and in noise to processors with 6, 8, and 11 widely spaced electrodes. 11 subjects participated in the study. RESULTS Using moderate signal-to-noise ratios, for these patients the curve relating percentage score to increasing numbers of active channels approached an asymptote before the 10-channel data point was reached. Asymptotic performance was achieved using four channels for consonants, and eight channels for sentences. Understanding of monosyllabic words reached a maximum value at a similar number of channels for both quiet conditions and against a background of pink noise, and the mean increase in test score between 6 and 11 channels was only 7%. CONCLUSIONS These results are similar to those of previous experiments carried out in quiet listening conditions. The data suggest that 12 frequency channels (the number implemented by the COMBI 40+ cochlear implant) are more than adequate for users to achieve asymptotic performance levels in clinical speech tests applied in the presence of wideband noise at moderate signal-to-noise ratios.
Collapse
|
723
|
Shannon RV. The relative importance of amplitude, temporal, and spectral cues for cochlear implant processor design. Am J Audiol 2002; 11:124-7. [PMID: 12691223 DOI: 10.1044/1059-0889(2002/013)] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Speech understanding with cochlear implants has improved steadily over the last 25 years, and the success of implants has provided a powerful tool for understanding speech recognition in general. Comparing speech recognition in normal-hearing listeners and in cochlear-implant listeners has revealed many important lessons about the types of information necessary for good speech recognition--and some of the lessons are surprising. This paper presents a summary of speech perception research over the last 25 years with cochlear-implant and normal-hearing listeners. As long as the speech is audible, even the relatively severe amplitude distortion has only a mild effect on intelligibility. Temporal cues appear to be useful for speech intelligibility only up to about 20 Hz. Whereas temporal information above 20 Hz may contribute to improved quality, it contributes little to speech understanding. In contrast, the quantity and quality of spectral information appear to be critical for speech understanding. Only four spectral "channels" of information can produce good speech understanding, but more channels are required for difficult listening situations. Speech understanding is sensitive to the placement of spectral information along the cochlea. In prosthetic devices, in which the spectral information can be delivered to any cochlear location, it is critical to present spectral information to the normal acoustic tonotopic location for that information. If there is a shift or distortion of 2 to 3 mm between frequency and cochlear place, speech recognition is decreased dramatically.
Collapse
Affiliation(s)
- Robert V Shannon
- Department of Auditory Implants and Perception, House Ear Institute, Los Angeles, CA 90057, USA.
| |
Collapse
|
724
|
Green T, Faulkner A, Rosen S. Spectral and temporal cues to pitch in noise-excited vocoder simulations of continuous-interleaved-sampling cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 112:2155-2164. [PMID: 12430827 DOI: 10.1121/1.1506688] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Four-band and single-band noise-excited vocoders were used in acoustic simulations to investigate spectral and temporal cues to melodic pitch in the output of a cochlear implant speech processor. Noise carriers were modulated by amplitude envelopes extracted by half-wave rectification and low-pass filtering at 32 or 400 Hz. The four-band, but not the single-band processors, may preserve spectral correlates of fundamental frequency (F0). Envelope smoothing at 400 Hz preserves temporal correlates of F0, which are eliminated with 32-Hz smoothing. Inputs to the processors were sawtooth frequency glides, in which spectral variation is completely determined by F0, or synthetic diphthongal vowel glides, whose spectral shape is dominated by varying formant resonances. Normal listeners labeled the direction of pitch movement of the processed stimuli. For processed sawtooth waves, purely temporal cues led to decreasing performance with increasing F0. With purely spectral cues, performance was above chance despite the limited spectral resolution of the processors. For processed diphthongs, performance with purely spectral cues was at chance, showing that spectral envelope changes due to formant movement obscured spectral cues to F0. Performance with temporal cues was poorer for diphthongs than for sawtooths, with very limited discrimination at higher F0. These data suggest that, for speech signals through a typical cochlear implant processor, spectral cues to pitch are likely to have limited utility, while temporal envelope cues may be useful only at low F0.
Collapse
Affiliation(s)
- Tim Green
- Department of Phonetics and Linguistics, University College London, United Kingdom.
| | | | | |
Collapse
|
725
|
Neural encoding: making sense of sound. Curr Opin Otolaryngol Head Neck Surg 2002. [DOI: 10.1097/00020840-200210000-00005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
726
|
Xu L, Tsai Y, Pfingst BE. Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 112:247-58. [PMID: 12141350 PMCID: PMC1414789 DOI: 10.1121/1.1487843] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Tone languages differ from English in that the pitch pattern of a single-syllable word conveys lexical meaning. In the present study, dependence of tonal-speech perception on features of the stimulation was examined using an acoustic simulation of a CIS-type speech-processing strategy for cochlear prostheses. Contributions of spectral features of the speech signals were assessed by varying the number of filter bands, while contributions of temporal envelope features were assessed by varying the low-pass cutoff frequency used for extracting the amplitude envelopes. Ten normal-hearing native Mandarin Chinese speakers were tested. When the low-pass cutoff frequency was fixed at 512 Hz, consonant, vowel, and sentence recognition improved as a function of the number of channels and reached plateau at 4 to 6 channels. Subjective judgments of sound quality continued to improve as the number of channels increased to 12, the highest number tested. Tone recognition, i.e., recognition of the four Mandarin tone patterns, depended on both the number of channels and the low-pass cutoff frequency. The trade-off between the temporal and spectral cues for tone recognition indicates that temporal cues can compensate for diminished spectral cues for tone recognition and vice versa. An additional tone recognition experiment using syllables of equal duration showed a marked decrease in performance, indicating that duration cues contribute to tone recognition. A third experiment showed that recognition of processed FM patterns that mimic Mandarin tone patterns was poor when temporal envelope and duration cues were removed.
Collapse
Affiliation(s)
- Li Xu
- Kresge Hearing Research Institute, Department of Otolaryngology, University of Michigan, Ann Arbor 48109, USA.
| | | | | |
Collapse
|
727
|
McKay CM, Henshall KR. Frequency-to-electrode allocation and speech perception with cochlear implants. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1036-1044. [PMID: 11863160 DOI: 10.1121/1.1436073] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The hypothesis was investigated that selectively increasing the discrimination of low-frequency information (below 2600 Hz) by altering the frequency-to-electrode allocation would improve speech perception by cochlear implantees. Two experimental conditions were compared, both utilizing ten electrode positions selected based on maximal discrimination. A fixed frequency range (200-10513 Hz) was allocated either relatively evenly across the ten electrodes, or so that nine of the ten positions were allocated to the frequencies up to 2600 Hz. Two additional conditions utilizing all available electrode positions (15-18 electrodes) were assessed: one with each subject's usual frequency-to-electrode allocation; and the other using the same analysis filters as the other experimental conditions. Seven users of the Nucleus CI22 implant wore processors mapped with each experimental condition for 2-week periods away from the laboratory, followed by assessment of perception of words in quiet and sentences in noise. Performance with both ten-electrode maps was significantly poorer than with both full-electrode maps on at least one measure. Performance with the map allocating nine out of ten electrodes to low frequencies was equivalent to that with the full-electrode maps for vowel perception and sentences in noise, but was worse for consonant perception. Performance with the evenly allocated ten-electrode map was equivalent to that with the full-electrode maps for consonant perception, but worse for vowel perception and sentences in noise. Comparison of the two full-electrode maps showed that subjects could fully adapt to frequency shifts up to ratio changes of 1.3, given 2 weeks' experience. Future research is needed to investigate whether speech perception may be improved by the manipulation of frequency-to-electrode allocation in maps which have a full complement of electrodes in Nucleus implants.
Collapse
Affiliation(s)
- Colette M McKay
- The University of Melbourne, Department of Otolaryngology, Parkville, Australia.
| | | |
Collapse
|