1
|
Cychosz M, Winn MB, Goupell MJ. How to vocode: Using channel vocoders for cochlear-implant research. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 155:2407-2437. [PMID: 38568143 PMCID: PMC10994674 DOI: 10.1121/10.0025274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 02/23/2024] [Indexed: 04/05/2024]
Abstract
The channel vocoder has become a useful tool to understand the impact of specific forms of auditory degradation-particularly the spectral and temporal degradation that reflect cochlear-implant processing. Vocoders have many parameters that allow researchers to answer questions about cochlear-implant processing in ways that overcome some logistical complications of controlling for factors in individual cochlear implant users. However, there is such a large variety in the implementation of vocoders that the term "vocoder" is not specific enough to describe the signal processing used in these experiments. Misunderstanding vocoder parameters can result in experimental confounds or unexpected stimulus distortions. This paper highlights the signal processing parameters that should be specified when describing vocoder construction. The paper also provides guidance on how to determine vocoder parameters within perception experiments, given the experimenter's goals and research questions, to avoid common signal processing mistakes. Throughout, we will assume that experimenters are interested in vocoders with the specific goal of better understanding cochlear implants.
Collapse
Affiliation(s)
- Margaret Cychosz
- Department of Linguistics, University of California, Los Angeles, Los Angeles, California 90095, USA
| | - Matthew B Winn
- Department of Speech-Language-Hearing Sciences, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Matthew J Goupell
- Department of Hearing and Speech Sciences, University of Maryland, College Park, College Park, Maryland 20742, USA
| |
Collapse
|
2
|
Creff G, Lambert C, Coudert P, Pean V, Laurent S, Godey B. Comparison of Tonotopic and Default Frequency Fitting for Speech Understanding in Noise in New Cochlear Implantees: A Prospective, Randomized, Double-Blind, Cross-Over Study. Ear Hear 2024; 45:35-52. [PMID: 37823850 DOI: 10.1097/aud.0000000000001423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
OBJECTIVES While cochlear implants (CIs) have provided benefits for speech recognition in quiet for subjects with severe-to-profound hearing loss, speech recognition in noise remains challenging. A body of evidence suggests that reducing frequency-to-place mismatch may positively affect speech perception. Thus, a fitting method based on a tonotopic map may improve speech perception results in quiet and noise. The aim of our study was to assess the impact of a tonotopic map on speech perception in noise and quiet in new CI users. DESIGN A prospective, randomized, double-blind, two-period cross-over study in 26 new CI users was performed over a 6-month period. New CI users older than 18 years with bilateral severe-to-profound sensorineural hearing loss or complete hearing loss for less than 5 years were selected in the University Hospital Centre of Rennes in France. An anatomical tonotopic map was created using postoperative flat-panel computed tomography and a reconstruction software based on the Greenwood function. Each participant was randomized to receive a conventional map followed by a tonotopic map or vice versa. Each setting was maintained for 6 weeks, at the end of which participants performed speech perception tasks. The primary outcome measure was speech recognition in noise. Participants were allocated to sequences by block randomization of size two with a ratio 1:1 (CONSORT Guidelines). Participants and those assessing the outcomes were blinded to the intervention. RESULTS Thirteen participants were randomized to each sequence. Two of the 26 participants recruited (one in each sequence) had to be excluded due to the COVID-19 pandemic. Twenty-four participants were analyzed. Speech recognition in noise was significantly better with the tonotopic fitting at all signal-to-noise ratio (SNR) levels tested [SNR = +9 dB, p = 0.002, mean effect (ME) = 12.1%, 95% confidence interval (95% CI) = 4.9 to 19.2, standardized effect size (SES) = 0.71; SNR = +6 dB, p < 0.001, ME = 16.3%, 95% CI = 9.8 to 22.7, SES = 1.07; SNR = +3 dB, p < 0.001 ME = 13.8%, 95% CI = 6.9 to 20.6, SES = 0.84; SNR = 0 dB, p = 0.003, ME = 10.8%, 95% CI = 4.1 to 17.6, SES = 0.68]. Neither period nor interaction effects were observed for any signal level. Speech recognition in quiet ( p = 0.66) and tonal audiometry ( p = 0.203) did not significantly differ between the two settings. 92% of the participants kept the tonotopy-based map after the study period. No correlation was found between speech-in-noise perception and age, duration of hearing deprivation, angular insertion depth, or position or width of the frequency filters allocated to the electrodes. CONCLUSION For new CI users, tonotopic fitting appears to be more efficient than the default frequency fitting because it allows for better speech recognition in noise without compromising understanding in quiet.
Collapse
Affiliation(s)
- Gwenaelle Creff
- Department of Otolaryngology-Head and Neck Surgery (HNS), University Hospital, Rennes, France
- MediCIS, LTSI (Image and Signal Processing Laboratory), INSERM, U1099, Rennes, France
| | - Cassandre Lambert
- Department of Otolaryngology-Head and Neck Surgery (HNS), University Hospital, Rennes, France
| | - Paul Coudert
- Department of Otolaryngology-Head and Neck Surgery (HNS), University Hospital, Rennes, France
| | | | | | - Benoit Godey
- Department of Otolaryngology-Head and Neck Surgery (HNS), University Hospital, Rennes, France
- MediCIS, LTSI (Image and Signal Processing Laboratory), INSERM, U1099, Rennes, France
- Hearing Aid Academy, Javene, France
| |
Collapse
|
3
|
Dillon MT, Helpard L, Brown KD, Selleck AM, Richter ME, Rooth MA, Thompson NJ, Dedmon MM, Ladak HM, Agrawal S. Influence of the Frequency-to-Place Function on Recognition with Place-Based Cochlear Implant Maps. Laryngoscope 2023; 133:3540-3547. [PMID: 37078508 DOI: 10.1002/lary.30710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 04/07/2023] [Accepted: 04/11/2023] [Indexed: 04/21/2023]
Abstract
OBJECTIVE Comparison of acute speech recognition for cochlear implant (CI) alone and electric-acoustic stimulation (EAS) users listening with default maps or place-based maps using either a spiral ganglion (SG) or a new Synchrotron Radiation-Artificial Intelligence (SR-AI) frequency-to-place function. METHODS Thirteen adult CI-alone or EAS users completed a task of speech recognition at initial device activation with maps that differed in the electric filter frequency assignments. The three map conditions were: (1) maps with the default filter settings (default map), (2) place-based maps with filters aligned to cochlear SG tonotopicity using the SG function (SG place-based map), and (3) place-based maps with filters aligned to cochlear Organ of Corti (OC) tonotopicity using the SR-AI function (SR-AI place-based map). Speech recognition was evaluated using a vowel recognition task. Performance was scored as the percent correct for formant 1 recognition due to the rationale that the maps would deviate the most in the estimated cochlear place frequency for low frequencies. RESULTS On average, participants had better performance with the OC SR-AI place-based map as compared to the SG place-based map and the default map. A larger performance benefit was observed for EAS users than for CI-alone users. CONCLUSION These pilot data suggest that EAS and CI-alone users may experience better performance with a patient-centered mapping approach that accounts for the variability in cochlear morphology (OC SR-AI frequency-to-place function) in the individualization of the electric filter frequencies (place-based mapping procedure). LEVEL OF EVIDENCE 3 Laryngoscope, 133:3540-3547, 2023.
Collapse
Affiliation(s)
- Margaret T Dillon
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Luke Helpard
- School of Biomedical Engineering, Western University, London, Ontario, Canada
| | - Kevin D Brown
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - A Morgan Selleck
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Margaret E Richter
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Meredith A Rooth
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Nicholas J Thompson
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Matthew M Dedmon
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Hanif M Ladak
- School of Biomedical Engineering, Western University, London, Ontario, Canada
- Department of Electrical and Computer Engineering, Western University, London, Ontario, Canada
- Department of Medical Biophysics, Western University, London, Ontario, Canada
- Department of Otolaryngology - Head & Neck Surgery, Western University, London, Ontario, Canada
| | - Sumit Agrawal
- Department of Otolaryngology - Head & Neck Surgery, Western University, London, Ontario, Canada
| |
Collapse
|
4
|
Dillon MT, Buss E, Johnson AD, Canfarotta MW, O’Connell BP. Comparison of Two Place-Based Mapping Procedures on Masked Sentence Recognition as a Function of Electrode Array Angular Insertion Depth and Presence of Acoustic Low-Frequency Information: A Simulation Study. Audiol Neurootol 2023; 28:478-487. [PMID: 37482054 PMCID: PMC10948008 DOI: 10.1159/000531262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 05/23/2023] [Indexed: 07/25/2023] Open
Abstract
INTRODUCTION Cochlear implant (CI) and electric-acoustic stimulation (EAS) users may experience better performance with maps that align the electric filter frequencies to the cochlear place frequencies, known as place-based maps, than with maps that present spectrally shifted information. Individual place-based mapping procedures differ in the frequency content that is aligned to cochlear tonotopicity versus discarded or spectrally shifted. The performance benefit with different place-based maps may vary due to individual differences in angular insertion depth (AID) of the electrode array and whether functional acoustic low-frequency information is available in the implanted ear. The present study compared masked speech recognition with two types of place-based maps as a function of AID and presence of acoustic low-frequency information. METHODS Sixty adults with normal hearing listened acutely to CI or EAS simulations of two types of place-based maps for one of three cases of electrode arrays at shallow AIDs. The strict place-based (Strict-PB) map aligned the low- and mid-frequency information to cochlear tonotopicity and discarded information below the frequency associated with the most apical electrode contact. The alternative place-based map (LFshift-PB) aligned the mid-frequency information to cochlear tonotopicity and provided more of the speech spectrum by compressing low-frequency information on the apical electrode contacts (i.e., <1 kHz). Three actual cases of a 12-channel, 24-mm electrode array were simulated by assigning the carrier frequency for an individual channel as the cochlear place frequency of the associated electrode contact. The AID and cochlear place frequency for the most apical electrode contact were 460° and 498 Hz for case 1, 389° and 728 Hz for case 2, and 335° and 987 Hz for case 3, respectively. RESULTS Generally, better performance was observed with the Strict-PB maps for cases 1 and 2, where mismatches were 2-4 octaves for the most apical channel with the LFshift-PB map. Similar performance was observed between maps for case 3. For the CI simulations, performance with the Strict-PB map declined with decreases in AID, while performance with the LFshift-PB map remained stable across cases. For the EAS simulations, performance with the Strict-PB map remained stable across cases, while performance with the LFshift-PB map improved with decreases in AID. CONCLUSIONS Listeners demonstrated differences with the Strict-PB versus LFshift-PB maps as a function of AID and whether acoustic low-frequency information was available (CI vs. EAS). These data support the use of the Strict-PB mapping procedure for AIDs ≥335°, though further study including time for acclimatization in CI and EAS users is warranted.
Collapse
Affiliation(s)
- Margaret T. Dillon
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Division of Speech and Hearing Sciences, Department of Allied Health Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Emily Buss
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Alec D. Johnson
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Michael W. Canfarotta
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Brendan P. O’Connell
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Charlotte Eye Ear Nose & Throat Associates, P.A., Charlotte, NC, USA
| |
Collapse
|
5
|
Dillon MT, Canfarotta MW, Buss E, Rooth MA, Richter ME, Overton AB, Roth NE, Dillon SM, Raymond JH, Young A, Pearson AC, Davis AG, Dedmon MM, Brown KD, O'Connell BP. Influence of Electric Frequency-to-Place Mismatches on the Early Speech Recognition Outcomes for Electric-Acoustic Stimulation Users. Am J Audiol 2023; 32:251-260. [PMID: 36800505 PMCID: PMC10166189 DOI: 10.1044/2022_aja-21-00254] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Revised: 05/21/2022] [Accepted: 11/28/2022] [Indexed: 02/19/2023] Open
Abstract
PURPOSE Cochlear implant (CI) recipients with hearing preservation experience significant improvements in speech recognition with electric-acoustic stimulation (EAS) as compared to with a CI alone, although outcomes across EAS users vary. The individual differences in performance may be due in part to default mapping procedures, which result in electric frequency-to-place mismatches for the majority of EAS users. This study assessed the influence of electric mismatches on the early speech recognition for EAS users. METHOD Twenty-one participants were randomized at EAS activation to listen exclusively with a default or place-based map. For both groups, the unaided thresholds determined the acoustic cutoff frequency (i.e., > 65 dB HL). For default maps, the electric filter frequencies were assigned to avoid spectral gaps in frequency information but created varying magnitudes of mismatches. For place-based maps, the electric filter frequencies were assigned to avoid frequency-to-place mismatches. Recognition of consonant-nucleus-consonant words and vowels was assessed at activation and 1, 3, and 6 months postactivation. RESULTS For participants with default maps, electric mismatch at 1500 Hz ranged from 2 to -12.0 semitones (Mdn = -5 semitones). Poorer performance was observed for those with larger magnitudes of electric mismatch. This effect was observed through 6 months of EAS listening experience. CONCLUSIONS The present sample of EAS users experienced better initial performance when electric mismatches were small or eliminated. These data suggest the utility of methods that reduce electric mismatches, such as place-based mapping procedures. Investigation is ongoing to determine whether these differences persist with long-term EAS use. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.22096523.
Collapse
Affiliation(s)
- Margaret T. Dillon
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
- Division of Speech and Hearing Sciences, Department of Allied Health Sciences, The University of North Carolina at Chapel Hill
| | - Michael W. Canfarotta
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| | - Emily Buss
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| | - Meredith A. Rooth
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| | - Margaret E. Richter
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
- Division of Speech and Hearing Sciences, Department of Allied Health Sciences, The University of North Carolina at Chapel Hill
| | | | | | | | | | - Allison Young
- Department of Audiology, UNC Health, Chapel Hill, NC
| | | | - Amanda G. Davis
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
- Division of Speech and Hearing Sciences, Department of Allied Health Sciences, The University of North Carolina at Chapel Hill
| | - Matthew M. Dedmon
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| | - Kevin D. Brown
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| | - Brendan P. O'Connell
- Department of Otolaryngology/Head & Neck Surgery, School of Medicine, The University of North Carolina at Chapel Hill
| |
Collapse
|
6
|
Association between lateral wall electrode array insertion parameters and audiological outcomes in bilateral cochlear implantation. Eur Arch Otorhinolaryngol 2022; 280:2707-2714. [PMID: 36436080 PMCID: PMC10175364 DOI: 10.1007/s00405-022-07756-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2022] [Accepted: 11/15/2022] [Indexed: 11/28/2022]
Abstract
Abstract
Purpose
The aims of this study were to compare speech recognition at different postoperative times for both ears in bilaterally implanted patients and to assess the influence of the time of deafness, frequency-to-place mismatch, angular insertion depth (AID) and angular separation between neighbouring electrode contacts on audiometric outcomes.
Methods
This study was performed at an academic tertiary referral centre. A total of 19 adult patients (6 men, 13 women), who received sequential bilateral implantation with lateral wall electrode arrays, were analysed in retrospective. Statistical analysis was performed using two-sided t test, Wilcoxon test, median test, and Spearman’s correlation.
Results
Postlingually deafened patients (deafness after the age of 10) had a significantly better speech perception (WRS65[CI]) than the perilingually deafened subjects (deafness at the age of 1–10 years) (p < 0.001). Comparison of cochlear duct length between peri- and postlingually deafened subjects showed a slightly significantly smaller cochleae in perilingual patients (p = 0.045). No association between frequency-to-place mismatch as well as angular separation and speech perception could be detected. There was even no significant difference between the both ears in the intraindividual comparison, even if insertion parameters differed.
Conclusion
The exact electrode position seems to have less influence on the speech comprehension of CI patients than already established parameters as preoperative speech recognition or duration of deafness.
Collapse
|
7
|
American Cochlear Implant Alliance Task Force Guidelines for Clinical Assessment and Management of Adult Cochlear Implantation for Single-Sided Deafness. Ear Hear 2022; 43:1605-1619. [PMID: 35994570 PMCID: PMC9592177 DOI: 10.1097/aud.0000000000001260] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The indications for cochlear implantation have expanded to include individuals with profound sensorineural hearing loss in the impaired ear and normal hearing (NH) in the contralateral ear, known as single-sided deafness (SSD). There are additional considerations for the clinical assessment and management of adult cochlear implant candidates and recipients with SSD as compared to conventional cochlear implant candidates with bilateral moderate to profound sensorineural hearing loss. The present report reviews the current evidence relevant to the assessment and management of adults with SSD. A systematic review was also conducted on published studies that investigated outcomes of cochlear implant use on measures of speech recognition in quiet and noise, sound source localization, tinnitus perception, and quality of life for this patient population. Expert consensus and systematic review of the current literature were combined to provide guidance for the clinical assessment and management of adults with SSD.
Collapse
|
8
|
Dillon MT, O'Connell BP, Canfarotta MW, Buss E, Hopfinger J. Effect of Place-Based Versus Default Mapping Procedures on Masked Speech Recognition: Simulations of Cochlear Implant Alone and Electric-Acoustic Stimulation. Am J Audiol 2022; 31:322-337. [PMID: 35394798 DOI: 10.1044/2022_aja-21-00123] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
PURPOSE Cochlear implant (CI) recipients demonstrate variable speech recognition when listening with a CI-alone or electric-acoustic stimulation (EAS) device, which may be due in part to electric frequency-to-place mismatches created by the default mapping procedures. Performance may be improved if the filter frequencies are aligned with the cochlear place frequencies, known as place-based mapping. Performance with default maps versus an experimental place-based map was compared for participants with normal hearing when listening to CI-alone or EAS simulations to observe potential outcomes prior to initiating an investigation with CI recipients. METHOD A noise vocoder simulated CI-alone and EAS devices, mapped with default or place-based procedures. The simulations were based on an actual 24-mm electrode array recipient, whose insertion angles for each electrode contact were used to estimate the respective cochlear place frequency. The default maps used the filter frequencies assigned by the clinical software. The filter frequencies for the place-based maps aligned with the cochlear place frequencies for individual contacts in the low- to mid-frequency cochlear region. For the EAS simulations, low-frequency acoustic information was filtered to simulate aided low-frequency audibility. Performance was evaluated for the AzBio sentences presented in a 10-talker masker at +5 dB signal-to-noise ratio (SNR), +10 dB SNR, and asymptote. RESULTS Performance was better with the place-based maps as compared with the default maps for both CI-alone and EAS simulations. For instance, median performance at +10 dB SNR for the CI-alone simulation was 57% correct for the place-based map and 20% for the default map. For the EAS simulation, those values were 59% and 37% correct. Adding acoustic low-frequency information resulted in a similar benefit for both maps. CONCLUSIONS Reducing frequency-to-place mismatches, such as with the experimental place-based mapping procedure, produces a greater benefit in speech recognition than maximizing bandwidth for CI-alone and EAS simulations. Ongoing work is evaluating the initial and long-term performance benefits in CI-alone and EAS users. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.19529053.
Collapse
Affiliation(s)
- Margaret T. Dillon
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill
- Division of Speech and Hearing Sciences, Department of Allied Health Sciences, University of North Carolina at Chapel Hill
| | - Brendan P. O'Connell
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill
| | - Michael W. Canfarotta
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill
| | - Emily Buss
- Department of Otolaryngology/Head & Neck Surgery, University of North Carolina at Chapel Hill
| | - Joseph Hopfinger
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill
| |
Collapse
|
9
|
Avivi-Reich M, Sran RK, Schneider BA. Do Age and Linguistic Status Alter the Effect of Sound Source Diffuseness on Speech Recognition in Noise? Front Psychol 2022; 13:838576. [PMID: 35369266 PMCID: PMC8965325 DOI: 10.3389/fpsyg.2022.838576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2021] [Accepted: 02/14/2022] [Indexed: 11/13/2022] Open
Abstract
One aspect of auditory scenes that has received very little attention is the level of diffuseness of sound sources. This aspect has increasing importance due to growing use of amplification systems. When an auditory stimulus is amplified and presented over multiple, spatially-separated loudspeakers, the signal's timbre is altered due to comb filtering. In a previous study we examined how increasing the diffuseness of the sound sources might affect listeners' ability to recognize speech presented in different types of background noise. Listeners performed similarly when both the target and the masker were presented via a similar number of loudspeakers. However, performance improved when the target was presented using a single speaker (compact) and the masker from three spatially separate speakers (diffuse) but worsened when the target was diffuse, and the masker was compact. In the current study, we extended our research to examine whether the effects of timbre changes with age and linguistic experience. Twenty-four older adults whose first language was English (Old-EFLs) and 24 younger adults whose second language was English (Young-ESLs) were asked to repeat non-sense sentences masked by either Noise, Babble, or Speech and their results were compared with those of the Young-EFLs previously tested. Participants were divided into two experimental groups: (1) A Compact-Target group where the target sentences were presented over a single loudspeaker, while the masker was either presented over three loudspeakers or over a single loudspeaker; (2) A Diffuse-Target group, where the target sentences were diffuse while the masker was either compact or diffuse. The results indicate that the Target Timbre has a negligible effect on thresholds when the timbre of the target matches the timbre of the masker in all three groups. When there is a timbre contrast between target and masker, thresholds are significantly lower when the target is compact than when it is diffuse for all three listening groups in a Noise background. However, while this difference is maintained for the Young and Old-EFLs when the masker is Babble or Speech, speech reception thresholds in the Young-ESL group tend to be equivalent for all four combinations of target and masker timbre.
Collapse
Affiliation(s)
- Meital Avivi-Reich
- Department of Communication Arts, Sciences and Disorders, Brooklyn College, City University of New York, Brooklyn, NY, United States
| | - Rupinder Kaur Sran
- Human Communication Lab, Department of Psychology, University of Toronto Mississauga, Toronto, ON, Canada
- Department of Speech-Language Pathology, University of Toronto, Toronto, ON, Canada
| | - Bruce A. Schneider
- Human Communication Lab, Department of Psychology, University of Toronto Mississauga, Toronto, ON, Canada
| |
Collapse
|
10
|
Canfarotta MW, Dillon MT, Brown KD, Pillsbury HC, Dedmon MM, O'Connell BP. Insertion Depth and Cochlear Implant Speech Recognition Outcomes: A Comparative Study of 28- and 31.5-mm Lateral Wall Arrays. Otol Neurotol 2022; 43:183-189. [PMID: 34772886 PMCID: PMC8752482 DOI: 10.1097/mao.0000000000003416] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
OBJECTIVES 1) To compare speech recognition outcomes between cochlear implant (CI) recipients of 28- and 31.5-mm lateral wall electrode arrays, and 2) to characterize the relationship between angular insertion depth (AID) and speech recognition. STUDY DESIGN Retrospective review. SETTING Tertiary academic referral center. PATIENTS Seventy-five adult CI recipients of fully inserted 28-mm (n = 28) or 31.5-mm (n = 47) lateral wall arrays listening with a CI-alone device. INTERVENTIONS Cochlear implantation with postoperative computed tomography. MAIN OUTCOME MEASURES Consonant-nucleus-consonant (CNC) word recognition assessed with the CI-alone at 12 months postactivation. RESULTS The mean AID of the most apical electrode contact for the 31.5-mm array recipients was significantly deeper than the 28-mm array recipients (628° vs 571°, p < 0.001). Following 12 months of listening experience, mean CNC word scores were significantly better for recipients of 31.5-mm arrays compared with those implanted with 28-mm arrays (59.5% vs 48.3%, p = 0.004; Cohen's d = 0.70; 95% CI [0.22, 1.18]). There was a significant positive correlation between AID and CNC word scores (r = 0.372, p = 0.001), with a plateau in performance observed around 600°. CONCLUSIONS Cochlear implant recipients implanted with a 31.5-mm array experienced better speech recognition than those with a 28-mm array at 12 months postactivation. Deeper insertion of a lateral wall array appears to confer speech recognition benefit up to ∼600°, with a plateau in performance observed thereafter. These data provide preliminary evidence of the insertion depth necessary to optimize speech recognition outcomes for lateral wall electrode arrays among CI-alone users.
Collapse
Affiliation(s)
- Michael W Canfarotta
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina
| | | | | | | | | | | |
Collapse
|
11
|
Relationship Between Electrocochleography, Angular Insertion Depth, and Cochlear Implant Speech Perception Outcomes. Ear Hear 2021; 42:941-948. [PMID: 33369942 PMCID: PMC8217403 DOI: 10.1097/aud.0000000000000985] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Electrocochleography (ECochG), obtained before the insertion of a cochlear implant (CI) array, provides a measure of residual cochlear function that accounts for a substantial portion of variability in postoperative speech perception outcomes in adults. It is postulated that subsequent surgical factors represent independent sources of variance in outcomes. Prior work has demonstrated a positive correlation between angular insertion depth (AID) of straight arrays and speech perception under the CI-alone condition, with an inverse relationship observed for precurved arrays. The purpose of the present study was to determine the combined effects of ECochG, AID, and array design on speech perception outcomes. DESIGN Participants were 50 postlingually deafened adult CI recipients who received one of three straight arrays (MED-EL Flex24, MED-EL Flex28, and MED-EL Standard) and two precurved arrays (Cochlear Contour Advance and Advanced Bionics HiFocus Mid-Scala). Residual cochlear function was determined by the intraoperative ECochG total response (TR) measured before array insertion, which is the sum of magnitudes of spectral components in response to tones of different stimulus frequencies across the speech spectrum. The AID was then determined with postoperative imaging. Multiple linear regression was used to predict consonant-nucleus-consonant (CNC) word recognition in the CI-alone condition at 6 months postactivation based on AID, TR, and array design. RESULTS Forty-one participants received a straight array and nine received a precurved array. The AID of the most apical electrode contact ranged from 341° to 696°. The TR measured by ECochG accounted for 43% of variance in speech perception outcomes (p < 0.001). A regression model predicting CNC word scores with the TR tended to underestimate the performance for precurved arrays and deeply inserted straight arrays, and to overestimate the performance for straight arrays with shallower insertions. When combined in a multivariate linear regression, the TR, AID, and array design accounted for 72% of variability in speech perception outcomes (p < 0.001). CONCLUSIONS A model of speech perception outcomes that incorporates TR, AID, and array design represents an improvement over a model based on TR alone. The success of this model shows that peripheral factors including cochlear health and electrode placement may play a predominant role in speech perception with CIs.
Collapse
|
12
|
The smaller the frequency-to-place mismatch the better the hearing outcomes in cochlear implant recipients? Eur Arch Otorhinolaryngol 2021; 279:1875-1883. [PMID: 34131770 DOI: 10.1007/s00405-021-06899-y] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 05/20/2021] [Indexed: 10/21/2022]
Abstract
OBJECTIVE To investigate the effect of frequency-to-place mismatch, i.e. the mismatch between the tonotopic frequency map in the cochlea and the frequency band that is assigned to an electrode contact of a cochlear implant (CI) at the same cochlear location on speech perception outcomes, using postoperative CT images. STUDY DESIGN Retrospective observational single-centre study. METHODS Retrospective pre- and postoperative clinical CT data of 39 CI recipients with normal cochlear anatomy were analysed in an otological surgical planning software. The tonotopic frequency at each electrode position was estimated using the Greenwood function. For each patient, frequency-to-place mismatch between the tonotopic frequency and the fitted centre frequency for each electrode contact was calculated. The influence of frequency-to-place mismatch on speech perception in noise at 6 and 12 months after CI activation was studied. RESULTS A significant linear correlation was found between the frequency-to-place mismatch and speech perception in noise 6 months after cochlear implantation (p < 0.05). The smaller the frequency-to-place mismatch, the better the initial speech perception in noise results of the CI recipients. The significant effect disappeared after 12 months CI experience. CONCLUSION The study findings support the idea of minimizing the frequency-to-place mismatch in CI recipients in order to pursue better initial speech perception in noise. Further research is needed to investigate the prospect of tonotopic fitting strategies based upon postoperative CT images of the exact locations of the electrode contacts.
Collapse
|
13
|
Yoon YS, Mills I, Toliver B, Park C, Whitaker G, Drew C. Comparisons in Frequency Difference Limens Between Sequential and Simultaneous Listening Conditions in Normal-Hearing Listeners. Am J Audiol 2021; 30:266-274. [PMID: 33769845 DOI: 10.1044/2021_aja-20-00134] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Purpose We compared frequency difference limens (FDLs) in normal-hearing listeners under two listening conditions: sequential and simultaneous. Method Eighteen adult listeners participated in three experiments. FDL was measured using a method of limits for comparison frequency. In the sequential listening condition, the tones were presented with a half-second time interval in between, but for the simultaneous listening condition, the tones were presented simultaneously. For the first experiment, one of four reference tones (125, 250, 500, or 750 Hz), which was presented to the left ear, was paired with one of four starting comparison tones (250, 500, 750, or 1000 Hz), which was presented to the right ear. The second and third experiments had the same testing conditions as the first experiment except with two- and three-tone complexes, comparison tones. The subjects were asked if the tones sounded the same or different. When a subject chose "different," the comparison frequency decreased by 10% of the frequency difference between the reference and comparison tones. FDLs were determined when the subjects chose "same" 3 times in a row. Results FDLs were significantly broader (worse) with simultaneous listening than with sequential listening for the two- and three-tone complex conditions but not for the single-tone condition. The FDLs were narrowest (best) with the three-tone complex under both listening conditions. FDLs broadened as the testing frequencies increased for the single tone and the two-tone complex. The FDLs were not broadened at frequencies > 250 Hz for the three-tone complex. Conclusion The results suggest that sequential and simultaneous frequency discriminations are mediated by different processes at different stages in the auditory pathway for complex tones, but not for pure tones.
Collapse
Affiliation(s)
- Yang-Soo Yoon
- Department of Communication Sciences and Disorders, Baylor University, Waco, TX
| | - Ivy Mills
- Department of Communication Sciences and Disorders, Baylor University, Waco, TX
| | - BaileyAnn Toliver
- Department of Communication Sciences and Disorders, Baylor University, Waco, TX
| | - Christine Park
- Department of Communication Sciences and Disorders, Baylor University, Waco, TX
| | - George Whitaker
- Division of Otolaryngology, Baylor Scott & White Medical Center, Temple, TX
| | - Carrie Drew
- Department of Communication Sciences and Disorders, Baylor University, Waco, TX
| |
Collapse
|
14
|
Dillon MT, Canfarotta MW, Buss E, O'Connell BP. Comparison of Speech Recognition With an Organ of Corti Versus Spiral Ganglion Frequency-to-Place Function in Place-Based Mapping of Cochlear Implant and Electric-Acoustic Stimulation Devices. Otol Neurotol 2021; 42:721-725. [PMID: 33625196 PMCID: PMC8935664 DOI: 10.1097/mao.0000000000003070] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To compare acute speech recognition with a cochlear implant (CI) alone or electric-acoustic stimulation (EAS) device for place-based maps calculated with an organ of Corti (OC) versus a spiral ganglion (SG) frequency-to-place function. PATIENTS Eleven adult CI recipients of a lateral wall electrode array. INTERVENTION Postoperative imaging was used to derive place-based maps calculated with an OC versus SG function. MAIN OUTCOME MEASURE Phoneme recognition was evaluated at initial activation with consonant-nucleus-consonant (CNC) words presented using an OC versus a SG place-based map. RESULTS For the 9 CI-alone users, there was a nonsignificant trend for better acute phoneme recognition with the SG map (mean 18 RAUs) than the OC map (mean 9 RAUs; p = 0.071, 95% CI [≤-1.2]). When including the 2 EAS users in the analysis, performance was significantly better with the SG map (mean 21 RAUs) than the OC map (mean 7 RAUs; p = 0.019, 95% CI [≤-6.2]). CONCLUSIONS Better phoneme recognition with the SG frequency-to-place function could indicate more natural tonotopic alignment of information compared with the OC place-based map.A prospective, randomized investigation is currently underway to assess longitudinal outcomes with place-based mapping in CI-alone and EAS devices using the SG frequency-to-place function.
Collapse
Affiliation(s)
- Margaret T Dillon
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine
- Division of Speech & Hearing, Department of Allied Health Sciences, University of North Carolina at Chapel Hill, North Carolina
| | | | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine
| | | |
Collapse
|
15
|
Individual Variability in Recalibrating to Spectrally Shifted Speech: Implications for Cochlear Implants. Ear Hear 2021; 42:1412-1427. [PMID: 33795617 DOI: 10.1097/aud.0000000000001043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Cochlear implant (CI) recipients are at a severe disadvantage compared with normal-hearing listeners in distinguishing consonants that differ by place of articulation because the key relevant spectral differences are degraded by the implant. One component of that degradation is the upward shifting of spectral energy that occurs with a shallow insertion depth of a CI. The present study aimed to systematically measure the effects of spectral shifting on word recognition and phoneme categorization by specifically controlling the amount of shifting and using stimuli whose identification specifically depends on perceiving frequency cues. We hypothesized that listeners would be biased toward perceiving phonemes that contain higher-frequency components because of the upward frequency shift and that intelligibility would decrease as spectral shifting increased. DESIGN Normal-hearing listeners (n = 15) heard sine wave-vocoded speech with simulated upward frequency shifts of 0, 2, 4, and 6 mm of cochlear space to simulate shallow CI insertion depth. Stimuli included monosyllabic words and /b/-/d/ and /∫/-/s/ continua that varied systematically by formant frequency transitions or frication noise spectral peaks, respectively. Recalibration to spectral shifting was operationally defined as shifting perceptual acoustic-phonetic mapping commensurate with the spectral shift. In other words, adjusting frequency expectations for both phonemes upward so that there is still a perceptual distinction, rather than hearing all upward-shifted phonemes as the higher-frequency member of the pair. RESULTS For moderate amounts of spectral shifting, group data suggested a general "halfway" recalibration to spectral shifting, but individual data suggested a notably different conclusion: half of the listeners were able to recalibrate fully, while the other halves of the listeners were utterly unable to categorize shifted speech with any reliability. There were no participants who demonstrated a pattern intermediate to these two extremes. Intelligibility of words decreased with greater amounts of spectral shifting, also showing loose clusters of better- and poorer-performing listeners. Phonetic analysis of word errors revealed certain cues were more susceptible to being compromised due to a frequency shift (place and manner of articulation), while voicing was robust to spectral shifting. CONCLUSIONS Shifting the frequency spectrum of speech has systematic effects that are in line with known properties of speech acoustics, but the ensuing difficulties cannot be predicted based on tonotopic mismatch alone. Difficulties are subject to substantial individual differences in the capacity to adjust acoustic-phonetic mapping. These results help to explain why speech recognition in CI listeners cannot be fully predicted by peripheral factors like electrode placement and spectral resolution; even among listeners with functionally equivalent auditory input, there is an additional factor of simply being able or unable to flexibly adjust acoustic-phonetic mapping. This individual variability could motivate precise treatment approaches guided by an individual's relative reliance on wideband frequency representation (even if it is mismatched) or limited frequency coverage whose tonotopy is preserved.
Collapse
|
16
|
Canfarotta MW, Dillon MT, Brown KD, Pillsbury HC, Dedmon MM, O'Connell BP. Incidence of Complete Insertion in Cochlear Implant Recipients of Long Lateral Wall Arrays. Otolaryngol Head Neck Surg 2021; 165:571-577. [PMID: 33588627 DOI: 10.1177/0194599820987456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
OBJECTIVE High rates of partial insertion have been reported for cochlear implant (CI) recipients of long lateral wall electrode arrays, presumably caused by resistance encountered during insertion due to cochlear morphology. With recent advances in long-electrode array design, we sought to investigate (1) the incidence of complete insertions among patients implanted with 31.5-mm flexible arrays and (2) whether complete insertion is limited by cochlear duct length (CDL). STUDY DESIGN Retrospective review. SETTING Tertiary referral center. METHODS Fifty-one adult CI recipients implanted with 31.5-mm flexible lateral wall arrays underwent postoperative computed tomography to determine the rate of complete insertion, defined as all contacts being intracochlear. CDL and angular insertion depth (AID) were compared between complete and partial insertion cohorts. RESULTS Most cases had a complete insertion (96.1%, n = 49). Among the complete insertion cohort, the median CDL was 33.6 mm (range, 30.3-37.9 mm), and median AID was 641° (range, 533-751°). Two cases of partial insertion had relatively short CDL (31.8 mm and 32.3 mm) and shallow AID (542° and 575°). Relatively shallow AID for the 2 cases of partial insertion fails to support the idea that CDL alone prevents a complete insertion. CONCLUSION Complete insertion of a 31.5-mm flexible array is feasible in most cases and does not appear to be limited by the range of CDL observed in this cohort. Future studies are needed to estimate other variations in cochlear morphology that could predict resistance and failure to achieve complete insertion with long arrays.
Collapse
Affiliation(s)
- Michael W Canfarotta
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Margaret T Dillon
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Kevin D Brown
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Harold C Pillsbury
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Matthew M Dedmon
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| | - Brendan P O'Connell
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, North Carolina, USA
| |
Collapse
|
17
|
Dillon MT, Canfarotta MW, Buss E, Hopfinger J, O'Connell BP. Effectiveness of Place-based Mapping in Electric-Acoustic Stimulation Devices. Otol Neurotol 2021; 42:197-202. [PMID: 33885267 PMCID: PMC8787166 DOI: 10.1097/mao.0000000000002965] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND The default mapping procedure for electric-acoustic stimulation (EAS) devices uses the cochlear implant recipient's unaided detection thresholds in the implanted ear to derive the acoustic settings and assign the lowest frequency filter of electric stimulation. Individual differences for speech recognition with EAS may be due to discrepancies between the electric frequency filters of individual electrode contacts and the cochlear place of stimulation, known as a frequency-to-place mismatch. Frequency-to-place mismatch of greater than 1/2 octave has been demonstrated in up to 60% of EAS users. Aligning the electric frequency filters via a place-based mapping procedure using postoperative imaging may improve speech recognition with EAS. METHODS Masked sentence recognition was evaluated for normal-hearing subjects (n = 17) listening with vocoder simulations of EAS, using a place-based map and a default map. Simulation parameters were based on audiometric and imaging data from a representative 24-mm electrode array recipient and EAS user. The place-based map aligned electric frequency filters with the cochlear place frequency, which introduced a gap between the simulated acoustic and electric output. The default map settings were derived from the clinical programming software and provided the full speech frequency range. RESULTS Masked sentence recognition was significantly better for simulated EAS with the place-based map as compared with the default map. CONCLUSION The simulated EAS place-based map supported better performance than the simulated EAS default map. This indicates that individualizing maps may improve performance in EAS users by helping them achieve better asymptotic performance earlier and mitigate the need for acclimatization.
Collapse
Affiliation(s)
- Margaret T Dillon
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine
- Division of Speech & Hearing, Department of Allied Health Sciences
| | | | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine
| | - Joseph Hopfinger
- Department of Psychology and Neuroscience, University of North Carolina at Chapel Hill, North Carolina
| | | |
Collapse
|
18
|
Zanetti D, Conte G, Di Berardino F, Lo Russo F, Cavicchiolo S, Triulzi F. Assessment of Frequency-Place Mismatch by Flat-Panel CT and Correlation With Cochlear Implant Performance. Otol Neurotol 2021; 42:165-173. [PMID: 33885263 DOI: 10.1097/mao.0000000000002967] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE To calculate the frequency allocation mismatch in a group of very selected cochlear implant (CI) recipients and to contrast it with the speech perception performances. STUDY DESIGN Cross-sectional observational prospective study. SETTINGS Tertiary Audiological Department, University hospital. PATIENTS Fifteen adults receiving the same CI array by the same surgeon through a posterior tympanotomy, round window approach. MAIN OUTCOME MEASURES 1) High definition flat panel computed tomography (FPCT) control of the intracochlear position of each electrode contact, and computation of the relative frequency allocation mismatch; 2) analysis of speech perception outcomes in relation with the mismatch. RESULTS Despite a consistent and reproducible surgical procedure with the same intracochlear array, significant deviations from the frequency allocation tables (FAT) assigned by default by the manufacturer were observed in this study.Their influences on speech perception performances were negligible in the simple tasks of words or sentences recognition in quiet (and, to a lesser extent also in noise). The greatest effect of a significant mismatch was observed for the vocal-consonant-vocal (VCV) sequences recognition under noise masking, the emotional and the linguistic prosody recognition, and the phonemes discrimination of the Auditory Speech Sound Evaluation (A§E) test. CONCLUSIONS The greatest frequency-to-place occurred at the high frequencies. The effect was rather irrelevant on simple words and sentences recognition, while it negatively impacted on the more complex perceptual tasks.
Collapse
Affiliation(s)
- Diego Zanetti
- Audiology Unit, Department of Clinical Sciences and Community Health, University of Milan and Department of Specialistic Surgical Sciences, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Milan, Italy
| | - Giorgio Conte
- Department of Neuroradiology, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico
| | - Federica Di Berardino
- Audiology Unit, Department of Clinical Sciences and Community Health, University of Milan and Department of Specialistic Surgical Sciences, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Milan, Italy
| | - Francesco Lo Russo
- Postgraduation School of Radiodiagnostics, Università degli Studi di Milano, Milan, Italy
| | - Sara Cavicchiolo
- Audiology Unit, Department of Clinical Sciences and Community Health, University of Milan and Department of Specialistic Surgical Sciences, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico, Milan, Italy
| | - Fabio Triulzi
- Department of Neuroradiology, Fondazione IRCCS Ca' Granda Ospedale Maggiore Policlinico
| |
Collapse
|
19
|
Karoui C, James C, Barone P, Bakhos D, Marx M, Macherey O. Searching for the Sound of a Cochlear Implant: Evaluation of Different Vocoder Parameters by Cochlear Implant Users With Single-Sided Deafness. Trends Hear 2020; 23:2331216519866029. [PMID: 31533581 PMCID: PMC6753516 DOI: 10.1177/2331216519866029] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Cochlear implantation in subjects with single-sided deafness (SSD) offers a unique opportunity to directly compare the percepts evoked by a cochlear implant (CI) with those evoked acoustically. Here, nine SSD-CI users performed a forced-choice task evaluating the similarity of speech processed by their CI with speech processed by several vocoders presented to their healthy ear. In each trial, subjects heard two intervals: their CI followed by a certain vocoder in Interval 1 and their CI followed by a different vocoder in Interval 2. The vocoders differed either (i) in carrier type-(sinusoidal [SINE], bandfiltered noise [NOISE], and pulse-spreading harmonic complex) or (ii) in frequency mismatch between the analysis and synthesis frequency ranges-(no mismatch, and two frequency-mismatched conditions of 2 and 4 equivalent rectangular bandwidths [ERBs]). Subjects had to state in which of the two intervals the CI and vocoder sounds were more similar. Despite a large intersubject variability, the PSHC vocoder was judged significantly more similar to the CI than SINE or NOISE vocoders. Furthermore, the No-mismatch and 2-ERB mismatch vocoders were judged significantly more similar to the CI than the 4-ERB mismatch vocoder. The mismatch data were also interpreted by comparing spiral ganglion characteristic frequencies with electrode contact positions determined from postoperative computed tomography scans. Only one subject demonstrated a pattern of preference consistent with adaptation to the CI sound processor frequency-to-electrode allocation table and two subjects showed possible partial adaptation. Those subjects with adaptation patterns presented overall small and consistent frequency mismatches across their electrode arrays.
Collapse
Affiliation(s)
- Chadlia Karoui
- Centre de Recherche Cerveau et Cognition, Toulouse, France.,Cochlear France SAS, Toulouse, France
| | - Chris James
- Cochlear France SAS, Toulouse, France.,Department of Otology-Neurotology and Skull Base Surgery, Purpan University Hospital, Toulouse, France
| | - Pascal Barone
- Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - David Bakhos
- Université François-Rabelais de Tours, CHRU de Tours, France.,Ear Nose and Throat department, CHUR de Tours, Tours, France
| | - Mathieu Marx
- Centre de Recherche Cerveau et Cognition, Toulouse, France.,Department of Otology-Neurotology and Skull Base Surgery, Purpan University Hospital, Toulouse, France
| | - Olivier Macherey
- Aix Marseille University, CNRS, Centrale Marseille, LMA, Marseille, France
| |
Collapse
|
20
|
Hay-McCutcheon MJ, Peterson NR, Pisoni DB, Kirk KI, Yang X, Parton J. Performance variability on perceptual discrimination tasks in profoundly deaf adults with cochlear implants. JOURNAL OF COMMUNICATION DISORDERS 2018; 72:122-135. [PMID: 29395103 PMCID: PMC5860985 DOI: 10.1016/j.jcomdis.2018.01.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Revised: 01/09/2018] [Accepted: 01/22/2018] [Indexed: 06/07/2023]
Abstract
OBJECTIVES The purpose of this study was to evaluate performance on two challenging listening tasks, talker and regional accent discrimination, and to assess variables that could have affected the outcomes. STUDY DESIGN A prospective study using 35 adults with one cochlear implant (CI) or a CI and a contralateral hearing aid (bimodal hearing) was conducted. Adults completed talker and regional accent discrimination tasks. METHODS Two-alternative forced-choice tasks were used to assess talker and accent discrimination in a group of adults who ranged in age from 30 years old to 81 years old. RESULTS A large amount of performance variability was observed across listeners for both discrimination tasks. Three listeners successfully discriminated between talkers for both listening tasks, 14 participants successfully completed one discrimination task and 18 participants were not able to discriminate between talkers for either listening task. Some adults who used bimodal hearing benefitted from the addition of acoustic cues provided through a HA but for others the HA did not help with discrimination abilities. Acoustic speech feature analysis of the test signals indicated that both the talker speaking rate and the fundamental frequency (F0) helped with talker discrimination. For accent discrimination, findings suggested that access to more salient spectral cues was important for better discrimination performance. CONCLUSIONS The ability to perform challenging discrimination tasks successfully likely involves a number of complex interactions between auditory and non-auditory pre- and post-implant factors. To understand why some adults with CIs perform similarly to adults with normal hearing and others experience difficulty discriminating between talkers, further research will be required with larger populations of adults who use unilateral CIs, bilateral CIs and bimodal hearing.
Collapse
Affiliation(s)
- Marcia J Hay-McCutcheon
- The University of Alabama, Department of Communicative Disorders, Speech and Hearing Center, Box 870242, United States.
| | | | - David B Pisoni
- Indiana University, Department of Psychological and Brain Sciences, United States
| | - Karen Iler Kirk
- The University of Illinois at Urbana-Champaign, Department of Communication Sciences and Disorders, United States
| | - Xin Yang
- The University of Alabama, Information Systems, Statistics, Management Sciences, United States
| | - Jason Parton
- The University of Alabama, Information Systems, Statistics, Management Sciences, United States
| |
Collapse
|
21
|
Iyaniwura JE, Elfarnawany M, Ladak HM, Agrawal SK. An automated A-value measurement tool for accurate cochlear duct length estimation. J Otolaryngol Head Neck Surg 2018; 47:5. [PMID: 29357924 PMCID: PMC5778705 DOI: 10.1186/s40463-018-0253-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 01/08/2018] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND There has been renewed interest in the cochlear duct length (CDL) for preoperative cochlear implant electrode selection and postoperative generation of patient-specific frequency maps. The CDL can be estimated by measuring the A-value, which is defined as the length between the round window and the furthest point on the basal turn. Unfortunately, there is significant intra- and inter-observer variability when these measurements are made clinically. The objective of this study was to develop an automated A-value measurement algorithm to improve accuracy and eliminate observer variability. METHOD Clinical and micro-CT images of 20 cadaveric cochleae specimens were acquired. The micro-CT of one sample was chosen as the atlas, and A-value fiducials were placed onto that image. Image registration (rigid affine and non-rigid B-spline) was applied between the atlas and the 19 remaining clinical CT images. The registration transform was applied to the A-value fiducials, and the A-value was then automatically calculated for each specimen. High resolution micro-CT images of the same 19 specimens were used to measure the gold standard A-values for comparison against the manual and automated methods. RESULTS The registration algorithm had excellent qualitative overlap between the atlas and target images. The automated method eliminated the observer variability and the systematic underestimation by experts. Manual measurement of the A-value on clinical CT had a mean error of 9.5 ± 4.3% compared to micro-CT, and this improved to an error of 2.7 ± 2.1% using the automated algorithm. Both the automated and manual methods correlated significantly with the gold standard micro-CT A-values (r = 0.70, p < 0.01 and r = 0.69, p < 0.01, respectively). CONCLUSION An automated A-value measurement tool using atlas-based registration methods was successfully developed and validated. The automated method eliminated the observer variability and improved accuracy as compared to manual measurements by experts. This open-source tool has the potential to benefit cochlear implant recipients in the future.
Collapse
Affiliation(s)
- John E Iyaniwura
- Biomedical Engineering Graduate Program, Western University, 1151 Richmond Street, London, ON, N6A 3K7, Canada.
| | - Mai Elfarnawany
- Department of Otolaryngology-Head and Neck Surgery, Western University, London, ON, Canada
| | - Hanif M Ladak
- Biomedical Engineering Graduate Program, Western University, 1151 Richmond Street, London, ON, N6A 3K7, Canada.,Department of Otolaryngology-Head and Neck Surgery, Western University, London, ON, Canada.,Department of Medical Biophysics, Western University, London, ON, Canada.,Department of Electrical and Computer Engineering, Western University, London, ON, Canada
| | - Sumit K Agrawal
- Biomedical Engineering Graduate Program, Western University, 1151 Richmond Street, London, ON, N6A 3K7, Canada.,Department of Otolaryngology-Head and Neck Surgery, Western University, London, ON, Canada.,Department of Electrical and Computer Engineering, Western University, London, ON, Canada.,London Health Science Centre, Room B1-333, University Hospital, 339 Windermere Rd., London, ON, Canada
| |
Collapse
|
22
|
Stelmach J, Landsberger DM, Padilla M, Aronoff JM. Determining the minimum number of electrodes that need to be pitch matched to accurately estimate pitch matches across the array. Int J Audiol 2017; 56:894-899. [PMID: 28697658 DOI: 10.1080/14992027.2017.1346302] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
OBJECTIVE With bilateral cochlear implant (CI) users there is typically a place mismatch between the locations stimulated by the left and right electrode arrays. This mismatch can affect performance, potentially limiting binaural benefits. One way to address this is by perceptually realigning the arrays such that a given frequency in the input stimulates perceptually matched locations in the two ears. A clinically feasible technique is needed that can determine the appropriate perceptual alignment. A pitch matching task can potentially be used for this, but only if it can be performed in a clinically feasible amount of time. The objective of this study was to determine the minimal number of electrodes that need to be pitch matched to accurately determine pitch matches across the entire array. DESIGN A retrospective analysis of pitch matching data was conducted. Subsets of pitch matches were selected and the predicted pitch matching across the array was compared to that predicted by the full dataset. STUDY SAMPLE 16 bilateral CI users. RESULTS The results indicated that nine pitch matches are sufficient, which can typically be obtained in approximately 7 min. CONCLUSION The results reveal a clinically feasible method for determining pitch matches across the array.
Collapse
Affiliation(s)
- Julia Stelmach
- a Department of Speech and Hearing Science , University of Illinois at Urbana-Champaign , Champaign , IL , USA
| | - David M Landsberger
- b Department of Otolaryngology , New York University , New York , NY , USA.,c Communication and Neuroscience Division , House Ear Institute , Los Angeles , CA , USA
| | - Monica Padilla
- b Department of Otolaryngology , New York University , New York , NY , USA.,c Communication and Neuroscience Division , House Ear Institute , Los Angeles , CA , USA.,d Department of Otolaryngology , University of Southern California , Los Angeles , CA , USA , and
| | - Justin M Aronoff
- a Department of Speech and Hearing Science , University of Illinois at Urbana-Champaign , Champaign , IL , USA.,c Communication and Neuroscience Division , House Ear Institute , Los Angeles , CA , USA.,e Department of Otolaryngology - Head and Neck Surgery , University of Illinois at Chicago , Chicago , IL , USA
| |
Collapse
|
23
|
|
24
|
Landsberger DM, Svrakic M, Roland JT, Svirsky M. The Relationship Between Insertion Angles, Default Frequency Allocations, and Spiral Ganglion Place Pitch in Cochlear Implants. Ear Hear 2016; 36:e207-13. [PMID: 25860624 DOI: 10.1097/aud.0000000000000163] [Citation(s) in RCA: 171] [Impact Index Per Article: 21.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES Commercially available cochlear implant systems attempt to deliver frequency information going down to a few hundred Hertz, but the electrode arrays are not designed to reach the most apical regions of the cochlea, which correspond to these low frequencies. This may cause a mismatch between the frequencies presented by a cochlear implant electrode array and the frequencies represented at the corresponding location in a normal-hearing cochlea. In the following study, the mismatch between the frequency presented at a given cochlear angle and the frequency expected by an acoustic hearing ear at the corresponding angle is examined for the cochlear implant systems that are most commonly used in the United States. DESIGN The angular insertion of each of the electrodes on four different electrode arrays (MED-EL Standard, MED-EL Flex28, Advanced Bionics HiFocus 1J, and Cochlear Contour Advance) was estimated from X-ray. For the angular location of each electrode on each electrode array, the predicted spiral ganglion frequency was estimated. The predicted spiral ganglion frequency was compared with the center frequency provided by the corresponding electrode using the manufacturer's default frequency-to-electrode allocation. RESULTS Differences across devices were observed for the place of stimulation for frequencies below 650 Hz. Longer electrode arrays (i.e., the MED-EL Standard and Flex28) demonstrated smaller deviations from the spiral ganglion map than the other electrode arrays. For insertion angles up to approximately 270°, the frequencies presented at a given location were typically approximately an octave below what would be expected by a spiral ganglion frequency map, while the deviations were larger for angles deeper than 270°. For frequencies above 650 Hz, the frequency to angle relationship was consistent across all four electrode models. CONCLUSIONS A mismatch was observed between the predicted frequency and the default frequency provided by every electrode on all electrode arrays. The mismatch can be reduced by changing the default frequency allocations, inserting electrodes deeper into the cochlea, or allowing cochlear implant users to adapt to the mismatch. Further studies are required to fully assess the clinical significance of the frequency mismatch.
Collapse
Affiliation(s)
- David M Landsberger
- Department of Otolaryngology, New York University School of Medicine, New York, USA
| | | | | | | |
Collapse
|
25
|
Guediche S, Holt LL, Laurent P, Lim SJ, Fiez JA. Evidence for Cerebellar Contributions to Adaptive Plasticity in Speech Perception. Cereb Cortex 2015; 25:1867-77. [PMID: 24451660 PMCID: PMC4481605 DOI: 10.1093/cercor/bht428] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Human speech perception rapidly adapts to maintain comprehension under adverse listening conditions. For example, with exposure listeners can adapt to heavily accented speech produced by a non-native speaker. Outside the domain of speech perception, adaptive changes in sensory and motor processing have been attributed to cerebellar functions. The present functional magnetic resonance imaging study investigates whether adaptation in speech perception also involves the cerebellum. Acoustic stimuli were distorted using a vocoding plus spectral-shift manipulation and presented in a word recognition task. Regions in the cerebellum that showed differences before versus after adaptation were identified, and the relationship between activity during adaptation and subsequent behavioral improvements was examined. These analyses implicated the right Crus I region of the cerebellum in adaptive changes in speech perception. A functional correlation analysis with the right Crus I as a seed region probed for cerebral cortical regions with covarying hemodynamic responses during the adaptation period. The results provided evidence of a functional network between the cerebellum and language-related regions in the temporal and parietal lobes of the cerebral cortex. Consistent with known cerebellar contributions to sensorimotor adaptation, cerebro-cerebellar interactions may support supervised learning mechanisms that rely on sensory prediction error signals in speech perception.
Collapse
Affiliation(s)
- Sara Guediche
- Center for Neuroscience at the University of Pittsburgh, Pittsburgh, PA, USA
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA
- Current address: Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI, USA
| | - Lori L. Holt
- Center for Neuroscience at the University of Pittsburgh, Pittsburgh, PA, USA
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Patryk Laurent
- Center for Neuroscience at the University of Pittsburgh, Pittsburgh, PA, USA
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA
- Current address: Brain Corporation, San Diego, CA, USA
| | - Sung-Joo Lim
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Julie A. Fiez
- Center for Neuroscience at the University of Pittsburgh, Pittsburgh, PA, USA
- Center for the Neural Basis of Cognition, Pittsburgh, PA, USA
- Department of Psychology, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
26
|
Whitmal NA, DeMaio D, Lin R. Effects of envelope bandwidth on importance functions for cochlear implant simulations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:733-744. [PMID: 25698008 DOI: 10.1121/1.4906260] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Frequency-importance functions (FIFs) quantify intelligibility contributions of spectral regions of speech. In previous work, FIFs were considered as instruments for characterizing intelligibility contributions of individual cochlear implant electrode channels. Comparisons of FIFs for natural speech and vocoder-simulated implant processed speech showed that vocoding shifted peak importance regions downward in frequency by 0.5 octaves. These shifts were attributed to voicing cue changes, and may reflect increased reliance on low-frequency information (apart from periodicity cues) for correct voicing perception. The purpose of this study was to determine whether increasing channel envelope bandwidth would reverse these shifts by improving access to voicing and pitch cues. Importance functions were measured for 48 subjects with normal hearing, who listened to vowel-consonant-vowel tokens either as recorded or as output from five different vocoders that simulated implant processing. Envelopes were constructed using filters that either included or excluded pitch information. Results indicate that vocoding-based shifts are only partially counteracted by including pitch information; moreover, a substantial baseline shift is present even for vocoders with high spectral resolution. The results also suggest that vocoded speech intelligibility is most sensitive to a loss of spectral resolution in high-importance regions, a finding with possible implications for cochlear implant electrode mapping.
Collapse
Affiliation(s)
- Nathaniel A Whitmal
- Department of Communication Disorders, University of Massachusetts, Amherst, Massachusetts 01003
| | - Decia DeMaio
- Department of Communication Disorders, University of Massachusetts, Amherst, Massachusetts 01003
| | - Rongheng Lin
- Division of Biostatistics and Epidemiology, School of Public Health and Health Sciences, University of Massachusetts, Amherst, Massachusetts 01003
| |
Collapse
|
27
|
Place pitch versus electrode location in a realistic computational model of the implanted human cochlea. Hear Res 2014; 315:10-24. [PMID: 24975087 DOI: 10.1016/j.heares.2014.06.003] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/17/2014] [Revised: 06/06/2014] [Accepted: 06/15/2014] [Indexed: 11/23/2022]
Abstract
Place pitch was investigated in a computational model of the implanted human cochlea containing nerve fibres with realistic trajectories that take the variable distance between the organ of Corti and spiral ganglion into account. The model was further updated from previous studies by including fluid compartments in the modiolus and updating the electrical conductivity values of (temporal) bone and the modiolus, based on clinical data. Four different cochlear geometries are used, modelled with both lateral and perimodiolar implants, and their neural excitation patterns were examined for nerve fibres modelled with and without peripheral processes. Additionally, equations were derived from the model geometries that describe Greenwood's frequency map as a function of cochlear angle at the basilar membrane as well as at the spiral ganglion. The main findings are: (I) in the first (basal) turn of the cochlea, cochlear implant induced pitch can be predicted fairly well using the Greenwood function. (II) Beyond the first turn this pitch becomes increasingly unpredictable, greatly dependent on stimulus level, state of the cochlear neurons and the electrode's distance from the modiolus. (III) After the first turn cochlear implant induced pitch decreases as stimulus level increases, but the pitch does not reach values expected from direct spiral ganglion stimulation unless the peripheral processes are missing. (IV) Electrode contacts near the end of the spiral ganglion or deeper elicit very unpredictable pitch, with broad frequency ranges that strongly overlap with those of neighbouring contacts. (V) The characteristic place pitch for stimulation at either the organ of Corti or the spiral ganglion can be described as a function of cochlear angle by the equations presented in this paper.
Collapse
|
28
|
Lee T, Yu S, Yuan M, Wong TKC, Kong YY. The effect of enhancing temporal periodicity cues on Cantonese tone recognition by cochlear implantees. Int J Audiol 2014; 53:546-57. [PMID: 24694089 DOI: 10.3109/14992027.2014.893374] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVES This study investigates the efficacy of a cochlear implant (CI) processing method that enhances temporal periodicity cues of speech. DESIGN Subjects participated in word and tone identification tasks. Two processing conditions - the conventional advanced combination encoder (ACE) and tone-enhanced ACE were tested. Test materials were Cantonese disyllabic words recorded from one male and one female speaker. Speech-shaped noise was added to clean speech. The fundamental frequency information for periodicity enhancement was extracted from the clean speech. Electrical stimuli generated from the noisy speech with and without periodicity enhancement were presented via direct stimulation using a Laura 34 research processor. Subjects were asked to identify the presented word. STUDY SAMPLE Seven post-lingually deafened native Cantonese-speaking CI users. RESULTS Percent correct word, segmental structure, and tone identification scores were calculated. While word and segmental structure identification accuracy remained similar between the two processing conditions, tone identification in noise was better with tone-enhanced ACE than with conventional ACE. Significant improvement on tone perception was found only for the female voice. CONCLUSIONS Temporal periodicity cues are important to tone perception in noise. Pitch and tone perception by CI users could be improved when listeners received enhanced temporal periodicity cues.
Collapse
Affiliation(s)
- Tan Lee
- * Department of Electronic Engineering, The Chinese University of Hong Kong , China
| | | | | | | | | |
Collapse
|
29
|
Guediche S, Blumstein SE, Fiez JA, Holt LL. Speech perception under adverse conditions: insights from behavioral, computational, and neuroscience research. Front Syst Neurosci 2014; 7:126. [PMID: 24427119 PMCID: PMC3879477 DOI: 10.3389/fnsys.2013.00126] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2013] [Accepted: 12/16/2013] [Indexed: 01/06/2023] Open
Abstract
Adult speech perception reflects the long-term regularities of the native language, but it is also flexible such that it accommodates and adapts to adverse listening conditions and short-term deviations from native-language norms. The purpose of this article is to examine how the broader neuroscience literature can inform and advance research efforts in understanding the neural basis of flexibility and adaptive plasticity in speech perception. Specifically, we highlight the potential role of learning algorithms that rely on prediction error signals and discuss specific neural structures that are likely to contribute to such learning. To this end, we review behavioral studies, computational accounts, and neuroimaging findings related to adaptive plasticity in speech perception. Already, a few studies have alluded to a potential role of these mechanisms in adaptive plasticity in speech perception. Furthermore, we consider research topics in neuroscience that offer insight into how perception can be adaptively tuned to short-term deviations while balancing the need to maintain stability in the perception of learned long-term regularities. Consideration of the application and limitations of these algorithms in characterizing flexible speech perception under adverse conditions promises to inform theoretical models of speech.
Collapse
Affiliation(s)
- Sara Guediche
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown UniversityProvidence, RI, USA
| | - Sheila E. Blumstein
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown UniversityProvidence, RI, USA
- Department of Cognitive, Linguistic, and Psychological Sciences, Brain Institute, Brown UniversityProvidence, RI, USA
| | - Julie A. Fiez
- Department of Neuroscience, Center for Neuroscience at the University of Pittsburgh, University of PittsburghPittsburgh, PA, USA
- Department of Psychology, University of PittsburghPittsburgh, PA, USA
- Department of Psychology at Carnegie Mellon University and Department of Neuroscience at the University of Pittsburgh, Center for the Neural Basis of CognitionPittsburgh, PA, USA
| | - Lori L. Holt
- Department of Neuroscience, Center for Neuroscience at the University of Pittsburgh, University of PittsburghPittsburgh, PA, USA
- Department of Psychology at Carnegie Mellon University and Department of Neuroscience at the University of Pittsburgh, Center for the Neural Basis of CognitionPittsburgh, PA, USA
- Department of Psychology, Carnegie Mellon UniversityPittsburgh, PA, USA
| |
Collapse
|
30
|
van Besouw RM, Forrester L, Crowe ND, Rowan D. Simulating the effect of interaural mismatch in the insertion depth of bilateral cochlear implants on speech perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1348-1357. [PMID: 23927131 DOI: 10.1121/1.4812272] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
A bilateral advantage for diotically presented stimuli has been observed for cochlear implant (CI) users and is suggested to be dependent on symmetrical implant performance. Studies using CI simulations have not shown a true "bilateral" advantage, but a "better ear" effect and have demonstrated that performance decreases with increasing basalward shift in insertion depth. This study aimed to determine whether there is a bilateral advantage for CI simulations with interaurally matched insertions and the extent to which performance is affected by interaural insertion depth mismatch. Speech perception in noise and self-reported ease of listening were measured using matched bilateral, mismatched bilateral and unilateral CI simulations over four insertion depths for seventeen normal hearing listeners. Speech scores and ease of listening reduced with increasing basalward shift in (interaurally matched) insertion depth. A bilateral advantage for speech perception was only observed when the insertion depths were interaurally matched and deep. No advantage was observed for small to moderate interaural insertion-depth mismatches, consistent with a better ear effect. Finally, both measures were poorer than expected for a better ear effect for large mismatches, suggesting that misalignment of the electrode arrays may prevent a bilateral advantage and detrimentally affect perception of diotically presented speech.
Collapse
Affiliation(s)
- Rachel M van Besouw
- Institute of Sound and Vibration Research, University of Southampton, Southampton, Hampshire, SO17 1BJ, United Kingdom.
| | | | | | | |
Collapse
|
31
|
Wilkinson EP, Abdel-Hamid O, Galvin JJ, Jiang H, Fu QJ. Voice conversion in cochlear implantation. Laryngoscope 2013; 123 Suppl 3:S29-43. [PMID: 23299859 DOI: 10.1002/lary.23744] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2012] [Revised: 07/01/2012] [Accepted: 08/22/2012] [Indexed: 11/09/2022]
Abstract
OBJECTIVES/HYPOTHESIS Voice conversion algorithms may benefit cochlear implant (CI) users who better understand speech produced by one talker than by another. It is unclear how the source or target talker's fundamental frequency (F0) information may contribute to perception of converted speech. This study evaluated voice conversion algorithms for CI users in which the source or target talker's F0 was included in the converted speech. STUDY DESIGN Development and evaluation of computerized voice conversion algorithms in CI patients. METHODS A series of cepstral analysis-based algorithms were developed and evaluated in six CI users. The algorithms converted talker voice gender (male-to-female, or female-to-male); either the source or target talker F0 was included in the converted speech. The voice conversion algorithms were evaluated in terms of recognition of IEEE sentences, speech quality, and voice gender discrimination. RESULTS Voice gender recognition performance showed that listeners strongly cued to the F0 that was included within the converted speech. For both IEEE sentence recognition and voice quality ratings, performance was poorer with the voice conversion algorithms than with original speech. Performance on female-to-male conversion was superior to male-to-female conversion. CONCLUSION The strong cueing to F0 within the voice conversion algorithms suggests that CI users are able to utilize temporal periodicity information for some pitch-related tasks. Limitations on spectral channel information experienced by CI users may result in poorer performance with voice conversion algorithms due to distortion of speech formant information and degradation of the spectral envelope.
Collapse
|
32
|
Yoon YS, Liu A, Fu QJ. Binaural benefit for speech recognition with spectral mismatch across ears in simulated electric hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:EL94-100. [PMID: 21877777 PMCID: PMC3155583 DOI: 10.1121/1.3606460] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The present study investigated the effects of binaural spectral mismatch on binaural benefits in the context of bilateral cochlear implants using acoustic simulations. Binaural spectral mismatch was systematically manipulated by simulating changes in the relative insertion depths across ears. Sentence recognition, presented unilaterally and bilaterally, were measured in normal-hearing listeners in quiet and noise at +5 dB signal-to-noise ratio. Significant binaural benefits were observed when the interaural difference in insertion depth was 1 mm or less. This result suggests a dependence of the binaural benefit on redundant speech information, rather than on similarity in performance across ears.
Collapse
Affiliation(s)
- Yang-soo Yoon
- Communication and Neuroscience Division, House Ear Institute, Los Angeles, CA 90057, USA.
| | | | | |
Collapse
|