1
|
Borjigin A, Bharadwaj HM. Individual Differences Elucidate the Perceptual Benefits Associated with Robust Temporal Fine-Structure Processing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.20.558670. [PMID: 37790457 PMCID: PMC10542537 DOI: 10.1101/2023.09.20.558670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
The auditory system is unique among sensory systems in its ability to phase lock to and precisely follow very fast cycle-by-cycle fluctuations in the phase of sound-driven cochlear vibrations. Yet, the perceptual role of this temporal fine structure (TFS) code is debated. This fundamental gap is attributable to our inability to experimentally manipulate TFS cues without altering other perceptually relevant cues. Here, we circumnavigated this limitation by leveraging individual differences across 200 participants to systematically compare variations in TFS sensitivity to performance in a range of speech perception tasks. TFS sensitivity was assessed through detection of interaural time/phase differences, while speech perception was evaluated by word identification under noise interference. Results suggest that greater TFS sensitivity is not associated with greater masking release from fundamental-frequency or spatial cues, but appears to contribute to resilience against the effects of reverberation. We also found that greater TFS sensitivity is associated with faster response times, indicating reduced listening effort. These findings highlight the perceptual significance of TFS coding for everyday hearing.
Collapse
Affiliation(s)
- Agudemu Borjigin
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA
- Waisman Center, University of Wisconsin - Madison, Madison, WI 53705, USA
| | - Hari M. Bharadwaj
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907, USA
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN 47907, USA
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA 15213, USA
| |
Collapse
|
2
|
Devolder P, Keppler H, Keshishzadeh S, Taghon B, Dhooge I, Verhulst S. The role of hidden hearing loss in tinnitus: Insights from early markers of peripheral hearing damage. Hear Res 2024; 450:109050. [PMID: 38852534 DOI: 10.1016/j.heares.2024.109050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 05/24/2024] [Accepted: 05/28/2024] [Indexed: 06/11/2024]
Abstract
Since the presence of tinnitus is not always associated with audiometric hearing loss, it has been hypothesized that hidden hearing loss may act as a potential trigger for increased central gain along the neural pathway leading to tinnitus perception. In recent years, the study of hidden hearing loss has improved with the discovery of cochlear synaptopathy and several objective diagnostic markers. This study investigated three potential markers of peripheral hidden hearing loss in subjects with tinnitus: extended high-frequency audiometric thresholds, the auditory brainstem response, and the envelope following response. In addition, speech intelligibility was measured as a functional outcome measurement of hidden hearing loss. To account for age-related hidden hearing loss, participants were grouped according to age, presence of tinnitus, and audiometric thresholds. Group comparisons were conducted to differentiate between age- and tinnitus-related effects of hidden hearing loss. All three markers revealed age-related differences, whereas no differences were observed between the tinnitus and non-tinnitus groups. However, the older tinnitus group showed improved performance on low-pass filtered speech in noise tests compared to the older non-tinnitus group. These low-pass speech in noise scores were significantly correlated with tinnitus distress, as indicated using questionnaires, and could be related to the presence of hyperacusis. Based on our observations, cochlear synaptopathy does not appear to be the underlying cause of tinnitus. The improvement in low-pass speech-in-noise could be explained by enhanced temporal fine structure encoding or hyperacusis. Therefore, we recommend that future tinnitus research takes into account age-related factors, explores low-frequency encoding, and thoroughly assesses hyperacusis.
Collapse
Affiliation(s)
- Pauline Devolder
- Hearing Technology @ WAVES, Department of Information Technology, Ghent University, Zwijnaarde, Belgium; Department of Rehabilitation Sciences, Ghent University, Ghent, Belgium.
| | - Hannah Keppler
- Department of Rehabilitation Sciences, Ghent University, Ghent, Belgium; Department of Ear, Nose and Throat, Ghent University Hospital, Ghent, Belgium
| | - Sarineh Keshishzadeh
- Hearing Technology @ WAVES, Department of Information Technology, Ghent University, Zwijnaarde, Belgium
| | - Baziel Taghon
- Hearing Technology @ WAVES, Department of Information Technology, Ghent University, Zwijnaarde, Belgium
| | - Ingeborg Dhooge
- Department of Ear, Nose and Throat, Ghent University Hospital, Ghent, Belgium; Department of Head and Skin, Ghent University, Ghent, Belgium
| | - Sarah Verhulst
- Hearing Technology @ WAVES, Department of Information Technology, Ghent University, Zwijnaarde, Belgium
| |
Collapse
|
3
|
Goupell MJ, Stecker GC, Williams BT, Bilokon A, Tollin DJ. The Rapid Decline in Interaural-Time-Difference Sensitivity for Pure Tones Can Be Explained by Peripheral Filtering. J Assoc Res Otolaryngol 2024:10.1007/s10162-024-00949-y. [PMID: 38769250 DOI: 10.1007/s10162-024-00949-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 04/28/2024] [Indexed: 05/22/2024] Open
Abstract
PURPOSE The interaural time difference (ITD) is a primary horizontal-plane sound localization cue computed in the auditory brainstem. ITDs are accessible in the temporal fine structure of pure tones with a frequency of no higher than about 1400 Hz. How listeners' ITD sensitivity transitions from very best sensitivity near 700 Hz to impossible to detect within 1 octave currently lacks a fully compelling physiological explanation. Here, it was hypothesized that the rapid decline in ITD sensitivity is dictated not by a central neural limitation but by initial peripheral sound encoding, specifically, the low-frequency (apical) portion of the cochlear excitation pattern produced by a pure tone. METHODS ITD sensitivity was measured in 16 normal-hearing listeners as a joint function of frequency (900-1500 Hz) and level (10-50 dB sensation level). RESULTS Performance decreased with increasing frequency and decreasing sound level. The slope of performance decline was 90 dB/octave, consistent with the low-frequency slope of the cochlear excitation pattern. CONCLUSION Fine-structure ITD sensitivity near 1400 Hz may be conveyed primarily by "off-frequency" activation of neurons tuned to lower frequencies near 700 Hz. Physiologically, this could be realized by having neurons sensitive to fine-structure ITD up to only about 700 Hz. A more extreme model would have only a single narrow channel near 700 Hz that conveys fine-structure ITDs. Such a model is a major simplification and departure from the classic formulation of the binaural display, which consists of a matrix of neurons tuned to a wide range of relevant frequencies and ITDs.
Collapse
Affiliation(s)
- Matthew J Goupell
- Department of Hearing and Speech Sciences, University of Maryland, College Park, MD, 20742, USA.
- Neuroscience and Cognitive Science Program, University of Maryland, College Park, MD, 20742, USA.
| | - G Christopher Stecker
- Center for Hearing Research, Boys Town National Research Hospital, 555 N 30th St, Omaha, NE, 68131, USA
| | - Brittany T Williams
- Center for Hearing Research, Boys Town National Research Hospital, 555 N 30th St, Omaha, NE, 68131, USA
| | - Anhelina Bilokon
- Department of Hearing and Speech Sciences, University of Maryland, College Park, MD, 20742, USA
| | - Daniel J Tollin
- Department of Physiology & Biophysics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| |
Collapse
|
4
|
Schirmer J, Wolpert S, Dapper K, Rühle M, Wertz J, Wouters M, Eldh T, Bader K, Singer W, Gaudrain E, Başkent D, Verhulst S, Braun C, Rüttiger L, Munk MHJ, Dalhoff E, Knipper M. Neural Adaptation at Stimulus Onset and Speed of Neural Processing as Critical Contributors to Speech Comprehension Independent of Hearing Threshold or Age. J Clin Med 2024; 13:2725. [PMID: 38731254 PMCID: PMC11084258 DOI: 10.3390/jcm13092725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Revised: 04/24/2024] [Accepted: 04/26/2024] [Indexed: 05/13/2024] Open
Abstract
Background: It is assumed that speech comprehension deficits in background noise are caused by age-related or acquired hearing loss. Methods: We examined young, middle-aged, and older individuals with and without hearing threshold loss using pure-tone (PT) audiometry, short-pulsed distortion-product otoacoustic emissions (pDPOAEs), auditory brainstem responses (ABRs), auditory steady-state responses (ASSRs), speech comprehension (OLSA), and syllable discrimination in quiet and noise. Results: A noticeable decline of hearing sensitivity in extended high-frequency regions and its influence on low-frequency-induced ABRs was striking. When testing for differences in OLSA thresholds normalized for PT thresholds (PTTs), marked differences in speech comprehension ability exist not only in noise, but also in quiet, and they exist throughout the whole age range investigated. Listeners with poor speech comprehension in quiet exhibited a relatively lower pDPOAE and, thus, cochlear amplifier performance independent of PTT, smaller and delayed ABRs, and lower performance in vowel-phoneme discrimination below phase-locking limits (/o/-/u/). When OLSA was tested in noise, listeners with poor speech comprehension independent of PTT had larger pDPOAEs and, thus, cochlear amplifier performance, larger ASSR amplitudes, and higher uncomfortable loudness levels, all linked with lower performance of vowel-phoneme discrimination above the phase-locking limit (/i/-/y/). Conslusions: This study indicates that listening in noise in humans has a sizable disadvantage in envelope coding when basilar-membrane compression is compromised. Clearly, and in contrast to previous assumptions, both good and poor speech comprehension can exist independently of differences in PTTs and age, a phenomenon that urgently requires improved techniques to diagnose sound processing at stimulus onset in the clinical routine.
Collapse
Affiliation(s)
- Jakob Schirmer
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Stephan Wolpert
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Konrad Dapper
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
- Department of Biology, Technical University Darmstadt, 64287 Darmstadt, Germany
| | - Moritz Rühle
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Jakob Wertz
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Marjoleen Wouters
- Department of Information Technology, Ghent University, Technologiepark 126, 9052 Zwijnaarde, Belgium; (M.W.); (S.V.)
| | - Therese Eldh
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Katharina Bader
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Wibke Singer
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Etienne Gaudrain
- Lyon Neuroscience Research Center, Centre National de la Recherche Scientifique UMR5292, Inserm U1028, Université Lyon 1, Centre Hospitalier Le Vinatier-Bâtiment 462–Neurocampus, 95 Boulevard Pinel, 69675 Bron CEDEX, France;
- Department of Otorhinolaryngology, University Medical Center Groningen (UMCG), Hanzeplein 1, BB21, 9700 RB Groningen, The Netherlands;
| | - Deniz Başkent
- Department of Otorhinolaryngology, University Medical Center Groningen (UMCG), Hanzeplein 1, BB21, 9700 RB Groningen, The Netherlands;
| | - Sarah Verhulst
- Department of Information Technology, Ghent University, Technologiepark 126, 9052 Zwijnaarde, Belgium; (M.W.); (S.V.)
| | - Christoph Braun
- Magnetoencephalography-Centre and Hertie Institute for Clinical Brain Research, University of Tübingen, Otfried-Müller-Straße 27, 72076 Tübingen, Germany;
- Center for Mind and Brain Research, University of Trento, Palazzo Fedrigotti-corso Bettini 31, 38068 Rovereto, Italy
| | - Lukas Rüttiger
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Matthias H. J. Munk
- Department of Biology, Technical University Darmstadt, 64287 Darmstadt, Germany
- Department of Psychiatry & Psychotherapy, University of Tübingen, Calwerstraße 14, 72076 Tübingen, Germany
| | - Ernst Dalhoff
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| | - Marlies Knipper
- Department of Otolaryngology, Head and Neck Surgery, University of Tübingen, Elfriede-Aulhorn-Str. 5, 72076 Tübingen, Germany; (J.S.); (S.W.); (K.D.); (M.R.); (J.W.); (T.E.); (K.B.); (W.S.); (L.R.)
| |
Collapse
|
5
|
Saddler MR, McDermott JH. Models optimized for real-world tasks reveal the necessity of precise temporal coding in hearing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.21.590435. [PMID: 38712054 PMCID: PMC11071365 DOI: 10.1101/2024.04.21.590435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Neurons encode information in the timing of their spikes in addition to their firing rates. Spike timing is particularly precise in the auditory nerve, where action potentials phase lock to sound with sub-millisecond precision, but its behavioral relevance is uncertain. To investigate the role of this temporal coding, we optimized machine learning models to perform real-world hearing tasks with simulated cochlear input. We asked how precise auditory nerve spike timing needed to be to reproduce human behavior. Models with high-fidelity phase locking exhibited more human-like sound localization and speech perception than models without, consistent with an essential role in human hearing. Degrading phase locking produced task-dependent effects, revealing how the use of fine-grained temporal information reflects both ecological task demands and neural implementation constraints. The results link neural coding to perception and clarify conditions in which prostheses that fail to restore high-fidelity temporal coding could in principle restore near-normal hearing.
Collapse
Affiliation(s)
- Mark R Saddler
- Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA
- McGovern Institute for Brain Research, MIT, Cambridge, MA, USA
- Center for Brains, Minds, and Machines, MIT, Cambridge, MA, USA
| | - Josh H McDermott
- Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA
- McGovern Institute for Brain Research, MIT, Cambridge, MA, USA
- Center for Brains, Minds, and Machines, MIT, Cambridge, MA, USA
- Program in Speech and Hearing Biosciences and Technology, Harvard, Cambridge, MA, USA
| |
Collapse
|
6
|
Hu H, Ewert SD, Kollmeier B, Vickers D. Rate dependent neural responses of interaural-time-difference cues in fine-structure and envelope. PeerJ 2024; 12:e17104. [PMID: 38680894 PMCID: PMC11055513 DOI: 10.7717/peerj.17104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 02/22/2024] [Indexed: 05/01/2024] Open
Abstract
Advancements in cochlear implants (CIs) have led to a significant increase in bilateral CI users, especially among children. Yet, most bilateral CI users do not fully achieve the intended binaural benefit due to potential limitations in signal processing and/or surgical implant positioning. One crucial auditory cue that normal hearing (NH) listeners can benefit from is the interaural time difference (ITD), i.e., the time difference between the arrival of a sound at two ears. The ITD sensitivity is thought to be heavily relying on the effective utilization of temporal fine structure (very rapid oscillations in sound). Unfortunately, most current CIs do not transmit such true fine structure. Nevertheless, bilateral CI users have demonstrated sensitivity to ITD cues delivered through envelope or interaural pulse time differences, i.e., the time gap between the pulses delivered to the two implants. However, their ITD sensitivity is significantly poorer compared to NH individuals, and it further degrades at higher CI stimulation rates, especially when the rate exceeds 300 pulse per second. The overall purpose of this research thread is to improve spatial hearing abilities in bilateral CI users. This study aims to develop electroencephalography (EEG) paradigms that can be used with clinical settings to assess and optimize the delivery of ITD cues, which are crucial for spatial hearing in everyday life. The research objective of this article was to determine the effect of CI stimulation pulse rate on the ITD sensitivity, and to characterize the rate-dependent degradation in ITD perception using EEG measures. To develop protocols for bilateral CI studies, EEG responses were obtained from NH listeners using sinusoidal-amplitude-modulated (SAM) tones and filtered clicks with changes in either fine structure ITD (ITDFS) or envelope ITD (ITDENV). Multiple EEG responses were analyzed, which included the subcortical auditory steady-state responses (ASSRs) and cortical auditory evoked potentials (CAEPs) elicited by stimuli onset, offset, and changes. Results indicated that acoustic change complex (ACC) responses elicited by ITDENV changes were significantly smaller or absent compared to those elicited by ITDFS changes. The ACC morphologies evoked by ITDFS changes were similar to onset and offset CAEPs, although the peak latencies were longest for ACC responses and shortest for offset CAEPs. The high-frequency stimuli clearly elicited subcortical ASSRs, but smaller than those evoked by lower carrier frequency SAM tones. The 40-Hz ASSRs decreased with increasing carrier frequencies. Filtered clicks elicited larger ASSRs compared to high-frequency SAM tones, with the order being 40 > 160 > 80> 320 Hz ASSR for both stimulus types. Wavelet analysis revealed a clear interaction between detectable transient CAEPs and 40-Hz ASSRs in the time-frequency domain for SAM tones with a low carrier frequency.
Collapse
Affiliation(s)
- Hongmei Hu
- SOUND Lab, Cambridge Hearing Group, Department of Clinical Neuroscience, Cambridge University, Cambridge, United Kingdom
- Department of Medical Physics and Acoustics, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
| | - Stephan D. Ewert
- Department of Medical Physics and Acoustics, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
| | - Birger Kollmeier
- Department of Medical Physics and Acoustics, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
| | - Deborah Vickers
- SOUND Lab, Cambridge Hearing Group, Department of Clinical Neuroscience, Cambridge University, Cambridge, United Kingdom
| |
Collapse
|
7
|
Cychosz M, Winn MB, Goupell MJ. How to vocode: Using channel vocoders for cochlear-implant research. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 155:2407-2437. [PMID: 38568143 PMCID: PMC10994674 DOI: 10.1121/10.0025274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 02/23/2024] [Indexed: 04/05/2024]
Abstract
The channel vocoder has become a useful tool to understand the impact of specific forms of auditory degradation-particularly the spectral and temporal degradation that reflect cochlear-implant processing. Vocoders have many parameters that allow researchers to answer questions about cochlear-implant processing in ways that overcome some logistical complications of controlling for factors in individual cochlear implant users. However, there is such a large variety in the implementation of vocoders that the term "vocoder" is not specific enough to describe the signal processing used in these experiments. Misunderstanding vocoder parameters can result in experimental confounds or unexpected stimulus distortions. This paper highlights the signal processing parameters that should be specified when describing vocoder construction. The paper also provides guidance on how to determine vocoder parameters within perception experiments, given the experimenter's goals and research questions, to avoid common signal processing mistakes. Throughout, we will assume that experimenters are interested in vocoders with the specific goal of better understanding cochlear implants.
Collapse
Affiliation(s)
- Margaret Cychosz
- Department of Linguistics, University of California, Los Angeles, Los Angeles, California 90095, USA
| | - Matthew B Winn
- Department of Speech-Language-Hearing Sciences, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Matthew J Goupell
- Department of Hearing and Speech Sciences, University of Maryland, College Park, College Park, Maryland 20742, USA
| |
Collapse
|
8
|
Goupell MJ, Stecker GC, Williams BT, Bilokon A, Tollin DJ. The rapid decline in interaural-time-difference sensitivity for pure tones can be explained by peripheral filtering. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.04.551950. [PMID: 37577552 PMCID: PMC10418179 DOI: 10.1101/2023.08.04.551950] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
Purpose The interaural time difference (ITD) is a primary horizontal-plane sound localization cue computed in the auditory brainstem. ITDs are accessible in the temporal fine structure of pure tones with a frequency of no higher than about 1400 Hz. Explaining how listeners' ITD sensitivity transitions from very best sensitivity near 700 Hz to impossible to detect within 1 octave currently lacks a fully compelling physiological explanation. Here, it was hypothesized that the rapid decline in ITD sensitivity is dictated not by a central neural limitation but by initial peripheral sound encoding, specifically, the low-frequency (apical) edge of the cochlear excitation pattern produced by a pure tone. Methods ITD sensitivity was measured in 16 normal-hearing listeners as a joint function of frequency (900-1500 Hz) and level (10-50 dB sensation level). Results Performance decreased with increasing frequency and decreasing sound level. The slope of performance decline was 90 dB/octave, consistent with the low-frequency slope of the cochlear excitation pattern. Conclusion Fine-structure ITD sensitivity near 1400 Hz may be conveyed primarily by "off-frequency" activation of neurons tuned to lower frequencies near 700 Hz. Physiologically, this could be realized by having neurons sensitive to fine-structure ITD up to only about 700 Hz. A more extreme model would have only a single narrow channel near 700 Hz that conveys fine-structure ITDs. Such a model is a major simplification and departure from the classic formulation of the binaural display, which consists of a matrix of neurons tuned to a wide range of relevant frequencies and ITDs.
Collapse
|
9
|
Li G, Gao Y, Wu H, Zhao T. Gentamicin administration leads to synaptic dysfunction in inner hair cells. Toxicol Lett 2024; 391:86-99. [PMID: 38101494 DOI: 10.1016/j.toxlet.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 11/17/2023] [Accepted: 12/11/2023] [Indexed: 12/17/2023]
Abstract
Ototoxicity is a major side effect of aminoglycosides, which can cause irreversible hearing loss. Previous studies on aminoglycoside-induced ototoxicity have primarily focused on the loss of sensory hair cells. Recent investigations have revealed that aminoglycosides can also lead to the loss of ribbon synapses in inner hair cells (IHCs). However, the functional implications of ribbon synapse loss and the underlying mechanisms remain unclear. In this study, we intraperitoneally injected C57BL/6 J mice with 300 mg/kg gentamicin once daily for 3, 10, and 20 days. Then, we performed immunofluorescence staining, patch-clamp recording, proteomics analysis and western blotting to characterize the changes in ribbon synapses in IHCs and the associated mechanisms. After gentamicin treatment, the auditory brainstem response (ABR) threshold was elevated, and the ABR wave I amplitude was decreased. We also observed loss of ribbon synapses in IHCs. Interestingly, ribbon synapse loss occurred on both the modiolar and pillar sides of IHCs. Whole-cell patch-clamp recordings in IHCs revealed a reduction in the calcium current amplitude, along with a shifted half-activation voltage and altered calcium voltage dependency. Moreover, exocytosis of IHCs was reduced, consistent with the reduction in the ABR wave I amplitude. Through proteomic analysis, western blotting, and immunofluorescence staining, we found that gentamicin treatment resulted in downregulation of myosin VI, a protein crucial for synaptic vesicle recycling and replenishment in IHCs. Furthermore, we evaluated the kinetics of endocytosis and found a significant reduction in IHC exocytosis, possibly reflecting the impact of myosin VI downregulation on synaptic vesicle recycling. In summary, our findings demonstrate that gentamicin treatment leads to synaptic dysfunction in IHCs, highlighting the important role of myosin VI downregulation in gentamicin-induced synaptic damage.
Collapse
Affiliation(s)
- Gen Li
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Ear Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai, China
| | - Yunge Gao
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Ear Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai, China
| | - Hao Wu
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Ear Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai, China.
| | - Ting Zhao
- Department of Otolaryngology-Head and Neck Surgery, Shanghai Ninth People's Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Ear Institute, Shanghai Jiao Tong University School of Medicine, Shanghai, China; Shanghai Key Laboratory of Translational Medicine on Ear and Nose Diseases, Shanghai, China.
| |
Collapse
|
10
|
Singh R, Bharadwaj HM. Cortical temporal integration can account for limits of temporal perception: investigations in the binaural system. Commun Biol 2023; 6:981. [PMID: 37752215 PMCID: PMC10522716 DOI: 10.1038/s42003-023-05361-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 09/15/2023] [Indexed: 09/28/2023] Open
Abstract
The auditory system has exquisite temporal coding in the periphery which is transformed into a rate-based code in central auditory structures, like auditory cortex. However, the cortex is still able to synchronize, albeit at lower modulation rates, to acoustic fluctuations. The perceptual significance of this cortical synchronization is unknown. We estimated physiological synchronization limits of cortex (in humans with electroencephalography) and brainstem neurons (in chinchillas) to dynamic binaural cues using a novel system-identification technique, along with parallel perceptual measurements. We find that cortex can synchronize to dynamic binaural cues up to approximately 10 Hz, which aligns well with our measured limits of perceiving dynamic spatial information and utilizing dynamic binaural cues for spatial unmasking, i.e. measures of binaural sluggishness. We also find that the tracking limit for frequency modulation (FM) is similar to the limit for spatial tracking, demonstrating that this sluggish tracking is a more general perceptual limit that can be accounted for by cortical temporal integration limits.
Collapse
Affiliation(s)
- Ravinderjit Singh
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA
| | - Hari M Bharadwaj
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN, USA.
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, IN, USA.
- Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
11
|
Vinay, Moore BCJ. Exploiting individual differences to assess the role of place and phase locking cues in auditory frequency discrimination at 2 kHz. Sci Rep 2023; 13:13801. [PMID: 37612303 PMCID: PMC10447419 DOI: 10.1038/s41598-023-40571-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 08/13/2023] [Indexed: 08/25/2023] Open
Abstract
The relative role of place and temporal mechanisms in auditory frequency discrimination was assessed for a centre frequency of 2 kHz. Four measures of frequency discrimination were obtained for 63 normal-hearing participants: detection of frequency modulation using modulation rates of 2 Hz (FM2) and 20 Hz (FM20); detection of a change in frequency across successive pure tones (difference limen for frequency, DLF); and detection of changes in the temporal fine structure of bandpass filtered complex tones centred at 2 kHz (TFS). Previous work has suggested that: FM2 depends on the use of both temporal and place cues; FM20 depends primarily on the use of place cues because the temporal mechanism cannot track rapid changes in frequency; DLF depends primarily on temporal cues; TFS depends exclusively on temporal cues. This led to the following predicted patterns of the correlations of scores across participants: DLF and TFS should be highly correlated; FM2 should be correlated with DLF and TFS; FM20 should not be correlated with DLF or TFS. The results were broadly consistent with these predictions and with the idea that frequency discrimination at 2 kHz depends partly or primarily on temporal cues except for frequency modulation detection at a high rate.
Collapse
Affiliation(s)
- Vinay
- Audiology Group, Department of Neuromedicine and Movement Science, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU), Tungasletta 2, 7491, Trondheim, Norway.
| | - Brian C J Moore
- Cambridge Hearing Group, Department of Psychology, University of Cambridge, Cambridge, UK
| |
Collapse
|
12
|
Gockel HE, Carlyon RP. Effect of diotic versus dichotic presentation on the pitch perception of tone complexes at medium and very high frequencies. Sci Rep 2023; 13:13247. [PMID: 37582928 PMCID: PMC10427668 DOI: 10.1038/s41598-023-40122-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Accepted: 08/04/2023] [Indexed: 08/17/2023] Open
Abstract
Difference limens for fundamental frequency (F0), F0DLs, are usually small for complex tones containing low harmonics that are resolved in the auditory periphery, but worsen when the rank of the lowest harmonic increases above about 6-8 and harmonics become less resolved. The traditional explanation for this, in terms of resolvability, has been challenged and an alternative explanation in terms of harmonic rank was suggested. Here, to disentangle the effects of resolvability and harmonic rank the complex tones were presented either diotically (all harmonics to both ears) or dichotically (even and odd harmonics to opposite ears); the latter increases resolvability but does not affect harmonic rank. F0DLs were measured for 14 listeners for complex tones containing harmonics 6-10 with F0s of 280 and 1400 Hz, presented diotically or dichotically. For the low F0, F0DLs were significantly lower for the dichotic than for the diotic condition. This is consistent with a benefit of increased resolvability of harmonics for F0 discrimination and extends previous results to harmonics as low as the sixth. In contrast, for the high F0, F0DLs were similar for the two presentation modes, adding to evidence for differences in pitch perception between tones with low-to-medium and very-high frequency content.
Collapse
Affiliation(s)
- Hedwig E Gockel
- MRC Cognition and Brain Sciences Unit, Cambridge Hearing Group, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK.
| | - Robert P Carlyon
- MRC Cognition and Brain Sciences Unit, Cambridge Hearing Group, University of Cambridge, 15 Chaucer Road, Cambridge, CB2 7EF, UK
| |
Collapse
|
13
|
Kuo CY, Liu JW, Wang CH, Juan CH, Hsieh IH. The role of carrier spectral composition in the perception of musical pitch. Atten Percept Psychophys 2023; 85:2083-2099. [PMID: 37479873 DOI: 10.3758/s13414-023-02761-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/07/2023] [Indexed: 07/23/2023]
Abstract
Temporal envelope fluctuations of natural sounds convey critical information to speech and music processing. In particular, musical pitch perception is assumed to be primarily underlined by temporal envelope encoding. While increasing evidence demonstrates the importance of carrier fine structure to complex pitch perception, how carrier spectral information affects musical pitch perception is less clear. Here, transposed tones designed to convey identical envelope information across different carriers were used to assess the effects of carrier spectral composition to pitch discrimination and musical-interval and melody identifications. Results showed that pitch discrimination thresholds became lower (better) with increasing carrier frequencies from 1k to 10k Hz, with performance comparable to that of pure sinusoids. Musical interval and melody defined by the periodicity of sine- or harmonic complex envelopes across carriers were identified with greater than 85% accuracy even on a 10k-Hz carrier. Moreover, enhanced interval and melody identification performance was observed with increasing carrier frequency up to 6k Hz. Findings suggest a perceptual enhancement of temporal envelope information with increasing carrier spectral region in musical pitch processing, at least for frequencies up to 6k Hz. For carriers in the extended high-frequency region (8-20k Hz), the use of temporal envelope information to music pitch processing may vary depending on task requirement. Collectively, these results implicate the fidelity of temporal envelope information to musical pitch perception is more pronounced than previously considered, with ecological implications.
Collapse
Affiliation(s)
- Chao-Yin Kuo
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei City, Taiwan
| | - Jia-Wei Liu
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
| | - Chih-Hung Wang
- Department of Otolaryngology-Head and Neck Surgery, Tri-Service General Hospital, National Defense Medical Center, Taipei City, Taiwan
| | - Chi-Hung Juan
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
- Cognitive Intelligence and Precision Healthcare Center, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan
| | - I-Hui Hsieh
- Institute of Cognitive Neuroscience, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan.
- Cognitive Intelligence and Precision Healthcare Center, National Central University, No. 300, Zhongda Rd., Zhongli District, Taoyuan City, 320317, Taiwan.
| |
Collapse
|
14
|
Moore BCJ, Vinay. Assessing mechanisms of frequency discrimination by comparison of different measures over a wide frequency range. Sci Rep 2023; 13:11379. [PMID: 37452119 PMCID: PMC10349105 DOI: 10.1038/s41598-023-38600-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 07/11/2023] [Indexed: 07/18/2023] Open
Abstract
It has been hypothesized that auditory detection of frequency modulation (FM) for low FM rates depends on the use of both temporal (phase locking) and place cues, depending on the carrier frequency, while detection of FM at high rates depends primarily on the use of place cues. To test this, FM detection for 2 and 20 Hz rates was measured over a wide frequency range, 1-10 kHz, including high frequencies for which temporal cues are assumed to be very weak. Performance was measured over the same frequency range for a task involving detection of changes in the temporal fine structure (TFS) of bandpass filtered complex tones, for which performance is assumed to depend primarily on the use of temporal cues. FM thresholds were better for the 2- than for the 20-Hz rate for center frequencies up to 4 kHz, while the reverse was true for higher center frequencies. For both FM rates, the thresholds, expressed as a proportion of the center frequency, were roughly constant for center frequencies from 6 to 10 Hz, consistent with the use of place cues. For the TFS task, thresholds worsened progressively with increasing frequency above 4 kHz, consistent with the weakening of temporal cues.
Collapse
Affiliation(s)
- Brian C J Moore
- Cambridge Hearing Group, Department of Psychology, University of Cambridge, Cambridge, UK.
| | - Vinay
- Audiology Group, Department of Neuromedicine and Movement Science, Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology (NTNU), Tungasletta 2, 7491, Trondheim, Norway
| |
Collapse
|
15
|
Haggerty RA, Hutson KA, Riggs WJ, Brown KD, Pillsbury HC, Adunka OF, Buchman CA, Fitzpatrick DC. Assessment of cochlear synaptopathy by electrocochleography to low frequencies in a preclinical model and human subjects. Front Neurol 2023; 14:1104574. [PMID: 37483448 PMCID: PMC10361575 DOI: 10.3389/fneur.2023.1104574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Accepted: 06/19/2023] [Indexed: 07/25/2023] Open
Abstract
Cochlear synaptopathy is the loss of synapses between the inner hair cells and the auditory nerve despite survival of sensory hair cells. The findings of extensive cochlear synaptopathy in animals after moderate noise exposures challenged the long-held view that hair cells are the cochlear elements most sensitive to insults that lead to hearing loss. However, cochlear synaptopathy has been difficult to identify in humans. We applied novel algorithms to determine hair cell and neural contributions to electrocochleographic (ECochG) recordings from the round window of animal and human subjects. Gerbils with normal hearing provided training and test sets for a deep learning algorithm to detect the presence of neural responses to low frequency sounds, and an analytic model was used to quantify the proportion of neural and hair cell contributions to the ECochG response. The capacity to detect cochlear synaptopathy was validated in normal hearing and noise-exposed animals by using neurotoxins to reduce or eliminate the neural contributions. When the analytical methods were applied to human surgical subjects with access to the round window, the neural contribution resembled the partial cochlear synaptopathy present after neurotoxin application in animals. This result demonstrates the presence of viable hair cells not connected to auditory nerve fibers in human subjects with substantial hearing loss and indicates that efforts to regenerate nerve fibers may find a ready cochlear substrate for innervation and resumption of function.
Collapse
Affiliation(s)
- Raymond A. Haggerty
- Department of Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Kendall A. Hutson
- Department of Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - William J. Riggs
- Department of Otolaryngology, The Ohio State University, Columbus, OH, United States
| | - Kevin D. Brown
- Department of Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- University of North Carolina School of Medicine, Chapel Hill, NC, United States
| | - Harold C. Pillsbury
- Department of Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
- University of North Carolina School of Medicine, Chapel Hill, NC, United States
| | - Oliver F. Adunka
- Department of Otolaryngology, The Ohio State University, Columbus, OH, United States
| | - Craig A. Buchman
- Department of Otolaryngology, Washington University in St. Louis, St. Louis, MO, United States
| | - Douglas C. Fitzpatrick
- Department of Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| |
Collapse
|
16
|
Klug J, Encke J, Dietz M. Characterization of the decline in auditory nerve phase locking at high frequencies. JASA EXPRESS LETTERS 2023; 3:074403. [PMID: 37477582 DOI: 10.1121/10.0020267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 07/04/2023] [Indexed: 07/22/2023]
Abstract
The frequency dependence of phase locking in the auditory nerve influences various auditory coding mechanisms. The decline of phase locking with increasing frequency is commonly described by a low-pass filter. This study compares fitted low-pass filter parameters with the actual rate of phase locking decline. The decline is similar across studies and only 40 dB per decade, corresponding to the asymptotic decline of a second order filter.
Collapse
Affiliation(s)
- Jonas Klug
- Department of Medical Physics and Acoustics, University of Oldenburg, 26129 Oldenburg, , ,
| | - Jörg Encke
- Department of Medical Physics and Acoustics, University of Oldenburg, 26129 Oldenburg, , ,
| | - Mathias Dietz
- Department of Medical Physics and Acoustics, University of Oldenburg, 26129 Oldenburg, , ,
| |
Collapse
|
17
|
Whiteford KL, Oxenham AJ. Sensitivity to Frequency Modulation is Limited Centrally. J Neurosci 2023; 43:3687-3695. [PMID: 37028932 PMCID: PMC10198444 DOI: 10.1523/jneurosci.0995-22.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 03/23/2023] [Accepted: 03/31/2023] [Indexed: 04/09/2023] Open
Abstract
Modulations in both amplitude and frequency are prevalent in natural sounds and are critical in defining their properties. Humans are exquisitely sensitive to frequency modulation (FM) at the slow modulation rates and low carrier frequencies that are common in speech and music. This enhanced sensitivity to slow-rate and low-frequency FM has been widely believed to reflect precise, stimulus-driven phase locking to temporal fine structure in the auditory nerve. At faster modulation rates and/or higher carrier frequencies, FM is instead thought to be coded by coarser frequency-to-place mapping, where FM is converted to amplitude modulation (AM) via cochlear filtering. Here, we show that patterns of human FM perception that have classically been explained by limits in peripheral temporal coding are instead better accounted for by constraints in the central processing of fundamental frequency (F0) or pitch. We measured FM detection in male and female humans using harmonic complex tones with an F0 within the range of musical pitch but with resolved harmonic components that were all above the putative limits of temporal phase locking (>8 kHz). Listeners were more sensitive to slow than fast FM rates, even though all components were beyond the limits of phase locking. In contrast, AM sensitivity remained better at faster than slower rates, regardless of carrier frequency. These findings demonstrate that classic trends in human FM sensitivity, previously attributed to auditory nerve phase locking, may instead reflect the constraints of a unitary code that operates at a more central level of processing.SIGNIFICANCE STATEMENT Natural sounds involve dynamic frequency and amplitude fluctuations. Humans are particularly sensitive to frequency modulation (FM) at slow rates and low carrier frequencies, which are prevalent in speech and music. This sensitivity has been ascribed to encoding of stimulus temporal fine structure (TFS) via phase-locked auditory nerve activity. To test this long-standing theory, we measured FM sensitivity using complex tones with a low F0 but only high-frequency harmonics beyond the limits of phase locking. Dissociating the F0 from TFS showed that FM sensitivity is limited not by peripheral encoding of TFS but rather by central processing of F0, or pitch. The results suggest a unitary code for FM detection limited by more central constraints.
Collapse
Affiliation(s)
- Kelly L Whiteford
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455
| | - Andrew J Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455
| |
Collapse
|
18
|
Mishra SK, Fu QJ, Galvin JJ, Galindo A. Suprathreshold auditory processes in listeners with normal audiograms but extended high-frequency hearing lossa). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:2745. [PMID: 37133816 DOI: 10.1121/10.0019337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 04/17/2023] [Indexed: 05/04/2023]
Abstract
Hearing loss in the extended high-frequency (EHF) range (>8 kHz) is widespread among young normal-hearing adults and could have perceptual consequences such as difficulty understanding speech in noise. However, it is unclear how EHF hearing loss might affect basic psychoacoustic processes. The hypothesis that EHF hearing loss is associated with poorer auditory resolution in the standard frequencies was tested. Temporal resolution was characterized by amplitude modulation detection thresholds (AMDTs), and spectral resolution was characterized by frequency change detection thresholds (FCDTs). AMDTs and FCDTs were measured in adults with or without EHF loss but with normal clinical audiograms. AMDTs were measured with 0.5- and 4-kHz carrier frequencies; similarly, FCDTs were measured for 0.5- and 4-kHz base frequencies. AMDTs were significantly higher with the 4 kHz than the 0.5 kHz carrier, but there was no significant effect of EHF loss. There was no significant effect of EHF loss on FCDTs at 0.5 kHz; however, FCDTs were significantly higher at 4 kHz for listeners with than without EHF loss. This suggests that some aspects of auditory resolution in the standard audiometric frequency range may be compromised in listeners with EHF hearing loss despite having a normal audiogram.
Collapse
Affiliation(s)
- Srikanta K Mishra
- Department of Speech, Language and Hearing Sciences, The University of Texas at Austin, Austin, Texas 78712, USA
| | - Qian-Jie Fu
- Department of Head and Neck Surgery, David Geffen School of Medicine, University of California at Los Angeles (UCLA), Los Angeles, California 90095, USA
| | - John J Galvin
- House Institute Foundation, Los Angeles, California 90075, USA
| | - Andrea Galindo
- Department of Communication Sciences and Disorders, The University of Texas Rio Grande Valley, Edinburg, Texas 78539, USA
| |
Collapse
|
19
|
Tahmasebi S, Segovia-Martinez M, Nogueira W. Optimization of Sound Coding Strategies to Make Singing Music More Accessible for Cochlear Implant Users. Trends Hear 2023; 27:23312165221148022. [PMID: 36628453 PMCID: PMC9837293 DOI: 10.1177/23312165221148022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Cochlear implants (CIs) are implantable medical devices that can partially restore hearing to people suffering from profound sensorineural hearing loss. While these devices provide good speech understanding in quiet, many CI users face difficulties when listening to music. Reasons include poor spatial specificity of electric stimulation, limited transmission of spectral and temporal fine structure of acoustic signals, and restrictions in the dynamic range that can be conveyed via electric stimulation of the auditory nerve. The coding strategies currently used in CIs are typically designed for speech rather than music. This work investigates the optimization of CI coding strategies to make singing music more accessible to CI users. The aim is to reduce the spectral complexity of music by selecting fewer bands for stimulation, attenuating the background instruments by strengthening a noise reduction algorithm, and optimizing the electric dynamic range through a back-end compressor. The optimizations were evaluated through both objective and perceptual measures of speech understanding and melody identification of singing voice with and without background instruments, as well as music appreciation questionnaires. Consistent with the objective measures, results gathered from the perceptual evaluations indicated that reducing the number of selected bands and optimizing the electric dynamic range significantly improved speech understanding in music. Moreover, results obtained from questionnaires show that the new music back-end compressor significantly improved music enjoyment. These results have potential as a new CI program for improved singing music perception.
Collapse
Affiliation(s)
- Sina Tahmasebi
- Department of Otolaryngology, Hannover Medical School, Hannover, Germany
- Cluster of Excellence Hearing4all, Hannover, Germany
- Sina Tahmasebi, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
Waldo Nogueira, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
| | | | - Waldo Nogueira
- Department of Otolaryngology, Hannover Medical School, Hannover, Germany
- Cluster of Excellence Hearing4all, Hannover, Germany
- Sina Tahmasebi, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
Waldo Nogueira, Karl-Wiechert-Allee 3, 30625 Hannover, Germany.
| |
Collapse
|
20
|
Klug J, Dietz M. Frequency dependence of sensitivity to interaural phase differences in pure tones. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:3130. [PMID: 36586867 DOI: 10.1121/10.0015246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Accepted: 11/03/2022] [Indexed: 06/17/2023]
Abstract
It is well established that in normal-hearing humans, the threshold of interaural time differences for pure tones increases dramatically above about 1300 Hz, only to become unmeasurable above 1400 Hz. However, physiological data and auditory models suggest that the actual decline in sensitivity is more gradual and only appears to be abrupt because the maximum of the psychometric function dips below the threshold proportion correct, e.g., 0.794. Published data only report thresholds at certain proportions correct but not the decline of proportions correct or of the sensitivity index d' with increasing frequencies. Here, we present pure-tone behavioral data obtained with a constant stimulus procedure. Seven of nine subjects showed proportions correct above 0.9 at 1300 Hz and virtually no sensitivity at 1500 Hz (proportion correct within 0.07 of chance level). This corresponds to a sensitivity decline of 46-78 dB/oct, much steeper than predicted by existing models or by the decline of phase locking of the auditory nerve fibers in animal data.
Collapse
Affiliation(s)
- Jonas Klug
- Department of Medical Physics and Acoustics, University of Oldenburg, 26129, Oldenburg, Germany
| | - Mathias Dietz
- Department of Medical Physics and Acoustics, University of Oldenburg, 26129, Oldenburg, Germany
| |
Collapse
|
21
|
Racca JM, Delgado RE, Gifford RH, Ramachandran R, Hood LJ. The Effects of Middle-ear Stiffness on the Auditory Brainstem Neural Encoding of Phase. J Assoc Res Otolaryngol 2022; 23:859-873. [PMID: 36214911 PMCID: PMC9549819 DOI: 10.1007/s10162-022-00872-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Accepted: 09/07/2022] [Indexed: 01/06/2023] Open
Abstract
The middle-ear system relies on a balance of mass and stiffness characteristics for transmitting sound from the external environment to the cochlea and auditory neural pathway. Phase is one aspect of sound that, when transmitted and encoded by both ears, contributes to binaural cue sensitivity and spatial hearing. The study aims were (i) to investigate the effects of middle-ear stiffness on the auditory brainstem neural encoding of phase in human adults with normal pure-tone thresholds and (ii) to investigate the relationships between middle-ear stiffness-induced changes in wideband acoustic immittance and neural encoding of phase. The auditory brainstem neural encoding of phase was measured using the auditory steady-state response (ASSR) with and without middle-ear stiffness elicited via contralateral activation of the middle-ear muscle reflex (MEMR). Middle-ear stiffness was quantified using a wideband acoustic immittance assay of acoustic absorbance. Statistical analyses demonstrated decreased ASSR phase lag and decreased acoustic absorbance with contralateral activation of the MEMR, consistent with increased middle-ear stiffness changing the auditory brainstem neural encoding of phase. There were no statistically significant correlations between stiffness-induced changes in wideband acoustic absorbance and ASSR phase. The findings of this study may have important implications for understanding binaural cue sensitivity and horizontal plane sound localization in audiologic and otologic clinical populations that demonstrate changes in middle-ear stiffness, including cochlear implant recipients who use combined electric and binaural acoustic hearing and otosclerosis patients.
Collapse
Affiliation(s)
- Jordan M Racca
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN, USA.
- Current Affiliation: Collaborative for STEM Education and Outreach, Peabody College of Education, Vanderbilt University, Nashville, TN, USA.
| | | | - René H Gifford
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Ramnarayan Ramachandran
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
| | - Linda J Hood
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN, USA
| |
Collapse
|
22
|
Bissmeyer SRS, Goldsworthy RL. Combining Place and Rate of Stimulation Improves Frequency Discrimination in Cochlear Implant Users. Hear Res 2022; 424:108583. [PMID: 35930901 PMCID: PMC10849775 DOI: 10.1016/j.heares.2022.108583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 06/20/2022] [Accepted: 07/21/2022] [Indexed: 11/04/2022]
Abstract
In the auditory system, frequency is represented as tonotopic and temporal response properties of the auditory nerve. While these response properties are inextricably linked in normal hearing, cochlear implants can separately excite tonotopic location and temporal synchrony using different electrodes and stimulation rates, respectively. This separation allows for the investigation of the contributions of tonotopic and temporal cues for frequency discrimination. The present study examines frequency discrimination in adult cochlear implant users as conveyed by electrode position and stimulation rate, separately and combined. The working hypothesis is that frequency discrimination is better provided by place and rate cues combined compared to either cue alone. This hypothesis was tested in two experiments. In the first experiment, frequency discrimination needed for melodic contour identification was measured for frequencies near 100, 200, and 400 Hz using frequency allocation modeled after clinical processors. In the second experiment, frequency discrimination for pitch ranking was measured for frequencies between 100 and 1600 Hz using an experimental frequency allocation designed to provide better access to place cues. The results of both experiments indicate that frequency discrimination is better with place and rate cues combined than with either cue alone. These results clarify how signal processing for cochlear implants could better encode frequency into place and rate of electrical stimulation. Further, the results provide insight into the contributions of place and rate cues for pitch.
Collapse
Affiliation(s)
- Susan R S Bissmeyer
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States; Auditory Research Center, Health Research Association, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, 1640 Marengo Street Suite 326, Los Angeles, CA 90033, United States.
| | - Raymond L Goldsworthy
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, United States; Auditory Research Center, Health Research Association, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, 1640 Marengo Street Suite 326, Los Angeles, CA 90033, United States
| |
Collapse
|
23
|
Burwood G, Hakizimana P, Nuttall AL, Fridberger A. Best frequencies and temporal delays are similar across the low-frequency regions of the guinea pig cochlea. SCIENCE ADVANCES 2022; 8:eabq2773. [PMID: 36149949 PMCID: PMC9506724 DOI: 10.1126/sciadv.abq2773] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
The cochlea maps tones with different frequencies to distinct anatomical locations. For instance, a faint 5000-hertz tone produces brisk responses at a place approximately 8 millimeters into the 18-millimeter-long guinea pig cochlea, but little response elsewhere. This place code pervades the auditory pathways, where neurons have "best frequencies" determined by their connections to the sensory cells in the hearing organ. However, frequency selectivity in cochlear regions encoding low-frequency sounds has not been systematically studied. Here, we show that low-frequency hearing works according to a unique principle that does not involve a place code. Instead, sound-evoked responses and temporal delays are similar across the low-frequency regions of the cochlea. These findings are a break from theories considered proven for 100 years and have broad implications for understanding information processing in the brainstem and cortex and for optimizing the stimulus delivery in auditory implants.
Collapse
Affiliation(s)
- George Burwood
- Oregon Hearing Research Center, Department of Otolaryngology–Head and Neck Surgery, Oregon Health & Science University, Portland, OR 97239, USA
| | - Pierre Hakizimana
- Department of Biomedical and Clinical Sciences, Linköping University, SE-581 83 Linköping, Sweden
| | - Alfred L Nuttall
- Oregon Hearing Research Center, Department of Otolaryngology–Head and Neck Surgery, Oregon Health & Science University, Portland, OR 97239, USA
- Corresponding author. (A.L.N.); (A.F.)
| | - Anders Fridberger
- Oregon Hearing Research Center, Department of Otolaryngology–Head and Neck Surgery, Oregon Health & Science University, Portland, OR 97239, USA
- Department of Biomedical and Clinical Sciences, Linköping University, SE-581 83 Linköping, Sweden
- Corresponding author. (A.L.N.); (A.F.)
| |
Collapse
|
24
|
Jaatinen J, Pätynen J. Effect of inharmonicity on pitch perception and subjective tuning of piano tones. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:1146. [PMID: 36050167 DOI: 10.1121/10.0013572] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Accepted: 07/29/2022] [Indexed: 06/15/2023]
Abstract
The consensus in piano tuning philosophy explains the stretched tuning scale by the inharmonicity of piano strings. This study aimed to examine how variable inharmonicity influences the result of the piano tuning process, compare the tuning curves of aurally tuned pianos with the curves derived from subjective octave enlargement experiments, and evaluate whether the pitches of inharmonic or harmonic versions of the same tone are perceived differently. In addition, the influence of strings of other piano keys on the measured inharmonicity of a single piano string was investigated. The inharmonicity of all individual strings was measured on a Steinway D grand piano. Variable inharmonicity was implemented by additive synthesis with frequency-adjusted sinusoidal partials. Fifteen piano tuners and 18 orchestra musicians participated in the experiments. The results indicate that the inharmonic piano tones produced a keyboard tuning curve similar to the Railsback curve and differed significantly from the harmonic counterpart. The inharmonic tuning curve was reminiscent of the subjective octave enlargement curve. Inharmonic tone pitches were perceived to be higher than harmonic tones up to C ♯ 7. The covibrating strings of the other keys did not exhibit any meaningful effect on the measured inharmonicity of a single string of the played key.
Collapse
Affiliation(s)
- Jussi Jaatinen
- Faculty of Arts, Musicology, University of Helsinki, P.O. Box 24, 00014 Helsingin yliopisto, Finland
| | | |
Collapse
|
25
|
Mehta AH, Oxenham AJ. Role of perceptual integration in pitch discrimination at high frequenciesa). JASA EXPRESS LETTERS 2022; 2:084402. [PMID: 37311192 PMCID: PMC10264831 DOI: 10.1121/10.0013429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 07/26/2022] [Indexed: 06/15/2023]
Abstract
At very high frequencies, fundamental-frequency difference limens (F0DLs) for five-component harmonic complex tones can be better than predicted by optimal integration of information, assuming performance is limited by noise at the peripheral level, but are in line with predictions based on more central sources of noise. This study investigates whether there is a minimum number of harmonic components needed for such super-optimal integration effects and if harmonic range or inharmonicity affects this super-optimal integration. Results show super-optimal integration, even with two harmonic components and for most combinations of consecutive harmonic, but not inharmonic, components.
Collapse
Affiliation(s)
- Anahita H Mehta
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455, USA ,
| | - Andrew J Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455, USA ,
| |
Collapse
|
26
|
Richardson ML, Guérit F, Gransier R, Wouters J, Carlyon RP, Middlebrooks JC. Temporal Pitch Sensitivity in an Animal Model: Psychophysics and Scalp Recordings : Temporal Pitch Sensitivity in Cat. J Assoc Res Otolaryngol 2022; 23:491-512. [PMID: 35668206 PMCID: PMC9437162 DOI: 10.1007/s10162-022-00849-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 04/11/2022] [Indexed: 01/28/2023] Open
Abstract
Cochlear implant (CI) users show limited sensitivity to the temporal pitch conveyed by electric stimulation, contributing to impaired perception of music and of speech in noise. Neurophysiological studies in cats suggest that this limitation is due, in part, to poor transmission of the temporal fine structure (TFS) by the brainstem pathways that are activated by electrical cochlear stimulation. It remains unknown, however, how that neural limit might influence perception in the same animal model. For that reason, we developed non-invasive psychophysical and electrophysiological measures of temporal (i.e., non-spectral) pitch processing in the cat. Normal-hearing (NH) cats were presented with acoustic pulse trains consisting of band-limited harmonic complexes that simulated CI stimulation of the basal cochlea while removing cochlear place-of-excitation cues. In the psychophysical procedure, trained cats detected changes from a base pulse rate to a higher pulse rate. In the scalp-recording procedure, the cortical-evoked acoustic change complex (ACC) and brainstem-generated frequency following response (FFR) were recorded simultaneously in sedated cats for pulse trains that alternated between the base and higher rates. The range of perceptual sensitivity to temporal pitch broadly resembled that of humans but was shifted to somewhat higher rates. The ACC largely paralleled these perceptual patterns, validating its use as an objective measure of temporal pitch sensitivity. The phase-locked FFR, in contrast, showed strong brainstem encoding for all tested pulse rates. These measures demonstrate the cat's perceptual sensitivity to pitch in the absence of cochlear-place cues and may be valuable for evaluating neural mechanisms of temporal pitch perception in the feline animal model of stimulation by a CI or novel auditory prostheses.
Collapse
Affiliation(s)
- Matthew L Richardson
- Department of Otolaryngology, Center for Hearing Research, University of California at Irvine, Irvine, CA, USA.
| | - François Guérit
- Cambridge Hearing Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
| | - Robin Gransier
- Department of Neurosciences, ExpORL, KU Leuven, Leuven, Belgium
| | - Jan Wouters
- Department of Neurosciences, ExpORL, KU Leuven, Leuven, Belgium
| | - Robert P Carlyon
- Cambridge Hearing Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
| | - John C Middlebrooks
- Department of Otolaryngology, Center for Hearing Research, University of California at Irvine, Irvine, CA, USA
- Departments of Neurobiology & Behavior, Biomedical Engineering, Cognitive Sciences, University of California at Irvine, Irvine, CA, USA
| |
Collapse
|
27
|
Joris PX. In praise of adventitious sounds. Hear Res 2022; 425:108592. [DOI: 10.1016/j.heares.2022.108592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Revised: 07/13/2022] [Accepted: 07/26/2022] [Indexed: 11/04/2022]
|
28
|
Gockel HE, Carlyon RP. On mistuning detection and beat perception for harmonic complex tones at low and very high frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 152:226. [PMID: 35931513 DOI: 10.1121/10.0012351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 06/19/2022] [Indexed: 06/15/2023]
Abstract
This study assessed the detection of mistuning of a single harmonic in complex tones (CTs) containing either low-frequency harmonics or very high-frequency harmonics, for which phase locking to the temporal fine structure is weak or absent. CTs had F0s of either 280 or 1400 Hz and contained harmonics 6-10, the 8th of which could be mistuned. Harmonics were presented either diotically or dichotically (odd and even harmonics to different ears). In the diotic condition, mistuning-detection thresholds were very low for both F0s and consistent with detection of temporal interactions (beats) produced by peripheral interactions of components. In the dichotic condition, for which the components in each ear were more widely spaced and beats were not reported, the mistuned component was perceptually segregated from the complex for the low F0, but subjects reported no "popping out" for the high F0 and performance was close to chance. This is consistent with the idea that phase locking is required for perceptual segregation to occur. For diotic presentation, the perceived beat rate corresponded to the amount of mistuning (in Hz). It is argued that the beat percept cannot be explained solely by interactions between the mistuned component and its two closest harmonic neighbours.
Collapse
Affiliation(s)
- Hedwig E Gockel
- Cambridge Hearing Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge CB2 7EF, United Kingdom
| | - Robert P Carlyon
- Cambridge Hearing Group, MRC Cognition and Brain Sciences Unit, University of Cambridge, 15 Chaucer Road, Cambridge CB2 7EF, United Kingdom
| |
Collapse
|
29
|
Goldsworthy RL. Computational Modeling of Synchrony in the Auditory Nerve in Response to Acoustic and Electric Stimulation. Front Comput Neurosci 2022; 16:889992. [PMID: 35782089 PMCID: PMC9249013 DOI: 10.3389/fncom.2022.889992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 05/25/2022] [Indexed: 11/13/2022] Open
Abstract
Cochlear implants are medical devices that provide hearing to nearly one million people around the world. Outcomes are impressive with most recipients learning to understand speech through this new way of hearing. Music perception and speech reception in noise, however, are notably poor. These aspects of hearing critically depend on sensitivity to pitch, whether the musical pitch of an instrument or the vocal pitch of speech. The present article examines cues for pitch perception in the auditory nerve based on computational models. Modeled neural synchrony for pure and complex tones is examined for three different electric stimulation strategies including Continuous Interleaved Sampling (CIS), High-Fidelity CIS (HDCIS), and Peak-Derived Timing (PDT). Computational modeling of current spread and neuronal response are used to predict neural activity to electric and acoustic stimulation. It is shown that CIS does not provide neural synchrony to the frequency of pure tones nor to the fundamental component of complex tones. The newer HDCIS and PDT strategies restore synchrony to both the frequency of pure tones and to the fundamental component of complex tones. Current spread reduces spatial specificity of excitation as well as the temporal fidelity of neural synchrony, but modeled neural excitation restores precision of these cues. Overall, modeled neural excitation to electric stimulation that incorporates temporal fine structure (e.g., HDCIS and PDT) indicates neural synchrony comparable to that provided by acoustic stimulation. Discussion considers the importance of stimulation rate and long-term rehabilitation to provide temporal cues for pitch perception.
Collapse
|
30
|
Signatures of cochlear processing in neuronal coding of auditory information. Mol Cell Neurosci 2022; 120:103732. [PMID: 35489636 DOI: 10.1016/j.mcn.2022.103732] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2021] [Revised: 04/19/2022] [Accepted: 04/21/2022] [Indexed: 11/22/2022] Open
Abstract
The vertebrate ear is endowed with remarkable perceptual capabilities. The faintest sounds produce vibrations of magnitudes comparable to those generated by thermal noise and can nonetheless be detected through efficient amplification of small acoustic stimuli. Two mechanisms have been proposed to underlie such sound amplification in the mammalian cochlea: somatic electromotility and active hair-bundle motility. These biomechanical mechanisms may work in concert to tune auditory sensitivity. In addition to amplitude sensitivity, the hearing system shows exceptional frequency discrimination allowing mammals to distinguish complex sounds with great accuracy. For instance, although the wide hearing range of humans encompasses frequencies from 20 Hz to 20 kHz, our frequency resolution extends to one-thirtieth of the interval between successive keys on a piano. In this article, we review the different cochlear mechanisms underlying sound encoding in the auditory system, with a particular focus on the frequency decomposition of sounds. The relation between peak frequency of activation and location along the cochlea - known as tonotopy - arises from multiple gradients in biophysical properties of the sensory epithelium. Tonotopic mapping represents a major organizational principle both in the peripheral hearing system and in higher processing levels and permits the spectral decomposition of complex tones. The ribbon synapses connecting sensory hair cells to auditory afferents and the downstream spiral ganglion neurons are also tuned to process periodic stimuli according to their preferred frequency. Though sensory hair cells and neurons necessarily filter signals beyond a few kHz, many animals can hear well beyond this range. We finally describe how the cochlear structure shapes the neural code for further processing in order to send meaningful information to the brain. Both the phase-locked response of auditory nerve fibers and tonotopy are key to decode sound frequency information and place specific constraints on the downstream neuronal network.
Collapse
|
31
|
DeVries L, Anderson S, Goupell MJ, Smith E, Gordon-Salant S. Effects of aging and hearing loss on perceptual and electrophysiological measures of pulse-rate discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:1639. [PMID: 35364956 PMCID: PMC8916844 DOI: 10.1121/10.0009399] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 01/10/2022] [Accepted: 01/14/2022] [Indexed: 06/14/2023]
Abstract
Auditory temporal processing declines with age, leading to potential deleterious effects on communication. In young normal-hearing listeners, perceptual rate discrimination is rate limited around 300 Hz. It is not known whether this rate limitation is similar in older listeners with hearing loss. The purpose of this study was to investigate age- and hearing-loss-related rate limitations on perceptual rate discrimination, and age- and hearing-loss-related effects on neural representation of these stimuli. Younger normal-hearing, older normal-hearing, and older hearing-impaired listeners performed a pulse-rate discrimination task at rates of 100, 200, 300, and 400 Hz. Neural phase locking was assessed using the auditory steady-state response. Finally, a battery of non-auditory cognitive tests was administered. Younger listeners had better rate discrimination, higher phase locking, and higher cognitive scores compared to both groups of older listeners. Aging, but not hearing loss, diminished neural-rate encoding and perceptual performance; however, there was no relationship between the perceptual and neural measures. Higher cognitive scores were correlated with improved perceptual performance, but not with neural phase locking. This study shows that aging, rather than hearing loss, may be a stronger contributor to poorer temporal processing, and cognitive factors such as processing speed and inhibitory control may be related to these declines.
Collapse
Affiliation(s)
- Lindsay DeVries
- Department of Hearing and Speech Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Samira Anderson
- Department of Hearing and Speech Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Matthew J Goupell
- Department of Hearing and Speech Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Ed Smith
- Department of Hearing and Speech Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Sandra Gordon-Salant
- Department of Hearing and Speech Sciences, University of Maryland, College Park, Maryland 20742, USA
| |
Collapse
|
32
|
Guest DR, Oxenham AJ. Human discrimination and modeling of high-frequency complex tones shed light on the neural codes for pitch. PLoS Comput Biol 2022; 18:e1009889. [PMID: 35239639 PMCID: PMC8923464 DOI: 10.1371/journal.pcbi.1009889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Revised: 03/15/2022] [Accepted: 02/02/2022] [Indexed: 11/24/2022] Open
Abstract
Accurate pitch perception of harmonic complex tones is widely believed to rely on temporal fine structure information conveyed by the precise phase-locked responses of auditory-nerve fibers. However, accurate pitch perception remains possible even when spectrally resolved harmonics are presented at frequencies beyond the putative limits of neural phase locking, and it is unclear whether residual temporal information, or a coarser rate-place code, underlies this ability. We addressed this question by measuring human pitch discrimination at low and high frequencies for harmonic complex tones, presented either in isolation or in the presence of concurrent complex-tone maskers. We found that concurrent complex-tone maskers impaired performance at both low and high frequencies, although the impairment introduced by adding maskers at high frequencies relative to low frequencies differed between the tested masker types. We then combined simulated auditory-nerve responses to our stimuli with ideal-observer analysis to quantify the extent to which performance was limited by peripheral factors. We found that the worsening of both frequency discrimination and F0 discrimination at high frequencies could be well accounted for (in relative terms) by optimal decoding of all available information at the level of the auditory nerve. A Python package is provided to reproduce these results, and to simulate responses to acoustic stimuli from the three previously published models of the human auditory nerve used in our analyses.
Collapse
Affiliation(s)
- Daniel R. Guest
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Andrew J. Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota, United States of America
| |
Collapse
|
33
|
Individualized Assays of Temporal Coding in the Ascending Human Auditory System. eNeuro 2022; 9:ENEURO.0378-21.2022. [PMID: 35193890 PMCID: PMC8925652 DOI: 10.1523/eneuro.0378-21.2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2021] [Revised: 01/12/2022] [Accepted: 02/08/2022] [Indexed: 11/21/2022] Open
Abstract
Neural phase-locking to temporal fluctuations is a fundamental and unique mechanism by which acoustic information is encoded by the auditory system. The perceptual role of this metabolically expensive mechanism, the neural phase-locking to temporal fine structure (TFS) in particular, is debated. Although hypothesized, it is unclear whether auditory perceptual deficits in certain clinical populations are attributable to deficits in TFS coding. Efforts to uncover the role of TFS have been impeded by the fact that there are no established assays for quantifying the fidelity of TFS coding at the individual level. While many candidates have been proposed, for an assay to be useful, it should not only intrinsically depend on TFS coding, but should also have the property that individual differences in the assay reflect TFS coding per se over and beyond other sources of variance. Here, we evaluate a range of behavioral and electroencephalogram (EEG)-based measures as candidate individualized measures of TFS sensitivity. Our comparisons of behavioral and EEG-based metrics suggest that extraneous variables dominate both behavioral scores and EEG amplitude metrics, rendering them ineffective. After adjusting behavioral scores using lapse rates, and extracting latency or percent-growth metrics from EEG, interaural timing sensitivity measures exhibit robust behavior-EEG correlations. Together with the fact that unambiguous theoretical links can be made relating binaural measures and phase-locking to TFS, our results suggest that these "adjusted" binaural assays may be well suited for quantifying individual TFS processing.
Collapse
|
34
|
Chauvette L, Fournier P, Sharp A. The frequency-following response to assess the neural representation of spectral speech cues in older adults. Hear Res 2022; 418:108486. [DOI: 10.1016/j.heares.2022.108486] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Revised: 03/12/2022] [Accepted: 03/15/2022] [Indexed: 11/04/2022]
|
35
|
Cabrera L, Lau BK. The development of auditory temporal processing during the first year of life. HEARING, BALANCE AND COMMUNICATION 2022; 20:155-165. [PMID: 36111124 PMCID: PMC9473293 DOI: 10.1080/21695717.2022.2029092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
OBJECTIVES The processing of auditory temporal information is important for the extraction of voice pitch, linguistic information, as well as the overall temporal structure of speech. However, many aspects of its early development remain poorly understood. This paper reviews the development of auditory temporal processing during the first year of life when infants are acquiring their native language. METHODS First, potential mechanisms of neural immaturity are discussed in the context of neurophysiological studies. Next, what is known about infant auditory capabilities is considered with a focus on psychophysical studies involving non-speech stimuli to investigate the perception of temporal fine structure and envelope cues. This is followed by a review of studies involving speech stimuli, including those that present vocoded signals as a method of degrading the spectro-temporal information available to infant listeners. RESULTS/CONCLUSION This review suggests that temporal resolution may be well developed in the first postnatal months, but that the ability to use and process the temporal information in an efficient way along the entire auditory pathway is longer to develop. Those findings have crucial implications for the development of language abilities, especially for infants with hearing impairment who are using cochlear implants.
Collapse
Affiliation(s)
- Laurianne Cabrera
- Université de Paris, INCC UMR 8002, CNRS, 45 rue des saints-pères, F-75006 Paris, France
| | - Bonnie K Lau
- Department of Otolaryngology - Head & Neck Surgery, University of Washington, 1701 NE Columbia Rd, Box 257923, Seattle, WA 98195
| |
Collapse
|
36
|
Goldsworthy RL, Bissmeyer SRS, Camarena A. Advantages of Pulse Rate Compared to Modulation Frequency for Temporal Pitch Perception in Cochlear Implant Users. J Assoc Res Otolaryngol 2022; 23:137-150. [PMID: 34981263 PMCID: PMC8782986 DOI: 10.1007/s10162-021-00828-w] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 12/01/2021] [Indexed: 02/03/2023] Open
Abstract
Most cochlear implants encode the fundamental frequency of periodic sounds by amplitude modulation of constant-rate pulsatile stimulation. Pitch perception provided by such stimulation strategies is markedly poor. Two experiments are reported here that consider potential advantages of pulse rate compared to modulation frequency for providing stimulation timing cues for pitch. The first experiment examines beat frequency distortion that occurs when modulating constant-rate pulsatile stimulation. This distortion has been reported on previously, but the results presented here indicate that distortion occurs for higher stimulation rates than previously reported. The second experiment examines pitch resolution as provided by pulse rate compared to modulation frequency. The results indicate that pitch discrimination is better with pulse rate than with modulation frequency. The advantage was large for rates near what has been suggested as the upper limit of temporal pitch perception conveyed by cochlear implants. The results are relevant to sound processing design for cochlear implants particularly for algorithms that encode fundamental frequency into deep envelope modulations or into precisely timed pulsatile stimulation.
Collapse
Affiliation(s)
- Raymond L Goldsworthy
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
| | - Susan R S Bissmeyer
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, CA, USA
| | - Andres Camarena
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Neuroscience Graduate Program, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
37
|
Saddler MR, Gonzalez R, McDermott JH. Deep neural network models reveal interplay of peripheral coding and stimulus statistics in pitch perception. Nat Commun 2021; 12:7278. [PMID: 34907158 PMCID: PMC8671597 DOI: 10.1038/s41467-021-27366-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 11/12/2021] [Indexed: 11/15/2022] Open
Abstract
Perception is thought to be shaped by the environments for which organisms are optimized. These influences are difficult to test in biological organisms but may be revealed by machine perceptual systems optimized under different conditions. We investigated environmental and physiological influences on pitch perception, whose properties are commonly linked to peripheral neural coding limits. We first trained artificial neural networks to estimate fundamental frequency from biologically faithful cochlear representations of natural sounds. The best-performing networks replicated many characteristics of human pitch judgments. To probe the origins of these characteristics, we then optimized networks given altered cochleae or sound statistics. Human-like behavior emerged only when cochleae had high temporal fidelity and when models were optimized for naturalistic sounds. The results suggest pitch perception is critically shaped by the constraints of natural environments in addition to those of the cochlea, illustrating the use of artificial neural networks to reveal underpinnings of behavior.
Collapse
Affiliation(s)
- Mark R Saddler
- Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA.
- McGovern Institute for Brain Research, MIT, Cambridge, MA, USA.
- Center for Brains, Minds and Machines, MIT, Cambridge, MA, USA.
| | - Ray Gonzalez
- Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA
- McGovern Institute for Brain Research, MIT, Cambridge, MA, USA
- Center for Brains, Minds and Machines, MIT, Cambridge, MA, USA
| | - Josh H McDermott
- Department of Brain and Cognitive Sciences, MIT, Cambridge, MA, USA.
- McGovern Institute for Brain Research, MIT, Cambridge, MA, USA.
- Center for Brains, Minds and Machines, MIT, Cambridge, MA, USA.
- Program in Speech and Hearing Biosciences and Technology, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
38
|
Patro C, Kreft HA, Wojtczak M. The search for correlates of age-related cochlear synaptopathy: Measures of temporal envelope processing and spatial release from speech-on-speech masking. Hear Res 2021; 409:108333. [PMID: 34425347 PMCID: PMC8424701 DOI: 10.1016/j.heares.2021.108333] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/02/2020] [Revised: 07/17/2021] [Accepted: 08/04/2021] [Indexed: 01/13/2023]
Abstract
Older adults often experience difficulties understanding speech in adverse listening conditions. It has been suggested that for listeners with normal and near-normal audiograms, these difficulties may, at least in part, arise from age-related cochlear synaptopathy. The aim of this study was to assess if performance on auditory tasks relying on temporal envelope processing reveal age-related deficits consistent with those expected from cochlear synaptopathy. Listeners aged 20 to 66 years were tested using a series of psychophysical, electrophysiological, and speech-perception measures using stimulus configurations that promote coding by medium- and low-spontaneous-rate auditory-nerve fibers. Cognitive measures of executive function were obtained to control for age-related cognitive decline. Results from the different tests were not significantly correlated with each other despite a presumed reliance on common mechanisms involved in temporal envelope processing. Only gap-detection thresholds for a tone in noise and spatial release from speech-on-speech masking were significantly correlated with age. Increasing age was related to impaired cognitive executive function. Multivariate regression analyses showed that individual differences in hearing sensitivity, envelope-based measures, and scores from nonauditory cognitive tests did not significantly contribute to the variability in spatial release from speech-on-speech masking for small target/masker spatial separation, while age was a significant contributor.
Collapse
Affiliation(s)
- Chhayakanta Patro
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA.
| | - Heather A Kreft
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA
| | - Magdalena Wojtczak
- Department of Psychology, University of Minnesota, N640 Elliott Hall, 75 East River Parkway, Minneapolis, MN 55455, USA
| |
Collapse
|
39
|
Temporal Correlates to Monaural Edge Pitch in the Distribution of Interspike Interval Statistics in the Auditory Nerve. eNeuro 2021; 8:ENEURO.0292-21.2021. [PMID: 34281977 PMCID: PMC8387151 DOI: 10.1523/eneuro.0292-21.2021] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 07/07/2021] [Indexed: 12/02/2022] Open
Abstract
Pitch is a perceptual attribute enabling perception of melody. There is no consensus regarding the fundamental nature of pitch and its underlying neural code. A stimulus which has received much interest in psychophysical and computational studies is noise with a sharp spectral edge. High-pass (HP) or low-pass (LP) noise gives rise to a pitch near the edge frequency (monaural edge pitch; MEP). The simplicity of this stimulus, combined with its spectral and autocorrelation properties, make it an interesting stimulus to examine spectral versus temporal cues that could underly its pitch. We recorded responses of single auditory nerve (AN) fibers in chinchilla to MEP-stimuli varying in edge frequency. Temporal cues were examined with shuffled autocorrelogram (SAC) analysis. Correspondence between the population’s dominant interspike interval and reported pitch estimates was poor. A fuller analysis of the population interspike interval distribution, which incorporates not only the dominant but all intervals, results in good matches with behavioral results, but not for the entire range of edge frequencies that generates pitch. Finally, we also examined temporal structure over a slower time scale, intermediate between average firing rate and interspike intervals, by studying the SAC envelope. We found that, in response to a given MEP stimulus, this feature also systematically varies with edge frequency, across fibers with different characteristic frequency (CF). Because neural mechanisms to extract envelope cues are well established, and because this cue is not limited by coding of stimulus fine-structure, this newly identified slower temporal cue is a more plausible basis for pitch than cues based on fine-structure.
Collapse
|
40
|
Fischer T, Schmid C, Kompis M, Mantokoudis G, Caversaccio M, Wimmer W. Effects of temporal fine structure preservation on spatial hearing in bilateral cochlear implant users. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:673. [PMID: 34470279 DOI: 10.1121/10.0005732] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 07/02/2021] [Indexed: 06/13/2023]
Abstract
Typically, the coding strategies of cochlear implant audio processors discard acoustic temporal fine structure information (TFS), which may be related to the poor perception of interaural time differences (ITDs) and the resulting reduced spatial hearing capabilities compared to normal-hearing individuals. This study aimed to investigate to what extent bilateral cochlear implant (BiCI) recipients can exploit ITD cues provided by a TFS preserving coding strategy (FS4) in a series of sound field spatial hearing tests. As a baseline, we assessed the sensitivity to ITDs and binaural beats of 12 BiCI subjects with a coding strategy disregarding fine structure (HDCIS) and the FS4 strategy. For 250 Hz pure-tone stimuli but not for broadband noise, the BiCI users had significantly improved ITD discrimination using the FS4 strategy. In the binaural beat detection task and the broadband sound localization, spatial discrimination, and tracking tasks, no significant differences between the two tested coding strategies were observed. These results suggest that ITD sensitivity did not generalize to broadband stimuli or sound field spatial hearing tests, suggesting that it would not be useful for real-world listening.
Collapse
Affiliation(s)
- T Fischer
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| | - C Schmid
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| | - M Kompis
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| | - G Mantokoudis
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| | - M Caversaccio
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| | - W Wimmer
- Department of ENT, Head and Neck Surgery, Inselspital, Bern University Hospital, University of Bern, Freiburgstrasse, 3010 Bern, Switzerland
| |
Collapse
|
41
|
Ellis RJ, Rönnberg J. Temporal fine structure: associations with cognition and speech-in-noise recognition in adults with normal hearing or hearing impairment. Int J Audiol 2021; 61:778-786. [PMID: 34292115 DOI: 10.1080/14992027.2021.1948119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
OBJECTIVES To investigate associations between sensitivity to temporal fine structure (TFS) and performance in cognitive and speech-in-noise recognition tests. DESIGN A binaural test of TFS sensitivity (the TFS-LF) was used. Measures of cognition included the reading span, Raven's, and text-reception threshold tests. Measures of speech recognition included the Hearing in noise (HINT) and the Hagerman matrix sentence tests in three signal processing conditions. STUDY SAMPLE Analyses are based on the performance of 324/317 adults with and without hearing impairment. RESULTS Sensitivity to TFS was significantly correlated with both the reading span test and the recognition of speech-in-noise processed using noise reduction, the latter only when limited to participants with hearing impairment. Neither association was significant when the effects of age were partialled out. CONCLUSIONS The findings are consistent with previous research in finding no evidence of a link between sensitivity to TFS and working memory once the effects of age had been partialled out. The results provide some evidence of an influence of signal processing strategy on the association between TFS sensitivity and speech-in-noise recognition. However, further research is necessary to assess the generalisability of the findings before any claims can be made regarding any clinical implications of these findings.
Collapse
Affiliation(s)
- Rachel J Ellis
- Institute for Behavioural Sciences and Learning, Linköping University, Linköping, Sweden.,Swedish Institute for Disability Research, Linköping, Sweden
| | - Jerker Rönnberg
- Institute for Behavioural Sciences and Learning, Linköping University, Linköping, Sweden.,Swedish Institute for Disability Research, Linköping, Sweden
| |
Collapse
|
42
|
Brown AD, Tollin DJ. Effects of interaural decoherence on sensitivity to interaural level differences across frequency. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:4630. [PMID: 34241434 PMCID: PMC8249038 DOI: 10.1121/10.0005123] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 04/27/2021] [Accepted: 05/10/2021] [Indexed: 05/19/2023]
Abstract
The interaural level difference (ILD) is a robust indicator of sound source azimuth, and human ILD sensitivity persists under conditions that degrade normally-dominant interaural time difference (ITD) cues. Nonetheless, ILD sensitivity varies somewhat with both stimulus frequency and interaural correlation (coherence). To further investigate the combined binaural perceptual influence of these variables, the present study assessed ILD sensitivity at frequencies 250-4000 Hz using stimuli of varied interaural correlation. In the first of two experiments, ILD discrimination thresholds were modestly elevated, and subjective lateralization slightly reduced, for both half-correlated and uncorrelated narrowband noise tokens relative to correlated tokens. Different from thresholds in the correlated condition, which were worst at 1000 Hz [Grantham, D.W. (1984). J. Acoust. Soc. Am. 75, 1191-1194], thresholds in the decorrelated conditions were independent of frequency. However, intrinsic envelope fluctuations in narrowband stimuli caused moment-to-moment variation of the nominal ILD, complicating interpretation of measured thresholds. Thus, a second experiment employed low-fluctuation noise tokens, revealing a clear effect of interaural decoherence per se that was strongly frequency-dependent, decreasing in magnitude from low to high frequencies. Measurements are consistent with known integration times in ILD-sensitive neurons and also suggest persistent influences of covert ITD cues in putative "ILD" tasks.
Collapse
Affiliation(s)
- Andrew D Brown
- Department of Speech and Hearing Sciences, University of Washington, 1417 Northeast 42nd Street, Seattle, Washington 98105, USA
| | - Daniel J Tollin
- Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, Colorado 80045, USA
| |
Collapse
|
43
|
Unilateral auditory deprivation in humans: Effects on frequency discrimination and auditory memory span in the normal ear. Hear Res 2021; 405:108245. [PMID: 33887536 DOI: 10.1016/j.heares.2021.108245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 03/12/2021] [Accepted: 03/31/2021] [Indexed: 11/22/2022]
Abstract
Hearing with one ear is associated with auditory deprivation leading to cortical neuronal reorganization. Despite evidence for substantial effects of unilateral input on cortical and sub-cortical structures, the functional consequences of such alterations on human hearing is underexplored. Unilateral hearing impairment offers a unique model to study the perceptual consequences of cortical reorganization. The present study provides evidence for larger (poorer) difference limens for frequency for sounds heard by the normal ear of listeners with unilateral hearing loss relative to bilaterally normal-hearing controls. This difference in frequency discrimination ability was observed for the low (250 Hz), but not for the high-frequency tone (4000 Hz). Besides auditory perceptual effects, we also found reduced working memory capacity as revealed by forward and backward digit span measures. Contrary to the expectation, there was no significant association between frequency discrimination and working memory capacity in listeners with unilateral hearing loss. Auditory deprivation associated with unilateral hearing impairment affects low-frequency (pitch) discrimination and working memory capacity despite normal hearing in the intact ear. Such deficits in basic auditory processes and memory span for sounds heard by the normal ear may contribute to the hearing and communication difficulties experienced by listeners with unilateral or single-sided deafness.
Collapse
|
44
|
Gockel HE, Carlyon RP. On musical interval perception for complex tones at very high frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:2644. [PMID: 33940917 PMCID: PMC7612123 DOI: 10.1121/10.0004222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Accepted: 03/17/2021] [Indexed: 06/12/2023]
Abstract
Listeners appear able to extract a residue pitch from high-frequency harmonics for which phase locking to the temporal fine structure is weak or absent. The present study investigated musical interval perception for high-frequency harmonic complex tones using the same stimuli as Lau, Mehta, and Oxenham [J. Neurosci. 37, 9013-9021 (2017)]. Nine young musically trained listeners with especially good high-frequency hearing adjusted various musical intervals using harmonic complex tones containing harmonics 6-10. The reference notes had fundamental frequencies (F0s) of 280 or 1400 Hz. Interval matches were possible, albeit markedly worse, even when all harmonic frequencies were above the presumed limit of phase locking. Matches showed significantly larger systematic errors and higher variability, and subjects required more trials to finish a match for the high than for the low F0. Additional absolute pitch judgments from one subject with absolute pitch, for complex tones containing harmonics 1-5 or 6-10 with a wide range of F0s, were perfect when the lowest frequency component was below about 7 kHz, but at least 50% of responses were incorrect when it was 8 kHz or higher. The results are discussed in terms of the possible effects of phase-locking information and familiarity with high-frequency stimuli on pitch.
Collapse
|
45
|
Abstract
Signal processing algorithms are the hidden components in the audio processor that converts the received acoustic signal into electrical impulses while maintaining as much relevant information as possible. Signal processing algorithms should be smart enough to mimic the functionality of external, middle and the inner-ear to provide the cochlear implant (CI) user with a hearing experience as natural as possible. Modern sound processing strategies are based on the continuous interleaved sampling (CIS) strategy proposed by B. Wilson in 1991, which provided envelope information over several intracochlear electrodes. The CIS strategy brought significant gains in speech perception. Translational research activities of MED-EL resulted in further improvements in speech understanding in noisy environments as well as enjoyment of music by not only coding CIS-based envelope information, but by also representing temporal fine structure information in the stimulation patterns of the apical channels. Further developments include "complete cochlear coverage" made possible by deep insertion of the intracochlear electrode, elaborate front end processing, anatomy based fitting (ABF), triphasic pulse stimulation instrumental in the suppression of facial nerve stimulation, and bimodal delay compensation allowing unilateral CI users to experience hearing with hearing aids on the contralateral ear. The large number of hardware developments might be exemplified by the RONDO, the world's first single unit audio processor in 2013. This article covers the milestones of translational research around the signal processing and audio processor topic that took place in association with MED-EL.
Collapse
Affiliation(s)
| | - Ingeborg Hochmair
- MED-EL Elektromedizinische Geraete Gesellschaft m.b.H., Innsbruck, Austria
| |
Collapse
|
46
|
Demany L, Monteiro G, Semal C, Shamma S, Carlyon RP. The perception of octave pitch affinity and harmonic fusion have a common origin. Hear Res 2021; 404:108213. [PMID: 33662686 PMCID: PMC7614450 DOI: 10.1016/j.heares.2021.108213] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 02/05/2021] [Accepted: 02/10/2021] [Indexed: 02/06/2023]
Abstract
Musicians say that the pitches of tones with a frequency ratio of 2:1 (one octave) have a distinctive affinity, even if the tones do not have common spectral components. It has been suggested, however, that this affinity judgment has no biological basis and originates instead from an acculturation process ‒ the learning of musical rules unrelated to auditory physiology. We measured, in young amateur musicians, the perceptual detectability of octave mistunings for tones presented alternately (melodic condition) or simultaneously (harmonic condition). In the melodic condition, mistuning was detectable only by means of explicit pitch comparisons. In the harmonic condition, listeners could use a different and more efficient perceptual cue: in the absence of mistuning, the tones fused into a single sound percept; mistunings decreased fusion. Performance was globally better in the harmonic condition, in line with the hypothesis that listeners used a fusion cue in this condition; this hypothesis was also supported by results showing that an illusory simultaneity of the tones was much less advantageous than a real simultaneity. In the two conditions, mistuning detection was generally better for octave compressions than for octave stretchings. This asymmetry varied across listeners, but crucially the listener-specific asymmetries observed in the two conditions were highly correlated. Thus, the perception of the melodic octave appeared to be closely linked to the phenomenon of harmonic fusion. As harmonic fusion is thought to be determined by biological factors rather than factors related to musical culture or training, we argue that octave pitch affinity also has, at least in part, a biological basis.
Collapse
Affiliation(s)
- Laurent Demany
- Institut de Neurosciences Cognitives et Intégratives d'Aquitaine, CNRS, EPHE, and Université de Bordeaux, Bordeaux, France.
| | - Guilherme Monteiro
- Institut de Neurosciences Cognitives et Intégratives d'Aquitaine, CNRS, EPHE, and Université de Bordeaux, Bordeaux, France
| | - Catherine Semal
- Institut de Neurosciences Cognitives et Intégratives d'Aquitaine, CNRS, EPHE, and Université de Bordeaux, Bordeaux, France; Bordeaux INP, Bordeaux, France.
| | - Shihab Shamma
- Institute for Systems Research, University of Maryland, College Park, MD, United States; Département d'Etudes Cognitives, Ecole Normale Supérieure, Paris, France.
| | - Robert P Carlyon
- Cambridge Hearing Group, MRC Cognition and Brain Sciences Unit, Cambridge, United Kingdom.
| |
Collapse
|
47
|
de Cheveigné A. Harmonic Cancellation-A Fundamental of Auditory Scene Analysis. Trends Hear 2021; 25:23312165211041422. [PMID: 34698574 PMCID: PMC8552394 DOI: 10.1177/23312165211041422] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 07/23/2021] [Accepted: 07/09/2021] [Indexed: 11/16/2022] Open
Abstract
This paper reviews the hypothesis of harmonic cancellation according to which an interfering sound is suppressed or canceled on the basis of its harmonicity (or periodicity in the time domain) for the purpose of Auditory Scene Analysis. It defines the concept, discusses theoretical arguments in its favor, and reviews experimental results that support it, or not. If correct, the hypothesis may draw on time-domain processing of temporally accurate neural representations within the brainstem, as required also by the classic equalization-cancellation model of binaural unmasking. The hypothesis predicts that a target sound corrupted by interference will be easier to hear if the interference is harmonic than inharmonic, all else being equal. This prediction is borne out in a number of behavioral studies, but not all. The paper reviews those results, with the aim to understand the inconsistencies and come up with a reliable conclusion for, or against, the hypothesis of harmonic cancellation within the auditory system.
Collapse
Affiliation(s)
- Alain de Cheveigné
- Laboratoire des systèmes perceptifs, CNRS, Paris, France
- Département d’études cognitives, École normale supérieure, PSL
University, Paris, France
- UCL Ear Institute, London, UK
| |
Collapse
|
48
|
Bissmeyer SRS, Hossain S, Goldsworthy RL. Perceptual learning of pitch provided by cochlear implant stimulation rate. PLoS One 2020; 15:e0242842. [PMID: 33270735 PMCID: PMC7714175 DOI: 10.1371/journal.pone.0242842] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 11/10/2020] [Indexed: 11/19/2022] Open
Abstract
Cochlear implant users hear pitch evoked by stimulation rate, but discrimination diminishes for rates above 300 Hz. This upper limit on rate pitch is surprising given the remarkable and specialized ability of the auditory nerve to respond synchronously to stimulation rates at least as high as 3 kHz and arguably as high as 10 kHz. Sensitivity to stimulation rate as a pitch cue varies widely across cochlear implant users and can be improved with training. The present study examines individual differences and perceptual learning of stimulation rate as a cue for pitch ranking. Adult cochlear implant users participated in electrode psychophysics that involved testing once per week for three weeks. Stimulation pulse rate discrimination was measured in bipolar and monopolar configurations for apical and basal electrodes. Base stimulation rates between 100 and 800 Hz were examined. Individual differences were quantified using psychophysically derived metrics of spatial tuning and temporal integration. This study examined distribution of measures across subjects, predictive power of psychophysically derived metrics of spatial tuning and temporal integration, and the effect of training on rate discrimination thresholds. Psychophysical metrics of spatial tuning and temporal integration were not predictive of stimulation rate discrimination, but discrimination thresholds improved at lower frequencies with training. Since most clinical devices do not use variable stimulation rates, it is unknown to what extent recipients may learn to use stimulation rate cues if provided in a clear and consistent manner.
Collapse
Affiliation(s)
- Susan R. S. Bissmeyer
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Shaikat Hossain
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| | - Raymond L. Goldsworthy
- Department of Biomedical Engineering, Viterbi School of Engineering, University of Southern California, Los Angeles, California, United States of America
- Auditory Research Center, Caruso Department of Otolaryngology, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America
| |
Collapse
|
49
|
Binaural Frequency Modulation Detection in School-Age Children, Young Adults, and Older Adults: Effects of Interaural Modulator Phase. Ear Hear 2020; 42:691-699. [PMID: 33306546 DOI: 10.1097/aud.0000000000000975] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES The purpose of this study was to measure low-rate binaural frequency modulation (FM) detection across the lifespan as a gauge of temporal fine structure processing. Children and older adults were expected to perform more poorly than young adults but for different reasons. DESIGN Detection of 2-Hz FM carried by a 500-Hz pure tone was measured for modulators that were either in-phase or out-of-phase across ears. Thresholds were measured in quiet and in noise. Participants were school-age children (n = 44), young adults (n = 11), and older adults (n = 17) with normal or near-normal hearing. RESULTS Thresholds were lower for out-of-phase than in-phase modulators among all listening groups. Detection thresholds improved with child age, with larger effects of age for dichotic than diotic FM. Introduction of masking noise tended to elevate thresholds; this effect was larger for the dichotic condition than the diotic condition, and larger for older adults than young adults. In noise, young adults received the greatest dichotic benefit, followed by older adults, then young children. The relative effects of noise on dichotic benefit did not differ for young adults compared to young children and older adults; however, young children saw greater reduction in benefit due to noise than older adults. CONCLUSION The difference in dichotic benefit between children and young adults is consistent with maturation of central auditory processing. Differences in the effect of noise on dichotic benefit in young children and older adults support the idea that different factors or combinations of factors limit performance in these two groups. Although dichotic FM detection appears to be more sensitive to the effects of development and aging than diotic FM detection, the positive correlation between diotic and dichotic FM detection thresholds for all listeners suggests contribution of one or more factors common to both conditions.
Collapse
|
50
|
Mesik J, Wojtczak M. Effects of noise precursors on the detection of amplitude and frequency modulation for tones in noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:3581. [PMID: 33379905 PMCID: PMC8097715 DOI: 10.1121/10.0002879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 11/05/2020] [Accepted: 11/16/2020] [Indexed: 06/12/2023]
Abstract
Recent studies on amplitude modulation (AM) detection for tones in noise reported that AM-detection thresholds improve when the AM stimulus is preceded by a noise precursor. The physiological mechanisms underlying this AM unmasking are unknown. One possibility is that adaptation to the level of the noise precursor facilitates AM encoding by causing a shift in neural rate-level functions to optimize level encoding around the precursor level. The aims of this study were to investigate whether such a dynamic-range adaptation is a plausible mechanism for the AM unmasking and whether frequency modulation (FM), thought to be encoded via AM, also exhibits the unmasking effect. Detection thresholds for AM and FM of tones in noise were measured with and without a fixed-level precursor. Listeners showing the unmasking effect were then tested with the precursor level roved over a wide range to modulate the effect of adaptation to the precursor level on the detection of the subsequent AM. It was found that FM detection benefits from a precursor and the magnitude of FM unmasking correlates with that of AM unmasking. Moreover, consistent with dynamic-range adaptation, the unmasking magnitude weakens as the level difference between the precursor and simultaneous masker of the tone increases.
Collapse
Affiliation(s)
- Juraj Mesik
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Magdalena Wojtczak
- Department of Psychology, University of Minnesota, Minneapolis, Minnesota 55455, USA
| |
Collapse
|