1
|
de Hoz L, McAlpine D. Noises on-How the Brain Deals with Acoustic Noise. BIOLOGY 2024; 13:501. [PMID: 39056695 PMCID: PMC11274191 DOI: 10.3390/biology13070501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2024] [Revised: 07/01/2024] [Accepted: 07/01/2024] [Indexed: 07/28/2024]
Abstract
What is noise? When does a sound form part of the acoustic background and when might it come to our attention as part of the foreground? Our brain seems to filter out irrelevant sounds in a seemingly effortless process, but how this is achieved remains opaque and, to date, unparalleled by any algorithm. In this review, we discuss how noise can be both background and foreground, depending on what a listener/brain is trying to achieve. We do so by addressing questions concerning the brain's potential bias to interpret certain sounds as part of the background, the extent to which the interpretation of sounds depends on the context in which they are heard, as well as their ethological relevance, task-dependence, and a listener's overall mental state. We explore these questions with specific regard to the implicit, or statistical, learning of sounds and the role of feedback loops between cortical and subcortical auditory structures.
Collapse
Affiliation(s)
- Livia de Hoz
- Neuroscience Research Center, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
- Bernstein Center for Computational Neuroscience, 10115 Berlin, Germany
| | - David McAlpine
- Neuroscience Research Center, Charité—Universitätsmedizin Berlin, 10117 Berlin, Germany
- Department of Linguistics, Macquarie University Hearing, Australian Hearing Hub, Sydney, NSW 2109, Australia
| |
Collapse
|
2
|
Willmore BDB, King AJ. Adaptation in auditory processing. Physiol Rev 2023; 103:1025-1058. [PMID: 36049112 PMCID: PMC9829473 DOI: 10.1152/physrev.00011.2022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Adaptation is an essential feature of auditory neurons, which reduces their responses to unchanging and recurring sounds and allows their response properties to be matched to the constantly changing statistics of sounds that reach the ears. As a consequence, processing in the auditory system highlights novel or unpredictable sounds and produces an efficient representation of the vast range of sounds that animals can perceive by continually adjusting the sensitivity and, to a lesser extent, the tuning properties of neurons to the most commonly encountered stimulus values. Together with attentional modulation, adaptation to sound statistics also helps to generate neural representations of sound that are tolerant to background noise and therefore plays a vital role in auditory scene analysis. In this review, we consider the diverse forms of adaptation that are found in the auditory system in terms of the processing levels at which they arise, the underlying neural mechanisms, and their impact on neural coding and perception. We also ask what the dynamics of adaptation, which can occur over multiple timescales, reveal about the statistical properties of the environment. Finally, we examine how adaptation to sound statistics is influenced by learning and experience and changes as a result of aging and hearing loss.
Collapse
Affiliation(s)
- Ben D. B. Willmore
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| | - Andrew J. King
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
3
|
Palandrani KN, Hoover EC, Stavropoulos T, Seitz AR, Isarangura S, Gallun FJ, Eddins DA. Temporal integration of monaural and dichotic frequency modulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:745. [PMID: 34470296 PMCID: PMC8337085 DOI: 10.1121/10.0005729] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2020] [Revised: 06/17/2021] [Accepted: 07/02/2021] [Indexed: 05/06/2023]
Abstract
Frequency modulation (FM) detection at low modulation frequencies is commonly used as an index of temporal fine-structure processing. The present study evaluated the rate of improvement in monaural and dichotic FM across a range of test parameters. In experiment I, dichotic and monaural FM detection was measured as a function of duration and modulator starting phase. Dichotic FM thresholds were lower than monaural FM thresholds and the modulator starting phase had no effect on detection. Experiment II measured monaural FM detection for signals that differed in modulation rate and duration such that the improvement with duration in seconds (carrier) or cycles (modulator) was compared. Monaural FM detection improved monotonically with the number of modulation cycles, suggesting that the modulator is extracted prior to detection. Experiment III measured dichotic FM detection for shorter signal durations to test the hypothesis that dichotic FM relies primarily on the signal onset. The rate of improvement decreased as duration increased, which is consistent with the use of primarily onset cues for the detection of dichotic FM. These results establish that improvement with duration occurs as a function of the modulation cycles at a rate consistent with the independent-samples model for monaural FM, but later cycles contribute less to detection in dichotic FM.
Collapse
Affiliation(s)
- Katherine N Palandrani
- Department of Communication Sciences and Disorders, University of Maryland, College Park, Maryland 20742, USA
| | - Eric C Hoover
- Department of Communication Sciences and Disorders, University of Maryland, College Park, Maryland 20742, USA
| | - Trevor Stavropoulos
- Brain Game Center, University of California Riverside, Riverside, California 92521, USA
| | - Aaron R Seitz
- Department of Psychology, University of California Riverside, Riverside, California 92521, USA
| | - Sittiprapa Isarangura
- Department of Communication Sciences and Disorders, Mahidol University, Phaya Thai, Bangkok 10400, Thailand
| | - Frederick J Gallun
- Oregon Hearing Research Center, Oregon Health and Science University, Portland, Oregon 97239, USA
| | - David A Eddins
- Department of Communication Sciences and Disorders, University of South Florida, Tampa, Florida 33620, USA
| |
Collapse
|
4
|
Bondy BJ, Haimes DB, Golding NL. Physiological Diversity Influences Detection of Stimulus Envelope and Fine Structure in Neurons of the Medial Superior Olive. J Neurosci 2021; 41:6234-6245. [PMID: 34083255 PMCID: PMC8287997 DOI: 10.1523/jneurosci.2354-20.2021] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 05/03/2021] [Accepted: 05/05/2021] [Indexed: 01/10/2023] Open
Abstract
The neurons of the medial superior olive (MSO) of mammals extract azimuthal information from the delays between sounds reaching the two ears [interaural time differences (ITDs)]. Traditionally, all models of sound localization have assumed that MSO neurons represent a single population of cells with specialized and homogeneous intrinsic and synaptic properties that enable the detection of synaptic coincidence on a timescale of tens to hundreds of microseconds. Here, using patch-clamp recordings from large populations of anatomically labeled neurons in brainstem slices from male and female Mongolian gerbils (Meriones unguiculatus), we show that MSO neurons are far more physiologically diverse than previously appreciated, with properties that depend regionally on cell position along the topographic map of frequency. Despite exhibiting a similar morphology, neurons in the MSO exhibit subthreshold oscillations of differing magnitudes that drive action potentials at rates between 100 and 800 Hz. These oscillations are driven primarily by voltage-gated sodium channels and are distinct from resonant properties derived from other active membrane properties. We show that graded differences in these and other physiological properties across the MSO neuron population enable the MSO to duplex the encoding of ITD information in both fast, submillisecond time-varying signals as well as in slower envelopes.SIGNIFICANCE STATEMENT Neurons in the medial superior olive (MSO) encode sound localization cues by detecting microsecond differences in the arrival times of inputs from the left and right ears, and it has been assumed that this computation is made possible by highly stereotyped structural and physiological specializations. Here we report using a large (>400) sample size in which MSO neurons show a strikingly large continuum of functional properties despite exhibiting similar morphologies. We demonstrate that subthreshold oscillations mediated by voltage-gated Na+ channels play a key role in conferring graded differences in firing properties. This functional diversity likely confers capabilities of processing both fast, submillisecond-scale synaptic activity (acoustic "fine structure"), and slow-rising envelope information that is found in amplitude-modulated sounds and speech patterns.
Collapse
Affiliation(s)
- Brian J Bondy
- Department of Neuroscience, University of Texas at Austin, Austin, Texas 78712
- Center for Learning and Memory, University of Texas at Austin, Austin, Texas 78712
| | - David B Haimes
- Department of Neuroscience, University of Texas at Austin, Austin, Texas 78712
- Center for Learning and Memory, University of Texas at Austin, Austin, Texas 78712
| | - Nace L Golding
- Department of Neuroscience, University of Texas at Austin, Austin, Texas 78712
- Center for Learning and Memory, University of Texas at Austin, Austin, Texas 78712
| |
Collapse
|
5
|
Haywood NR, Undurraga JA, McAlpine D. The influence of envelope shape on the lateralization of amplitude-modulated, low-frequency sound. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:3133. [PMID: 34241105 DOI: 10.1121/10.0004788] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Accepted: 04/06/2021] [Indexed: 06/13/2023]
Abstract
For abruptly gated sound, interaural time difference (ITD) cues at onset carry greater perceptual weight than those following. This research explored how envelope shape influences such carrier ITD weighting. Experiment 1 assessed the perceived lateralization of a tonal binaural beat that transitioned through ITD (diotic envelope, mean carrier frequency of 500 Hz). Listeners' left/right lateralization judgments were compared to those for static-ITD tones. For an 8 Hz sinusoidally amplitude-modulated envelope, ITD cues 24 ms after onset well-predicted reported sidedness. For an equivalent-duration "abrupt" envelope, which was unmodulated besides 20-ms onset/offset ramps, reported sidedness corresponded to ITDs near onset (e.g., 6 ms). However, unlike for sinusoidal amplitude modulation, ITDs toward offset seemingly also influenced perceived sidedness. Experiment 2 adjusted the duration of the offset ramp (25-75 ms) and found evidence for such offset weighting only for the most abrupt ramp tested. In experiment 3, an ITD was imposed on a brief segment of otherwise diotic filtered noise. Listeners discriminated right- from left-leading ITDs. In sinusoidal amplitude modulation, thresholds were lowest when the ITD segment occurred during rising amplitude. For the abrupt envelope, the lowest thresholds were observed when the segment occurred at either onset or offset. These experiments demonstrate the influence of envelope profile on carrier ITD sensitivity.
Collapse
Affiliation(s)
- Nicholas R Haywood
- Department of Linguistics, Faculty of Medicine, Health and Human Sciences, Macquarie Hearing, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Jaime A Undurraga
- Department of Linguistics, Faculty of Medicine, Health and Human Sciences, Macquarie Hearing, Macquarie University, Sydney, New South Wales 2109, Australia
| | - David McAlpine
- Department of Linguistics, Faculty of Medicine, Health and Human Sciences, Macquarie Hearing, Macquarie University, Sydney, New South Wales 2109, Australia
| |
Collapse
|
6
|
Auditory Brainstem Models: Adapting Cochlear Nuclei Improve Spatial Encoding by the Medial Superior Olive in Reverberation. J Assoc Res Otolaryngol 2021; 22:289-318. [PMID: 33861395 DOI: 10.1007/s10162-021-00797-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 03/22/2021] [Indexed: 10/21/2022] Open
Abstract
Listeners typically perceive a sound as originating from the direction of its source, even as direct sound is followed milliseconds later by reflected sound from multiple different directions. Early-arriving sound is emphasised in the ascending auditory pathway, including the medial superior olive (MSO) where binaural neurons encode the interaural-time-difference (ITD) cue for spatial location. Perceptually, weighting of ITD conveyed during rising sound energy is stronger at 600 Hz than at 200 Hz, consistent with the minimum stimulus rate for binaural adaptation, and with the longer reverberation times at 600 Hz, compared with 200 Hz, in many natural outdoor environments. Here, we computationally explore the combined efficacy of adaptation prior to the binaural encoding of ITD cues, and excitatory binaural coincidence detection within MSO neurons, in emphasising ITDs conveyed in early-arriving sound. With excitatory inputs from adapting, nonlinear model spherical bushy cells (SBCs) of the bilateral cochlear nuclei, a nonlinear model MSO neuron with low-threshold potassium channels reproduces the rate-dependent emphasis of rising vs. peak sound energy in ITD encoding; adaptation is equally effective in the model MSO. Maintaining adaptation in model SBCs, and adjusting membrane speed in model MSO neurons, 'left' and 'right' populations of computationally efficient, linear model SBCs and MSO neurons reproduce this stronger weighting of ITD conveyed during rising sound energy at 600 Hz compared to 200 Hz. This hemispheric population model demonstrates a link between strong weighting of spatial information during rising sound energy, and correct unambiguous lateralisation of a speech source in reverberation.
Collapse
|
7
|
Haywood NR, McAlpine D. Estimating the perceptual weighting of interaural time difference cues in amplitude modulated binaural beats. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 148:EL185. [PMID: 32872987 DOI: 10.1121/10.0001747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Accepted: 07/27/2020] [Indexed: 06/11/2023]
Abstract
For an abruptly gated sound, perceived lateralization is determined primarily by binaural cues at onset. Relatively less is known about the temporal weighing of binaural cues-such as interaural time difference (ITD)-during more naturalistic modulation profiles. Here, an experiment measured the lateralization of a tonal binaural beat modulated by a diotic, 8-Hz sinusoidal amplitude modulation. Binaural beat lateralization (left/right, two alternatives) was compared to that for tones with static ITDs. Across three mean carrier frequencies (200, 500, and 800 Hz), ITDs occurring during early rising amplitude (e.g., 20-25 ms after onset) predicted the perceived lateralization of the binaural beat signals well.
Collapse
Affiliation(s)
- Nicholas R Haywood
- Department of Linguistics, Faculty of Medicine, Health and Human Sciences, Macquarie Hearing, Macquarie University, Sydney, 2109, ,
| | - David McAlpine
- Department of Linguistics, Faculty of Medicine, Health and Human Sciences, Macquarie Hearing, Macquarie University, Sydney, 2109, ,
| |
Collapse
|
8
|
Zuk NJ, Delgutte B. Neural coding and perception of auditory motion direction based on interaural time differences. J Neurophysiol 2019; 122:1821-1842. [PMID: 31461376 DOI: 10.1152/jn.00081.2019] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
While motion is important for parsing a complex auditory scene into perceptual objects, how it is encoded in the auditory system is unclear. Perceptual studies suggest that the ability to identify the direction of motion is limited by the duration of the moving sound, yet we can detect changes in interaural differences at even shorter durations. To understand the source of these distinct temporal limits, we recorded from single units in the inferior colliculus (IC) of unanesthetized rabbits in response to noise stimuli containing a brief segment with linearly time-varying interaural time difference ("ITD sweep") temporally embedded in interaurally uncorrelated noise. We also tested the ability of human listeners to either detect the ITD sweeps or identify the motion direction. Using a point-process model to separate the contributions of stimulus dependence and spiking history to single-neuron responses, we found that the neurons respond primarily by following the instantaneous ITD rather than exhibiting true direction selectivity. Furthermore, using an optimal classifier to decode the single-neuron responses, we found that neural threshold durations of ITD sweeps for both direction identification and detection overlapped with human threshold durations even though the average response of the neurons could track the instantaneous ITD beyond psychophysical limits. Our results suggest that the IC does not explicitly encode motion direction, but internal neural noise may limit the speed at which we can identify the direction of motion.NEW & NOTEWORTHY Recognizing motion and identifying an object's trajectory are important for parsing a complex auditory scene, but how we do so is unclear. We show that neurons in the auditory midbrain do not exhibit direction selectivity as found in the visual system but instead follow the trajectory of the motion in their temporal firing patterns. Our results suggest that the inherent variability in neural firings may limit our ability to identify motion direction at short durations.
Collapse
Affiliation(s)
- Nathaniel J Zuk
- Eaton-Peabody Laboratories, Massachusetts Eye and Ear, Boston, Massachusetts
| | - Bertrand Delgutte
- Eaton-Peabody Laboratories, Massachusetts Eye and Ear, Boston, Massachusetts.,Department of Otolaryngology, Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
9
|
Pastore MT, Braasch J. The impact of peripheral mechanisms on the precedence effect. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:425. [PMID: 31370612 PMCID: PMC6658214 DOI: 10.1121/1.5116680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 06/20/2019] [Accepted: 06/25/2019] [Indexed: 06/10/2023]
Abstract
When two similar sounds are presented from different locations, with one (the lead) preceding the other (the lag) by a small delay, listeners typically report hearing one sound near the location of the lead sound source-this is called the precedence effect (PE). Several questions about the underlying mechanisms that produce the PE are asked. (1) How might listeners' relative weighting of cues at onset versus ongoing stimulus portions affect perceived lateral position of long-duration lead/lag noise stimuli? (2) What are the factors that influence this weighting? (3) Are the mechanisms invoked to explain the PE for transient stimuli applicable to long-duration stimuli? To answer these questions, lead/lag noise stimuli are presented with a range of durations, onset slopes, and lag-to-lead level ratios over headphones. Monaural, peripheral mechanisms, and binaural cue extraction are modeled to estimate the cues available for determination of perceived laterality. Results showed that all three stimulus manipulations affect the relative weighting of onset and ongoing cues and that mechanisms invoked to explain the PE for transient stimuli are also applicable to the PE, in terms of both onset and ongoing segments of long-duration, lead/lag stimuli.
Collapse
Affiliation(s)
- M Torben Pastore
- Spatial Hearing Laboratory, College of Health Solutions, Arizona State University, Tempe, Arizona 85287, USA
| | - Jonas Braasch
- School of Architecture & Cognitive and Immersive Systems Laboratory (CISL), Rensselaer Polytechnic Institute, Troy, New York 12180, USA
| |
Collapse
|
10
|
Moore BCJ. Effects of age on sensitivity to interaural time differences in envelope and fine structure, individually and in combination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:1287. [PMID: 29604696 PMCID: PMC5834318 DOI: 10.1121/1.5025845] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 02/08/2018] [Accepted: 02/10/2018] [Indexed: 06/01/2023]
Abstract
Sensitivity to interaural time differences (ITDs) in envelope and temporal fine structure (TFS) of amplitude-modulated (AM) tones was assessed for young and older subjects, all with clinically normal hearing at the carrier frequencies of 250 and 500 Hz. Some subjects had hearing loss at higher frequencies. In experiment 1, thresholds for detecting changes in ITD were measured when the ITD was present in the TFS alone (ITDTFS), the envelope alone (ITDENV), or both (ITDTFS/ENV). Thresholds tended to be higher for the older than for the young subjects. ITDENV thresholds were much higher than ITDTFS thresholds, while ITDTFS/ENV thresholds were similar to ITDTFS thresholds. ITDTFS thresholds were lower than ITD thresholds obtained with an unmodulated pure tone, indicating that uninformative AM can improve ITDTFS discrimination. In experiment 2, equally detectable values of ITDTFS and ITDENV were combined so as to give consistent or inconsistent lateralization. There were large individual differences, but several subjects gave scores that were much higher than would be expected from the optimal combination of independent sources of information, even for the inconsistent condition. It is suggested that ITDTFS and ITDENV cues are processed partly independently, but that both cues influence lateralization judgments, even when one cue is uninformative.
Collapse
|
11
|
Freyman RL, Morse-Fortier C, Griffin AM, Zurek PM. Can monaural temporal masking explain the ongoing precedence effect? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:EL133. [PMID: 29495692 PMCID: PMC5826740 DOI: 10.1121/1.5024687] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 01/24/2018] [Accepted: 02/01/2018] [Indexed: 06/08/2023]
Abstract
The precedence effect for transient sounds has been proposed to be based primarily on monaural processes, manifested by asymmetric temporal masking. This study explored the potential for monaural explanations with longer ("ongoing") sounds exhibiting the precedence effect. Transient stimuli were single lead-lag noise burst pairs; ongoing stimuli were trains of 63 burst pairs. Unlike with transients, monaural masking data for ongoing sounds showed no advantage for the lead, and are inconsistent with asymmetric audibility as an explanation for ongoing precedence. This result, along with supplementary measurements of interaural time discrimination, suggests different explanations for transient and ongoing precedence.
Collapse
Affiliation(s)
- Richard L Freyman
- Department of Communication Disorders, University of Massachusetts, 358 North Pleasant Street, Amherst, Massachusetts 01003, USA , ,
| | - Charlotte Morse-Fortier
- Department of Communication Disorders, University of Massachusetts, 358 North Pleasant Street, Amherst, Massachusetts 01003, USA , ,
| | - Amanda M Griffin
- Department of Communication Disorders, University of Massachusetts, 358 North Pleasant Street, Amherst, Massachusetts 01003, USA , ,
| | - Patrick M Zurek
- Sensimetrics Corporation, 14 Summer Street, Malden, Massachusetts 02148, USA
| |
Collapse
|
12
|
Dietz M, Lestang JH, Majdak P, Stern RM, Marquardt T, Ewert SD, Hartmann WM, Goodman DFM. A framework for testing and comparing binaural models. Hear Res 2017; 360:92-106. [PMID: 29208336 DOI: 10.1016/j.heares.2017.11.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 11/03/2017] [Accepted: 11/24/2017] [Indexed: 11/19/2022]
Abstract
Auditory research has a rich history of combining experimental evidence with computational simulations of auditory processing in order to deepen our theoretical understanding of how sound is processed in the ears and in the brain. Despite significant progress in the amount of detail and breadth covered by auditory models, for many components of the auditory pathway there are still different model approaches that are often not equivalent but rather in conflict with each other. Similarly, some experimental studies yield conflicting results which has led to controversies. This can be best resolved by a systematic comparison of multiple experimental data sets and model approaches. Binaural processing is a prominent example of how the development of quantitative theories can advance our understanding of the phenomena, but there remain several unresolved questions for which competing model approaches exist. This article discusses a number of current unresolved or disputed issues in binaural modelling, as well as some of the significant challenges in comparing binaural models with each other and with the experimental data. We introduce an auditory model framework, which we believe can become a useful infrastructure for resolving some of the current controversies. It operates models over the same paradigms that are used experimentally. The core of the proposed framework is an interface that connects three components irrespective of their underlying programming language: The experiment software, an auditory pathway model, and task-dependent decision stages called artificial observers that provide the same output format as the test subject.
Collapse
Affiliation(s)
- Mathias Dietz
- National Centre for Audiology, Western University, London, ON, Canada.
| | - Jean-Hugues Lestang
- Department of Electrical and Electronic Engineering, Imperial College London, London, United Kingdom
| | - Piotr Majdak
- Institut für Schallforschung, Österreichische Akademie der Wissenschaften, Wien, Austria
| | | | | | - Stephan D Ewert
- Medizinische Physik, Universität Oldenburg, Oldenburg, Germany
| | | | - Dan F M Goodman
- Department of Electrical and Electronic Engineering, Imperial College London, London, United Kingdom
| |
Collapse
|
13
|
Greenberg D, Monaghan JJM, Dietz M, Marquardt T, McAlpine D. Influence of envelope waveform on ITD sensitivity of neurons in the auditory midbrain. J Neurophysiol 2017; 118:2358-2370. [PMID: 28701550 PMCID: PMC5646199 DOI: 10.1152/jn.01048.2015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2015] [Revised: 07/10/2017] [Accepted: 07/11/2017] [Indexed: 12/04/2022] Open
Abstract
Using single-neuron electrophysiology, we show that the precise shape of a sound’s “energy envelope” is a critical factor in determining how well midbrain neurons are able to convey information about auditory spatial cues. Consistent with human behavioral performance, sounds with rapidly rising energy and relatively long intervals between energy bursts are best at conveying spatial information. The data suggest specific sound energy patterns that might best be applied to hearing devices to aid spatial listening. Interaural time differences (ITDs) conveyed by the modulated envelopes of high-frequency sounds can serve as a cue for localizing a sound source. Klein-Hennig et al. (J Acoust Soc Am 129: 3856, 2011) demonstrated the envelope attack (the rate at which stimulus energy in the envelope increases) and the duration of the pause (the interval between successive envelope pulses) as important factors affecting sensitivity to envelope ITDs in human listeners. Modulated sounds with rapid attacks and long pauses produce the lowest ITD discrimination thresholds. The duration of the envelope’s sustained component (sustain) and the rate at which stimulus energy falls at the offset of the envelope (decay) are only minor factors. We assessed the responses of 71 single neurons, recorded from the midbrains of 15 urethane-anesthetized tri-colored guinea pigs, to envelope shapes in which the four envelope components, i.e., attack, sustain, decay, and pause, were systematically varied. We confirmed the importance of the attack and pause components in generating ITD-sensitive responses. Analysis of neural firing rates demonstrated more neurons (49/71) show ITD sensitivity in response to “damped” stimuli (fast attack and slow decay) compared with “ramped” stimuli (slow attack and fast decay) (14/71). Furthermore, the lowest threshold for the damped stimulus (91 μs) was lower by a factor of 4 than that for the temporally reversed ramped envelope shape (407 μs). The data confirm the importance of fast attacks and optimal pause durations in generating sensitivity to ITDs conveyed in the modulated envelopes of high-frequency sounds and are incompatible with models of ITD processing based on the integration of sound energy over time. NEW & NOTEWORTHY Using single-neuron electrophysiology, we show that the precise shape of a sound’s “energy envelope” is a critical factor in determining how well midbrain neurons are able to convey information about auditory spatial cues. Consistent with human behavioral performance, sounds with rapidly rising energy and relatively long intervals between energy bursts are best at conveying spatial information. The data suggest specific sound energy patterns that might best be applied to hearing devices to aid spatial listening.
Collapse
Affiliation(s)
| | - Jessica J M Monaghan
- Department of Linguistics, Australian Hearing Hub, Macquarie University, Sydney, New South Wales, Australia; and
| | - Mathias Dietz
- Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, Oldenburg, Germany
| | | | - David McAlpine
- UCL Ear Institute, London, United Kingdom.,Department of Linguistics, Australian Hearing Hub, Macquarie University, Sydney, New South Wales, Australia; and
| |
Collapse
|
14
|
Zuk N, Delgutte B. Neural coding of time-varying interaural time differences and time-varying amplitude in the inferior colliculus. J Neurophysiol 2017; 118:544-563. [PMID: 28381487 DOI: 10.1152/jn.00797.2016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2016] [Revised: 03/29/2017] [Accepted: 03/31/2017] [Indexed: 11/22/2022] Open
Abstract
Binaural cues occurring in natural environments are frequently time varying, either from the motion of a sound source or through interactions between the cues produced by multiple sources. Yet, a broad understanding of how the auditory system processes dynamic binaural cues is still lacking. In the current study, we directly compared neural responses in the inferior colliculus (IC) of unanesthetized rabbits to broadband noise with time-varying interaural time differences (ITD) with responses to noise with sinusoidal amplitude modulation (SAM) over a wide range of modulation frequencies. On the basis of prior research, we hypothesized that the IC, one of the first stages to exhibit tuning of firing rate to modulation frequency, might use a common mechanism to encode time-varying information in general. Instead, we found weaker temporal coding for dynamic ITD compared with amplitude modulation and stronger effects of adaptation for amplitude modulation. The differences in temporal coding of dynamic ITD compared with SAM at the single-neuron level could be a neural correlate of "binaural sluggishness," the inability to perceive fluctuations in time-varying binaural cues at high modulation frequencies, for which a physiological explanation has so far remained elusive. At ITD-variation frequencies of 64 Hz and above, where a temporal code was less effective, noise with a dynamic ITD could still be distinguished from noise with a constant ITD through differences in average firing rate in many neurons, suggesting a frequency-dependent tradeoff between rate and temporal coding of time-varying binaural information.NEW & NOTEWORTHY Humans use time-varying binaural cues to parse auditory scenes comprising multiple sound sources and reverberation. However, the neural mechanisms for doing so are poorly understood. Our results demonstrate a potential neural correlate for the reduced detectability of fluctuations in time-varying binaural information at high speeds, as occurs in reverberation. The results also suggest that the neural mechanisms for processing time-varying binaural and monaural cues are largely distinct.
Collapse
Affiliation(s)
- Nathaniel Zuk
- Eaton-Peabody Laboratories, Massachusetts Eye and Ear, Boston, Massachusetts.,Speech and Hearing Bioscience and Technology Program, Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts; and
| | - Bertrand Delgutte
- Eaton-Peabody Laboratories, Massachusetts Eye and Ear, Boston, Massachusetts; .,Speech and Hearing Bioscience and Technology Program, Harvard-MIT Division of Health Sciences and Technology, Cambridge, Massachusetts; and.,Department of Otolaryngology, Harvard Medical School, Boston, Massachusetts
| |
Collapse
|
15
|
Input timing for spatial processing is precisely tuned via constant synaptic delays and myelination patterns in the auditory brainstem. Proc Natl Acad Sci U S A 2017; 114:E4851-E4858. [PMID: 28559325 DOI: 10.1073/pnas.1702290114] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Precise timing of synaptic inputs is a fundamental principle of neural circuit processing. The temporal precision of postsynaptic input integration is known to vary with the computational requirements of a circuit, yet how the timing of action potentials is tuned presynaptically to match these processing demands is not well understood. In particular, action potential timing is shaped by the axonal conduction velocity and the duration of synaptic transmission delays within a pathway. However, it is not known to what extent these factors are adapted to the functional constraints of the respective circuit. Here, we report the finding of activity-invariant synaptic transmission delays as a functional adaptation for input timing adjustment in a brainstem sound localization circuit. We compared axonal and synaptic properties of the same pathway between two species with dissimilar timing requirements (gerbil and mouse): In gerbils (like humans), neuronal processing of sound source location requires exceptionally high input precision in the range of microseconds, but not in mice. Activity-invariant synaptic transmission and conduction delays were present exclusively in fast conducting axons of gerbils that also exhibited unusual structural adaptations in axon myelination for increased conduction velocity. In contrast, synaptic transmission delays in mice varied depending on activity levels, and axonal myelination and conduction velocity exhibited no adaptations. Thus, the specializations in gerbils and their absence in mice suggest an optimization of axonal and synaptic properties to the specific demands of sound localization. These findings significantly advance our understanding of structural and functional adaptations for circuit processing.
Collapse
|
16
|
Tolnai S, Beutelmann R, Klump GM. Exploring binaural hearing in gerbils (Meriones unguiculatus) using virtual headphones. PLoS One 2017; 12:e0175142. [PMID: 28394906 PMCID: PMC5386270 DOI: 10.1371/journal.pone.0175142] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2016] [Accepted: 03/21/2017] [Indexed: 11/19/2022] Open
Abstract
The Mongolian gerbil (Meriones unguiculatus) has become a key species in investigations of the neural processing of sound localization cues in mammals. While its sound localization has been tested extensively under free-field stimulation, many neurophysiological studies use headphones to present signals with binaural localization cues. The gerbil's behavioral sensitivity to binaural cues, however, is unknown for the lack of appropriate stimulation paradigms in awake behaving gerbils. We close this gap in knowledge by mimicking a headphone stimulation; we use free-field loudspeakers and apply cross-talk cancellation techniques to present pure tones with binaural cues via “virtual headphones” to gerbils trained in a sound localization task. All gerbils were able to lateralize sounds depending on the interaural time or level difference (ITD and ILD, respectively). For ITD stimuli, reliable responses were seen for frequencies ≤2.9 kHz, the highest frequency tested with ITD stimuli. ITD sensitivity was frequency-dependent with the highest sensitivity observed at 1 kHz. For stimuli with ITD outside the gerbil's physiological range, responses were cyclic indicating the use of phase information when lateralizing narrow-band sounds. For ILD stimuli, reliable responses were obtained for frequencies ≥2 kHz. The comparison of ITD and ILD thresholds with ITD and ILD thresholds derived from gerbils’ free-field performance suggests that ongoing ITD information is the main cue for sound localization at frequencies <2 kHz. At 2 kHz, ITD and ILD cues are likely used in a complementary way. Verification of the use of the virtual headphones suggests that they can serve as a suitable substitute for conventional headphones particularly at frequencies ≤2 kHz.
Collapse
Affiliation(s)
- Sandra Tolnai
- Cluster of Excellence “Hearing4all”, Animal Physiology and Behavior Group, Department of Neuroscience, School of Medicine and Health Sciences, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
- * E-mail:
| | - Rainer Beutelmann
- Cluster of Excellence “Hearing4all”, Animal Physiology and Behavior Group, Department of Neuroscience, School of Medicine and Health Sciences, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
| | - Georg M. Klump
- Cluster of Excellence “Hearing4all”, Animal Physiology and Behavior Group, Department of Neuroscience, School of Medicine and Health Sciences, Carl von Ossietzky University of Oldenburg, Oldenburg, Germany
| |
Collapse
|
17
|
Hu H, Ewert SD, McAlpine D, Dietz M. Differences in the temporal course of interaural time difference sensitivity between acoustic and electric hearing in amplitude modulated stimuli. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:1862. [PMID: 28372072 DOI: 10.1121/1.4977014] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Previous studies have shown that normal-hearing (NH) listeners' spatial perception of non-stationary interaural time differences (ITDs) is dominated by the carrier ITD during rising amplitude segments. Here, ITD sensitivity throughout the amplitude-modulation cycle in NH listeners and bilateral cochlear implant (CI) subjects is compared, the latter by means of direct stimulation of a single electrode pair. The data indicate that, while NH listeners are most sensitive to ITDs applied toward the beginning of a modulation cycle at 600 Hz, NH listeners at 200 Hz and especially bilateral CI subjects at 200 pulses per second (pps) are more sensitive to ITDs applied to the modulation maximum. This has implications for spatial-hearing in complex environments: NH listeners' dominant 600-Hz ITD information from the rising amplitude segments comprises direct sound information. The 200-pps low rate required to get ITD sensitivity in CI users results in a higher weight of pulses later in the modulation cycle where the source ITDs are more likely corrupted by reflections. This indirectly indicates that even if future binaural CI processors are able to provide perceptually exploitable ITD information, CI users will likely not get the full benefit from such pulse-based ITD cues in reverberant and other complex environments.
Collapse
Affiliation(s)
- Hongmei Hu
- Medizinische Physik and Cluster of Excellence "Hearing4all," Universität Oldenburg, D-26111 Oldenburg, Germany
| | - Stephan D Ewert
- Medizinische Physik and Cluster of Excellence "Hearing4all," Universität Oldenburg, D-26111 Oldenburg, Germany
| | - David McAlpine
- Department of Linguistics, Australian Hearing Hub, Macquarie University, New South Wales 2109, Australia
| | - Mathias Dietz
- Medizinische Physik and Cluster of Excellence "Hearing4all," Universität Oldenburg, D-26111 Oldenburg, Germany
| |
Collapse
|
18
|
Moore BCJ, Kolarik A, Stone MA, Lee YW. Evaluation of a method for enhancing interaural level differences at low frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:2817. [PMID: 27794295 DOI: 10.1121/1.4965299] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
A method (called binaural enhancement) for enhancing interaural level differences at low frequencies, based on estimates of interaural time differences, was developed and evaluated. Five conditions were compared, all using simulated hearing-aid processing: (1) Linear amplification with frequency-response shaping; (2) binaural enhancement combined with linear amplification and frequency-response shaping; (3) slow-acting four-channel amplitude compression with independent compression at the two ears (AGC4CH); (4) binaural enhancement combined with four-channel compression (BE-AGC4CH); and (5) four-channel compression but with the compression gains synchronized across ears. Ten hearing-impaired listeners were tested, and gains and compression ratios for each listener were set to match targets prescribed by the CAM2 fitting method. Stimuli were presented via headphones, using virtualization methods to simulate listening in a moderately reverberant room. The intelligibility of speech at ±60° azimuth in the presence of competing speech on the opposite side of the head at ±60° azimuth was not affected by the binaural enhancement processing. Sound localization was significantly better for condition BE-AGC4CH than for condition AGC4CH for a sentence, but not for broadband noise, lowpass noise, or lowpass amplitude-modulated noise. The results suggest that the binaural enhancement processing can improve localization for sounds with distinct envelope fluctuations.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Andrew Kolarik
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Michael A Stone
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England
| | - Young-Woo Lee
- Samsung Electronics Co., Ltd., Maetan dong 129, Samsung-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, Korea
| |
Collapse
|
19
|
Fischl MJ, Burger RM, Schmidt-Pauly M, Alexandrova O, Sinclair JL, Grothe B, Forsythe ID, Kopp-Scheinpflug C. Physiology and anatomy of neurons in the medial superior olive of the mouse. J Neurophysiol 2016; 116:2676-2688. [PMID: 27655966 PMCID: PMC5133312 DOI: 10.1152/jn.00523.2016] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Accepted: 09/19/2016] [Indexed: 12/16/2022] Open
Abstract
In mammals with good low-frequency hearing, the medial superior olive (MSO) computes sound location by comparing differences in the arrival time of a sound at each ear, called interaural time disparities (ITDs). Low-frequency sounds are not reflected by the head, and therefore level differences and spectral cues are minimal or absent, leaving ITDs as the only cue for sound localization. Although mammals with high-frequency hearing and small heads (e.g., bats, mice) barely experience ITDs, the MSO is still present in these animals. Yet, aside from studies in specialized bats, in which the MSO appears to serve functions other than ITD processing, it has not been studied in small mammals that do not hear low frequencies. Here we describe neurons in the mouse brain stem that share prominent anatomical, morphological, and physiological properties with the MSO in species known to use ITDs for sound localization. However, these neurons also deviate in some important aspects from the typical MSO, including a less refined arrangement of cell bodies, dendrites, and synaptic inputs. In vitro, the vast majority of neurons exhibited a single, onset action potential in response to suprathreshold depolarization. This spiking pattern is typical of MSO neurons in other species and is generated from a complement of Kv1, Kv3, and IH currents. In vivo, mouse MSO neurons show bilateral excitatory and inhibitory tuning as well as an improvement in temporal acuity of spiking during bilateral acoustic stimulation. The combination of classical MSO features like those observed in gerbils with more unique features similar to those observed in bats and opossums make the mouse MSO an interesting model for exploiting genetic tools to test hypotheses about the molecular mechanisms and evolution of ITD processing.
Collapse
Affiliation(s)
- Matthew J Fischl
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany
| | - R Michael Burger
- Department of Biological Sciences, Lehigh University, Bethlehem, Pennsylvania; and
| | - Myriam Schmidt-Pauly
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany
| | - Olga Alexandrova
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany
| | - James L Sinclair
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany
| | - Benedikt Grothe
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany
| | - Ian D Forsythe
- Department of Neuroscience, Psychology, and Behaviour, University of Leicester, Leicester, United Kingdom
| | - Conny Kopp-Scheinpflug
- Division of Neurobiology, Department of Biology II, Ludwig Maximilian University Munich, Planegg-Martinsried, Germany;
| |
Collapse
|
20
|
Abstract
In an increasing number of countries, the standard treatment for deaf individuals is moving toward the implantation of two cochlear implants. Today's device technology and fitting procedure, however, appears as if the two implants would serve two independent ears and brains. Many experimental studies have demonstrated that after careful matching and balancing of left and right stimulation in controlled laboratory studies most patients have almost normal sensitivity to interaural level differences and some sensitivity to interaural time differences (ITDs). Mechanisms underlying the limited ITD sensitivity are still poorly understood and many different aspects may contribute. Recent pioneering computational approaches identified some of the functional implications the electric input imposes on the neural brainstem circuits. Simultaneously these studies have raised new questions and certainly demonstrated that further refinement of the model stages is necessary. They join the experimental study's conclusions that binaural device technology, binaural fitting, specific speech coding strategies, and binaural signal processing algorithms are obviously missing components to maximize the benefit of bilateral implantation. Within this review, the existing models of the electrically stimulated binaural system are explained, compared, and discussed from a viewpoint of a "CI device with auditory system" and from that of neurophysiological research.
Collapse
Affiliation(s)
- Mathias Dietz
- a Canada Research Chair in Binaural Hearing, National Centre for Audiology, Faculty of Health Sciences , Western University , London , Ontario , Canada
| |
Collapse
|
21
|
Dietz M, Wang L, Greenberg D, McAlpine D. Sensitivity to Interaural Time Differences Conveyed in the Stimulus Envelope: Estimating Inputs of Binaural Neurons Through the Temporal Analysis of Spike Trains. J Assoc Res Otolaryngol 2016; 17:313-30. [PMID: 27294694 DOI: 10.1007/s10162-016-0573-9] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 05/30/2016] [Indexed: 01/03/2023] Open
Abstract
Sound-source localization in the horizontal plane relies on detecting small differences in the timing and level of the sound at the two ears, including differences in the timing of the modulated envelopes of high-frequency sounds (envelope interaural time differences (ITDs)). We investigated responses of single neurons in the inferior colliculus (IC) to a wide range of envelope ITDs and stimulus envelope shapes. By a novel means of visualizing neural activity relative to different portions of the periodic stimulus envelope at each ear, we demonstrate the role of neuron-specific excitatory and inhibitory inputs in creating ITD sensitivity (or the lack of it) depending on the specific shape of the stimulus envelope. The underlying binaural brain circuitry and synaptic parameters were modeled individually for each neuron to account for neuron-specific activity patterns. The model explains the effects of envelope shapes on sensitivity to envelope ITDs observed in both normal-hearing listeners and in neural data, and has consequences for understanding how ITD information in stimulus envelopes might be maximized in users of bilateral cochlear implants-for whom ITDs conveyed in the stimulus envelope are the only ITD cues available.
Collapse
Affiliation(s)
- Mathias Dietz
- Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, 26111, Oldenburg, Germany. .,UCL Ear Institute, 332 Gray's Inn Road, London, WC1X 8EE, UK. .,National Centre for Audiology, Faculty of Health Sciences, Western University, London, N6G 1H1, Ontario, Canada.
| | - Le Wang
- Center for Computational Neuroscience and Neural Technology, Boston University, Boston, MA, 02215, USA
| | - David Greenberg
- UCL Ear Institute, 332 Gray's Inn Road, London, WC1X 8EE, UK
| | - David McAlpine
- UCL Ear Institute, 332 Gray's Inn Road, London, WC1X 8EE, UK.,Dept. of Lingustics, Australian Hearing Hub, Macquarie University, Sydney, NSW, 2109, Australia
| |
Collapse
|
22
|
Reed DK, Dietz M, Josupeit A, van de Par S. Lateralization of stimuli with alternating interaural time differences: The role of monaural envelope cues. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 139:30-40. [PMID: 26827002 DOI: 10.1121/1.4938018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
A temporally acute binaural system can help to resolve inherent fluctuations in binaural information that are often present in complex auditory scenes. Using a broadband noise stimulus that rapidly alternates between two different values of interaural time difference (ITD), the ability of the binaural system to hear the lateral position resulting from one of the ITD values was investigated. Results show that listeners are able to accurately lateralize brief noise tokens of only 3-7 ms in duration. In two subsequent experiments, the role of an amplitude modulation (AM) imposed on the ITD-switching stimulus used in the first experiment was tested. For wideband stimuli, the temporal position of the ITD target relative to the phase of the AM did not influence absolute lateralization or detection performance. When the stimuli were narrowband, however, detection of the ITD target was best when temporally positioned in the rising portion of the AM. These experiments illustrate that the auditory system is capable of making accurate lateral estimates of very brief moments of ITD information. Furthermore, for these instantaneous changes in ITD information, the stimulus bandwidth can influence the role of envelope cues for the readout of binaural information.
Collapse
Affiliation(s)
- Darrin K Reed
- Acoustics Group, Forschungszentrum Neurosensorik, Cluster of Excellence Hearing4all Universität Oldenburg, 26111 Oldenburg, Germany
| | - Mathias Dietz
- Medizinische Physik, Cluster of Excellence Hearing4all Universität Oldenburg, 26111 Oldenburg, Germany
| | - Angela Josupeit
- Medizinische Physik, Cluster of Excellence Hearing4all Universität Oldenburg, 26111 Oldenburg, Germany
| | - Steven van de Par
- Acoustics Group, Forschungszentrum Neurosensorik, Cluster of Excellence Hearing4all Universität Oldenburg, 26111 Oldenburg, Germany
| |
Collapse
|
23
|
Monaghan JJM, Bleeck S, McAlpine D. Sensitivity to Envelope Interaural Time Differences at High Modulation Rates. Trends Hear 2015; 19:2331216515619331. [PMID: 26721926 PMCID: PMC4871209 DOI: 10.1177/2331216515619331] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Sensitivity to interaural time differences (ITDs) conveyed in the temporal fine structure of low-frequency tones and the modulated envelopes of high-frequency sounds are considered comparable, particularly for envelopes shaped to transmit similar fidelity of temporal information normally present for low-frequency sounds. Nevertheless, discrimination performance for envelope modulation rates above a few hundred Hertz is reported to be poor-to the point of discrimination thresholds being unattainable-compared with the much higher (>1,000 Hz) limit for low-frequency ITD sensitivity, suggesting the presence of a low-pass filter in the envelope domain. Further, performance for identical modulation rates appears to decline with increasing carrier frequency, supporting the view that the low-pass characteristics observed for envelope ITD processing is carrier-frequency dependent. Here, we assessed listeners' sensitivity to ITDs conveyed in pure tones and in the modulated envelopes of high-frequency tones. ITD discrimination for the modulated high-frequency tones was measured as a function of both modulation rate and carrier frequency. Some well-trained listeners appear able to discriminate ITDs extremely well, even at modulation rates well beyond 500 Hz, for 4-kHz carriers. For one listener, thresholds were even obtained for a modulation rate of 800 Hz. The highest modulation rate for which thresholds could be obtained declined with increasing carrier frequency for all listeners. At 10 kHz, the highest modulation rate at which thresholds could be obtained was 600 Hz. The upper limit of sensitivity to ITDs conveyed in the envelope of high-frequency modulated sounds appears to be higher than previously considered.
Collapse
Affiliation(s)
| | - Stefan Bleeck
- Institute of Sound and Vibration Research, University of Southampton, UK
| | | |
Collapse
|
24
|
A Neural Model of Auditory Space Compatible with Human Perception under Simulated Echoic Conditions. PLoS One 2015; 10:e0137900. [PMID: 26355676 PMCID: PMC4565656 DOI: 10.1371/journal.pone.0137900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Accepted: 08/22/2015] [Indexed: 11/19/2022] Open
Abstract
In a typical auditory scene, sounds from different sources and reflective surfaces summate in the ears, causing spatial cues to fluctuate. Prevailing hypotheses of how spatial locations may be encoded and represented across auditory neurons generally disregard these fluctuations and must therefore invoke additional mechanisms for detecting and representing them. Here, we consider a different hypothesis in which spatial perception corresponds to an intermediate or sub-maximal firing probability across spatially selective neurons within each hemisphere. The precedence or Haas effect presents an ideal opportunity for examining this hypothesis, since the temporal superposition of an acoustical reflection with sounds arriving directly from a source can cause otherwise stable cues to fluctuate. Our findings suggest that subjects’ experiences may simply reflect the spatial cues that momentarily arise under various acoustical conditions and how these cues are represented. We further suggest that auditory objects may acquire “edges” under conditions when interaural time differences are broadly distributed.
Collapse
|
25
|
Diedesch AC, Stecker GC. Temporal weighting of binaural information at low frequencies: Discrimination of dynamic interaural time and level differences. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:125-133. [PMID: 26233013 PMCID: PMC4499054 DOI: 10.1121/1.4922327] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/17/2014] [Revised: 05/15/2015] [Accepted: 05/27/2015] [Indexed: 05/29/2023]
Abstract
The importance of sound onsets in binaural hearing has been addressed in many studies, particularly at high frequencies, where the onset of the envelope may carry much of the useful binaural information. Some studies suggest that sound onsets might play a similar role in the processing of binaural cues [e.g., fine-structure interaural time differences (ITD)] at low frequencies. This study measured listeners' sensitivity to ITD and interaural level differences (ILD) present in early (i.e., onset) and late parts of 80-ms pure tones of 250-, 500-, and 1000-Hz frequency. Following previous studies, tones carried static interaural cues or dynamic cues that peaked at sound onset and diminished to zero at sound offset or vice versa. Although better thresholds were observed in static than dynamic conditions overall, ITD discrimination was especially impaired, regardless of frequency, when cues were not available at sound onset. Results for ILD followed a similar pattern at 1000 Hz; at lower frequencies, ILD thresholds did not differ significantly between dynamic-cue conditions. The results support the "onset" hypothesis of Houtgast and Plomp [(1968). J. Acoust. Soc. Am. 44, 807-812] for ITD discrimination, but not necessarily ILD discrimination, in low-frequency pure tones.
Collapse
Affiliation(s)
- Anna C Diedesch
- Department of Hearing and Speech Sciences, Vanderbilt University School of Medicine, 1215 21st Avenue South, Nashville, Tennessee 37232, USA
| | - G Christopher Stecker
- Department of Hearing and Speech Sciences, Vanderbilt University School of Medicine, 1215 21st Avenue South, Nashville, Tennessee 37232, USA
| |
Collapse
|
26
|
Dietz M, Klein-Hennig M, Hohmann V. The influence of pause, attack, and decay duration of the ongoing envelope on sound lateralization. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:EL137-43. [PMID: 25698041 DOI: 10.1121/1.4905891] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Klein-Hennig et al. [J. Acoust. Soc. Am. 129, 3856-3872 (2011)] introduced a class of high-frequency stimuli for which the envelope shape can be altered by independently varying the attack, hold, decay, and pause durations. These stimuli, originally employed for testing the shape dependence of human listeners' sensitivity to interaural temporal differences (ITDs) in the ongoing envelope, were used to measure the lateralization produced by fixed interaural disparities. Consistent with the threshold ITD data, a steep attack and a non-zero pause facilitate strong ITD-based lateralization. In contrast, those conditions resulted in the smallest interaural level-based lateralization.
Collapse
Affiliation(s)
- Mathias Dietz
- Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, D-26111 Oldenburg, Germany , ,
| | - Martin Klein-Hennig
- Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, D-26111 Oldenburg, Germany , ,
| | - Volker Hohmann
- Medizinische Physik and Cluster of Excellence Hearing4all, Universität Oldenburg, D-26111 Oldenburg, Germany , ,
| |
Collapse
|
27
|
Stecker GC. Temporal weighting functions for interaural time and level differences. IV. Effects of carrier frequency. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:3221. [PMID: 25480069 PMCID: PMC4257961 DOI: 10.1121/1.4900827] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2014] [Revised: 10/10/2014] [Accepted: 10/20/2014] [Indexed: 05/29/2023]
Abstract
Temporal variation in listeners' sensitivity to interaural time and level differences (ITD and ILD, respectively) was measured for sounds of different carrier frequency using the temporal weighting function (TWF) paradigm [Stecker and Hafter (2002) J. Acoust. Soc. Am. 112,1046-1057]. Listeners made lateralization judgments following brief trains of filtered impulses (Gabor clicks) presented over headphones with overall ITD and/or ILD ranging from ±500 μs ITD and/or ±5 dB ILD across trials. Individual clicks within each train varied by an additional ±100 μs ITD or ±2 dB ILD to allow TWF calculation by multiple regression. In separate conditions, TWFs were measured for carrier frequencies of 1, 2, 4, and 8 kHz. Consistent with past studies, TWFs demonstrated high weight on the first click for stimuli with short interclick interval (ICI = 2 ms), but flatter weighting for longer ICI (5-10 ms). Some conditions additionally demonstrated greater weight for clicks near the offset than near the middle of the train. Results support a primary role of the auditory periphery in emphasizing onset and offset cues in rapidly modulated low-frequency sounds. For slower modulations, sensitivity to ongoing high-frequency ILD and low-frequency ITD cues appears subject to recency effects consistent with the effects of leaky temporal integration of binaural information.
Collapse
Affiliation(s)
- G Christopher Stecker
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, Tennessee 37232
| |
Collapse
|
28
|
Grothe B, Pecka M. The natural history of sound localization in mammals--a story of neuronal inhibition. Front Neural Circuits 2014; 8:116. [PMID: 25324726 PMCID: PMC4181121 DOI: 10.3389/fncir.2014.00116] [Citation(s) in RCA: 101] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2014] [Accepted: 09/01/2014] [Indexed: 12/14/2022] Open
Abstract
Our concepts of sound localization in the vertebrate brain are widely based on the general assumption that both the ability to detect air-borne sounds and the neuronal processing are homologous in archosaurs (present day crocodiles and birds) and mammals. Yet studies repeatedly report conflicting results on the neuronal circuits and mechanisms, in particular the role of inhibition, as well as the coding strategies between avian and mammalian model systems. Here we argue that mammalian and avian phylogeny of spatial hearing is characterized by a convergent evolution of hearing air-borne sounds rather than by homology. In particular, the different evolutionary origins of tympanic ears and the different availability of binaural cues in early mammals and archosaurs imposed distinct constraints on the respective binaural processing mechanisms. The role of synaptic inhibition in generating binaural spatial sensitivity in mammals is highlighted, as it reveals a unifying principle of mammalian circuit design for encoding sound position. Together, we combine evolutionary, anatomical and physiological arguments for making a clear distinction between mammalian processing mechanisms and coding strategies and those of archosaurs. We emphasize that a consideration of the convergent nature of neuronal mechanisms will significantly increase the explanatory power of studies of spatial processing in both mammals and birds.
Collapse
Affiliation(s)
- Benedikt Grothe
- Division of Neurobiology, Department of Biology II, Ludwig Maximilians University Munich Munich, Germany
| | - Michael Pecka
- Division of Neurobiology, Department of Biology II, Ludwig Maximilians University Munich Munich, Germany
| |
Collapse
|