1
|
Carney LH. Neural Fluctuation Contrast as a Code for Complex Sounds: The Role and Control of Peripheral Nonlinearities. Hear Res 2024; 443:108966. [PMID: 38310710 PMCID: PMC10923127 DOI: 10.1016/j.heares.2024.108966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 01/14/2024] [Accepted: 01/26/2024] [Indexed: 02/06/2024]
Abstract
The nonlinearities of the inner ear are often considered to be obstacles that the central nervous system has to overcome to decode neural responses to sounds. This review describes how peripheral nonlinearities, such as saturation of the inner-hair-cell response and of the IHC-auditory-nerve synapse, are instead beneficial to the neural encoding of complex sounds such as speech. These nonlinearities set up contrast in the depth of neural-fluctuations in auditory-nerve responses along the tonotopic axis, referred to here as neural fluctuation contrast (NFC). Physiological support for the NFC coding hypothesis is reviewed, and predictions of several psychophysical phenomena, including masked detection and speech intelligibility, are presented. Lastly, a framework based on the NFC code for understanding how the medial olivocochlear (MOC) efferent system contributes to the coding of complex sounds is presented. By modulating cochlear gain control in response to both sound energy and fluctuations in neural responses, the MOC system is hypothesized to function not as a simple feedback gain-control device, but rather as a mechanism for enhancing NFC along the tonotopic axis, enabling robust encoding of complex sounds across a wide range of sound levels and in the presence of background noise. Effects of sensorineural hearing loss on the NFC code and on the MOC feedback system are presented and discussed.
Collapse
Affiliation(s)
- Laurel H Carney
- Depts. of Biomedical Engineering, Neuroscience, and Electrical & Computer Engineering University of Rochester, Rochester, NY, USA.
| |
Collapse
|
2
|
Otsuka S, Furukawa S. Conversion of amplitude modulation to phase modulation in the human cochlea. Hear Res 2021; 408:108274. [PMID: 34237495 DOI: 10.1016/j.heares.2021.108274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 04/26/2021] [Accepted: 05/12/2021] [Indexed: 11/17/2022]
Abstract
When an amplitude modulated signal with a constant-frequency carrier is fed into a generic nonlinear amplifier, the phase of the carrier of the output signal is also modulated. This phenomenon is referred to as amplitude-modulation-to-phase-modulation (AM-to-PM) conversion and regarded as an unwanted signal distortion in the field of electro-communication engineering. Herein, we offer evidence that AM-to-PM conversion also occurs in the human cochlea and that listeners can use the PM information effectively to process the AM of sounds. We recorded otoacoustic emissions (OAEs) evoked by AM signals. The results showed that the OAE phase was modulated at the same rate as the stimulus modulation. The magnitude of the AM-induced PM of the OAE peaked generally around the stimulus level corresponding to the compression point of individual cochlear input-output functions, as estimated using a psychoacoustic method. A computational cochlear model incorporating a nonlinear active process replicates the abovementioned key features of the AM-induced PM observed in OAEs. These results indicate that AM-induced PM occurring at the cochlear partition can be estimated by measuring OAEs. Psychophysical experiments further revealed that, for individuals with higher sensitivity to PM, the PM magnitude is correlated with AM-detection performance. This result implies that the AM-induced PM information cannot be a dominant cue for AM detection, but listeners with higher sensitivity may partly rely on the AM-induced PM cue.
Collapse
Affiliation(s)
- Sho Otsuka
- Center for Frontier Medical Engineering, Chiba Univ. 1-33 Yayoicho, Inageku, Chiba 263-8522, Japan; NTT Communication Science Laboratories, NTT Corporation, 3-1, Morinosato-Wakamiya, Atsugi-shi, 243-01 Japan.
| | - Shigeto Furukawa
- NTT Communication Science Laboratories, NTT Corporation, 3-1, Morinosato-Wakamiya, Atsugi-shi, 243-01 Japan
| |
Collapse
|
3
|
Jennings SG. The role of the medial olivocochlear reflex in psychophysical masking and intensity resolution in humans: a review. J Neurophysiol 2021; 125:2279-2308. [PMID: 33909513 PMCID: PMC8285664 DOI: 10.1152/jn.00672.2020] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 03/16/2021] [Accepted: 04/02/2021] [Indexed: 02/01/2023] Open
Abstract
This review addresses the putative role of the medial olivocochlear (MOC) reflex in psychophysical masking and intensity resolution in humans. A framework for interpreting psychophysical results in terms of the expected influence of the MOC reflex is introduced. This framework is used to review the effects of a precursor or contralateral acoustic stimulation on 1) simultaneous masking of brief tones, 2) behavioral estimates of cochlear gain and frequency resolution in forward masking, 3) the buildup and decay of forward masking, and 4) measures of intensity resolution. Support, or lack thereof, for a role of the MOC reflex in psychophysical perception is discussed in terms of studies on estimates of MOC strength from otoacoustic emissions and the effects of resection of the olivocochlear bundle in patients with vestibular neurectomy. Novel, innovative approaches are needed to resolve the dissatisfying conclusion that current results are unable to definitively confirm or refute the role of the MOC reflex in masking and intensity resolution.
Collapse
Affiliation(s)
- Skyler G Jennings
- Department of Communication Sciences and Disorders, The University of Utah, Salt Lake City, Utah
| |
Collapse
|
4
|
Phase Locking of Auditory-Nerve Fibers Reveals Stereotyped Distortions and an Exponential Transfer Function with a Level-Dependent Slope. J Neurosci 2019; 39:4077-4099. [PMID: 30867259 DOI: 10.1523/jneurosci.1801-18.2019] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Revised: 02/28/2019] [Accepted: 03/07/2019] [Indexed: 12/16/2022] Open
Abstract
Phase locking of auditory-nerve-fiber (ANF) responses to the fine structure of acoustic stimuli is a hallmark of the auditory system's temporal precision and is important for many aspects of hearing. Period histograms from phase-locked ANF responses to low-frequency tones exhibit spike-rate and temporal asymmetries, but otherwise retain an approximately sinusoidal shape as stimulus level increases, even beyond the level at which the mean spike rate saturates. This is intriguing because apical cochlear mechanical vibrations show little compression, and mechanoelectrical transduction in the receptor cells is thought to obey a static sigmoidal nonlinearity, which might be expected to produce peak clipping at moderate and high stimulus levels. Here we analyze phase-locked responses of ANFs from cats of both sexes. We show that the lack of peak clipping is due neither to ANF refractoriness nor to spike-rate adaptation on time scales longer than the stimulus period. We demonstrate that the relationship between instantaneous pressure and instantaneous rate is well described by an exponential function whose slope decreases with increasing stimulus level. Relatively stereotyped harmonic distortions in the input to the exponential can account for the temporal asymmetry of the period histograms, including peak splitting. We show that the model accounts for published membrane-potential waveforms when assuming a power-of-three, but not a power-of-one, relationship to exocytosis. Finally, we demonstrate the relationship between the exponential transfer functions and the sigmoidal pseudotransducer functions obtained in the literature by plotting the maxima and minima of the voltage responses against the maxima and minima of the stimuli.SIGNIFICANCE STATEMENT Phase locking of auditory-nerve-fiber responses to the temporal fine structure of acoustic stimuli is important for many aspects of hearing, but the mechanisms underlying phase locking are not fully understood. Intriguingly, period histograms retain an approximately sinusoidal shape across sound levels, even when the mean rate has saturated. We find that neither refractoriness nor spike-rate adaptation is responsible for this behavior. Instead, the peripheral auditory system operates as though it contains an exponential transfer function whose slope changes with stimulus level. The underlying mechanism is distinct from the comparatively weak cochlear mechanical compression in the cochlear apex, and likely resides in the receptor cells.
Collapse
|
5
|
Carney LH. Supra-Threshold Hearing and Fluctuation Profiles: Implications for Sensorineural and Hidden Hearing Loss. J Assoc Res Otolaryngol 2018; 19:331-352. [PMID: 29744729 PMCID: PMC6081887 DOI: 10.1007/s10162-018-0669-5] [Citation(s) in RCA: 93] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2017] [Accepted: 04/19/2018] [Indexed: 12/22/2022] Open
Abstract
An important topic in contemporary auditory science is supra-threshold hearing. Difficulty hearing at conversational speech levels in background noise has long been recognized as a problem of sensorineural hearing loss, including that associated with aging (presbyacusis). Such difficulty in listeners with normal thresholds has received more attention recently, especially associated with descriptions of synaptopathy, the loss of auditory nerve (AN) fibers as a result of noise exposure or aging. Synaptopathy has been reported to cause a disproportionate loss of low- and medium-spontaneous rate (L/MSR) AN fibers. Several studies of synaptopathy have assumed that the wide dynamic ranges of L/MSR AN fiber rates are critical for coding supra-threshold sounds. First, this review will present data from the literature that argues against a direct role for average discharge rates of L/MSR AN fibers in coding sounds at moderate to high sound levels. Second, the encoding of sounds at supra-threshold levels is examined. A key assumption in many studies is that saturation of AN fiber discharge rates limits neural encoding, even though the majority of AN fibers, high-spontaneous rate (HSR) fibers, have saturated average rates at conversational sound levels. It is argued here that the cross-frequency profile of low-frequency neural fluctuation amplitudes, not average rates, encodes complex sounds. As described below, this fluctuation-profile coding mechanism benefits from both saturation of inner hair cell (IHC) transduction and average rate saturation associated with the IHC-AN synapse. Third, the role of the auditory efferent system, which receives inputs from L/MSR fibers, is revisited in the context of fluctuation-profile coding. The auditory efferent system is hypothesized to maintain and enhance neural fluctuation profiles. Lastly, central mechanisms sensitive to neural fluctuations are reviewed. Low-frequency fluctuations in AN responses are accentuated by cochlear nucleus neurons which, either directly or via other brainstem nuclei, relay fluctuation profiles to the inferior colliculus (IC). IC neurons are sensitive to the frequency and amplitude of low-frequency fluctuations and convert fluctuation profiles from the periphery into a phase-locked rate profile that is robust across a wide range of sound levels and in background noise. The descending projection from the midbrain (IC) to the efferent system completes a functional loop that, combined with inputs from the L/MSR pathway, is hypothesized to maintain "sharp" supra-threshold hearing, reminiscent of visual mechanisms that regulate optical accommodation. Examples from speech coding and detection in noise are reviewed. Implications for the effects of synaptopathy on control mechanisms hypothesized to influence supra-threshold hearing are discussed. This framework for understanding neural coding and control mechanisms for supra-threshold hearing suggests strategies for the design of novel hearing aid signal-processing and electrical stimulation patterns for cochlear implants.
Collapse
Affiliation(s)
- Laurel H Carney
- Departments of Biomedical Engineering, Neuroscience, and Electrical & Computer Engineering, Del Monte Institute for Neuroscience, University of Rochester, 601 Elmwood Ave., Box 603, Rochester, NY, 14642, USA.
| |
Collapse
|
6
|
Jennings SG, Chen J, Fultz SE, Ahlstrom JB, Dubno JR. Amplitude modulation detection with a short-duration carrier: Effects of a precursor and hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2232. [PMID: 29716275 PMCID: PMC5908713 DOI: 10.1121/1.5031122] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Revised: 03/26/2018] [Accepted: 03/27/2018] [Indexed: 05/28/2023]
Abstract
This study tests the hypothesis that amplitude modulation (AM) detection will be better under conditions where basilar membrane (BM) response growth is expected to be linear rather than compressive. This hypothesis was tested by (1) comparing AM detection for a tonal carrier as a function of carrier level for subjects with and without cochlear hearing impairment (HI), and by (2) comparing AM detection for carriers presented with and without an ipsilateral notched-noise precursor, under the assumption that the precursor linearizes BM responses. Average AM detection thresholds were approximately 5 dB better for subjects with HI than for subjects with normal hearing (NH) at moderate-level carriers. Average AM detection for low-to-moderate level carriers was approximately 2 dB better with the precursor than without the precursor for subjects with NH, whereas precursor effects were absent or smaller for subjects with HI. Although effect sizes were small and individual differences were noted, group differences are consistent with better AM detection for conditions where BM responses are less compressive due to cochlear hearing loss or due to a reduction in cochlear gain. These findings suggest the auditory system may quickly adjust to the local soundscape to increase effective AM depth and improve signal-to-noise ratios.
Collapse
Affiliation(s)
- Skyler G Jennings
- Department of Communication Sciences and Disorders, The University of Utah, 390 South, 1530 East, BEHS 1201, Salt Lake City, Utah 84112, USA
| | - Jessica Chen
- Department of Communication Sciences and Disorders, The University of Utah, 390 South, 1530 East, BEHS 1201, Salt Lake City, Utah 84112, USA
| | - Sara E Fultz
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, South Carolina 29425-5500, USA
| | - Jayne B Ahlstrom
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, South Carolina 29425-5500, USA
| | - Judy R Dubno
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina, 135 Rutledge Avenue, MSC 550, Charleston, South Carolina 29425-5500, USA
| |
Collapse
|
7
|
Ashida G, Tollin DJ, Kretzberg J. Physiological models of the lateral superior olive. PLoS Comput Biol 2017; 13:e1005903. [PMID: 29281618 PMCID: PMC5744914 DOI: 10.1371/journal.pcbi.1005903] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2017] [Accepted: 11/28/2017] [Indexed: 01/09/2023] Open
Abstract
In computational biology, modeling is a fundamental tool for formulating, analyzing and predicting complex phenomena. Most neuron models, however, are designed to reproduce certain small sets of empirical data. Hence their outcome is usually not compatible or comparable with other models or datasets, making it unclear how widely applicable such models are. In this study, we investigate these aspects of modeling, namely credibility and generalizability, with a specific focus on auditory neurons involved in the localization of sound sources. The primary cues for binaural sound localization are comprised of interaural time and level differences (ITD/ILD), which are the timing and intensity differences of the sound waves arriving at the two ears. The lateral superior olive (LSO) in the auditory brainstem is one of the locations where such acoustic information is first computed. An LSO neuron receives temporally structured excitatory and inhibitory synaptic inputs that are driven by ipsi- and contralateral sound stimuli, respectively, and changes its spike rate according to binaural acoustic differences. Here we examine seven contemporary models of LSO neurons with different levels of biophysical complexity, from predominantly functional ones (‘shot-noise’ models) to those with more detailed physiological components (variations of integrate-and-fire and Hodgkin-Huxley-type). These models, calibrated to reproduce known monaural and binaural characteristics of LSO, generate largely similar results to each other in simulating ITD and ILD coding. Our comparisons of physiological detail, computational efficiency, predictive performances, and further expandability of the models demonstrate (1) that the simplistic, functional LSO models are suitable for applications where low computational costs and mathematical transparency are needed, (2) that more complex models with detailed membrane potential dynamics are necessary for simulation studies where sub-neuronal nonlinear processes play important roles, and (3) that, for general purposes, intermediate models might be a reasonable compromise between simplicity and biological plausibility. Computational models help our understanding of complex biological systems, by identifying their key elements and revealing their operational principles. Close comparisons between model predictions and empirical observations ensure our confidence in a model as a building block for further applications. Most current neuronal models, however, are constructed to replicate only a small specific set of experimental data. Thus, it is usually unclear how these models can be generalized to different datasets and how they compare with each other. In this paper, seven neuronal models are examined that are designed to reproduce known physiological characteristics of auditory neurons involved in the detection of sound source location. Despite their different levels of complexity, the models generate largely similar results when their parameters are tuned with common criteria. Comparisons show that simple models are computationally more efficient and theoretically transparent, and therefore suitable for rigorous mathematical analyses and engineering applications including real-time simulations. In contrast, complex models are necessary for investigating the relationship between underlying biophysical processes and sub- and suprathreshold spiking properties, although they have a large number of unconstrained, unverified parameters. Having identified their advantages and drawbacks, these auditory neuron models may readily be used for future studies and applications.
Collapse
Affiliation(s)
- Go Ashida
- Cluster of Excellence "Hearing4all", Department of Neuroscience, University of Oldenburg, Oldenburg, Germany
| | - Daniel J Tollin
- Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, Colorado, United States of America
| | - Jutta Kretzberg
- Cluster of Excellence "Hearing4all", Department of Neuroscience, University of Oldenburg, Oldenburg, Germany
| |
Collapse
|
8
|
Maturation of Spontaneous Firing Properties after Hearing Onset in Rat Auditory Nerve Fibers: Spontaneous Rates, Refractoriness, and Interfiber Correlations. J Neurosci 2017; 36:10584-10597. [PMID: 27733610 DOI: 10.1523/jneurosci.1187-16.2016] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 08/23/2016] [Indexed: 12/14/2022] Open
Abstract
Auditory nerve fibers (ANFs) exhibit a range of spontaneous firing rates (SRs) that are inversely correlated with threshold for sounds. To probe the underlying mechanisms and time course of SR differentiation during cochlear maturation, loose-patch extracellular recordings were made from ANF dendrites using acutely excised rat cochlear preparations of different ages after hearing onset. Diversification of SRs occurred mostly between the second and the third postnatal week. Statistical properties of ANF spike trains showed developmental changes that approach adult-like features in older preparations. Comparison with intracellularly recorded EPSCs revealed that most properties of ANF spike trains derive from the characteristics of presynaptic transmitter release. Pharmacological tests and waveform analysis showed that endogenous firing produces some fraction of ANF spikes, accounting for their unusual properties; the endogenous firing diminishes gradually during maturation. Paired recordings showed that ANFs contacting the same inner hair cell could have different SRs, with no correlation in their spike timing. SIGNIFICANCE STATEMENT The inner hair cell (IHC)/auditory nerve fiber (ANF) synapse is the first synapse of the auditory pathway. Remarkably, each IHC is the sole partner of 10-30 ANFs with a range of spontaneous firing rates (SRs). Low and high SR ANFs respond to sound differently, and both are important for encoding sound information across varying acoustical environments. Here we demonstrate SR diversification after hearing onset by afferent recordings in acutely excised rat cochlear preparations. We describe developmental changes in spike train statistics and endogenous firing in immature ANFs. Dual afferent recordings provide the first direct evidence that fibers with different SRs contact the same IHCs and do not show correlated spike timing at rest. These results lay the groundwork for understanding the differential sensitivity of ANFs to acoustic trauma.
Collapse
|
9
|
Almishaal A, Bidelman GM, Jennings SG. Notched-noise precursors improve detection of low-frequency amplitude modulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 141:324. [PMID: 28147582 PMCID: PMC5392086 DOI: 10.1121/1.4973912] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2016] [Revised: 12/21/2016] [Accepted: 12/22/2016] [Indexed: 05/19/2023]
Abstract
Amplitude modulation (AM) detection was measured with a short (50 ms), high-frequency carrier as a function of carrier level (Experiment I) and modulation frequency (Experiment II) for conditions with or without a notched-noise precursor. A longer carrier (500 ms) was also included in Experiment I. When the carrier was preceded by silence (no precursor condition) AM detection thresholds worsened for moderate-level carriers compared to lower- or higher-level carriers, resulting in a "mid-level hump." AM detection thresholds with a precursor were better than those without a precursor, primarily for moderate-to-high level carriers, thus eliminating the mid-level hump in AM detection. When the carrier was 500 ms, AM thresholds improved by a constant (across all levels) relative to AM thresholds with a precursor, consistent with the longer carrier providing more "looks" to detect the AM signal. Experiment II revealed that improved AM detection with compared to without a precursor is limited to low-modulation frequencies (<60 Hz). These results are consistent with (1) a reduction in cochlear gain over the course of the precursor perhaps via the medial olivocochlear reflex or (2) a form of perceptual enhancement which may be mediated by adaptation of inhibition.
Collapse
Affiliation(s)
- Ali Almishaal
- Department of Communication Sciences and Disorders, The University of Utah, 390 South, 1530 East, Behavioral Sciences Building 1201, Salt Lake City, Utah 84112, USA
| | - Gavin M Bidelman
- School of Communication Sciences and Disorders and Institute for Intelligent Systems, University of Memphis, 4055 North Park Loop, Memphis, Tennessee 38152, USA
| | - Skyler G Jennings
- Department of Communication Sciences and Disorders, The University of Utah, 390 South, 1530 East, BEHS 1201, Salt Lake City, Utah 84112, USA
| |
Collapse
|
10
|
Otsuka S, Furukawa S, Yamagishi S, Hirota K, Kashino M. Relation Between Cochlear Mechanics and Performance of Temporal Fine Structure-Based Tasks. J Assoc Res Otolaryngol 2016; 17:541-557. [PMID: 27631508 PMCID: PMC5112215 DOI: 10.1007/s10162-016-0581-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2015] [Accepted: 08/09/2016] [Indexed: 12/01/2022] Open
Abstract
This study examined whether the mechanical characteristics of the cochlea could influence individual variation in the ability to use temporal fine structure (TFS) information. Cochlear mechanical functioning was evaluated by swept-tone evoked otoacoustic emissions (OAEs), which are thought to comprise linear reflection by micromechanical impedance perturbations, such as spatial variations in the number or geometry of outer hair cells, on the basilar membrane (BM). Low-rate (2 Hz) frequency modulation detection limens (FMDLs) were measured for carrier frequency of 1000 Hz and interaural phase difference (IPD) thresholds as indices of TFS sensitivity and high-rate (16 Hz) FMDLs and amplitude modulation detection limens (AMDLs) as indices of sensitivity to non-TFS cues. Significant correlations were found among low-rate FMDLs, low-rate AMDLs, and IPD thresholds (R = 0.47-0.59). A principal component analysis was used to show a common factor that could account for 81.1, 74.1, and 62.9 % of the variance in low-rate FMDLs, low-rate AMDLs, and IPD thresholds, respectively. An OAE feature, specifically a characteristic dip around 2-2.5 kHz in OAE spectra, showed a significant correlation with the common factor (R = 0.54). High-rate FMDLs and AMDLs were correlated with each other (R = 0.56) but not with the other measures. The results can be interpreted as indicating that (1) the low-rate AMDLs, as well as the IPD thresholds and low-rate FMDLs, depend on the use of TFS information coded in neural phase locking and (2) the use of TFS information is influenced by a particular aspect of cochlear mechanics, such as mechanical irregularity along the BM.
Collapse
Affiliation(s)
- Sho Otsuka
- Department of Human and Engineered Environmental Studies, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwanoha, Kashiwa, Chiba 277-8563 Japan
- NTT Communication Science Laboratories, NTT Corporation, Morinosato Wakamiya, Atsugi, Kanagawa 243-0198 Japan
| | - Shigeto Furukawa
- NTT Communication Science Laboratories, NTT Corporation, Morinosato Wakamiya, Atsugi, Kanagawa 243-0198 Japan
| | - Shimpei Yamagishi
- Department of Information Processing, Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8503 Japan
| | - Koich Hirota
- Interfaculty Initiative in Information Studies/Graduate School of Interdisciplinary Information Studies, The University of Tokyo, Kashiwanoha, Kashiwa, Chiba 277-8563 Japan
| | - Makio Kashino
- NTT Communication Science Laboratories, NTT Corporation, Morinosato Wakamiya, Atsugi, Kanagawa 243-0198 Japan
- Department of Information Processing, Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology, Nagatsuta-cho, Midori-ku, Yokohama, Kanagawa 226-8503 Japan
| |
Collapse
|
11
|
Roles for Coincidence Detection in Coding Amplitude-Modulated Sounds. PLoS Comput Biol 2016; 12:e1004997. [PMID: 27322612 PMCID: PMC4920552 DOI: 10.1371/journal.pcbi.1004997] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2016] [Accepted: 05/25/2016] [Indexed: 12/30/2022] Open
Abstract
Many sensory neurons encode temporal information by detecting coincident arrivals of synaptic inputs. In the mammalian auditory brainstem, binaural neurons of the medial superior olive (MSO) are known to act as coincidence detectors, whereas in the lateral superior olive (LSO) roles of coincidence detection have remained unclear. LSO neurons receive excitatory and inhibitory inputs driven by ipsilateral and contralateral acoustic stimuli, respectively, and vary their output spike rates according to interaural level differences. In addition, LSO neurons are also sensitive to binaural phase differences of low-frequency tones and envelopes of amplitude-modulated (AM) sounds. Previous physiological recordings in vivo found considerable variations in monaural AM-tuning across neurons. To investigate the underlying mechanisms of the observed temporal tuning properties of LSO and their sources of variability, we used a simple coincidence counting model and examined how specific parameters of coincidence detection affect monaural and binaural AM coding. Spike rates and phase-locking of evoked excitatory and spontaneous inhibitory inputs had only minor effects on LSO output to monaural AM inputs. In contrast, the coincidence threshold of the model neuron affected both the overall spike rates and the half-peak positions of the AM-tuning curve, whereas the width of the coincidence window merely influenced the output spike rates. The duration of the refractory period affected only the low-frequency portion of the monaural AM-tuning curve. Unlike monaural AM coding, temporal factors, such as the coincidence window and the effective duration of inhibition, played a major role in determining the trough positions of simulated binaural phase-response curves. In addition, empirically-observed level-dependence of binaural phase-coding was reproduced in the framework of our minimalistic coincidence counting model. These modeling results suggest that coincidence detection of excitatory and inhibitory synaptic inputs is essential for LSO neurons to encode both monaural and binaural AM sounds. Detecting coincident arrivals of synaptic inputs is a shared fundamental property of many sensory neurons. Such 'coincidence detection' usually refers to the detection of synchronized excitatory inputs only. Experimental evidence, however, indicated that some auditory neurons are also sensitive to the relative timing of excitatory and inhibitory synaptic inputs. This type of sensitivity is suggested to be important for coding temporal information of amplitude-modulated sounds, such as speech and other naturalistic sounds. In this study, we used a minimal model of coincidence detection to identify the key elements for temporal information processing. Our series of simulations demonstrated that (1) the threshold and time window for coincidence detection were the major factors for determining the response properties to excitatory inputs, and that (2) timed interactions between excitatory and inhibitory synaptic inputs are responsible for determining the temporal tuning properties of the neuron. These results suggest that coincidence detection is an essential function of neurons that detect the 'anti-coincidence' of excitatory and inhibitory inputs to encode temporal information of sounds.
Collapse
|
12
|
Roverud E, Strickland EA. The effects of ipsilateral, contralateral, and bilateral broadband noise on the mid-level hump in intensity discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:3245-3261. [PMID: 26627798 PMCID: PMC4662679 DOI: 10.1121/1.4935515] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/05/2015] [Revised: 10/21/2015] [Accepted: 10/28/2015] [Indexed: 05/29/2023]
Abstract
Previous psychoacoustical and physiological studies indicate that the medial olivocochlear reflex (MOCR), a bilateral, sound-evoked reflex, may lead to improved sound intensity discrimination in background noise. The MOCR can decrease the range of basilar-membrane compression and can counteract effects of neural adaptation from background noise. However, the contribution of these processes to intensity discrimination is not well understood. This study examined the effect of ipsilateral, contralateral, and bilateral noise on the "mid-level hump." The mid-level hump refers to intensity discrimination Weber fractions (WFs) measured for short-duration, high-frequency tones which are poorer at mid levels than at lower or higher levels. The mid-level hump WFs may reflect a limitation due to basilar-membrane compression, and thus may be decreased by the MOCR. The noise was either short (50 ms) or long (150 ms), with the long noise intended to elicit the sluggish MOCR. For a tone in quiet, mid-level hump WFs improved with ipsilateral noise for most listeners, but not with contralateral noise. For a tone in ipsilateral noise, WFs improved with contralateral noise for most listeners, but only when both noises were long. These results are consistent with MOCR-induced WF improvements, possibly via decreases in effects of compression and neural adaptation.
Collapse
Affiliation(s)
- Elin Roverud
- Department of Speech, Language, and Hearing Sciences, Purdue University, 715 Clinic Drive, West Lafayette, Indiana 47907, USA
| | - Elizabeth A Strickland
- Department of Speech, Language, and Hearing Sciences, Purdue University, 715 Clinic Drive, West Lafayette, Indiana 47907, USA
| |
Collapse
|
13
|
Roverud E, Strickland EA. Exploring the source of the mid-level hump for intensity discrimination in quiet and the effects of noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:1318-35. [PMID: 25786945 PMCID: PMC4368585 DOI: 10.1121/1.4908243] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Revised: 01/28/2015] [Accepted: 01/30/2015] [Indexed: 05/29/2023]
Abstract
Intensity discrimination Weber fractions (WFs) measured for short, high-frequency tones in quiet are larger at mid levels than at lower or higher levels. The source of this "mid-level hump" is a matter of debate. One theory is that the mid-level hump reflects basilar-membrane compression, and that WFs decrease at higher levels due to spread-of-excitation cues. To test this theory, Experiment 1 measured the mid-level hump and growth-of-masking functions to estimate the basilar membrane input/output (I/O) function in the same listeners. Results showed the initial rise in WFs could be accounted for by the change in I/O function slope, but there was additional unexplained variability in WFs. Previously, Plack [(1998). J. Acoust. Soc. Am. 103(5), 2530-2538] showed that long-duration notched noise (NN) presented with the tone reduced the mid-level hump even with a temporal gap in the NN. Plack concluded the results were consistent with central profile analysis. However, simultaneous, forward, and backward NN were not examined separately, which may independently test peripheral and central mechanisms of the NN. Experiment 2 measured WFs at the mid-level hump in the presence of NN and narrowband noise of different durations and temporal positions relative to the tone. Results varied across subjects, but were consistent with more peripheral mechanisms.
Collapse
Affiliation(s)
- Elin Roverud
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana 47907
| | - Elizabeth A Strickland
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana 47907
| |
Collapse
|
14
|
Henry KS, Heinz MG. Effects of sensorineural hearing loss on temporal coding of narrowband and broadband signals in the auditory periphery. Hear Res 2013; 303:39-47. [PMID: 23376018 DOI: 10.1016/j.heares.2013.01.014] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2012] [Revised: 12/11/2012] [Accepted: 01/15/2013] [Indexed: 10/27/2022]
Abstract
People with sensorineural hearing loss have substantial difficulty understanding speech under degraded listening conditions. Behavioral studies suggest that this difficulty may be caused by changes in auditory processing of the rapidly-varying temporal fine structure (TFS) of acoustic signals. In this paper, we review the presently known effects of sensorineural hearing loss on processing of TFS and slower envelope modulations in the peripheral auditory system of mammals. Cochlear damage has relatively subtle effects on phase locking by auditory-nerve fibers to the temporal structure of narrowband signals under quiet conditions. In background noise, however, sensorineural loss does substantially reduce phase locking to the TFS of pure-tone stimuli. For auditory processing of broadband stimuli, sensorineural hearing loss has been shown to severely alter the neural representation of temporal information along the tonotopic axis of the cochlea. Notably, auditory-nerve fibers innervating the high-frequency part of the cochlea grow increasingly responsive to low-frequency TFS information and less responsive to temporal information near their characteristic frequency (CF). Cochlear damage also increases the correlation of the response to TFS across fibers of varying CF, decreases the traveling-wave delay between TFS responses of fibers with different CFs, and can increase the range of temporal modulation frequencies encoded in the periphery for broadband sounds. Weaker neural coding of temporal structure in background noise and degraded coding of broadband signals along the tonotopic axis of the cochlea are expected to contribute considerably to speech perception problems in people with sensorineural hearing loss. This article is part of a Special Issue entitled "Annual Reviews 2013".
Collapse
Affiliation(s)
- Kenneth S Henry
- Department of Speech, Language, and Hearing Sciences, Purdue University, 500 Oval Drive, West Lafayette, IN 47907, USA
| | | |
Collapse
|
15
|
Oberfeld D, Stahn P. Sequential grouping modulates the effect of non-simultaneous masking on auditory intensity resolution. PLoS One 2012; 7:e48054. [PMID: 23110174 PMCID: PMC3480468 DOI: 10.1371/journal.pone.0048054] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2011] [Accepted: 09/26/2012] [Indexed: 11/22/2022] Open
Abstract
The presence of non-simultaneous maskers can result in strong impairment in auditory intensity resolution relative to a condition without maskers, and causes a complex pattern of effects that is difficult to explain on the basis of peripheral processing. We suggest that the failure of selective attention to the target tones is a useful framework for understanding these effects. Two experiments tested the hypothesis that the sequential grouping of the targets and the maskers into separate auditory objects facilitates selective attention and therefore reduces the masker-induced impairment in intensity resolution. In Experiment 1, a condition favoring the processing of the maskers and the targets as two separate auditory objects due to grouping by temporal proximity was contrasted with the usual forward masking setting where the masker and the target presented within each observation interval of the two-interval task can be expected to be grouped together. As expected, the former condition resulted in a significantly smaller masker-induced elevation of the intensity difference limens (DLs). In Experiment 2, embedding the targets in an isochronous sequence of maskers led to a significantly smaller DL-elevation than control conditions not favoring the perception of the maskers as a separate auditory stream. The observed effects of grouping are compatible with the assumption that a precise representation of target intensity is available at the decision stage, but that this information is used only in a suboptimal fashion due to limitations of selective attention. The data can be explained within a framework of object-based attention. The results impose constraints on physiological models of intensity discrimination. We discuss candidate structures for physiological correlates of the psychophysical data.
Collapse
Affiliation(s)
- Daniel Oberfeld
- Department of Psychology, Section Experimental Psychology, Johannes Gutenberg-Universität Mainz, Mainz, Germany.
| | | |
Collapse
|
16
|
Across-channel timing differences as a potential code for the frequency of pure tones. J Assoc Res Otolaryngol 2011; 13:159-171. [PMID: 22160791 PMCID: PMC3298616 DOI: 10.1007/s10162-011-0305-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 11/07/2011] [Indexed: 11/06/2022] Open
Abstract
When a pure tone or low-numbered harmonic is presented to a listener, the resulting travelling wave in the cochlea slows down at the portion of the basilar membrane (BM) tuned to the input frequency due to the filtering properties of the BM. This slowing is reflected in the phase of the response of neurons across the auditory nerve (AN) array. It has been suggested that the auditory system exploits these across-channel timing differences to encode the pitch of both pure tones and resolved harmonics in complex tones. Here, we report a quantitative analysis of previously published data on the response of guinea pig AN fibres, of a range of characteristic frequencies, to pure tones of different frequencies and levels. We conclude that although the use of across-channel timing cues provides an a priori attractive and plausible means of encoding pitch, many of the most obvious metrics for using that cue produce pitch estimates that are strongly influenced by the overall level and therefore are unlikely to provide a straightforward means for encoding the pitch of pure tones.
Collapse
|
17
|
Swaminathan J, Heinz MG. Predicted effects of sensorineural hearing loss on across-fiber envelope coding in the auditory nerve. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:4001-13. [PMID: 21682421 PMCID: PMC3135152 DOI: 10.1121/1.3583502] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2010] [Revised: 12/23/2010] [Accepted: 03/30/2011] [Indexed: 05/19/2023]
Abstract
Cross-channel envelope correlations are hypothesized to influence speech intelligibility, particularly in adverse conditions. Acoustic analyses suggest speech envelope correlations differ for syllabic and phonemic ranges of modulation frequency. The influence of cochlear filtering was examined here by predicting cross-channel envelope correlations in different speech modulation ranges for normal and impaired auditory-nerve (AN) responses. Neural cross-correlation coefficients quantified across-fiber envelope coding in syllabic (0-5 Hz), phonemic (5-64 Hz), and periodicity (64-300 Hz) modulation ranges. Spike trains were generated from a physiologically based AN model. Correlations were also computed using the model with selective hair-cell damage. Neural predictions revealed that envelope cross-correlation decreased with increased characteristic-frequency separation for all modulation ranges (with greater syllabic-envelope correlation than phonemic or periodicity). Syllabic envelope was highly correlated across many spectral channels, whereas phonemic and periodicity envelopes were correlated mainly between adjacent channels. Outer-hair-cell impairment increased the degree of cross-channel correlation for phonemic and periodicity ranges for speech in quiet and in noise, thereby reducing the number of independent neural information channels for envelope coding. In contrast, outer-hair-cell impairment was predicted to decrease cross-channel correlation for syllabic envelopes in noise, which may partially account for the reduced ability of hearing-impaired listeners to segregate speech in complex backgrounds.
Collapse
Affiliation(s)
- Jayaganesh Swaminathan
- Department of Speech, Language, and Hearing Sciences, Purdue University, West Lafayette, Indiana 47907-2038, USA.
| | | |
Collapse
|
18
|
Jennings SG, Heinz MG, Strickland EA. Evaluating adaptation and olivocochlear efferent feedback as potential explanations of psychophysical overshoot. J Assoc Res Otolaryngol 2011; 12:345-60. [PMID: 21267622 DOI: 10.1007/s10162-011-0256-5] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2010] [Accepted: 01/10/2011] [Indexed: 11/24/2022] Open
Abstract
Masked detection threshold for a short tone in noise improves as the tone's onset is delayed from the masker's onset. This improvement, known as "overshoot," is maximal at mid-masker levels and is reduced by temporary and permanent cochlear hearing loss. Computational modeling was used in the present study to evaluate proposed physiological mechanisms of overshoot, including classic firing rate adaptation and medial olivocochlear (MOC) feedback, for both normal hearing and cochlear hearing loss conditions. These theories were tested using an established model of the auditory periphery and signal detection theory techniques. The influence of several analysis variables on predicted tone-pip detection in broadband noise was evaluated, including: auditory nerve fiber spontaneous-rate (SR) pooling, range of characteristic frequencies, number of synapses per characteristic frequency, analysis window duration, and detection rule. The results revealed that overshoot similar to perceptual data in terms of both magnitude and level dependence could be predicted when the effects of MOC efferent feedback were included in the auditory nerve model. Conversely, simulations without MOC feedback effects never produced overshoot despite the model's ability to account for classic firing rate adaptation and dynamic range adaptation in auditory nerve responses. Cochlear hearing loss was predicted to reduce the size of overshoot only for model versions that included the effects of MOC efferent feedback. These findings suggest that overshoot in normal and hearing-impaired listeners is mediated by some form of dynamic range adaptation other than what is observed in the auditory nerve of anesthetized animals. Mechanisms for this adaptation may occur at several levels along the auditory pathway. Among these mechanisms, the MOC reflex may play a leading role.
Collapse
Affiliation(s)
- Skyler G Jennings
- Department of Speech, Language, and Hearing Sciences, Purdue University, 500 Oval Drive, West Lafayette, IN 47907, USA.
| | | | | |
Collapse
|
19
|
Jepsen ML, Dau T. Characterizing auditory processing and perception in individual listeners with sensorineural hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:262-81. [PMID: 21303008 DOI: 10.1121/1.3518768] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
This study considered consequences of sensorineural hearing loss in ten listeners. The characterization of individual hearing loss was based on psychoacoustic data addressing audiometric pure-tone sensitivity, cochlear compression, frequency selectivity, temporal resolution, and intensity discrimination. In the experiments it was found that listeners with comparable audiograms can show very different results in the supra-threshold measures. In an attempt to account for the observed individual data, a model of auditory signal processing and perception [Jepsen et al., J. Acoust. Soc. Am. 124, 422-438 (2008)] was used as a framework. The parameters of the cochlear processing stage of the model were adjusted to account for behaviorally estimated individual basilar-membrane input-output functions and the audiogram, from which the amounts of inner hair-cell and outer hair-cell losses were estimated as a function of frequency. All other model parameters were left unchanged. The predictions showed a reasonably good agreement with the measured individual data in the frequency selectivity and forward masking conditions while the variation of intensity discrimination thresholds across listeners was underestimated by the model. The model and the associated parameters for individual hearing-impaired listeners might be useful for investigating effects of individual hearing impairment in more complex conditions, such as speech intelligibility in noise.
Collapse
Affiliation(s)
- Morten L Jepsen
- Centre for Applied Hearing Research, Department of Electrical Engineering, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark.
| | | |
Collapse
|
20
|
Quantifying envelope and fine-structure coding in auditory nerve responses to chimaeric speech. J Assoc Res Otolaryngol 2009; 10:407-23. [PMID: 19365691 DOI: 10.1007/s10162-009-0169-8] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2008] [Accepted: 03/13/2009] [Indexed: 10/20/2022] Open
Abstract
Any sound can be separated mathematically into a slowly varying envelope and rapidly varying fine-structure component. This property has motivated numerous perceptual studies to understand the relative importance of each component for speech and music perception. Specialized acoustic stimuli, such as auditory chimaeras with the envelope of one sound and fine structure of another have been used to separate the perceptual roles for envelope and fine structure. Cochlear narrowband filtering limits the ability to isolate fine structure from envelope; however, envelope recovery from fine structure has been difficult to evaluate physiologically. To evaluate envelope recovery at the output of the cochlea, neural cross-correlation coefficients were developed that quantify the similarity between two sets of spike-train responses. Shuffled auto- and cross-correlogram analyses were used to compute separate correlations for responses to envelope and fine structure based on both model and recorded spike trains from auditory nerve fibers. Previous correlogram analyses were extended to isolate envelope coding more effectively in auditory nerve fibers with low center frequencies, which are particularly important for speech coding. Recovered speech envelopes were present in both model and recorded responses to one- and 16-band speech fine-structure chimaeras and were significantly greater for the one-band case, consistent with perceptual studies. Model predictions suggest that cochlear recovered envelopes are reduced following sensorineural hearing loss due to broadened tuning associated with outer-hair cell dysfunction. In addition to the within-fiber cross-stimulus cases considered here, these neural cross-correlation coefficients can also be used to evaluate spatiotemporal coding by applying them to cross-fiber within-stimulus conditions. Thus, these neural metrics can be used to quantitatively evaluate a wide range of perceptually significant temporal coding issues relevant to normal and impaired hearing.
Collapse
|
21
|
Variation in the phase of response to low-frequency pure tones in the guinea pig auditory nerve as functions of stimulus level and frequency. J Assoc Res Otolaryngol 2008; 10:233-50. [PMID: 19093151 PMCID: PMC2674197 DOI: 10.1007/s10162-008-0151-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2008] [Accepted: 11/14/2008] [Indexed: 11/02/2022] Open
Abstract
The directionality of hair cell stimulation combined with the vibration of the basilar membrane causes the auditory nerve fiber action potentials, in response to low-frequency stimuli, to occur at a particular phase of the stimulus waveform. Because direct mechanical measurements at the cochlear apex are difficult, such phase locking has often been used to indirectly infer the basilar membrane motion. Here, we confirm and extend earlier data from mammals using sine wave stimulation over a wide range of sound levels (up to 90 dB sound pressure level). We recorded phase-locked responses to pure tones over a wide range of frequencies and sound levels of a large population of auditory nerve fibers in the anesthetized guinea pig. The results indicate that, for a constant frequency of stimulation, the phase lag decreases with increases in the characteristic frequency (CF) of the nerve fiber. The phase lag decreases up to a CF above the stimulation frequency, beyond which it decreases at a much slower rate. Such phase changes are consistent with known basal cochlear mechanics. Measurements from individual fibers showed smaller but systematic variations in phase with sound level, confirming previous reports. We found a "null" stimulation frequency at which little variation in phase occurred with sound level. This null frequency was often not at the CF. At stimulation frequencies below the null, there was a progressive lag with sound level and a progressive lead for stimulation frequencies above the null. This was maximally 0.2 cycles.
Collapse
|
22
|
Jepsen ML, Ewert SD, Dau T. A computational model of human auditory signal processing and perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:422-438. [PMID: 18646987 DOI: 10.1121/1.2924135] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
A model of computational auditory signal-processing and perception that accounts for various aspects of simultaneous and nonsimultaneous masking in human listeners is presented. The model is based on the modulation filterbank model described by Dau et al. [J. Acoust. Soc. Am. 102, 2892 (1997)] but includes major changes at the peripheral and more central stages of processing. The model contains outer- and middle-ear transformations, a nonlinear basilar-membrane processing stage, a hair-cell transduction stage, a squaring expansion, an adaptation stage, a 150-Hz lowpass modulation filter, a bandpass modulation filterbank, a constant-variance internal noise, and an optimal detector stage. The model was evaluated in experimental conditions that reflect, to a different degree, effects of compression as well as spectral and temporal resolution in auditory processing. The experiments include intensity discrimination with pure tones and broadband noise, tone-in-noise detection, spectral masking with narrow-band signals and maskers, forward masking with tone signals and tone or noise maskers, and amplitude-modulation detection with narrow- and wideband noise carriers. The model can account for most of the key properties of the data and is more powerful than the original model. The model might be useful as a front end in technical applications.
Collapse
Affiliation(s)
- Morten L Jepsen
- Centre for Applied Hearing Research, Acoustic Technology, Department of Electrical Engineering, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark
| | | | | |
Collapse
|
23
|
Joris PX, Michelet P, Franken TP, McLaughlin M. Variations on a Dexterous theme: peripheral time-intensity trading. Hear Res 2007; 238:49-57. [PMID: 18187277 DOI: 10.1016/j.heares.2007.11.011] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2007] [Revised: 11/28/2007] [Accepted: 11/30/2007] [Indexed: 10/22/2022]
Abstract
Sound pressure level changes can affect the timing of spiketrains. Timing of spiketrains is critical for sensitivity to interaural timing differences (ITDs). Interaural level differences (ILDs) can therefore affect the ITD cue. It has been hypothesized that ILDs may be coded indirectly through a peripheral conversion of level to time (but it should be cautioned that the changes in phase with SPL in low-CF AN fibers of the cat are more complicated) (Jeffress, L.A., 1948. A place theory of sound localization. J. Comp. Physiol. Psychol. 41, 35-39). We tested this conversion by recording from auditory nerve fibers to broadband noise at different SPLs. For each fiber, correlograms were constructed to compare timing to fine-structure across SPLs. We find generally a decrease in the time delay between spikes and the stimulus with increasing SPL. However, the magnitudes of the shift in time are surprisingly small, and dependent on characteristic frequency (CF): the largest shifts are approximately 10 micros/dB and occur at the lowest CFs. Nevertheless, the effects of level on spike timing are systematic and of a magnitude to which the binaural system is sensitive. Thus, even though the results indicate that ILD is not traded for ITD in a simple way, the possibility that low-frequency ILDs affect the binaural percept via a peripheral level-to-time conversion cannot be excluded.
Collapse
Affiliation(s)
- Philip X Joris
- Laboratory of Auditory Neurophysiology, K.U. Leuven Medical School, Campus Gasthuisberg O&N2, Herestraat 49 bus 1021, B-3000 Leuven, Belgium.
| | | | | | | |
Collapse
|
24
|
Chintanpalli A, Heinz MG. Effect of auditory-nerve response variability on estimates of tuning curves. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 122:EL203-EL209. [PMID: 18247642 PMCID: PMC2911360 DOI: 10.1121/1.2794880] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Near-Poisson variability in auditory-nerve (AN) responses limits the accuracy of automated tuning-curve algorithms. Here, a typical adaptive tuning-curve algorithm was used with a physiologically realistic AN model with and without the inclusion of neural randomness. Response randomness produced variability in Q(10) estimates that was nearly as large as in AN data. Results suggest that it is sufficient for AN models to specify frequency selectivity based on mean Q(10) values at each characteristic frequency (CF). Errors in estimates of CF, which decreased from +/-0.2 octaves at low frequencies to +/-0.05 octaves at high frequencies, are significant for studies of spatiotemporal coding.
Collapse
Affiliation(s)
- Ananthakrishna Chintanpalli
- Weldon School of Biomedical Engineering, Purdue University, 206 S. Martin Jischke Drive, West Lafayette, Indiana 47907,
| | - Michael G. Heinz
- Department of Speech, Language and Hearing Sciences and Weldon School of Biomedical Engineering, Purdue University, 500 Oval Drive, West Lafayette, Indiana 47907,
| |
Collapse
|
25
|
Tan Q, Carney LH. Predictions of formant-frequency discrimination in noise based on model auditory-nerve responses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:1435-45. [PMID: 17004467 PMCID: PMC2572872 DOI: 10.1121/1.2225858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
To better understand how the auditory system extracts speech signals in the presence of noise, discrimination thresholds for the second formant frequency were predicted with simulations of auditory-nerve responses. These predictions employed either average-rate information or combined rate and timing information, and either populations of model fibers tuned across a wide range of frequencies or a subset of fibers tuned to a restricted frequency range. In general, combined temporal and rate information for a small population of model fibers tuned near the formant frequency was most successful in replicating the trends reported in behavioral data for formant-frequency discrimination. To explore the nature of the temporal information that contributed to these results, predictions based on model auditory-nerve responses were compared to predictions based on the average rates of a population of cross-frequency coincidence detectors. These comparisons suggested that average response rate (count) of cross-frequency coincidence detectors did not effectively extract important temporal information from the auditory-nerve population response. Thus, the relative timing of action potentials across auditory-nerve fibers tuned to different frequencies was not the aspect of the temporal information that produced the trends in formant-frequency discrimination thresholds.
Collapse
Affiliation(s)
- Qing Tan
- Boston University Hearing Research Center, Department of Biomedical Engineering, Boston University, 44 Cummington Street, Boston, Massachusetts 02215, USA
| | | |
Collapse
|
26
|
de Cheveigné A, Pressnitzer D. The case of the missing delay lines: synthetic delays obtained by cross-channel phase interaction. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:3908-18. [PMID: 16838534 DOI: 10.1121/1.2195291] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Temporal models of pitch and harmonic segregation call for delays of up to 30 ms to cover the full range of existence of musical pitch. To date there is little anatomical or physiological evidence for delays that long. We propose a mechanism by which delays may be synthesized from cross-channel phase interaction. Phases of adjacent cochlear filter channels are shifted by an amount proportional to frequency and then combined as a weighted sum to approximate a delay. Synthetic delays may be used by pitch perception models such as autocorrelation, segregation models such as harmonic cancellation, and binaural processing models to explain sensitivity to large interaural delays. The maximum duration of synthetic delays is limited by the duration of the impulse responses of cochlear filters, itself inversely proportional to cochlear filter bandwidth. Maximum delay is thus frequency dependent. This may explain the fact, puzzling for temporal pitch models such as autocorrelation, that pitch is more salient and easy to discriminate for complex tones that contain resolved partials.
Collapse
Affiliation(s)
- Alain de Cheveigné
- Equipe Audition, FRE 2929, CNRS, Université Paris 5, ENS, 29 Rue d'Ulm, F-75230 Paris 05, France
| | | |
Collapse
|
27
|
Joris PX, Louage DH, Cardoen L, van der Heijden M. Correlation index: a new metric to quantify temporal coding. Hear Res 2006; 216-217:19-30. [PMID: 16644160 DOI: 10.1016/j.heares.2006.03.010] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/10/2005] [Revised: 03/01/2006] [Accepted: 03/07/2006] [Indexed: 12/01/2022]
Abstract
The standard procedure to study temporal encoding of sound waveforms in the auditory system has been Fourier analysis of responses to periodic stimuli. We introduce a new metric--correlation index (CI)--which is based on a simple counting of spike coincidences. It can be used for responses to aperiodic stimuli and does not require knowledge of the stimulus. Moreover, the basic procedure of comparing spiketimes in spiketrains is more physiological than currently used methods for temporal analysis. The CI is the peak value of the normalized shuffled autocorrelogram (SAC), which provides a quantitative summary of temporal structure in the neural response to arbitrary stimuli. We illustrate the CI and SACs by comparing temporal coding in the auditory nerve and output fibers of the cochlear nucleus.
Collapse
Affiliation(s)
- Philip X Joris
- Laboratory of Auditory Neurophysiology, K.U. Leuven, Medical School, Campus Gasthuisberg O&N2 bus 1021, Herestraat 49, B-3000 Leuven, Belgium.
| | | | | | | |
Collapse
|
28
|
Davidson SA, Gilkey RH, Colburn HS, Carney LH. Binaural detection with narrowband and wideband reproducible noise maskers. III. Monaural and diotic detection and model results. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:2258-75. [PMID: 16642840 DOI: 10.1121/1.2177583] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
A single-interval, yes-no, tone-in-noise detection experiment was conducted to measure the proportion of "tone present" responses to each of 25 reproducible noise-alone and tone-plus-noise waveforms under narrowband (100 Hz), wideband (2900 Hz), monotic, and diotic stimulus conditions. Proportions of "tone present" responses (estimates of the probabilities of hits and false alarms) were correlated across masker bandwidths and across monotic and diotic conditions. Two categories of models were considered; one based on stimulus energy or neural counts, and another based on temporal structure of the stimulus envelope or neural patterns. Both categories gave significant correlation between decision variables and data. A model based on a weighted combination of energy in multiple critical bands performed best, predicting up to 90% of the variance in the reproducible-noise data. However, since energy-based models are unable to successfully explain detection under a roving-level paradigm without substantial modification, it is argued that other variations of detection models must be considered for future study. Temporal models are resistant to changes in threshold under roving-level conditions, but explained at most only 67% of the variance in the reproducible-noise data.
Collapse
Affiliation(s)
- Sean A Davidson
- Department of Biomedical and Chemical Engineering, Institute for Sensory Research, 621 Skytop Road, Syracuse University, Syracuse, New York 13244, USA
| | | | | | | |
Collapse
|
29
|
Joris PX, van de Sande B, Recio-Spinoso A, van der Heijden M. Auditory midbrain and nerve responses to sinusoidal variations in interaural correlation. J Neurosci 2006; 26:279-89. [PMID: 16399698 PMCID: PMC6674331 DOI: 10.1523/jneurosci.2285-05.2006] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The human sensitivity to interaural temporal differences in the acoustic waveforms to the two ears shows remarkable acuity but is also very sluggish. Fast changes in binaural parameters are not detectable, and this inability contrasts sharply with the excellent temporal resolution of the monaural auditory system. We studied the response of binaural neurons in the inferior colliculus of the cat to sinusoidal changes in the interaural correlation of broadband noise. Responses to the same waveforms were also obtained from auditory nerve fibers and further analyzed with a coincidence analysis. Overall, the auditory nerve and inferior colliculus showed a similar ability to code changes in interaural correlation. This ability extended to modulation frequencies an order of magnitude higher than the highest frequencies detected binaurally in humans. We conclude that binaural sluggishness is not caused by a lack of temporal encoding of fast binaural changes at the level of the midbrain. We hypothesize that there is no neural substrate at the level of the midbrain or higher to read out this temporal code and that this constitutes a low-pass "sluggishness" filter.
Collapse
Affiliation(s)
- Philip X Joris
- Laboratory of Auditory Neurophysiology, Medical School, Campus Gasthuisberg, K. U. Leuven, B-3000 Leuven, Belgium.
| | | | | | | |
Collapse
|
30
|
Nizami L. Intensity-difference limens predicted from the click-evoked peripheral N1: The mid-level hump and its implications for intensity encoding. Math Biosci 2005; 197:15-34. [PMID: 16023149 DOI: 10.1016/j.mbs.2005.04.006] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2004] [Accepted: 04/14/2005] [Indexed: 10/25/2022]
Abstract
The intensity-difference limen (DL) for an acoustic click rises at moderate click levels, a feature called the 'mid-level hump'. It has long been hypothesized that, because a click does not evoke sustained firing in any primary afferent, the DL must therefore originate from the initial burst of synchronized spikes in the eighth nerve. That burst causes the N1 component of the peripheral compound action potential (CAP). It should therefore be possible to predict click DLs from N1 potentials. Here, a Signal Detection model, using a series expansion, was used to derive equations in N1 for the level-dependence of the DL. The first-order equation predicts a dependence on the standard deviation of N1, and an inverse dependence on the rate-of-growth of the mean N1. The second-order equation is more complicated. Both approximations were applied to N1s from the cat. Both produced a mid-level hump; at its peak, the DLs from the second-order approximation were the smaller ones, and were of the same order of magnitude as the empirical DLs. Overall, the computations show that the rate-of-growth of the mean N1, not the standard deviation of N1, determines the hump in the empirical DL.
Collapse
Affiliation(s)
- Lance Nizami
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, NE 68131, United States.
| |
Collapse
|
31
|
Cedolin L, Delgutte B. Pitch of complex tones: rate-place and interspike interval representations in the auditory nerve. J Neurophysiol 2005; 94:347-62. [PMID: 15788522 PMCID: PMC2094528 DOI: 10.1152/jn.01114.2004] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Harmonic complex tones elicit a pitch sensation at their fundamental frequency (F0), even when their spectrum contains no energy at F0, a phenomenon known as "pitch of the missing fundamental." The strength of this pitch percept depends upon the degree to which individual harmonics are spaced sufficiently apart to be "resolved" by the mechanical frequency analysis in the cochlea. We investigated the resolvability of harmonics of missing-fundamental complex tones in the auditory nerve (AN) of anesthetized cats at low and moderate stimulus levels and compared the effectiveness of two representations of pitch over a much wider range of F0s (110-3,520 Hz) than in previous studies. We found that individual harmonics are increasingly well resolved in rate responses of AN fibers as the characteristic frequency (CF) increases. We obtained rate-based estimates of pitch dependent upon harmonic resolvability by matching harmonic templates to profiles of average discharge rate against CF. These estimates were most accurate for F0s above 400-500 Hz, where harmonics were sufficiently resolved. We also derived pitch estimates from all-order interspike-interval distributions, pooled over our entire sample of fibers. Such interval-based pitch estimates, which are dependent on phase-locking to the harmonics, were accurate for F0s below 1,300 Hz, consistent with the upper limit of the pitch of the missing fundamental in humans. The two pitch representations are complementary with respect to the F0 range over which they are effective; however, neither is entirely satisfactory in accounting for human psychophysical data.
Collapse
Affiliation(s)
- Leonardo Cedolin
- Eaton-Peabody Laboratory, Massachusetts Eye and Ear Infirmary, 243 Charles St., Boston, Massachusetts 02114.
| | | |
Collapse
|
32
|
Heinz MG, Issa JB, Young ED. Auditory-nerve rate responses are inconsistent with common hypotheses for the neural correlates of loudness recruitment. J Assoc Res Otolaryngol 2005; 6:91-105. [PMID: 15952047 PMCID: PMC2538332 DOI: 10.1007/s10162-004-5043-0] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2004] [Accepted: 11/22/2004] [Indexed: 10/25/2022] Open
Abstract
A number of perceptual phenomena related to normal and impaired level coding can be accounted for by the degree of compression in the basilar-membrane (BM) magnitude response. However, the narrow dynamic ranges of auditory-nerve (AN) fibers complicate these arguments. Because the AN serves as an information bottleneck, an improved understanding of the neural coding of level may clarify some of the limitations of current hearing aids. Here three hypotheses for the neural correlate of loudness recruitment were evaluated based on AN responses from normal-hearing cats and from cats with a noise-induced hearing loss (NIHL). Auditory-nerve fiber rate-level functions for tones were analyzed to test the following hypotheses: Loudness recruitment results from steeper AN rate functions after impairment. This hypothesis was not supported; AN rate functions were not steeper than normal following NIHL, despite steeper estimated BM responses based on the AN data. Loudness is based on the total AN discharge count, and recruitment results from an abnormally rapid spread of excitation after impairment. Whereas abnormal spread of excitation can be observed, steeper growth of total AN rate is not seen over the range of sound levels where recruitment is observed in human listeners. Loudness of a narrowband stimulus is based on AN responses in a narrow BF region, and recruitment results from compression of the AN-fiber threshold distribution after impairment. This hypothesis was not supported because there was no evidence that impaired AN threshold distributions were compressed and the growth of AN activity summed across BFs near the stimulus frequency was shallower than normal.Overall, these results suggest that loudness recruitment cannot be accounted for based on summed AN rate responses and may depend on neural mechanisms involved in the central representation of intensity.
Collapse
Affiliation(s)
- Michael G Heinz
- Center for Hearing Sciences and Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21205, USA.
| | | | | |
Collapse
|
33
|
Tan Q, Carney LH. Encoding of vowel-like sounds in the auditory nerve: model predictions of discrimination performance. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 117:1210-1222. [PMID: 15807010 PMCID: PMC1404504 DOI: 10.1121/1.1856391] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The sensitivity of listeners to changes in the center frequency of vowel-like harmonic complexes as a function of the center frequency of the complex cannot be explained by changes in the level of the stimulus [Lyzenga and Horst, J. Acoust. Soc. Am. 98, 1943-1955 (1995)]. Rather, a complex pattern of sensitivity is seen; for a spectrum with a triangular envelope, the greatest sensitivity occurs when the center frequency falls between harmonics, whereas for a spectrum with a trapezoidal envelope, greatest sensitivity occurs when the center frequency is aligned with a harmonic. In this study, the thresholds of a population model of auditory-nerve (AN) fibers were quantitatively compared to these trends in psychophysical thresholds. Single-fiber and population model responses were evaluated in terms of both average discharge rate and the combination of rate and timing information. Results indicate that phase-locked responses of AN fibers encode phase transitions associated with minima in these amplitude-modulated stimuli. The temporal response properties of a single AN fiber, tuned to a frequency slightly above the center frequency of the harmonic complex, were able to explain the trends in thresholds for both triangular- and trapezoidal-shaped spectra.
Collapse
Affiliation(s)
- Qing Tan
- Boston University Hearing Research Center, Department of Biomedical Engineering, Boston University, 44 Cummington Street, Boston, Massachusetts 02215
| | - Laurel H. Carney
- Author to whom correspondence should be addressed. Electronic mail:
| |
Collapse
|
34
|
Abstract
Level-invariant detection refers to findings that thresholds in tone-in-noise detection are unaffected by roving-level procedures that degrade energy cues. Such data are inconsistent with ideas that detection is based on the energy passed by an auditory filter. A hypothesis that detection is based on a level-invariant temporal cue is advanced. Simulations of a leaky-integrator model, consisting of a bandpass filter, half-wave rectification, and a lowpass filter, account for thresholds in band-widening experiments. The decision variable is calculated from the discrete Fourier transform of the leaky-integrator output. A counterintuitive finding is the apparent disassociation of the phenomenon of critical bands estimated from band-widening experiments and the theory of auditory filters. Physiological plausibility is demonstrated by showing that a leaky integrator describes the discharge cadence of primary afferents for tone-in-noise stimuli as well as for complex periodic sounds.
Collapse
Affiliation(s)
- Bruce G Berg
- Department of Cognitive Sciences, University of California, Irvine, Irvine, CA 92697, USA.
| |
Collapse
|
35
|
Nelson PC, Carney LH. A phenomenological model of peripheral and central neural responses to amplitude-modulated tones. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 116:2173-86. [PMID: 15532650 PMCID: PMC1379629 DOI: 10.1121/1.1784442] [Citation(s) in RCA: 112] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
A phenomenological model with time-varying excitation and inhibition was developed to study possible neural mechanisms underlying changes in the representation of temporal envelopes along the auditory pathway. A modified version of an existing auditory-nerve model [Zhang et al., J. Acoust. Soc. Am. 109, 648-670 (2001)] was used to provide inputs to higher hypothetical processing centers. Model responses were compared directly to published physiological data at three levels: the auditory nerve, ventral cochlear nucleus, and inferior colliculus. Trends and absolute values of both average firing rate and synchrony to the modulation period were accurately predicted at each level for a wide range of stimulus modulation depths and modulation frequencies. The diversity of central physiological responses was accounted for with realistic variations of model parameters. Specifically, enhanced synchrony in the cochlear nucleus and rate-tuning to modulation frequency in the inferior colliculus were predicted by choosing appropriate relative strengths and time courses of excitatory and inhibitory inputs to postsynaptic model cells. The proposed model is fundamentally different than others that have been used to explain the representation of envelopes in the mammalian midbrain, and it provides a computational tool for testing hypothesized relationships between physiology and psychophysics.
Collapse
Affiliation(s)
| | - Laurel H. Carney
- Author to whom correspondence should be addressed. Electronic mail:
| |
Collapse
|
36
|
Louage DHG, van der Heijden M, Joris PX. Temporal properties of responses to broadband noise in the auditory nerve. J Neurophysiol 2004; 91:2051-65. [PMID: 15069097 DOI: 10.1152/jn.00816.2003] [Citation(s) in RCA: 93] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Temporal information in the responses of auditory neurons to sustained sounds has been studied mostly with periodic stimuli, using measures that are based on Fourier analysis. Less information is available on temporal aspects of responses to nonperiodic wideband sounds. We recorded responses to a reference Gaussian noise and its polarity-inverted version in the auditory nerve of barbiturate-anesthetized cats and used shuffled autocorrelograms (SACs) to quantify spike timing. Two metrics were extracted from the central peak of autocorrelograms: the peak-height and the width at halfheight. Temporal information related to stimulus fine-structure was isolated from that to envelope by subtracting or adding responses to the reference and inverted noise. Peak-height and halfwidth generally behaved as expected from the existing body of data on phase-locking to pure tones and sinusoidally amplitude-modulated tones but showed some surprises as well. Compared with synchronization to low-frequency tones, SACs reveal large differences in temporal behavior between the different classes of nerve fibers (based on spontaneous rate) as well as a strong dependence on characteristic frequency (CF) throughout the phase-locking range. SACs also reveal a larger temporal consistency (i.e., tendency to discharge at the same point in time on repeated presentation of the same stimulus) in the responses to the stochastic noise stimulus than in the responses to periodic tones. Responses at high CFs reflect envelope phase-locking and are consistent with previous reports using sinusoidal AM. We conclude that the combined use of broadband noise and SAC analysis allow a more general characterization of temporal behavior than periodic stimuli and Fourier analysis.
Collapse
Affiliation(s)
- Dries H G Louage
- Laboratory of Auditory Neurophysiology, Medical School, K.U.Leuven, B-3000 Leuven, Belgium
| | | | | |
Collapse
|
37
|
Colburn HS, Carney LH, Heinz MG. Quantifying the information in auditory-nerve responses for level discrimination. J Assoc Res Otolaryngol 2004; 4:294-311. [PMID: 14690049 PMCID: PMC3202725 DOI: 10.1007/s10162-002-1090-6] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
An analytical approach for quantifying the information in auditory-nerve (AN) fiber responses for the task of level discrimination is described. A simple analytical model for ANT responses is extended to include temporal response properties, including the nonlinear-phase effects of the cochlear amplifier. Use of simple analytical models for AN discharge patterns allows quantification of the contributions of level-dependent aspects of the patterns to level discrimination. Specifically, the individual and combined contributions of the information contained in discharge rate, synchrony, and relative phase cues are explicitly examined for level discrimination of tonal stimuli. It is shown that the rate information provided by individual AN fibers is more constrained by increases in variance with increases in rate than by saturation. As noted in previous studies, there is sufficient average-rate information within a narrow-CF region to account for robust behavioral performance over a wide dynamic range; however, there is no model based on a simple limitation or use of AN information consistent with parametric variations in performance. This issue is explored in the current study through analysis of performance based on different aspects of AN patterns. For example, we show that performance predicted from use of all rate information degrades significantly as level increases above low-medium levels, inconsistent with Weber's Law. At low frequencies, synchrony information extends the range over which behavioral performance can be explained by 10-15 dB, but only at low levels. In contrast to rate and synchrony, nonlinear-phase cues are shown to provide robust information at medium and high levels in near-CF fibers for low-frequency stimuli. The level dependence of the discharge rate and phase properties of AN fibers are influenced by the compressive nonlinearity of the inner ear. Evaluating the role of the compressive nonlinearity in level coding is important for understanding neural encoding mechanisms and because of its association with the cochlear amplifier, which is a fragile aspect of the ear believed to be affected in common forms of hearing impairment.
Collapse
Affiliation(s)
- H Steven Colburn
- Hearing Research Center, Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA.
| | | | | |
Collapse
|
38
|
Huettel LG, Collins LM. Predicting Auditory Tone-in-Noise Detection Performance: The Effects of Neural Variability. IEEE Trans Biomed Eng 2004; 51:282-93. [PMID: 14765701 DOI: 10.1109/tbme.2003.820395] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Collecting and analyzing psychophysical data is a fundamental mechanism for the study of auditory processing. However, because this approach relies on human listening experiments, it can be costly in terms of time and money spent gathering the data. The development of a theoretical, model-based procedure capable of accurately predicting psychophysical behavior could alleviate these issues by enabling researchers to rapidly evaluate hypotheses prior to conducting experiments. This approach may also provide additional insight into auditory processing by establishing a link between psychophysical behavior and physiology. Signal detection theory has previously been combined with an auditory model to generate theoretical predictions of psychophysical behavior. Commonly, the ideal processor outperforms human subjects. In order for this model-based technique to enhance the study of auditory processing, discrepancies must be eliminated or explained. In this paper, we investigate the possibility that neural variability, which results from the randomness inherent in auditory nerve fiber responses, may explain some of the previously observed discrepancies. In addition, we study the impact of combining information across nerve fibers and investigate several models of multiple-fiber signal processing. Our findings suggest that neural variability can account for much, but not all, of the discrepancy between theoretical and experimental data.
Collapse
Affiliation(s)
- Lisa G Huettel
- Department of Electrical and Computer Engineering, Box 90291, Duke University, Durham, NC 27708, USA.
| | | |
Collapse
|
39
|
Heinz MG, Young ED. Response growth with sound level in auditory-nerve fibers after noise-induced hearing loss. J Neurophysiol 2003; 91:784-95. [PMID: 14534289 PMCID: PMC2921373 DOI: 10.1152/jn.00776.2003] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
People with sensorineural hearing loss are often constrained by a reduced acoustic dynamic range associated with loudness recruitment; however, the neural correlates of loudness and recruitment are still not well understood. The growth of auditory-nerve (AN) activity with sound level was compared in normal-hearing cats and in cats with a noise-induced hearing loss to test the hypothesis that AN-fiber rate-level functions are steeper in impaired ears. Stimuli included best-frequency and fixed-frequency tones, broadband noise, and a brief speech token. Three types of impaired responses were observed. 1) Fibers with rate-level functions that were similar across all stimuli typically had broad tuning, consistent with outer-hair-cell (OHC) damage. 2) Fibers with a wide dynamic range and shallow slope above threshold often retained sharp tuning, consistent with primarily inner-hair-cell (IHC) damage. 3) Fibers with very steep rate-level functions for all stimuli had thresholds above approximately 80 dB SPL and very broad tuning, consistent with severe IHC and OHC damage. Impaired rate-level slopes were on average shallower than normal for tones, and were steeper in only limited conditions. There was less variation in rate-level slopes across stimuli in impaired fibers, presumably attributable to the lack of suppression-induced reductions in slopes for complex stimuli relative to BF-tone slopes. Sloping saturation was observed less often in impaired fibers. These results illustrate that AN fibers do not provide a simple representation of the basilar-membrane I/O function and suggest that both OHC and IHC damage can affect AN response growth.
Collapse
Affiliation(s)
- Michael G Heinz
- Center for Hearing Sciences and Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.
| | | |
Collapse
|
40
|
Tan Q, Carney LH. A phenomenological model for the responses of auditory-nerve fibers. II. Nonlinear tuning with a frequency glide. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2007-2020. [PMID: 14587601 DOI: 10.1121/1.1608963] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
A computational model was developed to simulate the responses of auditory-nerve (AN) fibers in cat. The model's signal path consisted of a time-varying bandpass filter; the bandwidth and gain of the signal path were controlled by a nonlinear feed-forward control path. This model produced realistic response features to several stimuli, including pure tones, two-tone combinations, wideband noise, and clicks. Instantaneous frequency glides in the reverse-correlation (revcor) function of the model's response to broadband noise were achieved by carefully restricting the locations of the poles and zeros of the bandpass filter. The pole locations were continuously varied as a function of time by the control signal to change the gain and bandwidth of the signal path, but the instantaneous frequency profile in the revcor function was independent of sound pressure level, consistent with physiological data. In addition, this model has other important properties, such as nonlinear compression, two-tone suppression, and reasonable Q10 values for tuning curves. The incorporation of both the level-independent frequency glide and the level-dependent compressive nonlinearity into a phenomenological model for the AN was the primary focus of this work. The ability of this model to process arbitrary sound inputs makes it a useful tool for studying peripheral auditory processing.
Collapse
Affiliation(s)
- Qing Tan
- Boston University Hearing Research Center, Department of Biomedical Engineering, Boston University, 44 Cummington Street, Boston, Massachusetts 02215, USA
| | | |
Collapse
|
41
|
Litvak LM, Delgutte B, Eddington DK. Improved temporal coding of sinusoids in electric stimulation of the auditory nerve using desynchronizing pulse trains. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2079-98. [PMID: 14587607 PMCID: PMC2270483 DOI: 10.1121/1.1612493] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Rubinstein et al. [Hearing Res. 127, 108-118 (1999)] suggested that the representation of electric stimulus waveforms in the temporal discharge patterns of auditory-nerve fiber (ANF) might be improved by introducing an ongoing, high-rate, desynchronizing pulse train (DPT). To test this hypothesis, activity of ANFs was studied in acutely deafened, anesthetized cats in response to 10-min-long, 5-kpps electric pulse trains that were sinusoidally modulated for 400 ms every second. Two classes of responses to sinusoidal modulations of the DPT were observed. Fibers that only responded transiently to the unmodulated DPT showed hyper synchronization and narrow dynamic ranges to sinusoidal modulators, much as responses to electric sinusoids presented without a DPT. In contrast, fibers that exhibited sustained responses to the DPT were sensitive to modulation depths as low as 0.25% for a modulation frequency of 417 Hz. Over a 20-dB range of modulation depths, responses of these fibers resembled responses to tones in a healthy ear in both discharge rate and synchronization index. This range is much wider than the dynamic range typically found with electrical stimulation without a DPT, and comparable to the dynamic range for acoustic stimulation. These results suggest that a stimulation strategy that uses small signals superimposed upon a large DPT to encode sounds may evoke temporal discharge patterns in some ANFs that resemble responses to sound in a healthy ear.
Collapse
Affiliation(s)
- Leonid M Litvak
- Eaton-Peabody Laboratory, Massachusetts Eye and Ear Infirmary, 243 Charles Street, Boston, Massachusetts 02114, USA.
| | | | | |
Collapse
|
42
|
Doucet JR, Ryugo DK. Axonal pathways to the lateral superior olive labeled with biotinylated dextran amine injections in the dorsal cochlear nucleus of rats. J Comp Neurol 2003; 461:452-65. [PMID: 12746862 DOI: 10.1002/cne.10722] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The lateral superior olive (LSO) contains cells that are sensitive to intensity differences between the two ears, a feature used by the brain to localize sounds in space. This report describes a source of input to the LSO that complements bushy cell projections from the ventral cochlear nucleus (VCN). Injections of biotinylated dextran amine (BDA) into the dorsal cochlear nucleus (DCN) of the rat label axons and swellings in several brainstem structures, including the ipsilateral LSO. Labeling in the ipsilateral LSO was confined to a thin band that extended throughout the length of the structure such that it resembled an LSO isofrequency lamina. The source of this labeled pathway was not obvious, because DCN neurons do not project to the LSO, and VCN bushy cells were not filled by these injections. Filled neurons in several brainstem structures emerged as possible sources. Three observations suggest that most of the axonal labeling in the LSO derives from a single source. First, the number of labeled VCN planar multipolar cells and the amount of labeling in the LSO were consistent and robust across animals. In contrast, the number of labeled cells in most other structures was small and highly variable. Second, the locations of planar cells and filled axons in the LSO were related topographically to the position of the DCN injection site. Third, labeled terminal arborizations in the LSO arose from collaterals of axons in the trapezoid body (output tract of planar cells). We infer that planar multipolar cells, in addition to bushy cells, are a source of ascending input from the cochlear nucleus to the LSO.
Collapse
Affiliation(s)
- John R Doucet
- Department of Otolaryngology-Head and Neck Surgery, Center for Hearing Sciences, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.
| | | |
Collapse
|
43
|
Moore BCJ. Coding of sounds in the auditory system and its relevance to signal processing and coding in cochlear implants. Otol Neurotol 2003; 24:243-54. [PMID: 12621339 DOI: 10.1097/00129492-200303000-00019] [Citation(s) in RCA: 123] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVE To review how the properties of sounds are "coded" in the normal auditory system and to discuss the extent to which cochlear implants can and do represent these codes. DATA SOURCES Data are taken from published studies of the response of the cochlea and auditory nerve to simple and complex stimuli, in both the normal and the electrically stimulated ear. REVIEW CONTENT: The review describes: 1) the coding in the normal auditory system of overall level (which partly determines perceived loudness), spectral shape (which partly determines perceived timbre and the identity of speech sounds), periodicity (which partly determines pitch), and sound location; 2) the role of the active mechanism in the cochlea, and particularly the fast-acting compression associated with that mechanism; 3) the neural response patterns evoked by cochlear implants; and 4) how the response patterns evoked by implants differ from those observed in the normal auditory system in response to sound. A series of specific issues is then discussed, including: 1) how to compensate for the loss of cochlear compression; 2) the effective number of independent channels in a normal ear and in cochlear implantees; 3) the importance of independence of responses across neurons; 4) the stochastic nature of normal neural responses; 5) the possible role of across-channel coincidence detection; and 6) potential benefits of binaural implantation. CONCLUSIONS Current cochlear implants do not adequately reproduce several aspects of the neural coding of sound in the normal auditory system. Improved electrode arrays and coding systems may lead to improved coding and, it is hoped, to better performance.
Collapse
Affiliation(s)
- Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, England.
| |
Collapse
|
44
|
Heinz MG, Colburn HS, Carney LH. Quantifying the implications of nonlinear cochlear tuning for auditory-filter estimates. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:996-1011. [PMID: 11863202 DOI: 10.1121/1.1436071] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The relation between auditory filters estimated from psychophysical methods and peripheral tuning was evaluated using a computational auditory-nerve (AN) model that included many of the response properties associated with nonlinear cochlear tuning. The phenomenological AN model included the effects of dynamic level-dependent tuning, compression, and suppression on the responses of high-, medium-, and low-spontaneous-rate AN fibers. Signal detection theory was used to evaluate psychophysical performance limits imposed by the random nature of AN discharges and by random-noise stimuli. The power-spectrum model of masking was used to estimate psychophysical auditory filters from predicted AN-model detection thresholds for a tone signal in fixed-level notched-noise maskers. Results demonstrate that the role of suppression in broadening peripheral tuning in response to the noise masker has implications for the interpretation of psychophysical auditory-filter estimates. Specifically, the estimated psychophysical auditory-filter equivalent-rectangular bandwidths (ERBs) that were derived from the nonlinear AN model with suppression always overestimated the ERBs of the low-level peripheral model filters. Further, this effect was larger for an 8-kHz signal than for a 2-kHz signal, suggesting a potential characteristic-frequency (CF) dependent bias in psychophysical estimates of auditory filters due to the increase in strength of cochlear nonlinearity with increases in CF.
Collapse
Affiliation(s)
- Michael G Heinz
- Speech and Hearing Sciences Program, Division of Health Sciences and Technology, Massachusetts Institute of Technology, Cambridge 02139, USA.
| | | | | |
Collapse
|