1
|
Irino T, Yokota K, Patterson RD. Improving Auditory Filter Estimation by Incorporating Absolute Threshold and a Level-dependent Internal Noise. Trends Hear 2023; 27:23312165231209750. [PMID: 37905400 PMCID: PMC10619342 DOI: 10.1177/23312165231209750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2022] [Accepted: 10/07/2023] [Indexed: 11/02/2023] Open
Abstract
Auditory filter (AF) shape has traditionally been estimated with a combination of a notched-noise (NN) masking experiment and a power spectrum model (PSM) of masking. However, there are several challenges that remain in both the simultaneous and forward masking paradigms. We hypothesized that AF shape estimation would be improved if absolute threshold (AT) and a level-dependent internal noise were explicitly represented in the PSM. To document the interaction between NN threshold and AT in normal hearing (NH) listeners, a large set of NN thresholds was measured at four center frequencies (500, 1000, 2000, and 4000 Hz) with the emphasis on low-level maskers. The proposed PSM, consisting of the compressive gammachirp (cGC) filter and three nonfilter parameters, allowed AF estimation over a wide range of frequencies and levels with fewer coefficients and less error than previous models. The results also provided new insights into the nonfilter parameters. The detector signal-to-noise ratio (K ) was found to be constant across signal frequencies, suggesting that no frequency dependence hypothesis is required in the postfiltering process. The ANSI standard "Hearing Level-0dB" function, i.e., AT of NH listeners, could be applied to the frequency distribution of the noise floor for the best AF estimation. The introduction of a level-dependent internal noise could mitigate the nonlinear effects that occur in the simultaneous NN masking paradigm. The new PSM improves the applicability of the model, particularly when the sound pressure level of the NN threshold is close to AT.
Collapse
Affiliation(s)
- Toshio Irino
- Faculty of Systems Engineering, Wakayama University, Japan
| | - Kenji Yokota
- Faculty of Systems Engineering, Wakayama University, Japan
| | - Roy D. Patterson
- Department of Physiology, Development and Neuroscience, University
of Cambridge, UK
| |
Collapse
|
2
|
He W, Burwood G, Fridberger A, Nuttall AL, Ren T. An outer hair cell-powered global hydromechanical mechanism for cochlear amplification. Hear Res 2022; 423:108407. [PMID: 34922772 PMCID: PMC9156726 DOI: 10.1016/j.heares.2021.108407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 11/23/2021] [Accepted: 11/30/2021] [Indexed: 11/04/2022]
Abstract
It is a common belief that the mammalian cochlea achieves its exquisite sensitivity, frequency selectivity, and dynamic range through an outer hair cell-based active process, or cochlear amplification. As a sound-induced traveling wave propagates from the cochlear base toward the apex, outer hair cells at a narrow region amplify the low level sound-induced vibration through a local feedback mechanism. This widely accepted theory has been tested by measuring sound-induced sub-nanometer vibrations within the organ of Corti in the sensitive living cochleae using heterodyne low-coherence interferometry and optical coherence tomography. The aim of this short review is to summarize experimental findings on the cochlear active process by the authors' group. Our data show that outer hair cells are able to generate substantial forces for driving the cochlear partition at all audible frequencies in vivo. The acoustically induced reticular lamina vibration is larger and more broadly tuned than the basilar membrane vibration. The reticular lamina and basilar membrane vibrate approximately in opposite directions at low frequencies and in the same direction at the best frequency. The group delay of the reticular lamina is larger than that of the basilar membrane. The magnitude and phase differences between the reticular lamina and basilar membrane vibration are physiologically vulnerable. These results contradict predictions based on the local feedback mechanism but suggest a global hydromechanical mechanism for cochlear amplification. This article is part of the Special Issue Outer hair cell Edited by Joseph Santos-Sacchi and Kumar Navaratnam.
Collapse
Affiliation(s)
- Wenxuan He
- Department of Otolaryngology, Head and Neck Surgery, Oregon Health & Science University, Portland OR, United States
| | - George Burwood
- Department of Otolaryngology, Head and Neck Surgery, Oregon Health & Science University, Portland OR, United States
| | - Anders Fridberger
- Department of Biomedical and Clinical Sciences, Linköping University, Linköping, Sweden
| | - Alfred L Nuttall
- Department of Otolaryngology, Head and Neck Surgery, Oregon Health & Science University, Portland OR, United States
| | - Tianying Ren
- Department of Otolaryngology, Head and Neck Surgery, Oregon Health & Science University, Portland OR, United States.
| |
Collapse
|
3
|
Leschke J, Orellana GR, Shera CA, Oxenham AJ. Auditory filter shapes derived from forward and simultaneous masking at low frequencies: Implications for human cochlear tuning. Hear Res 2022; 420:108500. [PMID: 35405591 PMCID: PMC9167757 DOI: 10.1016/j.heares.2022.108500] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Revised: 03/08/2022] [Accepted: 03/28/2022] [Indexed: 01/04/2023]
Abstract
Behavioral forward-masking thresholds with a spectrally notched-noise masker and a fixed low-level probe tone have been shown to provide accurate estimates of cochlear tuning. Estimates using simultaneous masking are similar but generally broader, presumably due to nonlinear cochlear suppression effects. So far, estimates with forward masking have been limited to frequencies of 1 kHz and above. This study used spectrally notched noise under forward and simultaneous masking to estimate frequency selectivity between 200 and 1000 Hz for young adult listeners with normal hearing. Estimates of filter tuning at 1000 Hz were in agreement with previous studies. Estimated tuning broadened below 1000 Hz, with the filter quality factor based on the equivalent rectangular bandwidth (QERB) decreasing more rapidly with decreasing frequency than predicted by previous equations, in line with earlier predictions based on otoacoustic-emission latencies. Estimates from simultaneous masking remained broader than those from forward masking by approximately the same ratio. The new data provide a way to compare human cochlear tuning estimates with auditory-nerve tuning curves from other species across most of the auditory frequency range.
Collapse
Affiliation(s)
| | | | - Christopher A. Shera
- Auditory Research Center, Caruso Department of Otolaryngology, University of Southern California, Los Angeles, CA 90033
| | - Andrew J. Oxenham
- Department of Psychology, University of Minnesota, Minneapolis, MN 55455
| |
Collapse
|
4
|
Jurado C, Chow MYP, Leung KML, Larrea M, Vizuete J, de Cheveigné A, Marquardt T. The Spectral Extent of Phasic Suppression of Loudness and Distortion-Product Otoacoustic Emissions by Infrasound and Low-Frequency Tones. J Assoc Res Otolaryngol 2022; 23:167-181. [PMID: 35132510 PMCID: PMC8964881 DOI: 10.1007/s10162-021-00830-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 12/14/2021] [Indexed: 01/13/2023] Open
Abstract
We investigated the effect of a biasing tone close to 5, 15, or 30 Hz on the response to higher-frequency probe tones, behaviorally, and by measuring distortion-product otoacoustic emissions (DPOAEs). The amplitude of the biasing tone was adjusted for criterion suppression of cubic DPOAE elicited by probe tones presented between 0.7 and 8 kHz, or criterion loudness suppression of a train of tone-pip probes in the range 0.125-8 kHz. For DPOAEs, the biasing-tone level for criterion suppression increased with probe-tone frequency by 8-9 dB/octave, consistent with an apex-to-base gradient of biasing-tone-induced basilar membrane displacement, as we verified by computational simulation. In contrast, the biasing-tone level for criterion loudness suppression increased with probe frequency by only 1-3 dB/octave, reminiscent of previously published data on low-side suppression of auditory nerve responses to characteristic frequency tones. These slopes were independent of biasing-tone frequency, but the biasing-tone sensation level required for criterion suppression was ~ 10 dB lower for the two infrasound biasing tones than for the 30-Hz biasing tone. On average, biasing-tone sensation levels as low as 5 dB were sufficient to modulate the perception of higher frequency sounds. Our results are relevant for recent debates on perceptual effects of environmental noise with very low-frequency content and might offer insight into the mechanism underlying low-side suppression.
Collapse
Affiliation(s)
- Carlos Jurado
- UCL Ear Institute, London, WC1X8EE, UK
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Quito, EC170125, Ecuador
| | | | | | - Marcelo Larrea
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Quito, EC170125, Ecuador
| | - Juan Vizuete
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Quito, EC170125, Ecuador
| | - Alain de Cheveigné
- UCL Ear Institute, London, WC1X8EE, UK
- Laboratoire Des Systemes Perceptifs, CNRS UMR 8248, Paris, France
- Departement d'Études Cognitives, École Normale Supérieure, PSL University, Paris, France
| | | |
Collapse
|
5
|
Marrufo-Pérez MI, Lopez-Poveda EA. Adaptation to noise in normal and impaired hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:1741. [PMID: 35364964 DOI: 10.1121/10.0009802] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 02/26/2022] [Indexed: 06/14/2023]
Abstract
Many aspects of hearing function are negatively affected by background noise. Listeners, however, have some ability to adapt to background noise. For instance, the detection of pure tones and the recognition of isolated words embedded in noise can improve gradually as tones and words are delayed a few hundred milliseconds in the noise. While some evidence suggests that adaptation to noise could be mediated by the medial olivocochlear reflex, adaptation can occur for people who do not have a functional reflex. Since adaptation can facilitate hearing in noise, and hearing in noise is often harder for hearing-impaired than for normal-hearing listeners, it is conceivable that adaptation is impaired with hearing loss. It remains unclear, however, if and to what extent this is the case, or whether impaired adaptation contributes to the greater difficulties experienced by hearing-impaired listeners understanding speech in noise. Here, we review adaptation to noise, the mechanisms potentially contributing to this adaptation, and factors that might reduce the ability to adapt to background noise, including cochlear hearing loss, cochlear synaptopathy, aging, and noise exposure. The review highlights few knowns and many unknowns about adaptation to noise, and thus paves the way for further research on this topic.
Collapse
Affiliation(s)
- Miriam I Marrufo-Pérez
- Instituto de Neurociencias de Castilla y León, Universidad de Salamanca, Calle Pintor Fernando Gallego 1, 37007 Salamanca, Spain
| | - Enrique A Lopez-Poveda
- Instituto de Neurociencias de Castilla y León, Universidad de Salamanca, Calle Pintor Fernando Gallego 1, 37007 Salamanca, Spain
| |
Collapse
|
6
|
Search for Electrophysiological Indices of Hidden Hearing Loss in Humans: Click Auditory Brainstem Response Across Sound Levels and in Background Noise. Ear Hear 2020; 42:53-67. [PMID: 32675590 DOI: 10.1097/aud.0000000000000905] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Recent studies in animals indicate that even moderate levels of exposure to noise can damage synaptic ribbons between the inner hair cells and auditory nerve fibers without affecting audiometric thresholds, giving rise to the use of the term "hidden hearing loss" (HHL). Despite evidence across several animal species, there is little consistent evidence for HHL in humans. The aim of the study is to evaluate potential electrophysiological changes specific to individuals at risk for HHL. DESIGN Participants forming the high-risk experimental group consisted of 28 young normal-hearing adults who participated in marching band for at least 5 years. Twenty-eight age-matched normal-hearing adults who were not part of the marching band and had little or no history of recreational or occupational exposure to loud sounds formed the low-risk control group. Measurements included pure tone audiometry of conventional and high frequencies, distortion product otoacoustic emissions, and electrophysiological measures of auditory nerve and brainstem function as reflected in the click-evoked auditory brainstem response (ABR). In experiment 1, ABRs were recorded in a quiet background across stimulus levels (30-90 dB nHL) presented in 10 dB steps. In experiment 2, the ABR was elicited by a 70 dB nHL click stimulus presented in a quiet background, and in the presence of simultaneous ipsilateral continuous broadband noise presented at 50, 60, and 70 dB SPL using an insert earphone (Etymotic, ER2). RESULTS There were no differences between the low- and high-risk groups in audiometric thresholds or distortion product otoacoustic emission amplitude. Experiment 1 demonstrated smaller wave-I amplitudes at moderate and high sound levels for high-risk compared to low-risk group with similar wave III and wave V amplitude. Enhanced amplitude ratio V/I, particularly at moderate sound level (60 dB nHL), suggesting central compensation for reduced input from the periphery for high-risk group. The results of experiment 2 show that the decrease in wave I amplitude with increasing background noise level was relatively smaller for the high-risk compared to the low-risk group. However, wave V amplitude reduction was essentially similar for both groups. These results suggest that masking induced wave I amplitude reduction is smaller in individuals at high risk for cochlear synaptopathy. Unlike previous studies, we did not observe a difference in the noise-induced wave V latency shift between low- and high-risk groups. CONCLUSIONS Results of experiment 1 are consistent with findings in both animal studies (that suggest cochlear synaptopathy involving selective damage of low-spontaneous rate and medium-spontaneous rate fibers), and in several human studies that show changes in a range of ABR metrics that suggest the presence of cochlear synaptopathy. However, without postmortem examination by harvesting human temporal bone (the gold standard for identifying synaptopathy) with different noise exposure background, no direct inferences can be derived for the presence/extent of cochlear synaptopathy in high-risk group with high sound over-exposure history. Results of experiment 2 demonstrate that to the extent response amplitude reflects both the number of neural elements responding and the neural synchrony of the responding elements, the relatively smaller change in response amplitude for the high-risk group would suggest a reduced susceptibility to masking. One plausible mechanism would be that suppressive effects that kick in at moderate to high levels are different in these two groups, particularly at moderate levels of the masking noise. Altogether, a larger scale dataset with different noise exposure background, longitudinal measurements (changes due to recreational over-exposure by studying middle-school to high-school students enrolled in marching band) with an array of behavioral and electrophysiological tests are needed to understand the complex pathogenesis of sound over-exposure damage in normal-hearing individuals.
Collapse
|
7
|
Rasetshwane DM, Bosen EC, Kopun JG, Neely ST. Comparison of distortion-product otoacoustic emission and stimulus-frequency otoacoustic emission two-tone suppression in humans. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:4481. [PMID: 31893726 PMCID: PMC6930138 DOI: 10.1121/1.5139660] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/15/2019] [Revised: 11/19/2019] [Accepted: 11/20/2019] [Indexed: 06/10/2023]
Abstract
Distortion-product otoacoustic emission (DPOAE) and stimulus-frequency otoacoustic emission (SFOAE) are two types of acoustic signals emitted by the inner ear in response to tonal stimuli. The levels of both emission types may be reduced by the inclusion of additional (suppressor) tones with the stimulus. Comparison of two-tone suppression properties across emission type addresses a clinically relevant question of whether these two types of emission provide similar information about cochlear status. The purpose of this study was to compare DPOAE suppression to SFOAE suppression from the same ear in a group of participants with normal hearing. Probe frequency was approximately 1000 Hz, and the suppressor frequency varied from -1.5 to 0.5 octaves relative to the probe frequency. DPOAE and SFOAE suppression were compared in terms of (1) suppression growth rate (SGR), (2) superimposed suppression tuning curves (STCs), and (3) STC-derived metrics, such as high-frequency slope, cochlear amplifier gain, and QERB (ERB, equivalent rectangular bandwidth). Below the probe frequency, the SGR was slightly greater than one for SFOAEs and slightly less than two for DPOAEs. There were no differences in STC metrics across emission types. These observations may provide useful constraints on physiology-based models of otoacoustic emission suppression.
Collapse
Affiliation(s)
- Daniel M Rasetshwane
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Emily C Bosen
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Judy G Kopun
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Stephen T Neely
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| |
Collapse
|
8
|
Kreft HA, Oxenham AJ. Auditory enhancement under forward masking in normal-hearing and hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 146:3448. [PMID: 31795651 PMCID: PMC6872462 DOI: 10.1121/1.5133629] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2019] [Revised: 10/10/2019] [Accepted: 10/22/2019] [Indexed: 06/10/2023]
Abstract
A target within a spectrally notched masker can be enhanced by a preceding copy of the masker. Enhancement can also increase the effectiveness of the target as a forward masker. Enhancement has been reported in hearing-impaired listeners under simultaneous but not forward masking. However, previous studies of enhancement under forward masking did not fully assess the potential effect of differences in sensation level or spectral resolution between the normal-hearing and hearing-impaired listeners. This study measured enhancement via forward masking in hearing-impaired and age-matched normal-hearing listeners with different spectral notches in the masker, to account for potential differences in frequency selectivity, and with levels equated by adding a background masking noise to equate both sensation level and sound pressure level or by reducing the sound pressure level of the stimuli to equate sensation level. Hearing-impaired listeners showed no significant enhancement, regardless of spectral notch width. Normal-hearing listeners showed enhancement at high levels, but showed less enhancement when sensation levels were reduced to match those of the hearing-impaired group, either by reducing sound levels or by adding a masking noise. The results confirm a lack of forward-masked enhancement in hearing-impaired listeners but suggest this may be partly due to reduced sensation level.
Collapse
Affiliation(s)
- Heather A Kreft
- Department of Psychology, University of Minnesota, Elliott Hall, 75 East River Parkway, Minneapolis, Minnesota 55455, USA
| | - Andrew J Oxenham
- Department of Psychology, University of Minnesota, Elliott Hall, 75 East River Parkway, Minneapolis, Minnesota 55455, USA
| |
Collapse
|
9
|
Probing hair cell's mechano-transduction using two-tone suppression measurements. Sci Rep 2019; 9:4626. [PMID: 30874606 PMCID: PMC6420497 DOI: 10.1038/s41598-019-41112-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2018] [Accepted: 03/01/2019] [Indexed: 11/27/2022] Open
Abstract
When two sound tones are delivered to the cochlea simultaneously, they interact with each other in a suppressive way, a phenomenon referred to as two-tone suppression (2TS). This nonlinear response is ascribed to the saturation of the outer hair cell’s mechano-transduction. Thus, 2TS can be used as a non-invasive probe to investigate the fundamental properties of cochlear mechano-transduction. We developed a nonlinear cochlear model in the time domain to interpret 2TS data. The multi-scale model incorporates cochlear fluid dynamics, organ of Corti (OoC) mechanics and outer hair cell electrophysiology. The model simulations of 2TS show that the threshold amplitudes and rates of low-side suppression are dependent on mechano-transduction properties. By comparing model responses to existing 2TS measurement data, we estimate intrinsic characteristics of mechano-transduction such as sensitivity and adaptation. For mechano-transduction sensitivity at the basal location (characteristic frequency of 17 kHz) at 0.06 nm−1, the simulation results agree with 2TS measurements of basilar membrane responses. This estimate is an order of magnitude higher than the values observed in experiments on isolated outer hair cells. The model also demonstrates how the outer hair cell’s adaptation alters the temporal pattern of 2TS by modulating mechano-electrical gain and phase.
Collapse
|
10
|
He W, Kemp D, Ren T. Timing of the reticular lamina and basilar membrane vibration in living gerbil cochleae. eLife 2018; 7:37625. [PMID: 30183615 PMCID: PMC6125122 DOI: 10.7554/elife.37625] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 08/14/2018] [Indexed: 12/22/2022] Open
Abstract
Auditory sensory outer hair cells are thought to amplify sound-induced basilar membrane vibration through a feedback mechanism to enhance hearing sensitivity. For optimal amplification, the outer hair cell-generated force must act on the basilar membrane at an appropriate time at every cycle. However, the temporal relationship between the outer hair cell-driven reticular lamina vibration and the basilar membrane vibration remains unclear. By measuring sub-nanometer vibrations directly from outer hair cells using a custom-built heterodyne low-coherence interferometer, we demonstrate in living gerbil cochleae that the reticular lamina vibration occurs after, not before, the basilar membrane vibration. Both tone- and click-induced responses indicate that the reticular lamina and basilar membrane vibrate in opposite directions at the cochlear base and they oscillate in phase near the best-frequency location. Our results suggest that outer hair cells enhance hearing sensitivity through a global hydromechanical mechanism, rather than through a local mechanical feedback as commonly supposed. What is the quietest sound the ear can detect? All sounds begin as vibrating air molecules, which enter the ear and cause the eardrum to vibrate. We can detect vibrations that move the eardrum by a distance of less than one picometer. That’s one thousandth of a nanometer, or about 100 times smaller than a hydrogen atom. But how does the ear achieve this level of sensitivity? Vibrations of the eardrum cause three small bones within the middle ear to vibrate. The vibrations then spread to the cochlea, a fluid-filled spiral structure in the inner ear. Tiny hair cells lining the cochlea move as a result of the vibrations. There are two types of hair cells: inner and outer. Outer hair cells amplify the vibrations. It is this amplification that enables us to detect such small movements of the eardrum. Inner hair cells then convert the amplified vibrations into electrical signals, which travel via the auditory nerve to the brain. The bases of outer hair cells are connected to a structure called the basilar membrane, while their tops are anchored to a structure called the reticular lamina. It was generally assumed that outer hair cells amplify vibrations of the basilar membrane via a local positive feedback mechanism that requires the hair cells to vibrate first. But by comparing the timing of reticular lamina and basilar membrane vibrations in gerbils, He et al. show that this is not the case. Outer hair cells vibrate after the basilar membrane, not before. This indicates that outer hair cells use a mechanism other than commonly assumed local feedback to amplify sounds. The results presented by He et al. change our understanding of how the cochlea works, and may help bioengineers to design better hearing aids and cochlea implants. Millions of patients worldwide who suffer from hearing loss may ultimately stand to benefit.
Collapse
Affiliation(s)
- Wenxuan He
- Oregon Hearing Research Center, Department of Otolaryngology, Oregon Health & Science University, Portland, United States
| | - David Kemp
- University College London Ear Institute, University College London, London, United Kingdom
| | - Tianying Ren
- Oregon Hearing Research Center, Department of Otolaryngology, Oregon Health & Science University, Portland, United States
| |
Collapse
|
11
|
The Interplay Between Spike-Time and Spike-Rate Modes in the Auditory Nerve Encodes Tone-In-Noise Threshold. J Neurosci 2018; 38:5727-5738. [PMID: 29793977 DOI: 10.1523/jneurosci.3103-17.2018] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2017] [Revised: 05/04/2018] [Accepted: 05/10/2018] [Indexed: 11/21/2022] Open
Abstract
Auditory nerve fibers (ANFs) encode pure tones through two modes of coding, spike time and spike rate, depending on the tone frequency. In response to a low-frequency tone, ANF firing is phase locked to the sinusoidal waveform. Because time coding vanishes with an increase in the tone frequency, high-frequency tone coding relies on the spike rate of the ANFs. Adding a continuous broadband noise to a tone compresses the rate intensity function of ANFs and shifts its dynamic range toward higher intensities. Therefore, the ANFs with high-threshold/low-spontaneous rate (SR) are thought to contribute to behavioral tone detection in noise. However, this theory relies on the discharge rate of the ANFs. The direct comparison with the masking threshold through spike timing, irrespective of the spontaneous rate, has not so far been investigated. Taking advantage of a unique proxy to quantify the spike synchrony (i.e., the shuffle autocorrelogram), we show in female gerbils that high-SR ANFs are more adapted to encode low-frequency thresholds through temporal code, giving them a strong robustness in noise. By comparing behavioral thresholds measured using prepulse inhibition of the acoustical startle reflex with population thresholds calculated from ANFs pooled per octave band, we show that threshold-based spike timing provides a better estimate of behavioral thresholds in the low-frequency range, whereas the high-frequency behavioral thresholds rely on the spiking rate, particularly in noise. This emphasizes the complementarity of temporal and rate modes to code tone-in-noise thresholds over a large range of frequencies.SIGNIFICANCE STATEMENT There is a general agreement that high-threshold/low-spontaneous rate (SR) auditory nerve fibers (ANFs) are of prime importance for tone detection in noise. However, this theory is based on the discharge rate of the fibers. Comparing the behavioral thresholds and single ANF thresholds shows that this is only true in the high-frequency range of tone stimulations. In the low-frequency range of tones (up to 2.7 kHz in the gerbil), the most sensitive ANFs (high-SR fibers) carry neural information through a spike-timing mode, even for noise in which tones do not induce a noticeable increment in the spike rate. This emphasizes the interplay between spike-time and spike-rate modes in the auditory nerve to encode tone-in-noise threshold over a large range of tone frequencies.
Collapse
|
12
|
Jurado C, Gallegos P, Gordillo D, Moore BCJ. The detailed shapes of equal-loudness-level contours at low frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:3821. [PMID: 29289096 DOI: 10.1121/1.5018428] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
High-resolution equal-loudness-level contours (ELCs) were measured over the frequency range 10-250 Hz using 19 normal-hearing subjects. Three levels of the 50-Hz reference sound were used, corresponding to the levels at 50 Hz of the 30-, 50-, and 70-phon standardized ELCs given in ISO-226:2003. The dynamic range of the contours generally decreased with increasing reference level, and the slope was shallow between 10 and 20 Hz, consistent with previous studies. For the lowest level, the ELCs were sometimes but not always smooth and on average followed the standardized 30-phon contour for frequencies above 40 Hz. For the two higher levels, the individual ELCs showed a distinct non-monotonic feature in a "transition region" between about 40 and 100 Hz, where the slope could reach near-zero or even positive values. The pattern of the non-monotonic feature was similar across levels for the subjects for whom it was observed, but the pattern varied across subjects. Below 40 Hz, the slopes of the ELCs increased markedly for all loudness levels, and the levels exceeded those of the standardized ELCs. Systematic deviations from the standardized ELCs were largest for frequencies below 40 Hz for all levels and within the transition region for the two higher levels.
Collapse
Affiliation(s)
- Carlos Jurado
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Avenue Granados and Colimes, Quito EC170125, Ecuador
| | - Pablo Gallegos
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Avenue Granados and Colimes, Quito EC170125, Ecuador
| | - Darío Gordillo
- Escuela de Ingeniería en Sonido y Acústica, Universidad de Las Américas, Avenue Granados and Colimes, Quito EC170125, Ecuador
| | - Brian C J Moore
- Department of Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| |
Collapse
|
13
|
Jurado C, Marquardt T. The effect of the helicotrema on low-frequency loudness perception. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:3799. [PMID: 27908034 DOI: 10.1121/1.4967295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Below approximately 40 Hz, the cochlear travelling wave reaches the apex, and differential pressure is shunted through the helicotrema, reducing hearing sensitivity. Just above this corner frequency, a resonance feature is often observed in objectively measured middle-ear-transfer functions (METFs). This study inquires whether overall and fine structure characteristics of the METF are also perceptually evident. Equal-loudness-level contours (ELCs) were measured between 20 and 160 Hz for 14 subjects in a purpose-built test chamber. In addition, the inverse shapes of their METFs were obtained by adjusting the intensity of a low-frequency suppressor tone to maintain an equal suppression depth of otoacoustic emissions for various suppressor tone frequencies (20-250 Hz). For 11 subjects, the METFs showed a resonance. Six of them had coinciding features in both ears, and also in their ELC. For two subjects only the right-ear METF was obtainable, and in one case it was consistent with the ELC. One other subject showed a consistent lack of the feature in their ELC and in both METFs. Although three subjects displayed clear inconsistencies between both measures, the similarity between inverse METF and ELC for most subjects shows that the helicotrema has a marked impact on low-frequency sound perception.
Collapse
Affiliation(s)
- Carlos Jurado
- Section of Acoustics, Department of Electronic Systems, Aalborg University, Fredrik Bajersvej 7-A, Denmark
| | - Torsten Marquardt
- UCL Ear Institute, University College London, 332 Grays Inn Road, London, WC1X 8EE, United Kingdom
| |
Collapse
|
14
|
Sieck NE, Rasetshwane DM, Kopun JG, Jesteadt W, Gorga MP, Neely ST. Multi-tone suppression of distortion-product otoacoustic emissions in humans. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 139:2299. [PMID: 27250125 PMCID: PMC4859829 DOI: 10.1121/1.4946989] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Revised: 03/30/2016] [Accepted: 04/02/2016] [Indexed: 06/05/2023]
Abstract
The purpose of this study was to investigate the combined effect of multiple suppressors. Distortion-product otoacoustic emission (DPOAE) measurements were made in normal-hearing participants. Primary tones had fixed frequencies (f2 = 4000 Hz; f1 / f2 = 1.22) and a range of levels. Suppressor tones were at three frequencies (fs = 2828, 4100, 4300 Hz) and range of levels. Decrement was defined as the attenuation in DPOAE level due to the presence of a suppressor. A measure of suppression called suppressive intensity was calculated by an equation previously shown to fit DPOAE suppression data. Suppressor pairs, which were the combination of two different frequencies, were presented at levels selected to have equal single-suppressor decrements. A hybrid model that represents a continuum between additive intensity and additive attenuation best described the results. The suppressor pair with the smallest frequency ratio produced decrements that were more consistent with additive intensity. The suppressor pair with the largest frequency ratio produced decrements at the highest level that were consistent with additive attenuation. Other suppressor-pair conditions produced decrements that were intermediate between these two alternative models. The hybrid model provides a useful framework for representing the observed range of interaction when two suppressors are combined.
Collapse
Affiliation(s)
- Nicole E Sieck
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Daniel M Rasetshwane
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Judy G Kopun
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Walt Jesteadt
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Michael P Gorga
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | - Stephen T Neely
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| |
Collapse
|
15
|
Suppression Measured from Chinchilla Auditory-Nerve-Fiber Responses Following Noise-Induced Hearing Loss: Adaptive-Tracking and Systems-Identification Approaches. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 894:285-295. [PMID: 27080669 PMCID: PMC5069700 DOI: 10.1007/978-3-319-25474-6_30] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The compressive nonlinearity of cochlear signal transduction, reflecting outer-hair-cell function, manifests as suppressive spectral interactions; e.g., two-tone suppression. Moreover, for broadband sounds, there are multiple interactions between frequency components. These frequency-dependent nonlinearities are important for neural coding of complex sounds, such as speech. Acoustic-trauma-induced outer-hair-cell damage is associated with loss of nonlinearity, which auditory prostheses attempt to restore with, e.g., "multi-channel dynamic compression" algorithms.Neurophysiological data on suppression in hearing-impaired (HI) mammals are limited. We present data on firing-rate suppression measured in auditory-nerve-fiber responses in a chinchilla model of noise-induced hearing loss, and in normal-hearing (NH) controls at equal sensation level. Hearing-impaired (HI) animals had elevated single-fiber excitatory thresholds (by ~ 20-40 dB), broadened frequency tuning, and reduced-magnitude distortion-product otoacoustic emissions; consistent with mixed inner- and outer-hair-cell pathology. We characterized suppression using two approaches: adaptive tracking of two-tone-suppression threshold (62 NH, and 35 HI fibers), and Wiener-kernel analyses of responses to broadband noise (91 NH, and 148 HI fibers). Suppression-threshold tuning curves showed sensitive low-side suppression for NH and HI animals. High-side suppression thresholds were elevated in HI animals, to the same extent as excitatory thresholds. We factored second-order Wiener-kernels into excitatory and suppressive sub-kernels to quantify the relative strength of suppression. We found a small decrease in suppression in HI fibers, which correlated with broadened tuning. These data will help guide novel amplification strategies, particularly for complex listening situations (e.g., speech in noise), in which current hearing aids struggle to restore intelligibility.
Collapse
|
16
|
Clemens J, Rau F, Hennig RM, Hildebrandt KJ. Context-dependent coding and gain control in the auditory system of crickets. Eur J Neurosci 2015; 42:2390-406. [PMID: 26179973 DOI: 10.1111/ejn.13019] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2014] [Revised: 07/07/2015] [Accepted: 07/08/2015] [Indexed: 11/29/2022]
Abstract
Sensory systems process stimuli that greatly vary in intensity and complexity. To maintain efficient information transmission, neural systems need to adjust their properties to these different sensory contexts, yielding adaptive or stimulus-dependent codes. Here, we demonstrated adaptive spectrotemporal tuning in a small neural network, i.e. the peripheral auditory system of the cricket. We found that tuning of cricket auditory neurons was sharper for complex multi-band than for simple single-band stimuli. Information theoretical considerations revealed that this sharpening improved information transmission by separating the neural representations of individual stimulus components. A network model inspired by the structure of the cricket auditory system suggested two putative mechanisms underlying this adaptive tuning: a saturating peripheral nonlinearity could change the spectral tuning, whereas broad feed-forward inhibition was able to reproduce the observed adaptive sharpening of temporal tuning. Our study revealed a surprisingly dynamic code usually found in more complex nervous systems and suggested that stimulus-dependent codes could be implemented using common neural computations.
Collapse
Affiliation(s)
- Jan Clemens
- Behavioral Physiology Group, Department of Biology, Humboldt-Universität zu Berlin, Berlin, Germany.,Bernstein Center for Computational Neuroscience Berlin, Berlin, Germany.,Princeton Neuroscience Institute, Princeton University, Washington Road, Princeton, NJ 08540, USA
| | - Florian Rau
- Behavioral Physiology Group, Department of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - R Matthias Hennig
- Behavioral Physiology Group, Department of Biology, Humboldt-Universität zu Berlin, Berlin, Germany
| | - K Jannis Hildebrandt
- Cluster of Excellence 'Hearing4all', Department for Neuroscience, University of Oldenburg, Oldenburg, Germany.,Research Center Neurosensory Science, University of Oldenburg, Oldenburg, Germany
| |
Collapse
|
17
|
Rasetshwane DM, Trevino AC, Gombert JN, Liebig-Trehearn L, Kopun JG, Jesteadt W, Neely ST, Gorga MP. Categorical loudness scaling and equal-loudness contours in listeners with normal hearing and hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:1899-913. [PMID: 25920842 PMCID: PMC4417023 DOI: 10.1121/1.4916605] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
This study describes procedures for constructing equal-loudness contours (ELCs) in units of phons from categorical loudness scaling (CLS) data and characterizes the impact of hearing loss on these estimates of loudness. Additionally, this study developed a metric, level-dependent loudness loss, which uses CLS data to specify the deviation from normal loudness perception at various loudness levels and as function of frequency for an individual listener with hearing loss. CLS measurements were made in 87 participants with hearing loss and 61 participants with normal hearing. An assessment of the reliability of CLS measurements was conducted on a subset of the data. CLS measurements were reliable. There was a systematic increase in the slope of the low-level segment of the CLS functions with increase in the degree of hearing loss. ELCs derived from CLS measurements were similar to standardized ELCs (International Organization for Standardization, ISO 226:2003). The presence of hearing loss decreased the vertical spacing of the ELCs, reflecting loudness recruitment and reduced cochlear compression. Representing CLS data in phons may lead to wider acceptance of CLS measurements. Like the audiogram that specifies hearing loss at threshold, level-dependent loudness loss describes deficit for suprathreshold sounds. Such information may have implications for the fitting of hearing aids.
Collapse
Affiliation(s)
- Daniel M Rasetshwane
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Andrea C Trevino
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Jessa N Gombert
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Lauren Liebig-Trehearn
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Judy G Kopun
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Walt Jesteadt
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Stephen T Neely
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| | - Michael P Gorga
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131
| |
Collapse
|
18
|
Fletcher M, de Boer J, Krumbholz K. Is off-frequency overshoot caused by adaptation of suppression? J Assoc Res Otolaryngol 2014; 16:241-53. [PMID: 25468405 PMCID: PMC4368652 DOI: 10.1007/s10162-014-0498-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2014] [Accepted: 10/29/2014] [Indexed: 11/25/2022] Open
Abstract
This study is concerned with the mechanism of off-frequency overshoot. Overshoot refers to the phenomenon whereby a brief signal presented at the onset of a masker is easier to detect when the masker is preceded by a “precursor” sound (which is often the same as the masker). Overshoot is most prominent when the masker and precursor have a different frequency than the signal (henceforth referred to as “off-frequency overshoot”). It has been suggested that off-frequency overshoot is based on a similar mechanism as “enhancement,” which refers to the perceptual pop-out of a signal after presentation of a precursor that contains a spectral notch at the signal frequency; both have been proposed to be caused by a reduction in the suppressive masking of the signal as a result of the adaptive effect of the precursor (“adaptation of suppression”). In this study, we measured overshoot, suppression, and adaptation of suppression for a 4-kHz sinusoidal signal and a 4.75-kHz sinusoidal masker and precursor, using the same set of participants. We show that, while the precursor yielded strong overshoot and the masker produced strong suppression, the precursor did not appear to cause any reduction (adaptation) of suppression. Predictions based on an established model of the cochlear input–output function indicate that our failure to obtain any adaptation of suppression is unlikely to represent a false negative outcome. Our results indicate that off-frequency overshoot and enhancement are likely caused by different mechanisms. We argue that overshoot may be due to higher-order perceptual factors such as transient masking or attentional diversion, whereas enhancement may be based on mechanisms similar to those that generate the Zwicker tone.
Collapse
|
19
|
Lewis JD, Goodman SS. Basal contributions to short-latency transient-evoked otoacoustic emission components. J Assoc Res Otolaryngol 2014; 16:29-45. [PMID: 25303881 DOI: 10.1007/s10162-014-0493-5] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2014] [Accepted: 09/24/2014] [Indexed: 10/24/2022] Open
Abstract
The presence of short-latency (SL), less compressive-growing components in bandpass-filtered transient-evoked otoacoustic emission (TEOAE) waveforms may implicate contributions from cochlear regions basal to the tonotopic place. Recent empirical work suggests a region of SL generation between ∼1/5 and 1/10-octave basal to the TEOAE frequency's tonotopic place. However, this estimate may be biased to regions closer to the tonotopic place as the TEOAE extraction technique precluded measurement of components with latencies shorter than ∼5 ms. Using a variant of the non-linear, double-evoked extraction paradigm that permitted extraction of components with latencies as early as 1 ms, the current study empirically estimated the spatial-extent of the cochlear region contributing to 2 kHz SL TEOAE components. TEOAEs were evoked during simultaneous presentation of a suppressor stimulus, in order to suppress contributions to the TEOAE from different places along the cochlear partition. Three or four different-latency components of similar frequency content (∼2 kHz) were identified for most subjects. Component latencies ranged from 1.4 to 9.6 ms; latency was predictive of the component's growth rate and the suppressor frequency to which the component's magnitude was most sensitive to change. As component latency decreased, growth became less compressive and suppressor-frequency sensitivity shifted to higher frequencies. The shortest-latency components were most sensitive to suppressors approximately 3/5-octave higher than their nominal frequency of 2 kHz. These results are consistent with a distributed region of generation extending to approximately 3/5-octave basal to the TEOAE frequency's tonotopic place. The empirical estimates of TEOAE generation are similar to model-based estimates where generation of the different-latency components occurs through linear reflection from impedance discontinuities distributed across the cochlear partition.
Collapse
Affiliation(s)
- James D Lewis
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, NE, 68131, USA,
| | | |
Collapse
|
20
|
Charaziak KK, Souza P, Siegel JH. Stimulus-frequency otoacoustic emission suppression tuning in humans: comparison to behavioral tuning. J Assoc Res Otolaryngol 2013; 14:843-62. [PMID: 24013802 DOI: 10.1007/s10162-013-0412-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Accepted: 08/11/2013] [Indexed: 11/30/2022] Open
Abstract
As shown by the work of Kemp and Chum in 1980, stimulus-frequency otoacoustic emission suppression tuning curves (SFOAE STCs) have potential to objectively estimate behaviorally measured tuning curves. To date, this potential has not been tested. This study aims to do so by comparing SFOAE STCs and behavioral measures of tuning (simultaneous masking psychophysical tuning curves, PTCs) in 10 normal-hearing listeners for frequency ranges centered around 1,000 and 4,000 Hz at low probe levels. Additionally, SFOAE STCs were collected for varying conditions (probe level and suppression criterion) to identify the optimal parameters for comparison with behavioral data and to evaluate how these conditions affect the features of SFOAE STCs. SFOAE STCs qualitatively resembled PTCs: they demonstrated band-pass characteristics and asymmetric shapes with steeper high-frequency sides than low, but unlike PTCs they were consistently tuned to frequencies just above the probe frequency. When averaged across subjects the shapes of SFOAE STCs and PTCs showed agreement for most recording conditions, suggesting that PTCs are predominantly shaped by the frequency-selective filtering and suppressive effects of the cochlea. Individual SFOAE STCs often demonstrated irregular shapes (e.g., "double-tips"), particularly for the 1,000-Hz probe, which were not observed for the same subject's PTC. These results show the limited utility of SFOAE STCs to assess tuning in an individual. The irregularly shaped SFOAE STCs may be attributed to contributions from SFOAE sources distributed over a region of the basilar membrane extending beyond the probe characteristic place, as suggested by a repeatable pattern of SFOAE residual phase shifts observed in individual data.
Collapse
Affiliation(s)
- Karolina K Charaziak
- Department of Communication Sciences and Disorders, Northwestern University, School of Communication, 2240 Campus Drive, Evanston, IL, 602080-2952, USA,
| | | | | |
Collapse
|
21
|
Explaining the high voice superiority effect in polyphonic music: evidence from cortical evoked potentials and peripheral auditory models. Hear Res 2013; 308:60-70. [PMID: 23916754 DOI: 10.1016/j.heares.2013.07.014] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/19/2013] [Revised: 07/12/2013] [Accepted: 07/25/2013] [Indexed: 11/23/2022]
Abstract
Natural auditory environments contain multiple simultaneously-sounding objects and the auditory system must parse the incoming complex sound wave they collectively create into parts that represent each of these individual objects. Music often similarly requires processing of more than one voice or stream at the same time, and behavioral studies demonstrate that human listeners show a systematic perceptual bias in processing the highest voice in multi-voiced music. Here, we review studies utilizing event-related brain potentials (ERPs), which support the notions that (1) separate memory traces are formed for two simultaneous voices (even without conscious awareness) in auditory cortex and (2) adults show more robust encoding (i.e., larger ERP responses) to deviant pitches in the higher than in the lower voice, indicating better encoding of the former. Furthermore, infants also show this high-voice superiority effect, suggesting that the perceptual dominance observed across studies might result from neurophysiological characteristics of the peripheral auditory system. Although musically untrained adults show smaller responses in general than musically trained adults, both groups similarly show a more robust cortical representation of the higher than of the lower voice. Finally, years of experience playing a bass-range instrument reduces but does not reverse the high voice superiority effect, indicating that although it can be modified, it is not highly neuroplastic. Results of new modeling experiments examined the possibility that characteristics of middle-ear filtering and cochlear dynamics (e.g., suppression) reflected in auditory nerve firing patterns might account for the higher-voice superiority effect. Simulations show that both place and temporal AN coding schemes well-predict a high-voice superiority across a wide range of interval spacings and registers. Collectively, we infer an innate, peripheral origin for the higher-voice superiority observed in human ERP and psychophysical music listening studies.
Collapse
|
22
|
Rasetshwane DM, Gorga MP, Neely ST. Signal-processing strategy for restoration of cross-channel suppression in hearing-impaired listeners. IEEE Trans Biomed Eng 2013; 61:64-75. [PMID: 23925364 DOI: 10.1109/tbme.2013.2276351] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Because frequency components interact nonlinearly with each other inside the cochlea, the loudness growth of tones is relatively simple in comparison to the loudness growth of complex sounds. The term suppression refers to a reduction in the response growth of one tone in the presence of a second tone. Suppression is a salient feature of normal cochlear processing and contributes to psychophysical masking. Suppression is evident in many measurements of cochlear function in subjects with normal hearing, including distortion-product otoacoustic emissions (DPOAEs). Suppression is also evident, to a lesser extent, in subjects with mild-to-moderate hearing loss. This paper describes a hearing-aid signal-processing strategy that aims to restore both loudness growth and two-tone suppression in hearing-impaired listeners. The prescription of gain for this strategy is based on measurements of loudness by a method known as categorical loudness scaling. The proposed signal-processing strategy reproduces measured DPOAE suppression tuning curves and generalizes to any number of frequency components. The restoration of both normal suppression and normal loudness has the potential to improve hearing-aid performance and user satisfaction.
Collapse
|
23
|
Laback B, Necciari T, Balazs P, Savel S, Ystad S. Simultaneous masking additivity for short Gaussian-shaped tones: spectral effects. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1160-1171. [PMID: 23927115 DOI: 10.1121/1.4812773] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Laback et al. [(2011). J. Acoust. Soc. Am. 129, 888-897] investigated the additivity of nonsimultaneous masking using short Gaussian-shaped tones as maskers and target. The present study involved Gaussian stimuli to measure the additivity of simultaneous masking for combinations of up to four spectrally separated maskers. According to most basilar membrane measurements, the maskers should be processed linearly at the characteristic frequency (CF) of the target. Assuming also compression of the target, all masker combinations should produce excess masking (exceeding linear additivity). The results for a pair of maskers flanking the target indeed showed excess masking. The amount of excess masking could be predicted by a model assuming summation of masker-evoked excitations in intensity units at the target CF and compression of the target, using compressive input/output functions derived from the nonsimultaneous masking study. However, the combinations of lower-frequency maskers showed much less excess masking than predicted by the model. This cannot easily be attributed to factors like off-frequency listening, combination tone perception, or between-masker suppression. It was better predicted, however, by assuming weighted intensity summation of masker excitations. The optimum weights for the lower-frequency maskers were smaller than one, consistent with partial masker compression as indicated by recent psychoacoustic data.
Collapse
Affiliation(s)
- Bernhard Laback
- Acoustics Research Institute, Austrian Academy of Sciences, Wohllebengasse 12-14, A-1040 Vienna, Austria.
| | | | | | | | | |
Collapse
|
24
|
Masking effects in patients with auditory neuropathy-possible involvement of suppression mechanism caused by normal outer hair cell function. Otol Neurotol 2013; 34:868-76. [PMID: 23739557 DOI: 10.1097/mao.0b013e318292fcb3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVE Variations in the effects of masking noise were evaluated in different pathologies of sensorineural hearing loss. STUDY DESIGN Retrospective chart review. SETTING Tertiary referral center. PATIENTS Fifty-five ears of 30 patients with sensorineural hearing loss who underwent noise audiometry in the Department of Otolaryngology-Head and Neck Surgery, Tohoku University Hospital, since 2010, because of complaints of hearing difficulty in noisy environments. MAIN OUTCOME MEASURES Masked threshold for narrow band and white noise. RESULTS AND DISCUSSION Masking effects in patients with auditory neuropathy were significantly larger than those in patients with other types of hearing losses. Masking effects of broad band white noise were greater than those of narrow band noise. Masking effects could be observed for white noise even in the elevated unmasked threshold region, where little contribution of excitatory masking effect would be expected. The present findings support the idea that the suppression mechanism caused by normal outer hair cell function is important in the masking phenomenon in patients with auditory neuropathy.
Collapse
|
25
|
Birkholz C, Gruhlke A, Neely ST, Kopun J, Tan H, Jesteadt W, Schmid KK, Gorga MP. Growth of suppression using distortion-product otoacoustic emission measurements in hearing-impaired humans. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:3305-3318. [PMID: 23145614 PMCID: PMC3505206 DOI: 10.1121/1.4754526] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2012] [Revised: 08/21/2012] [Accepted: 08/28/2012] [Indexed: 05/31/2023]
Abstract
Growth of distortion-product otoacoustic emission suppression was measured in 65 subjects with mild-to-moderate sensorineural hearing loss (HI). Measurements were made at four probe frequencies (f(2)) and up to five L(2) levels. Eleven suppressor frequencies (f(3)) were used for each f(2), L(2) combination. These data were compared to data from normal-hearing (NH) subjects (Gorga et al., 2011a). In both NH and HI subjects, growth of suppression depended on the relation between f(2) and f(3), such that the slope was close to one when f(3) ≈ f(2), steeper than one when f(3) < f(2), and shallower than one when f(3) > f(2). Differences in growth of suppression between NH and HI subjects were not observed for fixed f(2), L(2) combinations, however large differences were observed in suppressor "threshold" when compared at the same probe sensation level (dB SL). Smaller group differences were observed when compared at the same probe sound-pressure level (dB SPL). Therefore, the extent of these differences depended on how probe level (L(2)) was specified. When the results from NH and HI subjects are compared with each other and with psychophysical studies of masking, differences are observed that have implications for the remediation of mild-to-moderate hearing loss.
Collapse
Affiliation(s)
- Cori Birkholz
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA
| | | | | | | | | | | | | | | |
Collapse
|
26
|
|
27
|
Abdala C, Keefe DH. Morphological and Functional Ear Development. HUMAN AUDITORY DEVELOPMENT 2012. [DOI: 10.1007/978-1-4614-1421-6_2] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/12/2023]
|
28
|
Rodriguez J, Neely ST. Temporal aspects of suppression in distortion-product otoacoustic emissions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:3082-3089. [PMID: 21568411 PMCID: PMC3108389 DOI: 10.1121/1.3575553] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/28/2011] [Revised: 03/11/2011] [Accepted: 03/15/2011] [Indexed: 05/30/2023]
Abstract
This study examined the time course of cochlear suppression using a tone-burst suppressor to measure decrement of distortion-product otoacoustic emissions (DPOAEs). Seven normal-hearing subjects with ages ranging from 19 to 28 yr participated in the study. Each subject had audiometric thresholds ≤ 15 dB HL [re ANSI (2004) Specifications for Audiometers] for standard octave and inter-octave frequencies from 0.25 to 8 kHz. DPOAEs were elicited by primary tones with f(2) = 4.0 kHz and f(1) = 3.333 kHz (f(2)/f(1) = 1.2). For the f(2), L(2) combination, suppression was measured for three suppressor frequencies: One suppressor below f(2) (3.834 kHz) and two above f(2) (4.166 and 4.282 kHz) at three levels (55, 60, and 65 dB SPL). DPOAE decrement as a function of L(3) for the tone-burst suppressor was similar to decrements obtained with longer duration suppressors. Onset- and setoff- latencies were ≤ 4 ms, in agreement with previous physiological findings in auditory-nerve fiber studies that suggest suppression results from a nearly instantaneous compression of the waveform. Persistence of suppression was absent for the below-frequency suppressor (f(3) = 3.834 kHz) and was ≤ 3 ms for the two above-frequency suppressors (f(3) = 4.166 and 4.282 kHz).
Collapse
Affiliation(s)
- Joyce Rodriguez
- Starkey Hearing Research Center, 2150 Shattuck Avenue, Suite 408, Berkeley, California 94704, USA.
| | | |
Collapse
|
29
|
Gorga MP, Neely ST, Kopun J, Tan H. Growth of suppression in humans based on distortion-product otoacoustic emission measurements. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:801-6. [PMID: 21361439 PMCID: PMC3070999 DOI: 10.1121/1.3523287] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/27/2010] [Revised: 11/01/2010] [Accepted: 11/04/2010] [Indexed: 05/30/2023]
Abstract
Distortion-product otoacoustic emissions (DPOAEs) were used to describe suppression growth in normal-hearing humans. Data were collected at eight f(2) frequencies ranging from 0.5 to 8 kHz for L(2) levels ranging from 10 to 60 dB sensation level. For each f(2) and L(2) combination, suppression was measured for nine or eleven suppressor frequencies (f(3)) whose levels varied from -20 to 85 dB sound pressure level (SPL). Suppression grew nearly linearly when f(3) ≈ f(2), grew more rapidly for f(3) < f(2), and grew more slowly for f(3) > f(2). These results are consistent with physiological and mechanical data from lower animals, as well as previous DPOAE data from humans, although no previous DPOAE study has described suppression growth for as wide a range of frequencies and levels. These trends were evident for all f(2) and L(2) combinations; however, some exceptions were noted. Specifically, suppression growth rate was less steep as a function of f(3) for f(2) frequencies ≤ 1 kHz. Thus, despite the qualitative similarities across frequency, there were quantitative differences related to f(2), suggesting that there may be subtle differences in suppression for frequencies above 1 kHz compared to frequencies below 1 kHz.
Collapse
Affiliation(s)
- Michael P Gorga
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | | | |
Collapse
|
30
|
Power-law dynamics in an auditory-nerve model can account for neural adaptation to sound-level statistics. J Neurosci 2010; 30:10380-90. [PMID: 20685981 DOI: 10.1523/jneurosci.0647-10.2010] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Neurons in the auditory system respond to recent stimulus-level history by adapting their response functions according to the statistics of the stimulus, partially alleviating the so-called "dynamic-range problem." However, the mechanism and source of this adaptation along the auditory pathway remain unknown. Inclusion of power-law dynamics in a phenomenological model of the inner hair cell (IHC)-auditory nerve (AN) synapse successfully explained neural adaptation to sound-level statistics, including the time course of adaptation of the mean firing rate and changes in the dynamic range observed in AN responses. A direct comparison between model responses to a dynamic stimulus and to an "inversely gated" static background suggested that AN dynamic-range adaptation largely results from the adaptation produced by the response history. These results support the hypothesis that the potential mechanism underlying the dynamic-range adaptation observed at the level of the auditory nerve is located peripheral to the spike generation mechanism and central to the IHC receptor potential.
Collapse
|
31
|
Rodríguez J, Neely ST, Patra H, Kopun J, Jesteadt W, Tan H, Gorga MP. The role of suppression in psychophysical tone-on-tone masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:361-9. [PMID: 20058983 PMCID: PMC2821167 DOI: 10.1121/1.3257224] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/25/2009] [Revised: 09/25/2009] [Accepted: 09/29/2009] [Indexed: 05/28/2023]
Abstract
This study tested the hypothesis that suppression contributes to the difference between simultaneous masking (SM) and forward masking (FM). To obtain an alternative estimate of suppression, distortion-product otoacoustic emissions (DPOAEs) were measured in the presence of a suppressor tone. Psychophysical-masking and DPOAE-suppression measurements were made in 22 normal-hearing subjects for a 4000-Hz signal/f(2) and two masker/suppressor frequencies: 2141 and 4281 Hz. Differences between SM and FM at the same masker level were used to provide a psychophysical estimate of suppression. The increase in L(2) to maintain a constant output (L(d)) provided a DPOAE estimate of suppression for a range of suppressor levels. The similarity of the psychophysical and DPOAE estimates for the two masker/suppressor frequencies suggests that the difference in amount of masking between SM and FM is at least partially due to suppression.
Collapse
Affiliation(s)
- Joyce Rodríguez
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | | | | | | | | | |
Collapse
|
32
|
Abstract
The auditory system operates over a vast range of sound pressure levels (100-120 dB) with nearly constant discrimination ability across most of the range, well exceeding the dynamic range of most auditory neurons (20-40 dB). Dean et al. (2005) have reported that the dynamic range of midbrain auditory neurons adapts to the distribution of sound levels in a continuous, dynamic stimulus by shifting toward the most frequently occurring level. Here, we show that dynamic range adaptation, distinct from classic firing rate adaptation, also occurs in primary auditory neurons in anesthetized cats for tone and noise stimuli. Specifically, the range of sound levels over which firing rates of auditory nerve (AN) fibers grows rapidly with level shifts nearly linearly with the most probable levels in a dynamic sound stimulus. This dynamic range adaptation was observed for fibers with all characteristic frequencies and spontaneous discharge rates. As in the midbrain, dynamic range adaptation improved the precision of level coding by the AN fiber population for the prevailing sound levels in the stimulus. However, dynamic range adaptation in the AN was weaker than in the midbrain and not sufficient (0.25 dB/dB, on average, for broadband noise) to prevent a significant degradation of the precision of level coding by the AN population above 60 dB SPL. These findings suggest that adaptive processing of sound levels first occurs in the auditory periphery and is enhanced along the auditory pathway.
Collapse
|
33
|
Lin CT, Chiu TT, Huang TY, Chao CF, Liang WC, Hsu SH, Ko LW. Assessing effectiveness of various auditory warning signals in maintaining drivers' attention in virtual reality-based driving environments. Percept Mot Skills 2009; 108:825-35. [PMID: 19725318 DOI: 10.2466/pms.108.3.825-835] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Drivers' fatigue contributes to traffic accidents, so drivers must maintain adequate alertness. The effectiveness of audio alarms in maintaining driving performance and characteristics of alarms was studied in a virtural reality-based driving environment. Response time to the car's drifting was measured under seven conditions: with no warnings and with continuous warning tones (500 Hz, 1750 Hz, and 3000 Hz), and with tone bursts at 500 Hz, 1750 Hz, and 3000 Hz. Analyses showed the audio warning signals significantly improved driving. Further, the tones' spectral characteristics significantly influenced the effectiveness of the warning.
Collapse
Affiliation(s)
- Chin-Teng Lin
- Brain Research Center, Department of Electrical and Control Engineering, National Chiao-Tung University, Hsinchu 30013, Taiwan
| | | | | | | | | | | | | |
Collapse
|
34
|
Gans D, Sheykholeslami K, Peterson DC, Wenstrup J. Temporal features of spectral integration in the inferior colliculus: effects of stimulus duration and rise time. J Neurophysiol 2009; 102:167-80. [PMID: 19403742 DOI: 10.1152/jn.91300.2008] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
This report examines temporal features of facilitation and suppression that underlie spectrally integrative responses to complex vocal signals. Auditory responses were recorded from 160 neurons in the inferior colliculus (IC) of awake mustached bats. Sixty-two neurons showed combination-sensitive facilitation: responses to best frequency (BF) signals were facilitated by well-timed signals at least an octave lower in frequency, in the range 16-31 kHz. Temporal features and strength of facilitation were generally unaffected by changes in duration of facilitating signals from 4 to 31 ms. Changes in stimulus rise time from 0.5 to 5.0 ms had little effect on facilitatory strength. These results suggest that low frequency facilitating inputs to high BF neurons have phasic-on temporal patterns and are responsive to stimulus rise times over the tested range. We also recorded from 98 neurons showing low-frequency (11-32 kHz) suppression of higher BF responses. Effects of changing duration were related to the frequency of suppressive signals. Signals<23 kHz usually evoked suppression sustained throughout signal duration. This and other features of such suppression are consistent with a cochlear origin that results in masking of responses to higher, near-BF signal frequencies. Signals in the 23- to 30-kHz range-frequencies in the first sonar harmonic-generally evoked phasic suppression of BF responses. This may result from neural inhibitory interactions within and below IC. In many neurons, we observed two or more forms of the spectral interactions described here. Thus IC neurons display temporally and spectrally complex responses to sound that result from multiple spectral interactions at different levels of the ascending auditory pathway.
Collapse
Affiliation(s)
- Donald Gans
- Department of Anatomy and Neurobiology, Northeastern Ohio University College of Medicine, 4209 State Route 44, PO Box 95, Rootstown, OH 44272, USA
| | | | | | | |
Collapse
|
35
|
Rebound depolarization in single units of the ventral cochlear nucleus: a contribution to grouping by common onset? Neuroscience 2008; 154:139-46. [PMID: 18479835 DOI: 10.1016/j.neuroscience.2008.03.020] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2007] [Revised: 03/10/2008] [Accepted: 03/10/2008] [Indexed: 11/21/2022]
Abstract
Simultaneous grouping by common onset time is believed to be a powerful cue in auditory perception; components that start or stop roughly at the same time are judged as far more likely to have originated from the same source. Here we report a simple experiment designed to simulate a complex psychophysical paradigm first described by Darwin and Sutherland [(1984) Grouping frequency components of vowels. When is a harmonic not a harmonic? Quarterly J of Experimental Psychology: Hum Exp Psychol 36(A):193-208]. It is possible to change the perception of the vowel /I/ to /epsilon/ by manipulating the harmonics around the first formant (F1). Increasing the amplitude of one harmonic around F1 caused the perception of the vowel to change from /I/ to /epsilon/. Extending the increased component before the vowel could, however, greatly reduce this change. The role of neural adaptation in this effect was questioned by repeating the experiment but this time using a 'captor' tone which was switched on with the asynchronous harmonic and off when the vowel started. This time the vowel percept did change in a fashion analogous to the effect of an increase in the amplitude of the fourth harmonic (which is close to F1). This effect was explained by assuming that the captor had grouped with the leading portion of the asynchronous component enabling the remainder of the asynchronous component to be grouped with the remainder of the components. We propose a relatively low-level neuronal explanation for this grouping effect: the captor reduces the neural response to the leading segment of the asynchronous component by activating across-frequency suppression, either from the cochlea, or acting via a wideband inhibitor in the ventral cochlear nucleus. The reduction in neural response results in a release from adaptation with the offset of the captor terminating the inhibition, such that the response to the continuation of that component is now enhanced. Using a simplified paradigm we show that both primary-like and chopper units in the ventral cochlear nucleus of the anesthetized guinea pig may show a rebound in excitation when a captor is positioned so as to stimulate the suppressive sidebands in its receptive field. The strength of the rebound was positively correlated with the strength of the suppression. These and other results are consistent with the view that low-level mechanisms underlie the psychophysical captor effect.
Collapse
|
36
|
Tolnai S, Englitz B, Kopp-Scheinpflug C, Dehmel S, Jost J, Rbsamen R. Dynamic coupling of excitatory and inhibitory responses in the medial nucleus of the trapezoid body. Eur J Neurosci 2008; 27:3191-204. [DOI: 10.1111/j.1460-9568.2008.06292.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
37
|
Young ED. Neural representation of spectral and temporal information in speech. Philos Trans R Soc Lond B Biol Sci 2008; 363:923-45. [PMID: 17827107 PMCID: PMC2606788 DOI: 10.1098/rstb.2007.2151] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Speech is the most interesting and one of the most complex sounds dealt with by the auditory system. The neural representation of speech needs to capture those features of the signal on which the brain depends in language communication. Here we describe the representation of speech in the auditory nerve and in a few sites in the central nervous system from the perspective of the neural coding of important aspects of the signal. The representation is tonotopic, meaning that the speech signal is decomposed by frequency and different frequency components are represented in different populations of neurons. Essential to the representation are the properties of frequency tuning and nonlinear suppression. Tuning creates the decomposition of the signal by frequency, and nonlinear suppression is essential for maintaining the representation across sound levels. The representation changes in central auditory neurons by becoming more robust against changes in stimulus intensity and more transient. However, it is probable that the form of the representation at the auditory cortex is fundamentally different from that at lower levels, in that stimulus features other than the distribution of energy across frequency are analysed.
Collapse
Affiliation(s)
- Eric D Young
- Department of Biomedical Engineering, Centre for Hearing and Balance, Johns Hopkins University, 720 Rutland Avenue, Baltimore, MD 21205, USA.
| |
Collapse
|
38
|
Keefe DH, Ellison JC, Fitzpatrick DF, Gorga MP. Two-tone suppression of stimulus frequency otoacoustic emissions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:1479-94. [PMID: 18345837 PMCID: PMC2517244 DOI: 10.1121/1.2828209] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Stimulus frequency otoacoustic emissions (SFOAEs) measured using a suppressor tone in human ears are analogous to two-tone suppression responses measured mechanically and neurally in mammalian cochleae. SFOAE suppression was measured in 24 normal-hearing adults at octave frequencies (f(p)=0.5-8.0 kHz) over a 40 dB range of probe levels (L(p)). Suppressor frequencies (f(s)) ranged from -2.0 to 0.7 octaves re: f(p), and suppressor levels ranged from just detectable suppression to full suppression. The lowest suppression thresholds occurred for "best" f(s) slightly higher than f(p). SFOAE growth of suppression (GOS) had slopes close to one at frequencies much lower than best f(s), and shallow slopes near best f(s), which indicated compressive growth close to 0.3 dBdB. Suppression tuning curves constructed from GOS functions were well defined at 1, 2, and 4 kHz, but less so at 0.5 and 8.0 kHz. Tuning was sharper at lower L(p) with an equivalent rectangular bandwidth similar to that reported behaviorally for simultaneous masking. The tip-to-tail difference assessed cochlear gain, increasing with decreasing L(p) and increasing f(p) at the lowest L(p) from 32 to 45 dB for f(p) from 1 to 4 kHz. SFOAE suppression provides a noninvasive measure of the saturating nonlinearities associated with cochlear amplification on the basilar membrane.
Collapse
Affiliation(s)
- Douglas H Keefe
- Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | | | |
Collapse
|
39
|
Yasin I, Plack CJ. The effects of low- and high-frequency suppressors on psychophysical estimates of basilar-membrane compression and gain. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:2832-41. [PMID: 17550182 DOI: 10.1121/1.2713675] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Physiological studies suggest that the increase in suppression as a function of suppressor level is greater for a suppressor below than above the signal frequency. This study investigated the pattern of gain reduction underlying this increase in suppression. Temporal masking curves (TMCs) were obtained by measuring the level of a 2.2-kHz sinusoidal off-frequency masker or 4-kHz on-frequency sinusoidal masker required to mask a brief 4-kHz sinusoidal signal at 10 dB SL, for masker-signal intervals of 20-100 ms. TMCs were also obtained in the presence of a 3- or 4.75-kHz sinusoidal suppressor gated with the 4-kHz masker, for suppressor levels of 40-70 dB SPL. The decrease in gain (increase in suppression) as a function of suppressor level was greater with a 3-kHz suppressor than with a 4.75-kHz suppressor, in line with previous findings. Basilar membrane input-output (I/O) functions derived from the TMCs showed a shift to higher input (4-kHz masker) levels of the low-level (linear) portion of the I/O function with the addition of a suppressor, with partial linearization of the function, but no reduction in maximum compression.
Collapse
Affiliation(s)
- Ifat Yasin
- Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford OX1 3UD, United Kingdom.
| | | |
Collapse
|
40
|
Rhode WS. Mutual suppression in the 6 kHz region of sensitive chinchilla cochleae. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:2805-18. [PMID: 17550179 DOI: 10.1121/1.2718398] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Basilar membrane (BM) vibration was measured using a displacement measuring interferometer for single-tone and two-tone suppression (2TS) paradigms in the 6-9 kHz region of sensitive chinchilla cochleae that had gains near or better than 60 dB. Based on prior studies of basilar membrane vibration, three significant differences remain between BM and auditory nerve (AN) 2TS responses: (1) suppression thresholds in the tail of tuning curves were much higher in BM than the auditory nerve (AN); (2) rates of suppression were significantly higher in AN than BM; and (3) the amplitude of vibration with low-frequency suppressors was always greater than the single-tone displacement rendering it impossible to explain 2TS rate suppression in the AN. The first two differences are eliminated by the results of the present study while the third remains. Suppression amplitudes greater than 40 dB and rates of suppression larger than 2.5 dB/dB were found for low-frequency suppressors. A correlation between both the gain and nonlinearity of the cochlea and 2TS properties indicates that when sensitive cochleae are studied. The third difference between BM and AN behavior could be strictly a function of the high-pass filter characteristic of the inner hair cells.
Collapse
Affiliation(s)
- William S Rhode
- Department of Physiology, University of Wisconsin, Madison, Wisconsin 53706, USA.
| |
Collapse
|
41
|
Oxenham AJ, Simonson AM, Turicchia L, Sarpeshkar R. Evaluation of companding-based spectral enhancement using simulated cochlear-implant processing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:1709-16. [PMID: 17407907 DOI: 10.1121/1.2434757] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
This study tested a time-domain spectral enhancement algorithm that was recently proposed by Turicchia and Sarpeshkar [IEEE Trans. Speech Audio Proc. 13, 243-253 (2005)]. The algorithm uses a filter bank, with each filter channel comprising broadly tuned amplitude compression, followed by more narrowly tuned expansion (companding). Normal-hearing listeners were tested in their ability to recognize sentences processed through a noise-excited envelope vocoder that simulates aspects of cochlear-implant processing. The sentences were presented in a steady background noise at signal-to-noise ratios of 0, 3, and 6 dB and were either passed directly through an envelope vocoder, or were first processed by the companding algorithm. Using an eight-channel envelope vocoder, companding produced small but significant improvements in speech reception. Parametric variations of the companding algorithm showed that the improvement in intelligibility was robust to changes in filter tuning, whereas decreases in the time constants resulted in a decrease in intelligibility. Companding continued to provide a benefit when the number of vocoder frequency channels was increased to sixteen. When integrated within a sixteen-channel cochlear-implant simulator, companding also led to significant improvements in sentence recognition. Thus, companding may represent a readily implementable way to provide some speech recognition benefits to current cochlear-implant users.
Collapse
Affiliation(s)
- Andrew J Oxenham
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
| | | | | | | |
Collapse
|
42
|
Zilany MSA, Bruce IC. Modeling auditory-nerve responses for high sound pressure levels in the normal and impaired auditory periphery. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:1446-66. [PMID: 17004468 DOI: 10.1121/1.2225512] [Citation(s) in RCA: 118] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Abstract
This paper presents a computational model to simulate normal and impaired auditory-nerve (AN) fiber responses in cats. The model responses match physiological data over a wider dynamic range than previous auditory models. This is achieved by providing two modes of basilar membrane excitation to the inner hair cell (IHC) rather than one. The two modes are generated by two parallel filters, component 1 (C1) and component 2 (C2), and the outputs are subsequently transduced by two separate functions. The responses are then added and passed through the IHC low-pass filter followed by the IHC-AN synapse model and discharge generator. The C1 filter is a narrow-band, chirp filter with the gain and bandwidth controlled by a nonlinear feed-forward control path. This filter is responsible for low and moderate level responses. A linear, static, and broadly tuned C2 filter followed by a nonlinear, inverted and nonrectifying C2 transduction function is critical for producing transition region and high-level effects. Consistent with Kiang's two-factor cancellation hypothesis, the interaction between the two paths produces effects such as the C1/C2 transition and peak splitting in the period histogram. The model responses are consistent with a wide range of physiological data from both normal and impaired ears for stimuli presented at levels spanning the dynamic range of hearing.
Collapse
Affiliation(s)
- Muhammad S A Zilany
- Department of Electrical and Computer Engineering, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | | |
Collapse
|
43
|
Yasin I, Plack CJ. The role of suppression in the upward spread of masking. J Assoc Res Otolaryngol 2006; 6:368-77. [PMID: 16261268 PMCID: PMC2504625 DOI: 10.1007/s10162-005-0014-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2005] [Accepted: 08/11/2005] [Indexed: 11/29/2022] Open
Abstract
The upward spread of masking refers to the higher growth rate of masking for maskers lower in frequency than the signal, compared to maskers at the signal frequency (Wegel RL, Lane CE. The auditory masking of one pure tone by another and its possible relation to the dynamics of the inner ear. Physics Rev. 23:266-285, 1924; Egan JP, Hake HW. On the masking pattern of a simple auditory stimulus. J. Acoust. Soc. Am. 22:622-630, 1950; Delgutte B. Physiological mechanisms of psychophysical masking: Observations from auditory-nerve fibres. J. Acoust. Soc. Am. 87:791-809, 1990a, Delgutte B. Two-tone rate suppression in auditory-nerve fibres: Dependence on suppressor frequency and level. Hear Res. 49:225-246, 1990b). The upward spread of simultaneous masking may arise from a combination of excitatory and suppressive effects. In this study, growth of masking functions were obtained for a 4-kHz signal masked by an on-frequency (4 kHz) or off-frequency (2.4 kHz), simultaneous or forward masker, in the presence of a notched noise with a center frequency of 4 kHz presented to restrict off-frequency listening. Compression was estimated from the slopes of the off-frequency growth of masking functions. Suppression was estimated by comparing the off-frequency simultaneous- and forward-masked growth of masking functions. Results showed that, for midlevel signals (35-60 dB SPL), the compression exponent estimated from simultaneous and forward masking averaged 0.31 and 0.26, respectively. The maximum amount of suppression (defined as the decrease in the basilar-membrane response to the signal) was variable, ranging from about 6 to 17 dB across subjects. Despite the substantial reduction in the response to the signal, the results suggest that suppression has a minimal effect on the slope of the masking function at mid levels. Rather, upward spread of masking seems to be mainly determined by the compressive basilar-membrane response to the signal in relation to the linear response to the lower-frequency masker.
Collapse
Affiliation(s)
- Ifat Yasin
- Department of Experimental Psychology, University of Oxford, South Parks Road, Oxford, UK.
| | | |
Collapse
|
44
|
Oxenham AJ, Simonson AM. Level dependence of auditory filters in nonsimultaneous masking as a function of frequency. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:444-53. [PMID: 16454299 PMCID: PMC1752201 DOI: 10.1121/1.2141359] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Auditory filter bandwidths were measured using nonsimultaneous masking, as a function of signal level between 10 and 35 dB SL for signal frequencies of 1, 2, 4, and 6 kHz. The brief sinusoidal signal was presented in a temporal gap within a spectrally notched noise. Two groups of normal-hearing subjects were tested, one using a fixed masker level and adaptively varying signal level, the other using a fixed signal level and adaptively varying masker level. In both cases, auditory filters were derived by assuming a constant filter shape for a given signal level. The filter parameters derived from the two paradigms were not significantly different. At 1 kHz, the equivalent rectangular bandwidth (ERB) decreased as the signal level increased from 10 to 20 dB SL, after which it remained roughly constant. In contrast, at 6 kHz, the ERB increased consistently with signal levels from 10 to 35 dB SL. The results at 2 and 4 kHz were intermediate, showing no consistent change in ERB with signal level. Overall, the results suggest changes in the level dependence of the auditory filters at frequencies above 1 kHz that are not currently incorporated in models of human auditory filter tuning.
Collapse
Affiliation(s)
- Andrew J Oxenham
- Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA.
| | | |
Collapse
|
45
|
Johnson TA, Neely ST, Garner CA, Gorga MP. Influence of primary-level and primary-frequency ratios on human distortion product otoacoustic emissions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:418-28. [PMID: 16454296 PMCID: PMC2440916 DOI: 10.1121/1.2133714] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
The combined influence of primary-level differences (L1-L2) and primary-frequency ratio (f2/f1) on distortion product otoacoustic emission (DPOAE) level was investigated in 20 normal-hearing subjects. DPOAEs were recorded with continuously varying stimulus levels [Neely et al. J. Acoust. Soc. Am. 117, 1248-1259 (2005)] for the following stimulus conditions: f2= 1, 2, 4, and 8 kHz and f2/f1=1.05 to 1.4; various L1-L2, including one individually optimized to produce the largest DPOAE. For broadly spaced primary frequencies at low L2 levels, the largest DPOAEs were recorded when L1 was much higher than L2, with L1 remaining relatively constant as L2 increased. As f2/fl decreased, the largest DPOAEs were observed when L1 was closer to L2 and increased as L2 increased. Optimal values for L1-L2 and f2 f1 were derived from these data. In general, average DPOAE levels for the new L1-L2 and f2/f1 were equivalent to or larger than those observed for other stimulus combinations, including the L1-L2 described by Kummer et al. [J. Acoust. Soc. Am. 103, 3431-3444 (1998)] and those defined by Neely et al. in which L1-L2 was evaluated, but f2/f1 was fixed at 1.2.
Collapse
|
46
|
Young ED, Calhoun BM. Nonlinear Modeling of Auditory-Nerve Rate Responses to Wideband Stimuli. J Neurophysiol 2005; 94:4441-54. [PMID: 16162837 DOI: 10.1152/jn.00261.2005] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The spectral selectivity of auditory nerve fibers was characterized by a method based on responses to random-spectrum-shape stimuli. The method models the average discharge rate of fibers for steady stimuli and is based on responses to ≈100 noise-like stimuli with pseudorandom spectral levels in 1/8- or 1/16-octave frequency bins. The model assumes that rate is determined by a linear weighting of the spectrum plus a second-order weighting of all pairs of spectrum values within a certain frequency range of best frequency. The method allows prediction of rate responses to stimuli with arbitrary wideband spectral shapes, thus providing a direct test of the degree of linearity of spectral processing Auditory-nerve fibers are shown to rely mainly on linear weighting of the stimulus spectrum; however, significant second-order terms are present and are important in predicting responses to random-spectrum shape stimuli, although not for predicting responses to noise filtered with cat head-related transfer functions. The second-order terms weight the products of levels at identical frequencies positively and the products of different frequencies negatively. As such, they model both curvature in the rate versus level function and suppressive interactions between different frequency components. The first- and second-order characterizations derived in this method provide a measure of higher-order nonlinearities in neurons, albeit without providing information about temporal characteristics.
Collapse
Affiliation(s)
- Eric D Young
- Center for Hearing and Balance, Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD 21205, USA.
| | | |
Collapse
|
47
|
Valentine PA, Eggermont JJ. Stimulus dependence of spectro-temporal receptive fields in cat primary auditory cortex. Hear Res 2005; 196:119-33. [PMID: 15464309 DOI: 10.1016/j.heares.2004.05.011] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/24/2004] [Accepted: 05/24/2004] [Indexed: 10/26/2022]
Abstract
The frequency-tuning curve is a static representation of the neuron's sensitivity to stimulus frequency. The temporal aspects of the frequency sensitivity can be captured in the spectro-temporal receptive field (STRF), often presented as the average spectrogram of the stimulus preceding a spike but also as the average frequency-dependent post-stimulus time histogram (PSTH). The temporal envelope of the stimulus produces considerable smoothing, and as a consequence the PSTH representation is finer-grained than the spectrogram representation. Here we compare STRFs for 1/s and 20/s single-frequency stimuli with 120/s steady-state multi-frequency stimuli for 87 recording sites in primary auditory cortex of cats. For the 672 estimated STRFs, which for multi-frequency stimuli were mostly obtained at 55 dB SPL, we found lateral inhibition in 17% of the cases, in 32% post-activation suppression, and in 51% only excitation. In 35% of the recordings the excitatory frequency-tuning curves were very similar for single and multi-frequency stimuli, in the remaining 65% the common finding was the emergence of an intensity independent bandwidth for the multi-frequency stimuli. Comparison of the 20/s and 120/s stimuli showed that the resulting increase in inhibition was strongest in the center of the STRF.
Collapse
Affiliation(s)
- Pamela A Valentine
- Department of Physiology and Biophysics, University of Calgary, Calgary, Alberta, Canada
| | | |
Collapse
|
48
|
Abstract
The nonlinear cochlear phenomenon of two-tone suppression is known to be very fast, but precisely how fast is unknown. We studied the timing of low-side suppression in the auditory nerve of the cat using multitone complexes as auditory stimuli. An evalution of the group delays of the responses to these complexes allowed us to measure the timing of the responses with sub-millisecond accuracy for a large number of fibers with characteristic frequencies (CFs) between 2 and 40 kHz. In particular, we measured the delays with which the same below-CF tone complexes affected the response either as an excitor (when presented alone) or as a suppressor (when combined with a CF probe). For CFs <10 kHz, we found that the delay of suppression was larger than the delay of excitation by several hundred microseconds. The difference between the delay of suppression and that of excitation decreased with increasing CF, becoming negligible for CFs >15 kHz. The results are analyzed in terms of traveling-wave delays and a purported cochlear gain control. The data suggest that suppression originates from a gain-control mechanism with an integration time in the order of two cycles of CF.
Collapse
Affiliation(s)
- Marcel van der Heijden
- Laboratory of Auditory, Neurophysiology, O. and N. Campus Gasthuisberg, Herestraat 49 - bus 801, B-3000 Leuven, Belgium.
| | | |
Collapse
|
49
|
Young ED, Yu JJ, Reiss LAJ. Non‐Linearities and the Representation of Auditory Spectra. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2005; 70:135-68. [PMID: 16472634 DOI: 10.1016/s0074-7742(05)70005-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Affiliation(s)
- Eric D Young
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland 21205, USA
| | | | | |
Collapse
|
50
|
Lopez-Poveda EA. Spectral processing by the peripheral auditory system: facts and models. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2005; 70:7-48. [PMID: 16472630 DOI: 10.1016/s0074-7742(05)70001-5] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Affiliation(s)
- Enrique A Lopez-Poveda
- Instituto de Neurociencias de Castilla y León, Universidad de Salamanca, Salamanca 37007, Spain
| |
Collapse
|