1
|
van Bentum GC, van Wanrooij MM, van Opstal AJ. Spatiotemporal factors influence sound-source segregation in localization behavior. J Neurophysiol 2020; 125:556-567. [PMID: 33378250 DOI: 10.1152/jn.00184.2020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
To program a goal-directed response in the presence of acoustic reflections, the audio-motor system should suppress the detection of time-delayed sources. We examined the effects of spatial separation and interstimulus delay on the ability of human listeners to localize a pair of broadband sounds in the horizontal plane. Participants indicated how many sounds were heard and where these were perceived by making one or two head-orienting localization responses. Results suggest that perceptual fusion of the two sounds depends on delay and spatial separation. Leading and lagging stimuli in close spatial proximity required longer stimulus delays to be perceptually separated than those further apart. Whenever participants heard one sound, their localization responses for synchronous sounds were oriented to a weighted average of both source locations. For short delays, responses were directed toward the leading stimulus location. Increasing spatial separation enhanced this effect. For longer delays, responses were again directed toward a weighted average. When participants perceived two sounds, the first and the second response were directed to either of the leading and lagging source locations. Perceived locations were interchanged often in their temporal order (in ∼40% of trials). We show that the percept of two sounds occurring requires sufficient spatiotemporal separation, after which localization can be performed with high accuracy. We propose that the percept of temporal order of two concurrent sounds results from a different process than localization and discuss how dynamic lateral excitatory-inhibitory interactions within a spatial sensorimotor map could explain the findings.NEW & NOTEWORTHY Sound localization requires spectral and temporal processing of implicit acoustic cues, and is seriously challenged when multiple sources coincide closely in space and time. We systematically varied spatial-temporal disparities for two sounds and instructed listeners to generate goal-directed head movements. We found that even when the auditory system has accurate representations of both sources, it still has trouble to decide whether the scene contained one or two sounds, and in which order they appeared.
Collapse
Affiliation(s)
- Guus C van Bentum
- Department of Biophysics, Donders Center for Neuroscience, Donders Institute for Brain, Cognition, and Behaviour, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Marc M van Wanrooij
- Department of Biophysics, Donders Center for Neuroscience, Donders Institute for Brain, Cognition, and Behaviour, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - A John van Opstal
- Department of Biophysics, Donders Center for Neuroscience, Donders Institute for Brain, Cognition, and Behaviour, Radboud University Nijmegen, Nijmegen, The Netherlands
| |
Collapse
|
2
|
Anbuhl KL, Benichoux V, Greene NT, Brown AD, Tollin DJ. Development of the head, pinnae, and acoustical cues to sound location in a precocial species, the guinea pig (Cavia porcellus). Hear Res 2017; 356:35-50. [PMID: 29128159 PMCID: PMC5705338 DOI: 10.1016/j.heares.2017.10.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/19/2017] [Revised: 09/23/2017] [Accepted: 10/30/2017] [Indexed: 11/26/2022]
Abstract
The morphology of the head and pinna shape the spatial and frequency dependence of sound propagation that give rise to the acoustic cues to sound source location. During early development, the physical dimensions of the head and pinna increase rapidly. Thus, the binaural (interaural time and level differences, ITD and ILD) and monaural (spectral shape) cues are also hypothesized to change rapidly. Complex interactions between the size and shape of the head and pinna limit the accuracy of simple acoustical models (e.g. spherical) and necessitate empirical measurements. Here, we measured the cues to location in the developing guinea pig, a precocial species commonly used for studies of the auditory system. We measured directional transfer functions (DTFs) and the dimensions of the head and pinna in guinea pigs from birth (P0) through adulthood. Dimensions of the head and pinna increased by 87% and 48%, respectively, reaching adult values by ∼8 weeks (P56). The monaural acoustic gain produced by the head and pinna increased with frequency and age, with maximum gains at higher frequencies (>8 kHz) reaching values of 10-21 dB for all ages. The center frequency of monaural spectral notches also decreased with age, from higher frequencies (∼17 kHz) at P0 to lower frequencies (∼12 kHz) in adults. In all animals, ILDs and ITDs were dependent on both frequency and spatial location. Over development, the maximum ILD magnitude increased from ∼15 dB at P0 to ∼30 dB in adults (at frequencies >8 kHz), while the maximum low frequency ITDs increased from ∼185 μs at P0 to ∼300 μs in adults. These results demonstrate that the changes in the acoustical cues are directly related to changes in head and pinna morphology.
Collapse
Affiliation(s)
- Kelsey L Anbuhl
- Neuroscience Training Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Victor Benichoux
- Department of Physiology & Biophysics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Nathaniel T Greene
- Department of Physiology & Biophysics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Department of Otolaryngology, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | - Andrew D Brown
- Department of Physiology & Biophysics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Daniel J Tollin
- Neuroscience Training Program, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Department of Physiology & Biophysics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA; Department of Otolaryngology, University of Colorado School of Medicine, Aurora, CO 80045, USA.
| |
Collapse
|
3
|
Gai Y, Ruhland JL, Yin TCT. Behavior and modeling of two-dimensional precedence effect in head-unrestrained cats. J Neurophysiol 2015; 114:1272-85. [PMID: 26133795 DOI: 10.1152/jn.00214.2015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2015] [Accepted: 06/29/2015] [Indexed: 11/22/2022] Open
Abstract
The precedence effect (PE) is an auditory illusion that occurs when listeners localize nearly coincident and similar sounds from different spatial locations, such as a direct sound and its echo. It has mostly been studied in humans and animals with immobile heads in the horizontal plane; speaker pairs were often symmetrically located in the frontal hemifield. The present study examined the PE in head-unrestrained cats for a variety of paired-sound conditions along the horizontal, vertical, and diagonal axes. Cats were trained with operant conditioning to direct their gaze to the perceived sound location. Stereotypical PE-like behaviors were observed for speaker pairs placed in azimuth or diagonally in the frontal hemifield as the interstimulus delay was varied. For speaker pairs in the median sagittal plane, no clear PE-like behavior occurred. Interestingly, when speakers were placed diagonally in front of the cat, certain PE-like behavior emerged along the vertical dimension. However, PE-like behavior was not observed when both speakers were located in the left hemifield. A Hodgkin-Huxley model was used to simulate responses of neurons in the medial superior olive (MSO) to sound pairs in azimuth. The novel simulation incorporated a low-threshold potassium current and frequency mismatches to generate internal delays. The model exhibited distinct PE-like behavior, such as summing localization and localization dominance. The simulation indicated that certain encoding of the PE could have occurred before information reaches the inferior colliculus, and MSO neurons with binaural inputs having mismatched characteristic frequencies may play an important role.
Collapse
Affiliation(s)
- Yan Gai
- Department of Neuroscience, University of Wisconsin, Madison, Wisconsin; and Department of Biomedical Engineering, Saint Louis University, St. Louis, Missouri
| | - Janet L Ruhland
- Department of Neuroscience, University of Wisconsin, Madison, Wisconsin; and
| | - Tom C T Yin
- Department of Neuroscience, University of Wisconsin, Madison, Wisconsin; and
| |
Collapse
|
4
|
Brown AD, Stecker GC, Tollin DJ. The precedence effect in sound localization. J Assoc Res Otolaryngol 2015; 16:1-28. [PMID: 25479823 PMCID: PMC4310855 DOI: 10.1007/s10162-014-0496-2] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Accepted: 10/13/2014] [Indexed: 11/29/2022] Open
Abstract
In ordinary listening environments, acoustic signals reaching the ears directly from real sound sources are followed after a few milliseconds by early reflections arriving from nearby surfaces. Early reflections are spectrotemporally similar to their source signals but commonly carry spatial acoustic cues unrelated to the source location. Humans and many other animals, including nonmammalian and even invertebrate animals, are nonetheless able to effectively localize sound sources in such environments, even in the absence of disambiguating visual cues. Robust source localization despite concurrent or nearly concurrent spurious spatial acoustic information is commonly attributed to an assortment of perceptual phenomena collectively termed "the precedence effect," characterizing the perceptual dominance of spatial information carried by the first-arriving signal. Here, we highlight recent progress and changes in the understanding of the precedence effect and related phenomena.
Collapse
Affiliation(s)
- Andrew D. Brown
- />Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, CO 80045 USA
| | - G. Christopher Stecker
- />Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN 37232 USA
| | - Daniel J. Tollin
- />Department of Physiology and Biophysics, University of Colorado School of Medicine, Aurora, CO 80045 USA
| |
Collapse
|
5
|
Gai Y, Ruhland JL, Yin TCT. Localization of click trains and speech by cats: the negative level effect. J Assoc Res Otolaryngol 2014; 15:789-800. [PMID: 24942705 DOI: 10.1007/s10162-014-0469-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 05/28/2014] [Indexed: 11/30/2022] Open
Abstract
Although localization of sound in elevation is believed to depend on spectral cues, it has been shown with human listeners that the temporal features of sound can also greatly affect localization performance. Of particular interest is a phenomenon known as the negative level effect, which describes the deterioration of localization ability in elevation with increasing sound level and is observed only with impulsive or short-duration sound. The present study uses the gaze positions of domestic cats as measures of perceived locations of sound targets varying in azimuth and elevation. The effects of sound level on localization in terms of accuracy, precision, and response latency were tested for sound with different temporal features, such as a click train, a single click, a continuous sound that had the same frequency spectrum of the click train, and speech segments. In agreement with previous human studies, negative level effects were only observed with click-like stimuli and only in elevation. In fact, localization of speech sounds in elevation benefited significantly when the sound level increased. Our findings indicate that the temporal continuity of a sound can affect the frequency analysis performed by the auditory system, and the variation in the frequency spectrum contained in speech sound does not interfere much with the spectral coding for its location in elevation.
Collapse
Affiliation(s)
- Yan Gai
- Department of Neuroscience, University of Wisconsin-Madison, Medical Sciences Building, 251 SMI, 1300 University Avenue, Madison, WI, 53706, USA,
| | | | | |
Collapse
|
6
|
Tolnai S, Litovsky RY, King AJ. The precedence effect and its buildup and breakdown in ferrets and humans. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:1406-1418. [PMID: 24606278 PMCID: PMC4153988 DOI: 10.1121/1.4864486] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Revised: 01/22/2014] [Accepted: 01/27/2014] [Indexed: 06/03/2023]
Abstract
Although many studies have examined the precedence effect (PE), few have tested whether it shows a buildup and breakdown in nonhuman animals comparable to that seen in humans. These processes are thought to reflect the ability of the auditory system to adjust to a listener's acoustic environment, and their mechanisms are still poorly understood. In this study, ferrets were trained on a two-alternative forced-choice task to discriminate the azimuthal direction of brief sounds. In one experiment, pairs of noise bursts were presented from two loudspeakers at different interstimulus delays (ISDs). Results showed that localization performance changed as a function of ISD in a manner consistent with the PE being operative. A second experiment investigated buildup and breakdown of the PE by measuring the ability of ferrets to discriminate the direction of a click pair following presentation of a conditioning train. Human listeners were also tested using this paradigm. In both species, performance was better when the test clicks and conditioning train had the same ISD but deteriorated following a switch in the direction of the leading and lagging sounds between the conditioning train and test clicks. These results suggest that ferrets, like humans, experience a buildup and breakdown of the PE.
Collapse
Affiliation(s)
- Sandra Tolnai
- Department of Physiology, Anatomy and Genetics, Sherrington Building, Parks Road, University of Oxford, Oxfordshire, United Kingdom
| | - Ruth Y Litovsky
- University of Wisconsin, Waisman Center, 1500 Highland Avenue, Madison, Wisconsin 53705
| | - Andrew J King
- Department of Physiology, Anatomy and Genetics, Sherrington Building, Parks Road, University of Oxford, Oxfordshire, United Kingdom
| |
Collapse
|
7
|
Gai Y, Ruhland JL, Yin TCT, Tollin DJ. Behavioral and modeling studies of sound localization in cats: effects of stimulus level and duration. J Neurophysiol 2013; 110:607-20. [PMID: 23657278 DOI: 10.1152/jn.01019.2012] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Sound localization accuracy in elevation can be affected by sound spectrum alteration. Correspondingly, any stimulus manipulation that causes a change in the peripheral representation of the spectrum may degrade localization ability in elevation. The present study examined the influence of sound duration and level on localization performance in cats with the head unrestrained. Two cats were trained using operant conditioning to indicate the apparent location of a sound via gaze shift, which was measured with a search-coil technique. Overall, neither sound level nor duration had a notable effect on localization accuracy in azimuth, except at near-threshold levels. In contrast, localization accuracy in elevation improved as sound duration increased, and sound level also had a large effect on localization in elevation. For short-duration noise, the performance peaked at intermediate levels and deteriorated at low and high levels; for long-duration noise, this "negative level effect" at high levels was not observed. Simulations based on an auditory nerve model were used to explain the above observations and to test several hypotheses. Our results indicated that neither the flatness of sound spectrum (before the sound reaches the inner ear) nor the peripheral adaptation influences spectral coding at the periphery for localization in elevation, whereas neural computation that relies on "multiple looks" of the spectral analysis is critical in explaining the effect of sound duration, but not level. The release of negative level effect observed for long-duration sound could not be explained at the periphery and, therefore, is likely a result of processing at higher centers.
Collapse
Affiliation(s)
- Yan Gai
- Department of Neuroscience, University of Wisconsin, Madison, WI 53706, USA.
| | | | | | | |
Collapse
|
8
|
Tollin DJ, Ruhland JL, Yin TCT. The role of spectral composition of sounds on the localization of sound sources by cats. J Neurophysiol 2012; 109:1658-68. [PMID: 23274314 DOI: 10.1152/jn.00358.2012] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Sound localization along the azimuthal dimension depends on interaural time and level disparities, whereas localization in elevation depends on broadband power spectra resulting from the filtering properties of the head and pinnae. We trained cats with their heads unrestrained, using operant conditioning to indicate the apparent locations of sounds via gaze shift. Targets consisted of broadband (BB), high-pass (HP), or low-pass (LP) noise, tones from 0.5 to 14 kHz, and 1/6 octave narrow-band (NB) noise with center frequencies ranging from 6 to 16 kHz. For each sound type, localization performance was summarized by the slope of the regression relating actual gaze shift to desired gaze shift. Overall localization accuracy for BB noise was comparable in azimuth and in elevation but was markedly better in azimuth than in elevation for sounds with limited spectra. Gaze shifts to targets in azimuth were most accurate to BB, less accurate for HP, LP, and NB sounds, and considerably less accurate for tones. In elevation, cats were most accurate in localizing BB, somewhat less accurate to HP, and less yet to LP noise (although still with slopes ∼0.60), but they localized NB noise much worse and were unable to localize tones. Deterioration of localization as bandwidth narrows is consistent with the hypothesis that spectral information is critical for sound localization in elevation. For NB noise or tones in elevation, unlike humans, most cats did not have unique responses at different frequencies, and some appeared to respond with a "default" location at all frequencies.
Collapse
Affiliation(s)
- Daniel J Tollin
- Dept. of Physiology and Biophysics, Univ. of Colorado School of Medicine, Aurora, CO 80045, USA.
| | | | | |
Collapse
|