1
|
Abstract
Identification of speech from a "target" talker was measured in a speech-on-speech
masking task with two simultaneous "masker" talkers. The overall level of each talker was
either fixed or randomized throughout each stimulus presentation to investigate the
effectiveness of level as a cue for segregating competing talkers and attending to the
target. Experimental manipulations included varying the level difference between talkers
and imposing three types of target level uncertainty: 1) fixed target level across trials,
2) random target level across trials, or 3) random target levels on a word-by-word basis
within a trial. When the target level was predictable performance was better than
corresponding conditions when the target level was uncertain. Masker confusions were
consistent with a high degree of informational masking (IM). Furthermore, evidence was
found for "tuning" in level and a level "release" from IM. These findings suggest that
conforming to listener expectation about relative level, in addition to cues signaling
talker identity, facilitates segregation of, and maintaining focus of attention on, a
specific talker in multiple-talker communication situations.
Collapse
Affiliation(s)
- Andrew J Byrne
- Department of Speech, Language, & Hearing Sciences, 1846Boston University, MA, USA
| | - Christopher Conroy
- Department of Speech, Language, & Hearing Sciences, 1846Boston University, MA, USA
| | - Gerald Kidd
- Department of Speech, Language, & Hearing Sciences, 1846Boston University, MA, USA.,Department of Otolaryngology, Head-Neck Surgery, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
2
|
Cabrera L, Lorenzini I, Rosen S, Varnet L, Lorenzi C. Temporal integration for amplitude modulation in childhood: Interaction between internal noise and memory. Hear Res 2021; 415:108403. [PMID: 34879987 DOI: 10.1016/j.heares.2021.108403] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 11/17/2021] [Accepted: 11/25/2021] [Indexed: 11/25/2022]
Abstract
It is still unclear whether the gradual improvement in amplitude-modulation (AM) sensitivity typically found in children up to 10 years of age reflects an improvement in "processing efficiency" (the central ability to use information extracted by sensory mechanisms). This hypothesis was tested by evaluating temporal integration for AM, a capacity relying on memory and decision factors. This was achieved by measuring the effect of increasing the number of AM cycles (2 vs 8) on AM-detection thresholds for three groups of children aged from 5 to 11 years and a group of young adults. AM-detection thresholds were measured using a forced-choice procedure and sinusoidal AM (4 or 32 Hz rate) applied to a 1024-Hz pure-tone carrier. All age groups demonstrated temporal integration for AM at both rates; that is, significant improvements in AM sensitivity with a higher number of AM cycles. However, an effect of age is observed as both 5-6 year olds and adults exhibited more temporal integration compared to 7-8 and 10-11 year olds at both rates. This difference is due to: (i) the 5-6 year olds displaying the worst thresholds with 2 AM cycles, but similar thresholds with 8 cycles compared to the 7-8 and 10-11 year olds, and, (ii) adults showing the best thresholds with 8 AM cycles but similar thresholds with 2 cycles compared to the 7-8 and 10-11 year olds. Computational modelling indicated that higher levels of internal noise combined with poorer short-term memory capacities in children accounted for the developmental trends. Improvement in processing efficiency may therefore account for the development of AM detection in childhood. This article is part of the Special Issue Outer hair cell Edited by Joseph Santos-Sacchi and Kumar Navaratnam.
Collapse
Affiliation(s)
- Laurianne Cabrera
- Université de Paris, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France; Speech, Hearing and Phonetic Sciences, UCL, United Kingdom.
| | - Irene Lorenzini
- Université de Paris, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Stuart Rosen
- Speech, Hearing and Phonetic Sciences, UCL, United Kingdom
| | - Léo Varnet
- Laboratoire des Systèmes Perceptifs (UMR 8248), CNRS, Ecole normale supérieure, Université Paris Sciences & Lettres (PSL), Paris, France
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs (UMR 8248), CNRS, Ecole normale supérieure, Université Paris Sciences & Lettres (PSL), Paris, France
| |
Collapse
|
3
|
Attia S, King A, Varnet L, Ponsot E, Lorenzi C. Double-pass consistency for amplitude- and frequency-modulation detection in normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3631. [PMID: 34852611 DOI: 10.1121/10.0006811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 10/05/2021] [Indexed: 06/13/2023]
Abstract
Amplitude modulation (AM) and frequency modulation (FM) provide crucial auditory information. If FM is encoded as AM, it should be possible to give a unified account of AM and FM perception both in terms of response consistency and performance. These two aspects of behavior were estimated for normal-hearing participants using a constant-stimuli, forced-choice detection task repeated twice with the same stimuli (double pass). Sinusoidal AM or FM with rates of 2 or 20 Hz were applied to a 500-Hz pure-tone carrier and presented at detection threshold. All stimuli were masked by a modulation noise. Percent agreement of responses across passes and percent-correct detection for the two passes were used to estimate consistency and performance, respectively. These data were simulated using a model implementing peripheral processes, a central modulation filterbank, an additive internal noise, and a template-matching device. Different levels of internal noise were required to reproduce AM and FM data, but a single level could account for the 2- and 20-Hz AM data. As for FM, two levels of internal noise were needed to account for detection at slow and fast rates. Finally, the level of internal noise yielding best predictions increased with the level of the modulation-noise masker. Overall, these results suggest that different sources of internal variability are involved for AM and FM detection at low audio frequencies.
Collapse
Affiliation(s)
- Sarah Attia
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Andrew King
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Emmanuel Ponsot
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| |
Collapse
|
4
|
Buss E, Lorenzi C, Cabrera L, Leibold LJ, Grose JH. Amplitude modulation detection and modulation masking in school-age children and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2019; 145:2565. [PMID: 31046373 PMCID: PMC6909994 DOI: 10.1121/1.5098950] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 04/02/2019] [Accepted: 04/03/2019] [Indexed: 05/30/2023]
Abstract
Two experiments were performed to better understand on- and off-frequency modulation masking in normal-hearing school-age children and adults. Experiment 1 estimated thresholds for detecting 16-, 64- or 256-Hz sinusoidal amplitude modulation (AM) imposed on a 4300-Hz pure tone. Thresholds tended to improve with age, with larger developmental effects for 64- and 256-Hz AM than 16-Hz AM. Detection of 16-Hz AM was also measured with a 1000-Hz off-frequency masker tone carrying 16-Hz AM. Off-frequency modulation masking was larger for younger than older children and adults when the masker was gated with the target, but not when the masker was continuous. Experiment 2 measured detection of 16- or 64-Hz sinusoidal AM carried on a bandpass noise with and without additional on-frequency masker AM. Children and adults demonstrated modulation masking with similar tuning to modulation rate. Rate-dependent age effects for AM detection on a pure-tone carrier are consistent with maturation of temporal resolution, an effect that may be obscured by modulation masking for noise carriers. Children were more susceptible than adults to off-frequency modulation masking for gated stimuli, consistent with maturation in the ability to listen selectively in frequency, but the children were not more susceptible to on-frequency modulation masking than adults.
Collapse
Affiliation(s)
- Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine, University of North Carolina, Chapel Hill, North Carolina 27599-7070, USA
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, Ecole Normale Supérieure, Universite Paris Sciences et Lettres, Centre National de la Recherche Scientifique, Paris, France
| | - Laurianne Cabrera
- Laboratoire de Psychologie de la Perception, Université Paris Descartes, Centre National de la Recherche Scientifique, Paris, France
| | - Lori J Leibold
- Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska 68131, USA
| | - John H Grose
- Department of Otolaryngology/Head and Neck Surgery, School of Medicine, University of North Carolina, Chapel Hill, North Carolina 27599-7070, USA
| |
Collapse
|
5
|
Joosten ERM, Shamma SA, Lorenzi C, Neri P. Dynamic Reweighting of Auditory Modulation Filters. PLoS Comput Biol 2016; 12:e1005019. [PMID: 27398600 PMCID: PMC4939963 DOI: 10.1371/journal.pcbi.1005019] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2015] [Accepted: 06/13/2016] [Indexed: 11/22/2022] Open
Abstract
Sound waveforms convey information largely via amplitude modulations (AM). A large body of experimental evidence has provided support for a modulation (bandpass) filterbank. Details of this model have varied over time partly reflecting different experimental conditions and diverse datasets from distinct task strategies, contributing uncertainty to the bandwidth measurements and leaving important issues unresolved. We adopt here a solely data-driven measurement approach in which we first demonstrate how different models can be subsumed within a common 'cascade' framework, and then proceed to characterize the cascade via system identification analysis using a single stimulus/task specification and hence stable task rules largely unconstrained by any model or parameters. Observers were required to detect a brief change in level superimposed onto random level changes that served as AM noise; the relationship between trial-by-trial noisy fluctuations and corresponding human responses enables targeted identification of distinct cascade elements. The resulting measurements exhibit a dynamic complex picture in which human perception of auditory modulations appears adaptive in nature, evolving from an initial lowpass to bandpass modes (with broad tuning, Q∼1) following repeated stimulus exposure.
Collapse
Affiliation(s)
- Eva R. M. Joosten
- Laboratoire Psychologie de la Perception (CNRS UMR 8242) and Université Paris Descartes, Sorbonne Paris Cité, Paris, France
| | - Shihab A. Shamma
- Laboratoire des Systèmes Perceptifs (CNRS UMR 8248) and Département d’études cognitives, Ecole Normale Supérieure, PSL Research University, Paris, France
- Department of Electrical and Computer Engineering, Institute for Systems Research, University of Maryland, College Park, Maryland, United States of America
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs (CNRS UMR 8248) and Département d’études cognitives, Ecole Normale Supérieure, PSL Research University, Paris, France
| | - Peter Neri
- Laboratoire des Systèmes Perceptifs (CNRS UMR 8248) and Département d’études cognitives, Ecole Normale Supérieure, PSL Research University, Paris, France
| |
Collapse
|
6
|
Perceptual learning evidence for tuning to spectrotemporal modulation in the human auditory system. J Neurosci 2012; 32:6542-9. [PMID: 22573676 DOI: 10.1523/jneurosci.5732-11.2012] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Natural sounds are characterized by complex patterns of sound intensity distributed across both frequency (spectral modulation) and time (temporal modulation). Perception of these patterns has been proposed to depend on a bank of modulation filters, each tuned to a unique combination of a spectral and a temporal modulation frequency. There is considerable physiological evidence for such combined spectrotemporal tuning. However, direct behavioral evidence is lacking. Here we examined the processing of spectrotemporal modulation behaviorally using a perceptual-learning paradigm. We trained human listeners for ∼1 h/d for 7 d to discriminate the depth of spectral (0.5 cyc/oct; 0 Hz), temporal (0 cyc/oct; 32 Hz), or upward spectrotemporal (0.5 cyc/oct; 32 Hz) modulation. Each trained group learned more on their respective trained condition than did controls who received no training. Critically, this depth-discrimination learning did not generalize to the trained stimuli of the other groups or to downward spectrotemporal (0.5 cyc/oct; -32 Hz) modulation. Learning on discrimination also led to worsening on modulation detection, but only when the same spectrotemporal modulation was used for both tasks. Thus, these influences of training were specific to the trained combination of spectral and temporal modulation frequencies, even when the trained and untrained stimuli had one modulation frequency in common. This specificity indicates that training modified circuitry that had combined spectrotemporal tuning, and therefore that circuits with such tuning can influence perception. These results are consistent with the possibility that the auditory system analyzes sounds through filters tuned to combined spectrotemporal modulation.
Collapse
|
7
|
Marrone N, Mason CR, Kidd G. Tuning in the spatial dimension: evidence from a masked speech identification task. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:1146-58. [PMID: 18681603 PMCID: PMC2809679 DOI: 10.1121/1.2945710] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2007] [Revised: 05/23/2008] [Accepted: 05/28/2008] [Indexed: 05/23/2023]
Abstract
Spatial release from masking was studied in a three-talker soundfield listening experiment. The target talker was presented at 0 degrees azimuth and the maskers were either colocated or symmetrically positioned around the target, with a different masker talker on each side. The symmetric placement greatly reduced any "better ear" listening advantage. When the maskers were separated from the target by +/-15 degrees , the average spatial release from masking was 8 dB. Wider separations increased the release to more than 12 dB. This large effect was eliminated when binaural cues and perceived spatial separation were degraded by covering one ear with an earplug and earmuff. Increasing reverberation in the room increased the target-to-masker ratio (TM) for the separated, but not colocated, conditions reducing the release from masking, although a significant advantage of spatial separation remained. Time reversing the masker speech improved performance in both the colocated and spatially separated cases but lowered TM the most for the colocated condition, also resulting in a reduction in the spatial release from masking. Overall, the spatial tuning observed appears to depend on the presence of interaural differences that improve the perceptual segregation of sources and facilitate the focus of attention at a point in space.
Collapse
Affiliation(s)
- Nicole Marrone
- Department of Speech, Language, and Hearing Sciences and the Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | |
Collapse
|
8
|
Crum PAC, Hafter ER. Predicting the path of a changing sound: velocity tracking and auditory continuity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:1116-1129. [PMID: 18681601 PMCID: PMC2809678 DOI: 10.1121/1.2945117] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2007] [Revised: 05/06/2008] [Accepted: 05/16/2008] [Indexed: 05/26/2023]
Abstract
Three studies demonstrate listeners' ability to use the rate of a sound's frequency change (velocity) to predict how the spectral path of the sound is likely to evolve, even in the event of an occlusion. Experiments 1 and 2 use a modified probe-signal method to measure attentional filters and demonstrate increased detection to sounds falling along implied paths of constant-linear velocity. Experiment 3 shows listeners perceive a suprathreshold tone as falling along a trajectory of constant velocity when the frequency is near to the region of greatest detection as measured in Experiments 1 and 2. Further, results show greater accuracy and decreased bias in the use of velocity information with increased exposure to a constant-velocity sound. As the duration of occlusion lengthens, results also show a downward shift (relative to a trajectory of constant velocity) in the frequency at which listeners' detection and experience of a continuous trajectory are greatest. A preliminary model of velocity processing is proposed to account for this downward shift. Results show listeners' use of velocity in extrapolating sounds with dynamically changing spectral and temporal properties and provide evidence for its role in perceptual auditory continuity within a noisy acoustic environment.
Collapse
Affiliation(s)
- Poppy A C Crum
- University of California Berkeley, Berkeley, CA 94720, USA
| | | |
Collapse
|
9
|
Gutschalk A, Micheyl C, Oxenham AJ. The pulse-train auditory aftereffect and the perception of rapid amplitude modulations. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:935-945. [PMID: 18247896 DOI: 10.1121/1.2828057] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Prolonged listening to a pulse train with repetition rates around 100 Hz induces a striking aftereffect, whereby subsequently presented sounds are heard with an unusually "metallic" timbre [Rosenblith et al., Science 106, 333-335 (1947)]. The mechanisms responsible for this auditory aftereffect are currently unknown. Whether the aftereffect is related to an alteration of the perception of temporal envelope fluctuations was evaluated. Detection thresholds for sinusoidal amplitude modulation (AM) imposed onto noise-burst carriers were measured for different AM frequencies (50-500 Hz), following the continuous presentation of a periodic pulse train, a temporally jittered pulse train, or an unmodulated noise. AM detection thresholds for AM frequencies of 100 Hz and above were significantly elevated compared to thresholds in quiet, following the presentation of the pulse-train inducers, and both induced a subjective auditory aftereffect. Unmodulated noise, which produced no audible aftereffect, left AM detection thresholds unchanged. Additional experiments revealed that, like the Rosenblith et al. aftereffect, the effect on AM thresholds does not transfer across ears, is not eliminated by protracted training, and can last several tens of seconds. The results suggest that the Rosenblith et al. aftereffect is related to a temporary alteration in the perception of fast temporal envelope fluctuations in sounds.
Collapse
Affiliation(s)
- Alexander Gutschalk
- Department of Neurology, University of Heidelberg, 69120 Heidelberg, Germany.
| | | | | |
Collapse
|
10
|
Fitzgerald MB, Wright BA. A perceptual learning investigation of the pitch elicited by amplitude-modulated noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:3794-803. [PMID: 16419824 DOI: 10.1121/1.2074687] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Noise that is amplitude modulated at rates ranging from 40 to 850 Hz can elicit a sensation of pitch. Here, the processing of this temporally based pitch was investigated using a perceptual-learning paradigm. Nine listeners were trained (1 hour per day for 6-8 days) to discriminate a standard rate of sinusoidal amplitude modulation (SAM) from a faster rate in a single condition (150 Hz SAM rate, 5 kHz low-pass carrier). All trained listeners improved significantly on that condition. These trained listeners subsequently showed no more improvement than nine untrained controls on pure-tone and rippled-noise discrimination with the same pitch, and on SAM-rate discrimination with a 30 Hz rate, although they did show some improvement with a 300 Hz rate. In addition, most trained, but not control, listeners were worse at detecting SAM at 150 Hz after, compared to before training. These results indicate that listeners can learn to improve their ability to discriminate SAM rate with multiple-hour training and that the mechanism that is modified by learning encodes (1) the pitch of SAM noise but not that of pure tones and rippled noise, (2) different SAM rates separately, and (3) differences in SAM rate more effectively than cues for SAM detection.
Collapse
Affiliation(s)
- Matthew B Fitzgerald
- Department of Communication Sciences and Disorders and Northwestern University Institute for Neuroscience, Northwestern University, 2240 Campus Drive, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
11
|
Kidd G, Arbogast TL, Mason CR, Gallun FJ. The advantage of knowing where to listen. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:3804-15. [PMID: 16419825 DOI: 10.1121/1.2109187] [Citation(s) in RCA: 173] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
This study examined the role of focused attention along the spatial (azimuthal) dimension in a highly uncertain multitalker listening situation. The task of the listener was to identify key words from a target talker in the presence of two other talkers simultaneously uttering similar sentences. When the listener had no a priori knowledge about target location, or which of the three sentences was the target sentence, performance was relatively poor-near the value expected simply from choosing to focus attention on only one of the three locations. When the target sentence was cued before the trial, but location was uncertain, performance improved significantly relative to the uncued case. When spatial location information was provided before the trial, performance improved significantly for both cued and uncued conditions. If the location of the target was certain, proportion correct identification performance was higher than 0.9 independent of whether the target was cued beforehand. In contrast to studies in which known versus unknown spatial locations were compared for relatively simple stimuli and tasks, the results of the current experiments suggest that the focus of attention along the spatial dimension can play a very significant role in solving the "cocktail party" problem.
Collapse
Affiliation(s)
- Gerald Kidd
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | | | |
Collapse
|
12
|
Wojtczak M, Viemeister NF. Forward masking of amplitude modulation: basic characteristics. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:3198-210. [PMID: 16334900 DOI: 10.1121/1.2042970] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
In this study we demonstrate an effect for amplitude modulation (AM) that is analogous to forward making of audio frequencies, i.e., the modulation threshold for detection of AM (signal) is raised by preceding AM (masker). In the study we focused on the basic characteristics of the forward-masking effect. Functions representing recovery from AM forward masking measured with a 150- ms 40- Hz masker AM and a 50- ms signal AM of the same rate imposed on the same broadband-noise carrier, showed an exponential decay of forward masking with increasing delay from masker offset. Thresholds remained elevated by more than 2 dB over an interval of at least 150 ms following the masker. Masked-threshold patterns, measured with a fixed signal rate (20, 40, and 80 Hz) and a variable masker rate, showed tuning of the AM forward-masking effect. The tuning was approximately constant across signal modulation rates used and consistent with the idea of modulation-rate selective channels. Combining two equally effective forward maskers of different frequencies did not lead to an increase in forward masking relative to that produced by either component alone. Overall, the results are consistent with modulation-rate selective neural channels that adapt and recover from the adaptation relatively quickly.
Collapse
Affiliation(s)
- Magdalena Wojtczak
- Department of Psychology, University of Minnesota, 75 East River Rd., Minneapolis, Minnesota 55455, USA
| | | |
Collapse
|
13
|
Hammond GR, Seth Y, Ison JR. Concurrent measurement of the detectability of tone bursts and their effect on the excitability of the human blink reflex using a probe-signal method. Hear Res 2005; 202:28-34. [PMID: 15811696 DOI: 10.1016/j.heares.2004.07.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/09/2004] [Accepted: 07/23/2004] [Indexed: 11/20/2022]
Abstract
The probe-signal method has shown that auditory signals that are either presented more often in a series of trials or that are immediately preceded by cues of the same frequency on a single trial are detected more readily than signals of other frequencies. The frequency range in which detection is favored defines an attentional band, which is thought to result from an effective attenuation of deviant frequencies in the cochlea, possibly by activation of the olivocochlear bundle. In a 2IFC procedure in which the first observation interval was preceded by a 1300-Hz cue, subjects detected cued probe tones (at 1300 Hz) but not uncued probe tones (at 1000 Hz or 1600 Hz) at better than chance levels. Concurrent elicitation of a blink reflex by presentation of an air puff in the first observation interval on a random half of the trials showed that cued probes, but not uncued probes, inhibited the size of the blink reflex. These data show that uncued probes do not enter into the low-level sensory processing in the brainstem which is responsible for reflex modification. This finding is consistent with the view that stimuli whose frequency falls outside an attentional band are excluded at the auditory periphery.
Collapse
Affiliation(s)
- Geoffrey R Hammond
- School of Psychology, The University of Western Australia, Crawley, Australia.
| | | | | |
Collapse
|
14
|
Wright BA. Combined representations for frequency and duration in detection templates for expected signals. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 117:1299-1304. [PMID: 15807018 DOI: 10.1121/1.1855771] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
When trying to detect a tonal signal in a continuous broadband noise, listeners attend selectively to both the frequency and the duration of the expected signal. However, it is not known whether they monitor separate or combined representations of these two attributes. To investigate this question, a probe-signal method was used to measure the detectability of signals of expected and unexpected durations at two expected frequencies. The four listeners expected only one of two signals to be presented at random: a brief tone at one frequency or a long tone at another frequency. For each signal frequency, the detectability of the signals of unexpected duration decreased to near chance as the difference between the expected and unexpected duration, at that frequency, increased. The frequency specificity of this duration tuning indicates that both the frequency and the duration of an expected stimulus are represented in a single template.
Collapse
Affiliation(s)
- Beverly A Wright
- Department of Communication Sciences and Disorders and Northwestern University Institute for Neuroscience, Northwestern University, Evanston, Illinois 60208-3550, USA.
| |
Collapse
|
15
|
Wright BA, Fitzgerald MB. The time course of attention in a simple auditory detection task. ACTA ACUST UNITED AC 2004; 66:508-16. [PMID: 15283074 DOI: 10.3758/bf03194897] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
What is the time course of human attention in a simple auditory detection task? To investigate this question, we determined the detectability of a 20-msec, 1000-Hz tone presented at expected and unexpected times. Twelve listeners who expected the tone to occur at a specific time after a 300-msec narrowband noise rarely detected signals presented 150-375 msec before or 100-200 msec after that expected time. The shape of this temporal-attention window depended on the expected presentation time of the tone and the temporal markers available in the trials. Further, though expecting the signal to occur in silence, listeners often detected signals presented at unexpected times during the noise. Combined with previous data, these results further clarify the listening strategy humans use when trying to detect an expected sound: Humans seem to listen specifically for that sound, while ignoring the background in which it is presented, around the time when the sound is expected to occur.
Collapse
Affiliation(s)
- Beverly A Wright
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
16
|
Arbogast TL, Kidd G. Evidence for spatial tuning in informational masking using the probe-signal method. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2000; 108:1803-1810. [PMID: 11051506 DOI: 10.1121/1.1289366] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Auditory spatial attention is one mechanism that may contribute to the ability to identify one sound source in a multi-source environment. The role of auditory spatial attention in a multi-source environment was investigated using the probe-signal method. The experiment took place in a quiet room with seven speakers arranged in a semi-circle in front of the listener. The speakers were placed at 30-degree intervals at a distance of 5 ft from the listener. The signal was comprised of eight contiguous, 60-ms pure-tone bursts arranged in either a rising or falling frequency pattern. Masker components were also comprised of eight contiguous pure-tone bursts but with durations that varied randomly from 20 to 100 ms. The six maskers were played with the signal and were constructed in order to result in informational rather than energetic masking. The frequency of each masker component was chosen randomly on each burst from a narrow frequency band, independent from the signal frequency band. The task was 1I-2AFC fixed-level identification with response time measurement. The listener was instructed to focus attention on a specified speaker (expected location) for a block of trials. Accuracy and response time were compared across two conditions: (1) signal presented at the expected location and (2) signal presented at an unexpected location. Results indicate a significant increase in accuracy and faster response time when the signal was presented at the expected location as compared to an unexpected location. These results suggest that auditory spatial attention plays an important role in multi-source listening, especially when the listening environment is complex and uncertain.
Collapse
Affiliation(s)
- T L Arbogast
- Department of Communication Disorders and The Hearing Research Center, Boston University, Massachusetts 02215, USA.
| | | |
Collapse
|
17
|
Stellmack MA, Viemeister NF. Observer weighting of monaural level information in a pair of tone pulses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2000; 107:3382-3393. [PMID: 10875383 DOI: 10.1121/1.429409] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
A correlational analysis was used to assess the relative weight given to the levels of two monaurally presented tone pulses for interpulse intervals (IPIs) ranging from 2-256 ms. In three different experimental conditions, listeners were instructed to discriminate the level of the first pulse, the level of the second pulse, or the difference between the levels of the two pulses. The level of the target pulse was chosen randomly and independently from trial to trial from a Gaussian distribution. The level of the nontarget pulse was either fixed at 75 dB SPL or varied in the same manner as the level of the target. In the tasks in which one pulse was to be ignored, listeners gave increasing weight to the nontarget component as IPI decreased. Listeners weighted the level information in the pulses appropriately only when the IPI approached 256 ms. When the listeners were instructed to compare the pulse levels to one another, two of three listeners weighted the levels optimally at all IPIs, while the third listener did so only at the longest IPI. For the two listeners who weighted the pulses optimally, a minimum in performance was achieved at IPIs around 16-32 ms. Intensity discrimination thresholds were also measured for one pulse in the presence of a second fixed pulse for IPIs of 2-256 ms. Thresholds were higher in all the two-pulse conditions relative to a one-pulse condition, and were dependent on the level of the nontarget pulse but not on IPI. The results indicate that level information is integrated to some extent over fairly long durations, but not in a manner that is consistent with simple temporal integration.
Collapse
Affiliation(s)
- M A Stellmack
- Department of Psychology, University of Minnesota, Minneapolis 55455, USA
| | | |
Collapse
|