1
|
Conroy C, Buss E, Kidd G. Cues to reduce modulation informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 153:274. [PMID: 36732267 PMCID: PMC9848649 DOI: 10.1121/10.0016867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 12/23/2022] [Accepted: 12/28/2022] [Indexed: 06/18/2023]
Abstract
The detectability of target amplitude modulation (AM) can be reduced by masker AM in the same carrier-frequency region. It can be reduced even further, however, if the masker-AM rate is uncertain [Conroy and Kidd, J. Acoust. Soc. Am. 149, 3665-3673 (2021)]. This study examined the effectiveness of contextual cues in reducing this latter, uncertainty-related effect (modulation informational masking). Observers were tasked with detecting fixed-rate target sinusoidal amplitude modulation (SAM) in the presence of masker SAM applied simultaneously to the same broadband-noise carrier. A single-interval, two-alternative forced-choice detection procedure was used to measure sensitivity for the target SAM; masker-AM-rate uncertainty was created by randomly selecting the AM rate of the masker SAM on each trial. Relative to an uncued condition, a pretrial cue to the masker SAM significantly improved sensitivity for the target SAM; a cue to the target SAM, however, did not. The delay between the cue-interval offset and trial-interval onset did not affect the size of the masker-cue benefit, suggesting that adaptation of the masker SAM was not responsible. A simple model of within-AM-channel masking captured important trends in the psychophysical data, suggesting that reduced masker-AM-rate uncertainty may have played a relatively minor role in the masker-cue benefit.
Collapse
Affiliation(s)
- Christopher Conroy
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, Boston, Massachusetts 02215, USA
| | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Gerald Kidd
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, Boston, Massachusetts 02215, USA
| |
Collapse
|
2
|
Wright BA, Dai H. Discrimination thresholds for interaural-time differences and interaural-level differences in naïve listeners: Sex differences and learning. Hear Res 2022; 424:108599. [DOI: 10.1016/j.heares.2022.108599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 08/11/2022] [Accepted: 08/22/2022] [Indexed: 11/04/2022]
|
3
|
Conroy C, Kidd G. Informational masking in the modulation domain. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:3665. [PMID: 34241144 PMCID: PMC8163511 DOI: 10.1121/10.0005038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
Uncertainty regarding the frequency spectrum of a masker can have an adverse effect on the ability to focus selective attention on a target frequency channel, yielding informational masking (IM). This study sought to determine if uncertainty regarding the modulation spectrum of a masker can have an analogous adverse effect on the ability to focus selective attention on a target modulation channel, yielding IM in the modulation domain, or "modulation IM." A single-interval, two-alternative forced-choice (yes-no) procedure was used. The task was to detect 32-Hz target sinusoidal amplitude modulation (SAM) imposed on a broadband-noise carrier in the presence of masker SAM imposed on the same carrier. Six maskers, spanning the range from 8 to 128 Hz in half-octave steps, were tested, excluding those that fell within a two-octave protected zone surrounding the target. Psychometric functions (d'-vs-target modulation depth) were measured for each masker under two conditions: a fixed (low-uncertainty/low-IM) condition, in which the masker was the same on all trials within a block, and a random (high-uncertainty/high-IM) condition, in which it varied randomly from presentation-to-presentation. Thresholds and slopes extracted from the psychometric functions differed markedly between the conditions. These results are consistent with the idea that IM occurs in the modulation domain.
Collapse
Affiliation(s)
- Christopher Conroy
- Department of Speech, Language & Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Gerald Kidd
- Department of Speech, Language & Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
4
|
Conroy C, Mason CR, Kidd G. Informational masking of negative masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:798. [PMID: 32113297 PMCID: PMC7004829 DOI: 10.1121/10.0000652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 01/10/2020] [Accepted: 01/10/2020] [Indexed: 06/10/2023]
Abstract
Negative masking (NM) is a ubiquitous finding in near-"threshold" psychophysics in which the detectability of a near-threshold signal improves when added to a copy of itself, i.e., a pedestal or masker. One interpretation of NM suggests that the pedestal acts as an informative cue, thereby reducing uncertainty and improving performance relative to detection in its absence. The purpose of this study was to test this hypothesis. Intensity discrimination thresholds were measured for 100-ms, 1000-Hz near-threshold tones. In the reference condition, thresholds were measured in quiet (no masker other than the pedestal). In comparison conditions, thresholds were measured in the presence of one of two additional maskers: a notched-noise masker or a random-frequency multitone masker. The additional maskers were intended to cause different amounts of uncertainty and, in turn, to differentially influence NM. The results were generally consistent with an uncertainty-based interpretation of NM: NM was found both in quiet and in notched-noise, yet it was eliminated by the multitone masker. A competing interpretation of NM based on nonlinear transduction does not account for all of the results. Profile analysis may have been a factor in performance and this suggests that NM may be attributable to, or influenced by, multiple mechanisms.
Collapse
Affiliation(s)
- Christopher Conroy
- Department of Speech, Language & Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Christine R Mason
- Department of Speech, Language & Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Gerald Kidd
- Department of Speech, Language & Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|
5
|
Gray L, Miller BS, Evans SW. Training children with ADHD to minimize impulsivity in auditory contralateral masking. Int J Pediatr Otorhinolaryngol 2012; 76:483-7. [PMID: 22297209 DOI: 10.1016/j.ijporl.2012.01.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2011] [Revised: 12/31/2011] [Accepted: 01/02/2012] [Indexed: 10/14/2022]
Abstract
OBJECTIVE Impulsivity and distractibility are among the important symptoms of attention deficit hyperactivity disorder (ADHD). In this study, impulsivity is operationally measured using false-alarm rates in an auditory, contralateral-masking task. Intensive auditory training was attempted to decrease false alarm rates. METHODS In contralateral masking there is a distracting noise in one ear on every trial and a threshold-level tone in the other ear on half of those trials. Participants indicated whether the tone was present or not and received immediate feedback. The intensity of the masked tone was adaptively varied to track threshold. False alarms are the error of commission, saying that a stimulus is present when it is not. Seven school-aged children with ADHD (ages 10-16) and four adults without ADHD were trained on this task for 900 trials per day over four consecutive days. RESULTS False alarms from the children with ADHD decreased over the four days of training, beginning at the high level and ending at the low level expected from previous studies. There was no generalization to a different masking task. Results from the four adults were unexpected: soon after the training began they behaved no differently than the children with ADHD. CONCLUSION Children with ADHD can be trained to become less impulsive in an auditory detection task.
Collapse
Affiliation(s)
- Lincoln Gray
- Department of Communication Sciences and Disorders, James Madison University, Harrisonburg, VA 22801, USA.
| | | | | |
Collapse
|
6
|
Leibold LJ. Development of Auditory Scene Analysis and Auditory Attention. HUMAN AUDITORY DEVELOPMENT 2012. [DOI: 10.1007/978-1-4614-1421-6_5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
7
|
Leibold LJ, Hitchens JJ, Buss E, Neff DL. Excitation-based and informational masking of a tonal signal in a four-tone masker. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:2441-50. [PMID: 20370027 PMCID: PMC2865701 DOI: 10.1121/1.3298588] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2009] [Revised: 12/31/2009] [Accepted: 01/05/2010] [Indexed: 05/29/2023]
Abstract
This study examined contributions of peripheral excitation and informational masking to the variability in masking effectiveness observed across samples of multi-tonal maskers. Detection thresholds were measured for a 1000-Hz signal presented simultaneously with each of 25, four-tone masker samples. Using a two-interval, forced-choice adaptive task, thresholds were measured with each sample fixed throughout trial blocks for ten listeners. Average thresholds differed by as much as 26 dB across samples. An excitation-based model of partial loudness [Moore, B. C. J. et al. (1997). J. Audio Eng. Soc. 45, 224-237] was used to predict thresholds. These predictions accounted for a significant portion of variance in the data of several listeners, but no relation between the model and data was observed for many listeners. Moreover, substantial individual differences, on the order of 41 dB, were observed for some maskers. The largest individual differences were found for maskers predicted to produce minimal excitation-based masking. In subsequent conditions, one of five maskers was randomly presented in each interval. The difference in performance for samples with low versus high predicted thresholds was reduced in random compared to fixed conditions. These findings are consistent with a trading relation whereby informational masking is largest for conditions in which excitation-based masking is smallest.
Collapse
Affiliation(s)
- Lori J Leibold
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| | | | | | | |
Collapse
|
8
|
Balakrishnan U, Freyman RL. Speech detection in spatial and nonspatial speech maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:2680-91. [PMID: 18529187 PMCID: PMC2811546 DOI: 10.1121/1.2902176] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
The effect of perceived spatial differences on masking release was examined using a 4AFC speech detection paradigm. Targets were 20 words produced by a female talker. Maskers were recordings of continuous streams of nonsense sentences spoken by two female talkers and mixed into each of two channels (two talker, and the same masker time reversed). Two masker spatial conditions were employed: "RF" with a 4 ms time lead to the loudspeaker 60 degrees horizontally to the right, and "FR" with the time lead to the front (0 degrees ) loudspeaker. The reference nonspatial "F" masker was presented from the front loudspeaker only. Target presentation was always from the front loudspeaker. In Experiment 1, target detection threshold for both natural and time-reversed spatial maskers was 17-20 dB lower than that for the nonspatial masker, suggesting that significant release from informational masking occurs with spatial speech maskers regardless of masker understandability. In Experiment 2, the effectiveness of the FR and RF maskers was evaluated as the right loudspeaker output was attenuated until the two-source maskers were indistinguishable from the F masker, as measured independently in a discrimination task. Results indicated that spatial release from masking can be observed with barely noticeable target-masker spatial differences.
Collapse
Affiliation(s)
- Uma Balakrishnan
- Department of Communication Disorders, University of Massachusetts, 358 N. Pleasant Street, Amherst, Massachusetts 01003, USA.
| | | |
Collapse
|
9
|
|
10
|
Leibold LJ, Neff DL. Effects of masker-spectral variability and masker fringes in children and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:3666-76. [PMID: 17552718 DOI: 10.1121/1.2723664] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
This study examined the degree to which masker-spectral variability contributes to children's susceptibility to informational masking. Listeners were younger children (5-7 years), older children (8-10 years), and adults (19-34 years). Masked thresholds were measured using a 2IFC, adaptive procedure for a 300-ms, 1000-Hz signal presented simultaneously with (1) broadband noise, (2) a random-frequency ten-tone complex, or (3) a fixed-frequency ten-tone complex. Maskers were presented at an overall level of 60 dB SPL. Thresholds were similar across age for the noise condition. Thresholds for most children were higher than for most adults, however, for both ten-tone conditions. The average difference in threshold between random and fixed ten-tone conditions was comparable across age, suggesting a similar effect of reducing masker-spectral variability in children and adults. Children appear more likely to be susceptible to informational masking than adults, however, both with and in the absence of masker-spectral variability. The addition of a masker fringe (delayed onset of signal relative to masker) provided a release from masking for fixed and random ten-tone conditions in all age groups, suggesting at least part of the masking observed for both ten-tone maskers was informational.
Collapse
Affiliation(s)
- Lori J Leibold
- Boys Town National Research Hospital, Omaha, Nebraska 68131, USA.
| | | |
Collapse
|
11
|
Best V, Ozmeral EJ, Shinn-Cunningham BG. Visually-guided attention enhances target identification in a complex auditory scene. J Assoc Res Otolaryngol 2007; 8:294-304. [PMID: 17453308 PMCID: PMC2538357 DOI: 10.1007/s10162-007-0073-z] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2006] [Accepted: 01/17/2007] [Indexed: 10/23/2022] Open
Abstract
In auditory scenes containing many similar sound sources, sorting of acoustic information into streams becomes difficult, which can lead to disruptions in the identification of behaviorally relevant targets. This study investigated the benefit of providing simple visual cues for when and/or where a target would occur in a complex acoustic mixture. Importantly, the visual cues provided no information about the target content. In separate experiments, human subjects either identified learned birdsongs in the presence of a chorus of unlearned songs or recalled strings of spoken digits in the presence of speech maskers. A visual cue indicating which loudspeaker (from an array of five) would contain the target improved accuracy for both kinds of stimuli. A cue indicating which time segment (out of a possible five) would contain the target also improved accuracy, but much more for birdsong than for speech. These results suggest that in real world situations, information about where a target of interest is located can enhance its identification, while information about when to listen can also be helpful when targets are unfamiliar or extremely similar to their competitors.
Collapse
Affiliation(s)
- Virginia Best
- Hearing Research Center, Boston University, Boston, MA 02215, USA.
| | | | | |
Collapse
|
12
|
Freyman RL, Helfer KS, Balakrishnan U. Variability and uncertainty in masking by competing speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:1040-6. [PMID: 17348526 DOI: 10.1121/1.2427117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
This study investigated the role of uncertainty in masking of speech by interfering speech. Target stimuli were nonsense sentences recorded by a female talker. Masking sentences were recorded from ten female talkers and combined into pairs. Listeners' recognition performance was measured with both target and masker presented from a front loudspeaker (nonspatial condition) or with a masker presented from two loudspeakers, with the right leading the front by 4 ms (spatial condition). In Experiment 1, the sentences were presented in blocks in which the masking talkers, spatial configuration, and signal-to-noise (S-N) ratio were fixed. Listeners' recognition performance varied widely among the masking talkers in the nonspatial condition, much less so in the spatial condition. This result was attributed to variation in effectiveness of informational masking in the nonspatial condition. The second experiment increased uncertainty by randomizing masking talkers and S-N ratios across trials in some conditions, and reduced uncertainty by presenting the same token of masker across trials in other conditions. These variations in masker uncertainty had relatively small effects on speech recognition.
Collapse
Affiliation(s)
- Richard L Freyman
- Department of Communication Disorders, University of Massachusetts, Amherst, Massachusetts 01003, USA.
| | | | | |
Collapse
|
13
|
Wightman FL, Kistler DJ. Informational masking of speech in children: effects of ipsilateral and contralateral distracters. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:3164-76. [PMID: 16334898 PMCID: PMC2819474 DOI: 10.1121/1.2082567] [Citation(s) in RCA: 130] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Using a closed-set speech recognition paradigm thought to be heavily influenced by informational masking, auditory selective attention was measured in 38 children (ages 4-16 years) and 8 adults (ages 20-30 years). The task required attention to a monaural target speech message that was presented with a time-synchronized distracter message in the same ear. In some conditions a second distracter message or a speech-shaped noise was presented to the other ear. Compared to adults, children required higher target/distracter ratios to reach comparable performance levels, reflecting more informational masking in these listeners. Informational masking in most conditions was confirmed by the fact that a large proportion of the errors made by the listeners were contained in the distracter message(s). There was a monotonic age effect, such that even the children in the oldest age group (13.6-16 years) demonstrated poorer performance than adults. For both children and adults, presentation of an additional distracter in the contralateral ear significantly reduced performance, even when the distracter messages were produced by a talker of different sex than the target talker. The results are consistent with earlier reports from pure-tone masking studies that informational masking effects are much larger in children than in adults.
Collapse
Affiliation(s)
- Frederic L Wightman
- Heuser Hearing Institute, Department of Psychological and Brain Sciences, University of Louisville, Louisville, Kentucky 40292, USA.
| | | |
Collapse
|
14
|
Durlach NI, Mason CR, Gallun FJ, Shinn-Cunningham B, Colburn HS, Kidd G. Informational masking for simultaneous nonspeech stimuli: psychometric functions for fixed and randomly mixed maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:2482-97. [PMID: 16266169 DOI: 10.1121/1.2032748] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Sensitivity d' and response bias beta were measured as a function of target level for the detection of a 1000-Hz tone in multitone maskers using a one interval, two-alternative forced-choice (1I-2AFC) paradigm. Ten such maskers, each with eight randomly selected components in the region 200-5000 Hz, with 800-1250 Hz excluded to form a protected zone, were presented under two conditions: the fixed condition, in which the same eight-component masker is used throughout an experimental run, and the random condition, in which an eight-component masker is chosen randomly trial-to-trial from the given set of ten such maskers. Differences between the results obtained with these two conditions help characterize the listener's susceptibility to informational masking (IM). The d' results show great intersubject variability, but can be reasonably well fit by simple energy-detector models in which internal noise and filter bandwidth are used as fitting parameters. In contrast, the beta results are not well fit by these models. In addition to presentation of new data and its relation to energy-detector models, this paper provides comments on a variety of issues, problems, and research needs in the IM area.
Collapse
Affiliation(s)
- Nathaniel I Durlach
- Hearing Research Center Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | | | | | | | |
Collapse
|
15
|
Kidd G, Mason CR, Gallun FJ. Combining energetic and informational masking for speech identification. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:982-92. [PMID: 16158654 DOI: 10.1121/1.1953167] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
This study examined combinations of energetic and informational maskers in speech identification. Speech targets and maskers (speech or noise) were processed and filtered into sets of 15 narrow frequency bands. The target was the sum of eight randomly selected bands. More masking occurred for speech maskers than for spectrally matched noise maskers regardless of whether the masker bands overlapped the target bands. The greater effect of the speech maskers was interpreted as due to informational masking. When the masker was comprised of nonoverlapping bands of speech, the addition of bands of noise overlapping the speech masker, but not the speech target, reduced the overall amount of masking. Surprisingly, presenting the noise to the ear contralateral to the target and masker produced an even greater release from masking. The contralateral noise was apparently sufficient to cause a slight change in the image of the ipsilateral speech masker, possibly pulling it away from the target enough to allow the focus of attention on the target. This finding is consistent with the interpretation that in some conditions small binaural differences may be sufficient to cause, or significantly strengthen, the perceptual segregation of sounds.
Collapse
Affiliation(s)
- Gerald Kidd
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Sargent College, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | |
Collapse
|
16
|
Lutfi RA, Alexander JM. Effects of informational maskers within and across trials. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 118:322-4. [PMID: 16119352 DOI: 10.1121/1.1923348] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The across-trial effect of maskers in conditions of informational masking was evaluated from performance on occasional trials in which the signal was presented alone. For 6 of 12 listeners participating in the study, a significant number of errors were obtained on signal-alone trials; in some cases equivalent to that signal+ masker trials. On immediately preceding trial blocks for which there were no intervening maskers, performance for these signals was perfect. The results indicate that informational maskers can have a significant effect on signal threshold, both within and across trials.
Collapse
Affiliation(s)
- Robert A Lutfi
- Department of Communicative Disorders, and Waisman Center, University of Wisconsin, Madison, Wisconsin 53705, USA.
| | | |
Collapse
|
17
|
Dye RH, Stellmack MA, Jurcin NF. Observer weighting strategies in interaural time-difference discrimination and monaural level discrimination for a multi-tone complex. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 117:3079-90. [PMID: 15957776 DOI: 10.1121/1.1861832] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Two experiments measured listeners' abilities to weight information from different components in a complex of 553, 753, and 953 Hz. The goal was to determine whether or not the ability to adjust perceptual weights generalized across tasks. Weights were measured by binary logistic regression between stimulus values that were sampled from Gaussian distributions and listeners' responses. The first task was interaural time discrimination in which listeners judged the laterality of the target component. The second task was monaural level discrimination in which listeners indicated whether the level of the target component decreased or increased across two intervals. For both experiments, each of the three components served as the target. Ten listeners participated in both experiments. The results showed that those individuals who adjusted perceptual weights in the interaural time experiment could also do so in the monaural level discrimination task. The fact that the same individuals appeared to be analytic in both tasks is an indication that the weights measure the ability to attend to a particular region of the spectrum while ignoring other spectral regions.
Collapse
Affiliation(s)
- Raymond H Dye
- Parmly Hearing Institute, Loyola University of Chicago, Chicago, Illinois 60626, USA
| | | | | |
Collapse
|
18
|
Joliveau E, Smith J, Wolfe J. Vocal tract resonances in singing: the soprano voice. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 116:2234-47. [PMID: 15532674 DOI: 10.1121/1.1784437] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
The vocal tract resonances of trained soprano singers were measured while they sang a range of vowels softly at different pitches. The measurements were made by broad band acoustic excitation at the mouth, which allowed the resonances of the tract to be measured simultaneously with and independently from the harmonics of the voice. At low pitch, when the lowest resonance frequency R1 exceeded f0, the values of the first two resonances R1 and R2 varied little with frequency and had values consistent with normal speech. At higher pitches, however, when fo exceeded the value of R1 observed at low pitch, R1 increased with f0 so that R1 was approximately equal to f0. R2 also increased over this high pitch range, probably as an incidental consequence of the tuning of R1. R3 increased slightly but systematically, across the whole pitch range measured. There was no evidence that any resonances are tuned close to harmonics of the pitch frequency except for R1 at high pitch. The variations in R1 and R2 at high pitch mean that vowels move, converge, and overlap their positions on the vocal plane (R2,R1) to an extent that implies loss of intelligibility.
Collapse
Affiliation(s)
- Elodie Joliveau
- School of Physics, University of New South Wales, Sydney NSW 2052, Australia
| | | | | |
Collapse
|
19
|
Wright BA, Fitzgerald MB. The time course of attention in a simple auditory detection task. ACTA ACUST UNITED AC 2004; 66:508-16. [PMID: 15283074 DOI: 10.3758/bf03194897] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
What is the time course of human attention in a simple auditory detection task? To investigate this question, we determined the detectability of a 20-msec, 1000-Hz tone presented at expected and unexpected times. Twelve listeners who expected the tone to occur at a specific time after a 300-msec narrowband noise rarely detected signals presented 150-375 msec before or 100-200 msec after that expected time. The shape of this temporal-attention window depended on the expected presentation time of the tone and the temporal markers available in the trials. Further, though expecting the signal to occur in silence, listeners often detected signals presented at unexpected times during the noise. Combined with previous data, these results further clarify the listening strategy humans use when trying to detect an expected sound: Humans seem to listen specifically for that sound, while ignoring the background in which it is presented, around the time when the sound is expected to occur.
Collapse
Affiliation(s)
- Beverly A Wright
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
20
|
Brungart DS, Simpson BD. Within-ear and across-ear interference in a dichotic cocktail party listening task: effects of masker uncertainty. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:301-310. [PMID: 14759023 DOI: 10.1121/1.1628683] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Increases in masker variability have been shown to increase the effects of informational masking in non-speech listening tasks, but relatively little is known about the influence that masker uncertainty has on the informational components of speech-on-speech masking. In this experiment, listeners were asked to extract information from a target phrase that was presented in their right ear while ignoring masking phrases that were presented in the same ear as the target phrase and in the ear opposite the target phrase. The level of masker uncertainty was varied by holding constant or "freezing" the talkers speaking the masking phrases, the semantic content used in the masking phrases, or both the talkers and the semantic content in the masking phrases within each block of 120 trials. The results showed that freezing the semantic content of the masking phrase in the target ear was the only reduction in masker uncertainty that ever resulted in a significant improvement in performance. Providing feedback after each trial improved performance overall, but did not prevent the listeners from making incorrect responses that matched the content of the frozen target-ear masking phrase. However, removing the target-ear contents corresponding to the masking phrase from the response set resulted in a dramatic improvement in performance. This suggests that the listeners were generally able to understand both of the phrases presented to the target ear, and that their incorrect responses in the task were almost entirely a result of their inability to determine which words were spoken by the target talker.
Collapse
Affiliation(s)
- Douglas S Brungart
- Air Force Research Laboratory, 2610 Seventh Street, Wright-Patterson AFB, Ohio 45433-7901, USA.
| | | |
Collapse
|
21
|
Richards VM, Neff DL. Cuing effects for informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:289-300. [PMID: 14759022 DOI: 10.1121/1.1631942] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The detection of a tone added to a random-frequency, multitone masker can be very poor even when the maskers have little energy in the frequency region of the signal. This paper examines the effects of adding a pretrial cue to reduce uncertainty for the masker or the signal. The first two experiments examined the effect of cuing a fixed-frequency signal as the number of masker components and presentation methods were manipulated. Cue effectiveness varied across observers, but could reduce thresholds by as much as 20 dB. Procedural comparisons indicated observers benefited more from having two masker samples to compare, with or without a signal cue, than having a single interval with one masker sample and a signal cue. The third experiment used random-frequency signals and compared no-cue, signal-cue, and masker-cue conditions, and also systematically varied the time interval between cue offset and trial onset. Thresholds with a cued random-frequency signal remained higher than for a cued fixed-frequency signal. For time intervals between the cue and trial of 50 ms or longer, thresholds were approximately the same with a signal or a masker cue and lower than when there was no cue. Without a cue or with a masker cue, analyses of possible decision strategies suggested observers attended to the potential signal frequencies, particularly the highest signal frequency. With a signal cue, observers appeared to attend to the frequency of the subsequent signal.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Psychology, 3815 Walnut Street, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
22
|
Lutfi RA, Kistler DJ, Callahan MR, Wightman FL. Psychometric functions for informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:3273-82. [PMID: 14714808 PMCID: PMC2858973 DOI: 10.1121/1.1629303] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The term informational masking has traditionally been used to refer to elevations in signal threshold resulting from masker uncertainty. In the present study, the method of constant stimuli was used to obtain complete psychometric functions (PFs) from 44 normal-hearing listeners in conditions known to produce varying amounts of informational masking. The listener's task was to detect a pure-tone signal in the presence of a broadband noise masker (low masker uncertainty) and in the presence of multitone maskers with frequencies and amplitudes that varied at random from one presentation to the next (high masker uncertainty). Relative to the broadband noise condition, significant reductions were observed in both the slope and the upper asymptote of the PF for multitone maskers producing large amounts of informational masking. Slope was affected more for some listeners and conditions while asymptote was affected more for others; consequently, neither parameter alone was highly predictive of individual thresholds or the amount of informational masking. Mean slopes and asymptotes varied nonmonotonically with the number of masker components in a manner similar to mean thresholds, particularly when the estimated effect of energetic masking on thresholds was subtracted out. As in past studies, the threshold data were well described by a model in which trial-by-trial judgments are based on a weighted sum of levels in dB at the output of independent auditory filters. The psychometric data, however, complicated the model's interpretation in two ways: First, they suggested that, depending on the listener and condition, the weights can either reflect a fixed influence of masker components on each trial or the effect of occasionally mistaking a masker component for the signal from trial to trial. Second, they indicated that in either case the variance of the underlying decision variable as estimated from PF slope is not by itself great enough to account for the observed changes in informational masking.
Collapse
Affiliation(s)
- Robert A Lutfi
- Waisman Center, University of Wisconsin, Madison, Wisconsin 53705, USA
| | | | | | | |
Collapse
|
23
|
Kidd G, Mason CR, Richards VM. Multiple bursts, multiple looks, and stream coherence in the release from informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2835-2845. [PMID: 14650018 DOI: 10.1121/1.1621864] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
In the simultaneous multitone masking paradigm introduced by Neff and Green [Percept. Psychophys. 41, 409-415 (1987)] the masker typically is a small number of tones having frequencies and levels that are randomly drawn on every presentation. Large amounts of masking for a pure-tone signal often occur that are thought to reflect central, rather than peripheral, limitations on processing. Previous work from this laboratory has indicated that playing a rapid succession of randomly drawn multitone maskers in each observation interval dramatically reduces the amount of masking that is observed relative to a single burst (SB). In this multiple-bursts-different (MBD) procedure, the signal tone is the only constant frequency component during the sequence of bursts and tends to perceptually segregate from the masker. In this study, the number of masker bursts and the interburst interval (IBI) were varied. The goals were to determine how the release from masking relative to the SB condition depends on the number of bursts and to examine whether increasing the IBI would cause each burst to be processed independently. If the latter were true, it might disrupt the perception of signal stream coherence, thereby diminishing the MBD advantage. However, multiple independent looks could also lead to an improvement in performance. For those subjects showing large amounts of informational masking in the SB condition, substantial reduction in masked thresholds occurred as the number of masker bursts increased, while masking increased as IBI lengthened. The results were not consistent with a simple version of a multiple-look model in which the information from each burst was combined optimally, but instead appear to be attributable to mechanisms involved in the perceptual organization of sounds.
Collapse
Affiliation(s)
- Gerald Kidd
- Hearing Research Center and Programs in Communication Disorders, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | |
Collapse
|
24
|
Kidd G, Mason CR, Brughera A, Chiu CYP. Discriminating harmonicity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:967-977. [PMID: 12942976 DOI: 10.1121/1.1587734] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Simultaneous tones that are harmonically related tend to be grouped perceptually to form a unitary auditory image. A partial that is mistuned stands out from the other tones, and harmonic complexes with different fundamental frequencies can readily be perceived as separate auditory objects. These phenomena are evidence for the strong role of harmonicity in perceptual grouping and segregation of sounds. This study measured the discriminability of harmonicity directly. In a two interval, two alternative forced-choice (2I2AFC) paradigm, the listener chose which of two sounds, signal or foil, was composed of tones that more closely matched an exact harmonic relationship. In one experiment, the signal was varied from perfectly harmonic to highly inharmonic by adding frequency perturbation to each component. The foil always had 100% perturbation. Group mean performance decreased from greater than 90% correct for 0% signal perturbation to near chance for 80% signal perturbation. In the second experiment, adding a masker presented simultaneously with the signals and foils disrupted harmonicity. Both monaural and dichotic conditions were tested. Signal level was varied relative to masker level to obtain psychometric functions from which slopes and midpoints were estimated. Dichotic presentation of these audible stimuli improved performance by 3-10 dB, due primarily to a release from "informational masking" by the perceptual segregation of the signal from the masker.
Collapse
Affiliation(s)
- Gerald Kidd
- Hearing Research Center Sargent College, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA.
| | | | | | | |
Collapse
|
25
|
Durlach NI, Mason CR, Shinn-Cunningham BG, Arbogast TL, Colburn HS, Kidd G. Informational masking: counteracting the effects of stimulus uncertainty by decreasing target-masker similarity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:368-379. [PMID: 12880048 DOI: 10.1121/1.1577562] [Citation(s) in RCA: 128] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Previous work has indicated that target-masker similarity, as well as stimulus uncertainty, influences the amount of informational masking that occurs in detection, discrimination, and recognition tasks. In each of five experiments reported in this paper, the detection threshold for a tonal target in random multitone maskers presented simultaneously with the target tone was measured for two conditions using the same set of five listeners. In one condition, the target was constructed to be "similar" (S) to the masker; in the other condition, it was constructed to be "dissimilar" (D) to the masker. The specific masker varied across experiments, but was constant for the two conditions. Target-masker similarity varied in dimensions such as duration, perceived location, direction of frequency glide, and spectro-temporal coherence. Group-mean results show large decreases in the amount of masking for the D condition relative to the S condition. In addition, individual differences (a hallmark of informational masking) are found to be much greater in the S condition than in the D condition. Furthermore, listener vulnerability to informational masking is found to be consistent to at least a moderate degree across experiments.
Collapse
Affiliation(s)
- Nathaniel I Durlach
- Hearing Research Center, Boston University, Boston, Massachusetts 02215, USA.
| | | | | | | | | | | |
Collapse
|
26
|
Tang Z, Richards VM. Examination of a linear model in an informational masking study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:361-367. [PMID: 12880047 DOI: 10.1121/1.1579006] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
When multitone maskers are used in a two-interval, forced choice experiment, the amount of masking is larger when the masker is randomly chosen on each presentation interval compared to on each trial (the same masker in the two listening intervals). These conditions are referred to as having within- versus between-trial randomization. If it is assumed that an observer's ultimate detection decision depends on a single decision variable (DV), it is probable that the DV's variance will be substantially larger in the within-trial randomization condition compared to the between-trial randomization condition. The goal of the current experiment is to evaluate the degree to which this stimulus-based change in DV variance can account for the difference in thresholds in the within-versus between-trial randomization conditions. Thresholds are measured for the detection of a tone added to a six-component masker in between- and within-trial randomization conditions. The slopes of the psychometric functions provide an estimate of the variance in the DV for the between- and within-trial randomization conditions. Additionally, a channel model is fitted to the psychophysical results in the within-trial randomization condition. The resulting model is then used to predict the value of the DV for each trial, and ultimately to estimate the proportion of the total variance in the within-trial randomization condition that is attributable to changes in maskers across intervals. The variance of the DV in the between-trial randomization condition accounted for approximately 65% of the total variance in the DV in the within-trial randomization condition. Stimulus-based interval-by-interval masker randomization accounted for approximately 20% of the total variance of the within-trial randomization DV. The remaining 15% of the DV variance in the within-trial randomization condition remained unaccounted for. This result is fairly stable whether the maskers are drawn from a small versus large pool of potential maskers.
Collapse
Affiliation(s)
- Zhongzhou Tang
- Department of Psychology, University of Pennsylvania, 3815 Walnut Street, Philadelphia, Pennsylvania 19104, USA
| | | |
Collapse
|
27
|
Durlach NI, Mason CR, Kidd G, Arbogast TL, Colburn HS, Shinn-Cunningham BG. Note on informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:2984-7. [PMID: 12822768 DOI: 10.1121/1.1570435] [Citation(s) in RCA: 246] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
|
28
|
Kidd G, Mason CR, Arbogast TL, Brungart DS, Simpson BD. Informational masking caused by contralateral stimulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:1594-1603. [PMID: 12656394 DOI: 10.1121/1.1547440] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Although informational masking is thought to reflect central mechanisms, the effects are generally much stronger when the target and masker are presented to the same ear than when they are presented to different ears. However, the results of a recent study by Brungart and Simpson [J. Acoust. Soc. Am. 112, 2985-2995 (2002)] indicated that a speech masker that is presented contralateral to a speech signal can produce substantial amounts of informational masking when a second speech masker is played simultaneously in the same ear as the signal. In this study, we conducted a series of experiments that paralleled those of Brungart and Simpson but used a pure-tone signal and multitone informational maskers in a detection task. Both the signal and the maskers were played as sequences of short bursts in each observation interval. The maskers were arranged in two types of spectrotemporal patterns. One type of pattern, called "multiple-bursts same" (MBS), has previously been shown to produce very large amounts of informational masking while the other type of pattern, called "multiple-bursts different" (MBD), has been shown to produce very small amounts of informational masking. Several conditions of ipsilateral, contralateral, and combined presentation of these maskers were tested. The results showed that presentation of the MBS masker in the contralateral ear produced a substantial amount of informational masking when the MBD masker was simultaneously presented to the ipsilateral ear. The results supported the earlier findings of Brungart and Simpson indicating that listeners are unable to selectively focus their attention on a single ear in some complex dichotic listening conditions. These results suggest that this contralateral masking effect is not restricted to speech and may reflect more general limitations on processing capacity. Further, it was concluded that the magnitude of the contralateral masking effect was related both to the informational masking value of the contralateral masker and the complexity of the stimulus and/or task in the ear in which the signal was presented.
Collapse
Affiliation(s)
- Gerald Kidd
- Hearing Research Center, Sargent College, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02132, USA.
| | | | | | | | | |
Collapse
|
29
|
Richards VM, Tang Z, Kidd GD. Informational masking with small set sizes. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1359-1366. [PMID: 11931313 DOI: 10.1121/1.1445790] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Informational masking refers to interference in the detectability of a sound, or discrimination of some property of a sound, beyond that which can be attributed to interactions at the auditory periphery. In the current experiments the signal to be detected was a tone added to a 6-tone masker, and informational masking was introduced by randomly choosing the frequencies of the tones that comprise the masker. The primary question was whether small numbers of maskers could replace randomly drawn maskers without sacrificing the underlying detection schemes adopted by observers. Similar to the method used by Wright and Saberi [J. Acoust. Soc. Am. 105, 1765-1775 (1999)], detection thresholds were measured for different masker set sizes, where set size refers to the number of 6-tone maskers from which any one masker was drawn. Set sizes of 3, 6, 12, and 24 were tested as well as conditions in which the maskers were chosen at random. In addition, observers' memory for maskers was coarsely evaluated. Large differences in thresholds were found across observers and across different masker sets. Even for set sizes of 24, the memory test suggests some recognition of maskers for some observers. Post hoc analysis of the data included an evaluation of the relative contribution of different frequencies using a single linear model. As a base for comparison, a linear model fitted to each condition was also evaluated. Although the data were fitted better using many rather than one linear model, the reduction in quality of fit was modest. This result suggests substantial consistency in decision strategies regardless of masker set size.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Psychology, University of Pennsylvania, Philadelphia 19104, USA
| | | | | |
Collapse
|