1
|
Veyrié A, Noreña A, Sarrazin JC, Pezard L. Investigating the influence of masker and target properties on the dynamics of perceptual awareness under informational masking. PLoS One 2023; 18:e0282885. [PMID: 36928693 PMCID: PMC10019711 DOI: 10.1371/journal.pone.0282885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 02/27/2023] [Indexed: 03/18/2023] Open
Abstract
Informational masking has been investigated using the detection of an auditory target embedded in a random multi-tone masker. The build-up of the target percept is influenced by the masker and target properties. Most studies dealing with discrimination performance neglect the dynamics of perceptual awareness. This study aims at investigating the dynamics of perceptual awareness using multi-level survival models in an informational masking paradigm by manipulating masker uncertainty, masker-target similarity and target repetition rate. Consistent with previous studies, it shows that high target repetition rates, low masker-target similarity and low masker uncertainty facilitate target detection. In the context of evidence accumulation models, these results can be interpreted by changes in the accumulation parameters. The probabilistic description of perceptual awareness provides a benchmark for the choice of target and masker parameters in order to examine the underlying cognitive and neural dynamics of perceptual awareness.
Collapse
Affiliation(s)
- Alexandre Veyrié
- Aix-Marseille Université, LNC, CNRS UMR 7291, Marseille, France
- ONERA, The French Aerospace Lab, Salon de Provence, France
| | - Arnaud Noreña
- Aix-Marseille Université, LNC, CNRS UMR 7291, Marseille, France
| | | | - Laurent Pezard
- Aix-Marseille Université, LNC, CNRS UMR 7291, Marseille, France
- * E-mail:
| |
Collapse
|
2
|
Richards VM, Tisby MK, Suzuki-Gill EN, Shen Y. Sub-optimal construction of an auditory profile from temporally distributed spectral information. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 149:1567. [PMID: 33765831 PMCID: PMC7943247 DOI: 10.1121/10.0003646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/13/2020] [Revised: 02/12/2021] [Accepted: 02/16/2021] [Indexed: 06/12/2023]
Abstract
When spectral components of a complex sound are presented not simultaneously but distributed over time, human listeners can still, to a degree, perceptually recover the spectral profile of the sound. This capability of integrating spectral information over time was investigated using a cued informational masking paradigm. Listeners detected a 1-kHz pure tone in a simultaneous masker composed of six random-frequency tones drawn on every trial. The spectral profile of the masker was cued using a precursor sound that consisted of a sequence of 50-ms bursts, separated by inter-burst intervals of 100 ms. Each burst in the precursor consisted of pure tones at the masker frequencies with tones appearing at each of the masker frequencies at different presentation probabilities. As the presentation probability increased in different conditions, the detectability of the target improved, indicating reliable precursor cuing regarding the spectral content of the masker. For many listeners, performance did not significantly improve as the number of precursor bursts increased from 2 to 16, indicating inefficient integration of information beyond 2 bursts. Additional analyses suggest that when intensity of the bursts is relatively constant, the contribution of the precursor is dominated by information in the initial burst.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Cognitive Sciences, University of California, Irvine, California 92687, USA
| | - Mariel Kazuko Tisby
- Department of Cognitive Sciences, University of California, Irvine, California 92687, USA
| | - Eli N Suzuki-Gill
- Department of Cognitive Sciences, University of California, Irvine, California 92687, USA
| | - Yi Shen
- Department of Speech and Hearing Sciences, University of Washington, Seattle, Washington 98105, USA
| |
Collapse
|
3
|
Roverud E, Dubno JR, Kidd G. Hearing-Impaired Listeners Show Reduced Attention to High-Frequency Information in the Presence of Low-Frequency Information. Trends Hear 2020; 24:2331216520945516. [PMID: 32853117 PMCID: PMC7557677 DOI: 10.1177/2331216520945516] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 06/30/2020] [Accepted: 07/07/2020] [Indexed: 11/29/2022] Open
Abstract
Many listeners with sensorineural hearing loss have uneven hearing sensitivity across frequencies. This study addressed whether this uneven hearing loss leads to a biasing of attention to different frequency regions. Normal-hearing (NH) and hearing-impaired (HI) listeners performed a pattern discrimination task at two distant center frequencies (CFs): 750 and 3500 Hz. The patterns were sequences of pure tones in which each successive tonal element was randomly selected from one of two possible frequencies surrounding a CF. The stimuli were presented at equal sensation levels to ensure equal audibility. In addition, the frequency separation of the tonal elements within a pattern was adjusted for each listener so that equal pattern discrimination performance was obtained for each CF in quiet. After these adjustments, the pattern discrimination task was performed under conditions in which independent patterns were presented at both CFs simultaneously. The listeners were instructed to attend to the low or high CF before the stimulus (assessing selective attention to frequency with instruction) or after the stimulus (divided attention, assessing inherent frequency biases). NH listeners demonstrated approximately equal performance decrements (re: quiet) between the two CFs. HI listeners demonstrated much larger performance decrements at the 3500 Hz CF than at the 750 Hz CF in combined-presentation conditions for both selective and divided attention conditions, indicating a low-frequency attentional bias that is apparently not under subject control. Surprisingly, the magnitude of this frequency bias was not related to the degree of asymmetry in thresholds at the two CFs.
Collapse
Affiliation(s)
- Elin Roverud
- Department of Speech, Language & Hearing Sciences, Boston University
| | - Judy R. Dubno
- Department of Otolaryngology-Head and Neck Surgery, Medical University of South Carolina
| | - Gerald Kidd
- Department of Speech, Language & Hearing Sciences, Boston University
| |
Collapse
|
4
|
Guan J, Liu C. Speech Perception in Noise With Formant Enhancement for Older Listeners. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:3290-3301. [PMID: 31479380 DOI: 10.1044/2019_jslhr-s-18-0089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Purpose Degraded speech intelligibility in background noise is a common complaint of listeners with hearing loss. The purpose of the current study is to explore whether 2nd formant (F2) enhancement improves speech perception in noise for older listeners with hearing impairment (HI) and normal hearing (NH). Method Target words (e.g., color and digit) were selected and presented based on the paradigm of the coordinate response measure corpus. Speech recognition thresholds with original and F2-enhanced speech in 2- and 6-talker babble were examined for older listeners with NH and HI. Results The thresholds for both the NH and HI groups improved for enhanced speech signals primarily in 2-talker babble, but not in 6-talker babble. The F2 enhancement benefits did not correlate significantly with listeners' age and their average hearing thresholds in most listening conditions. However, speech intelligibility index values increased significantly with F2 enhancement in babble for listeners with HI, but not for NH listeners. Conclusions Speech sounds with F2 enhancement may improve listeners' speech perception in 2-talker babble, possibly due to a greater amount of speech information available in temporally modulated noise or a better capacity to separate speech signals from background babble.
Collapse
Affiliation(s)
- Jingjing Guan
- Department of Communication Sciences and Disorders, University of Texas at Austin
| | - Chang Liu
- Department of Communication Sciences and Disorders, University of Texas at Austin
| |
Collapse
|
5
|
McFadden D, Pasanen EG, Maloney MM, Leshikar EM, Pho MH. Differences in common psychoacoustical tasks by sex, menstrual cycle, and race. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:2338. [PMID: 29716303 PMCID: PMC5915329 DOI: 10.1121/1.5030998] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 03/13/2018] [Accepted: 03/24/2018] [Indexed: 06/08/2023]
Abstract
The psychoacoustical literature contains multiple reports about small differences in performance depending upon the sex and phase of the menstrual cycle of the subjects. In an attempt to verify these past reports, a large-scale study was implemented. After extensive training, the performance of about 75 listeners was measured on seven common psychoacoustical tasks. For most tasks, the signal was a 3.0-kHz tone. The initial data analyses failed to confirm some past outcomes. Additional analyses, incorporating the limited information available about the racial background of the listeners, did confirm some of the past reports, with the direction and magnitude of the differences often diverging for the White and Non-White listeners. Sex differences and race differences interacted for six of the seven tasks studied. These interactions suggest that racial background needs to be considered when making generalizations about human auditory performance, and when considering failures of reproducibility across studies. Menstrual differences were small, but generally larger for Whites than Non-Whites. Hormonal effects may be responsible for the sex and cycle differences that do exist, and differences in intra-cochlear melanocytes may account for the race differences.
Collapse
Affiliation(s)
- Dennis McFadden
- Department of Psychology and Center for Perceptual Systems, University of Texas, 108 East Dean Keeton, A8000, Austin, Texas 78712-1043, USA
| | - Edward G Pasanen
- Department of Psychology and Center for Perceptual Systems, University of Texas, 108 East Dean Keeton, A8000, Austin, Texas 78712-1043, USA
| | - Mindy M Maloney
- Department of Psychology and Center for Perceptual Systems, University of Texas, 108 East Dean Keeton, A8000, Austin, Texas 78712-1043, USA
| | - Erin M Leshikar
- Department of Communication Sciences and Disorders, University of Texas, 2504-A Whitis Avenue, A1100, Austin, Texas 78712-0114, USA
| | - Michelle H Pho
- Department of Communication Sciences and Disorders, University of Texas, 2504-A Whitis Avenue, A1100, Austin, Texas 78712-0114, USA
| |
Collapse
|
6
|
Oberfeld D, Hots J, Verhey JL. Temporal weights in the perception of sound intensity: Effects of sound duration and number of temporal segments. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:943. [PMID: 29495718 DOI: 10.1121/1.5023686] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Loudness is a fundamental aspect of auditory perception that is closely related to the physical level of the sound. However, it has been demonstrated that, in contrast to a sound level meter, human listeners do not weight all temporal segments of a sound equally. Instead, the beginning of a sound is more important for loudness estimation than later temporal portions. The present study investigates the mechanism underlying this primacy effect by varying the number of equal-duration temporal segments (5 and 20) and the total duration of the sound (1.0 to 10.0 s) in a factorial design. Pronounced primacy effects were observed for all 20-segment sounds. The temporal weights for the five-segment sounds are similar to those for the 20-segment sounds when the weights of the segments covering the same temporal range as a segment of the five-segment sounds are averaged. The primacy effect can be described by an exponential decay function with a time constant of about 200 ms. Thus, the temporal weight assigned to a specific temporal portion of a sound is determined by the time delay between sound onset and segment onset rather than by the number of segments or the total duration of the sound.
Collapse
Affiliation(s)
- Daniel Oberfeld
- Institute of Psychology, Section Experimental Psychology, Johannes Gutenberg-Universität Mainz, Wallstrasse 3, 55122 Mainz, Germany
| | - Jan Hots
- Department of Experimental Audiology, Otto von Guericke University Magdeburg, Leipziger Street 44, 39120 Magdeburg, Germany
| | - Jesko L Verhey
- Department of Experimental Audiology, Otto von Guericke University Magdeburg, Leipziger Street 44, 39120 Magdeburg, Germany
| |
Collapse
|
7
|
Individual Differences in Behavioural Decision Weights Related to Irregularities in Cochlear Mechanics. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016. [PMID: 27080687 DOI: 10.1007/978-3-319-25474-6_48] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register]
Abstract
An unexpected finding of previous psychophysical studies is that listeners show highly replicable, individualistic patterns of decision weights on frequencies affecting their performance in spectral discrimination tasks--what has been referred to as individual listening styles. We, like many other researchers, have attributed these listening styles to peculiarities in how listeners attend to sounds, but we now believe they partially reflect irregularities in cochlear micromechanics modifying what listeners hear. The most striking evidence for cochlear irregularities is the presence of low-level spontaneous otoacoustic emissions (SOAEs) measured in the ear canal and the systematic variation in stimulus frequency otoacoustic emissions (SFOAEs), both of which result from back-propagation of waves in the cochlea. SOAEs and SFOAEs vary greatly across individual ears and have been shown to affect behavioural thresholds, behavioural frequency selectivity and judged loudness for tones. The present paper reports pilot data providing evidence that SOAEs and SFOAEs are also predictive of the relative decision weight listeners give to a pair of tones in a level discrimination task. In one condition the frequency of one tone was selected to be near that of an SOAE and the frequency of the other was selected to be in a frequency region for which there was no detectable SOAE. In a second condition the frequency of one tone was selected to correspond to an SFOAE maximum, the frequency of the other tone, an SFOAE minimum. In both conditions a statistically significant correlation was found between the average relative decision weight on the two tones and the difference in OAE levels.
Collapse
|
8
|
Roverud E, Best V, Mason CR, Swaminathan J, Kidd G. Informational Masking in Normal-Hearing and Hearing-Impaired Listeners Measured in a Nonspeech Pattern Identification Task. Trends Hear 2016; 20:2331216516638516. [PMID: 27059627 PMCID: PMC4871212 DOI: 10.1177/2331216516638516] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2015] [Revised: 01/26/2016] [Accepted: 02/16/2016] [Indexed: 11/16/2022] Open
Abstract
Individuals with sensorineural hearing loss (SNHL) often experience more difficulty with listening in multisource environments than do normal-hearing (NH) listeners. While the peripheral effects of sensorineural hearing loss certainly contribute to this difficulty, differences in central processing of auditory information may also contribute. To explore this issue, it is important to account for peripheral differences between NH and these hearing-impaired (HI) listeners so that central effects in multisource listening can be examined. In the present study, NH and HI listeners performed a tonal pattern identification task at two distant center frequencies (CFs), 850 and 3500 Hz. In an attempt to control for differences in the peripheral representations of the stimuli, the patterns were presented at the same sensation level (15 dB SL), and the frequency deviation of the tones comprising the patterns was adjusted to obtain equal quiet pattern identification performance across all listeners at both CFs. Tonal sequences were then presented at both CFs simultaneously (informational masking conditions), and listeners were asked either to selectively attend to a source (CF) or to divide attention between CFs and identify the pattern at a CF designated after each trial. There were large differences between groups in the frequency deviations necessary to perform the pattern identification task. After compensating for these differences, there were small differences between NH and HI listeners in the informational masking conditions. HI listeners showed slightly greater performance asymmetry between the low and high CFs than did NH listeners, possibly due to central differences in frequency weighting between groups.
Collapse
|
9
|
Kilman L, Zekveld A, Hällgren M, Rönnberg J. Native and Non-native Speech Perception by Hearing-Impaired Listeners in Noise- and Speech Maskers. Trends Hear 2015; 19:19/0/2331216515579127. [PMID: 25910504 PMCID: PMC4409938 DOI: 10.1177/2331216515579127] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
This study evaluated how hearing-impaired listeners perceive native (Swedish) and nonnative (English) speech in the presence of noise- and speech maskers. Speech reception thresholds were measured for four different masker types for each target language. The maskers consisted of stationary and fluctuating noise and two-talker babble in Swedish and English. Twenty-three hearing-impaired native Swedish listeners participated, aged between 28 and 65 years. The participants also performed cognitive tests of working memory capacity in Swedish and English, nonverbal reasoning, and an English proficiency test. Results indicated that the speech maskers were more interfering than the noise maskers in both target languages. The larger need for phonetic and semantic cues in a nonnative language makes a stationary masker relatively more challenging than a fluctuating-noise masker. Better hearing acuity (pure tone average) was associated with better perception of the target speech in Swedish, and better English proficiency was associated with better speech perception in English. Larger working memory and better pure tone averages were related to the better perception of speech masked with fluctuating noise in the nonnative language. This suggests that both are relevant in highly taxing conditions. A large variance in performance between the listeners was observed, especially for speech perception in the nonnative language.
Collapse
Affiliation(s)
- Lisa Kilman
- Department of Behavioral Sciences and Learning, Linköping University, Linköping, Sweden Linnaeus Centre HEAD, The Swedish Institute for Disability Research, Linköping University, Linköping, Sweden
| | - Adriana Zekveld
- Department of Behavioral Sciences and Learning, Linköping University, Linköping, Sweden Linnaeus Centre HEAD, The Swedish Institute for Disability Research, Linköping University, Linköping, Sweden Audiology/ENT & EMGO+ Institute for Health and Care Research, VU University Medical Center, Amsterdam, the Netherlands
| | - Mathias Hällgren
- Linnaeus Centre HEAD, The Swedish Institute for Disability Research, Linköping University, Linköping, Sweden Department of Otorhinolaryngology/Section of Audiology, Linköping University Hospital, Linköping, Sweden
| | - Jerker Rönnberg
- Department of Behavioral Sciences and Learning, Linköping University, Linköping, Sweden Linnaeus Centre HEAD, The Swedish Institute for Disability Research, Linköping University, Linköping, Sweden
| |
Collapse
|
10
|
Jones PR, Moore DR, Amitay S. Development of auditory selective attention: why children struggle to hear in noisy environments. Dev Psychol 2015; 51:353-69. [PMID: 25706591 PMCID: PMC4337492 DOI: 10.1037/a0038570] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2014] [Revised: 11/11/2014] [Accepted: 11/17/2014] [Indexed: 11/29/2022]
Abstract
Children's hearing deteriorates markedly in the presence of unpredictable noise. To explore why, 187 school-age children (4-11 years) and 15 adults performed a tone-in-noise detection task, in which the masking noise varied randomly between every presentation. Selective attention was evaluated by measuring the degree to which listeners were influenced by (i.e., gave weight to) each spectral region of the stimulus. Psychometric fits were also used to estimate levels of internal noise and bias. Levels of masking were found to decrease with age, becoming adult-like by 9-11 years. This change was explained by improvements in selective attention alone, with older listeners better able to ignore noise similar in frequency to the target. Consistent with this, age-related differences in masking were abolished when the noise was made more distant in frequency to the target. This work offers novel evidence that improvements in selective attention are critical for the normal development of auditory judgments.
Collapse
|
11
|
Advantages of binaural amplification to acceptable noise level of directional hearing aid users. Clin Exp Otorhinolaryngol 2014; 7:94-101. [PMID: 24917904 PMCID: PMC4050094 DOI: 10.3342/ceo.2014.7.2.94] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2013] [Revised: 03/25/2013] [Accepted: 03/30/2013] [Indexed: 12/02/2022] Open
Abstract
Objectives The goal of the present study was to examine whether Acceptable Noise Levels (ANLs) would be lower (greater acceptance of noise) in binaural listening than in monaural listening condition and also whether meaningfulness of background speech noise would affect ANLs for directional microphone hearing aid users. In addition, any relationships between the individual binaural benefits on ANLs and the individuals' demographic information were investigated. Methods Fourteen hearing aid users (mean age, 64 years) participated for experimental testing. For the ANL calculation, listeners' most comfortable listening levels and background noise level were measured. Using Korean ANL material, ANLs of all participants were evaluated under monaural and binaural amplification with a counterbalanced order. The ANLs were also compared across five types of competing speech noises, consisting of 1- through 8-talker background speech maskers. Seven young normal-hearing listeners (mean age, 27 years) participated for the same measurements as a pilot testing. Results The results demonstrated that directional hearing aid users accepted more noise (lower ANLs) with binaural amplification than with monaural amplification, regardless of the type of competing speech. When the background speech noise became more meaningful, hearing-impaired listeners accepted less amount of noise (higher ANLs), revealing that ANL is dependent on the intelligibility of the competing speech. The individuals' binaural advantages in ANLs were significantly greater for the listeners with longer experience of hearing aids, yet not related to their age or hearing thresholds. Conclusion Binaural directional microphone processing allowed hearing aid users to accept a greater amount of background noise, which may in turn improve listeners' hearing aid success. Informational masking substantially influenced background noise acceptance. Given a significant association between ANLs and duration of hearing aid usage, ANL measurement can be useful for clinical counseling of binaural hearing aid candidates or unsuccessful users.
Collapse
|
12
|
Jones PR, Moore DR, Shub DE, Amitay S. Learning to detect a tone in unpredictable noise. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 135:EL128-EL133. [PMID: 24606305 DOI: 10.1121/1.4865267] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Eight normal-hearing listeners practiced a tone-detection task in which a 1-kHz target was masked by a spectrally unpredictable multitone complex. Consistent learning was observed, with mean masking decreasing by 6.4 dB over five sessions (4500 trials). Reverse-correlation was used to estimate how listeners weighted each spectral region. Weight-vectors approximated the ideal more closely after practice, indicating that listeners were learning to attend selectively to the task relevant information. Once changes in weights were accounted for, no changes in internal noise (psychometric slope) were observed. It is concluded that this task elicits robust learning, which can be understood primarily as improved selective attention.
Collapse
Affiliation(s)
- Pete R Jones
- MRC Institute of Hearing Research, University Park, Nottingham NG7 2RD, United Kingdom ,
| | - David R Moore
- MRC Institute of Hearing Research, University Park, Nottingham NG7 2RD, United Kingdom ,
| | - Daniel E Shub
- School of Psychology, University of Nottingham, University Park, Nottingham NG7 2RD, United Kingdom
| | - Sygal Amitay
- MRC Institute of Hearing Research, University Park, Nottingham NG7 2RD, United Kingdom
| |
Collapse
|
13
|
An objective measure of auditory stream segregation based on molecular psychophysics. Atten Percept Psychophys 2014; 76:829-51. [DOI: 10.3758/s13414-013-0613-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
14
|
Oberfeld D, Kuta M, Jesteadt W. Factors limiting performance in a multitone intensity-discrimination task: disentangling non-optimal decision weights and increased internal noise. PLoS One 2013; 8:e79830. [PMID: 24278190 PMCID: PMC3835917 DOI: 10.1371/journal.pone.0079830] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Accepted: 10/04/2013] [Indexed: 12/18/2022] Open
Abstract
To identify factors limiting performance in multitone intensity discrimination, we presented sequences of five pure tones alternating in level between loud (85 dB SPL) and soft (30, 55, or 80 dB SPL). In the “overall-intensity task”, listeners detected a level increment on all of the five tones. In the “masking task”, the level increment was imposed only on the soft tones, rendering the soft tones targets and loud tones task-irrelevant maskers. Decision weights quantifying the importance of the five tone levels for the decision were estimated using methods of molecular psychophysics. Compatible with previous studies, listeners placed higher weights on the loud tones than on the soft tones in the overall-intensity condition. In the masking task, the decisions were systematically influenced by the to-be-ignored loud tones (maskers). Using a maximum-likelihood technique, we estimated the internal noise variance and tested whether the internal noise was higher in the alternating-level five-tone sequences than in sequences presenting only the soft or only the loud tones. For the overall-intensity task, we found no evidence for increased internal noise, but listeners applied suboptimal decision weights. These results are compatible with the hypothesis that the presence of the loud tones does not impair the precision of the representation of the intensity of the soft tones available at the decision stage, but that this information is not used in an optimal fashion due to a difficulty in attending to the soft tones. For the masking task, in some cases our data indicated an increase in internal noise. Additionally, listeners applied suboptimal decision weights. The maximum-likelihood analyses we developed should also be useful for other tasks or other sensory modalities.
Collapse
Affiliation(s)
- Daniel Oberfeld
- Department of Psychology, Johannes Gutenberg-Universität Mainz, Mainz, Germany
- * E-mail:
| | - Martha Kuta
- Department of Psychology, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| | - Walt Jesteadt
- Psychoacoustics Laboratory, Boys Town National Research Hospital, Omaha, Nebraska, United States of America
| |
Collapse
|
15
|
Lutfi RA, Gilbertson L, Heo I, Chang AC, Stamas J. The information-divergence hypothesis of informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:2160-70. [PMID: 23967946 PMCID: PMC3765281 DOI: 10.1121/1.4817875] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/08/2013] [Revised: 07/15/2013] [Accepted: 07/19/2013] [Indexed: 06/01/2023]
Abstract
In recent years there has been growing interest in masking that cannot be attributed to interactions in the cochlea-so--called informational masking (IM). Similarity in the acoustic properties of target and masker and uncertainty regarding the masker are the two major factors identified with IM. These factors involve quite different manipulations of signals and are believed to entail fundamentally different processes resulting in IM. Here, however, evidence is presented that these factors affect IM through their mutual influence on a single factor-the information divergence of target and masker given by Simpson-Fitter's da [Lutfi et al. (2012). J. Acoust. Soc. Am. 132, EL109-113]. Four experiments are described involving multitone pattern discrimination, multi-talker word recognition, sound-source identification, and sound localization. In each case standard manipulations of masker uncertainty and target-masker similarity (including the covariation of target-masker frequencies) are found to have the same effect on performance provided they produce the same change in da. The function relating d(') performance to da, moreover, appears to be linear with constant slope across listeners. The overriding dependence of IM on da is taken to reflect a general principle of perception that exploits differences in the statistical structure of signals to separate figure from ground.
Collapse
Affiliation(s)
- Robert A Lutfi
- Auditory Behavioral Research Lab, Department of Communication Sciences and Disorders, University of Wisconsin, Madison, Wisconsin 53706, USA.
| | | | | | | | | |
Collapse
|
16
|
Jones PR, Moore DR, Amitay S, Shub DE. Reduction of internal noise in auditory perceptual learning. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:970-981. [PMID: 23363114 DOI: 10.1121/1.4773864] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
This paper examines what mechanisms underlie auditory perceptual learning. Fifteen normal hearing adults performed two-alternative, forced choice, pure tone frequency discrimination for four sessions. External variability was introduced by adding a zero-mean Gaussian random variable to the frequency of each tone. Measures of internal noise, encoding efficiency, bias, and inattentiveness were derived using four methods (model fit, classification boundary, psychometric function, and double-pass consistency). The four methods gave convergent estimates of internal noise, which was found to decrease from 4.52 to 2.93 Hz with practice. No group-mean changes in encoding efficiency, bias, or inattentiveness were observed. It is concluded that learned improvements in frequency discrimination primarily reflect a reduction in internal noise. Data from highly experienced listeners and neural networks performing the same task are also reported. These results also indicated that auditory learning represents internal noise reduction, potentially through the re-weighting of frequency-specific channels.
Collapse
Affiliation(s)
- Pete R Jones
- MRC Insitute of Hearing Research, University Park, Nottingham NG7 2RD, United Kingdom.
| | | | | | | |
Collapse
|
17
|
Oberfeld D, Heeren W, Rennies J, Verhey J. Spectro-temporal weighting of loudness. PLoS One 2012; 7:e50184. [PMID: 23209670 PMCID: PMC3509144 DOI: 10.1371/journal.pone.0050184] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 10/18/2012] [Indexed: 11/18/2022] Open
Abstract
Real-world sounds like speech or traffic noise typically exhibit spectro-temporal variability because the energy in different spectral regions evolves differently as a sound unfolds in time. However, it is currently not well understood how the energy in different spectral and temporal portions contributes to loudness. This study investigated how listeners weight different temporal and spectral components of a sound when judging its overall loudness. Spectral weights were measured for the combination of three loudness-matched narrowband noises with different center frequencies. To measure temporal weights, 1,020-ms stimuli were presented, which randomly changed in level every 100 ms. Temporal weights were measured for each narrowband noise separately, and for a broadband noise containing the combination of the three noise bands. Finally, spectro-temporal weights were measured with stimuli where the level of the three narrowband noises randomly and independently changed every 100 ms. The data consistently showed that (i) the first 300 ms of the sounds had a greater influence on overall loudness perception than later temporal portions (primacy effect), and (ii) the lowest noise band contributed significantly more to overall loudness than the higher bands. The temporal weights did not differ between the three frequency bands. Notably, the spectral weights and temporal weights estimated from the conditions with only spectral or only temporal variability were very similar to the corresponding weights estimated in the spectro-temporal condition. The results indicate that the temporal and the spectral weighting of the loudness of a time-varying sound are independent processes. The spectral weights remain constant across time, and the temporal weights do not change across frequency. The results are discussed in the context of current loudness models.
Collapse
Affiliation(s)
- Daniel Oberfeld
- Department of Psychology, Section Experimental Psychology, Johannes Gutenberg-Universität Mainz, Mainz, Germany.
| | | | | | | |
Collapse
|
18
|
Schönfelder VH, Wichmann FA. Sparse regularized regression identifies behaviorally-relevant stimulus features from psychophysical data. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:3953-3969. [PMID: 22559369 DOI: 10.1121/1.3701832] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
As a prerequisite to quantitative psychophysical models of sensory processing it is necessary to learn to what extent decisions in behavioral tasks depend on specific stimulus features, the perceptual cues. Based on relative linear combination weights, this study demonstrates how stimulus-response data can be analyzed in this regard relying on an L(1)-regularized multiple logistic regression, a modern statistical procedure developed in machine learning. This method prevents complex models from over-fitting to noisy data. In addition, it enforces "sparse" solutions, a computational approximation to the postulate that a good model should contain the minimal set of predictors necessary to explain the data. In simulations, behavioral data from a classical auditory tone-in-noise detection task were generated. The proposed method is shown to precisely identify observer cues from a large set of covarying, interdependent stimulus features--a setting where standard correlational and regression methods fail. The proposed method succeeds for a wide range of signal-to-noise ratios and for deterministic as well as probabilistic observers. Furthermore, the detailed decision rules of the simulated observers were reconstructed from the estimated linear model weights allowing predictions of responses on the basis of individual stimuli.
Collapse
Affiliation(s)
- Vinzenz H Schönfelder
- Department for Modeling of Cognitive Processes, Technical University Berlin, FR 6-4, Franklinstr 28/29, 10587 Berlin, Germany.
| | | |
Collapse
|
19
|
Leibold LJ. Development of Auditory Scene Analysis and Auditory Attention. HUMAN AUDITORY DEVELOPMENT 2012. [DOI: 10.1007/978-1-4614-1421-6_5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
|
20
|
Abstract
How well can observers detect the presence of a change in luminance distributions? Performance was measured in three experiments. Observers viewed pairs of grayscale images on a calibrated CRT display. Each image was a checkerboard. All luminances in one image of each pair consisted of random draws from a single probability distribution. For the other image, some patch luminances consisted of random draws from that same distribution, while the rest of the patch luminances (test patches) consisted of random draws from a second distribution. The observers' task was to pick the image with luminances drawn from two distributions. The parameters of the second distribution that led to 75% correct performance were determined across manipulations of (1) the number of test patches, (2) the observers' certainty about test patch location, and (3) the geometric structure of the images. Performance improved with number of test patches and location certainty. The geometric manipulations did not affect performance. An ideal observer model with high efficiency fit the data well and a classification image analysis showed a similar use of information by the ideal and human observers, indicating that observers can make effective use of photometric information in our distribution discrimination task.
Collapse
Affiliation(s)
- Thomas Y Lee
- Department of Psychology, University of Pennsylvania, USA.
| | | |
Collapse
|
21
|
Abstract
In four experiments, we studied the influence of the level profile of time-varying sounds on temporal perceptual weights for loudness. The sounds consisted of contiguous wideband noise segments on which independent random-level perturbations were imposed. Experiment 1 showed that in sounds with a flat level profile, the first segment receives the highest weight (primacy effect). If, however, a gradual increase in level (fade-in) was imposed on the first few segments, the temporal weights showed a delayed primacy effect: The first unattenuated segment received the highest weight, while the fade-in segments were virtually ignored. This pattern argues against a capture of attention to the onset as the origin of the primacy effect. Experiment 2 demonstrated that listeners adjust their temporal weights to the level profile on a trial-by-trial basis. Experiment 3 ruled out potentially inferior intensity resolution at lower levels as the cause of the delayed primacy effect. Experiment 4 showed that the weighting patterns cannot be explained by perceptual segmentation of the sounds into a variable and a stable part. The results are interpreted in terms of memory and attention processes. We demonstrate that the prediction of loudness can be improved significantly by allowing for nonuniform temporal weights.
Collapse
|
22
|
Dai H, Micheyl C. Psychophysical reverse correlation with multiple response alternatives. J Exp Psychol Hum Percept Perform 2010; 36:976-93. [PMID: 20695712 PMCID: PMC3158580 DOI: 10.1037/a0017171] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
Psychophysical reverse-correlation methods such as the "classification image" technique provide a unique tool to uncover the internal representations and decision strategies of individual participants in perceptual tasks. Over the past 30 years, these techniques have gained increasing popularity among both visual and auditory psychophysicists. However, thus far, principled applications of the psychophysical reverse-correlation approach have been almost exclusively limited to two-alternative decision (detection or discrimination) tasks. Whether and how reverse-correlation methods can be applied to uncover perceptual templates and decision strategies in situations involving more than just two response alternatives remain largely unclear. Here, the authors consider the problem of estimating perceptual templates and decision strategies in stimulus identification tasks with multiple response alternatives. They describe a modified correlational approach, which can be used to solve this problem. The approach is evaluated under a variety of simulated conditions, including different ratios of internal-to-external noise, different degrees of correlations between the sensory observations, and various statistical distributions of stimulus perturbations. The results indicate that the proposed approach is reasonably robust, suggesting that it could be used in future empirical studies.
Collapse
Affiliation(s)
- Huanping Dai
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, AZ 85721, USA.
| | | |
Collapse
|
23
|
Leibold LJ, Hitchens JJ, Buss E, Neff DL. Excitation-based and informational masking of a tonal signal in a four-tone masker. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:2441-50. [PMID: 20370027 PMCID: PMC2865701 DOI: 10.1121/1.3298588] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/22/2009] [Revised: 12/31/2009] [Accepted: 01/05/2010] [Indexed: 05/29/2023]
Abstract
This study examined contributions of peripheral excitation and informational masking to the variability in masking effectiveness observed across samples of multi-tonal maskers. Detection thresholds were measured for a 1000-Hz signal presented simultaneously with each of 25, four-tone masker samples. Using a two-interval, forced-choice adaptive task, thresholds were measured with each sample fixed throughout trial blocks for ten listeners. Average thresholds differed by as much as 26 dB across samples. An excitation-based model of partial loudness [Moore, B. C. J. et al. (1997). J. Audio Eng. Soc. 45, 224-237] was used to predict thresholds. These predictions accounted for a significant portion of variance in the data of several listeners, but no relation between the model and data was observed for many listeners. Moreover, substantial individual differences, on the order of 41 dB, were observed for some maskers. The largest individual differences were found for maskers predicted to produce minimal excitation-based masking. In subsequent conditions, one of five maskers was randomly presented in each interval. The difference in performance for samples with low versus high predicted thresholds was reduced in random compared to fixed conditions. These findings are consistent with a trading relation whereby informational masking is largest for conditions in which excitation-based masking is smallest.
Collapse
Affiliation(s)
- Lori J Leibold
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| | | | | | | |
Collapse
|
24
|
Qian J, Richards VM. The effect of onset asynchrony on relative weights in profile analysis. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:2461-2465. [PMID: 20370029 PMCID: PMC2865702 DOI: 10.1121/1.3314251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2009] [Revised: 01/17/2010] [Accepted: 01/20/2010] [Indexed: 05/29/2023]
Abstract
Decision weights were estimated in a profile analysis task to determine whether onset asynchronies between the signal component and the nonsignal components encourage the segregation of the signal relative to the other components. The signal component onset was either synchronous or asynchronous with respect to the nonsignal components. In the asynchronous conditions, thresholds were higher and the decision weights were less efficient than in the synchronous conditions. These data are largely consistent with a segregation hypothesis: onset asynchrony encourages subjects to shift strategies from one of spectral shape discrimination toward one of intensity discrimination.
Collapse
Affiliation(s)
- Jinyu Qian
- Department of Psychology, University of Pennsylvania, 3401 Walnut Street, Suite 302C, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
25
|
Shub DE, Richards VM. Psychophysical spectro-temporal receptive fields in an auditory task. Hear Res 2009; 251:1-9. [PMID: 19249339 PMCID: PMC2692227 DOI: 10.1016/j.heares.2009.02.007] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/24/2008] [Revised: 02/12/2009] [Accepted: 02/13/2009] [Indexed: 11/19/2022]
Abstract
Psychophysical relative weighting functions, which provide information about the importance of different regions of a stimulus in forming decisions, are traditionally estimated using trial-based procedures, where a single stimulus is presented and a single response is recorded. Everyday listening is much more "free-running" in that we often must detect randomly occurring signals in the presence of a continuous background. Psychophysical relative weighting functions have not been measured with free-running paradigms. Here, we combine a free-running paradigm with the reverse correlation technique used to estimate physiological spectro-temporal receptive fields (STRFs) to generate psychophysical relative weighting functions that are analogous to physiological STRFs. The psychophysical task required the detection of a fixed target signal (a sequence of spectro-temporally coherent tone pips with a known frequency) in the presence of a continuously presented informational masker (spectro-temporally random tone pips). A comparison of psychophysical relative weighting functions estimated with the current free-running paradigm and trial-based paradigms, suggests that in informational-masking tasks subjects' decision strategies are similar in both free-running and trial-based paradigms. For more cognitively challenging tasks there may be differences in the decision strategies with free-running and trial-based paradigms.
Collapse
Affiliation(s)
- Daniel E Shub
- Department of Psychology, University of Pennsylvania, 3451 Walnut St., Philadelphia, PA 19104, USA.
| | | |
Collapse
|
26
|
Oberfeld D. The decision process in forward-masked intensity discrimination: evidence from molecular analyses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:294-303. [PMID: 19173416 DOI: 10.1121/1.3021296] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
In a two-interval forced-choice intensity discrimination task presenting a fixed increment, the level of the forward masker in interval 1 and interval 2 was sampled independently from the same normal distribution on each trial. Mean and standard deviation of the distribution were varied. Correlational analyses of the trial-by-trial data revealed different decision strategies depending on the relation between mean masker level and standard level. If the two levels were identical, listeners tended to select the interval containing the higher-level masker, behaving like an energy detector at the output of a temporal window of integration. For mean masker level higher than the standard level, most listeners showed a negative correlation between the masker level in a given interval and the probability of selecting this interval, indicating a strategy of comparing the masker loudness and the target loudness in each of the two observation intervals, and voting for the interval where the loudness difference was smaller. Implications for models of forward-masked intensity discrimination and differences from decision strategies reported for forward-masked detection tasks [Jesteadt et al., (2005). "Effect of variability in level on forward masking and on increment detection," J. Acoust. Soc. Am. 118, 325-337] are discussed.
Collapse
Affiliation(s)
- Daniel Oberfeld
- Department of Psychology, Johannes Gutenberg-Universitat Mainz, Mainz, Germany.
| |
Collapse
|
27
|
Leibold LJ, Tan H, Jesteadt W. Spectral weights for sample discrimination as a function of overall level. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:339-346. [PMID: 19173421 PMCID: PMC2659502 DOI: 10.1121/1.3033741] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/28/2007] [Revised: 10/23/2008] [Accepted: 10/27/2008] [Indexed: 05/27/2023]
Abstract
Doherty and Lutfi [(1996). "Spectral weights for overall level discrimination in listeners with sensorineural hearing loss," J. Acoust. Soc. Am. 99, 1053-1058] examined the weights assigned to individual components of a six-tone complex during a sample discrimination task and reported that hearing-impaired subjects gave the most weight to components in the region of their high-frequency hearing loss. In contrast, weighting patterns varied for normal-hearing subjects. In the current study, the same six-tone complex, comprised of the octave frequencies from 0.25 to 8 kHz, was presented to three subjects with normal hearing in high-pass noise, in low-pass noise, and in quiet at two overall levels. Consistent with Doherty and Lutfi, subjects assigned more weight to the 4-kHz component in the high-pass noise condition, but roughly equal weight to all components in the lower-level quiet condition. Weights in the low-pass noise and higher-level quiet conditions, however, were similar to those in the high-pass noise condition. A second experiment compared weights for seven subjects in quiet at four different mean levels. Weights for the highest-frequency components increased as the overall level of the complexes was increased. These results suggest that overall level, rather than degree of hearing loss or sensation level, was the primary cause of the effect that Doherty and Lutfi observed.
Collapse
Affiliation(s)
- Lori J Leibold
- Boys Town National Research Hospital, Omaha, Nebraska 68131, USA.
| | | | | |
Collapse
|
28
|
Huang R, Richards VM. Estimates of internal templates for the detection of sequential tonal patterns. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:3831-40. [PMID: 19206809 PMCID: PMC2654203 DOI: 10.1121/1.2967827] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2007] [Revised: 06/18/2007] [Accepted: 06/24/2008] [Indexed: 05/25/2023]
Abstract
In this experiment, listeners detected sequential tonal patterns embedded in multitone multiburst random maskers. The maskers consisted of eight 30 ms bursts of random-frequency tones. The signal, when present, occupied the central six bursts and was centered at 1000 Hz. The six sequential signal tones formed several spectro-temporal patterns: an equal-frequency pattern, three ascending patterns with frequency ranges spanning 0.5-, 1-, and 2-equivalent rectangular bandwidths (ERBs), and a random pattern with frequencies drawn at random from the range of 925-1075 Hz. The total number of tones in each burst, m, was varied to determine detection threshold. The detectability of the signal pattern declined as the frequency range of the signal pattern increased, and when the signal was random. Relative weights as a function of time and frequency, interpreted as listeners' internal templates, depended systematically on the properties of the signal pattern tested. The templates indicated that when sensitivity was poor, listeners integrated increasingly broad spectro-temporal regions around the signal frequencies, and sometimes integrated energy from the final burst even though the signal tones never occupied the final burst.
Collapse
Affiliation(s)
- Rong Huang
- Department of Psychology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
29
|
Titze IR. Nonlinear source-filter coupling in phonation: theory. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:2733-49. [PMID: 18529191 PMCID: PMC2811547 DOI: 10.1121/1.2832337] [Citation(s) in RCA: 225] [Impact Index Per Article: 14.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
A theory of interaction between the source of sound in phonation and the vocal tract filter is developed. The degree of interaction is controlled by the cross-sectional area of the laryngeal vestibule (epilarynx tube), which raises the inertive reactance of the supraglottal vocal tract. Both subglottal and supraglottal reactances can enhance the driving pressures of the vocal folds and the glottal flow, thereby increasing the energy level at the source. The theory predicts that instabilities in vibration modes may occur when harmonics pass through formants during pitch or vowel changes. Unlike in most musical instruments (e.g., woodwinds and brasses), a stable harmonic source spectrum is not obtained by tuning harmonics to vocal tract resonances, but rather by placing harmonics into favorable reactance regions. This allows for positive reinforcement of the harmonics by supraglottal inertive reactance (and to a lesser degree by subglottal compliant reactance) without the risk of instability. The traditional linear source-filter theory is encumbered with possible inconsistencies in the glottal flow spectrum, which is shown to be influenced by interaction. In addition, the linear theory does not predict bifurcations in the dynamical behavior of vocal fold vibration due to acoustic loading by the vocal tract.
Collapse
Affiliation(s)
- Ingo R Titze
- Department of Speech Pathology and Audiology, The University of Iowa, Iowa City, Iowa 52242, USA.
| |
Collapse
|
30
|
Alexander JM, Lutfi RA. Sample discrimination of frequency by hearing-impaired and normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 123:241-253. [PMID: 18177154 DOI: 10.1121/1.2816415] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
In a multiple observation, sample discrimination experiment normal-hearing (NH) and hearing-impaired (HI) listeners heard two multitone complexes each consisting of six simultaneous tones with nominal frequencies spaced evenly on an ERB(N) logarithmic scale between 257 and 6930 Hz. On every trial, the frequency of each tone was sampled from a normal distribution centered near its nominal frequency. In one interval of a 2IFC task, all tones were sampled from distributions lower in mean frequency and in the other interval from distributions higher in mean frequency. Listeners had to identify the latter interval. Decision weights were obtained from multiple regression analysis of the between- interval frequency differences for each tone and listeners' responses. Frequency difference limens (an index of sensorineural resolution) and decision weights for each tone were used to predict the sensitivity of different decision-theoretic models. Results indicate that low-frequency tones were given much greater perceptual weight than high-frequency tones by both groups of listeners. This tendency increased as hearing loss increased and as sensorineural resolution decreased, resulting in significantly less efficient weighting strategies for the HI listeners. Overall, results indicate that HI listeners integrated frequency information less optimally than NH listeners, even after accounting for differences in sensorineural resolution.
Collapse
Affiliation(s)
- Joshua M Alexander
- Department of Psychology, University of Wisconsin, Madison, Wisconsin 53706, USA.
| | | |
Collapse
|
31
|
|
32
|
Leibold LJ, Neff DL. Effects of masker-spectral variability and masker fringes in children and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:3666-76. [PMID: 17552718 DOI: 10.1121/1.2723664] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]
Abstract
This study examined the degree to which masker-spectral variability contributes to children's susceptibility to informational masking. Listeners were younger children (5-7 years), older children (8-10 years), and adults (19-34 years). Masked thresholds were measured using a 2IFC, adaptive procedure for a 300-ms, 1000-Hz signal presented simultaneously with (1) broadband noise, (2) a random-frequency ten-tone complex, or (3) a fixed-frequency ten-tone complex. Maskers were presented at an overall level of 60 dB SPL. Thresholds were similar across age for the noise condition. Thresholds for most children were higher than for most adults, however, for both ten-tone conditions. The average difference in threshold between random and fixed ten-tone conditions was comparable across age, suggesting a similar effect of reducing masker-spectral variability in children and adults. Children appear more likely to be susceptible to informational masking than adults, however, both with and in the absence of masker-spectral variability. The addition of a masker fringe (delayed onset of signal relative to masker) provided a release from masking for fixed and random ten-tone conditions in all age groups, suggesting at least part of the masking observed for both ten-tone maskers was informational.
Collapse
Affiliation(s)
- Lori J Leibold
- Boys Town National Research Hospital, Omaha, Nebraska 68131, USA.
| | | |
Collapse
|
33
|
Best V, Gallun FJ, Carlile S, Shinn-Cunningham BG. Binaural interference and auditory grouping. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2007; 121:1070-6. [PMID: 17348529 DOI: 10.1121/1.2407738] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
The phenomenon of binaural interference, where binaural judgments of a high-frequency target stimulus are disrupted by the presence of a simultaneous low-frequency interferer, can largely be explained using principles of auditory grouping and segregation. Evidence for this relationship comes from a number of previous studies showing that the manipulation of simultaneous grouping cues such as harmonicity and onset synchrony can influence the strength of the phenomenon. In this study, it is shown that sequential grouping cues can also influence whether binaural interference occurs. Subjects indicated the lateral position of a high-frequency sinusoidally amplitude-modulated (SAM) tone containing an interaural time difference. Perceived lateral positions were reduced by the presence of a simultaneous diotic low-frequency SAM tone, but were largely restored when the interferer was "captured" in a stream of identical tones. A control condition confirmed that the effect was not due to peripheral adaptation. The data lend further support to the idea that binaural interference is affected by processes related to the perceptual organization of auditory information. Modifications to existing grouping-based models are proposed that may help account for binaural interference effects more successfully.
Collapse
Affiliation(s)
- Virginia Best
- Hearing Research Center Boston University, Boston, Massachusetts 02215, USA
| | | | | | | |
Collapse
|
34
|
Lutfi RA, Jesteadt W. Molecular analysis of the effect of relative tone level on multitone pattern discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:3853-60. [PMID: 17225412 DOI: 10.1121/1.2361184] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Molecular psychophysics attempts to model the observer's response to stimuli as they vary from trial to trial. The approach has gained popularity in multitone pattern discrimination studies as a means of estimating the relative reliance or decision weight listeners give to different tones in the pattern. Various factors affecting decision weights have been examined, but one largely ignored is the relative level of tones in the pattern. In the present study listeners detected a level-increment in a sequence of 5, 100-ms, 2.0-kHz tone bursts alternating in level between 40 and 80 dB SPL. The level increment was made largest on the 40-dB tones, yet despite this all four highly-practiced listeners gave near exclusive weight to the 80-dB tones. The effect was the same when the tones were replaced by bursts of broadband Gaussian noise alternating in level. It was reduced only when the level differences were made <10 dB, and it was entirely reversed only when the low-level tones alternated with louder bursts of Gaussian noise. The results are discussed in terms of the effects of both sensory and perceptual factors on estimates of decision weights.
Collapse
Affiliation(s)
- Robert A Lutfi
- Department of Communicative Disorders and Waisman Center University of Wisconsin, Madison, Wisconsin 53706, USA
| | | |
Collapse
|
35
|
Leibold LJ, Werner LA. Effect of masker-frequency variability on the detection performance of infants and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:3960-70. [PMID: 16838539 DOI: 10.1121/1.2200150] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The effect of masker-frequency variability on the detection performance of 7-9 month-old infants and adults was examined. Listeners detected a 300-ms 1000-Hz pure tone masked by: (1) A random-frequency two-tone complex; (2) a fixed-frequency two-tone complex; or (3) a broadband noise. Maskers repeated at 300-ms intervals throughout testing at 60 dB SPL. The signal was presented simultaneously with one presentation of the masker. Thresholds were determined adaptively using an observer-based method. Infants' thresholds were higher than adults' in all conditions, but infants' and adults' thresholds changed with masker condition in qualitatively similar ways. The fixed two-tone complex produced masking for both age groups, but more masking for infants than for adults. For infants and adults, the random two-tone complex produced more masking than broadband noise, but the difference was greater for infants than for adults. For infants and adults, the random two-tone complex produced more masking than the fixed two-tone complex, and the difference between these conditions was similar for both age groups. These results suggest that infants are more susceptible to informational masking than adults in the absence of spectral variability. Whether infants are more susceptible to the effects of masker-frequency variability than adults remains to be clarified.
Collapse
Affiliation(s)
- Lori J Leibold
- Department of Speech and Hearing Sciences, University of Washington, Seattle, Washington 98195, USA.
| | | |
Collapse
|
36
|
Richards VM, Tang Z. Estimates of effective frequency selectivity based on the detection of a tone added to complex maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 119:1574-84. [PMID: 16583902 DOI: 10.1121/1.2165001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
In Experiment 1, the validity of parameters associated with the roex(p, r) auditory filter shape was examined for three different types of maskers: (a) A noise masker, (b) a random 12-tone masker whose frequencies varied on a burst-by-burst basis [multiple-burst different (MBD)], and (c) a random 12-tone masker whose frequencies were the same across bursts [multiple-burst same (MBS)]. First, the power spectrum model of masking was used to estimate auditory filter shapes for four observers. Second, the resulting auditory filter shapes were used in a computer simulation that provided an estimate of internal noise for each observer. Third, relative weights across frequency were estimated for each observer and each masker type. For the noise masker, these analyses provided predictions and relative weights that were consistent across the three analyses. For the MBD and MBS maskers, there was little consistency; neither the estimated internal noise nor the estimated relative weights reliably supported a single-filter model of detection. In Experiment 2, the time course for the detection of a tone added to an MBD masker was evaluated by estimating relative weights jointly in time and frequency. The relative weights at the signal frequency formed a rough inverse "U" across time.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Psychology, University of Pennsylvania, 3401 Walnut Street, Suite 302C, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
37
|
Dye RH, Stellmack MA, Jurcin NF. Observer weighting strategies in interaural time-difference discrimination and monaural level discrimination for a multi-tone complex. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2005; 117:3079-90. [PMID: 15957776 DOI: 10.1121/1.1861832] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
Two experiments measured listeners' abilities to weight information from different components in a complex of 553, 753, and 953 Hz. The goal was to determine whether or not the ability to adjust perceptual weights generalized across tasks. Weights were measured by binary logistic regression between stimulus values that were sampled from Gaussian distributions and listeners' responses. The first task was interaural time discrimination in which listeners judged the laterality of the target component. The second task was monaural level discrimination in which listeners indicated whether the level of the target component decreased or increased across two intervals. For both experiments, each of the three components served as the target. Ten listeners participated in both experiments. The results showed that those individuals who adjusted perceptual weights in the interaural time experiment could also do so in the monaural level discrimination task. The fact that the same individuals appeared to be analytic in both tasks is an indication that the weights measure the ability to attend to a particular region of the spectrum while ignoring other spectral regions.
Collapse
Affiliation(s)
- Raymond H Dye
- Parmly Hearing Institute, Loyola University of Chicago, Chicago, Illinois 60626, USA
| | | | | |
Collapse
|