1
|
Leibold LJ, Buss E, Miller MK, Cowan T, McCreery RW, Oleson J, Rodriguez B, Calandruccio L. Development of the Children's English and Spanish Speech Recognition Test: Psychometric Properties, Feasibility, Reliability, and Normative Data. Ear Hear 2024; 45:860-877. [PMID: 38334698 PMCID: PMC11178473 DOI: 10.1097/aud.0000000000001480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2024]
Abstract
OBJECTIVES The Children's English and Spanish Speech Recognition (ChEgSS) test is a computer-based tool for assessing closed-set word recognition in English and in Spanish, with a masker that is either speech-shaped noise or competing speech. The present study was conducted to (1) characterize the psychometric properties of the ChEgSS test, (2) evaluate feasibility and reliability for a large cohort of Spanish/English bilingual children with normal hearing, and (3) establish normative data. DESIGN Three experiments were conducted to evaluate speech perception in children (4-17 years) and adults (19-40 years) with normal hearing using the ChEgSS test. In Experiment 1, data were collected from Spanish/English bilingual and English monolingual adults at multiple, fixed signal-to-noise ratios. Psychometric functions were fitted to the word-level data to characterize variability across target words in each language and in each masker condition. In Experiment 2, Spanish/English bilingual adults were tested using an adaptive tracking procedure to evaluate the influence of different target-word normalization approaches on the reliability of estimates of masked-speech recognition thresholds corresponding to 70.7% correct word recognition and to determine the optimal number of reversals needed to obtain reliable estimates. In Experiment 3, Spanish/English bilingual and English monolingual children completed speech perception testing using the ChEgSS test to (1) characterize feasibility across age and language group, (2) evaluate test-retest reliability, and (3) establish normative data. RESULTS Experiments 1 and 2 yielded data that are essential for stimulus normalization, optimizing threshold estimation procedures, and interpreting threshold data across test language and masker type. Findings obtained from Spanish/English bilingual and English monolingual children with normal hearing in Experiment 3 support feasibility and demonstrate reliability for use with children as young as 4 years of age. Equivalent results for testing in English and Spanish were observed for Spanish/English bilingual children, contingent on adequate proficiency in the target language. Regression-based threshold norms were established for Spanish/English bilingual and English monolingual children between 4 and 17 years of age. CONCLUSIONS The present findings indicate the ChEgSS test is appropriate for testing a wide age range of children with normal hearing in either Spanish, English, or both languages. The ChEgSS test is currently being evaluated in a large cohort of patients with hearing loss at pediatric audiology clinics across the United States. Results will be compared with normative data established in the present study and with established clinical measures used to evaluate English- and Spanish-speaking children. Questionnaire data from parents and clinician feedback will be used to further improve test procedures.
Collapse
Affiliation(s)
- Lori J Leibold
- Boys Town National Research Hospital, Center for Hearing Research, Omaha, Nebraska, USA
| | - Emily Buss
- Department of Otolaryngology/HNS, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Margaret K Miller
- Boys Town National Research Hospital, Center for Hearing Research, Omaha, Nebraska, USA
| | - Tiana Cowan
- Boys Town National Research Hospital, Center for Hearing Research, Omaha, Nebraska, USA
| | - Ryan W McCreery
- Boys Town National Research Hospital, Center for Hearing Research, Omaha, Nebraska, USA
| | - Jacob Oleson
- Department of Biostatistics, University of Iowa, Iowa City, Iowa, USA
| | - Barbara Rodriguez
- Department of Speech and Hearing Sciences, University of New Mexico, Albuquerque, New Mexico, USA
| | - Lauren Calandruccio
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, Ohio, USA
| |
Collapse
|
2
|
Buss E, Dai H. Steep temporal integration for tone detection in a multi-tone, random-frequency masker. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:3429-3437. [PMID: 38015027 PMCID: PMC10769565 DOI: 10.1121/10.0022512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 10/11/2023] [Accepted: 11/04/2023] [Indexed: 11/29/2023]
Abstract
In the detection of a tone burst, masking by several tones with random frequencies can produce steep temporal integration. This feature was evaluated for nine normal-hearing adults for 1000-Hz tone bursts presented in a continuous train of four-tone masker bursts. Masker frequencies were randomly selected (250-4000 Hz) for each burst, with the proviso that all tones were separated by ≥0.2 oct. Bursts were 80-ms in duration; when present, signal bursts were gated synchronously with masker bursts. The observed mean temporal-integration function was exceptionally steep-thresholds improved by 26 dB as signal duration increased from 1 to 8 bursts. The results also showed that the individual differences were large, and that the mean psychometric function was exceptionally shallow, spanning a range of 35 dB between 0.6 and 0.9 proportion correct responses, consistent with previous reports. These findings were interpreted in the context of three signal-detection models, one based on the absolute-level cue, and two based on the relative-level cue via template matching; all cues were derived from the excitation patterns of the stimuli. Template-matching models were able to predict the shallow psychometric functions as observed, but all models fall short in the steepness of the observed temporal integration.
Collapse
Affiliation(s)
- Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Huanping Dai
- Department of Speech, Language, and Hearing Sciences, University of Arizona, Tucson, Arizona 85721, USA
| |
Collapse
|
3
|
Auditory and somatosensory feedback mechanisms of laryngeal and articulatory speech motor control. Exp Brain Res 2022; 240:2155-2173. [PMID: 35736994 DOI: 10.1007/s00221-022-06395-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Accepted: 06/02/2022] [Indexed: 11/04/2022]
Abstract
PURPOSE Speech production is a complex motor task involving multiple subsystems. The relationships between these subsystems need to be comprehensively investigated to understand the underlying mechanisms of speech production. The goal of this paper is to examine the differential contributions of 1) auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory speech production subsystems on speech motor control at an individual speaker level using altered auditory and somatosensory feedback paradigms. METHODS Twenty young adults completed speaking tasks in which sudden and unpredictable auditory and physical perturbations were applied to the laryngeal and articulatory speech production subsystems. Auditory perturbations were applied to laryngeal or articulatory acoustic features of speech. Physical perturbations were applied to the larynx and the jaw. Pearson-product moment correlation coefficients were calculated between 1) auditory and somatosensory reflexive responses to investigate relationships between auditory and somatosensory feedback control mechanisms, and 2) laryngeal and articulatory reflexive responses as well as acuity measures to investigate the relationship between auditory-motor features of laryngeal and articulatory subsystems. RESULTS No statistically significant correlations were found concerning the relationships between auditory and somatosensory feedback. No statistically significant correlations were found between auditory-motor features in the laryngeal and articulatory control subsystems. CONCLUSION Results suggest that the laryngeal and articulatory speech production subsystems operate with differential auditory and somatosensory feedback control mechanisms. The outcomes suggest that current models of speech motor control should consider decoupling laryngeal and articulatory domains to better model speech motor control processes.
Collapse
|
4
|
Mathematical framework for place coding in the auditory system. PLoS Comput Biol 2021; 17:e1009251. [PMID: 34339409 PMCID: PMC8360601 DOI: 10.1371/journal.pcbi.1009251] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 08/12/2021] [Accepted: 07/06/2021] [Indexed: 11/18/2022] Open
Abstract
In the auditory system, tonotopy is postulated to be the substrate for a place code, where sound frequency is encoded by the location of the neurons that fire during the stimulus. Though conceptually simple, the computations that allow for the representation of intensity and complex sounds are poorly understood. Here, a mathematical framework is developed in order to define clearly the conditions that support a place code. To accommodate both frequency and intensity information, the neural network is described as a space with elements that represent individual neurons and clusters of neurons. A mapping is then constructed from acoustic space to neural space so that frequency and intensity are encoded, respectively, by the location and size of the clusters. Algebraic operations -addition and multiplication- are derived to elucidate the rules for representing, assembling, and modulating multi-frequency sound in networks. The resulting outcomes of these operations are consistent with network simulations as well as with electrophysiological and psychophysical data. The analyses show how both frequency and intensity can be encoded with a purely place code, without the need for rate or temporal coding schemes. The algebraic operations are used to describe loudness summation and suggest a mechanism for the critical band. The mathematical approach complements experimental and computational approaches and provides a foundation for interpreting data and constructing models.
Collapse
|
5
|
Masked Sentence Recognition in Children, Young Adults, and Older Adults: Age-Dependent Effects of Semantic Context and Masker Type. Ear Hear 2020; 40:1117-1126. [PMID: 30601213 DOI: 10.1097/aud.0000000000000692] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES Masked speech recognition in normal-hearing listeners depends in part on masker type and semantic context of the target. Children and older adults are more susceptible to masking than young adults, particularly when the masker is speech. Semantic context has been shown to facilitate noise-masked sentence recognition in all age groups, but it is not known whether age affects a listener's ability to use context with a speech masker. The purpose of the present study was to evaluate the effect of masker type and semantic context of the target as a function of listener age. DESIGN Listeners were children (5 to 16 years), young adults (19 to 30 years), and older adults (67 to 81 years), all with normal or near-normal hearing. Maskers were either speech-shaped noise or two-talker speech, and targets were either semantically correct (high context) sentences or semantically anomalous (low context) sentences. RESULTS As predicted, speech reception thresholds were lower for young adults than either children or older adults. Age effects were larger for the two-talker masker than the speech-shaped noise masker, and the effect of masker type was larger in children than older adults. Performance tended to be better for targets with high than low semantic context, but this benefit depended on age group and masker type. In contrast to adults, children benefitted less from context in the two-talker speech masker than the speech-shaped noise masker. Context effects were small compared with differences across age and masker type. CONCLUSIONS Different effects of masker type and target context are observed at different points across the lifespan. While the two-talker masker is particularly challenging for children and older adults, the speech masker may limit the use of semantic context in children but not adults.
Collapse
|
6
|
Leibold LJ, Buss E. Yes/no and two-interval forced-choice tasks with listener-based vs observer-based responses. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1588. [PMID: 32237812 PMCID: PMC7067614 DOI: 10.1121/10.0000894] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Revised: 02/20/2020] [Accepted: 02/24/2020] [Indexed: 06/11/2023]
Abstract
Observer-based procedures are used to assess auditory behavior in infants, often incorporating adaptive tracking algorithms. These procedures are reliable, but effects of modifications made to accommodate infant testing are not fully understood. One modification is that observation intervals are undefined for the listener, introducing signal-temporal uncertainty and increasing the likelihood that listener response bias will influence estimates of performance. The effect of these factors was evaluated by comparing threshold estimates obtained from adults using two tasks: (1) single-interval, yes/no and (2) two-interval, forced-choice. Detection thresholds were estimated adaptively for a 1000-Hz FM tone in quiet and for a word presented in two-talker speech masking. Trials were initiated and judged by the observer (observer-based) or the listener (listener-based). Thus, listening intervals were temporally uncertain in observer-based procedures and temporally defined in listener-based procedures. Thresholds were higher for observer-based relative to corresponding listener-based procedures. The magnitude of this difference was similar across the yes/no and two-interval tasks, and was larger for masked word detection than tone detection in quiet. Listeners adopted a conservative criterion when tested using the observer-based, yes/no procedure, but modeling results suggest that signal-temporal uncertainty accounts for the largest portion of the threshold difference between observer-based and listener-based procedures.
Collapse
Affiliation(s)
- Lori J Leibold
- Center for Hearing Research, Boys Town National Research Hospital, Omaha, Nebraska 68131, USA
| | - Emily Buss
- Departement of Otolaryngology/Head and Neck Surgery, The University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| |
Collapse
|
7
|
Wang Y, Lu Z, Yang X, Liu C. Measuring Mandarin Speech Recognition Thresholds Using the Method of Adaptive Tracking. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:2009-2017. [PMID: 31112665 DOI: 10.1044/2019_jslhr-h-18-0162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose The purpose of this study was to examine (a) the psychometric functions of Mandarin speech-on-speech masking with a comparison to the psychometric properties of the English version and (b) the feasibility of using the method of adaptive tracking (MAT) to measure Mandarin speech recognition thresholds in different masker conditions using a Chinese coordinate response measure (C-CRM) speech corpus. Method The study followed the example of the English coordinate response measure (E-CRM) speech corpus and designed a Chinese version. Sixteen listeners were recruited for a monaural Mandarin speech recognition test using C-CRM materials in 5 competing speech masker conditions: 1-talker different-gender, 1-talker same-gender, and 2-, 4-, and 8-talker maskers. Both the MAT and the method of constant stimuli (MCS) were used and then compared. Results Results of the MCS revealed that Mandarin listeners' psychometric functions varied depending upon masker characteristics (e.g., the number and gender of talkers), because significant differences were suggested among the 5 masker conditions (mainly between 1-talker and multitalker), as measured in the slope b and the 50% threshold. Meanwhile, speech identification improved monotonically with increasing signal-to-noise ratios for all 5 masker conditions, except that a plateau was observed with signal-to-noise ratios from -10 to 0 dB in the 1-talker maskers. The properties of C-CRM psychometric functions largely agreed with E-CRM findings, and yet lower speech recognition thresholds were found in C-CRM. Meanwhile, the comparisons of the thresholds estimated in the MAT and the MCS showed no significant differences for all 5 masker conditions. Conclusion The results suggest that masker characteristics greatly affected Mandarin psychometric functions and monotonicity was present in all 5 C-CRM maskers, echoing the E-CRM literature in a great extent. Interestingly, target words of C-CRM were found easier to recognize in multitalker babbles than those of E-CRM. Moreover, comparisons of the MAT and the MCS suggest that the MAT provided a feasible method for measuring Mandarin speech recognition thresholds with the C-CRM speech maskers.
Collapse
Affiliation(s)
- Yuxia Wang
- School of Foreign Languages, Shanghai Jiao Tong University, China
| | - Zhaoyu Lu
- School of Foreign Languages, Shanghai Jiao Tong University, China
| | - Xiaohu Yang
- School of Foreign Languages, Tongji University, China
| | - Chang Liu
- Department of Communication Sciences and Disorders, The University of Texas at Austin
| |
Collapse
|
8
|
Sliva DD, Black CJ, Bowary P, Agrawal U, Santoyo JF, Philip NS, Greenberg BD, Moore CI, Jones SR. A Prospective Study of the Impact of Transcranial Alternating Current Stimulation on EEG Correlates of Somatosensory Perception. Front Psychol 2018; 9:2117. [PMID: 30515114 PMCID: PMC6255923 DOI: 10.3389/fpsyg.2018.02117] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2018] [Accepted: 10/15/2018] [Indexed: 01/30/2023] Open
Abstract
The (8-12 Hz) neocortical alpha rhythm is associated with shifts in attention across sensory systems, and is thought to represent a sensory gating mechanism for the inhibitory control of cortical processing. The present preliminary study sought to explore whether alpha frequency transcranial alternating current stimulation (tACS) could modulate endogenous alpha power in the somatosensory system, and whether the hypothesized modulation would causally impact perception of tactile stimuli at perceptual threshold. We combined electroencephalography (EEG) with simultaneous brief and intermittent tACS applied over primary somatosensory cortex at individuals' endogenous alpha frequency during a tactile detection task (n = 12 for EEG, n = 20 for behavior). EEG-measured pre-stimulus alpha power was higher on non-perceived than perceived trials, and analogous perceptual correlates emerged in early components of the tactile evoked response. Further, baseline normalized tactile detection performance was significantly lower during alpha than sham tACS, but the effect did not last into the post-tACS time period. Pre- to post-tACS changes in alpha power were linearly dependent upon baseline state, such that alpha power tended to increase when pre-tACS alpha power was low, and decrease when it was high. However, these observations were comparable in both groups, and not associated with evidence of tACS-induced alpha power modulation. Nevertheless, the tactile stimulus evoked response potential (ERP) revealed a potentially lasting impact of alpha tACS on circuit dynamics. The post-tACS ERP was marked by the emergence of a prominent peak ∼70 ms post-stimulus, which was not discernible post-sham, or in either pre-stimulation condition. Computational neural modeling designed to simulate macroscale EEG signals supported the hypothesis that the emergence of this peak could reflect synaptic plasticity mechanisms induced by tACS. The primary lesson learned in this study, which commanded a small sample size, was that while our experimental paradigm provided some evidence of an influence of tACS on behavior and circuit dynamics, it was not sufficient to induce observable causal effects of tACS on EEG-measured alpha oscillations. We discuss limitations and suggest improvements that may help further delineate a causal influence of tACS on cortical dynamics and perception in future studies.
Collapse
Affiliation(s)
- Danielle D. Sliva
- Department of Neuroscience, Brown University, Providence, RI, United States
| | - Christopher J. Black
- Department of Biomedical Engineering, School of Engineering, Brown University, Providence, RI, United States
| | - Paul Bowary
- Department of Psychiatry and Human Behavior, Brown University Medical School, Providence, RI, United States
- Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, RI, United States
- Butler Hospital, Providence, RI, United States
| | - Uday Agrawal
- Harvard Medical School, Boston, MA, United States
| | - Juan F. Santoyo
- Department of Neuroscience, Brown University, Providence, RI, United States
| | - Noah S. Philip
- Department of Psychiatry and Human Behavior, Brown University Medical School, Providence, RI, United States
- Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, RI, United States
- Butler Hospital, Providence, RI, United States
| | - Benjamin D. Greenberg
- Department of Psychiatry and Human Behavior, Brown University Medical School, Providence, RI, United States
- Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, RI, United States
- Butler Hospital, Providence, RI, United States
| | | | - Stephanie R. Jones
- Department of Neuroscience, Brown University, Providence, RI, United States
- Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, RI, United States
| |
Collapse
|
9
|
Zaltz Y, Roth DAE, Kishon-Rabin L. Is the Role of External Feedback in Auditory Skill Learning Age Dependent? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3656-3666. [PMID: 29230476 DOI: 10.1044/2017_jslhr-h-16-0446] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2016] [Accepted: 08/21/2017] [Indexed: 06/07/2023]
Abstract
PURPOSE The purpose of this study is to investigate the role of external feedback in auditory perceptual learning of school-age children as compared with that of adults. METHOD Forty-eight children (7-9 years of age) and 64 adults (20-35 years of age) conducted a training session using an auditory frequency discrimination (difference limen for frequency) task, with external feedback (EF) provided for half of them. RESULTS Data supported the following findings: (a) Children learned the difference limen for frequency task only when EF was provided. (b) The ability of the children to benefit from EF was associated with better cognitive skills. (c) Adults showed significant learning whether EF was provided or not. (d) In children, within-session learning following training was dependent on the provision of feedback, whereas between-sessions learning occurred irrespective of feedback. CONCLUSIONS EF was found beneficial for auditory skill learning of 7-9-year-old children but not for young adults. The data support the supervised Hebbian model for auditory skill learning, suggesting combined bottom-up internal neural feedback controlled by top-down monitoring. In the case of immature executive functions, EF enhanced auditory skill learning. This study has implications for the design of training protocols in the auditory modality for different age groups, as well as for special populations.
Collapse
Affiliation(s)
- Yael Zaltz
- Department of Communication Disorders, The Stanley Steyer School of Health Professions, Sackler Faculty of Medicine, Tel Aviv University, Israel
| | - Daphne Ari-Even Roth
- Department of Communication Disorders, The Stanley Steyer School of Health Professions, Sackler Faculty of Medicine, Tel Aviv University, Israel
| | - Liat Kishon-Rabin
- Department of Communication Disorders, The Stanley Steyer School of Health Professions, Sackler Faculty of Medicine, Tel Aviv University, Israel
| |
Collapse
|
10
|
Dai H. On averaging multiple estimates of decision weights within or across listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:3781. [PMID: 29289074 DOI: 10.1121/1.5017717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
As a practical guide, a formula is provided for averaging multiple estimates of decision weights, for the purpose of improving the accuracy and reliability of the final estimate of decision weights. The averaging over multiple weight estimates can take place either within or across listeners.
Collapse
Affiliation(s)
- Huanping Dai
- Department of Speech, Language, and Hearing Sciences, College of Science, Cognitive Sciences Program, University of Arizona, Tucson, Arizona 85721, USA
| |
Collapse
|
11
|
Shin H, Law R, Tsutsui S, Moore CI, Jones SR. The rate of transient beta frequency events predicts behavior across tasks and species. eLife 2017; 6:29086. [PMID: 29106374 PMCID: PMC5683757 DOI: 10.7554/elife.29086] [Citation(s) in RCA: 155] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 11/03/2017] [Indexed: 01/08/2023] Open
Abstract
Beta oscillations (15-29Hz) are among the most prominent signatures of brain activity. Beta power is predictive of healthy and abnormal behaviors, including perception, attention and motor action. In non-averaged signals, beta can emerge as transient high-power 'events'. As such, functionally relevant differences in averaged power across time and trials can reflect changes in event number, power, duration, and/or frequency span. We show that functionally relevant differences in averaged beta power in primary somatosensory neocortex reflect a difference in the number of high-power beta events per trial, i.e. event rate. Further, beta events occurring close to the stimulus were more likely to impair perception. These results are consistent across detection and attention tasks in human magnetoencephalography, and in local field potentials from mice performing a detection task. These results imply that an increased propensity of beta events predicts the failure to effectively transmit information through specific neocortical representations.
Collapse
Affiliation(s)
- Hyeyoung Shin
- Department of Neuroscience, Brown University, Providence, United States
| | - Robert Law
- Department of Neuroscience, Brown University, Providence, United States.,Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, United States
| | - Shawn Tsutsui
- Department of Neuroscience, Brown University, Providence, United States
| | | | - Stephanie R Jones
- Department of Neuroscience, Brown University, Providence, United States.,Center for Neurorestoration and Neurotechnology, Providence VA Medical Center, Providence, United States
| |
Collapse
|
12
|
Arzounian D, de Kerangal M, de Cheveigné A. Sequential dependencies in pitch judgments. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017; 142:3047. [PMID: 29195443 DOI: 10.1121/1.5009938] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Studies that measure pitch discrimination relate a subject's response on each trial to the stimuli presented on that trial, but there is evidence that behavior depends also on earlier stimulation. Here, listeners heard a sequence of tones and reported after each tone whether it was higher or lower in pitch than the previous tone. Frequencies were determined by an adaptive staircase targeting 75% correct, with interleaved tracks to ensure independence between consecutive frequency changes. Responses for this specific task were predicted by a model that took into account the frequency interval on the current trial, as well as the interval and response on the previous trial. This model was superior to simpler models. The dependence on the previous interval was positive (assimilative) for all subjects, consistent with persistence of the sensory trace. The dependence on the previous response was either positive or negative, depending on the subject, consistent with a subject-specific suboptimal response strategy. It is argued that a full stimulus + response model is necessary to account for effects of stimulus history and obtain an accurate estimate of sensory noise.
Collapse
Affiliation(s)
- Dorothée Arzounian
- Laboratoire des Systèmes Perceptifs, Département d'Etudes Cognitives, Ecole normale supérieure, PSL Research University, CNRS, 29 rue d'Ulm, Paris, 75005, France
| | - Mathilde de Kerangal
- The Ear Institute, University College London, 332 Grays Inn Road, Kings Cross, London, WC1X 8EE, United Kingdom
| | - Alain de Cheveigné
- Laboratoire des Systèmes Perceptifs, Département d'Etudes Cognitives, Ecole normale supérieure, PSL Research University, CNRS, 29 rue d'Ulm, Paris, 75005, France
| |
Collapse
|
13
|
García-Pérez MA, Alcalá-Quintana R. The Indecision Model of Psychophysical Performance in Dual-Presentation Tasks: Parameter Estimation and Comparative Analysis of Response Formats. Front Psychol 2017; 8:1142. [PMID: 28747893 PMCID: PMC5506217 DOI: 10.3389/fpsyg.2017.01142] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2017] [Accepted: 06/22/2017] [Indexed: 11/13/2022] Open
Abstract
Psychophysical data from dual-presentation tasks are often collected with the two-alternative forced-choice (2AFC) response format, asking observers to guess when uncertain. For an analytical description of performance, psychometric functions are then fitted to data aggregated across the two orders/positions in which stimuli were presented. Yet, order effects make aggregated data uninterpretable, and the bias with which observers guess when uncertain precludes separating sensory from decisional components of performance. A ternary response format in which observers are also allowed to report indecision should fix these problems, but a comparative analysis with the 2AFC format has never been conducted. In addition, fitting ternary data separated by presentation order poses serious challenges. To address these issues, we extended the indecision model of psychophysical performance to accommodate the ternary, 2AFC, and same-different response formats in detection and discrimination tasks. Relevant issues for parameter estimation are also discussed along with simulation results that document the superiority of the ternary format. These advantages are demonstrated by fitting the indecision model to published detection and discrimination data collected with the ternary, 2AFC, or same-different formats, which had been analyzed differently in the sources. These examples also show that 2AFC data are unsuitable for testing certain types of hypotheses. matlab and R routines written for our purposes are available as Supplementary Material, which should help spread the use of the ternary format for dependable collection and interpretation of psychophysical data.
Collapse
Affiliation(s)
- Miguel A García-Pérez
- Departamento de Metodología, Facultad de Psicología, Universidad ComplutenseMadrid, Spain
| | - Rocío Alcalá-Quintana
- Departamento de Metodología, Facultad de Psicología, Universidad ComplutenseMadrid, Spain
| |
Collapse
|
14
|
|
15
|
Bonino AY, Leibold LJ, Buss E. Effect of signal-temporal uncertainty in children and adults: tone detection in noise or a random-frequency masker. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:4446. [PMID: 25669256 PMCID: PMC3874056 DOI: 10.1121/1.4828828] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Revised: 10/12/2013] [Accepted: 10/18/2013] [Indexed: 06/04/2023]
Abstract
A cue indicating when in time to listen can improve adults' tone detection thresholds, particularly for conditions that produce substantial informational masking. The purpose of this study was to determine if 5- to 13-yr-old children likewise benefit from a light cue indicating when in time to listen for a masked pure-tone signal. Each listener was tested in one of two continuous maskers: Broadband noise (low informational masking) or a random-frequency, two-tone masker (high informational masking). Using a single-interval method of constant stimuli, detection thresholds were measured for two temporal conditions: (1) Temporally-defined, with the listening interval defined by a light cue, and (2) temporally-uncertain, with no light cue. Thresholds estimated from psychometric functions fitted to the data indicated that children and adults benefited to the same degree from the visual cue. Across listeners, the average benefit of a defined listening interval was 1.8 dB in the broadband noise and 8.6 dB in the random-frequency, two-tone masker. Thus, the benefit of knowing when in time to listen was more robust for conditions believed to be dominated by informational masking. An unexpected finding of this study was that children's thresholds were comparable to adults' in the random-frequency, two-tone masker.
Collapse
Affiliation(s)
- Angela Yarnell Bonino
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, CB 7190, Chapel Hill, North Carolina 27599
| | - Lori J Leibold
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, CB 7190, Chapel Hill, North Carolina 27599
| | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, The University of North Carolina at Chapel Hill, CB 7070, Chapel Hill, North Carolina 27599
| |
Collapse
|
16
|
Adaptive psychophysical methods for nonmonotonic psychometric functions. Atten Percept Psychophys 2013; 76:621-41. [DOI: 10.3758/s13414-013-0574-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
17
|
Cone B, Whitaker R. Dynamics of infant cortical auditory evoked potentials (CAEPs) for tone and speech tokens. Int J Pediatr Otorhinolaryngol 2013; 77:1162-73. [PMID: 23722003 PMCID: PMC3700622 DOI: 10.1016/j.ijporl.2013.04.030] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2013] [Revised: 04/19/2013] [Accepted: 04/20/2013] [Indexed: 11/24/2022]
Abstract
OBJECTIVES Cortical auditory evoked potentials (CAEPs) to tones and speech sounds were obtained in infants to: (1) further knowledge of auditory development above the level of the brainstem during the first year of life; (2) establish CAEP input-output functions for tonal and speech stimuli as a function of stimulus level and (3) elaborate the data-base that establishes CAEP in infants tested while awake using clinically relevant stimuli, thus providing methodology that would have translation to pediatric audiological assessment. Hypotheses concerning CAEP development were that the latency and amplitude input-output functions would reflect immaturity in encoding stimulus level. In a second experiment, infants were tested with the same stimuli used to evoke the CAEPs. Thresholds for these stimuli were determined using observer-based psychophysical techniques. The hypothesis was that the behavioral thresholds would be correlated with CAEP input-output functions because of shared cortical response areas known to be active in sound detection. DESIGN 36 infants, between the ages of 4 and 12 months (mean=8 months, s.d.=1.8 months) and 9 young adults (mean age 21 years) with normal hearing were tested. First, CAEPs amplitude and latency input-output functions were obtained for 4 tone bursts and 7 speech tokens. The tone bursts stimuli were 50 ms tokens of pure tones at 0.5, 1.0, 2.0 and 4.0 kHz. The speech sound tokens, /a/, /i/, /o/, /u/, /m/, /s/, and /∫/, were created from natural speech samples and were also 50 ms in duration. CAEPs were obtained for tone burst and speech token stimuli at 10 dB level decrements in descending order from 70 dB SPL. All CAEP tests were completed while the infants were awake and engaged in quiet play. For the second experiment, observer-based psychophysical methods were used to establish perceptual threshold for the same speech sound and tone tokens. RESULTS Infant CAEP component latencies were prolonged by 100-150 ms in comparison to adults. CAEP latency-intensity input output functions were steeper in infants compared to adults. CAEP amplitude growth functions with respect to stimulus SPL are adult-like at this age, particularly for the earliest component, P1-N1. Infant perceptual thresholds were elevated with respect to those found in adults. Furthermore, perceptual thresholds were higher, on average, than levels at which CAEPs could be obtained. When CAEP amplitudes were plotted with respect to perceptual threshold (dB SL), the infant CAEP amplitude growth slopes were steeper than in adults. CONCLUSIONS Although CAEP latencies indicate immaturity in neural transmission at the level of the cortex, amplitude growth with respect to stimulus SPL is adult-like at this age, particularly for the earliest component, P1-N1. The latency and amplitude input-output functions may provide additional information as to how infants perceive stimulus level. The reasons for the discrepancy between electrophysiologic and perceptual threshold may be due to immaturity in perceptual temporal resolution abilities and the broad-band listening strategy employed by infants. The findings from the current study can be translated to the clinical setting. It is possible to use tonal or speech sound tokens to evoke CAEPs in an awake, passively alert infant, and thus determine whether these sounds activate the auditory cortex. This could be beneficial in the verification of hearing aid or cochlear implant benefit.
Collapse
Affiliation(s)
- Barbara Cone
- University of Arizona, Department of Speech, Language and Hearing Sciences, PO Box 210071, Tucson, AZ 85721, United States.
| | - Richard Whitaker
- Hearing Science of Rancho Cucamonga 6283 Grove Avenue Suite 104 Rancho Cucamonga, CA 91730 909-920-9906
| |
Collapse
|
18
|
Liu C, Eddins DA. Measurement of stop consonant identification using adaptive tracking procedures. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:EL250-EL256. [PMID: 22979840 DOI: 10.1121/1.4747826] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The purpose of this study was to establish the potential utility of adaptive tracking procedures, relative to the method of constant stimuli, when examining consonant identification in cafeteria noise or four-talker babble. Thresholds for consonant identification were comparable for the two methods, while psychometric function slopes derived from the two methods were statistically different. To obtain accurate slope estimates from adaptive procedures, the target percent correct and change in variability with signal-to-noise ratio should be considered carefully. However, adaptive procedures are accurate and more efficient than the method of constant stimuli when the purpose is to estimate phoneme identification threshold.
Collapse
Affiliation(s)
- Chang Liu
- Department of Communication Sciences and Disorders, 1 University Station A1100, The University of Texas at Austin Austin, Texas 78712, USA.
| | | |
Collapse
|
19
|
Ihlefeld A, Shinn-Cunningham BG, Carlyon RP. Comodulation masking release in speech identification with real and simulated cochlear-implant hearing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:1315-1324. [PMID: 22352505 PMCID: PMC9014238 DOI: 10.1121/1.3676701] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2011] [Revised: 12/21/2011] [Accepted: 12/22/2011] [Indexed: 05/31/2023]
Abstract
For normal-hearing (NH) listeners, masker energy outside the spectral region of a target signal can improve target detection and identification, a phenomenon referred to as comodulation masking release (CMR). This study examined whether, for cochlear implant (CI) listeners and for NH listeners presented with a "noise vocoded" CI simulation, speech identification in modulated noise is improved by a co-modulated flanking band. In Experiment 1, NH listeners identified noise-vocoded speech in a background of on-target noise with or without a flanking narrow band of noise outside the spectral region of the target. The on-target noise and flanker were either 16-Hz square-wave modulated with the same phase or were unmodulated; the speech was taken from a closed-set corpus. Performance was better in modulated than in unmodulated noise, and this difference was slightly greater when the comodulated flanker was present, consistent with a small CMR of about 1.7 dB for noise-vocoded speech. Experiment 2, which tested CI listeners using the same speech materials, found no advantage for modulated versus unmodulated maskers and no CMR. Thus although NH listeners can benefit from CMR even for speech signals with reduced spectro-temporal detail, no CMR was observed for CI users.
Collapse
Affiliation(s)
- Antje Ihlefeld
- MRC Cognition and Brain Sciences Unit, 15 Chaucer Road, Cambridge CB2 7EF, United Kingdom.
| | | | | |
Collapse
|
20
|
García-Pérez MA. A cautionary note on the use of the adaptive up-down method. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:2098-107. [PMID: 21973364 DOI: 10.1121/1.3628334] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Up-down staircases with equal sizes for the steps up and down are widely used to estimate detection and discrimination thresholds in psychoacoustics, but the conventional average-of-reversals estimator does not converge on its presumed percent point in Yes-No tasks or in two-alternative forced-choice detection tasks. The particular percent point of convergence is partly determined by the relative size of the steps with respect to the spread (inverse of slope) of the underlying psychometric function. In particular, threshold is increasingly underestimated as the spread of the psychometric function decreases. This characteristic may have serious consequences when thresholds estimated via up-down staircases are compared across conditions in which the spread of the psychometric function varies, because then these thresholds do not represent comparable measures of performance. This paper documents the misbehavior of the average-of-reversals estimator under up-down rules and types of forced-choice task that are in common use in psychoacoustics but which have not been studied before in simulations. It is also shown that a relatively simple modification of the up-down design (namely, using steps up and down of different size and in a certain ratio depending only on the task and the up-down rule being used) stabilizes the performance of these staircases.
Collapse
Affiliation(s)
- Miguel A García-Pérez
- Departamento de Metodología, Facultad de Psicología, Universidad Complutense, Campus de Somosaguas, 28223 Madrid, Spain.
| |
Collapse
|
21
|
Wan Q, Kerr C, Pritchett D, Hämäläinen M, Moore C, Jones S. Dynamics of dynamics within a single data acquisition session: variation in neocortical alpha oscillations in human MEG. PLoS One 2011; 6:e24941. [PMID: 21966388 PMCID: PMC3178572 DOI: 10.1371/journal.pone.0024941] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Accepted: 08/24/2011] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND Behavioral paradigms applied during human recordings in electro- and magneto- encephalography (EEG and MEG) typically require 1-2 hours of data collection. Over this time scale, the natural fluctuations in brain state or rapid learning effects could impact measured signals, but are seldom analyzed. METHODS AND FINDINGS We investigated within-session dynamics of neocortical alpha (7-14 Hz) rhythms and their allocation with cued-attention using MEG recorded from primary somatosensory neocortex (SI) in humans. We found that there were significant and systematic changes across a single ~1 hour recording session in several dimensions, including increased alpha power, increased differentiation in attention-induced alpha allocation, increased distinction in immediate time-locked post-cue evoked responses in SI to different visual cues, and enhanced power in the immediate cue-locked alpha band frequency response. Further, comparison of two commonly used baseline methods showed that conclusions on the evolution of alpha dynamics across a session were dependent on the normalization method used. CONCLUSIONS These findings are important not only as they relate to studies of oscillations in SI, they also provide a robust example of the type of dynamic changes in brain measures within a single session that are overlooked in most human brain imaging/recording studies.
Collapse
Affiliation(s)
- Qian Wan
- McGovern Institute for Brain Research, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
- Harvard Osher Research Center, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Catherine Kerr
- Harvard Osher Research Center, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Dominique Pritchett
- McGovern Institute for Brain Research, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
| | - Matti Hämäläinen
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, Massachusetts, United States of America
| | - Christopher Moore
- McGovern Institute for Brain Research, Massachusetts Institute of Technology (MIT), Cambridge, Massachusetts, United States of America
| | - Stephanie Jones
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
22
|
Dai H, Micheyl C. Psychometric functions for pure-tone frequency discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:263-72. [PMID: 21786896 PMCID: PMC3155586 DOI: 10.1121/1.3598448] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
The form of the psychometric function (PF) for auditory frequency discrimination is of theoretical interest and practical importance. In this study, PFs for pure-tone frequency discrimination were measured for several standard frequencies (200-8000 Hz) and levels [35-85 dB sound pressure level (SPL)] in normal-hearing listeners. The proportion-correct data were fitted using a cumulative-Gaussian function of the sensitivity index, d', computed as a power transformation of the frequency difference, Δf. The exponent of the power function corresponded to the slope of the PF on log(d')-log(Δf) coordinates. The influence of attentional lapses on PF-slope estimates was investigated. When attentional lapses were not taken into account, the estimated PF slopes on log(d')-log(Δf) coordinates were found to be significantly lower than 1, suggesting a nonlinear relationship between d' and Δf. However, when lapse rate was included as a free parameter in the fits, PF slopes were found not to differ significantly from 1, consistent with a linear relationship between d' and Δf. This was the case across the wide ranges of frequencies and levels tested in this study. Therefore, spectral and temporal models of frequency discrimination must account for a linear relationship between d' and Δf across a wide range of frequencies and levels.
Collapse
Affiliation(s)
- Huanping Dai
- Department of Speech, Language, and Hearing Sciences, University of Arizona, 1131 East 2nd Street, Tucson, Arizona, 85721, USA.
| | | |
Collapse
|
23
|
Valente DL, Patra H, Jesteadt W. Relative effects of increment and pedestal duration on the detection of intensity increments. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:2095-103. [PMID: 21476665 PMCID: PMC3087391 DOI: 10.1121/1.3557043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Revised: 01/20/2011] [Accepted: 01/24/2011] [Indexed: 05/30/2023]
Abstract
The detection of a brief increment in the intensity of a longer duration pedestal is commonly used as a measure of intensity-resolution. Increment detection is known to improve with increasing duration of the increment and also with increasing duration of the pedestal, but the relative effects of these two parameters have not been explored in the same study. In several past studies of the effects of increment duration, pedestal duration was increased as increment duration increased. In the present study, increment and pedestal duration were independently manipulated. Increment-detection thresholds were determined for four subjects with normal-hearing using a 500- or 4000-Hz pedestal presented at 60 dB sound pressure level (SPL). Increment durations were 10, 20, 40, 80, 160, and 320 ms. Pedestal durations were 20, 40, 80, 160, and 320 ms. Each increment duration was combined with all pedestals of equal or greater duration. Multiple-regression analyses indicate that increment detection under these conditions is determined primarily by pedestal duration. Follow-up experiments ruled out effects of off-frequency listening or overshoot. The results suggest that effects of increment duration have been confounded by effects of pedestal duration in studies that co-varied increment and pedestal duration. Implications for models of temporal integration are discussed.
Collapse
Affiliation(s)
- Daniel L Valente
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | |
Collapse
|
24
|
On the theoretical error bound for estimating psychometric functions. Atten Percept Psychophys 2011; 73:919-26. [DOI: 10.3758/s13414-010-0050-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
25
|
Rodríguez J, Neely ST, Jesteadt W, Tan H, Gorga MP. Comparison of distortion-product otoacoustic emission growth rates and slopes of forward-masked psychometric functions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:864-875. [PMID: 21361444 PMCID: PMC3070994 DOI: 10.1121/1.3523340] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2010] [Revised: 11/09/2010] [Accepted: 11/09/2010] [Indexed: 05/30/2023]
Abstract
Slopes of forward-masked psychometric functions (FM PFs) were compared with distortion-product otoacoustic emission (DPOAE) input/output (I/O) parameters at 1 and 6 kHz to test the hypothesis that these measures provide similar estimates of cochlear compression. Implicit in this hypothesis is the assumption that both DPOAE I/O and FM PF slopes are functionally related to basilar-membrane (BM) response growth. FM PF-slope decreased with signal level, but this effect was reduced or reversed with increasing hearing loss; there was a trend of decreasing psychometric function (PF) slope with increasing frequency, consistent with greater compression at higher frequencies. DPOAE I/O functions at 6 kHz exhibited an increase in the breakpoint of a two-segment slope as a function of hearing loss with a concomitant decrease in the level of the distortion product (L(d)). Results of the comparison between FM PF and DPOAE I/O parameters revealed only a weak correlation, suggesting that one or both of these measures may provide unreliable information about BM compression.
Collapse
Affiliation(s)
- Joyce Rodríguez
- Starkey Hearing Research Center, 2150 Shattuck Avenue, Suite 408, Berkeley, California 94704-1345, USA.
| | | | | | | | | |
Collapse
|
26
|
Cued spatial attention drives functionally relevant modulation of the mu rhythm in primary somatosensory cortex. J Neurosci 2010; 30:13760-5. [PMID: 20943916 DOI: 10.1523/jneurosci.2969-10.2010] [Citation(s) in RCA: 210] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Cued spatial attention modulates functionally relevant alpha rhythms in visual cortices in humans. Here, we present evidence for analogous phenomena in primary somatosensory neocortex (SI). Using magnetoencephalography, we measured changes in the SI mu rhythm containing mu-alpha (7-14 Hz) and mu-beta (15-29 Hz) components. We found that cued attention impacted mu-alpha in the somatopically localized hand representation in SI, showing decreased power after attention was cued to the hand and increased power after attention was cued to the foot, with significant differences observed 500-1100 ms after cue. Mu-beta showed differences in a time window 800-850 ms after cue. The visual cue also drove an early evoked response beginning ∼70 ms after cue with distinct peaks modulated with cued attention. Distinct components of the tactile stimulus-evoked response were also modulated with cued attention. Analysis of a second dataset showed that, on a trial-by-trial basis, tactile detection probabilities decreased linearly with prestimulus mu-alpha and mu-beta power. These results support the growing consensus that cue-induced alpha modulation is a functionally relevant sensory gating mechanism deployed by attention. Further, while cued attention had a weaker effect on the allocation of mu-beta, oscillations in this band also predicted tactile detection.
Collapse
|
27
|
|
28
|
Lutfi RA, Stoelinga CNJ. Sensory constraints on auditory identification of the material and geometric properties of struck bars. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:350-360. [PMID: 20058982 PMCID: PMC2821150 DOI: 10.1121/1.3263606] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2009] [Revised: 10/08/2009] [Accepted: 10/12/2009] [Indexed: 05/28/2023]
Abstract
A computational formula is derived for estimating the constraints limited auditory sensitivity imposes on auditory identification of the material and geometric properties of struck bars. The formula combines a model of the transverse motion of the bar with empirical psychometric functions to map out "null" regions in the bar's physical parameter space where changes in the frequency, amplitude, and decay of partials are likely below threshold for detection. Parameters of the physical space include bar density, Young's modulus, fluid and viscoelastic damping factors, bar length, and bar cross-sectional area (as related to bar shape and hollowness). The formula is used to estimate the possible effect of limited sensitivity in past studies on the auditory identification of bar attributes. The results suggest that sensitivity may, indeed, have played a role in some studies, and that apparent discrepancies in results may be understood based on whether the predominant source of damping was internal or external to the bar. The formula identifies conditions representing an expected bound on identification performance and thereby may be used to aid in the design of future studies for which the struck bar is the sound source of choice.
Collapse
Affiliation(s)
- Robert A Lutfi
- Department of Communicative Disorders, Auditory Behavioral Research Laboratory, University of Wisconsin, Madison, Wisconsin 53706, USA
| | | |
Collapse
|
29
|
Jones SR, Pritchett DL, Sikora MA, Stufflebeam SM, Hämäläinen M, Moore CI. Quantitative analysis and biophysically realistic neural modeling of the MEG mu rhythm: rhythmogenesis and modulation of sensory-evoked responses. J Neurophysiol 2009; 102:3554-72. [PMID: 19812290 DOI: 10.1152/jn.00535.2009] [Citation(s) in RCA: 146] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Variations in cortical oscillations in the alpha (7-14 Hz) and beta (15-29 Hz) range have been correlated with attention, working memory, and stimulus detection. The mu rhythm recorded with magnetoencephalography (MEG) is a prominent oscillation generated by Rolandic cortex containing alpha and beta bands. Despite its prominence, the neural mechanisms regulating mu are unknown. We characterized the ongoing MEG mu rhythm from a localized source in the finger representation of primary somatosensory (SI) cortex. Subjects showed variation in the relative expression of mu-alpha or mu-beta, which were nonoverlapping for roughly 50% of their respective durations on single trials. To delineate the origins of this rhythm, a biophysically principled computational neural model of SI was developed, with distinct laminae, inhibitory and excitatory neurons, and feedforward (FF, representative of lemniscal thalamic drive) and feedback (FB, representative of higher-order cortical drive or input from nonlemniscal thalamic nuclei) inputs defined by the laminar location of their postsynaptic effects. The mu-alpha component was accurately modeled by rhythmic FF input at approximately 10-Hz. The mu-beta component was accurately modeled by the addition of approximately 10-Hz FB input that was nearly synchronous with the FF input. The relative dominance of these two frequencies depended on the delay between FF and FB drives, their relative input strengths, and stochastic changes in these variables. The model also reproduced key features of the impact of high prestimulus mu power on peaks in SI-evoked activity. For stimuli presented during high mu power, the model predicted enhancement in an initial evoked peak and decreased subsequent deflections. In agreement, the MEG-evoked responses showed an enhanced initial peak and a trend to smaller subsequent peaks. These data provide new information on the dynamics of the mu rhythm in humans and the model provides a novel mechanistic interpretation of this rhythm and its functional significance.
Collapse
Affiliation(s)
- Stephanie R Jones
- Massachusetts General Hospital, Athinoula A Martinos Center for Biomedical Imaging, Charlestown, MA 02129, USA.
| | | | | | | | | | | |
Collapse
|
30
|
Kumpik D, Ting J, Campbell RAA, Schnupp JWH, King AJ. Specificity of binaural perceptual learning for amplitude modulated tones: a comparison of two training methods. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2009; 125:2221-32. [PMID: 19354398 PMCID: PMC3428918 DOI: 10.1121/1.3082102] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The specificity of auditory perceptual learning has been taken as an indicator of the likely locus within the brain at which underlying neuronal changes occur. This study examined interaural level difference (ILD) discrimination learning with sinusoidally amplitude modulated (SAM) tones and whether training-induced threshold improvements generalize from one side of auditory space to the other and to an untrained carrier frequency. A novel, dual-staircase adaptive method was adopted that was designed to prevent participants from identifying the nature of the adaptive track. ILD thresholds obtained with this method were compared with a constant-stimulus technique using otherwise identical stimuli. Adaptive thresholds derived from psychometric functions were found to be biased compared to those obtained from reversals. Although adaptive and constant-stimulus procedures appeared to yield different temporal patterns of learning, no global differences were found between them in terms of training outcomes. These data show that ILD discrimination learning with SAM tones does generalize to an untrained carrier frequency but does not generalize across the midline. This implies that the neural substrate for binaural plasticity is found at a relatively high level of the auditory pathway where information is combined across frequency and where each side of auditory space is represented separately.
Collapse
Affiliation(s)
- Daniel Kumpik
- Department of Physiology, Anatomy and Genetics, University of Oxford, Sherrington Building, Oxford, United Kingdom.
| | | | | | | | | |
Collapse
|
31
|
Bonino AY, Leibold LJ. The effect of signal-temporal uncertainty on detection in bursts of noise or a random-frequency complex. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:EL321-EL327. [PMID: 19045685 PMCID: PMC2677353 DOI: 10.1121/1.2993745] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2008] [Revised: 08/29/2008] [Accepted: 09/04/2008] [Indexed: 05/27/2023]
Abstract
This study examined the effect of signal-temporal uncertainty on detection of a 120-ms, 1-kHz tone in the presence of a continuous sequence of 120-ms bursts of either a broadband noise or a random-frequency, two-tone complex. Using the method of constant stimuli, signal-temporal uncertainty was defined as the difference in threshold across temporally uncertain and temporally defined listening conditions. Results indicted an average effect of signal-temporal uncertainty of 2 dB for the noise masker compared to 9 dB for the random-frequency, two-tone masker. These results suggest that signal-temporal uncertainty may be more detrimental for conditions in which informational masking dominates performance.
Collapse
|
32
|
Schairer KS, Messersmith J, Jesteadt W. Use of psychometric-function slopes for forward-masked tones to investigate cochlear nonlinearity. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2008; 124:2196-215. [PMID: 19062859 PMCID: PMC2600619 DOI: 10.1121/1.2968686] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2007] [Revised: 07/08/2008] [Accepted: 07/11/2008] [Indexed: 05/24/2023]
Abstract
Schairer et al. [(2003). "Effects of peripheral nonlinearity on psychometric functions for forward-masked tones," J. Acoust. Soc. Am. 133, 1560-1573] demonstrated that cochlear nonlinearity is reflected in psychometric-function (PF) slopes for 4 kHz forward-masked tones. The goals of the current study were to use PF slopes to compare the degree of compression between signal frequencies of 0.25 and 4 kHz in listeners with normal hearing (LNH), and between LNH and listeners with cochlear hearing loss (LHL). Forward-masked thresholds were estimated in LNH and LHL using on- and off-frequency maskers and 0.25 and 4 kHz signals in three experiments. PFs were reconstructed from adaptive-procedure data for each subject in each condition. Trends in PF slopes across conditions suggest comparable compression at 0.25 and 4 kHz, and potentially a wider bandwidth of compression in relative frequency at 0.25 kHz. This is consistent with other recent behavioral studies that revise earlier estimates of less compression at lower frequencies. The preliminary results in LHL demonstrate that PF slopes are abnormally steep at frequencies with HL, but are similar to those for LNH at frequencies with NH. Overall, the results are consistent with the notion that PF slopes reflect degree of cochlear nonlinearity and can be used as an additional measure of compression across frequency.
Collapse
Affiliation(s)
- Kim S Schairer
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | |
Collapse
|
33
|
Jones SR, Pritchett DL, Stufflebeam SM, Hämäläinen M, Moore CI. Neural correlates of tactile detection: a combined magnetoencephalography and biophysically based computational modeling study. J Neurosci 2007; 27:10751-64. [PMID: 17913909 PMCID: PMC2867095 DOI: 10.1523/jneurosci.0482-07.2007] [Citation(s) in RCA: 111] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2007] [Revised: 08/16/2007] [Accepted: 08/19/2007] [Indexed: 11/21/2022] Open
Abstract
Previous reports conflict as to the role of primary somatosensory neocortex (SI) in tactile detection. We addressed this question in normal human subjects using whole-head magnetoencephalography (MEG) recording. We found that the evoked signal (0-175 ms) showed a prominent equivalent current dipole that localized to the anterior bank of the postcentral gyrus, area 3b of SI. The magnitude and timing of peaks in the SI waveform were stimulus amplitude dependent and predicted perception beginning at approximately 70 ms after stimulus. To make a direct and principled connection between the SI waveform and underlying neural dynamics, we developed a biophysically realistic computational SI model that contained excitatory and inhibitory neurons in supragranular and infragranular layers. The SI evoked response was successfully reproduced from the intracellular currents in pyramidal neurons driven by a sequence of lamina-specific excitatory input, consisting of output from the granular layer (approximately 25 ms), exogenous input to the supragranular layers (approximately 70 ms), and a second wave of granular output (approximately 135 ms). The model also predicted that SI correlates of perception reflect stronger and shorter-latency supragranular and late granular drive during perceived trials. These findings strongly support the view that signatures of tactile detection are present in human SI and are mediated by local neural dynamics induced by lamina-specific synaptic drive. Furthermore, our model provides a biophysically realistic solution to the MEG signal and can predict the electrophysiological correlates of human perception.
Collapse
Affiliation(s)
- Stephanie R Jones
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, Massachusetts 02129, USA.
| | | | | | | | | |
Collapse
|
34
|
Nelson PC, Carney LH. Cues for masked amplitude-modulation detection. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2006; 120:978-90. [PMID: 16938985 PMCID: PMC2572864 DOI: 10.1121/1.2213573] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
The ability of psychoacoustic models to predict listeners' performance depends on two key stages: preprocessing and the generation of a decision variable. The goal of the current study was to determine the perceptually relevant decision variables in masked amplitude-modulation detection tasks in which the modulation depth of the masker was systematically varied. Potential cues were made unreliable by roving the overall modulation depth from trial to trial or were reduced in salience by equalizing the envelope energy of the standard and target after the signal was added. Listeners' performance was significantly degraded in both paradigms compared to the baseline (fixed-level modulation masker) condition, which was similar to those used in previous studies of masking in the envelope-frequency domain. Although this observation was broadly consistent with a simple long-term envelope power-spectrum model, there were several aspects of the data that were not. For example, the steep rate of change in threshold with masker depth and the fact that an optimal amount of envelope noise could enhance performance were not predicted by decision variables calculated directly from the stimulus envelope. A physiologically based processing model suggested a realistic nonlinear mechanism that could give rise to these second-order features of the data.
Collapse
Affiliation(s)
- Paul C Nelson
- Department of Biomedical and Chemical Engineering, Institute for Sensory Research, Syracuse University, Syracuse, New York 13244, USA
| | | |
Collapse
|
35
|
Abstract
Tone detection thresholds for a 10-kHz tone in NMRI mice were determined in psychoacoustic experiments using both a constant-stimuli procedure and a two-down/one-up adaptive-tracking procedure in the same subjects and applying identical threshold criteria (70.7% response probability). Constant-stimuli thresholds were on average 24 dB lower than adaptive-tracking thresholds, and there was a trend indicating that constant-stimuli thresholds were less variable than adaptive-tracking thresholds. Furthermore, in the constant-stimuli procedure the number of trials constituting the psychometric function could be reduced from 100 to 50 trials without a large loss of accuracy of threshold determination. In the constant-stimuli procedure, the threshold value was affected by the threshold criteria. The lowest and least variable constant-stimuli thresholds were obtained by applying signal detection theory and a criterion of d' = 1. Thus, the constant-stimuli procedure in combination with signal detection theory appears to be better suited than the adaptive-tracking procedure to determine auditory sensory thresholds.
Collapse
Affiliation(s)
- Karin B Klink
- Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
| | | | | |
Collapse
|
36
|
Jesteadt W. The variance of d' estimates obtained in yes-no and two-interval forced choice procedures. ACTA ACUST UNITED AC 2005; 67:72-80. [PMID: 15912873 DOI: 10.3758/bf03195013] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
In studies of detection and discrimination, data are often obtained in the form of a 2 x 2 matrix and then converted to an estimate of d' based on the assumptions that the underlying decision distributions are Gaussian and equal in variance. The statistical properties of the estimate of d', d' are well understood for data obtained using the yes-no procedure, but less effort has been devoted to the more commonly used two-interval forced choice (2IFC) procedure. The variance associated with d' is a function of true d' in both procedures, but for small values of true d' the variance of d' obtained using the 2IFC procedure is predicted to be less than the variance of d' obtained using yes-no; for large values of true d', the variance of d' obtained using the 2IFC procedure is predicted to be greater than the variance of d' from yes-no. These results follow from standard assumptions about the relationship between the two procedures. The present paper reviews the statistical properties of d' obtained using the two standard procedures and compares estimates of the variance of d' as a function of true d' with the variance observed in values of d' obtained with a 2IFC procedure.
Collapse
Affiliation(s)
- Walt Jesteadt
- Boys Town National Research Hospital, Omaha, Nebraska 68131, USA
| |
Collapse
|
37
|
Richards VM, Neff DL. Cuing effects for informational masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2004; 115:289-300. [PMID: 14759022 DOI: 10.1121/1.1631942] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
The detection of a tone added to a random-frequency, multitone masker can be very poor even when the maskers have little energy in the frequency region of the signal. This paper examines the effects of adding a pretrial cue to reduce uncertainty for the masker or the signal. The first two experiments examined the effect of cuing a fixed-frequency signal as the number of masker components and presentation methods were manipulated. Cue effectiveness varied across observers, but could reduce thresholds by as much as 20 dB. Procedural comparisons indicated observers benefited more from having two masker samples to compare, with or without a signal cue, than having a single interval with one masker sample and a signal cue. The third experiment used random-frequency signals and compared no-cue, signal-cue, and masker-cue conditions, and also systematically varied the time interval between cue offset and trial onset. Thresholds with a cued random-frequency signal remained higher than for a cued fixed-frequency signal. For time intervals between the cue and trial of 50 ms or longer, thresholds were approximately the same with a signal or a masker cue and lower than when there was no cue. Without a cue or with a masker cue, analyses of possible decision strategies suggested observers attended to the potential signal frequencies, particularly the highest signal frequency. With a signal cue, observers appeared to attend to the frequency of the subsequent signal.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Psychology, 3815 Walnut Street, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.
| | | |
Collapse
|
38
|
Jesteadt W, Nizami L, Schairer KS. A measure of internal noise based on sample discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 114:2147-57. [PMID: 14587612 DOI: 10.1121/1.1610456] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Internal noise is often inferred from the difference between observed performance and optimum performance in detection and discrimination tasks. It can be measured directly in some cases by observing the extent to which a change in external variability impacts performance. In the studies reported here, external variability was added to an intensity discrimination task by adding a Gaussian random variable with zero mean to the overall level presented in each interval of a two-interval forced-choice task. The standard deviation of the random variable was set to half the mean difference between the levels in the two intervals, resulting in d'(ideal) = 2. As the mean difference and the corresponding standard deviation of the random variable decreased in size, performance was increasingly limited by internal noise, permitting a reliable estimate of internal noise to be obtained. This can be viewed as a sample discrimination task, with one component per sample. In the first study, performance was measured using 2-kHz tones presented at an average level of 70 dB SPL, with mean differences between distributions ranging from 0.1 to 2.2 dB in steps of 0.3 dB. The distributions were either Gaussian in level or in power. Conditions with no external variability were used to obtain a psychometric function. In the second study, performance was measured using 2-kHz tones presented at average levels of 50 and 90 dB SPL, with mean differences ranging from 0.4 to 2.2 dB in steps of 0.6 dB. In both studies, the measure of internal noise was highly reliable and in good agreement with the intensity difference limen (DL) estimated from the psychometric function. Analyses suggest that this measure could be used to estimate the mean difference between the decision distributions as well as the amount of internal noise in cases where the mean difference between the distributions is unknown.
Collapse
Affiliation(s)
- Walt Jesteadt
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | |
Collapse
|
39
|
Schairer KS, Nizami L, Reimer JF, Jesteadt W. Effects of peripheral nonlinearity on psychometric functions for forward-masked tones. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2003; 113:1560-1573. [PMID: 12656391 DOI: 10.1121/1.1543933] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Psychometric functions (PFs) for forward-masked tones were obtained for conditions in which signal level was varied to estimate threshold at several masker levels (variable-signal condition), and in which masker level was varied to estimate threshold at several signal levels (variable-masker condition). The changes in PF slope across combinations of masker frequency, masker level, and signal delay were explored in three experiments. In experiment 1, a 2-kHz, 10-ms tone was masked by a 50, 70 or 90 dB SPL, 20-ms on-frequency forward masker, with signal delays of 2, 20, or 40 ms, in a variable-signal condition. PF slopes decreased in conditions where signal threshold was high. In experiments 2 and 3, the signal was a 4-kHz, 10-ms tone, and the masker was either a 4- or 2.4-kHz, 200-ms tone. In experiment 2, on-frequency maskers were presented at 30 to 90 dB SPL in 10-dB steps and off-frequency maskers were presented at 60 to 90 dB SPL in 10-dB steps, with signal delays of 0, 10, or 30 ms, in a variable-signal condition. PF slopes decreased as signal level increased, and this trend was similar for on- and off-frequency maskers. In experiment 3, variable-masker conditions with on- and off-frequency maskers and 0-ms signal delay were presented. In general, the results were consistent with the hypothesis that peripheral nonlinearity is reflected in the PF slopes. The data also indicate that masker level plays a role independent of signal level, an effect that could be accounted for by assuming greater internal noise at higher stimulus levels.
Collapse
Affiliation(s)
- Kim S Schairer
- Center for Hearing Research, Boys Town National Research Hospital, 555 North 30th Street, Omaha, Nebraska 68131, USA.
| | | | | | | |
Collapse
|
40
|
Richards VM, Tang Z, Kidd GD. Informational masking with small set sizes. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1359-1366. [PMID: 11931313 DOI: 10.1121/1.1445790] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Informational masking refers to interference in the detectability of a sound, or discrimination of some property of a sound, beyond that which can be attributed to interactions at the auditory periphery. In the current experiments the signal to be detected was a tone added to a 6-tone masker, and informational masking was introduced by randomly choosing the frequencies of the tones that comprise the masker. The primary question was whether small numbers of maskers could replace randomly drawn maskers without sacrificing the underlying detection schemes adopted by observers. Similar to the method used by Wright and Saberi [J. Acoust. Soc. Am. 105, 1765-1775 (1999)], detection thresholds were measured for different masker set sizes, where set size refers to the number of 6-tone maskers from which any one masker was drawn. Set sizes of 3, 6, 12, and 24 were tested as well as conditions in which the maskers were chosen at random. In addition, observers' memory for maskers was coarsely evaluated. Large differences in thresholds were found across observers and across different masker sets. Even for set sizes of 24, the memory test suggests some recognition of maskers for some observers. Post hoc analysis of the data included an evaluation of the relative contribution of different frequencies using a single linear model. As a base for comparison, a linear model fitted to each condition was also evaluated. Although the data were fitted better using many rather than one linear model, the reduction in quality of fit was modest. This result suggests substantial consistency in decision strategies regardless of masker set size.
Collapse
Affiliation(s)
- Virginia M Richards
- Department of Psychology, University of Pennsylvania, Philadelphia 19104, USA
| | | | | |
Collapse
|
41
|
Grimault N, Bacon SP, Micheyl C. Auditory stream segregation on the basis of amplitude-modulation rate. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2002; 111:1340-1348. [PMID: 11931311 DOI: 10.1121/1.1452740] [Citation(s) in RCA: 86] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
In this study, auditory stream segregation based on differences in the rate of envelope fluctuations--in the absence of spectral and temporal fine structure cues--was tested. The temporal sequences to segregate were composed of fully amplitude-modulated (AM) bursts of broadband noises A and B. All sequences were built by the reiteration of a ABA triplet where A modulation rate was fixed at 100 Hz and B modulation rate was variable. The first experiment was devoted to measuring the threshold difference in AM rate leading subjects to perceive the sequence as two streams as opposed to just one. The results of this first experiment revealed that subjects generally perceived the sequences as a single perceptual stream when the difference in AM rate between the A and B noises was smaller than 0.75 oct, and as two streams when the difference was larger than about 1.00 oct. These streaming thresholds were found to be substantially larger than, and not related to, the subjects' modulation-rate discrimination thresholds. The results of a second experiment demonstrated that AM-rate-based streaming was adversely affected by decreases in AM depth, but that segregation remained possible as long as the AM of either the A or B noises was above the subject's AM-detection threshold. The results of a third experiment indicated that AM-rate-based streaming effects were still observed when the modulations applied to the A and B noises were set individually, either at a constant level in dB above AM-detection threshold, or at levels at which they were of the same perceived strength. This finding suggests that AM-rate-based streaming is not necessarily mediated by perceived differences in AM depth. Altogether, the results of this study indicate that sequential sounds can be segregated on the sole basis of differences in the rate of their temporal fluctuations in the absence of other temporal or spectral cues.
Collapse
Affiliation(s)
- Nicolas Grimault
- Department of Speech and Hearing Science, Arizona State University, Tempe 85287-1908, USA.
| | | | | |
Collapse
|
42
|
Early SJ, Mason CR, Zheng L, Evilsizer M, Idrobo F, Harrison JM, Carney LH. Studies of binaural detection in the rabbit (Oryctolagus cuniculus) with Pavlovian conditioning. Behav Neurosci 2001; 115:650-60. [PMID: 11439454 DOI: 10.1037/0735-7044.115.3.650] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A Pavlovian conditioned eyeblink response in rabbits (Oryctolagus cuniculus) was used to study psychoacoustical phenomena previously demonstrated in human listeners and other animals. This article contains the results of a tone-in-noise detection study to examine 2 psychoacoustical phenomena in rabbit and in human listeners: (a) the binaural masking level difference (BMLD) and (b) differential performance across reproducible noise masker waveforms. The rabbits demonstrated a BMLD comparable in size to other species. Significant differences in performance across reproducible noise masker waveforms were seen in the rabbits. This performance was compared with the performance of human listeners using the same set of waveforms.
Collapse
Affiliation(s)
- S J Early
- Department of Biomedical Engineering and Hearing Research Center, Boston University, Massachusetts 02215, USA
| | | | | | | | | | | | | |
Collapse
|
43
|
Kortekaas RW, Stelmachowicz PG. Bandwidth effects on children's perception of the inflectional morpheme /s/: acoustical measurements, auditory detection, and clarity rating. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2000; 43:645-660. [PMID: 10877435 DOI: 10.1044/jslhr.4303.645] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The goal of this study was to examine developmental effects in auditory perception of word-final /s/ in inflectional morpheme contexts as a function of high-frequency (HF) bandwidth. Such developmental effects may support the use of hearing aids with extended bandwidths in young children with impaired hearing. The first part of the study consisted of acoustical measurements on word-initial /s/ and inflectional morpheme /s/ in sentences recorded from a male speaker. For this speaker, recordings of inflectional morpheme /s/ on average were approximately 50 ms shorter and about 5 dB lower in level than word-initial /s/ sounds. They also had a lower spectral center of gravity, but not a higher coefficient of kurtosis. The second part consisted of measurements of psychometric functions relating detection of the inflectional morpheme /s/ sounds to HF bandwidth in normally hearing 5-, 7-, and 10-year-old children and adults. In speech-shaped noise, significant main effects of age were found for detection as a function of bandwidth for both the 30- and 10-dB signal-to-noise ratio (SNR) condition, although there was no interaction between age and SNR condition. The third part consisted of subjective clarity rating as a function of HF bandwidth for the same speech stimuli used in the second part. No differences were found between age groups in the shape of the clarity rating functions, but differences were found in the rating variance. No systematic effects of the spectral composition of inflectional morpheme /s/ sounds on either detection or clarity rating were found. The higher detection thresholds and larger clarity rating variances for the youngest participants support the use of extended high-frequency bandwidths for young children with impaired hearing. The extent to which the potential benefit of extended bandwidths is affected by hearing impairment in this population, however, deserves further investigation.
Collapse
Affiliation(s)
- R W Kortekaas
- Boys Town National Research Hospital, Omaha, NE, USA.
| | | |
Collapse
|
44
|
Lam CF, Dubno JR, Mills JH. Determination of optimal data placement for psychometric function estimation: a computer simulation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1999; 106:1969-1976. [PMID: 10530021 DOI: 10.1121/1.427944] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Psychometric functions are used to relate the responses of a subject to physical stimuli in a variety of psychophysical tasks. However, it is time consuming to obtain data to determine a psychometric function if many stimulus levels and many trials are required. A computer simulation was conducted to determine the minimum number of data points needed for such a determination. The computer simulation also determined the optimal placements of the stimuli and the number of trials per datum point for psychometric function determinations. Results indicate that a 2-point sampling method with 30-50 trials per point at optimal locations can produce a psychometric function with accurate spread and threshold estimates in a yes-no paradigm. However, the 4-point sampling method yields statistically smaller variances of the estimates. For the 2-alternative forced-choice paradigm, at least 120 trials per point are needed for the 2-point sampling method's estimated parameters to differ from the known parameter values by less than 5%. The simulation results suggest that 3-alternative or 4-alternative forced-choice is preferable to 2-alternative. Furthermore, when a criterion-free paradigm is not required, the yes-no paradigm is a better procedure than m-alternative forced-choice for obtaining the corresponding psychometric function because of smaller standard deviation of the estimates and smaller number of trials/point required.
Collapse
Affiliation(s)
- C F Lam
- Department of Biometry and Epidemiology, Medical University of South Carolina, Charleston 29425, USA.
| | | | | |
Collapse
|
45
|
Wright BA, Saberi K. Strategies used to detect auditory signals in small sets of random maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1999; 105:1765-1775. [PMID: 10089600 DOI: 10.1121/1.426714] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Detection performance for a masked auditory signal of fixed frequency can be substantially degraded if there is uncertainty about the frequency content of the masker. A quasimolecular psychophysical approach was used to examine response strategies in masker-uncertainty conditions, and to investigate the influence of uncertainty when the number of different masker samples was limited to ten or fewer. The task of the four listeners was to detect a 1000-Hz signal that was presented simultaneously with one of ten ten-tone masker samples. The masker sample was either fixed throughout a block of two-interval forced-choice trials or was randomized across or within trials. The primary results showed that: (1) When the signal level was low and the masker sample differed between the two intervals of a trial, most listeners based their responses more on the presence of specific masker samples than on the signal. (2) The detrimental effect of masker uncertainty was clearly evident when only four maskers were randomly presented, and grew as the size of the masker set was increased from two to ten. (3) The slopes of psychometric functions measured with the same masker samples differed among the fixed and two random-masker conditions. (4) There were large differences in the influence of masker uncertainty across masker samples and listeners. These data demonstrate the great susceptibility of human listeners to the influence of masker uncertainty and the ability of quasimolecular investigations to reveal important aspects of behavior in uncertainty condition.
Collapse
Affiliation(s)
- B A Wright
- Audiology and Hearing Sciences Program, Northwestern University, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
46
|
Wright BA, Dai H. Detection of sinusoidal amplitude modulation at unexpected rates. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1998; 104:2991-2996. [PMID: 9821343 DOI: 10.1121/1.423881] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The detectability of sinusoidal amplitude modulation at unexpected modulation rates was assessed using a probe-signal method. With this method, three listeners were led to expect a target modulation rate (4, 32, or 256 Hz) by presenting the signal most often at that rate, and sensitivity to modulation at six other unexpected rates between 4 and 256 Hz was measured via occasionally presented probe modulation rates. The modulation phase was random on each two-interval forced-choice trial and the overall level of the 500-ms broadband carrier was randomly varied between 55 and 75 dB SPL across intervals. The modulation depth at each rate was set so that the modulation was detected on about 90% of the trials when only that rate was presented. Performance at the unexpected rates depended upon the target rate. For the 4-Hz target, modulation at all rates was detected on about 80% of the trials. For the 32- and 256-Hz targets, unexpected modulation rates of 16 Hz and above were detected on 80%-90% of the trials, but modulation rates below 16 Hz were detected nearly at chance. The influence of expectation of modulation rate on the detection of sinusoidal amplitude modulation is not readily predicted by current models of modulation detection.
Collapse
Affiliation(s)
- B A Wright
- Audiology and Hearing Sciences Program, Northwestern University, Evanston, Illinois 60208-3550, USA.
| | | |
Collapse
|
47
|
Lee J, Bacon SP. Amplitude modulation depth discrimination of a sinusoidal carrier: effect of stimulus duration. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 1997; 101:3688-93. [PMID: 9193056 PMCID: PMC3282190 DOI: 10.1121/1.418329] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Discrimination of the change in depth of sinusoidal amplitude modulation (AM) was investigated as a function of stimulus duration. The carrier frequency was 4000 Hz, the standard modulation depth (m) was either 0.1, 0.18, or 0.3, and the modulation rate was either 10, 20, 40, or 80 Hz. For all standard depths and modulation rates, threshold (delta m) decreased by more than a factor o two as stimulus duration doubled from the shortest duration used up to a certain duration (critical duration), beyond which the threshold decreased only slightly or remained constant. The critical duration corresponded to about four cycles of modulation. Psychometric functions were measured for different stimulus durations to examine the extent to which a multiple-looks model could explain the present data. This model provided a reasonable prediction of the change in AM depth discrimination threshold as a function of stimulus duration.
Collapse
Affiliation(s)
- J Lee
- Department of Speech and Hearing Science, Arizona State University, Tempe 85287-1908, USA.
| | | |
Collapse
|