1
|
Calcus A. Development of auditory scene analysis: a mini-review. Front Hum Neurosci 2024; 18:1352247. [PMID: 38532788 PMCID: PMC10963424 DOI: 10.3389/fnhum.2024.1352247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 02/22/2024] [Indexed: 03/28/2024] Open
Abstract
Most auditory environments contain multiple sound waves that are mixed before reaching the ears. In such situations, listeners must disentangle individual sounds from the mixture, performing the auditory scene analysis. Analyzing complex auditory scenes relies on listeners ability to segregate acoustic events into different streams, and to selectively attend to the stream of interest. Both segregation and selective attention are known to be challenging for adults with normal hearing, and seem to be even more difficult for children. Here, we review the recent literature on the development of auditory scene analysis, presenting behavioral and neurophysiological results. In short, cognitive and neural mechanisms supporting stream segregation are functional from birth but keep developing until adolescence. Similarly, from 6 months of age, infants can orient their attention toward a target in the presence of distractors. However, selective auditory attention in the presence of interfering streams only reaches maturity in late childhood at the earliest. Methodological limitations are discussed, and a new paradigm is proposed to clarify the relationship between auditory scene analysis and speech perception in noise throughout development.
Collapse
Affiliation(s)
- Axelle Calcus
- Center for Research in Cognitive Neuroscience (CRCN), ULB Neuroscience Institute (UNI), Université Libre de Bruxelles, Brussels, Belgium
| |
Collapse
|
2
|
Saberi K, Hickok G. Confirming an antiphasic bicyclic pattern of forward entrainment in signal detection: A reanalysis of Sun et al. (2021). Eur J Neurosci 2022; 56:5274-5286. [PMID: 36057434 PMCID: PMC9826078 DOI: 10.1111/ejn.15816] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 08/27/2022] [Accepted: 08/31/2022] [Indexed: 01/11/2023]
Abstract
Forward entrainment refers to that part of the entrainment process that persists after termination of an entraining stimulus. Hickok et al. (2015) reported forward entrainment in signal detection that lasted for two post-stimulus cycles. In a recent paper, Sun et al. (2021) reported new data which suggested an absence of entrainment effects (Eur. J. Neurosci, 1-18, doi.org/10.1111/ejn.15367). Here we show that when Sun et al.'s data are analysed using unbiased detection-theoretic measures, a clear antiphasic bicyclic pattern of entrainment is observed. We further show that the measure of entrainment strength used by Sun et al., the normalized Fourier transform of performance curves, is not only erroneously calculated but is also unreliable in estimating entrainment strength due to signal-processing artifacts.
Collapse
Affiliation(s)
- Kourosh Saberi
- Department of Cognitive SciencesUniversity of California, IrvineIrvineCaliforniaUSA
| | - Gregory Hickok
- Department of Cognitive SciencesUniversity of California, IrvineIrvineCaliforniaUSA,Department of Language ScienceUniversity of California, IrvineIrvineCaliforniaUSA
| |
Collapse
|
3
|
Oster MM, Werner LA. Infants use onset asynchrony cues in auditory scene analysis. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:2052. [PMID: 30404496 PMCID: PMC6181648 DOI: 10.1121/1.5058397] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2018] [Revised: 09/01/2018] [Accepted: 09/17/2018] [Indexed: 06/08/2023]
Abstract
This experiment investigated the effect of onset asynchrony on the segregation of concurrent vowels in infants and adults. Two vowels, randomly chosen from seven American-English vowels, were superimposed. Each vowel pair contained one vowel by a male and one by a female talker. A train of such vowel pairs was presented to listeners, who were trained to respond to the male target vowel /i:/ or /u:/. The ability to identify the target vowel was compared among three conditions: synchronous onset, 100-, and 200-ms onset asynchrony. Experiment 1 measured performance, in d', in 7-month-old infants and adults. Infants and adults performed better with asynchronous than synchronous vowel onset, regardless of asynchrony duration. Experiment 2 compared the proportion of 3-month-old infants achieving an 80% correct criterion with and without onset asynchrony. Significantly more infants reached criterion with asynchronous than with synchronous vowel onset. Asynchrony duration did not influence performance. These experiments show that infants, as young as 3 months old, benefit from onset asynchrony.
Collapse
Affiliation(s)
- Monika-Maria Oster
- Department of Speech and Hearing Sciences, University of Washington, 1417 Northeast 42nd Street, Seattle, Washington 98105, USA
| | - Lynne A Werner
- Department of Speech and Hearing Sciences, University of Washington, 1417 Northeast 42nd Street, Seattle, Washington 98105, USA
| |
Collapse
|
4
|
Johnson KA, Bryan M, Polonowita K, Decroupet D, Coull JT. Isochronous Sequential Presentation Helps Children Orient Their Attention in Time. Front Psychol 2016; 7:1417. [PMID: 27713713 PMCID: PMC5032676 DOI: 10.3389/fpsyg.2016.01417] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2016] [Accepted: 09/05/2016] [Indexed: 11/13/2022] Open
Abstract
Knowing when an event is likely to occur allows attentional resources to be oriented toward that moment in time, enhancing processing of the event. We previously found that children (mean age 11 years) are unable to use endogenous temporal cues to orient attention in time, despite being able to use endogenous spatial cues (arrows) to orient attention in space. Arrow cues, however, may have proved beneficial by engaging exogenous (automatic), as well as endogenous (voluntary), orienting mechanisms. We therefore conducted two studies in which the exogenous properties of visual temporal cues were increased, to examine whether this helped children orient their attention in time. In the first study, the location of an imperative target was predicted by the direction of a left or right spatial arrow cue while its onset was predicted by the relative duration of a short or long temporal cue. To minimize the influence of rhythmic entrainment in the temporal condition, the foreperiod (500 ms/1100 ms) was deliberately chosen so as not to precisely match the duration of the temporal cue (100 ms/400 ms). Targets appeared either at cued locations/onset times (valid trials) or at unexpected locations/onset times (invalid trials). Adults’ response times were significantly slower for invalid versus valid trials, in both spatial and temporal domains. Despite being slowed by invalid spatial cues, children (mean age 10.7 years) were unperturbed by invalid temporal cues, suggesting that these duration-based temporal cues did not help them orient attention in time. In the second study, we enhanced the exogenous properties of temporal cues further, by presenting multiple temporal cues in an isochronous (rhythmic) sequence. Again, to minimize automatic entrainment, target onset did not match the isochronous interval. Children (mean age 11.4 years), as well as adults, were now significantly slowed by invalid cues in both the temporal and spatial dimension. The sequential, as opposed to single, presentation of temporal cues therefore helped children to orient their attention in time. We suggest that the exogenous properties of sequential presentation provide a temporal scaffold that supports the additional attentional and mnemonic requirements of temporal, as compared to spatial, processing.
Collapse
Affiliation(s)
- Katherine A Johnson
- School of Psychological Sciences, University of Melbourne, Parkville VIC, Australia
| | - Marita Bryan
- School of Psychological Sciences, University of Melbourne, Parkville VIC, Australia
| | - Kira Polonowita
- School of Psychological Sciences, University of Melbourne, Parkville VIC, Australia
| | - Delia Decroupet
- School of Psychological Sciences, University of Melbourne, Parkville VIC, Australia
| | - Jennifer T Coull
- Laboratoire des Neurosciences Cognitives, Aix-Marseille Université, CNRS Marseille, France
| |
Collapse
|
5
|
Mento G, Tarantino V. Developmental Trajectories of Internally and Externally Driven Temporal Prediction. PLoS One 2015; 10:e0135098. [PMID: 26262878 PMCID: PMC4532408 DOI: 10.1371/journal.pone.0135098] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2015] [Accepted: 07/16/2015] [Indexed: 11/30/2022] Open
Abstract
The ability to generate temporal prediction (TP) is fundamental to our survival since it allows us to selectively orient our attention in time in order to prioritize relevant environmental information. Studies on adult participants showed that externally and internally driven mechanisms can be engaged to establish TP, both resulting in better behavioural performance. However, few studies on children have investigated the ability to engage internally and externally driven TP, especially in relation to how these mechanisms change across development. In this study, 111 participants (88 children between six and eleven years of age, and 23 adults) were tested by means of a simple reaction time paradigm, in which temporal cueing and neutral conditions were orthogonally manipulated to induce externally and internally driven TP mechanisms, as well as an interaction between the two. Sequential effects (SEs) relative to both tasks were also investigated. Results showed that all children participating in the study were able to implement both external and internal TP in an independent fashion. However, children younger than eight years were not able to combine both strategies. Furthermore, in the temporal cueing blocks they did not show the typically-observed asymmetric SE pattern. These results suggest that children can flexibly use both external and internal TP mechanisms to optimise their behaviour, although their successful combined use develops only after eight years of age.
Collapse
Affiliation(s)
- Giovanni Mento
- Department of General Psychology, University of Padua, Via Venezia 8, 35131, Padua, Italy
- * E-mail:
| | - Vincenza Tarantino
- Department of General Psychology, University of Padua, Via Venezia 8, 35131, Padua, Italy
- Department of Neurosciences: SNPSRR, University of Padua, Via Giustiniani 5, 35128, Padova (PD), Italy
| |
Collapse
|
6
|
Sharma M, Dhamani I, Leung J, Carlile S. Attention, memory, and auditory processing in 10- to 15-year-old children with listening difficulties. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2014; 57:2308-2321. [PMID: 25198800 DOI: 10.1044/2014_jslhr-h-13-0226] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/03/2013] [Accepted: 08/20/2014] [Indexed: 06/03/2023]
Abstract
PURPOSE The aim of this study was to examine attention, memory, and auditory processing in children with reported listening difficulty in noise (LDN) despite having clinically normal hearing. METHOD Twenty-one children with LDN and 15 children with no listening concerns (controls) participated. The clinically normed auditory processing tests included the Frequency/Pitch Pattern Test (FPT; Musiek, 2002), the Dichotic Digits Test (Musiek, 1983), the Listening in Spatialized Noise-Sentences (LiSN-S) test (Dillon, Cameron, Glyde, Wilson, & Tomlin, 2012), gap detection in noise (Baker, Jayewardene, Sayle, & Saeed, 2008), and masking level difference (MLD; Wilson, Moncrieff, Townsend, & Pillion, 2003). Also included were research-based psychoacoustic tasks, such as auditory stream segregation, localization, sinusoidal amplitude modulation (SAM), and fine structure perception. All were also evaluated on attention and memory test batteries. RESULTS The LDN group was significantly slower switching their auditory attention and had poorer inhibitory control. Additionally, the group mean results showed significantly poorer performance on FPT, MLD, 4-Hz SAM, and memory tests. Close inspection of the individual data revealed that only 5 participants (out of 21) in the LDN group showed significantly poor performance on FPT compared with clinical norms. Further testing revealed the frequency discrimination of these 5 children to be significantly impaired. CONCLUSION Thus, the LDN group showed deficits in attention switching and inhibitory control, whereas only a subset of these participants demonstrated an additional frequency resolution deficit.
Collapse
|
7
|
Bonino AY, Leibold LJ, Buss E. Effect of signal-temporal uncertainty in children and adults: tone detection in noise or a random-frequency masker. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:4446. [PMID: 25669256 PMCID: PMC3874056 DOI: 10.1121/1.4828828] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Revised: 10/12/2013] [Accepted: 10/18/2013] [Indexed: 06/04/2023]
Abstract
A cue indicating when in time to listen can improve adults' tone detection thresholds, particularly for conditions that produce substantial informational masking. The purpose of this study was to determine if 5- to 13-yr-old children likewise benefit from a light cue indicating when in time to listen for a masked pure-tone signal. Each listener was tested in one of two continuous maskers: Broadband noise (low informational masking) or a random-frequency, two-tone masker (high informational masking). Using a single-interval method of constant stimuli, detection thresholds were measured for two temporal conditions: (1) Temporally-defined, with the listening interval defined by a light cue, and (2) temporally-uncertain, with no light cue. Thresholds estimated from psychometric functions fitted to the data indicated that children and adults benefited to the same degree from the visual cue. Across listeners, the average benefit of a defined listening interval was 1.8 dB in the broadband noise and 8.6 dB in the random-frequency, two-tone masker. Thus, the benefit of knowing when in time to listen was more robust for conditions believed to be dominated by informational masking. An unexpected finding of this study was that children's thresholds were comparable to adults' in the random-frequency, two-tone masker.
Collapse
Affiliation(s)
- Angela Yarnell Bonino
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, CB 7190, Chapel Hill, North Carolina 27599
| | - Lori J Leibold
- Department of Allied Health Sciences, The University of North Carolina at Chapel Hill, CB 7190, Chapel Hill, North Carolina 27599
| | - Emily Buss
- Department of Otolaryngology/Head and Neck Surgery, The University of North Carolina at Chapel Hill, CB 7070, Chapel Hill, North Carolina 27599
| |
Collapse
|
8
|
Dhamani I, Leung J, Carlile S, Sharma M. Switch attention to listen. Sci Rep 2013; 3:1297. [PMID: 23416613 PMCID: PMC3575018 DOI: 10.1038/srep01297] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2012] [Accepted: 02/01/2013] [Indexed: 11/09/2022] Open
Abstract
The aim of this research was to evaluate the ability to switch attention and selectively attend to relevant information in children (10-15 years) with persistent listening difficulties in noisy environments. A wide battery of clinical tests indicated that children with complaints of listening difficulties had otherwise normal hearing sensitivity and auditory processing skills. Here we show that these children are markedly slower to switch their attention compared to their age-matched peers. The results suggest poor attention switching, lack of response inhibition and/or poor listening effort consistent with a predominantly top-down (central) information processing deficit. A deficit in the ability to switch attention across talkers would provide the basis for this otherwise hidden listening disability, especially in noisy environments involving multiple talkers such as classrooms.
Collapse
Affiliation(s)
- Imran Dhamani
- Audiology Section, Macquarie University and The Hearing CRC.
| | | | | | | |
Collapse
|
9
|
Werner LA. Infants' detection and discrimination of sounds in modulated maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:4156-4167. [PMID: 23742367 PMCID: PMC3689834 DOI: 10.1121/1.4803903] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/10/2012] [Revised: 04/15/2013] [Accepted: 04/19/2013] [Indexed: 06/02/2023]
Abstract
Adults and 7-month-old infants were compared in detection and discrimination of sounds in modulated maskers. In two experiments, the level of a target sound was varied to equate listeners' performance in unmodulated noise, and performance was assessed at that level in a noise modulated with the envelope of single-talker speech. While adults' vowel discrimination and tone detection were better in the modulated than in the unmodulated masker, infants' vowel discrimination was poorer in the modulated than in the unmodulated masker. Infants' tone detection was the same in the two maskers. In two additional experiments, each age group was tested at one level with order of testing in modulated and unmodulated maskers counterbalanced across subjects. Both infants and adults discriminated between vowels better in single-talker modulated and sinusoidally amplitude modulated (SAM) maskers than in an unmodulated masker, but infants' modulated-unmodulated difference was smaller than than that of adults. Increasing the modulation depth of the SAM masker did not affect the size of infants' modulated-unmodulated difference. However, infants' asymptotic performance in a modulated masker limits the extent to which their performance could improve. Infants can make use of information in masker dips, but masker modulation may also interfere with their ability to process the target.
Collapse
Affiliation(s)
- Lynne A Werner
- Department of Speech and Hearing Sciences, University of Washington, Seattle, Washington 98105-6246, USA
| |
Collapse
|
10
|
Tarr E, Nittrouer S. Explaining coherence in coherence masking protection for adults and children. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:4218-4231. [PMID: 23742373 PMCID: PMC3689784 DOI: 10.1121/1.4802638] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2012] [Revised: 03/21/2013] [Accepted: 03/22/2013] [Indexed: 06/01/2023]
Abstract
Coherence masking protection (CMP) is the phenomenon in which a low-frequency target (typically a first formant) is labeled accurately in poorer signal-to-noise levels when combined with a high-frequency cosignal, rather than presented alone. An earlier study by the authors revealed greater CMP for children than adults, with more resistance to disruptions in harmonicity across spectral components [Nittrouer and Tarr (2011). Atten. Percept. Psychophys. 73, 2606-2623]. That finding was interpreted as demonstrating that children are obliged to process speech signals as broad spectral patterns, regardless of the harmonic structure of the spectral components. The current study tested three alternative, auditory explanations for the observed coherence of target + cosignal: (1) unique spectral shapes of target + cosignal support labeling, (2) periodicity of target + cosignal promotes coherence, and (3) temporal synchrony across target + cosignal reinforces temporal expectancies. Adults, eight-year-olds, and five-year-olds labeled stimuli in five conditions: F1 only and F1 + a constant cosignal (both used previously) were benchmarks for comparing thresholds for F1 + 3 new cosignals. Children again showed greater CMP than adults, but none of the three hypotheses could explain their CMP. It was again concluded that children are obliged to recognize speech signals as broad spectral patterns.
Collapse
Affiliation(s)
- Eric Tarr
- Department of Otolaryngology, The Ohio State University, 915 Olentangy River Road, Suite 4000, Columbus, Ohio 43212, USA.
| | | |
Collapse
|
11
|
Buss E, Hall JW, Grose JH. Development of Auditory Coding as Reflected in Psychophysical Performance. HUMAN AUDITORY DEVELOPMENT 2012. [DOI: 10.1007/978-1-4614-1421-6_4] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
12
|
Dai H, Micheyl C. Psychometric functions for pure-tone frequency discrimination. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 130:263-72. [PMID: 21786896 PMCID: PMC3155586 DOI: 10.1121/1.3598448] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
The form of the psychometric function (PF) for auditory frequency discrimination is of theoretical interest and practical importance. In this study, PFs for pure-tone frequency discrimination were measured for several standard frequencies (200-8000 Hz) and levels [35-85 dB sound pressure level (SPL)] in normal-hearing listeners. The proportion-correct data were fitted using a cumulative-Gaussian function of the sensitivity index, d', computed as a power transformation of the frequency difference, Δf. The exponent of the power function corresponded to the slope of the PF on log(d')-log(Δf) coordinates. The influence of attentional lapses on PF-slope estimates was investigated. When attentional lapses were not taken into account, the estimated PF slopes on log(d')-log(Δf) coordinates were found to be significantly lower than 1, suggesting a nonlinear relationship between d' and Δf. However, when lapse rate was included as a free parameter in the fits, PF slopes were found not to differ significantly from 1, consistent with a linear relationship between d' and Δf. This was the case across the wide ranges of frequencies and levels tested in this study. Therefore, spectral and temporal models of frequency discrimination must account for a linear relationship between d' and Δf across a wide range of frequencies and levels.
Collapse
Affiliation(s)
- Huanping Dai
- Department of Speech, Language, and Hearing Sciences, University of Arizona, 1131 East 2nd Street, Tucson, Arizona, 85721, USA.
| | | |
Collapse
|
13
|
He S, Buss E, Hall JW. Monaural temporal integration and temporally selective listening in children and adults. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010; 127:3643-3653. [PMID: 20550263 PMCID: PMC2896408 DOI: 10.1121/1.3397464] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2009] [Revised: 03/12/2010] [Accepted: 03/23/2010] [Indexed: 05/29/2023]
Abstract
This study used two paradigms to investigate the development of temporal integration and temporally selective listening. Experiment 1 measured detection as a function of duration for a pure tone at 1625 or 6500 Hz. At both frequencies thresholds of children younger than 7 years old were higher than those for older children and adults. The pattern of temporal integration was similar across groups for the 6500-Hz signal, but younger children showed relatively more temporal integration for the 1625-Hz signal due to high thresholds for the briefest 1625-Hz signal. Experiment 2 measured detection thresholds for one or for three brief tone pips presented in a noise masker. In one set of conditions, the noise masker consisted of 100-ms steady bursts interleaved with 10-ms temporal gaps. In other conditions, the level of the central 50 ms of the 100-ms masking noise bursts was adjusted by either +6 or -6 dB. Children showed higher thresholds but similar temporal integration compared with adults. Overall, these data suggest that children are less efficient than adults in weighting the output of the monaural temporal window at 1625 but not 6500 Hz. Children are efficient in combining energy from brief temporal epochs that are separated by noise.
Collapse
Affiliation(s)
- Shuman He
- Department of Otolaryngology/Head and Neck Surgery, University of North Carolina School of Medicine, Chapel Hill, North Carolina 27599, USA.
| | | | | |
Collapse
|