1
|
DeFreese A, Camarata S, Sunderhaus L, Holder J, Berg K, Lighterink M, Gifford R. The impact of spectral and temporal processing on speech recognition in children with cochlear implants. Sci Rep 2024; 14:14094. [PMID: 38890428 PMCID: PMC11189542 DOI: 10.1038/s41598-024-63932-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 06/03/2024] [Indexed: 06/20/2024] Open
Abstract
While the relationships between spectral resolution, temporal resolution, and speech recognition are well defined in adults with cochlear implants (CIs), they are not well defined for prelingually deafened children with CIs, for whom language development is ongoing. This cross-sectional study aimed to better characterize these relationships in a large cohort of prelingually deafened children with CIs (N = 47; mean age = 8.33 years) by comprehensively measuring spectral resolution thresholds (measured via spectral modulation detection), temporal resolution thresholds (measured via sinusoidal amplitude modulation detection), and speech recognition (measured via monosyllabic word recognition, vowel recognition, and sentence recognition in noise via both fixed signal-to-noise ratio (SNR) and adaptively varied SNR). Results indicated that neither spectral or temporal resolution were significantly correlated with speech recognition in quiet or noise for children with CIs. Both age and CI experience had a moderate effect on spectral resolution, with significant effects for spectral modulation detection at a modulation rate of 0.5 cyc/oct, suggesting spectral resolution may improve with maturation. Thus, it is possible we may see an emerging relationship between spectral resolution and speech perception over time for children with CIs. While further investigation into this relationship is warranted, these findings demonstrate the need for new investigations to uncover ways of improving spectral resolution for children with CIs.
Collapse
Affiliation(s)
- Andrea DeFreese
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA.
| | - Stephen Camarata
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| | - Linsey Sunderhaus
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| | - Jourdan Holder
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| | - Katelyn Berg
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| | - Mackenzie Lighterink
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| | - René Gifford
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, 1215 21st Avenue South, Nashville, TN, 37232, USA
| |
Collapse
|
2
|
de la Cruz-Pavía I, Hegde M, Cabrera L, Nazzi T. Infants' abilities to segment word forms from spectrally degraded speech in the first year of life. Dev Sci 2024:e13533. [PMID: 38853379 DOI: 10.1111/desc.13533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 04/22/2024] [Accepted: 05/15/2024] [Indexed: 06/11/2024]
Abstract
Infants begin to segment word forms from fluent speech-a crucial task in lexical processing-between 4 and 7 months of age. Prior work has established that infants rely on a variety of cues available in the speech signal (i.e., prosodic, statistical, acoustic-segmental, and lexical) to accomplish this task. In two experiments with French-learning 6- and 10-month-olds, we use a psychoacoustic approach to examine if and how degradation of the two fundamental acoustic components extracted from speech by the auditory system, namely, temporal (both frequency and amplitude modulation) and spectral information, impact word form segmentation. Infants were familiarized with passages containing target words, in which frequency modulation (FM) information was replaced with pure tones using a vocoder, while amplitude modulation (AM) was preserved in either 8 or 16 spectral bands. Infants were then tested on their recognition of the target versus novel control words. While the 6-month-olds were unable to segment in either condition, the 10-month-olds succeeded, although only in the 16 spectral band condition. These findings suggest that 6-month-olds need FM temporal cues for speech segmentation while 10-month-olds do not, although they need the AM cues to be presented in enough spectral bands (i.e., 16). This developmental change observed in infants' sensitivity to spectrotemporal cues likely results from an increase in the range of available segmentation procedures, and/or shift from a vowel to a consonant bias in lexical processing between the two ages, as vowels are more affected by our acoustic manipulations. RESEARCH HIGHLIGHTS: Although segmenting speech into word forms is crucial for lexical acquisition, the acoustic information that infants' auditory system extracts to process continuous speech remains unknown. We examined infants' sensitivity to spectrotemporal cues in speech segmentation using vocoded speech, and revealed a developmental change between 6 and 10 months of age. We showed that FM information, that is, the fast temporal modulations of speech, is necessary for 6- but not 10-month-old infants to segment word forms. Moreover, reducing the number of spectral bands impacts 10-month-olds' segmentation abilities, who succeed when 16 bands are preserved, but fail with 8 bands.
Collapse
Affiliation(s)
- Irene de la Cruz-Pavía
- Faculty of Social and Human Sciences, Universidad de Deusto, Bilbao, Spain
- Basque Foundation for Science Ikerbasque, Bilbao, Spain
| | - Monica Hegde
- INCC UMR 8002, CNRS, F-75006, Université Paris Cité, Paris, France
| | | | - Thierry Nazzi
- INCC UMR 8002, CNRS, F-75006, Université Paris Cité, Paris, France
| |
Collapse
|
3
|
Lorenzini I, Lorenzi C, Varnet L, Cabrera L. Reduced processing efficiency impacts auditory detection of amplitude modulation in children: Evidence from an experimental and modeling study. Hear Res 2024; 445:108982. [PMID: 38484447 DOI: 10.1016/j.heares.2024.108982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 02/06/2024] [Accepted: 02/20/2024] [Indexed: 04/07/2024]
Abstract
Auditory detection of the Amplitude Modulation (AM) of sounds, crucial for speech perception, improves until 10 years of age. This protracted development may not only be explained by sensory maturation, but also by improvements in processing efficiency: the ability to make efficient use of available sensory information. This hypothesis was tested behaviorally on 86 6-to-9-year-olds and 15 adults using AM-detection tasks assessing absolute sensitivity, masking, and response consistency in the AM domain. Absolute sensitivity was estimated by the detection thresholds of a sinusoidal AM applied to a pure-tone carrier; AM masking was estimated as the elevation of AM-detection thresholds produced when replacing the pure-tone carrier by a narrowband noise; response consistency was estimated using a double-pass paradigm where the same set of stimuli was presented twice. Results showed that AM sensitivity improved from childhood to adulthood, but did not change between 6 and 9 years. AM masking did not change with age, suggesting that the selectivity of perceptual AM filters was adult-like by 6 years. However, response consistency increased developmentally, supporting the hypothesis of reduced processing efficiency in early childhood. At the group level, double-pass data of children and adults were well simulated by a model of the human auditory system assuming a higher level of internal noise for children. At the individual level, for both children and adults, double-pass data were better simulated when assuming a sub-optimal decision strategy in addition to differences in internal noise. In conclusion, processing efficiency for AM detection is reduced in childhood. Moreover, worse AM detection was linked to both systematic and stochastic inefficiencies, in both children and adults.
Collapse
Affiliation(s)
- Irene Lorenzini
- Integrative Neuroscience and Cognition Center, CNRS and Université Paris Cité, 45 rue des Saints-Pères, 75006 Paris, France; Laboratoire Ethologie, Cognition, Développement, Université Paris Nanterre, 200 av. de la République, 9200, Nanterre, France.
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs, Département d'études cognitives, École normale supérieure, Université Paris Sciences et Lettres (PSL University), 29 rue d'Ulm, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs, Département d'études cognitives, École normale supérieure, Université Paris Sciences et Lettres (PSL University), 29 rue d'Ulm, 75005 Paris, France
| | - Laurianne Cabrera
- Integrative Neuroscience and Cognition Center, CNRS and Université Paris Cité, 45 rue des Saints-Pères, 75006 Paris, France
| |
Collapse
|
4
|
Petley L, Blankenship C, Hunter LL, Stewart HJ, Lin L, Moore DR. Amplitude Modulation Perception and Cortical Evoked Potentials in Children With Listening Difficulties and Their Typically Developing Peers. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2024; 67:633-656. [PMID: 38241680 PMCID: PMC11000788 DOI: 10.1044/2023_jslhr-23-00317] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 09/01/2023] [Accepted: 11/09/2023] [Indexed: 01/21/2024]
Abstract
PURPOSE Amplitude modulations (AMs) are important for speech intelligibility, and deficits in speech intelligibility are a leading source of impairment in childhood listening difficulties (LiD). The present study aimed to explore the relationships between AM perception and speech-in-noise (SiN) comprehension in children and to determine whether deficits in AM processing contribute to childhood LiD. Evoked responses were used to parse the neural origins of AM processing. METHOD Forty-one children with LiD and 44 typically developing children, ages 8-16 years, participated in the study. Behavioral AM depth thresholds were measured at 4 and 40 Hz. SiN tasks included the Listening in Spatialized Noise-Sentences Test (LiSN-S) and a coordinate response measure (CRM)-based task. Evoked responses were obtained during an AM change detection task using alternations between 4 and 40 Hz, including the N1 of the acoustic change complex, auditory steady-state response (ASSR), P300, and a late positive response (late potential [LP]). Maturational effects were explored via age correlations. RESULTS Age correlated with 4-Hz AM thresholds, CRM separated talker scores, and N1 amplitude. Age-normed LiSN-S scores obtained without spatial or talker cues correlated with age-corrected 4-Hz AM thresholds and area under the LP curve. CRM separated talker scores correlated with AM thresholds and area under the LP curve. Most behavioral measures of AM perception correlated with the signal-to-noise ratio and phase coherence of the 40-Hz ASSR. AM change response time also correlated with area under the LP curve. Children with LiD exhibited deficits with respect to 4-Hz thresholds, AM change accuracy, and area under the LP curve. CONCLUSIONS The observed relationships between AM perception and SiN performance extend the evidence that modulation perception is important for understanding SiN in childhood. In line with this finding, children with LiD demonstrated poorer performance on some measures of AM perception, but their evoked responses implicated a primarily cognitive deficit. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.25009103.
Collapse
Affiliation(s)
- Lauren Petley
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, OH
- Patient Services Research, Cincinnati Children's Hospital Medical Center, OH
- Department of Psychology, Clarkson University, Potsdam, NY
| | - Chelsea Blankenship
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, OH
- Patient Services Research, Cincinnati Children's Hospital Medical Center, OH
| | - Lisa L. Hunter
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, OH
- Patient Services Research, Cincinnati Children's Hospital Medical Center, OH
- Department of Otolaryngology, College of Medicine, University of Cincinnati, OH
- Department of Communication Sciences and Disorders, College of Allied Health Sciences, University of Cincinnati, OH
| | | | - Li Lin
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, OH
- Patient Services Research, Cincinnati Children's Hospital Medical Center, OH
| | - David R. Moore
- Communication Sciences Research Center, Cincinnati Children's Hospital Medical Center, OH
- Patient Services Research, Cincinnati Children's Hospital Medical Center, OH
- Department of Otolaryngology, College of Medicine, University of Cincinnati, OH
- Manchester Centre for Audiology and Deafness, The University of Manchester, United Kingdom
| |
Collapse
|
5
|
Peng ZE, Easwar V. Development of amplitude modulation, voice onset time, and consonant identification in noise and reverberation. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2024; 155:1071-1085. [PMID: 38341737 DOI: 10.1121/10.0024461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 01/02/2024] [Indexed: 02/13/2024]
Abstract
Children's speech understanding is vulnerable to indoor noise and reverberation: e.g., from classrooms. It is unknown how they develop the ability to use temporal acoustic cues, specifically amplitude modulation (AM) and voice onset time (VOT), which are important for perceiving distorted speech. Through three experiments, we investigated the typical development of AM depth detection in vowels (experiment I), categorical perception of VOT (experiment II), and consonant identification (experiment III) in quiet and in speech-shaped noise (SSN) and mild reverberation in 6- to 14-year-old children. Our findings suggested that AM depth detection using a naturally produced vowel at the rate of the fundamental frequency was particularly difficult for children and with acoustic distortions. While the VOT cue salience was monotonically attenuated with increasing signal-to-noise ratio of SSN, its utility for consonant discrimination was completely removed even under mild reverberation. The reverberant energy decay in distorting critical temporal cues provided further evidence that may explain the error patterns observed in consonant identification. By 11-14 years of age, children approached adult-like performance in consonant discrimination and identification under adverse acoustics, emphasizing the need for good acoustics for younger children as they develop auditory skills to process distorted speech in everyday listening environments.
Collapse
Affiliation(s)
- Z Ellen Peng
- Waisman Center, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | | |
Collapse
|
6
|
Petley L, Blankenship C, Hunter LL, Stewart HJ, Lin L, Moore DR. Amplitude modulation perception and cortical evoked potentials in children with listening difficulties and their typically-developing peers. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.10.26.23297523. [PMID: 37961469 PMCID: PMC10635202 DOI: 10.1101/2023.10.26.23297523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Purpose Amplitude modulations (AM) are important for speech intelligibility, and deficits in speech intelligibility are a leading source of impairment in childhood listening difficulties (LiD). The present study aimed to explore the relationships between AM perception and speech-in-noise (SiN) comprehension in children and to determine whether deficits in AM processing contribute to childhood LiD. Evoked responses were used to parse the neural origin of AM processing. Method Forty-one children with LiD and forty-four typically-developing children, ages 8-16 y.o., participated in the study. Behavioral AM depth thresholds were measured at 4 and 40 Hz. SiN tasks included the LiSN-S and a Coordinate Response Measure (CRM)-based task. Evoked responses were obtained during an AM Change detection task using alternations between 4 and 40 Hz, including the N1 of the acoustic change complex, auditory steady-state response (ASSR), P300, and a late positive response (LP). Maturational effects were explored via age correlations. Results Age correlated with 4 Hz AM thresholds, CRM Separated Talker scores, and N1 amplitude. Age-normed LiSN-S scores obtained without spatial or talker cues correlated with age-corrected 4 Hz AM thresholds and area under the LP curve. CRM Separated Talker scores correlated with AM thresholds and area under the LP curve. Most behavioral measures of AM perception correlated with the SNR and phase coherence of the 40 Hz ASSR. AM Change RT also correlated with area under the LP curve. Children with LiD exhibited deficits with respect to 4 Hz thresholds, AM Change accuracy, and area under the LP curve. Conclusions The observed relationships between AM perception and SiN performance extend the evidence that modulation perception is important for understanding SiN in childhood. In line with this finding, children with LiD demonstrated poorer performance on some measures of AM perception, but their evoked responses implicated a primarily cognitive deficit.
Collapse
|
7
|
Cabrera L, Lau BK. The development of auditory temporal processing during the first year of life. HEARING, BALANCE AND COMMUNICATION 2022; 20:155-165. [PMID: 36111124 PMCID: PMC9473293 DOI: 10.1080/21695717.2022.2029092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
OBJECTIVES The processing of auditory temporal information is important for the extraction of voice pitch, linguistic information, as well as the overall temporal structure of speech. However, many aspects of its early development remain poorly understood. This paper reviews the development of auditory temporal processing during the first year of life when infants are acquiring their native language. METHODS First, potential mechanisms of neural immaturity are discussed in the context of neurophysiological studies. Next, what is known about infant auditory capabilities is considered with a focus on psychophysical studies involving non-speech stimuli to investigate the perception of temporal fine structure and envelope cues. This is followed by a review of studies involving speech stimuli, including those that present vocoded signals as a method of degrading the spectro-temporal information available to infant listeners. RESULTS/CONCLUSION This review suggests that temporal resolution may be well developed in the first postnatal months, but that the ability to use and process the temporal information in an efficient way along the entire auditory pathway is longer to develop. Those findings have crucial implications for the development of language abilities, especially for infants with hearing impairment who are using cochlear implants.
Collapse
Affiliation(s)
- Laurianne Cabrera
- Université de Paris, INCC UMR 8002, CNRS, 45 rue des saints-pères, F-75006 Paris, France
| | - Bonnie K Lau
- Department of Otolaryngology - Head & Neck Surgery, University of Washington, 1701 NE Columbia Rd, Box 257923, Seattle, WA 98195
| |
Collapse
|
8
|
Varnet L, Lorenzi C. Probing temporal modulation detection in white noise using intrinsic envelope fluctuations: A reverse-correlation study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022; 151:1353. [PMID: 35232105 DOI: 10.1121/10.0009629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 02/03/2022] [Indexed: 06/14/2023]
Abstract
Part of the detrimental effect caused by a stationary noise on sound perception results from the masking of relevant amplitude modulations (AM) in the signal by random intrinsic envelope fluctuations arising from the filtering of noise by cochlear channels. This study capitalizes on this phenomenon to probe AM detection strategies for human listeners using a reverse correlation analysis. Eight normal-hearing listeners were asked to detect the presence of a 4-Hz sinusoidal AM target applied to a 1-kHz tone carrier using a yes-no task with 3000 trials/participant. All stimuli were embedded in a white-noise masker. A reverse-correlation analysis was then carried on the data to compute "psychophysical kernels" showing which aspects of the stimulus' temporal envelope influenced the listener's responses. These results were compared to data simulated with different implementations of a modulation-filterbank model. Psychophysical kernels revealed that human listeners were able to track the position of AM peaks in the target, similar to the models. However, they also showed a marked temporal decay and a consistent phase shift compared to the ideal template. In light of the simulated data, this was interpreted as an evidence for the presence of phase uncertainty in the processing of intrinsic envelope fluctuations.
Collapse
Affiliation(s)
- Léo Varnet
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, École Normale Supérieure, Université Paris Sciences & Lettres, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs, Département d'Études Cognitives, École Normale Supérieure, Université Paris Sciences & Lettres, Centre National de la Recherche Scientifique, 75005 Paris, France
| |
Collapse
|
9
|
Cabrera L, Lorenzini I, Rosen S, Varnet L, Lorenzi C. Temporal integration for amplitude modulation in childhood: Interaction between internal noise and memory. Hear Res 2021; 415:108403. [PMID: 34879987 DOI: 10.1016/j.heares.2021.108403] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/27/2021] [Revised: 11/17/2021] [Accepted: 11/25/2021] [Indexed: 11/25/2022]
Abstract
It is still unclear whether the gradual improvement in amplitude-modulation (AM) sensitivity typically found in children up to 10 years of age reflects an improvement in "processing efficiency" (the central ability to use information extracted by sensory mechanisms). This hypothesis was tested by evaluating temporal integration for AM, a capacity relying on memory and decision factors. This was achieved by measuring the effect of increasing the number of AM cycles (2 vs 8) on AM-detection thresholds for three groups of children aged from 5 to 11 years and a group of young adults. AM-detection thresholds were measured using a forced-choice procedure and sinusoidal AM (4 or 32 Hz rate) applied to a 1024-Hz pure-tone carrier. All age groups demonstrated temporal integration for AM at both rates; that is, significant improvements in AM sensitivity with a higher number of AM cycles. However, an effect of age is observed as both 5-6 year olds and adults exhibited more temporal integration compared to 7-8 and 10-11 year olds at both rates. This difference is due to: (i) the 5-6 year olds displaying the worst thresholds with 2 AM cycles, but similar thresholds with 8 cycles compared to the 7-8 and 10-11 year olds, and, (ii) adults showing the best thresholds with 8 AM cycles but similar thresholds with 2 cycles compared to the 7-8 and 10-11 year olds. Computational modelling indicated that higher levels of internal noise combined with poorer short-term memory capacities in children accounted for the developmental trends. Improvement in processing efficiency may therefore account for the development of AM detection in childhood. This article is part of the Special Issue Outer hair cell Edited by Joseph Santos-Sacchi and Kumar Navaratnam.
Collapse
Affiliation(s)
- Laurianne Cabrera
- Université de Paris, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France; Speech, Hearing and Phonetic Sciences, UCL, United Kingdom.
| | - Irene Lorenzini
- Université de Paris, CNRS, Integrative Neuroscience and Cognition Center, F-75006 Paris, France
| | - Stuart Rosen
- Speech, Hearing and Phonetic Sciences, UCL, United Kingdom
| | - Léo Varnet
- Laboratoire des Systèmes Perceptifs (UMR 8248), CNRS, Ecole normale supérieure, Université Paris Sciences & Lettres (PSL), Paris, France
| | - Christian Lorenzi
- Laboratoire des Systèmes Perceptifs (UMR 8248), CNRS, Ecole normale supérieure, Université Paris Sciences & Lettres (PSL), Paris, France
| |
Collapse
|
10
|
Attia S, King A, Varnet L, Ponsot E, Lorenzi C. Double-pass consistency for amplitude- and frequency-modulation detection in normal-hearing listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3631. [PMID: 34852611 DOI: 10.1121/10.0006811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Accepted: 10/05/2021] [Indexed: 06/13/2023]
Abstract
Amplitude modulation (AM) and frequency modulation (FM) provide crucial auditory information. If FM is encoded as AM, it should be possible to give a unified account of AM and FM perception both in terms of response consistency and performance. These two aspects of behavior were estimated for normal-hearing participants using a constant-stimuli, forced-choice detection task repeated twice with the same stimuli (double pass). Sinusoidal AM or FM with rates of 2 or 20 Hz were applied to a 500-Hz pure-tone carrier and presented at detection threshold. All stimuli were masked by a modulation noise. Percent agreement of responses across passes and percent-correct detection for the two passes were used to estimate consistency and performance, respectively. These data were simulated using a model implementing peripheral processes, a central modulation filterbank, an additive internal noise, and a template-matching device. Different levels of internal noise were required to reproduce AM and FM data, but a single level could account for the 2- and 20-Hz AM data. As for FM, two levels of internal noise were needed to account for detection at slow and fast rates. Finally, the level of internal noise yielding best predictions increased with the level of the modulation-noise masker. Overall, these results suggest that different sources of internal variability are involved for AM and FM detection at low audio frequencies.
Collapse
Affiliation(s)
- Sarah Attia
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Andrew King
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Léo Varnet
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Emmanuel Ponsot
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs (CNRS 8248), Département d'études cognitives, Ecole normale supérieure, Université Paris Sciences et Lettres, 29 rue d'Ulm, 75005 Paris, France
| |
Collapse
|
11
|
Peng ZE, Pausch F, Fels J. Spatial release from masking in reverberation for school-age children. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:3263. [PMID: 34852617 PMCID: PMC8730369 DOI: 10.1121/10.0006752] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 09/29/2021] [Accepted: 09/29/2021] [Indexed: 05/06/2023]
Abstract
Understanding speech in noisy environments, such as classrooms, is a challenge for children. When a spatial separation is introduced between the target and masker, as compared to when both are co-located, children demonstrate intelligibility improvement of the target speech. Such intelligibility improvement is known as spatial release from masking (SRM). In most reverberant environments, binaural cues associated with the spatial separation are distorted; the extent to which such distortion will affect children's SRM is unknown. Two virtual acoustic environments with reverberation times between 0.4 s and 1.1 s were compared. SRM was measured using a spatial separation with symmetrically displaced maskers to maximize access to binaural cues. The role of informational masking in modulating SRM was investigated through voice similarity between the target and masker. Results showed that, contradictory to previous developmental findings on free-field SRM, children's SRM in reverberation has not yet reached maturity in the 7-12 years age range. When reducing reverberation, an SRM improvement was seen in adults but not in children. Our findings suggest that, even though school-age children have access to binaural cues that are distorted in reverberation, they demonstrate immature use of such cues for speech-in-noise perception, even in mild reverberation.
Collapse
Affiliation(s)
- Z Ellen Peng
- Institute for Hearing Technology and Acoustics, RWTH Aachen University, Kopernikusstrasse 5, 52074 Aachen, Germany
| | - Florian Pausch
- Institute for Hearing Technology and Acoustics, RWTH Aachen University, Kopernikusstrasse 5, 52074 Aachen, Germany
| | - Janina Fels
- Institute for Hearing Technology and Acoustics, RWTH Aachen University, Kopernikusstrasse 5, 52074 Aachen, Germany
| |
Collapse
|
12
|
Flanagan SA, Moore BCJ, Wilson AM, Gabrielczyk FC, MacFarlane A, Mandke K, Goswami U. Development of binaural temporal fine structure sensitivity in children. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2021; 150:2967. [PMID: 34717481 DOI: 10.1121/10.0006665] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 09/22/2021] [Indexed: 06/13/2023]
Abstract
The highest frequency for which the temporal fine structure (TFS) of a sinewave can be compared across ears varies between listeners with an upper limit of about 1400 Hz for young normal-hearing adults (YNHA). In this study, binaural TFS sensitivity was investigated for 63 typically developing children, aged 5 years, 6 months to 9 years, 4 months using the temporal fine structure-adaptive frequency (TFS-AF) test of Füllgrabe, Harland, Sęk, and Moore [Int. J. Audiol. 56, 926-935 (2017)]. The test assesses the highest frequency at which an interaural phase difference (IPD) of ϕ° can be distinguished from an IPD of 0°. The values of ϕ were 30° and 180°. The starting frequency was 200 Hz. The thresholds for the children were significantly lower (worse) than the thresholds reported by Füllgrabe, Harland, Sęk, and Moore [Int. J. Audiol. 56, 926-935 (2017)] for YNHA. For both values of ϕ, the median age at which children performed above chance level was significantly higher (p < 0.001) than for those who performed at chance. For the subgroup of 40 children who performed above chance for ϕ = 180°, the linear regression analyses showed that the thresholds for ϕ = 180° increased (improved) significantly with increasing age (p < 0.001) with adult-like thresholds predicted to be reached at 10 years, 2 months of age. The implications for spatial release from masking are discussed.
Collapse
Affiliation(s)
- Sheila A Flanagan
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Brian C J Moore
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Angela M Wilson
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Fiona C Gabrielczyk
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Annabel MacFarlane
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Kanad Mandke
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Usha Goswami
- Department of Psychology, Centre for Neuroscience in Education, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| |
Collapse
|
13
|
Ponsot E, Varnet L, Wallaert N, Daoud E, Shamma SA, Lorenzi C, Neri P. Mechanisms of Spectrotemporal Modulation Detection for Normal- and Hearing-Impaired Listeners. Trends Hear 2021; 25:2331216520978029. [PMID: 33620023 PMCID: PMC7905488 DOI: 10.1177/2331216520978029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 10/26/2020] [Accepted: 11/06/2020] [Indexed: 11/20/2022] Open
Abstract
Spectrotemporal modulations (STM) are essential features of speech signals that make them intelligible. While their encoding has been widely investigated in neurophysiology, we still lack a full understanding of how STMs are processed at the behavioral level and how cochlear hearing loss impacts this processing. Here, we introduce a novel methodological framework based on psychophysical reverse correlation deployed in the modulation space to characterize the mechanisms underlying STM detection in noise. We derive perceptual filters for young normal-hearing and older hearing-impaired individuals performing a detection task of an elementary target STM (a given product of temporal and spectral modulations) embedded in other masking STMs. Analyzed with computational tools, our data show that both groups rely on a comparable linear (band-pass)-nonlinear processing cascade, which can be well accounted for by a temporal modulation filter bank model combined with cross-correlation against the target representation. Our results also suggest that the modulation mistuning observed for the hearing-impaired group results primarily from broader cochlear filters. Yet, we find idiosyncratic behaviors that cannot be captured by cochlear tuning alone, highlighting the need to consider variability originating from additional mechanisms. Overall, this integrated experimental-computational approach offers a principled way to assess suprathreshold processing distortions in each individual and could thus be used to further investigate interindividual differences in speech intelligibility.
Collapse
Affiliation(s)
- Emmanuel Ponsot
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
- Hearing Technology @ WAVES, Department of Information
Technology, Ghent University, Ghent, Belgium
| | - Léo Varnet
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
| | - Nicolas Wallaert
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
| | - Elza Daoud
- Aix-Marseille Université, UMR CNRS 7260, Laboratoire
Neurosciences Intégratives et Adaptatives, Centre Saint-Charles,
Marseille, France
| | - Shihab A. Shamma
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
| | - Christian Lorenzi
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
| | - Peter Neri
- Laboratoire des systèmes perceptifs, Département
d′études cognitives, École normale supérieure, Université PSL, CNRS,
Paris, France
| |
Collapse
|
14
|
Yao JD, Gimoto J, Constantinople CM, Sanes DH. Parietal Cortex Is Required for the Integration of Acoustic Evidence. Curr Biol 2020; 30:3293-3303.e4. [PMID: 32619478 DOI: 10.1016/j.cub.2020.06.017] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2020] [Revised: 05/12/2020] [Accepted: 06/04/2020] [Indexed: 01/31/2023]
Abstract
Sensory-driven decisions are formed by accumulating information over time. Although parietal cortex activity is thought to represent accumulated evidence for sensory-based decisions, recent perturbation studies in rodents and non-human primates have challenged the hypothesis that these representations actually influence behavior. Here, we asked whether the parietal cortex integrates acoustic features from auditory cortical inputs during a perceptual decision-making task. If so, we predicted that selective inactivation of this projection should impair subjects' ability to accumulate sensory evidence. We trained gerbils to perform an auditory discrimination task and obtained measures of integration time as a readout of evidence accumulation capability. Minimum integration time was calculated behaviorally as the shortest stimulus duration for which subjects could discriminate the acoustic signals. Direct pharmacological inactivation of parietal cortex increased minimum integration times, suggesting its role in the behavior. To determine the specific impact of sensory evidence, we chemogenetically inactivated the excitatory projections from auditory cortex to parietal cortex and found this was sufficient to increase minimum behavioral integration times. Our signal-detection-theory-based model accurately replicated behavioral outcomes and indicated that the deficits in task performance were plausibly explained by elevated sensory noise. Together, our findings provide causal evidence that parietal cortex plays a role in the network that integrates auditory features for perceptual judgments.
Collapse
Affiliation(s)
- Justin D Yao
- Center for Neural Science, New York University, New York, NY 10003, USA.
| | - Justin Gimoto
- Center for Neural Science, New York University, New York, NY 10003, USA
| | - Christine M Constantinople
- Center for Neural Science, New York University, New York, NY 10003, USA; Neuroscience Institute, NYU Langone Medical Center, New York University, New York, NY 10016, USA
| | - Dan H Sanes
- Center for Neural Science, New York University, New York, NY 10003, USA; Department of Psychology, New York University, New York, NY 10003, USA; Department of Biology, New York University, New York, NY 10003, USA; Neuroscience Institute, NYU Langone Medical Center, New York University, New York, NY 10016, USA
| |
Collapse
|
15
|
Peng ZE, Kan A, Litovsky RY. Development of Binaural Sensitivity: Eye Gaze as a Measure of Real-time Processing. Front Syst Neurosci 2020; 14:39. [PMID: 32733212 PMCID: PMC7360356 DOI: 10.3389/fnsys.2020.00039] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 05/27/2020] [Indexed: 11/13/2022] Open
Abstract
Children localize sounds using binaural cues when navigating everyday auditory environments. While sensitivity to binaural cues reaches maturity by 8-10 years of age, large individual variability has been observed in the just-noticeable-difference (JND) thresholds for interaural time difference (ITD) among children in this age range. To understand the development of binaural sensitivity beyond JND thresholds, the "looking-while-listening" paradigm was adapted in this study to reveal the real-time decision-making behavior during ITD processing. Children ages 8-14 years with normal hearing (NH) and a group of young NH adults were tested. This novel paradigm combined eye gaze tracking with behavioral psychoacoustics to estimate ITD JNDs in a two-alternative forced-choice discrimination task. Results from simultaneous eye gaze recordings during ITD processing suggested that children had adult-like ITD JNDs, but they demonstrated immature decision-making strategies. While the time course of arriving at the initial fixation and final decision in providing a judgment of the ITD direction was similar, children exhibited more uncertainty than adults during decision-making. Specifically, children made more fixation changes, particularly when tested using small ITD magnitudes, between the target and non-target response options prior to finalizing a judgment. These findings suggest that, while children may exhibit adult-like sensitivity to ITDs, their eye gaze behavior reveals that the processing of this binaural cue is still developing through late childhood.
Collapse
Affiliation(s)
- Z. Ellen Peng
- Waisman Center, University of Wisconsin-Madison, Madison, WI, United States
| | - Alan Kan
- Waisman Center, University of Wisconsin-Madison, Madison, WI, United States
- School of Engineering, Macquarie University, Sydney, NSW, Australia
| | - Ruth Y. Litovsky
- Waisman Center, University of Wisconsin-Madison, Madison, WI, United States
| |
Collapse
|
16
|
Gay JD, Rosen MJ, Huyck JJ. Effects of Gap Position on Perceptual Gap Detection Across Late Childhood and Adolescence. J Assoc Res Otolaryngol 2020; 21:243-258. [PMID: 32488537 DOI: 10.1007/s10162-020-00756-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Accepted: 04/28/2020] [Indexed: 11/27/2022] Open
Abstract
The ability to detect a silent gap within a sound is critical for accurate speech perception, and gap detection has been shown to have an extended developmental trajectory. In certain conditions, the detectability of the gap decreases as the gap is placed closer to the beginning of the signal. Early in development, the detection of gaps shortly after signal onset may be especially difficult due to immaturities in the encoding and perception of rapidly changing sounds. The present study explored the development of gap detection from age 8 to 19 years, specifically when the temporal placement of the gap varied. Performance improved with age for all temporal placements of the gap, demonstrating a gradual maturation of gap detection abilities throughout adolescence. Younger adolescents did not benefit from increasing gap onset times, while older adolescents' thresholds gradually improved as gap onset time lengthened. Regardless of age, listeners learned between the two testing days but did not improve within days. Younger adolescents had poorer thresholds for the last block of testing on the second day, returning to baseline performance despite learning between days. These data support earlier studies showing that gaps are harder to detect near stimulus onset and confirm that gap detection abilities continue to mature into adolescence. The data also suggest that younger adolescents do not receive the same benefit of increasing gap onset time and respond differently to repeated testing than older adolescents and young adults.
Collapse
Affiliation(s)
- Jennifer D Gay
- Department of Anatomy & Neurobiology, Northeast Ohio Medical University, 4209 State Route 44, Rootstown, OH, 44272, USA.,Biomedical Sciences Program, Kent State University, 800 East Summit St, Kent, OH, 44242, USA
| | - Merri J Rosen
- Department of Anatomy & Neurobiology, Northeast Ohio Medical University, 4209 State Route 44, Rootstown, OH, 44272, USA.,Kent State Brain Health Research Institute, Kent State University, 251M Integrated Sciences Building, 1175 Lefton Esplanade, Kent, OH, 44242, USA
| | - Julia Jones Huyck
- Kent State Brain Health Research Institute, Kent State University, 251M Integrated Sciences Building, 1175 Lefton Esplanade, Kent, OH, 44242, USA. .,Speech Pathology and Audiology Program, Kent State University, 1325 Theatre Drive, Kent, OH, 44242, USA.
| |
Collapse
|