1
|
Xi X, Li JN, Yuen KCP, Chen AT, Li SQ, Hong MD, Wang Q, Ji F, Dillon H, Ching TYC. List Equivalency and Critical Differences of a Mandarin Bamford-Kowal-Bench Sentence in Babble Noise Test for Adults and Preschool Children With Normal Hearing. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2023; 66:5061-5070. [PMID: 37889230 DOI: 10.1044/2023_jslhr-23-00025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
PURPOSE The purpose of this study was to determine the speech recognition equivalence of Mandarin Bamford-Kowal-Bench (BKB) sentence lists with adults and children with normal hearing. METHOD A total of 32 lists, each of nine sentences, were compiled from a corpus of BKB-like sentences with paired babble in Mandarin. Interlist equivalence, critical differences, and sensitivity of performance to signal-to-noise ratio (SNR) were examined. Experiment 1 included 64 native Mandarin-speaking adults with normal hearing. Experiment 2 included 54 native Mandarin-speaking children with normal hearing aged 4-6 years. RESULTS Among the 32 sentence lists, 28 lists were confirmed to be equivalent in adults, with a mean SNR required for 50% correct (SNR50) of -5.9 ± 0.1 dB, a mean slope of 22.3%/dB ± 1.5%/dB, and a grand 95% critical difference subsequently calculated as 27.2% for score. From the 28 equivalent lists, 27 lists were selected and observed to be equivalent in children, with a mean SNR50 threshold of -2.0 ± 0.2 dB, a mean slope of 15.8%/dB ± 1.1%/dB, and a grand 95% critical difference of 24.6% for score. CONCLUSIONS The Mandarin BKB sentences in babble noise test offers an opportunity for clinicians and researchers to assess speech understanding in adults and preschool children in an efficient manner. For comparisons of performance in different test conditions, 28 equivalent lists are available for adults and 27 equivalent lists for preschool children. The 95% critical difference values can be used for total percentage correct or SNR for 50% performance. Future work will examine the clinical utility for school-age children and children who are deaf and hard of hearing. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.24400066.
Collapse
Affiliation(s)
- Xin Xi
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Jia-Nan Li
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Kevin C P Yuen
- Department of Special Education and Counselling, The Education University of Hong Kong, HKSAR, China
| | - Ai-Ting Chen
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Si-Qi Li
- School of Communication Science, Beijing Language and Culture University, China
| | - Meng-Di Hong
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Qian Wang
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Fei Ji
- Department of Otolaryngology-Head & Neck Surgery, the Sixth Medical Center of PLA General Hospital, Beijing, China
- National Clinical Research Center for Otolaryngologic Diseases, Beijing, China
| | - Harvey Dillon
- Department of Linguistics, Macquarie University, Sydney, New South Wales, Australia
- Manchester Centre for Audiology and Deafness, School of Health Sciences, University of Manchester, United Kingdom
| | - Teresa Y C Ching
- Macquarie School of Education, Macquarie University, Sydney, New South Wales, Australia
- NextSense Institute, Sydney, New South Wales, Australia
- School of Health and Rehabilitation Sciences, University of Queensland, Brisbane, Australia
| |
Collapse
|
2
|
Byrne AJ, Conroy C, Kidd G. Individual differences in speech-on-speech masking are correlated with cognitive and visual task performance. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2023; 154:2137-2153. [PMID: 37800988 PMCID: PMC10631817 DOI: 10.1121/10.0021301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/19/2023] [Accepted: 09/17/2023] [Indexed: 10/07/2023]
Abstract
Individual differences in spatial tuning for masked target speech identification were determined using maskers that varied in type and proximity to the target source. The maskers were chosen to produce three strengths of informational masking (IM): high [same-gender, speech-on-speech (SOS) masking], intermediate (the same masker speech time-reversed), and low (speech-shaped, speech-envelope-modulated noise). Typical for this task, individual differences increased as IM increased, while overall performance decreased. To determine the extent to which auditory performance might generalize to another sensory modality, a comparison visual task was also implemented. Visual search time was measured for identifying a cued object among "clouds" of distractors that were varied symmetrically in proximity to the target. The visual maskers also were chosen to produce three strengths of an analog of IM based on feature similarities between the target and maskers. Significant correlations were found for overall auditory and visual task performance, and both of these measures were correlated with an index of general cognitive reasoning. Overall, the findings provide qualified support for the proposition that the ability of an individual to solve IM-dominated tasks depends on cognitive mechanisms that operate in common across sensory modalities.
Collapse
Affiliation(s)
- Andrew J Byrne
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, Boston, Massachusetts 02215, USA
| | - Christopher Conroy
- Department of Biological and Vision Sciences, State University of New York College of Optometry, New York, New York 10036, USA
| | - Gerald Kidd
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, Boston, Massachusetts 02215, USA
| |
Collapse
|
3
|
Development of Masked Speech Detection Thresholds in 2- to 15-year-old Children: Speech-Shaped Noise and Two-Talker Speech Maskers. Ear Hear 2021; 42:1712-1726. [PMID: 33928913 DOI: 10.1097/aud.0000000000001062] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
OBJECTIVES On the basis of the data from school-aged children, there is consistent evidence that there is a prolonged course of auditory development for perceiving speech embedded in competing background sounds. Furthermore, age-related differences are prolonged and pronounced for a two-talker speech masker compared to a speech-shaped noise masker. However, little is known about the course of development during the toddler and preschool years because it is difficult to collect reliable behavioral data from this age range. The goal of this study was to extend our lower age limit to include toddlers and preschoolers to characterize the developmental trajectory for masked speech detection thresholds across childhood. DESIGN Participants were 2- to 15-year-old children (n = 67) and adults (n = 17), all with normal hearing. Thresholds (71%) were measured for detecting a two-syllable word embedded in one of two maskers: speech-shaped noise or two-talker speech. The masker was presented at 55 dB SPL throughout testing. Stimuli were presented to the left ear via a lightweight headphone. Data were collected using an observer-based testing method in which the participant's behavior was judged by an experimenter using a two-interval, two-alternative testing paradigm. The participant's response to the stimulus was shaped by training him/her to perform a conditioned play-based response to the sound. For children, receptive vocabulary and working memory were measured. Data were fitted with a linear regression model to establish the course of development for each masker condition. Appropriateness of the test method was also evaluated by determining if there were age-related differences in training data, inter-rater reliability, or slope or upper asymptote estimates from pooled psychometric functions across different age groups. RESULTS Child and adult speech detection thresholds were poorer in the two-talker masker than in the speech-shaped noise masker, but different developmental trajectories were seen for the two masker conditions. For the speech-shaped noise masker, threshold improved by about 5 dB across the age span tested, with adult-like performance being reached around 10 years of age. For the two-talker masker condition, thresholds improved by about 7 dB between 2.5 and 15 years. However, the linear fit for this condition failed to achieve adult-like performance because of limited data from teenagers. No significant age-related differences were seen in training data, probe hit rate, or inter-rater reliability. Furthermore, slope and upper asymptote estimates from pooled psychometric functions were similar across different child age groups. CONCLUSIONS Different developmental patterns were seen across the two maskers, with more pronounced child-adult differences and prolonged immaturity during childhood for the two-talker masker relative to the speech-shaped noise masker. Our data do not support the idea that there is rapid improvement of masked speech detection thresholds between 2.5 and 5 years of age. This study also highlights that our observer-based method can be used to collect reliable behavioral data from toddlers and preschoolers-a time period where we know little about auditory development.
Collapse
|
4
|
Wang X, Xu L. Speech perception in noise: Masking and unmasking. J Otol 2021; 16:109-119. [PMID: 33777124 PMCID: PMC7985001 DOI: 10.1016/j.joto.2020.12.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Revised: 12/03/2020] [Accepted: 12/06/2020] [Indexed: 11/23/2022] Open
Abstract
Speech perception is essential for daily communication. Background noise or concurrent talkers, on the other hand, can make it challenging for listeners to track the target speech (i.e., cocktail party problem). The present study reviews and compares existing findings on speech perception and unmasking in cocktail party listening environments in English and Mandarin Chinese. The review starts with an introduction section followed by related concepts of auditory masking. The next two sections review factors that release speech perception from masking in English and Mandarin Chinese, respectively. The last section presents an overall summary of the findings with comparisons between the two languages. Future research directions with respect to the difference in literature on the reviewed topic between the two languages are also discussed.
Collapse
Affiliation(s)
- Xianhui Wang
- Communication Sciences and Disorders, Ohio University, Athens, OH, 45701, USA
| | - Li Xu
- Communication Sciences and Disorders, Ohio University, Athens, OH, 45701, USA
| |
Collapse
|
5
|
Huang W, Wong LLN, Chen F, Liu H, Liang W. Effects of Fundamental Frequency Contours on Sentence Recognition in Mandarin-Speaking Children With Cochlear Implants. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:3855-3864. [PMID: 33022190 DOI: 10.1044/2020_jslhr-20-00033] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose Fundamental frequency (F0) is the primary acoustic cue for lexical tone perception in tonal languages but is processed in a limited way in cochlear implant (CI) systems. The aim of this study was to evaluate the importance of F0 contours in sentence recognition in Mandarin-speaking children with CIs and find out whether it is similar to/different from that in age-matched normal-hearing (NH) peers. Method Age-appropriate sentences, with F0 contours manipulated to be either natural or flattened, were randomly presented to preschool children with CIs and their age-matched peers with NH under three test conditions: in quiet, in white noise, and with competing sentences at 0 dB signal-to-noise ratio. Results The neutralization of F0 contours resulted in a significant reduction in sentence recognition. While this was seen only in noise conditions among NH children, it was observed throughout all test conditions among children with CIs. Moreover, the F0 contour-induced accuracy reduction ratios (i.e., the reduction in sentence recognition resulting from the neutralization of F0 contours compared to the normal F0 condition) were significantly greater in children with CIs than in NH children in all test conditions. Conclusions F0 contours play a major role in sentence recognition in both quiet and noise among pediatric implantees, and the contribution of the F0 contour is even more salient than that in age-matched NH children. These results also suggest that there may be differences between children with CIs and NH children in how F0 contours are processed.
Collapse
Affiliation(s)
- Wanting Huang
- Unit of Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, China
| | - Lena L N Wong
- Unit of Human Communication, Development, and Information Sciences, Faculty of Education, The University of Hong Kong, China
| | - Fei Chen
- Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China
| | - Haihong Liu
- Beijing Key Laboratory of Pediatric Diseases of Otolaryngology, Head and Neck Surgery, Beijing Children's Hospital, China
| | - Wei Liang
- China Rehabilitation Research Center for Hearing and Speech Impairment, Beijing, China
| |
Collapse
|
6
|
Salorio-Corbetto M, Baer T, Stone MA, Moore BCJ. Effect of the number of amplitude-compression channels and compression speed on speech recognition by listeners with mild to moderate sensorineural hearing loss. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1344. [PMID: 32237835 DOI: 10.1121/10.0000804] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 02/09/2020] [Indexed: 06/11/2023]
Abstract
The use of a large number of amplitude-compression channels in hearing aids has potential advantages, such as the ability to compensate for variations in loudness recruitment across frequency and provide appropriate frequency-response shaping. However, sound quality and speech intelligibility could be adversely affected due to reduction of spectro-temporal contrast and distortion, especially when fast-acting compression is used. This study assessed the effect of the number of channels and compression speed on speech recognition when the multichannel processing was used solely to implement amplitude compression, and not for frequency-response shaping. Computer-simulated hearing aids were used. The frequency-dependent insertion gains for speech with a level of 65 dB sound pressure level were applied using a single filter before the signal was filtered into compression channels. Fast-acting (attack, 10 ms; release, 100 ms) or slow-acting (attack, 50 ms; release, 3000 ms) compression using 3, 6, 12, and 22 channels was applied subsequently. Using a sentence recognition task with speech in two- and eight-talker babble at three different signal-to-babble ratios (SBRs), 20 adults with sensorineural hearing loss were tested. The number of channels and compression speed had no significant effect on speech recognition, regardless of babble type or SBR.
Collapse
Affiliation(s)
- Marina Salorio-Corbetto
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Thomas Baer
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| | - Michael A Stone
- Division of Human Communication, Development and Hearing, University of Manchester, Oxford Road, Manchester M13 9PL, United Kingdom
| | - Brian C J Moore
- Department of Experimental Psychology, University of Cambridge, Downing Street, Cambridge CB2 3EB, United Kingdom
| |
Collapse
|
7
|
Summers RJ, Roberts B. Informational masking of speech by acoustically similar intelligible and unintelligible interferers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020; 147:1113. [PMID: 32113320 DOI: 10.1121/10.0000688] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 01/19/2020] [Indexed: 06/10/2023]
Abstract
Masking experienced when target speech is accompanied by a single interfering voice is often primarily informational masking (IM). IM is generally greater when the interferer is intelligible than when it is not (e.g., speech from an unfamiliar language), but the relative contributions of acoustic-phonetic and linguistic interference are often difficult to assess owing to acoustic differences between interferers (e.g., different talkers). Three-formant analogues (F1+F2+F3) of natural sentences were used as targets and interferers. Targets were presented monaurally either alone or accompanied contralaterally by interferers from another sentence (F0 = 4 semitones higher); a target-to-masker ratio (TMR) between ears of 0, 6, or 12 dB was used. Interferers were either intelligible or rendered unintelligible by delaying F2 and advancing F3 by 150 ms relative to F1, a manipulation designed to minimize spectro-temporal differences between corresponding interferers. Target-sentence intelligibility (keywords correct) was 67% when presented alone, but fell considerably when an unintelligible interferer was present (49%) and significantly further when the interferer was intelligible (41%). Changes in TMR produced neither a significant main effect nor an interaction with interferer type. Interference with acoustic-phonetic processing of the target can explain much of the impact on intelligibility, but linguistic factors-particularly interferer intrusions-also make an important contribution to IM.
Collapse
Affiliation(s)
- Robert J Summers
- Psychology, School of Life and Health Sciences, Aston University, Birmingham B4 7ET, United Kingdom
| | - Brian Roberts
- Psychology, School of Life and Health Sciences, Aston University, Birmingham B4 7ET, United Kingdom
| |
Collapse
|
8
|
Jarollahi F, Amiri M, Jalaie S, Sameni SJ. The effects of auditory spatial training on informational masking release in elderly listeners: a study protocol for a randomized clinical trial. F1000Res 2019; 8:420. [PMID: 31354946 PMCID: PMC6652096 DOI: 10.12688/f1000research.18602.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/03/2019] [Indexed: 10/14/2023] Open
Abstract
Background: Regarding the strong auditory spatial plasticity capability of the central auditory system and the effect of short-term and long-term rehabilitation programs in elderly people, it seems that an auditory spatial training can help this population in informational masking release and better track speech in noisy environments. The main purposes of this study are developing an informational masking measurement test and an auditory spatial training program. Protocol: This study will be conducted in two parts. Part 1: develop and determine the validity of an informational masking measurement test by recruiting two groups of young (n=50) and old (n=50) participants with normal hearing who have no difficulty in understanding speech in noisy environments. Part 2 (clinical trial): two groups of 60-75-year-olds with normal hearing, who complain about difficulty in speech perception in noisy environments, will participate as control and intervention groups to examine the effect of auditory spatial training. Intervention: 8 sessions of auditory spatial training. The informational masking measurement test and Speech, Spatial and Qualities of Hearing Scale will be compared before intervention, immediately after intervention, and one month after intervention between the two groups. Discussion: Since auditory training programs do not deal with informational masking release, an auditory spatial training will be designed, aiming to improve hearing in noisy environments for elderly populations. Trial registration: Iranian Registry of Clinical Trials ( IRCT20190118042404N1) on 25 th February 2019.
Collapse
Affiliation(s)
- Farnoush Jarollahi
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Marzieh Amiri
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Shohreh Jalaie
- Department of Physiotherapy, School of Rehabilitation Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Seyyed Jalal Sameni
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
9
|
Jarollahi F, Amiri M, Jalaie S, Sameni SJ. The effects of auditory spatial training on informational masking release in elderly listeners: a study protocol for a randomized clinical trial. F1000Res 2019; 8:420. [PMID: 31354946 PMCID: PMC6652096 DOI: 10.12688/f1000research.18602.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/26/2019] [Indexed: 11/20/2022] Open
Abstract
Background: Regarding the strong auditory spatial plasticity capability of the central auditory system and the effect of short-term and long-term rehabilitation programs in elderly people, it seems that an auditory spatial training can help this population in informational masking release and better track speech in noisy environments. The main purposes of this study are developing an informational masking measurement test and an auditory spatial training program. Protocol: This study will be conducted in two parts. Part 1: develop and determine the validity of an informational masking measurement test by recruiting two groups of young (n=50) and old (n=50) participants with normal hearing who have no difficulty in understanding speech in noisy environments. Part 2 (clinical trial): two groups of 60-75-year-olds with normal hearing, who complain about difficulty in speech perception in noisy environments, will participate as control and intervention groups to examine the effect of auditory spatial training. Intervention: 15 sessions of auditory spatial training. The informational masking measurement test and Speech, Spatial and Qualities of Hearing Scale will be compared before intervention, immediately after intervention, and five weeks after intervention between the two groups. Discussion: Since auditory training programs do not deal with informational masking release, an auditory spatial training will be designed, aiming to improve hearing in noisy environments for elderly populations. Trial registration: Iranian Registry of Clinical Trials ( IRCT20190118042404N1) on 25 th February 2019.
Collapse
Affiliation(s)
- Farnoush Jarollahi
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Marzieh Amiri
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| | - Shohreh Jalaie
- Department of Physiotherapy, School of Rehabilitation Sciences, Tehran University of Medical Sciences, Tehran, Iran
| | - Seyyed Jalal Sameni
- Department of Audiology, School of Rehabilitation Sciences, Iran University of Medical Sciences, Tehran, Iran
| |
Collapse
|
10
|
Calandruccio L, Buss E, Bencheck P, Jett B. Does the semantic content or syntactic regularity of masker speech affect speech-on-speech recognition? THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 144:3289. [PMID: 30599661 PMCID: PMC6786886 DOI: 10.1121/1.5081679] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 11/07/2018] [Accepted: 11/09/2018] [Indexed: 05/30/2023]
Abstract
Speech-on-speech recognition differs substantially across stimuli, but it is unclear what role linguistic features of the masker play in this variability. The linguistic similarity hypothesis suggests similarity between sentence-level semantic content of the target and masker speech increases masking. Sentence recognition in a two-talker masker was evaluated with respect to semantic content and syntactic structure of the masker (experiment 1) and linguistic similarity of the target and masker (experiment 2). Target and masker sentences were semantically meaningful or anomalous. Masker syntax was varied or the same across sentences. When other linguistic features of the masker were controlled, variability in syntactic structure across masker tokens was only relevant when the masker was played continuously (as opposed to gated); when played continuously, sentence-recognition thresholds were poorer with variable than consistent masker syntax, but this effect was small (0.5 dB). When the syntactic structure of the masker was held constant, semantic meaningfulness of the masker did not increase masking, and at times performance was better for the meaningful than the anomalous masker. These data indicate that sentence-level semantic content of the masker speech does not influence speech-on-speech masking. Further, no evidence that similarities between target/masker sentence-level semantic content increases masking was found.
Collapse
Affiliation(s)
- Lauren Calandruccio
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Emily Buss
- Department of Head/Neck Surgery and Otolaryngology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599, USA
| | - Penelope Bencheck
- Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA
| | - Brandi Jett
- Department of Psychological Sciences, Case Western Reserve University, Cleveland, Ohio 44106, USA
| |
Collapse
|
11
|
Leibold LJ. Speech Perception in Complex Acoustic Environments: Developmental Effects. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:3001-3008. [PMID: 29049600 PMCID: PMC5945069 DOI: 10.1044/2017_jslhr-h-17-0070] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 06/19/2017] [Indexed: 05/06/2023]
Abstract
PURPOSE The ability to hear and understand speech in complex acoustic environments follows a prolonged time course of development. The purpose of this article is to provide a general overview of the literature describing age effects in susceptibility to auditory masking in the context of speech recognition, including a summary of findings related to the maturation of processes thought to facilitate segregation of target from competing speech. METHOD Data from published and ongoing studies are discussed, with a focus on synthesizing results from studies that address age-related changes in the ability to perceive speech in the presence of a small number of competing talkers. CONCLUSIONS This review provides a summary of the current state of knowledge that is valuable for researchers and clinicians. It highlights the importance of considering listener factors, such as age and hearing status, as well as stimulus factors, such as masker type, when interpreting masked speech recognition data. PRESENTATION VIDEO http://cred.pubs.asha.org/article.aspx?articleid=2601620.
Collapse
Affiliation(s)
- Lori J. Leibold
- Center for Hearing Research, Boys Town National Research Hospital, Omaha, NE
| |
Collapse
|
12
|
Kidd G, Colburn HS. Informational Masking in Speech Recognition. SPRINGER HANDBOOK OF AUDITORY RESEARCH 2017. [DOI: 10.1007/978-3-319-51662-2_4] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
|
13
|
Kidd G, Mason CR, Swaminathan J, Roverud E, Clayton KK, Best V. Determining the energetic and informational components of speech-on-speech masking. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:132. [PMID: 27475139 PMCID: PMC5392100 DOI: 10.1121/1.4954748] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
Identification of target speech was studied under masked conditions consisting of two or four independent speech maskers. In the reference conditions, the maskers were colocated with the target, the masker talkers were the same sex as the target, and the masker speech was intelligible. The comparison conditions, intended to provide release from masking, included different-sex target and masker talkers, time-reversal of the masker speech, and spatial separation of the maskers from the target. Significant release from masking was found for all comparison conditions. To determine whether these reductions in masking could be attributed to differences in energetic masking, ideal time-frequency segregation (ITFS) processing was applied so that the time-frequency units where the masker energy dominated the target energy were removed. The remaining target-dominated "glimpses" were reassembled as the stimulus. Speech reception thresholds measured using these resynthesized ITFS-processed stimuli were the same for the reference and comparison conditions supporting the conclusion that the amount of energetic masking across conditions was the same. These results indicated that the large release from masking found under all comparison conditions was due primarily to a reduction in informational masking. Furthermore, the large individual differences observed generally were correlated across the three masking release conditions.
Collapse
Affiliation(s)
- Gerald Kidd
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Christine R Mason
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Jayaganesh Swaminathan
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Elin Roverud
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Kameron K Clayton
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| | - Virginia Best
- Department of Speech, Language and Hearing Sciences and Hearing Research Center, Boston University, 635 Commonwealth Avenue, Boston, Massachusetts 02215, USA
| |
Collapse
|