1
|
Netto AFA, Zanotelli T, Felix LB. Multi-channel and multi-harmonic analysis of Auditory Steady-State Response detection. Comput Methods Biomech Biomed Engin 2024; 27:276-284. [PMID: 36803329 DOI: 10.1080/10255842.2023.2181041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 01/17/2023] [Accepted: 02/09/2023] [Indexed: 02/22/2023]
Abstract
The Auditory Steady-State Response (ASSR) is a type of auditory evoked potential (AEP) generated in the auditory system that can be automatically detected by means of objective response detectors (ORDs). ASSRs are usually registered on the scalp using electroencephalography (EEG). ORD are univariate techniques, i.e. only uses one data channel. However, techniques involving more than one channel - multi-channel objective response detectors (MORDs) - have been showing higher detection rate (DR) when compared to ORD techniques. When ASSR is evoked by amplitude stimuli, the responses could be detected by analyzing the modulation frequencies and their harmonics. Despite this, ORD techniques are traditionally applied only in its first harmonic. This approach is known as one-sample test. The q-sample tests, however, considers harmonics beyond the first. Thus, this work proposes and evaluates the use of q-sample tests using a combination of multiple EEG channels and multiple harmonics of the stimulation frequencies and compare them with traditional one-sample tests. The database used consists of EEG channels from 24 volunteers with normal auditory threshold collected following a binaural stimulation protocol by amplitude modulated (AM) tone with modulating frequencies near 80 Hz. The best q-sample MORD result showed an increase in DR of 45.25% when compared with the best one-sample ORD test. Thus, it is recommended to use multiple channels and multiple harmonics, whenever available.
Collapse
Affiliation(s)
| | - Tiago Zanotelli
- Federal Institute of Education Science and Technology of Espírito Santo-São Mateus, São Mateus, ES, Brazil
| | - Leonardo Bonato Felix
- Department of Electrical Engineering, Federal University of Viçosa, Viçosa, MG, Brazil
| |
Collapse
|
2
|
Perugia E, Marmel F, Kluk K. Feasibility of Diagnosing Dead Regions Using Auditory Steady-State Responses to an Exponentially Amplitude Modulated Tone in Threshold Equalizing Notched Noise, Assessed Using Normal-Hearing Participants. Trends Hear 2023; 27:23312165231173234. [PMID: 37384583 PMCID: PMC10336760 DOI: 10.1177/23312165231173234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 04/11/2023] [Accepted: 04/14/2023] [Indexed: 07/01/2023] Open
Abstract
The aim of this study was to assess feasibility of using electrophysiological auditory steady-state response (ASSR) masking for detecting dead regions (DRs). Fifteen normally hearing adults were tested using behavioral and electrophysiological tasks. In the electrophysiological task, ASSRs were recorded to a 2 kHz exponentially amplitude-modulated tone (AM2) presented within a notched threshold equalizing noise (TEN) whose center frequency (CFNOTCH) varied. We hypothesized that, in the absence of DRs, ASSR amplitudes would be largest for CFNOTCH at/or near the signal frequency. In the presence of a DR at the signal frequency, the largest ASSR amplitude would occur at a frequency (fmax) far away from the signal frequency. The AM2 and the TEN were presented at 60 and 75 dB SPL, respectively. In the behavioral task, for the same maskers as above, the masker level at which an AM and a pure tone could just be distinguished, denoted AM2ML, was determined, for low (10 dB above absolute AM2 threshold) and high (60 dB SPL) signal levels. We also hypothesized that the value of fmax would be similar for both techniques. The ASSR fmax values obtained from grand average ASSR amplitudes, but not from individual amplitudes, were consistent with our hypotheses. The agreement between the behavioral fmax and ASSR fmax was poor. The within-session ASSR-amplitude repeatability was good for AM2 alone, but poor for AM2 in notched TEN. The ASSR-amplitude variability between and within participants seems to be a major roadblock to developing our approach into an effective DR detection method.
Collapse
Affiliation(s)
- Emanuele Perugia
- Manchester Centre for Audiology and Deafness, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Frederic Marmel
- Manchester Centre for Audiology and Deafness, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Karolina Kluk
- Manchester Centre for Audiology and Deafness, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| |
Collapse
|
3
|
Easwar V, Purcell D, Wright T. Predicting Hearing aid Benefit Using Speech-Evoked Envelope Following Responses in Children With Hearing Loss. Trends Hear 2023; 27:23312165231151468. [PMID: 36946195 PMCID: PMC10034298 DOI: 10.1177/23312165231151468] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Revised: 12/24/2022] [Accepted: 12/30/2022] [Indexed: 03/23/2023] Open
Abstract
Electroencephalography could serve as an objective tool to evaluate hearing aid benefit in infants who are developmentally unable to participate in hearing tests. We investigated whether speech-evoked envelope following responses (EFRs), a type of electroencephalography-based measure, could predict improved audibility with the use of a hearing aid in children with mild-to-severe permanent, mainly sensorineural, hearing loss. In 18 children, EFRs were elicited by six male-spoken band-limited phonemic stimuli--the first formants of /u/ and /i/, the second and higher formants of /u/ and /i/, and the fricatives /s/ and /∫/--presented together as /su∫i/. EFRs were recorded between the vertex and nape, when /su∫i/ was presented at 55, 65, and 75 dB SPL using insert earphones in unaided conditions and individually fit hearing aids in aided conditions. EFR amplitude and detectability improved with the use of a hearing aid, and the degree of improvement in EFR amplitude was dependent on the extent of change in behavioral thresholds between unaided and aided conditions. EFR detectability was primarily influenced by audibility; higher sensation level stimuli had an increased probability of detection. Overall EFR sensitivity in predicting audibility was significantly higher in aided (82.1%) than unaided conditions (66.5%) and did not vary as a function of stimulus or frequency. EFR specificity in ascertaining inaudibility was 90.8%. Aided improvement in EFR detectability was a significant predictor of hearing aid-facilitated change in speech discrimination accuracy. Results suggest that speech-evoked EFRs could be a useful objective tool in predicting hearing aid benefit in children with hearing loss.
Collapse
Affiliation(s)
- Vijayalakshmi Easwar
- Department of Communication Sciences and Disorders & Waisman
Center, University of
Wisconsin–Madison, Madison, USA
- National
Acoustic Laboratories, Macquarie
University, Sydney, New South Wales, Australia
| | - David Purcell
- School of Communication Sciences and Disorders,
Western
University, London, Canada
- National Centre for Audiology, Western
University, London, Canada
| | - Trevor Wright
- Department of Communication Sciences and Disorders & Waisman
Center, University of
Wisconsin–Madison, Madison, USA
| |
Collapse
|
4
|
Easwar V, Aiken S, Beh K, McGrath E, Galloy M, Scollie S, Purcell D. Variability in the Estimated Amplitude of Vowel-Evoked Envelope Following Responses Caused by Assumed Neurophysiologic Processing Delays. J Assoc Res Otolaryngol 2022; 23:759-769. [PMID: 36002663 PMCID: PMC9789223 DOI: 10.1007/s10162-022-00855-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 06/16/2022] [Indexed: 01/06/2023] Open
Abstract
Vowel-evoked envelope following responses (EFRs) reflect neural encoding of the fundamental frequency of voice (f0). Accurate analysis of EFRs elicited by natural vowels requires the use of methods like the Fourier analyzer (FA) to consider the production-related f0 changes. The FA's accuracy in estimating EFRs is, however, dependent on the assumed neurophysiological processing delay needed to time-align the f0 time course and the recorded electroencephalogram (EEG). For male-spoken vowels (f0 ~ 100 Hz), a constant 10-ms delay correction is often assumed. Since processing delays vary with stimulus and physiological factors, we quantified (i) the delay-related variability that would occur in EFR estimation, and (ii) the influence of stimulus frequency, non-f0 related neural activity, and the listener's age on such variability. EFRs were elicited by the low-frequency first formant, and mid-frequency second and higher formants of /u/, /a/, and /i/ in young adults and 6- to 17-year-old children. To time-align with the f0 time course, EEG was shifted by delays between 5 and 25 ms to encompass plausible response latencies. The delay-dependent range in EFR amplitude did not vary by stimulus frequency or age and was significantly smaller when interference from low-frequency activity was reduced. On average, the delay-dependent range was < 22% of the maximum variability in EFR amplitude that could be expected by noise. Results suggest that using a constant EEG delay correction in FA analysis does not substantially alter EFR amplitude estimation. In the present study, the lack of substantial variability was likely facilitated by using vowels with small f0 ranges.
Collapse
Affiliation(s)
- Vijayalakshmi Easwar
- Department of Communication Sciences and Disorders & Waisman Center, University of Wisconsin-Madison, Madison, WI, USA.
- National Acoustic Laboratories, Sydney, Australia.
| | - Steven Aiken
- School of Communication Sciences and Disorders, Dalhousie University, Nova Scotia, Canada
| | - Krystal Beh
- Department of Communication Sciences and Disorders & National Centre for Audiology, Western University, London, ON, Canada
| | - Emma McGrath
- Department of Communication Sciences and Disorders & Waisman Center, University of Wisconsin-Madison, Madison, WI, USA
| | - Mary Galloy
- Department of Communication Sciences and Disorders & Waisman Center, University of Wisconsin-Madison, Madison, WI, USA
| | - Susan Scollie
- Department of Communication Sciences and Disorders & National Centre for Audiology, Western University, London, ON, Canada
| | - David Purcell
- Department of Communication Sciences and Disorders & National Centre for Audiology, Western University, London, ON, Canada
| |
Collapse
|
5
|
Performance of Statistical Indicators in the Objective Detection of Speech-Evoked Envelope Following Responses. Ear Hear 2022; 43:1669-1677. [PMID: 35499293 DOI: 10.1097/aud.0000000000001232] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
OBJECTIVES To assess the sensitivity of statistical indicators used for the objective detection of speech-evoked envelope following responses (EFRs) in infants and adults. DESIGN Twenty-three adults and 21 infants with normal hearing participated in this study. A modified/susa∫i/speech token was presented at 65 dB SPL monoaurally. Presentation level in infants was corrected using in-ear measurements. EFRs were recorded between high forehead and ipsilateral mastoid. Statistical post-processing was completed using F -test, Magnitude-Square Coherence, Rayleigh test, Rayleigh-Moore test, and Hotelling's T 2 test. Logistic regression models assessed the sensitivity of each statistical indicator in both infants and adults as a function of testing duration. RESULTS The Rayleigh-Moore and Rayleigh tests were the most sensitive statistical indicators for speech-evoked EFR detection in infants. Comparatively, Magnitude-Square Coherence and Hotelling's T 2 also provide clinical benefit for infants in all conditions after ~30 minutes of testing, whereas the F -test failed to detect responses to EFRs elicited by vowels with accuracy greater than chance. In contrast, the F-test was the most sensitive for vowel-elicited response detection for adults in short tests (<10 minute) and performed comparatively with the Rayleigh-Moore and Rayleigh test during longer test durations. Decreased sensitivity was observed in infants relative to adults across all testing durations and statistical indicators, but the effects were largest in low frequency stimuli and seemed to be mostly, but not wholly, caused by differences in response amplitude. CONCLUSIONS The choice of statistical indicator significantly impacts the sensitivity of speech-evoked EFR detection. In both groups and for all stimuli, the Rayleigh test and Rayleigh-Moore tests have high sensitivity. Differences in EFR detection are present between infants and adults regardless of statistical indicator; however, these effects are largest for low-frequency EFR stimuli and for amplitude-based statistical indicators.
Collapse
|
6
|
Lu H, Mehta AH, Oxenham AJ. Methodological considerations when measuring and analyzing auditory steady-state responses with multi-channel EEG. CURRENT RESEARCH IN NEUROBIOLOGY 2022; 3:100061. [PMID: 36386860 PMCID: PMC9647176 DOI: 10.1016/j.crneur.2022.100061] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2021] [Revised: 07/11/2022] [Accepted: 10/19/2022] [Indexed: 11/06/2022] Open
Abstract
The auditory steady-state response (ASSR) has been traditionally recorded with few electrodes and is often measured as the voltage difference between mastoid and vertex electrodes (vertical montage). As high-density EEG recording systems have gained popularity, multi-channel analysis methods have been developed to integrate the ASSR signal across channels. The phases of ASSR across electrodes can be affected by factors including the stimulus modulation rate and re-referencing strategy, which will in turn affect the estimated ASSR strength. To explore the relationship between the classical vertical-montage ASSR and whole-scalp ASSR, we applied these two techniques to the same data to estimate the strength of ASSRs evoked by tones with sinusoidal amplitude modulation rates of around 40, 100, and 200 Hz. The whole-scalp methods evaluated in our study, with either linked-mastoid or common-average reference, included ones that assume equal phase across all channels, as well as ones that allow for different phase relationships. The performance of simple averaging was compared to that of more complex methods involving principal component analysis. Overall, the root-mean-square of the phase locking values (PLVs) across all channels provided the most efficient method to detect ASSR across the range of modulation rates tested here.
Collapse
Affiliation(s)
- Hao Lu
- Department of Psychology, University of Minnesota, 75 East River Parkway, Minneapolis, MN, 55455, USA
| | - Anahita H. Mehta
- Department of Psychology, University of Minnesota, 75 East River Parkway, Minneapolis, MN, 55455, USA
| | - Andrew J. Oxenham
- Department of Psychology, University of Minnesota, 75 East River Parkway, Minneapolis, MN, 55455, USA
| |
Collapse
|
7
|
Easwar V, Purcell D, Lasarev M, McGrath E, Galloy M. Speech-Evoked Envelope Following Responses in Children and Adults. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2022; 65:4009-4023. [PMID: 36129844 DOI: 10.1044/2022_jslhr-22-00156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
PURPOSE Envelope following responses (EFRs) could be useful for objectively evaluating audibility of speech in children who are unable to participate in routine clinical tests. However, relative to adults, the characteristics of EFRs elicited by frequency-specific speech and their utility in predicting audibility in children are unknown. METHOD EFRs were elicited by the first (F1) and second and higher formants (F2+) of male-spoken vowels /u/ and /i/ and by fricatives /ʃ/ and /s/ in the token /suʃi/ presented at 15, 35, 55, 65, and 75 dB SPL. The F1, F2+, and fricatives were low-, mid-, and high-frequency dominant, respectively. EFRs were recorded between the vertex and the nape from twenty-three 6- to 17-year-old children and 21 young adults with normal hearing. Sensation levels of stimuli were estimated based on behavioral thresholds. RESULTS In children, amplitude decreased with age for /ʃ/-elicited EFRs but remained stable for low- and mid-frequency stimuli. As a group, EFR amplitude and phase coherence did not differ from that of adults. EFR sensitivity (proportion of audible stimuli detected) and specificity (proportion of inaudible stimuli not detected) did not vary between children and adults. Consistent with previous work, EFR sensitivity increased with stimulus frequency and level. The type of statistical indicator used for EFR detection did not influence accuracy in children. CONCLUSIONS Adultlike EFRs in 6- to 17-year-old typically developing children suggest mature envelope encoding for low- and mid-frequency stimuli. EFR sensitivity and specificity in children, when considering a wide range of stimulus levels and audibility, are ~77% and ~92%, respectively. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.21136171.
Collapse
Affiliation(s)
- Vijayalakshmi Easwar
- Department of Communication Sciences and Disorders and Waisman Center, University of Wisconsin-Madison
- National Acoustic Laboratories, Sydney, New South Wales, Australia
| | - David Purcell
- School of Communication Sciences and Disorders, Western University, London, Ontario, Canada
| | - Michael Lasarev
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison
| | - Emma McGrath
- Department of Communication Sciences and Disorders and Waisman Center, University of Wisconsin-Madison
| | - Mary Galloy
- Department of Communication Sciences and Disorders and Waisman Center, University of Wisconsin-Madison
| |
Collapse
|
8
|
Jeng FC, Jeng YS. Implementation of Machine Learning on Human Frequency-Following Responses: A Tutorial. Semin Hear 2022; 43:251-274. [PMID: 36313046 PMCID: PMC9605809 DOI: 10.1055/s-0042-1756219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
The frequency-following response (FFR) provides enriched information on how acoustic stimuli are processed in the human brain. Based on recent studies, machine learning techniques have demonstrated great utility in modeling human FFRs. This tutorial focuses on the fundamental principles, algorithmic designs, and custom implementations of several supervised models (linear regression, logistic regression, k -nearest neighbors, support vector machines) and an unsupervised model ( k -means clustering). Other useful machine learning tools (Markov chains, dimensionality reduction, principal components analysis, nonnegative matrix factorization, and neural networks) are discussed as well. Each model's applicability and its pros and cons are explained. The choice of a suitable model is highly dependent on the research question, FFR recordings, target variables, extracted features, and their data types. To promote understanding, an example project implemented in Python is provided, which demonstrates practical usage of several of the discussed models on a sample dataset of six FFR features and a target response label.
Collapse
Affiliation(s)
- Fuh-Cherng Jeng
- Communication Sciences and Disorders, Ohio University, Athens, Ohio
| | - Yu-Shiang Jeng
- Computer Science and Engineering, Ohio State University, Columbus, Ohio
| |
Collapse
|
9
|
Sensitivity of Vowel-Evoked Envelope Following Responses to Spectra and Level of Preceding Phoneme Context. Ear Hear 2022; 43:1327-1335. [DOI: 10.1097/aud.0000000000001190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
10
|
Cheng FY, Smith S. Objective Detection of the Speech Frequency Following Response (sFFR): A Comparison of Two Methods. Audiol Res 2022; 12:89-94. [PMID: 35200259 PMCID: PMC8869319 DOI: 10.3390/audiolres12010010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/22/2022] [Accepted: 01/24/2022] [Indexed: 02/01/2023] Open
Abstract
Speech frequency following responses (sFFRs) are increasingly used in translational auditory research. Statistically-based automated sFFR detection could aid response identification and provide a basis for stopping rules when recording responses in clinical and/or research applications. In this brief report, sFFRs were measured from 18 normal hearing adult listeners in quiet and speech-shaped noise. Two statistically-based automated response detection methods, the F-test and Hotelling’s T2 (HT2) test, were compared based on detection accuracy and test time. Similar detection accuracy across statistical tests and conditions was observed, although the HT2 test time was less variable. These findings suggest that automated sFFR detection is robust for responses recorded in quiet and speech-shaped noise using either the F-test or HT2 test. Future studies evaluating test performance with different stimuli and maskers are warranted to determine if the interchangeability of test performance extends to these conditions.
Collapse
|
11
|
Chesnaye MA, Bell SL, Harte JM, Simpson DM. Controlling test specificity for auditory evoked response detection using a frequency domain bootstrap. J Neurosci Methods 2021; 363:109352. [PMID: 34508784 DOI: 10.1016/j.jneumeth.2021.109352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2020] [Revised: 08/29/2021] [Accepted: 08/31/2021] [Indexed: 10/20/2022]
Abstract
BACKGROUND Statistical detection methods are routinely used to automate auditory evoked response (AER) detection and assist clinicians with AER measurements. However, many of these methods are built around statistical assumptions that can be violated for AER data, potentially resulting in reduced or unpredictable test performances. This study explores a frequency domain bootstrap (FDB) and some FDB modifications to preserve test performance in serially correlated non-stationary data. METHOD The FDB aims to generate many surrogate recordings, all with similar serial correlation as the original recording being analysed. Analysing the surrogates with the detection method then gives a distribution of values that can be used for inference. A potential limitation of the conventional FDB is the assumption of stationary data with a smooth power spectral density (PSD) function, which is addressed through two modifications. COMPARISONS WITH EXISTING METHODS The FDB was compared to a conventional parametric approach and two modified FDB approaches that aim to account for heteroskedasticity and non-smooth PSD functions. Hotelling's T2(HT2) test applied to auditory brainstem responses was the test case. RESULTS When using conventional HT2, false-positive rates deviated significantly from the nominal alpha-levels due to serial correlation. The false-positive rates of the modified FDB were consistently closer to the nominal alpha-levels, especially when data was strongly heteroskedastic or the underlying PSD function was not smooth due to e.g. power lines noise. CONCLUSION The FDB and its modifications provide accurate, recording-dependent approximations of null distributions, and an improved control of false-positive rates relative to parametric inference for auditory brainstem response detection.
Collapse
Affiliation(s)
- M A Chesnaye
- Institute of Sound and Vibration Research, Faculty of Engineering and the Environment, University of Southampton, UK.
| | - S L Bell
- Institute of Sound and Vibration Research, Faculty of Engineering and the Environment, University of Southampton, UK.
| | - J M Harte
- Eriksholm Research Centre, Snekkersten, Denmark.
| | - D M Simpson
- Institute of Sound and Vibration Research, Faculty of Engineering and the Environment, University of Southampton, UK.
| |
Collapse
|
12
|
Easwar V, Birstler J, Harrison A, Scollie S, Purcell D. The Accuracy of Envelope Following Responses in Predicting Speech Audibility. Ear Hear 2021; 41:1732-1746. [PMID: 33136646 PMCID: PMC8132745 DOI: 10.1097/aud.0000000000000892] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES The present study aimed to (1) evaluate the accuracy of envelope following responses (EFRs) in predicting speech audibility as a function of the statistical indicator used for objective response detection, stimulus phoneme, frequency, and level, and (2) quantify the minimum sensation level (SL; stimulus level above behavioral threshold) needed for detecting EFRs. DESIGN In 21 participants with normal hearing, EFRs were elicited by 8 band-limited phonemes in the male-spoken token /susa∫i/ (2.05 sec) presented between 20 and 65 dB SPL in 15 dB increments. Vowels in /susa∫i/ were modified to elicit two EFRs simultaneously by selectively lowering the fundamental frequency (f0) in the first formant (F1) region. The modified vowels elicited one EFR from the low-frequency F1 and another from the mid-frequency second and higher formants (F2+). Fricatives were amplitude-modulated at the average f0. EFRs were extracted from single-channel EEG recorded between the vertex (Cz) and the nape of the neck when /susa∫i/ was presented monaurally for 450 sweeps. The performance of the three statistical indicators, F-test, Hotelling's T, and phase coherence, was compared against behaviorally determined audibility (estimated SL, SL ≥0 dB = audible) using area under the receiver operating characteristics (AUROC) curve, sensitivity (the proportion of audible speech with a detectable EFR [true positive rate]), and specificity (the proportion of inaudible speech with an undetectable EFR [true negative rate]). The influence of stimulus phoneme, frequency, and level on the accuracy of EFRs in predicting speech audibility was assessed by comparing sensitivity, specificity, positive predictive value (PPV; the proportion of detected EFRs elicited by audible stimuli) and negative predictive value (NPV; the proportion of undetected EFRs elicited by inaudible stimuli). The minimum SL needed for detection was evaluated using a linear mixed-effects model with the predictor variables stimulus and EFR detection p value. RESULTS of the 3 statistical indicators were similar; however, at the type I error rate of 5%, the sensitivities of Hotelling's T (68.4%) and phase coherence (68.8%) were significantly higher than the F-test (59.5%). In contrast, the specificity of the F-test (97.3%) was significantly higher than the Hotelling's T (88.4%). When analyzed using Hotelling's T as a function of stimulus, fricatives offered higher sensitivity (88.6 to 90.6%) and NPV (57.9 to 76.0%) compared with most vowel stimuli (51.9 to 71.4% and 11.6 to 51.3%, respectively). When analyzed as a function of frequency band (F1, F2+, and fricatives aggregated as low-, mid- and high-frequencies, respectively), high-frequency stimuli offered the highest sensitivity (96.9%) and NPV (88.9%). When analyzed as a function of test level, sensitivity improved with increases in stimulus level (99.4% at 65 dB SPL). The minimum SL for EFR detection ranged between 13.4 and 21.7 dB for F1 stimuli, 7.8 to 12.2 dB for F2+ stimuli, and 2.3 to 3.9 dB for fricative stimuli. CONCLUSIONS EFR-based inference of speech audibility requires consideration of the statistical indicator used, phoneme, stimulus frequency, and stimulus level.
Collapse
Affiliation(s)
- Vijayalakshmi Easwar
- Department of Communication Sciences and Disorders & Waisman Center, University of Wisconsin-Madison, USA
- National Centre for Audiology, Western University, Canada
| | - Jen Birstler
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, USA
| | - Adrienne Harrison
- Health and Rehabilitation Sciences, Western University, Canada
- School of Communication Sciences and Disorders, Western University, Canada
| | - Susan Scollie
- National Centre for Audiology, Western University, Canada
- School of Communication Sciences and Disorders, Western University, Canada
| | - David Purcell
- National Centre for Audiology, Western University, Canada
- School of Communication Sciences and Disorders, Western University, Canada
| |
Collapse
|
13
|
Montage-related Variability in the Characteristics of Envelope Following Responses. Ear Hear 2021; 42:1436-1440. [PMID: 33900208 DOI: 10.1097/aud.0000000000001018] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
OBJECTIVES The study aimed to compare two electrode montages commonly used for recording speech-evoked envelope following responses. DESIGN Twenty-three normal-hearing adults participated in this study. EFRs were elicited by a naturally spoken, modified /susa∫i/ stimulus presented at 65 dB SPL monaurally. EFRs were recorded using two single-channel electrode montages: Cz-nape and Fz-ipsilateral mastoid, where the noninverting and inverting sites were the vertex and nape, and the high forehead and ipsilateral mastoid, respectively. Montage order was counterbalanced across participants. RESULTS Envelope following responses amplitude and phase coherence were significantly higher overall in the Cz-nape montage with no significant differences in noise amplitude. Post hoc testing on montage effects in response amplitude and phase coherence was not significant for individual stimuli. The Cz-nape montage also resulted in a greater number of detections and analyzed using the Hotelling's T2. CONCLUSIONS Electrode montage influences the estimated characteristics of speech-evoked EFRs.
Collapse
|
14
|
Efficient Detection of Cortical Auditory Evoked Potentials in Adults Using Bootstrapped Methods. Ear Hear 2020; 42:574-583. [PMID: 33259446 DOI: 10.1097/aud.0000000000000959] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
BACKGROUND Statistical detection methods are useful tools for assisting clinicians with cortical auditory evoked potential (CAEP) detection, and can help improve the overall efficiency and reliability of the test. However, many of these detection methods rely on parametric distributions when evaluating test significance, and thus make various assumptions regarding the electroencephalogram (EEG) data. When these assumptions are violated, reduced test sensitivities and/or increased or decreased false-positive rates can be expected. As an alternative to the parametric approach, test significance can be evaluated using a bootstrap, which does not require some of the aforementioned assumptions. Bootstrapping also permits a large amount of freedom when choosing or designing the statistical test for response detection, as the distributions underlying the test statistic no longer need to be known prior to the test. OBJECTIVES To improve the reliability and efficiency of CAEP-related applications by improving the specificity and sensitivity of objective CAEP detection methods. DESIGN The methods included in the assessment were Hotelling's T2 test, the Fmp, four modified q-sample statistics, and various template-based detection methods (calculated between the ensemble coherent average and some predefined template), including the correlation coefficient, covariance, and dynamic time-warping (DTW). The assessment was carried out using both simulations and a CAEP threshold series collected from 23 adults with normal hearing. RESULTS The most sensitive method was DTW, evaluated using the bootstrap, with maximum increases in test sensitivity (relative to the conventional Hotelling's T2 test) of up to 30%. An important factor underlying the performance of DTW is that the template adopted for the analysis correlates well with the subjects' CAEP. CONCLUSION When subjects' CAEP morphology is approximately known before the test, then the DTW algorithm provides a highly sensitive method for CAEP detection.
Collapse
|
15
|
Abstract
OBJECTIVES There is increasing interest in using the frequency following response (FFR) to describe the effects of varying different aspects of hearing aid signal processing on brainstem neural representation of speech. To this end, recent studies have examined the effects of filtering on brainstem neural representation of the speech fundamental frequency (f0) in listeners with normal hearing sensitivity by measuring FFRs to low- and high-pass filtered signals. However, the stimuli used in these studies do not reflect the entire range of typical cutoff frequencies used in frequency-specific gain adjustments during hearing aid fitting. Further, there has been limited discussion on the effect of filtering on brainstem neural representation of formant-related harmonics. Here, the effects of filtering on brainstem neural representation of speech fundamental frequency (f0) and harmonics related to first formant frequency (F1) were assessed by recording envelope and spectral FFRs to a vowel low-, high-, and band-pass filtered at cutoff frequencies ranging from 0.125 to 8 kHz. DESIGN FFRs were measured to a synthetically generated vowel stimulus /u/ presented in a full bandwidth and low-pass (experiment 1), high-pass (experiment 2), and band-pass (experiment 3) filtered conditions. In experiment 1, FFRs were measured to a synthetically generated vowel stimulus /u/ presented in a full bandwidth condition as well as 11 low-pass filtered conditions (low-pass cutoff frequencies: 0.125, 0.25, 0.5, 0.75, 1, 1.5, 2, 3, 4, 6, and 8 kHz) in 19 adult listeners with normal hearing sensitivity. In experiment 2, FFRs were measured to the same synthetically generated vowel stimulus /u/ presented in a full bandwidth condition as well as 10 high-pass filtered conditions (high-pass cutoff frequencies: 0.125, 0.25, 0.5, 0.75, 1, 1.5, 2, 3, 4, and 6 kHz) in 7 adult listeners with normal hearing sensitivity. In experiment 3, in addition to the full bandwidth condition, FFRs were measured to vowel /u/ low-pass filtered at 2 kHz, band-pass filtered between 2-4 kHz and 4-6 kHz in 10 adult listeners with normal hearing sensitivity. A Fast Fourier Transform analysis was conducted to measure the strength of f0 and the F1-related harmonic relative to the noise floor in the brainstem neural responses obtained to the full bandwidth and filtered stimulus conditions. RESULTS Brainstem neural representation of f0 was reduced when the low-pass filter cutoff frequency was between 0.25 and 0.5 kHz; no differences in f0 strength were noted between conditions when the low-pass filter cutoff condition was at or greater than 0.75 kHz. While envelope FFR f0 strength was reduced when the stimulus was high-pass filtered at 6 kHz, there was no effect of high-pass filtering on brainstem neural representation of f0 when the high-pass filter cutoff frequency ranged from 0.125 to 4 kHz. There was a weakly significant global effect of band-pass filtering on brainstem neural phase-locking to f0. A trends analysis indicated that mean f0 magnitude in the brainstem neural response was greater when the stimulus was band-pass filtered between 2 and 4 kHz as compared to when the stimulus was band-pass filtered between 4 and 6 kHz, low-pass filtered at 2 kHz or presented in the full bandwidth condition. Last, neural phase-locking to f0 was reduced or absent in envelope FFRs measured to filtered stimuli that lacked spectral energy above 0.125 kHz or below 6 kHz. Similarly, little to no energy was seen at F1 in spectral FFRs obtained to low-, high-, or band-pass filtered stimuli that did not contain energy in the F1 region. For stimulus conditions that contained energy at F1, the strength of the peak at F1 in the spectral FFR varied little with low-, high-, or band-pass filtering. CONCLUSIONS Energy at f0 in envelope FFRs may arise due to neural phase-locking to low-, mid-, or high-frequency stimulus components, provided the stimulus envelope is modulated by at least two interacting harmonics. Stronger neural responses at f0 are measured when filtering results in stimulus bandwidths that preserve stimulus energy at F1 and F2. In addition, results suggest that unresolved harmonics may favorably influence f0 strength in the neural response. Lastly, brainstem neural representation of the F1-related harmonic measured in spectral FFRs obtained to filtered stimuli is related to the presence or absence of stimulus energy at F1. These findings add to the existing literature exploring the viability of the FFR as an objective technique to evaluate hearing aid fitting where stimulus bandwidth is altered by design due to frequency-specific gain applied by amplification algorithms.
Collapse
|
16
|
Richard C, Neel ML, Jeanvoine A, Connell SM, Gehred A, Maitre NL. Characteristics of the Frequency-Following Response to Speech in Neonates and Potential Applicability in Clinical Practice: A Systematic Review. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2020; 63:1618-1635. [PMID: 32407639 DOI: 10.1044/2020_jslhr-19-00322] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Purpose We sought to critically analyze and evaluate published evidence regarding feasibility and clinical potential for predicting neurodevelopmental outcomes of the frequency-following responses (FFRs) to speech recordings in neonates (birth to 28 days). Method A systematic search of MeSH terms in the Cumulative Index to Nursing and Allied HealthLiterature, Embase, Google Scholar, Ovid Medline (R) and E-Pub Ahead of Print, In-Process & Other Non-Indexed Citations and Daily, Web of Science, SCOPUS, COCHRANE Library, and ClinicalTrials.gov was performed. Manual review of all items identified in the search was performed by two independent reviewers. Articles were evaluated based on the level of methodological quality and evidence according to the RTI item bank. Results Seven articles met inclusion criteria. None of the included studies reported neurodevelopmental outcomes past 3 months of age. Quality of the evidence ranged from moderate to high. Protocol variations were frequent. Conclusions Based on this systematic review, the FFR to speech can capture both temporal and spectral acoustic features in neonates. It can accurately be recorded in a fast and easy manner at the infant's bedside. However, at this time, further studies are needed to identify and validate which FFR features could be incorporated as an addition to standard evaluation of infant sound processing evaluation in subcortico-cortical networks. This review identifies the need for further research focused on identifying specific features of the neonatal FFRs, those with predictive value for early childhood outcomes to help guide targeted early speech and hearing interventions.
Collapse
Affiliation(s)
- Céline Richard
- Center for Perinatal Research and Department of Pediatrics, Nationwide Children's Hospital, Columbus, OH
- Laboratory for Investigative Neurophysiology, Department of Radiology and Department of Clinical Neurosciences, University Hospital Center and University of Lausanne, Switzerland
| | - Mary Lauren Neel
- Center for Perinatal Research and Department of Pediatrics, Nationwide Children's Hospital, Columbus, OH
| | - Arnaud Jeanvoine
- Center for Perinatal Research and Department of Pediatrics, Nationwide Children's Hospital, Columbus, OH
| | - Sharon Mc Connell
- Center for Perinatal Research and Department of Pediatrics, Nationwide Children's Hospital, Columbus, OH
| | - Alison Gehred
- Medical Library Division, Nationwide Children's Hospital, Columbus, OH
| | - Nathalie L Maitre
- Center for Perinatal Research and Department of Pediatrics, Nationwide Children's Hospital, Columbus, OH
- Department of Hearing and Speech Sciences, Vanderbilt University Medical Center, Nashville, TN
| |
Collapse
|
17
|
Zanotelli T, Antunes F, Simpson DM, Mazoni Andrade Marçal Mendes E, Felix LB. Faster automatic ASSR detection using sequential tests. Int J Audiol 2020; 59:631-639. [PMID: 32091286 DOI: 10.1080/14992027.2020.1728402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Objective: Objective Response Detection (ORD) can be used for auditory steady-state response (ASSR) detection. In conventional ORD methods, the statistical tests are applied at the end of data collection ('single-shot tests'). In sequential ORD methods, statistical tests are applied repeatedly, while data is being collected. However, repeated testing can increase False Positive (FP) rates. One solution is to infer that response is present only after the test remains significant for a predefined number of consecutive detections (NCD). Thus, this paper describes a new method for finding the required NCD that control the FP rate for ASSR detection.Design: NCD values are estimated using Monte Carlo simulations.Study sample: ASSR signals were recorded from 8 normal-hearing subjects.Results: The exam time was reduced by up to 38.9% compared to the single-shot test with loss of approximately 5% in detection rate. Alternatively, lower gains in time were achieved for a smaller (non-significant) loss in detection rate. The FP rates at the end of the test were kept at the nominal level expected (1%).Conclusion: The sequential test strategy with NCD as the stopping criterion can improve the speed of ASSR detection and prevent higher than expected FP rates.
Collapse
Affiliation(s)
- Tiago Zanotelli
- Department of Electrical Engineering, Graduate Program in Electrical Engineering, Federal University of Minas Gerais, Belo Horizonte, Brazil.,Federal Institute of Education Science and Technology of Espírito Santo-São Mateus, São Mateus, Brazil
| | - Felipe Antunes
- Department of Electrical Engineering, Federal Institute of Education Science and Technology of Minas Gerais-Ipatinga Advanced Campus, Ipatinga, Brazil
| | - David Martin Simpson
- Institute of Sound and Vibration and Research, University of Southampton, Southampton, United Kingdom
| | | | - Leonardo Bonato Felix
- Institute of Sound and Vibration and Research, University of Southampton, Southampton, United Kingdom.,Department of Electrical Engineering, Federal University of Viçosa, Viçosa, Brazil
| |
Collapse
|
18
|
Easwar V, Scollie S, Purcell D. Investigating potential interactions between envelope following responses elicited simultaneously by different vowel formants. Hear Res 2019; 380:35-45. [DOI: 10.1016/j.heares.2019.05.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/07/2019] [Revised: 05/16/2019] [Accepted: 05/21/2019] [Indexed: 10/26/2022]
|
19
|
Improving the power of objective response detection of evoked responses in noise by using average and product of magnitude-squared coherence of two different signals. Med Biol Eng Comput 2019; 57:2203-2214. [PMID: 31399896 DOI: 10.1007/s11517-019-02020-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Accepted: 07/24/2019] [Indexed: 10/26/2022]
Abstract
Objective response detection (ORD) techniques such as the magnitude-squared coherence (MSC) are mathematical methods tailored to detect potentials evoked by an external periodic stimulation. The performance of the MSC is directly proportional to the signal-to-noise ratio (SNR) of the recorded signal and the time spent for collecting data. An alternative to increasing the performance of detection techniques without increasing data recording time is to use the information from more than one signal simultaneously. In this context, this work proposes two new detection techniques based on the average and on the product of MSCs of two different signals. The critical values and detection probabilities were obtained theoretically and using a Monte Carlo simulation. The performances of the new detectors were evaluated using synthetic data and electroencephalogram (EEG) signals during photo and auditory stimulation. For the synthetic signals, the two proposed detectors exhibited a higher detection rate when compared to the rate of the traditional MSC technique. When applied to EEG signals, these detectors resulted in an increase of the mean detection rate in relation to MSC for visual and auditory stimulation of at least 25% and 13.21%, respectively. The proposed detectors may be considered as promising tools for clinical applications. Graphical Abstract.
Collapse
|
20
|
Vanheusden FJ, Chesnaye MA, Simpson DM, Bell SL. Envelope frequency following responses are stronger for high-pass than low-pass filtered vowels. Int J Audiol 2019; 58:355-362. [DOI: 10.1080/14992027.2018.1562243] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Frederique J. Vanheusden
- Institute of Sound and Vibration Research, University of Southampton, Southampton, UK
- School of Science and Technology, Nottingham Trent University, Clifton Campus, Nottingham, UK
| | - Michael A. Chesnaye
- Institute of Sound and Vibration Research, University of Southampton, Southampton, UK
| | - David M. Simpson
- Institute of Sound and Vibration Research, University of Southampton, Southampton, UK
| | - Steven L. Bell
- Institute of Sound and Vibration Research, University of Southampton, Southampton, UK
| |
Collapse
|