26
|
Anand S, Kopf LM, Shrivastav R, Eddins DA. Objective Indices of Perceived Vocal Strain. J Voice 2019; 33:838-845. [DOI: 10.1016/j.jvoice.2018.06.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2018] [Revised: 06/06/2018] [Accepted: 06/07/2018] [Indexed: 10/28/2022]
|
27
|
Isarangura S, Eddins AC, Ozmeral EJ, Eddins DA. The Effects of Duration and Level on Spectral Modulation Perception. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:3876-3886. [PMID: 31638883 PMCID: PMC7838824 DOI: 10.1044/2019_jslhr-h-18-0449] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Revised: 04/02/2019] [Accepted: 07/23/2019] [Indexed: 05/30/2023]
Abstract
Purpose Spectral modulation detection is an increasingly common assay of suprathreshold auditory perception and has been correlated with speech perception performance. Here, the potential effects of stimulus duration and stimulus presentation level on spectral modulation detection were investigated. Method Spectral modulation detection thresholds were measured as a function of modulation frequency in young, normal-hearing listeners. The standard stimulus was a bandpass noise, and signal stimuli were created by superimposing sinusoidal spectral modulation on the bandpass noise carrier. The modulation was sinusoidal on a log2 frequency axis and a log10 (dB) amplitude scale with a random starting phase (0-2π radians). In 1 experiment, stimulus durations were 50, 100, 200, or 400 ms (at fixed level 81 dB SPL). In a 2nd experiment, stimuli were presented at sensation levels of 10, 20, 30, 40, and 60 dB SL (fixed at a duration of 400 ms). Results Spectral modulation detection thresholds were similarly low for the 400- and 200-ms durations, increased slightly for the 100-ms duration, and increased markedly for the 50-ms duration. Thresholds were lowest for 40 dB SL; increased slightly for 20, 30, and 60 dB SL; and markedly higher for the 10-dB SL condition. Conclusions The increase in thresholds for the shortest durations and lowest sensational levels is consistent with previous investigations of auditory spectral profile analysis. The effects of presentation level and stimulus duration are important considerations in the context of understanding potential relationships between the perception of spectral cues and speech perception, when designing investigations and interpreting data related to spectral envelope perception, and in the context of models of auditory perception. As examples, 2 simple models based on auditory nerve output that have been used to explain spectrotemporal modulation in previous investigations produced an output inconsistent with the present results. Plain language summary Intensity variations across audio frequency lead to spectral shapes that are essential and sometimes signature features of various sounds in the environment, including speech. Here, we show how laboratory measures of spectral shape perception depend on presentation level and stimulus duration.
Collapse
|
28
|
Anand S, Skowronski MD, Shrivastav R, Eddins DA. Perceptual and Quantitative Assessment of Dysphonia Across Vowel Categories. J Voice 2019; 33:473-481. [DOI: 10.1016/j.jvoice.2017.12.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 12/21/2017] [Indexed: 10/16/2022]
|
29
|
Ozmeral EJ, Eddins DA, Eddins AC. Electrophysiological responses to lateral shifts are not consistent with opponent-channel processing of interaural level differences. J Neurophysiol 2019; 122:737-748. [PMID: 31242052 DOI: 10.1152/jn.00090.2019] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Cortical encoding of auditory space relies on two major peripheral cues, interaural time difference (ITD) and interaural level difference (ILD) of the sounds arriving at a listener's ears. In much of the precortical auditory pathway, ITD and ILD cues are processed independently, and it is assumed that cue integration is a higher order process. However, there remains debate on how ITDs and ILDs are encoded in the cortex and whether they share a common mechanism. The present study used electroencephalography (EEG) to measure evoked cortical potentials from narrowband noise stimuli with imposed binaural cue changes. Previous studies have similarly tested ITD shifts to demonstrate that neural populations broadly favor one spatial hemifield over the other, which is consistent with an opponent-channel model that computes the relative activity between broadly tuned neural populations. However, it is still a matter of debate whether the same coding scheme applies to ILDs and, if so, whether processing the two binaural cues is distributed across similar regions of the cortex. The results indicate that ITD and ILD cues have similar neural signatures with respect to the monotonic responses to shift magnitude; however, the direction of the shift did not elicit responses equally across cues. Specifically, ITD shifts evoked greater responses for outward than inward shifts, independently of the spatial hemifield of the shift, whereas ILD-shift responses were dependent on the hemifield in which the shift occurred. Active cortical structures showed only minor overlap between responses to cues, suggesting the two are not represented by the same pathway.NEW & NOTEWORTHY Interaural time differences (ITDs) and interaural level differences (ILDs) are critical to locating auditory sources in the horizontal plane. The higher order perceptual feature of auditory space is thought to be encoded together by these binaural differences, yet evidence of their integration in cortex remains elusive. Although present results show some common effects between the two cues, key differences were observed that are not consistent with an ITD-like opponent-channel process for ILD encoding.
Collapse
|
30
|
Hoover EC, Kinney BN, Bell KL, Gallun FJ, Eddins DA. A Comparison of Behavioral Methods for Indexing the Auditory Processing of Temporal Fine Structure Cues. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2019; 62:2018-2034. [PMID: 31145649 PMCID: PMC6808371 DOI: 10.1044/2019_jslhr-h-18-0217] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Revised: 12/20/2018] [Accepted: 02/13/2019] [Indexed: 05/12/2023]
Abstract
Purpose Growing evidence supports the inclusion of perceptual tests that quantify the processing of temporal fine structure (TFS) in clinical hearing assessment. Many tasks have been used to evaluate TFS in the laboratory that vary greatly in the stimuli used and whether the judgments require monaural or binaural comparisons of TFS. The purpose of this study was to compare laboratory measures of TFS for inclusion in a battery of suprathreshold auditory tests. A subset of available TFS tasks were selected on the basis of potential clinical utility and were evaluated using metrics that focus on characteristics important for clinical use. Method TFS measures were implemented in replication of studies that demonstrated clinical utility. Monaural, diotic, and dichotic measures were evaluated in 11 young listeners with normal hearing. Measures included frequency modulation (FM) tasks, harmonic frequency shift detection, interaural phase difference (TFS-low frequency), interaural time difference (ITD), monaural gap duration discrimination, and tone detection in noise with and without a difference in interaural phase (N0S0, N0Sπ). Data were compared with published results and evaluated with metrics of consistency and efficiency. Results Thresholds obtained were consistent with published data. There was no evidence of predictive relationships among the measures consistent with a homogenous group. The most stable tasks across repeated testing were TFS-low frequency, diotic and dichotic FM, and N0Sπ. Monaural and diotic FM had the lowest normalized variance and were the most efficient accounting for differences in total test duration, followed by ITD. Conclusions Despite a long stimulus duration, FM tasks dominated comparisons of consistency and efficiency. Small differences separated the dichotic tasks FM, ITD, and N0Sπ. Future comparisons following procedural optimization of the tasks will evaluate clinical efficiency in populations with impairment.
Collapse
|
31
|
Eddins AC, Ozmeral EJ, Eddins DA. How aging impacts the encoding of binaural cues and the perception of auditory space. Hear Res 2018; 369:79-89. [PMID: 29759684 PMCID: PMC6196106 DOI: 10.1016/j.heares.2018.05.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/15/2017] [Revised: 04/23/2018] [Accepted: 05/02/2018] [Indexed: 10/17/2022]
Abstract
Over the years, the effect of aging on auditory function has been investigated in animal models and humans in an effort to characterize age-related changes in both perception and physiology. Here, we review how aging may impact neural encoding and processing of binaural and spatial cues in human listeners with a focus on recent work by the authors as well as others. Age-related declines in monaural temporal processing, as estimated from measures of gap detection and temporal fine structure discrimination, have been associated with poorer performance on binaural tasks that require precise temporal processing. In lateralization and localization tasks, as well as in the detection of signals in noise, marked age-related changes have been demonstrated in both behavioral and electrophysiological measures and have been attributed to declines in neural synchrony and reduced central inhibition with advancing age. Evidence for such mechanisms, however, are influenced by the task (passive vs. attending) and the stimulus paradigm (e.g., static vs. continuous with dynamic change). That is, cortical auditory evoked potentials (CAEP) measured in response to static interaural time differences (ITDs) are larger in older versus younger listeners, consistent with reduced inhibition, while continuous stimuli with dynamic ITD changes lead to smaller responses in older compared to younger adults, suggestive of poorer neural synchrony. Additionally, the distribution of cortical activity is broader and less asymmetric in older than younger adults, consistent with the hemispheric asymmetry reduction in older adults model of cognitive aging. When older listeners attend to selected target locations in the free field, their CAEP components (N1, P2, P3) are again consistently smaller relative to younger listeners, and the reduced asymmetry in the distribution of cortical activity is maintained. As this research matures, proper neural biomarkers for changes in spatial hearing can provide objective evidence of impairment and targets for remediation. Future research should focus on the development and evaluation of effective approaches for remediating these spatial processing deficits associated with aging and hearing loss.
Collapse
|
32
|
Ozmeral EJ, Eddins AC, Eddins DA. How Do Age and Hearing Loss Impact Spectral Envelope Perception? JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:2376-2385. [PMID: 30178062 PMCID: PMC6195040 DOI: 10.1044/2018_jslhr-h-18-0056] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Revised: 05/09/2018] [Accepted: 05/16/2018] [Indexed: 06/01/2023]
Abstract
Purpose The goal was to evaluate the potential effects of increasing hearing loss and advancing age on spectral envelope perception. Method Spectral modulation detection was measured as a function of spectral modulation frequency from 0.5 to 8.0 cycles/octave. The spectral modulation task involved discrimination of a noise carrier (3 octaves wide from 400 to 3200 Hz) with a flat spectral envelope from a noise having a sinusoidal spectral envelope across a logarithmic audio frequency scale. Spectral modulation transfer functions (SMTFs; modulation threshold vs. modulation frequency) were computed and compared 4 listener groups: young normal hearing, older normal hearing, older with mild hearing loss, and older with moderate hearing loss. Estimates of the internal spectral contrast were obtained by computing excitation patterns. Results SMTFs for young listeners with normal hearing were bandpass with a minimum modulation detection threshold at 2 cycles/octave, and older listeners with normal hearing were remarkably similar to those of the young listeners. SMTFs for older listeners with mild and moderate hearing loss had a low-pass rather than a bandpass shape. Excitation patterns revealed that limited spectral resolution dictated modulation detection thresholds at high but not low spectral modulation frequencies. Even when factoring out (presumed) differences in frequency resolution among groups, the spectral envelope perception was worse for the group with moderate hearing loss than the other 3 groups. Conclusions The spectral envelope perception as measured by spectral modulation detection thresholds is compromised by hearing loss at higher spectral modulation frequencies, consistent with predictions of reduced spectral resolution known to accompany sensorineural hearing loss. Spectral envelope perception is not negatively impacted by advancing age at any spectral modulation frequency between 0.5 and 8.0 cycles/octave.
Collapse
|
33
|
Hoover EC, Eddins AC, Eddins DA. Distribution of spectral modulation transfer functions in a young, normal-hearing population. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2018; 143:306. [PMID: 29390785 PMCID: PMC5777922 DOI: 10.1121/1.5020787] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
Spectral modulation transfer functions (SMTFs) were measured in 49 young (18-35 years of age) normal-hearing listeners. Noise carriers spanned six octaves from 200 to 12 800 Hz. Sinusoidal (on a log-amplitude scale) spectral modulation with random starting phase was superimposed on the carrier at spectral modulation frequencies of 0.25, 0.5, 1.0, 2.0, 4.0, and 8.0 cycles/octave. Modulation detection thresholds (in dB) yielded SMTFs that were bandpass in nature, consistent with previous investigations reporting data for only a few subjects. Thresholds were notably consistent across subjects despite minimal practice. Population statistics are reported that may serve as reference data for future studies.
Collapse
|
34
|
Kopf LM, Skowronski MD, Anand S, Eddins DA, Shrivastav R. The Perception of Breathiness in the Voices of Pediatric Speakers. J Voice 2017; 33:204-213. [PMID: 29162356 DOI: 10.1016/j.jvoice.2017.09.024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Revised: 09/27/2017] [Accepted: 09/28/2017] [Indexed: 10/18/2022]
Abstract
BACKGROUND The perception of pediatric voice quality has been investigated using clinical protocols developed for adult voices and acoustic analyses designed to identify important physical parameters associated with normal and dysphonic pediatric voices. Laboratory investigations of adult dysphonia have included sophisticated methods, including a psychoacoustic approach that involves a single-variable matching task (SVMT), characterized by high inter- and intra-listener reliability, and analyses that include bio-inspired models of auditory perception that have provided valuable information regarding adult voice quality. OBJECTIVES To establish the utility of a psychoacoustic approach to the investigation of voice quality perception in the context of pediatric voices? METHODS Six listeners judged the breathiness of 20 synthetic vowel stimuli using an SVMT. To support comparisons with previous data, stimuli were modeled after four pediatric speakers and synthesized using Klatt with five parameter settings that influence the perception of breathiness. The population average breathiness judgments were modeled with acoustic measures of loudness ratio, pitch strength, and cepstral peak. RESULTS Listeners reliably judged the perceived breathiness of pediatric voices, as with previous investigations of breathiness in adult dysphonic voices. Breathiness judgments were accurately modeled by loudness ratio (r2 = 0.93), pitch strength (r2 = 0.91), and cepstral peak (r2 = 0.82). Model accuracy was not affected significantly by including stimulus fundamental frequency and was slightly higher for pediatric than for adult voices. CONCLUSIONS The SVMT proved robust for pediatric voices spanning a wide range of breathiness. The data indicate that this is a promising approach for future investigation of pediatric voice quality.
Collapse
|
35
|
Ozmeral EJ, Eddins DA, Eddins AC. Reduced temporal processing in older, normal-hearing listeners evident from electrophysiological responses to shifts in interaural time difference. J Neurophysiol 2016; 116:2720-2729. [PMID: 27683889 PMCID: PMC5133308 DOI: 10.1152/jn.00560.2016] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Accepted: 09/24/2016] [Indexed: 11/22/2022] Open
Abstract
Previous electrophysiological studies of interaural time difference (ITD) processing have demonstrated that ITDs are represented by a nontopographic population rate code. Rather than narrow tuning to ITDs, neural channels have broad tuning to ITDs in either the left or right auditory hemifield, and the relative activity between the channels determines the perceived lateralization of the sound. With advancing age, spatial perception weakens and poor temporal processing contributes to declining spatial acuity. At present, it is unclear whether age-related temporal processing deficits are due to poor inhibitory controls in the auditory system or degraded neural synchrony at the periphery. Cortical processing of spatial cues based on a hemifield code are susceptible to potential age-related physiological changes. We consider two distinct predictions of age-related changes to ITD sensitivity: declines in inhibitory mechanisms would lead to increased excitation and medial shifts to rate-azimuth functions, whereas a general reduction in neural synchrony would lead to reduced excitation and shallower slopes in the rate-azimuth function. The current study tested these possibilities by measuring an evoked response to ITD shifts in a narrow-band noise. Results were more in line with the latter outcome, both from measured latencies and amplitudes of the global field potentials and source-localized waveforms in the left and right auditory cortices. The measured responses for older listeners also tended to have reduced asymmetric distribution of activity in response to ITD shifts, which is consistent with other sensory and cognitive processing models of aging.
Collapse
|
36
|
Eddins DA, Kopf LM, Shrivastav R. The psychophysics of roughness applied to dysphonic voice. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 138:3820-5. [PMID: 26723336 PMCID: PMC4691258 DOI: 10.1121/1.4937753] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Revised: 11/19/2015] [Accepted: 11/25/2015] [Indexed: 05/19/2023]
Abstract
Roughness is a sound quality that has been related to the amplitude modulation characteristics of the acoustic stimulus. Roughness also is considered one of the primary elements of voice quality associated with natural variations across normal voices and is a salient feature of many dysphonic voices. It is known that the roughness of tonal stimuli is dependent on the frequency and depth of amplitude modulation and on the carrier frequency. Here, it is determined if similar dependencies exist for voiced speech stimuli. Knowledge of such dependencies can lead to a better understanding of the acoustic characteristics of vocal roughness along the continuum of normal to dysphonic and may facilitate computational estimates of vocal roughness. Synthetic vowel stimuli were modeled after talkers selected from the Satloff/Heman-Ackah disordered voice database. To parametrically control amplitude modulation frequency and depth, synthesized stimuli had minimal amplitude fluctuations, and amplitude modulation was superimposed with the desired frequency and depth. Perceptual roughness judgments depended on amplitude modulation frequency and depth in a manner that closely matched data from tonal carriers. The dependence of perceived roughness on amplitude modulation frequency and depth closely matched the roughness of sinusoidal carriers as reported by Fastl and Zwicker [(2007) Psychoacoustics: Facts and Models, 3rd ed. (Springer, New York)].
Collapse
|
37
|
Kelly MR, Amuso VJ, Eddins DA, Borkholder DA. The focal index as a singular metric for beamforming effectiveness. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2014; 136:2654-2664. [PMID: 25373966 DOI: 10.1121/1.4896465] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Hearing-impaired individuals frequently cite intelligibility problems in multi-talker environments. Microphone arrays performing time-delay beamforming address conditions of poor signal-to-noise ratio by spatially filtering incoming sound. Existing beam pattern metrics including peak side lobe level, integrated side lobe level, beamwidth, and planar directivity index fail to quantitatively capture all elements essential for improving speech intelligibility in multi-talker situations. The focal index (FI) was developed to address these deficiencies. Simulations were performed to exemplify the robust nature of the FI and to demonstrate the utility of this metric for driving array parameter selection. Beam patterns were generated and the metrics were calculated and evaluated against the strict unidirectional requirements for the array. Array performance was assessed by human subjects in a speech recognition task that incorporated competing speech from multiple locations. Simulations of array output were presented under conditions differing in array sparsity. The resulting human subject data were used to demonstrate the linear relationship (R(2) > 0.975) between speech-intelligibility-weighted FI (SII-FI) and the signal-to-noise ratio thresholds for 20% and 80% correct responses. Data indicate that the FI and SII-FI are robust singular metrics for determining the effectiveness of the array.
Collapse
|
38
|
|
39
|
Eddins DA. Hearing Aids. Second Edition, Harvey Dillon. Ear Hear 2013. [DOI: 10.1097/01.aud.0000436254.15629.5b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
40
|
Eddins DA, Shrivastav R. Psychometric properties associated with perceived vocal roughness using a matching task. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:EL294-300. [PMID: 24116533 PMCID: PMC3779263 DOI: 10.1121/1.4819183] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/02/2013] [Accepted: 08/09/2013] [Indexed: 05/19/2023]
Abstract
A psychophysical matching paradigm has been used to better quantify voice quality under laboratory conditions. The goals of this study were to establish which of two candidate comparison stimuli would best ensure that the range of perceived vocal roughness could be adequately bracketed using a matching task and to provide a general solution to the problem of estimating vocal roughness. Psychometric functions for roughness matching indicated that a speech-like sawtooth-plus-noise complex (20 dB signal-to-noise ratio) amplitude modulated by a sinusoidal function raised to the 4th power yielded a comparison stimulus with a perceptual dynamic range well suited for roughness matching.
Collapse
|
41
|
Zecker SG, Hoffman HJ, Frisina R, Dubno JR, Dhar S, Wallhagen M, Kraus N, Griffith JW, Walton JP, Eddins DA, Newman C, Victorson D, Warrier CM, Wilson RH. Audition assessment using the NIH Toolbox. Neurology 2013; 80:S45-8. [PMID: 23479544 DOI: 10.1212/wnl.0b013e3182872dd2] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The NIH Toolbox project has assembled measurement tools to assess a wide range of human perception and ability across the lifespan. As part of this initiative, a small but comprehensive battery of auditory tests has been assembled. The main tool of this battery, pure-tone thresholds, measures the ability of people to hear at specific frequencies. Pure-tone thresholds have long been considered the "gold standard" of auditory testing, and are normally obtained in a clinical setting by highly trained audiologists. For the purposes of the Toolbox project, an automated procedure (NIH Toolbox Threshold Hearing Test) was developed that allows nonspecialists to administer the test reliably. Three supplemental auditory tests are also included in the Toolbox auditory test battery: assessment of middle-ear function (tympanometry), speech perception in noise (the NIH Toolbox Words-in-Noise Test), and self-assessment of hearing impairment (the NIH Toolbox Hearing Handicap Inventory Ages 18-64 and the NIH Toolbox Hearing Handicap Inventory Ages 64+). Tympanometry can help differentiate conductive from sensorineural pathology. The NIH Toolbox Words-in-Noise Test measures a listener's ability to perceive words in noisy situations. This ability is not necessarily predicted by a person's pure-tone thresholds; some people with normal hearing have difficulty extracting meaning from speech sounds heard in a noisy context. The NIH Toolbox Hearing Handicap Inventory focuses on how a person's perceived hearing status affects daily life. The test was constructed to include emotional and social/situational subscales, with specific questions about how hearing impairment may affect one's emotional state or limit participation in specific activities. The 4 auditory tests included in the Toolbox auditory test battery cover a range of auditory abilities and provide a snapshot of a participant's auditory capacity.
Collapse
|
42
|
Sabin AT, Clark CA, Eddins DA, Wright BA. Different patterns of perceptual learning on spectral modulation detection between older hearing-impaired and younger normal-hearing adults. J Assoc Res Otolaryngol 2012; 14:283-94. [PMID: 23229719 DOI: 10.1007/s10162-012-0363-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2012] [Accepted: 11/14/2012] [Indexed: 10/27/2022] Open
Abstract
Young adults with normal hearing (YNH) can improve their sensitivity to basic acoustic features with practice. However, it is not known to what extent the influence of the same training regimen differs between YNH listeners and older listeners with hearing impairment (OHI)--the largest population seeking treatment in audiology clinics. To examine this issue, we trained OHI listeners on a basic auditory task (spectral modulation detection) using a training regimen previously administered to YNH listeners (≈ 1 h/session for seven sessions on a single condition). For the trained conditions on which pretraining performance was not already at asymptote, the YNH listeners who received training learned more than matched controls who received none, but that learning did not generalize to any untrained spectral modulation frequency. In contrast, the OHI-trained listeners and controls learned similar amounts on the trained condition, implying no effect of the training itself. However, surprisingly the OHI-trained listeners improved over the training phase and on an untrained spectral modulation frequency. These population differences suggest that learning consolidated more slowly, and that training modified an aspect of processing that had broader tuning to spectral modulation frequency, in OHI than YNH listeners. More generally, these results demonstrate that conclusions about perceptual learning that come from examination of one population do not necessarily apply to another.
Collapse
|
43
|
Patel S, Shrivastav R, Eddins DA. Identifying a comparison for matching rough voice quality. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2012; 55:1407-22. [PMID: 22361106 PMCID: PMC3616748 DOI: 10.1044/1092-4388(2012/11-0160)] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
PURPOSE Perceptual estimates of voice quality obtained using rating scales are subject to contextual biases that influence how individuals assign numbers to estimate the magnitude of vocal quality. Because rating scales are commonly used in clinical settings, assessments of voice quality are also subject to the limitations of these scales. Instead, a matching task can be used to obtain objective measures of voice quality, thereby facilitating model development and tools for clinical use. METHOD Twenty-seven individuals participated in a rating task or at least 1 of 3 matching tests (named after their modulation functions: SINE, SQUARE, POWER) to quantify the degree of roughness in dysphonic voice stimuli. Participants evaluated the roughness of 34 voice samples using an amplitude-modulated complex carrier. RESULTS The matching thresholds were highly correlated with the ratings estimates. Reliability of thresholds did not significantly differ across tasks, but linear regressions showed that the POWER test resulted in larger perceptual distances. CONCLUSIONS A matching task can be used to obtain reliable estimates of roughness in dysphonic voices. The POWER comparison is recommended because the variability in matching thresholds across the range of roughness was evenly distributed, and the perceptual distances between stimuli were maximized.
Collapse
|
44
|
Liu C, Eddins DA. Measurement of stop consonant identification using adaptive tracking procedures. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:EL250-EL256. [PMID: 22979840 DOI: 10.1121/1.4747826] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The purpose of this study was to establish the potential utility of adaptive tracking procedures, relative to the method of constant stimuli, when examining consonant identification in cafeteria noise or four-talker babble. Thresholds for consonant identification were comparable for the two methods, while psychometric function slopes derived from the two methods were statistically different. To obtain accurate slope estimates from adaptive procedures, the target percent correct and change in variability with signal-to-noise ratio should be considered carefully. However, adaptive procedures are accurate and more efficient than the method of constant stimuli when the purpose is to estimate phoneme identification threshold.
Collapse
|
45
|
Patel S, Shrivastav R, Eddins DA. Developing a single comparison stimulus for matching breathy voice quality. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2012; 55:639-47. [PMID: 22215034 PMCID: PMC3612287 DOI: 10.1044/1092-4388(2011/10-0337)] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/19/2023]
Abstract
PURPOSE In this experiment, a single comparison stimulus was developed as a reference in a perceptual matching task for the quantification of breathy voice quality. Perceptual judgments of a set of synthetic voice samples were compared to previous data obtained using multiple comparison stimuli "customized" for different voices (Patel, Shrivastav, & Eddins, 2010). METHOD Five male and 5 female samples of the vowel /a/ were selected from the Kay Elemetrics Disordered Voice Database and resynthesized using a Klatt synthesizer. Eleven samples were created for each base voice by manipulating the aspiration noise level. Five samples from each continuum were evaluated in a perceptual matching task in which a single sawtooth and noise comparison stimulus was used to obtain breathiness judgments. Linear regression was used to compare measurements obtained using the new comparison stimulus against the customized comparison stimuli. RESULTS Results indicated that the noncustomized sawtooth comparison provides reliability and perceptual distances between stimuli similar to those obtained using customized comparison stimuli. CONCLUSION A single-variable matching task using a single comparison stimulus can be used to obtain perceptual estimates of breathiness across voices and experiments in a laboratory setting. This technique will help develop models of voice-quality perception.
Collapse
|
46
|
Shrivastav R, Eddins DA, Anand S. Pitch strength of normal and dysphonic voices. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:2261-9. [PMID: 22423721 PMCID: PMC3316683 DOI: 10.1121/1.3681937] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Two sounds with the same pitch may vary from each other based on saliency of their pitch sensation. This perceptual attribute is called "pitch strength." The study of voice pitch strength may be important in quantifying of normal and pathological qualities. The present study investigated how pitch strength varies across normal and dysphonic voices. A set of voices (vowel /a/) selected from the Kay Elemetrics Disordered Voice Database served as the stimuli. These stimuli demonstrated a wide range of voice quality. Ten listeners judged the pitch strength of these stimuli in an anchored magnitude estimation task. On a given trial, listeners heard three different stimuli. The first stimulus represented very low pitch strength (wide-band noise), the second stimulus consisted of the target voice and the third stimulus represented very high pitch strength (pure tone). Listeners estimated pitch strength of the target voice by positioning a continuous slider labeled with values between 0 and 1, reflecting the two anchor stimuli. Results revealed that listeners can judge pitch strength reliably in dysphonic voices. Moderate to high correlations with perceptual judgments of voice quality suggest that pitch strength may contribute to voice quality judgments.
Collapse
|
47
|
Eddins DA, Liu C. Psychometric properties of the coordinate response measure corpus with various types of background interference. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:EL177-EL183. [PMID: 22352619 PMCID: PMC3277602 DOI: 10.1121/1.3678680] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2011] [Accepted: 12/29/2011] [Indexed: 05/29/2023]
Abstract
The coordinate response measure (CRM) corpus has gained broad acceptance as a research tool for investigating speech intelligibility in background competition and has been widely used in studies of informational masking. The purpose of this study is to establish the psychometric characteristics of CRM target-word identification in various backgrounds with the goal of being able to determine when it is appropriate or not to use adaptive threshold procedures with the CRM corpus. Target-word identification performance based on adaptive tracking mapped directly onto the monotonic psychometric functions obtained for two-talker, four-talker, and cafeteria noise interferers.
Collapse
|
48
|
Shrivastav R, Camacho A, Patel S, Eddins DA. A model for the prediction of breathiness in vowels. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2011; 129:1605-15. [PMID: 21428523 PMCID: PMC3077964 DOI: 10.1121/1.3543993] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/02/2010] [Revised: 10/29/2010] [Accepted: 12/29/2010] [Indexed: 05/19/2023]
Abstract
The perception of breathiness in vowels is cued by multiple acoustic cues, including changes in aspiration noise (AH) and the open quotient (OQ) [Klatt and Klatt, J. Acoust. Soc. Am. 87(2), 820-857 (1990)]. A loudness model can be used to determine the extent to which AH masks the harmonic components in voice. The resulting "partial loudness" (PL) and loudness of AH ["noise loudness" (NL)] have been shown to be good predictors of perceived breathiness [Shrivastav and Sapienza, J. Acoust. Soc. Am. 114(1), 2217-2224 (2003)]. The levels of AH and OQ were systematically manipulated for ten synthetic vowels. Perceptual judgments of breathiness were obtained and regression functions to predict breathiness from the ratio of NL to PL (η) were derived. Results show that breathiness can be modeled as a power function of η. The power parameter of this function appears to be affected by the fundamental frequency of the vowel. A second experiment was conducted to determine if the resulting power function could estimate breathiness in a different set of voices. The breathiness of these stimuli, both natural and synthetic, was determined in a listening test. The model estimates of breathiness were highly correlated with perceptual data but the absolute predicted values showed some discrepancies.
Collapse
|
49
|
Allen PD, Eddins DA. Presbycusis phenotypes form a heterogeneous continuum when ordered by degree and configuration of hearing loss. Hear Res 2010; 264:10-20. [PMID: 20144701 DOI: 10.1016/j.heares.2010.02.001] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/22/2009] [Revised: 02/01/2010] [Accepted: 02/01/2010] [Indexed: 10/19/2022]
Abstract
Many reports have documented age-by-frequency increases in average auditory thresholds in various human populations. Despite this, the prevalence of different patterns of hearing loss in presbycusis remains uncertain. We examined 'presbycusis phenotypes' in a database of 960 subjects (552 female, 408 male, 18-92 years) that each had 30 measures of peripheral hearing sensitivity: pure tone audiograms for left and right ears from 0.25 to 8 kHz and DPOAE for each ear with F(mean)=1-6.4 kHz. Surprisingly, the hearing phenotypes did not naturally separate into discrete classes of presbycusis. Principal component (PC) analysis revealed that two principal components account for 74% of the variance among the 30 measures of hearing. The two components represent the overall degree (PC1) and configuration of loss (Flat vs. Sloping; PC2) and the phenotypes form a continuum when plotted against them. A heuristic partitioning of this continuum produced classes of presbycusis that vary in their degree of Sloping or Flat hearing loss, suggesting that the previously reported sub-types of presbycusis arise from the categorical segregation of a continuous and heterogeneous distribution. Further, most phenotypes lie intermediate to the extremes of either Flat or Sloping loss, indicating that if audiometric configuration does predict presbycusis etiology, then a mixed origin is the most prevalent.
Collapse
|
50
|
Eddins DA, Hall JW. Binaural Processing and Auditory Asymmetries. THE AGING AUDITORY SYSTEM 2010. [DOI: 10.1007/978-1-4419-0993-0_6] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|