151
|
Jackson HM, Moore BCJ. The dominant region for the pitch of complex tones with low fundamental frequencies. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 134:1193-1204. [PMID: 23927118 DOI: 10.1121/1.4812754] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The dominant region for pitch for complex tones with low fundamental frequency (F0) was investigated. Thresholds for detection of a change in F0 (F0DLs) were measured for a group of harmonics (group B) embedded in a group of fixed non-overlapping harmonics (group A) with the same mean F0. It was assumed that F0DLs would be smallest when the harmonics in group B fell in the dominant region. The rank of the lowest harmonic in group B, N, was varied from 1 to 15. When all components had the same level, F0DLs increased with increasing N, but the increase started at a lower value of N for F0 = 200 Hz than for F0 = 50 or 100 Hz, the opposite of what would be expected if the dominant region corresponds to resolved harmonics. When the component levels followed an equal-loudness contour, F0DLs for F0 = 50 Hz were lowest for N = 1, but overall performance was much worse than for equal-level components, suggesting that the lowest harmonics were masking information from the higher harmonics.
Collapse
|
152
|
Moore BCJ, Popelka GR. Preliminary comparison of bone-anchored hearing instruments and a dental device as treatments for unilateral hearing loss. Int J Audiol 2013; 52:678-86. [PMID: 23859058 DOI: 10.3109/14992027.2013.809483] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
OBJECTIVE To compare the effectiveness of two types of treatment for unilateral hearing loss (UHL), bone-anchored hearing instruments (BAHI) and a dental device (SoundBite). DESIGN Either BAHI or SoundBite were worn for 30 days, and then the devices were swapped and the second device was worn for 30 days. Measures included unaided and aided sound-field thresholds, sound localization, and perception of speech in babble. The APHAB questionnaire was administered for each trial period. STUDY SAMPLE Nine adult BAHI wearers with UHL. RESULTS Mid-frequency aided thresholds were lower for SoundBite than for BAHI. Both devices gave benefits for localization after 30 days, but there was no difference between devices. Speech perception was better for both devices than for unaided listening when the target speech came from the poorer hearing side or in front, and the interfering babble came from the better-hearing side. There was no consistent difference between devices. APHAB scores were better for SoundBite than for BAHI. CONCLUSIONS Speech perception and sound localization were similar for the two types of device, but the SoundBite led to lower aided thresholds and better APHAB scores than the BAHI.
Collapse
|
153
|
Margolis RH, Eikelboom RH, Johnson C, Ginter SM, Swanepoel DW, Moore BCJ. False air-bone gaps at 4 kHz in listeners with normal hearing and sensorineural hearing loss. Int J Audiol 2013; 52:526-32. [DOI: 10.3109/14992027.2013.792437] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
154
|
Chen J, Baer T, Moore BCJ. Effect of spectral change enhancement for the hearing impaired using parameter values selected with a genetic algorithm. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:2910-2920. [PMID: 23654396 DOI: 10.1121/1.4799807] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
Chen et al. [J. Acoust. Soc. Am. 131, 2987-2998 (2012)] evaluated the effectiveness of an algorithm for enhancing spectral changes over time in improving the intelligibility of speech in background sounds for hearing-impaired subjects. The processing improved intelligibility for speech in steady speech-spectrum noise (SSN) but tended to impair intelligibility in a background of two-talker speech (TTS). Large individual differences were found. The present study assessed whether the effectiveness of the algorithm was improved when the parameters that controlled the degree and type of enhancement were chosen individually for each subject, using a genetic algorithm based on subjective preferences for speech clarity. The parameter values selected by the genetic algorithm varied markedly across subjects. Speech intelligibility was measured for unprocessed stimuli and stimuli processed using the selected parameters, with SSN and TTS maskers and two signal-to-masker ratios (SMRs) for each subject. The intelligibility of speech in the SSN masker at the lower SMR was improved about 14 percentage points by the processing. The overall improvement produced by the processing was significantly larger than the improvement observed in the previous study when the parameter values were fixed across subjects, indicating that use of the genetic algorithm was beneficial.
Collapse
|
155
|
Jackson HM, Moore BCJ. Contribution of temporal fine structure information and fundamental frequency separation to intelligibility in a competing-speaker paradigm. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013; 133:2421-2430. [PMID: 23556607 DOI: 10.1121/1.4792153] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
The speech reception threshold (SRT) for identifying a target speaker in a background speaker was measured as a function of the difference (F0sep) in fundamental frequency (F0) between the two speakers. The amount of original temporal fine structure (TFS) information in the mixed signals was manipulated by tone vocoding channels above a certain cutoff channel (CO). When the natural variations in F0 of both speakers were preserved, the SRT did not decrease with increasing F0sep, indicating that short-term differences in F0 can allow perceptual segregation of two speakers even when their F0s cross. When F0 variations were removed from both speakers, increasing F0sep led to decreased (better) SRTs. The decrease was greater for unprocessed signals than for fully tone-vocoded signals. However, the decrease was similar for unprocessed signals and for signals with original TFS below 1600 Hz, suggesting that most of the benefit from increasing F0 difference depends on the use of TFS information at lower frequencies. Adding original TFS information to channels centered above 1600 Hz produced roughly the same decrease in SRT as adding original TFS information to channels centered below 1600 Hz, suggesting a benefit from original TFS information apart from that related to differences in F0.
Collapse
|
156
|
Léger AC, Moore BCJ, Lorenzi C. Abnormal speech processing in frequency regions where absolute thresholds are normal for listeners with high-frequency hearing loss. Hear Res 2012; 294:95-103. [PMID: 23104012 DOI: 10.1016/j.heares.2012.10.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Revised: 09/28/2012] [Accepted: 10/02/2012] [Indexed: 10/27/2022]
Abstract
The ability to understand speech in quiet and in a steady noise was measured for 26 listeners with audiometric thresholds below 30 dB HL for frequencies up to 3 kHz and covering a wide range (0-80 dB HL) between 3 and 8 kHz. The stimulus components were restricted to the low (≤1.5 kHz) and middle (1-3 kHz) frequency regions, where audiometric thresholds were classified clinically as normal or near-normal. Sensitivity to inter-aural phase was measured at 0.5 and 0.75 kHz and otoacoustic emission and brainstem responses were measured. For each frequency region, about half of the listeners with high-frequency hearing loss showed extremely poor intelligibility for speech in quiet and in noise. These deficits could not be accounted for by reduced audibility. Scores for speech in quiet were correlated with age, audiometric thresholds at low and at high frequencies, the amplitude of transient otoacoustic emissions in the mid-frequency region, but not with inter-aural phase discrimination. The results suggest that large speech deficits may be observed in regions of normal or near-normal hearing for hearing-impaired listeners. They also suggest that speech deficits may result from suprathreshold auditory deficits caused by outer hair-cell damage and by factors associated with aging.
Collapse
|
157
|
Moore BCJ, Vickers DA, Mehta A. The effects of age on temporal fine structure sensitivity in monaural and binaural conditions. Int J Audiol 2012; 51:715-21. [DOI: 10.3109/14992027.2012.690079] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
158
|
Moore BCJ, Ernst SMA. Frequency difference limens at high frequencies: evidence for a transition from a temporal to a place code. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:1542-7. [PMID: 22978883 DOI: 10.1121/1.4739444] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
It is commonly believed that difference limens for frequency (DLFs) for pure tones depend on a temporal mechanism (phase locking) for frequencies up to 4-5 kHz and a place mechanism at higher frequencies. The DLFs predicted from a place mechanism, expressed as a proportion of center frequency (Δf/f), should be approximately invariant with frequency at medium to high frequencies. If there is a transition from a temporal to a place mechanism, Δf/f should increase with increasing center frequency until the transition occurs, and then reach a plateau. Published data do not show such an effect. In this study, DLFs were measured for center frequencies from 2 to 14 kHz, using earphones designed to produce a flat response at the eardrum. The level of every tone was varied over a range of ±4 dB, to reduce loudness cues. The value of Δf/f increased progressively from 2 to 8 kHz, but did not change significantly for frequencies from 8 to 14 kHz. The results are consistent with the idea that there is a transition from a temporal to a place mechanism at about 8 kHz, rather than at 4-5 kHz, as is commonly assumed.
Collapse
|
159
|
Moore BCJ. Effects of bandwidth, compression speed, and gain at high frequencies on preferences for amplified music. Trends Amplif 2012; 16:159-72. [PMID: 23172008 PMCID: PMC4040859 DOI: 10.1177/1084713812465494] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
This article reviews a series of studies on the factors influencing sound quality preferences, mostly for jazz and classical music stimuli. The data were obtained using ratings of individual stimuli or using the method of paired comparisons. For normal-hearing participants, the highest ratings of sound quality were obtained when the reproduction bandwidth was wide (55 to 16000 Hz) and ripples in the frequency response were small (less than ± 5 dB). For hearing-impaired participants listening via a simulated five-channel compression hearing aid with gains set using the CAM2 fitting method, preferences for upper cutoff frequency varied across participants: Some preferred a 7.5- or 10-kHz upper cutoff frequency over a 5-kHz cutoff frequency, and some showed the opposite preference. Preferences for a higher upper cutoff frequency were associated with a shallow high-frequency slope of the audiogram. A subsequent study comparing the CAM2 and NAL-NL2 fitting methods, with gains slightly reduced for participants who were not experienced hearing aid users, showed a consistent preference for CAM2. Since the two methods differ mainly in the gain applied for frequencies above 4 kHz (CAM2 recommending higher gain than NAL-NL2), these results suggest that extending the upper cutoff frequency is beneficial. A system for reducing "overshoot" effects produced by compression gave small but significant benefits for sound quality of a percussion instrument (xylophone). For a high-input level (80 dB SPL), slow compression was preferred over fast compression.
Collapse
|
160
|
Hopkins K, King A, Moore BCJ. The effect of compression speed on intelligibility: simulated hearing-aid processing with and without original temporal fine structure information. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:1592-1601. [PMID: 22978888 DOI: 10.1121/1.4742719] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Hearing aids use amplitude compression to compensate for the effects of loudness recruitment. The compression speed that gives the best speech intelligibility varies among individuals. Moore [(2008). Trends Amplif. 12, 300-315] suggested that an individual's sensitivity to temporal fine structure (TFS) information may affect which compression speed gives most benefit. This hypothesis was tested using normal-hearing listeners with a simulated hearing loss. Sentences in a competing talker background were processed using multi-channel fast or slow compression followed by a simulation of threshold elevation and loudness recruitment. Signals were either tone vocoded with 1-ERB(N)-wide channels (where ERB(N) is the bandwidth of normal auditory filters) to remove the original TFS information, or not processed further. In a second experiment, signals were vocoded with either 1 - or 2-ERB(N)-wide channels, to test whether the available spectral detail affects the optimal compression speed. Intelligibility was significantly better for fast than slow compression regardless of vocoder channel bandwidth. The results suggest that the availability of original TFS or detailed spectral information does not affect the optimal compression speed. This conclusion is tentative, since while the vocoder processing removed the original TFS information, listeners may have used the altered TFS in the vocoded signals.
Collapse
|
161
|
Stone MA, Anton K, Moore BCJ. Use of high-rate envelope speech cues and their perceptually relevant dynamic range for the hearing impaired. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:1141-1151. [PMID: 22894233 DOI: 10.1121/1.4733543] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The ability of hearing-impaired (HI) listeners to use high-rate envelope information in a competing-talker situation was assessed. In experiment 1, signals were tone vocoded and the cutoff frequency (f(c)) of the envelope extraction filter was either 50 Hz (E filter) or 200 Hz (P filter). The channels for which the P or E filter was used were varied. Intelligibility was higher with the P filter regardless of whether it was used for low or high center frequencies. Performance was best when the P filter was used for all channels. Experiment 2 explored the dynamic range over which HI listeners made use of high-rate cues. In each channel of a vocoder, the envelope extracted using f(c) = 16 Hz was replaced by the envelope extracted using f(c) = 300 Hz, either at the peaks or valleys, with a parametrically varied "switching threshold." For a target-to-background ratio of +5 dB, changes in speech intelligibility occurred mainly when the switching threshold was between -8 and +8 dB relative to the channel root-mean-square level. This range is similar in width to, but about 3 dB higher in absolute level than, that found for normal-hearing listeners, despite the reduced dynamic range of the HI listeners.
Collapse
|
162
|
Aazh H, Moore BCJ, Prasher D. The accuracy of matching target insertion gains with open-fit hearing aids. Am J Audiol 2012; 21:175-80. [PMID: 22846638 DOI: 10.1044/1059-0889(2012/11-0008)] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
PURPOSE To assess the accuracy with which target insertion gains were matched for a single type of open-fit hearing aid, both on initial fitting and after adjustment. METHOD The hearing aids were fitted using the first-fit setting in the programming software and the target formula was selected as NAL-NL1. The difference between the real ear insertion gain (REIG) and the NAL-NL1 target REIG was recorded. The initial fitting was considered acceptable if the difference was less than 10 dB at all frequencies. If an initial fitting was not acceptable, the frequency-gain response was modified. The difference between the final REIG and the NAL-NL1 target REIG was recorded as final target mismatch. RESULTS Of the 51 initial fittings, 36 (71%) failed to achieve a match within ±10 dB of the NAL-NL1 insertion gain target at 1 or more frequencies between 0.25 and 4 kHz. After the authors adjusted the frequency-gain response of the hearing aids, only 9 fittings (18%) failed to achieve a match. CONCLUSION These outcomes suggest that target insertion gains for the open-fit hearing aids used here are rarely achieved with a first fitting but can usually be achieved through adjustments based on REIG measurements.
Collapse
|
163
|
Füllgrabe C, Moore BCJ. Objective and subjective measures of pure-tone stream segregation based on interaural time differences. Hear Res 2012; 291:24-33. [PMID: 22771780 DOI: 10.1016/j.heares.2012.06.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/21/2012] [Revised: 05/24/2012] [Accepted: 06/23/2012] [Indexed: 11/18/2022]
Abstract
The effect of interaural time differences (ITDs) on stream segregation for successive tone bursts was investigated. Obligatory stream segregation was inferred from the threshold for detecting a rhythmic irregularity in an otherwise isochronous sequence of interleaved "A" and "B" tones (task 1). Subjective stream segregation was evaluated by requiring listeners to indicate whether they heard one or two streams during presentation of a 30-s long sequence (task 2). The A and B tones had equal but opposite ITDs and had the same or different frequencies of 500 and/or 707 Hz. The ITDs ranged from 0 to 2 ms in study 1, and from 0 to 0.5 ms in study 2. Sensitivity on task 1 was poor in both studies when A and B had different frequencies, and was little affected by ITD. Thresholds for the same-frequency conditions worsened somewhat with increasing ITD up to 0.5 ms and then (for study 1) flattened off. There was a small increase in subjective streaming as the ITD was increased up to 0.5 ms, but little streaming for larger ITDs (study 1). We conclude that ITD, at most, has weak effects in producing obligatory and subjective stream segregation.
Collapse
|
164
|
Wang J, Baer T, Glasberg BR, Stone MA, Ye D, Moore BCJ. Pitch perception of concurrent harmonic tones with overlapping spectra. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:339-356. [PMID: 22779482 DOI: 10.1121/1.4728165] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Fundamental frequency difference limens (F0DLs) were measured for a target harmonic complex tone with nominal fundamental frequency (F0) of 200 Hz, in the presence and absence of a harmonic masker with overlapping spectrum. The F0 of the masker was 0, ± 3, or ± 6 semitones relative to 200 Hz. The stimuli were bandpass filtered into three regions: 0-1000 Hz (low, L), 1600-2400 Hz (medium, M), and 2800-3600 Hz (high, H), and a background noise was used to mask combination tones and to limit the audibility of components falling on the filter skirts. The components of the target or masker started either in cosine or random phase. Generally, the effect of F0 difference between target and masker was small. For the target alone, F0DLs were larger for random than cosine phase for region H. For the target plus masker, F0DLs were larger when the target had random phase than cosine phase for regions M and H. F0DLs increased with increasing center frequency of the bandpass filter. Modeling using excitation patterns and "summary autocorrelation" and "stabilized auditory image" models suggested that use of temporal fine structure information can account for the small F0DLs obtained when harmonics are barely, if at all, resolved.
Collapse
|
165
|
Goldman SA, Baer T, Moore BCJ. Comodulation masking release: effects of training and experimental design on use of within- and across-channel cues. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:303-316. [PMID: 22779479 DOI: 10.1121/1.4726074] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
The effects of training and experimental design on comodulation masking release (CMR) were assessed. The study of Dau et al. [(2009), J. Acoust. Soc. Am. 125, 2182-2188], which used auditory-grouping manipulations to distinguish the use of within- and across-channel cues to CMR, was replicated in Experiment One but using naive subjects and an experimental design that minimized familiarization with the cues. Subjects made effective use of within- but not across-channel cues. Experiment Two examined training effects over more testing sessions, across four experimental designs (to minimize or maximize repeated exposure to the cues) and using an auditory grouping manipulation ("postcursors") to distinguish the use of within- and across-channel cues. Naive subjects were tested with either two or four flanking bands (FBs), to determine if training effects varied with the amount of FB information. Within-channel cues could be used from the outset, but effective use of across-channel cues required training when they were less salient. Increased repeated exposure enhanced the effects of training. Experiment Three tested naive subjects using two FBs, but with noise presented continuously and a different auditory grouping manipulation, after Grose et al. [(2009), J. Acoust. Soc. Am. 125, 282-293]. CMR was large from the outset.
Collapse
|
166
|
Stone MA, Füllgrabe C, Moore BCJ. Notionally steady background noise acts primarily as a modulation masker of speech. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 132:317-26. [PMID: 22779480 DOI: 10.1121/1.4725766] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
Stone et al. [J. Acoust. Soc Am. 130, 2874-2881 (2011)], using vocoder processing, showed that the envelope modulations of a notionally steady noise were more effective than the envelope energy as a masker of speech. Here the same effect is demonstrated using non-vocoded signals. Speech was filtered into 28 channels. A masker centered on each channel was added to the channel signal at a target-to-background ratio of -5 or -10 dB. Maskers were sinusoids or noise bands with bandwidth 1/3 or 1 ERB(N) (ERB(N) being the bandwidth of "normal" auditory filters), synthesized with Gaussian (GN) or low-noise (LNN) statistics. To minimize peripheral interactions between maskers, odd-numbered channels were presented to one ear and even to the other. Speech intelligibility was assessed in the presence of each "steady" masker and that masker 100% sinusoidally amplitude modulated (SAM) at 8 Hz. Intelligibility decreased with increasing envelope fluctuation of the maskers. Masking release, the difference in intelligibility between the SAM and its "steady" counterpart, increased with bandwidth from near-zero to around 50 percentage points for the 1-ERB(N) GN. It is concluded that the sinusoidal and GN maskers behaved primarily as energetic and modulation maskers, respectively.
Collapse
|
167
|
Schwartz JL, Grimault N, Hupé JM, Moore BCJ, Pressnitzer D. Multistability in perception: binding sensory modalities, an overview. Philos Trans R Soc Lond B Biol Sci 2012; 367:896-905. [PMID: 22371612 DOI: 10.1098/rstb.2011.0254] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
This special issue presents research concerning multistable perception in different sensory modalities. Multistability occurs when a single physical stimulus produces alternations between different subjective percepts. Multistability was first described for vision, where it occurs, for example, when different stimuli are presented to the two eyes or for certain ambiguous figures. It has since been described for other sensory modalities, including audition, touch and olfaction. The key features of multistability are: (i) stimuli have more than one plausible perceptual organization; (ii) these organizations are not compatible with each other. We argue here that most if not all cases of multistability are based on competition in selecting and binding stimulus information. Binding refers to the process whereby the different attributes of objects in the environment, as represented in the sensory array, are bound together within our perceptual systems, to provide a coherent interpretation of the world around us. We argue that multistability can be used as a method for studying binding processes within and across sensory modalities. We emphasize this theme while presenting an outline of the papers in this issue. We end with some thoughts about open directions and avenues for further research.
Collapse
|
168
|
Ernst SMA, Moore BCJ. The role of time and place cues in the detection of frequency modulation by hearing-impaired listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:4722-31. [PMID: 22712945 DOI: 10.1121/1.3699233] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
Frequency modulation detection limens (FMDLs) were measured for five hearing-impaired (HI) subjects for carrier frequencies f(c) = 1000, 4000, and 6000 Hz, using modulation frequencies f(m) = 2 and 10 Hz and levels of 20 dB sensation level and 90 dB SPL. FMDLs were smaller for f(m) = 10 than for f(m) = 2 Hz for the two higher f(c), but not for f(c) = 1000 Hz. FMDLs were also determined with additional random amplitude modulation (AM), to disrupt excitation-pattern cues. The disruptive effect was larger for f(m) = 10 than for f(m) = 2 Hz. The smallest disruption occurred for f(m) = 2 Hz and f(c) = 1000 Hz. AM detection thresholds for normal-hearing and HI subjects were measured for the same f(c) and f(m) values. Performance was better for the HI subjects for both f(m). AM detection was much better for f(m) = 10 than for f(m) = 2 Hz. Additional tests showed that most HI subjects could discriminate temporal fine structure (TFS) at 800 Hz. The results are consistent with the idea that, for f(m) = 2 Hz and f(c) = 1000 Hz, frequency modulation (FM) detection was partly based on the use of TFS information. For higher carrier frequencies and for all carrier frequencies with f(m) = 10 Hz, FM detection was probably based on place cues.
Collapse
|
169
|
Moore BCJ. Contributions of von Békésy to psychoacoustics. Hear Res 2012; 293:51-7. [PMID: 22561280 DOI: 10.1016/j.heares.2012.04.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/31/2012] [Revised: 03/30/2012] [Accepted: 04/12/2012] [Indexed: 10/28/2022]
Abstract
This paper reviews the contributions of von Békésy to psychoacoustics, comparing his findings and interpretations to those that have emerged since his work. The areas covered include the perception of pitch for pure tones and complex tones, the effect of frequency on the apparent location of pure tones, estimation of the velocity of the traveling wave on the basilar membrane using judgments of lateralization, and the relative loudness of monaural and diotic sounds. While subsequent research has failed to replicate some of his findings, other findings have stood the test of time. There is no doubt that von Békésy made very substantial contributions to psychoacoustic research.
Collapse
|
170
|
Léger AC, Moore BCJ, Gnansia D, Lorenzi C. Effects of spectral smearing on the identification of speech in noise filtered into low- and mid-frequency regions. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:4114-4123. [PMID: 22559383 DOI: 10.1121/1.3699265] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Léger et al. [J. Acoust. Soc. Am. 131, 1502-1514 (2012)] reported deficits in the identification of consonants in noise by hearing-impaired listeners using stimuli filtered into low- or mid-frequency regions in which audiometric thresholds were normal or near-normal. The deficits could not be fully explained in terms of reduced audibility or temporal-envelope processing. However, previous studies indicate that the listeners may have had reduced frequency selectivity, with auditory filters broadened by a factor of about 1.3, despite having normal or near-normal audiometric thresholds in the tested regions. The present study aimed to determine whether the speech-perception deficits could be explained by such a small reduction of frequency selectivity. Consonant identification was measured for normal-hearing listeners in quiet and in unmodulated and modulated noises using the same method as Léger et al. The signal-to-noise ratio was set to -3 dB for the masked conditions. Various amounts of reduced frequency selectivity were simulated using a spectral-smearing algorithm. Performance was reduced only for spectral-smearing factors greater than 1.7. For all conditions, identification scores for hearing-impaired listeners could not be explained by a mild reduction of frequency selectivity.
Collapse
|
171
|
Abstract
A sequence of sounds may be heard as coming from a single source (called fusion or coherence) or from two or more sources (called fission or stream segregation). Each perceived source is called a 'stream'. When the differences between successive sounds are very large, fission nearly always occurs, whereas when the differences are very small, fusion nearly always occurs. When the differences are intermediate in size, the percept often 'flips' between one stream and multiple streams, a property called 'bistability'. The flips do not generally occur regularly in time. The tendency to hear two streams builds up over time, but can be partially or completely reset by a sudden change in the properties of the sequence or by switches in attention. Stream formation depends partly on the extent to which successive sounds excite different 'channels' in the peripheral auditory system. However, other factors can play a strong role; multiple streams may be heard when successive sounds are presented to the same ear and have essentially identical excitation patterns in the cochlea. Differences between successive sounds in temporal envelope, fundamental frequency, phase spectrum and lateralization can all induce a percept of multiple streams. Regularities in the temporal pattern of elements within a stream can help in stabilizing that stream.
Collapse
|
172
|
Chen J, Li H, Li L, Wu X, Moore BCJ. Informational masking of speech produced by speech-like sounds without linguistic content. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:2914-26. [PMID: 22501069 DOI: 10.1121/1.3688510] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
This study investigated whether speech-like maskers without linguistic content produce informational masking of speech. The target stimuli were nonsense Chinese Mandarin sentences. In experiment I, the masker contained harmonics the fundamental frequency (F0) of which was sinusoidally modulated and the mean F0 of which was varied. The magnitude of informational masking was evaluated by measuring the change in intelligibility (releasing effect) produced by inducing a perceived spatial separation of the target speech and masker via the precedence effect. The releasing effect was small and was only clear when the target and masker had the same mean F0, suggesting that informational masking was small. Performance with the harmonic maskers was better than with a steady speech-shaped noise (SSN) masker. In experiments II and III, the maskers were speech-like synthesized signals, alternating between segments with harmonic structure and segments composed of SSN. Performance was much worse than for experiment I, and worse than when an SSN masker was used, suggesting that substantial informational masking occurred. The similarity of the F0 contours of the target and masker had little effect. The informational masking effect was not influenced by whether or not the noise-like segments of the masker were synchronous with the unvoiced segments of the target speech.
Collapse
|
173
|
Chen J, Baer T, Moore BCJ. Effect of enhancement of spectral changes on speech intelligibility and clarity preferences for the hearing impaired. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:2987-2998. [PMID: 22501075 DOI: 10.1121/1.3689556] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
Most information in speech is carried in spectral changes over time, rather than in static spectral shape per se. A form of signal processing aimed at enhancing spectral changes over time was developed and evaluated using hearing-impaired listeners. The signal processing was based on the overlap-add method, and the degree and type of enhancement could be manipulated via four parameters. Two experiments were conducted to assess speech intelligibility and clarity preferences. Three sets of parameter values (one corresponding to a control condition), two types of masker (steady speech-spectrum noise and two-talker speech) and two signal-to-masker ratios (SMRs) were used for each masker type. Generally, the effects of the processing were small, although intelligibility was improved by about 8 percentage points relative to the control condition for one set of parameter values using the steady noise masker at -6 dB SMR. The processed signals were not preferred over those for the control condition, except for the steady noise masker at -6 dB SMR. Further work is needed to determine whether tailoring the processing to the characteristics of the individual hearing-impaired listener is beneficial.
Collapse
|
174
|
Neher T, Lunner T, Hopkins K, Moore BCJ. Binaural temporal fine structure sensitivity, cognitive function, and spatial speech recognition of hearing-impaired listeners (L). THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:2561-4. [PMID: 22501036 DOI: 10.1121/1.3689850] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
The relationships between spatial speech recognition (SSR; the ability to understand speech in complex spatial environments), binaural temporal fine structure (TFS) sensitivity, and three cognitive tasks were assessed for 17 hearing-impaired listeners. Correlations were observed between SSR, TFS sensitivity, and two of the three cognitive tasks, which became non-significant when age effects were controlled for, suggesting that reduced TFS sensitivity and certain cognitive deficits may share a common age-related cause. The third cognitive measure was also significantly correlated with SSR, but not with TFS sensitivity or age, suggesting an independent non-age-related cause.
Collapse
|
175
|
Moore BCJ, Glasberg BR, Oxenham AJ. Effects of pulsing of a target tone on the ability to hear it out in different types of complex sounds. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2012; 131:2927-2937. [PMID: 22501070 PMCID: PMC3543369 DOI: 10.1121/1.3692243] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2011] [Revised: 02/09/2012] [Accepted: 02/14/2012] [Indexed: 05/28/2023]
Abstract
Judgments of whether a sinusoidal probe is higher or lower in frequency than the closest partial ("target") in a multi-partial complex are improved when the target is pulsed on and off. These experiments explored the contribution of reduction in perceptual confusion and recovery from adaptation to this effect. In experiment 1, all partials except the target were replaced by noise to reduce perceptual confusion. Performance was much better than when the background was composed of multiple partials. When the level of the target was reduced to avoid ceiling effects, no effect of pulsing the target occurred. In experiment 2, the target and background partials were irregularly and independently amplitude modulated. This gave a large effect of pulsing the target, suggesting that if recovery from adaptation contributes to the effect, amplitude fluctuations do not prevent this. In experiment 3, the background was composed of multiple steady partials, but the target was irregularly amplitude modulated. This gave better performance than when the target was unmodulated and a moderate effect of pulsing the target. It is argued that when the target and background are steady tones, pulsing the target may result both in reduction of perceptual confusion and recovery from adaptation.
Collapse
|