Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Saba JN, Hansen JHL. The effects of Lombard perturbation on speech intelligibility in noise for normal hearing and cochlear implant listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2022;151:1007. [PMID: 35232065 PMCID: PMC8849642 DOI: 10.1121/10.0009377] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Revised: 01/09/2022] [Accepted: 01/09/2022] [Indexed: 06/02/2023]

Kelly F, Hansen JHL. Analysis and Calibration of Lombard Effect and Whisper for Speaker Recognition. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 2021;29:927-942. [PMID: 35783572 PMCID: PMC9245507 DOI: 10.1109/taslp.2021.3053388] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Conversation in small groups: Speaking and listening strategies depend on the complexities of the environment and group. Psychon Bull Rev 2020;28:632-640. [PMID: 33051825 PMCID: PMC8062389 DOI: 10.3758/s13423-020-01821-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/22/2020] [Indexed: 11/29/2022]

Abstract

Many conversations in our day-to-day lives are held in noisy environments – impeding comprehension, and in groups – taxing auditory attention-switching processes. These situations are particularly challenging for older adults in cognitive and sensory decline. In noisy environments, a variety of extra-linguistic strategies are available to speakers and listeners to facilitate communication, but while models of language account for the impact of context on word choice, there has been little consideration of the impact of context on extra-linguistic behaviour. To address this issue, we investigate how the complexity of the acoustic environment and interaction situation impacts extra-linguistic conversation behaviour of older adults during face-to-face conversations. Specifically, we test whether the use of intelligibility-optimising strategies increases with complexity of the background noise (from quiet to loud, and in speech-shaped vs. babble noise), and with complexity of the conversing group (dyad vs. triad). While some communication strategies are enhanced in more complex background noise, with listeners orienting to talkers more optimally and moving closer to their partner in babble than speech-shaped noise, this is not the case with all strategies, as we find greater vocal level increases in the less complex speech-shaped noise condition. Other behaviours are enhanced in the more complex interaction situation, with listeners using more optimal head orientations, and taking longer turns when gaining the floor in triads compared to dyads. This study elucidates how different features of the conversation context impact individuals’ communication strategies, which is necessary to both develop a comprehensive cognitive model of multimodal conversation behaviour, and effectively support individuals that struggle conversing.

Collapse

Understanding Lombard speech: a review of compensation techniques towards improving speech based recognition systems. Artif Intell Rev 2020. [DOI: 10.1007/s10462-020-09907-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Hansen JHL, Bokshi M, Khorram S. Speech variability: A cross-language study on acoustic variations of speaking versus untrained singing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;148:829. [PMID: 32873043 PMCID: PMC7438159 DOI: 10.1121/10.0001526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 06/17/2020] [Accepted: 06/19/2020] [Indexed: 06/11/2023]

Abstract

Speech production variability introduces significant challenges for existing speech technologies such as speaker identification (SID), speaker diarization, speech recognition, and language identification (ID). There has been limited research analyzing changes in acoustic characteristics for speech produced by untrained singing versus speaking. To better understand changes in speech production of the untrained singing voice, this study presents the first cross-language comparison between normal speaking and untrained karaoke singing of the same text content. Previous studies comparing professional singing versus speaking have shown deviations in both prosodic and spectral features. Some investigations also considered assigning the intrinsic activity of the singing. Motivated by these studies, a series of experiments to investigate both prosodic and spectral variations of untrained karaoke singers for three languages, American English, Hindi, and Farsi, are considered. A comprehensive comparison on common prosodic features, including phoneme duration, mean fundamental frequency (F0), and formant center frequencies of vowels was performed. Collective changes in the corresponding overall acoustic spaces based on the Kullback-Leibler distance using Gaussian probability distribution models trained on spectral features were analyzed. Finally, these models were used in a Gausian mixture model with universal background model SID evaluation to quantify speaker changes between speaking and singing when the audio text content is the same. The experiments showed that many acoustic characteristics of untrained singing are considerably different from speaking when the text content is the same. It is suggested that these results would help advance automatic speech production normalization/compensation to improve performance of speech processing applications (e.g., speaker ID, speech recognition, and language ID).

Collapse

Whittico TH, Ortiz AJ, Marks KL, Toles LE, Van Stan JH, Hillman RE, Mehta DD. Ambulatory monitoring of Lombard-related vocal characteristics in vocally healthy female speakers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;147:EL552. [PMID: 32611177 PMCID: PMC7316514 DOI: 10.1121/10.0001446] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]

Hansen JHL, Lee J, Ali H, Saba JN. A speech perturbation strategy based on "Lombard effect" for enhanced intelligibility for cochlear implant listeners. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2020;147:1418. [PMID: 32237802 PMCID: PMC7054124 DOI: 10.1121/10.0000690] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 12/09/2019] [Accepted: 01/21/2020] [Indexed: 06/02/2023]

Chennupati N, Kadiri SR, B. Y. Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise. COMPUT SPEECH LANG 2019. [DOI: 10.1016/j.csl.2018.09.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Lee J, Ali H, Ziaei A, Tobey EA, Hansen JHL. The Lombard effect observed in speech produced by cochlear implant users in noisy environments: A naturalistic study. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017;141:2788. [PMID: 28464686 PMCID: PMC5398925 DOI: 10.1121/1.4979927] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Revised: 03/25/2017] [Accepted: 03/27/2017] [Indexed: 06/02/2023]

Hansen JHL, Nandwana MK, Shokouhi N. Analysis of human scream and its impact on text-independent speaker verification. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2017;141:2957. [PMID: 28464689 DOI: 10.1121/1.4979337] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Bouserhal RE, Macdonald EN, Falk TH, Voix J. Variations in voice level and fundamental frequency with changing background noise level and talker-to-listener distance while wearing hearing protectors: A pilot study. Int J Audiol 2016;55 Suppl 1:S13-20. [DOI: 10.3109/14992027.2015.1122240] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Šimko J, Beňuš Š, Vainio M. Hyperarticulation in Lombard speech: Global coordination of the jaw, lips and the tongue. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016;139:151-62. [PMID: 26827013 DOI: 10.1121/1.4939495] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/15/2023]

Poblete V, Espic F, King S, Stern RM, Huenupán F, Fredes J, Yoma NB. A perceptually-motivated low-complexity instantaneous linear channel normalization technique applied to speaker verification. COMPUT SPEECH LANG 2015. [DOI: 10.1016/j.csl.2014.10.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

An adaptive post-filtering method producing an artificial Lombard-like effect for intelligibility enhancement of narrowband telephone speech. COMPUT SPEECH LANG 2014. [DOI: 10.1016/j.csl.2013.03.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Pohjalainen J, Raitio T, Yrttiaho S, Alku P. Detection of shouted speech in noise: human and machine. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2013;133:2377-2389. [PMID: 23556603 DOI: 10.1121/1.4794394] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Cooke M, Lu Y. Spectral and temporal changes to speech produced in the presence of energetic and informational maskers. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2010;128:2059-2069. [PMID: 20968376 DOI: 10.1121/1.3478775] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Boril H, Hansen JHL. Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments. ACTA ACUST UNITED AC 2010. [DOI: 10.1109/tasl.2009.2034770] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]