1
|
Gutz SE, Maffei MF, Green JR. Feedback From Automatic Speech Recognition to Elicit Clear Speech in Healthy Speakers. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2023; 32:2940-2959. [PMID: 37824377 PMCID: PMC10721250 DOI: 10.1044/2023_ajslp-23-00030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 04/10/2023] [Accepted: 08/01/2023] [Indexed: 10/14/2023]
Abstract
PURPOSE This study assessed the effectiveness of feedback generated by automatic speech recognition (ASR) for eliciting clear speech from young, healthy individuals. As a preliminary step toward exploring a novel method for eliciting clear speech in patients with dysarthria, we investigated the effects of ASR feedback in healthy controls. If successful, ASR feedback has the potential to facilitate independent, at-home clear speech practice. METHOD Twenty-three healthy control speakers (ages 23-40 years) read sentences aloud in three speaking modes: Habitual, Clear (over-enunciated), and in response to ASR feedback (ASR). In the ASR condition, we used Mozilla DeepSpeech to transcribe speech samples and provide participants with a value indicating the accuracy of the ASR's transcription. For speakers who achieved sufficiently high ASR accuracy, noise was added to their speech at a participant-specific signal-to-noise ratio to ensure that each participant had to over-enunciate to achieve high ASR accuracy. RESULTS Compared to habitual speech, speech produced in the ASR and Clear conditions was clearer, as rated by speech-language pathologists, and more intelligible, per speech-language pathologist transcriptions. Speech in the Clear and ASR conditions aligned on several acoustic measures, particularly those associated with increased vowel distinctiveness and decreased speaking rate. However, ASR accuracy, intelligibility, and clarity were each correlated with different speech features, which may have implications for how people change their speech for ASR feedback. CONCLUSIONS ASR successfully elicited outcomes similar to clear speech in healthy speakers. Future work should investigate its efficacy in eliciting clear speech in people with dysarthria.
Collapse
Affiliation(s)
- Sarah E. Gutz
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Cambridge, MA
| | - Marc F. Maffei
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
| | - Jordan R. Green
- Department of Communication Sciences and Disorders, MGH Institute of Health Professions, Boston, MA
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Cambridge, MA
| |
Collapse
|
2
|
Pascual-Soriano A, Cervera Mérida JF. Tratamiento de la nasalidad transpalatal en un caso complejo con trastorno fonológico. REVISTA DE INVESTIGACIÓN EN LOGOPEDIA 2022. [DOI: 10.5209/rlog.72880] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Se presenta el proceso de evaluación e intervención logopédica en una niña de 9 años con hipernasalidad y dificultades fonológicas perceptivas y expresivas. Se evalúo el nivel fonético y fonológico del lenguaje, el timbre nasal mediante nasometría, la integridad funcional de las estructuras velopalatofaríngeas y la audición. La paciente pronunciaba los sonidos [d], [l] y [t] muy distorsionados y con características de [n], al tiempo que no era capaz de distinguirlos de la consonante nasal. La configuración de su cavidad oral y su base articulatoria idiosincrática favorecía el fenómeno de nasalidad transpalatal: presencia excesiva de resonancia nasal durante la emisión de sonidos no nasales en ausencia de disfunción velofaríngea. Para el abordaje de la hipernasalidad se aplicó un tratamiento intensivo con la herramienta de biofeedback visual Nasometer II, como recomienda la literatura existente para los casos de nasalidad de origen funcional. El tratamiento fue eficaz para el control voluntario de la reducción de la hiperrinolalia, pero no fue efectivo debido a que la paciente no fue capaz de reducir el timbre nasal en el uso espontáneo del habla.
Collapse
|
3
|
Pozos Radillo BE, Horta Botello E, Plascencia Campos AR, Preciado Serrano MDL. Efecto del biofeedback en la satisfacción laboral en trabajadores de la industria joyera en México. Rev Salud Publica (Bogota) 2021. [DOI: 10.15446/rsap.v23n5.87557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Objetivo Determinar cuál es el efecto que tiene la técnica de biofeedback de la respiración en la satisfacción laboral de los trabajadores de una industria joyera en Guadalajara, México.
Materiales y Métodos La investigación fue un estudio cuasiexperimental pre y postest en grupos aleatorizados en tres fases: diagnóstico, intervención y evaluación. Para los resultados se utilizó el ANOVA, durante el año 2020.
Resultados Los resultados muestran las diferencias significativas en el aumento de la motivación entre el grupo experimental y control en el postest, no así para la dimensión de satisfacción laboral y la dimensión de jerarquía.
Conclusión La aplicación de la técnica del biofeedback de la respiración mostró solo ser eficiente en la dimensión de la motivación de la satisfacción laboral, aunque con un efecto bajo en el personal de una empresa joyera, por lo que es necesario continuar buscando herramientas que puedan aumentar la satisfacción laboral en todas sus dimensiones.
Collapse
|
4
|
Yang X, Pratama GB, Choi Y, You H, Tâm NPM, Kim GW, Jo YJ, Ko MH. Measurement of Nasalance Scores Without Touching the Philtrum for Better Comfort During Speech Assessment and Therapy: A Preliminary Study. Cleft Palate Craniofac J 2020; 58:446-454. [PMID: 32924600 DOI: 10.1177/1055665620953340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVE The Kay Pentax nasometer uses a separator plate that touches the philtrum of a patient to separate the nasal and oral sound energies for nasalance measurement. However, the separator plate can restrict the natural movement of the patient's upper lip and generate unpleasant pressure on the patient's philtrum. The present study was intended to measure nasalance scores without touching the philtrum for better comfort during speech assessment and therapy. METHODS Nasalance scores of 10 males and 10 females having no speech disorders were measured under 4 levels (0, 5, 10, and 15 mm) of the gap between the plate and the philtrum (denoted as plate-to-philtrum gap) using Nasometer II 6450 for nasal (Nasal Sentences) and oral (Zoo Passage) stimuli. Regression formulas were established to examine the relationships between nasalance score and plate-to-philtrum gap for the stimuli. To provide nasalance scores equivalent to those measured for the contact condition, compensation factors for the 5 mm plate-to-philtrum gap measurement condition were identified for the stimuli. RESULTS The nasalance scores were significantly different between the 4 different plate-to-philtrum gaps for the stimuli. Compensation factors for the Nasal Sentences and the Zoo Passage were identified as 1.17 and 0.71, respectively. CONCLUSIONS The 5 mm plate-to-philtrum gap condition after multiplying the compensation factors can provide equivalent nasalance scores to the conventional contact measurement condition which may provide better comfort in speech assessment and therapy.
Collapse
Affiliation(s)
- Xiaopeng Yang
- School of Artificial Intelligence and Computer Science, 66374Jiangnan University, Wuxi, PR China
| | - Gradiyan Budi Pratama
- Department of Industrial Engineering, 89224Bandung Institute of Technology, Bandung, Indonesia
| | - Younggeun Choi
- Department of Industrial and Management Engineering, 34995Pohang University of Science and Technology, Pohang, Republic of Korea
| | - Heecheon You
- Department of Industrial and Management Engineering, 34995Pohang University of Science and Technology, Pohang, Republic of Korea
| | - Nguyễn Phu'ò'c Minh Tâm
- Department of Medicine, 90158Chonbuk National University Medical School, Jeonju, Republic of Korea
| | - Gi-Wook Kim
- Department of Physical Medicine and Rehabilitation, 90158Chonbuk National University Medical School, Jeonju, Republic of Korea.,Research Institute of Clinical Medicine of Chonbuk National University-Biomedical Research Institute of 65377Chonbuk National University Hospital, Jeonju, Republic of Korea
| | - Yun-Ju Jo
- Research Institute of Clinical Medicine of Chonbuk National University-Biomedical Research Institute of 65377Chonbuk National University Hospital, Jeonju, Republic of Korea
| | - Myoung-Hwan Ko
- Department of Physical Medicine and Rehabilitation, 90158Chonbuk National University Medical School, Jeonju, Republic of Korea.,Research Institute of Clinical Medicine of Chonbuk National University-Biomedical Research Institute of 65377Chonbuk National University Hospital, Jeonju, Republic of Korea
| |
Collapse
|
5
|
Srinivas V, Bressmann T. Effects of Knowledge of Task on Control of Oral-Nasal Balance in Speech. Folia Phoniatr Logop 2019; 73:15-21. [DOI: 10.1159/000503861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 10/01/2019] [Indexed: 11/19/2022] Open
Abstract
<b><i>Introduction:</i></b> Previous research has shown that altering the nasal signal level auditory feedback changed the control of oral-nasal balance in normal speakers. The present study investigated whether knowledge of the task and the instruction not to compensate would change the participants’ response to the manipulation. <b><i>Methods:</i></b> Twenty participants (10 females) in 2 groups continuously repeated a sentence while their nasal signal level was increased or decreased and fed back to them via headphones, so the speakers heard themselves as more or less nasal, respectively. After the first recording session, participants were debriefed about the true nature of the experiment. They were instructed not to compensate in the second recording session. The outcome measures were the percentage changes of nasalance scores from the first baseline. <b><i>Results:</i></b> Statistical analysis using a repeated measures analysis of variance showed an effect of the nasal signal level, <i>F</i>(5,80) = 2.51, <i>p</i> = 0.049, and a nasal signal level by knowledge of task interaction effect, <i>F</i>(5,80) = 3.25, <i>p</i> = 0.019. Post hoc tests showed that the maximum nasal signal level auditory feedback resulted in a significant decrease of nasality from the initial baseline. <b><i>Conclusion:</i></b> Despite knowledge of the task, speakers were unable to resist compensating. As found in previous research, there was a numerically higher compensation response at the maximum than at the minimum nasal signal level auditory feedback condition.
Collapse
|
6
|
Lo ESC, Wong AWK, Tse ACY, Ma EPM, Whitehill TL, Masters R. Effects of Error Experience on Learning to Lower Speech Nasalance Level. AMERICAN JOURNAL OF SPEECH-LANGUAGE PATHOLOGY 2019; 28:448-455. [PMID: 31136230 DOI: 10.1044/2018_ajslp-18-0033] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Purpose This research aims to examine the effects of error experience when learning to speak with lowered nasalance level. Method A total of 45 typical speakers were instructed to learn to lower speech nasalance level in either an errorless (restricted possibility for committing errors) or an errorful (unrestricted possibility for committing errors) learning condition. The nasality level of the participants' speech was measured by a nasometer and quantified by nasalance scores (in percent). Errorless learners practiced producing speech with lowered nasalance level with a threshold nasalance score of 50% (the easiest target) at the beginning, which gradually decreased to a threshold of 10% (the most difficult target) at the end. The same set of threshold targets was presented to errorful learners, but in reverse order. Errors were defined by the proportion of speech, with a nasalance score exceeding the threshold. Retention and transfer tests were administered. Results Errorless learners displayed fewer errors and lower mean nasalance scores than errorful learners during the acquisition phase. Furthermore, errorless learners achieved lower mean nasalance scores than errorful learners in the retention and transfer tests. Conclusion These results suggest that errorless learning is more effective than errorful learning and that error experience has a detrimental effect on the acquisition of a novel speech motor task that requires minimization of the nasality level. Errorless learning may be a useful paradigm for the intervention and management of hypernasality in clinical settings where behavioral treatments are needed.
Collapse
Affiliation(s)
- Eric Siu-Chung Lo
- Nam Shan Psychology Laboratory, Department of Social and Behavioural Sciences, City University of Hong Kong, Hong Kong SAR, China
| | - Andus Wing-Kuen Wong
- Nam Shan Psychology Laboratory, Department of Social and Behavioural Sciences, City University of Hong Kong, Hong Kong SAR, China
| | - Andy Choi-Yeung Tse
- Department of Health and Physical Education, The Education University of Hong Kong, Hong Kong SAR, China
| | - Estella Pui-Man Ma
- Division of Speech and Hearing Sciences, University of Hong Kong, Hong Kong SAR, China
| | - Tara L Whitehill
- Division of Speech and Hearing Sciences, University of Hong Kong, Hong Kong SAR, China
| | - Rich Masters
- Te Huataki Waiora Faculty of Health, Sport and Human Performance, University of Waikato, Hamilton, New Zealand
| |
Collapse
|
7
|
Cler GJ, Mittelman T, Braden MN, Woodnorth GH, Stepp CE. Video Game Rehabilitation of Velopharyngeal Dysfunction: A Case Series. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2017; 60:1800-1809. [PMID: 28655049 PMCID: PMC5544406 DOI: 10.1044/2017_jslhr-s-16-0231] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2016] [Revised: 10/12/2016] [Accepted: 11/22/2016] [Indexed: 06/07/2023]
Abstract
PURPOSE Video games provide a promising platform for rehabilitation of speech disorders. Although video games have been used to train speech perception in foreign language learners and have been proposed for aural rehabilitation, their use in speech therapy has been limited thus far. We present feasibility results from at-home use in a case series of children with velopharyngeal dysfunction (VPD) using an interactive video game that provided real-time biofeedback to facilitate appropriate nasalization. METHOD Five participants were recruited across a range of ages, VPD severities, and VPD etiologies. Participants completed multiple weeks of individual game play with a video game that provides feedback on nasalization measured via nasal accelerometry. Nasalization was assessed before and after training by using nasometry, aerodynamic measures, and expert perceptual judgments. RESULTS Four participants used the game at home or school, with the remaining participant unwilling to have the nasal accelerometer secured to his nasal skin, perhaps due to his young age. The remaining participants showed a tendency toward decreased nasalization after training, particularly for the words explicitly trained in the video game. CONCLUSION Results suggest that video game-based systems may provide a useful rehabilitation platform for providing real-time feedback of speech nasalization in VPD. SUPPLEMENTAL MATERIAL https://doi.org/10.23641/asha.5116828.
Collapse
Affiliation(s)
- Gabriel J. Cler
- Graduate Program for Neuroscience–Computational Neuroscience, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
| | - Talia Mittelman
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
| | - Maia N. Braden
- Department of Surgery, Division of Otolaryngology, Voice and Swallow Clinics, University of Wisconsin, Madison
| | - Geralyn Harvey Woodnorth
- Speech-Language Pathology Program, Otolaryngology and Communication Enhancement, Boston Children's Hospital, MA
| | - Cara E. Stepp
- Graduate Program for Neuroscience–Computational Neuroscience, Boston University, MA
- Department of Speech, Language, and Hearing Sciences, Boston University, MA
- Department of Biomedical Engineering, Boston University, MA
- Department of Otolaryngology–Head and Neck Surgery, Boston University School of Medicine, MA
| |
Collapse
|