1
|
Lee G, Connor CW. "Alexa, Cycle The Blood Pressure": A Voice Control Interface Method for Anesthesia Monitoring. Anesth Analg 2024; 139:639-646. [PMID: 39008420 PMCID: PMC11329363 DOI: 10.1213/ane.0000000000007003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/17/2024]
Abstract
BACKGROUND Anesthesia monitors and devices are usually controlled with some combination of dials, keypads, a keyboard, or a touch screen. Thus, anesthesiologists can operate their monitors only when they are physically close to them, and not otherwise task-loaded with sterile procedures such as line or block placement. Voice recognition technology has become commonplace and may offer advantages in anesthesia practice such as reducing surface contamination rates and allowing anesthesiologists to effect changes in monitoring and therapy when they would otherwise presently be unable to do so. We hypothesized that this technology is practicable and that anesthesiologists would consider it useful. METHODS A novel voice-driven prototype controller was designed for the GE Solar 8000M anesthesia patient monitor. The apparatus was implemented using a Raspberry Pi 4 single-board computer, an external conference audio device, a Google Cloud Speech-to-Text platform, and a modified Solar controller to effect commands. Fifty anesthesia providers tested the prototype. Evaluations and surveys were completed in a nonclinical environment to avoid any ethical or safety concerns regarding the use of the device in direct patient care. All anesthesiologists sampled were fluent English speakers; many with inflections from their first language or national origin, reflecting diversity in the population of practicing anesthesiologists. RESULTS The prototype was uniformly well-received by anesthesiologists. Ease-of-use, usefulness, and effectiveness were assessed on a Likert scale with means of 9.96, 7.22, and 8.48 of 10, respectively. No population cofactors were associated with these results. Advancing level of training (eg, nonattending versus attending) was not correlated with any preference. Accent of country or region was not correlated with any preference. Vocal pitch register did not correlate with any preference. Statistical analyses were performed with analysis of variance and the unpaired t -test. CONCLUSIONS The use of voice recognition to control operating room monitors was well-received anesthesia providers. Additional commands are easily implemented on the prototype controller. No adverse relationship was found between acceptability and level of anesthesia experience, pitch of voice, or presence of accent. Voice recognition is a promising method of controlling anesthesia monitors and devices that could potentially increase usability and situational awareness in circumstances where the anesthesiologist is otherwise out-of-position or task-loaded.
Collapse
Affiliation(s)
- Grace Lee
- From the Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital-Harvard Medical School, Boston, Massachusetts
| | - Christopher W Connor
- From the Department of Anesthesiology, Perioperative and Pain Medicine, Brigham and Women's Hospital-Harvard Medical School, Boston, Massachusetts
- Department of Physiology
- Department of Biophysics and Biomedical Engineering, Boston University, Boston, Massachusetts
- Department of Cardiac Anesthesiology and Intensive Care Medicine, Deutsches Herzzentrum der Charité, Charité Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
2
|
Jelassi M, Jemai O, Demongeot J. Revolutionizing Radiological Analysis: The Future of French Language Automatic Speech Recognition in Healthcare. Diagnostics (Basel) 2024; 14:895. [PMID: 38732310 PMCID: PMC11083196 DOI: 10.3390/diagnostics14090895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Revised: 04/09/2024] [Accepted: 04/24/2024] [Indexed: 05/13/2024] Open
Abstract
This study introduces a specialized Automatic Speech Recognition (ASR) system, leveraging the Whisper Large-v2 model, specifically adapted for radiological applications in the French language. The methodology focused on adapting the model to accurately transcribe medical terminology and diverse accents within the French language context, achieving a notable Word Error Rate (WER) of 17.121%. This research involved extensive data collection and preprocessing, utilizing a wide range of French medical audio content. The results demonstrate the system's effectiveness in transcribing complex radiological data, underscoring its potential to enhance medical documentation efficiency in French-speaking clinical settings. The discussion extends to the broader implications of this technology in healthcare, including its potential integration with electronic health records (EHRs) and its utility in medical education. This study also explores future research directions, such as tailoring ASR systems to specific medical specialties and languages. Overall, this research contributes significantly to the field of medical ASR systems, presenting a robust tool for radiological transcription in the French language and paving the way for advanced technology-enhanced healthcare solutions.
Collapse
Affiliation(s)
- Mariem Jelassi
- RIADI Laboratory, Ecole Nationale des Sciences de l’Informatique (ENSI), Manouba University, La Manouba 2010, Tunisia;
- Health Tech Innovation Systems Inc., ENSI Innovation Hub, La Manouba 2010, Tunisia;
| | - Oumaima Jemai
- Health Tech Innovation Systems Inc., ENSI Innovation Hub, La Manouba 2010, Tunisia;
- Ecole Supérieure des Communications de Tunis (SUP’COM), Carthage University, Ariana 2083, Tunisia
| | - Jacques Demongeot
- AGEIS Laboratory, Faculté de Médecine, Université Grenoble Alpes (UGA), 38700 La Tronche, France
| |
Collapse
|
3
|
Lambert SI, Madi M, Sopka S, Lenes A, Stange H, Buszello CP, Stephan A. An integrative review on the acceptance of artificial intelligence among healthcare professionals in hospitals. NPJ Digit Med 2023; 6:111. [PMID: 37301946 DOI: 10.1038/s41746-023-00852-5] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 05/25/2023] [Indexed: 06/12/2023] Open
Abstract
Artificial intelligence (AI) in the domain of healthcare is increasing in prominence. Acceptance is an indispensable prerequisite for the widespread implementation of AI. The aim of this integrative review is to explore barriers and facilitators influencing healthcare professionals' acceptance of AI in the hospital setting. Forty-two articles met the inclusion criteria for this review. Pertinent elements to the study such as the type of AI, factors influencing acceptance, and the participants' profession were extracted from the included studies, and the studies were appraised for their quality. The data extraction and results were presented according to the Unified Theory of Acceptance and Use of Technology (UTAUT) model. The included studies revealed a variety of facilitating and hindering factors for AI acceptance in the hospital setting. Clinical decision support systems (CDSS) were the AI form included in most studies (n = 21). Heterogeneous results with regard to the perceptions of the effects of AI on error occurrence, alert sensitivity and timely resources were reported. In contrast, fear of a loss of (professional) autonomy and difficulties in integrating AI into clinical workflows were unanimously reported to be hindering factors. On the other hand, training for the use of AI facilitated acceptance. Heterogeneous results may be explained by differences in the application and functioning of the different AI systems as well as inter-professional and interdisciplinary disparities. To conclude, in order to facilitate acceptance of AI among healthcare professionals it is advisable to integrate end-users in the early stages of AI development as well as to offer needs-adjusted training for the use of AI in healthcare and providing adequate infrastructure.
Collapse
Affiliation(s)
- Sophie Isabelle Lambert
- AIXTRA-Competence Center for Training and Patient Safety, Medical Faculty, RWTH Aachen University, Pauwelsstraße 30, 52074, Aachen, Germany.
- Department of Anesthesiology, Uniklinik RWTH Aachen, Pauwelsstraße 30, 52074, Aachen, Germany.
| | - Murielle Madi
- Department of Nursing Science, Uniklinik RWTH Aachen, Pauwelsstraße 30, 52074, Aachen, Germany.
| | - Saša Sopka
- AIXTRA-Competence Center for Training and Patient Safety, Medical Faculty, RWTH Aachen University, Pauwelsstraße 30, 52074, Aachen, Germany
- Department of Anesthesiology, Uniklinik RWTH Aachen, Pauwelsstraße 30, 52074, Aachen, Germany
| | - Andrea Lenes
- AIXTRA-Competence Center for Training and Patient Safety, Medical Faculty, RWTH Aachen University, Pauwelsstraße 30, 52074, Aachen, Germany
| | - Hendrik Stange
- Fraunhofer Society for the Advancement of Applied Research. Fraunhofer-Institute for Intelligent Analysis and Information Systems IAIS, Schloss Birlinghoven 1, 53757, Sankt Augustin, Bonn, Germany
| | - Claus-Peter Buszello
- Fraunhofer Society for the Advancement of Applied Research. Fraunhofer-Institute for Intelligent Analysis and Information Systems IAIS, Schloss Birlinghoven 1, 53757, Sankt Augustin, Bonn, Germany
| | - Astrid Stephan
- Department of Nursing Science, Uniklinik RWTH Aachen, Pauwelsstraße 30, 52074, Aachen, Germany
- Fliedner University of Applied Sciences, Geschwister-Aufricht-Straße, 940489, Düsseldorf, Germany
| |
Collapse
|
4
|
Zhang J, Wu J, Qiu Y, Song A, Li W, Li X, Liu Y. Intelligent speech technologies for transcription, disease diagnosis, and medical equipment interactive control in smart hospitals: A review. Comput Biol Med 2023; 153:106517. [PMID: 36623438 PMCID: PMC9814440 DOI: 10.1016/j.compbiomed.2022.106517] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2022] [Revised: 12/23/2022] [Accepted: 12/31/2022] [Indexed: 01/07/2023]
Abstract
The growing and aging of the world population have driven the shortage of medical resources in recent years, especially during the COVID-19 pandemic. Fortunately, the rapid development of robotics and artificial intelligence technologies help to adapt to the challenges in the healthcare field. Among them, intelligent speech technology (IST) has served doctors and patients to improve the efficiency of medical behavior and alleviate the medical burden. However, problems like noise interference in complex medical scenarios and pronunciation differences between patients and healthy people hamper the broad application of IST in hospitals. In recent years, technologies such as machine learning have developed rapidly in intelligent speech recognition, which is expected to solve these problems. This paper first introduces IST's procedure and system architecture and analyzes its application in medical scenarios. Secondly, we review existing IST applications in smart hospitals in detail, including electronic medical documentation, disease diagnosis and evaluation, and human-medical equipment interaction. In addition, we elaborate on an application case of IST in the early recognition, diagnosis, rehabilitation training, evaluation, and daily care of stroke patients. Finally, we discuss IST's limitations, challenges, and future directions in the medical field. Furthermore, we propose a novel medical voice analysis system architecture that employs active hardware, active software, and human-computer interaction to realize intelligent and evolvable speech recognition. This comprehensive review and the proposed architecture offer directions for future studies on IST and its applications in smart hospitals.
Collapse
Affiliation(s)
- Jun Zhang
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China,Corresponding author
| | - Jingyue Wu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Yiyi Qiu
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Aiguo Song
- The State Key Laboratory of Bioelectronics, School of Instrument Science and Engineering, Southeast University, Nanjing, 210096, China
| | - Weifeng Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Xin Li
- Department of Emergency Medicine, Guangdong Provincial People's Hospital, Guangdong Academy of Medical Sciences, Guangzhou, 510080, China
| | - Yecheng Liu
- Emergency Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, 100730, China
| |
Collapse
|
5
|
M FR, N R, P M, B O, L R B, J CP, E L, J L V. Feasibility Study on Automatic Surgical Phase Identification based on Speech Recognition for Laparoscopic Prostatectomy. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2022; 2022:4411-4414. [PMID: 36086038 DOI: 10.1109/embc48229.2022.9870990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
An efficient capacity management of the Operation Room (OR) is crucial for the optimization of resources and consequent cost minimization. However, the duration of each surgery is hard to predict and often prone to variability and unforeseen events that may delay or accelerate each procedure. Automatic surgical phase identification strategies can lead to a more accurate and automated estimation of the surgery duration, improving OR schedule optimization. One possible contribution for these methods is the introduction of speech recognition systems. The described work aims at the implementation of a speech recognition (SR) engine for surgical phase identification, specifically optimized for Laparoscopic Radical Prostatectomy (LRP). For this application the performance of 3 engines was tested under different background noise and distance circumstances: Microsoft Speech SDK; Google Speech Recognition API; and an optimization of the first, with the introduction of specific vocabulary. Action/target binomials were used to identify the Laparoscopic Prostatectomy surgical phases. 15 participants were selected to perform the tests and Word Error Rate (WER) was calculated as the main comparison metric. The values for the total WER indicate that AzureGrammar (specific vocabulary inserted) has superior performance, reaching a lower error rate of 29.25%, when compared to Azure and Google with 67.91% and 67.48%, respectively. Phase Accuracy Ratio (PAR), considering only the AzureGrammar SR engine, are 84% and 83%, for 1 m and 2 m, without background noise; 61% and 51%, for 1 m and 2 m, with background noise. This study demonstrates that using a conventional SR engine, with specific vocabulary incorporated, has the potential to achieve an acceptable performance with minimal setup. Clinical Relevance- Speech based, surgical phase identification, can be achieved with minimal setup to existing SR engines, showing potential to be implemented in Operating Room, with further optimization.
Collapse
|
6
|
Sato H. Development of Clinical Pharmaceutical Services <i>via</i> Artificial Intelligence Adaptation. YAKUGAKU ZASSHI 2022; 142:337-340. [DOI: 10.1248/yakushi.21-00178-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Hiroyasu Sato
- Department of Pharmacy, Obihiro Kosei General Hospital
| |
Collapse
|