1
|
Fiedler AK, Zhang K, Lal TS, Jiang X, Fraser SM. Generative Pre-trained Transformer for Pediatric Stroke Research: A Pilot Study. Pediatr Neurol 2024; 160:54-59. [PMID: 39191085 DOI: 10.1016/j.pediatrneurol.2024.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 06/25/2024] [Accepted: 07/02/2024] [Indexed: 08/29/2024]
Abstract
BACKGROUND Pediatric stroke is an important cause of morbidity in children. Although research can be challenging, large amounts of data have been captured through collaborative efforts in the International Pediatric Stroke Study (IPSS). This study explores the use of an advanced artificial intelligence program, the Generative Pre-trained Transformer (GPT), to enter pediatric stroke data into the IPSS. METHODS The most recent 50 clinical notes of patients with ischemic stroke or cerebral venous sinus thrombosis at the UTHealth Pediatric Stroke Clinic were deidentified. Domain-specific prompts were engineered for an offline artificial intelligence program (GPT) to answer IPSS questions. Responses from GPT were compared with the human rater. Percent agreement was assessed across 50 patients for each of the 114 queries developed from the IPSS database outcome questionnaire. RESULTS GPT demonstrated strong performance on several questions but showed variability overall. In its early iterations it was able to match human judgment occasionally with an accuracy score of 1.00 (n = 20, 17.5%), but it scored as low as 0.26 in some patients. Prompts were adjusted in four subsequent iterations to increase accuracy. In its fourth iteration, agreement was 93.6%, with a maximum agreement of 100% and minimum of 62%. Of 2400 individual items assessed, our model entered 2247 (93.6%) correctly and 153 (6.4%) incorrectly. CONCLUSIONS Although our tailored generative model with domain-specific prompt engineering and ontological guidance shows promise for research applications, further refinement is needed to enhance its accuracy. It cannot enter data entirely independently, but it can be employed in tandem with human oversight contributing to a collaborative approach that reduces overall effort.
Collapse
Affiliation(s)
- Anna K Fiedler
- Division of Child Neurology, Department of Pediatrics, The University of Texas Health Science Center at Houston, Houston, Texas
| | - Kai Zhang
- Department of Health Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics at UTHealth Houston, Houston, Texas; UTHealth Houston Institute of Stroke and Cerebrovascular Diseases, Houston, Texas
| | - Tia S Lal
- UTHealth Houston Institute of Stroke and Cerebrovascular Diseases, Houston, Texas
| | - Xiaoqian Jiang
- Department of Health Data Science and Artificial Intelligence, McWilliams School of Biomedical Informatics at UTHealth Houston, Houston, Texas; UTHealth Houston Institute of Stroke and Cerebrovascular Diseases, Houston, Texas
| | - Stuart M Fraser
- Division of Child Neurology, Department of Pediatrics, The University of Texas Health Science Center at Houston, Houston, Texas; UTHealth Houston Institute of Stroke and Cerebrovascular Diseases, Houston, Texas.
| |
Collapse
|
2
|
Cao D, Chan MK. Enhancing chemical synthesis research with NLP: Word embeddings for chemical reagent identification-A case study on nano-FeCu. iScience 2024; 27:110780. [PMID: 39319268 PMCID: PMC11417335 DOI: 10.1016/j.isci.2024.110780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/11/2024] [Accepted: 08/16/2024] [Indexed: 09/26/2024] Open
Abstract
Nanoparticle synthesis is complex, influenced by multiple variables including reagent selection. This study introduces a specialized corpus focused on "Fe, Cu, synthesis" to train a domain-specific word embedding model using natural language processing (NLP) in an unsupervised environment. Evaluation metrics included average cosine similarity, visual analysis via t-distributed stochastic neighbor embedding (t-SNE), synonym analysis, and analogy reasoning analysis. Results indicate a strong correlation between learning rate and cosine similarity, with enhanced chemical specificity in the tailored model compared to general models. The framework facilitates rapid identification of potential reagents for nano-FeCu synthesis, enhancing precision in nanomaterial research. This innovative approach offers a data-driven pathway for chemical material synthesis, demonstrating significant interdisciplinary applications.
Collapse
Affiliation(s)
- Dingding Cao
- Centre for Water Research, Faculty of Engineering, Built Environment and Information Technology, SEGi University. Jalan Teknologi, Kota Damansara, Petaling Jaya 47810, Selangor Darul Ehsan, Malaysia
- Department of Electrical and Electronic Engineering, Guangdong Technology College, Zhaoqing 526100, China
| | - Mieow Kee Chan
- Centre for Water Research, Faculty of Engineering, Built Environment and Information Technology, SEGi University. Jalan Teknologi, Kota Damansara, Petaling Jaya 47810, Selangor Darul Ehsan, Malaysia
| |
Collapse
|
3
|
Lyons RA, Gabbe BJ, Vallmuur K. Potential for advances in data linkage and data science to support injury prevention research. Inj Prev 2024:ip-2024-045367. [PMID: 39362751 DOI: 10.1136/ip-2024-045367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Accepted: 09/14/2024] [Indexed: 10/05/2024]
Abstract
The recent COVID-19 pandemic stimulated unprecedented linkage of datasets worldwide, and while injury is endemic rather than pandemic, there is much to be learned by the injury prevention community from the data science approaches taken to respond to the pandemic to support research into the primary, secondary and tertiary prevention of injuries. The use of routinely collected data to produce real-world evidence, as an alternative to clinical trials, has been gaining in popularity as the availability and quality of digital health platforms grow and the linkage landscape, and the analytics required to make best use of linked and unstructured data, is rapidly evolving. Capitalising on existing data sources, innovative linkage and advanced analytic approaches provides the opportunity to undertake novel injury prevention research and generate new knowledge, while avoiding data waste and additional burden to participants. We provide a tangible, but not exhaustive, list of examples showing the breadth and value of data linkage, along with the emerging capabilities of natural language processing techniques to enhance injury research. To optimise data science approaches to injury prevention, injury researchers in this area need to share methods, code, models and tools to improve consistence and efficiencies in this field. Increased collaboration between injury prevention researchers and data scientists working on population data linkage systems has much to offer this field of research.
Collapse
Affiliation(s)
- Ronan A Lyons
- Population Data Science, Swansea University, Swansea, Swansea, UK
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
- Administrative Data Research Wales, Swansea University Medical School, Swansea University, Swansea, UK
| | - Belinda J Gabbe
- Population Data Science, Swansea University, Swansea, Swansea, UK
- School of Public Health and Preventive Medicine, Monash University, Melbourne, Victoria, Australia
| | - Kirsten Vallmuur
- Australian Centre for Health Services Innovation (AusHSI), Queensland University of Technology (QUT), Brisbane, Queensland, Australia
- Jamieson Trauma Institute, Royal Brisbane & Women's Hospital (RBWH), Brisbane, Queensland, Australia
| |
Collapse
|
4
|
Stavropoulos A, Crone DL, Grossmann I. Shadows of wisdom: Classifying meta-cognitive and morally grounded narrative content via large language models. Behav Res Methods 2024; 56:7632-7646. [PMID: 38811519 DOI: 10.3758/s13428-024-02441-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2024] [Indexed: 05/31/2024]
Abstract
We investigated large language models' (LLMs) efficacy in classifying complex psychological constructs like intellectual humility, perspective-taking, open-mindedness, and search for a compromise in narratives of 347 Canadian and American adults reflecting on a workplace conflict. Using state-of-the-art models like GPT-4 across few-shot and zero-shot paradigms and RoB-ELoC (RoBERTa -fine-tuned-on-Emotion-with-Logistic-Regression-Classifier), we compared their performance with expert human coders. Results showed robust classification by LLMs, with over 80% agreement and F1 scores above 0.85, and high human-model reliability (Cohen's κ Md across top models = .80). RoB-ELoC and few-shot GPT-4 were standout classifiers, although somewhat less effective in categorizing intellectual humility. We offer example workflows for easy integration into research. Our proof-of-concept findings indicate the viability of both open-source and commercial LLMs in automating the coding of complex constructs, potentially transforming social science research.
Collapse
Affiliation(s)
| | | | - Igor Grossmann
- Department of Psychology, University of Waterloo, Waterloo, N2L 3G1, Canada.
| |
Collapse
|
5
|
Kumari K, Pahuja SK, Kumar S. A Comprehensive Examination of ChatGPT's Contribution to the Healthcare Sector and Hepatology. Dig Dis Sci 2024:10.1007/s10620-024-08659-4. [PMID: 39354272 DOI: 10.1007/s10620-024-08659-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 09/20/2024] [Indexed: 10/03/2024]
Abstract
Artificial Intelligence and Natural Language Processing technology have demonstrated significant promise across several domains within the medical and healthcare sectors. This technique has numerous uses in the field of healthcare. One of the primary challenges in implementing ChatGPT in healthcare is the requirement for precise and up-to-date data. In the case of the involvement of sensitive medical information, it is imperative to carefully address concerns regarding privacy and security when using GPT in the healthcare sector. This paper outlines ChatGPT and its relevance in the healthcare industry. It discusses the important aspects of ChatGPT's workflow and highlights the usual features of ChatGPT specifically designed for the healthcare domain. The present review uses the ChatGPT model within the research domain to investigate disorders associated with the hepatic system. This review demonstrates the possible use of ChatGPT in supporting researchers and clinicians in analyzing and interpreting liver-related data, thereby improving disease diagnosis, prognosis, and patient care.
Collapse
Affiliation(s)
- Kabita Kumari
- Department of Instrumentation and Control Engineering, Dr B. R. Ambedkar National Institute of Technology, Jalandhar, Punjab, 144011, India.
| | - Sharvan Kumar Pahuja
- Department of Instrumentation and Control Engineering, Dr B. R. Ambedkar National Institute of Technology, Jalandhar, Punjab, 144011, India
| | - Sanjeev Kumar
- Biomedical Instrumentation Unit, CSIR-Central Scientific Instruments Organisation (CSIR-CSIO), Chandigarh, India
| |
Collapse
|
6
|
Bereczki Z, Benczik B, Balogh OM, Marton S, Puhl E, Pétervári M, Váczy-Földi M, Papp ZT, Makkos A, Glass K, Locquet F, Euler G, Schulz R, Ferdinandy P, Ágg B. Mitigating off-target effects of small RNAs: conventional approaches, network theory and artificial intelligence. Br J Pharmacol 2024. [PMID: 39293936 DOI: 10.1111/bph.17302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 05/07/2024] [Accepted: 06/17/2024] [Indexed: 09/20/2024] Open
Abstract
Three types of highly promising small RNA therapeutics, namely, small interfering RNAs (siRNAs), microRNAs (miRNAs) and the RNA subtype of antisense oligonucleotides (ASOs), offer advantages over small-molecule drugs. These small RNAs can target any gene product, opening up new avenues of effective and safe therapeutic approaches for a wide range of diseases. In preclinical research, synthetic small RNAs play an essential role in the investigation of physiological and pathological pathways as silencers of specific genes, facilitating discovery and validation of drug targets in different conditions. Off-target effects of small RNAs, however, could make it difficult to interpret experimental results in the preclinical phase and may contribute to adverse events of small RNA therapeutics. Out of the two major types of off-target effects we focused on the hybridization-dependent, especially on the miRNA-like off-target effects. Our main aim was to discuss several approaches, including sequence design, chemical modifications and target prediction, to reduce hybridization-dependent off-target effects that should be considered even at the early development phase of small RNA therapy. Because there is no standard way of predicting hybridization-dependent off-target effects, this review provides an overview of all major state-of-the-art computational methods and proposes new approaches, such as the possible inclusion of network theory and artificial intelligence (AI) in the prediction workflows. Case studies and a concise survey of experimental methods for validating in silico predictions are also presented. These methods could contribute to interpret experimental results, to minimize off-target effects and hopefully to avoid off-target-related adverse events of small RNA therapeutics.
Collapse
Affiliation(s)
- Zoltán Bereczki
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Bettina Benczik
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Olivér M Balogh
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Szandra Marton
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Eszter Puhl
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Mátyás Pétervári
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Sanovigado Kft, Budapest, Hungary
| | - Máté Váczy-Földi
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - Zsolt Tamás Papp
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
| | - András Makkos
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Kimberly Glass
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Fabian Locquet
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Gerhild Euler
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Rainer Schulz
- Physiologisches Institut, Justus-Liebig-Universität Gießen, Giessen, Germany
| | - Péter Ferdinandy
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| | - Bence Ágg
- Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- HUN-REN-SU System Pharmacology Research Group, Department of Pharmacology and Pharmacotherapy, Semmelweis University, Budapest, Hungary
- Pharmahungary Group, Szeged, Hungary
| |
Collapse
|
7
|
Affengruber L, van der Maten MM, Spiero I, Nussbaumer-Streit B, Mahmić-Kaknjo M, Ellen ME, Goossen K, Kantorova L, Hooft L, Riva N, Poulentzas G, Lalagkas PN, Silva AG, Sassano M, Sfetcu R, Marqués ME, Friessova T, Baladia E, Pezzullo AM, Martinez P, Gartlehner G, Spijker R. An exploration of available methods and tools to improve the efficiency of systematic review production: a scoping review. BMC Med Res Methodol 2024; 24:210. [PMID: 39294580 PMCID: PMC11409535 DOI: 10.1186/s12874-024-02320-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Accepted: 08/26/2024] [Indexed: 09/20/2024] Open
Abstract
BACKGROUND Systematic reviews (SRs) are time-consuming and labor-intensive to perform. With the growing number of scientific publications, the SR development process becomes even more laborious. This is problematic because timely SR evidence is essential for decision-making in evidence-based healthcare and policymaking. Numerous methods and tools that accelerate SR development have recently emerged. To date, no scoping review has been conducted to provide a comprehensive summary of methods and ready-to-use tools to improve efficiency in SR production. OBJECTIVE To present an overview of primary studies that evaluated the use of ready-to-use applications of tools or review methods to improve efficiency in the review process. METHODS We conducted a scoping review. An information specialist performed a systematic literature search in four databases, supplemented with citation-based and grey literature searching. We included studies reporting the performance of methods and ready-to-use tools for improving efficiency when producing or updating a SR in the health field. We performed dual, independent title and abstract screening, full-text selection, and data extraction. The results were analyzed descriptively and presented narratively. RESULTS We included 103 studies: 51 studies reported on methods, 54 studies on tools, and 2 studies reported on both methods and tools to make SR production more efficient. A total of 72 studies evaluated the validity (n = 69) or usability (n = 3) of one method (n = 33) or tool (n = 39), and 31 studies performed comparative analyses of different methods (n = 15) or tools (n = 16). 20 studies conducted prospective evaluations in real-time workflows. Most studies evaluated methods or tools that aimed at screening titles and abstracts (n = 42) and literature searching (n = 24), while for other steps of the SR process, only a few studies were found. Regarding the outcomes included, most studies reported on validity outcomes (n = 84), while outcomes such as impact on results (n = 23), time-saving (n = 24), usability (n = 13), and cost-saving (n = 3) were less often evaluated. CONCLUSION For title and abstract screening and literature searching, various evaluated methods and tools are available that aim at improving the efficiency of SR production. However, only few studies have addressed the influence of these methods and tools in real-world workflows. Few studies exist that evaluate methods or tools supporting the remaining tasks. Additionally, while validity outcomes are frequently reported, there is a lack of evaluation regarding other outcomes.
Collapse
Affiliation(s)
- Lisa Affengruber
- Cochrane Austria, Department for Evidence-Based Medicine and Clinical Epidemiology, University for Continuing Education Krems, Krems an der Donau, Austria.
- School for Public Health and Primary Care (CAPHRI), Maastricht University, Maastricht, the Netherlands.
| | - Miriam M van der Maten
- Knowledge Institute of Federation of Medical Specialists, Utrecht, The Netherlands
- Cochrane Netherlands, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Isa Spiero
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Barbara Nussbaumer-Streit
- Cochrane Austria, Department for Evidence-Based Medicine and Clinical Epidemiology, University for Continuing Education Krems, Krems an der Donau, Austria
| | - Mersiha Mahmić-Kaknjo
- Zenica Cantonal Hospital, Department for Clinical Pharmacology, Zenica, Bosnia and Herzegovina
| | - Moriah E Ellen
- Department of Health Policy and Management, Guilford Glazer Faculty of Business and Management and Faculty of Health Sciences, Ben-Gurion University of the Negev, Beer-Sheva, Israel
- Institute of Health Policy Management and Evaluation, Dalla Lana School Of Public Health, University of Toronto, Toronto, Canada
- McMaster Health Forum, McMaster University, Hamilton, Canada
| | - Käthe Goossen
- Witten/Herdecke University, Institute for Research in Operative Medicine (IFOM), Cologne, Germany
| | - Lucia Kantorova
- Czech National Centre for Evidence-Based Healthcare and Knowledge Translation (Cochrane Czech Republic, Czech CEBHC: JBI Centre of Excellence, Masaryk University GRADE Centre), Institute of Biostatistics and Analyses, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Lotty Hooft
- Cochrane Netherlands, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Nicoletta Riva
- Department of Pathology, Faculty of Medicine and Surgery, University of Malta, Msida, Malta
| | - Georgios Poulentzas
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Panagiotis Nikolaos Lalagkas
- Laboratory of Hygiene and Environmental Protection, Department of Medicine, Democritus University of Thrace, Alexandroupolis, Greece
| | - Anabela G Silva
- CINTESIS.RISE@UA, University of Aveiro, Campus Universitário de Santiago, Aveiro, Portugal
| | - Michele Sassano
- Section of Hygiene, University Department of Life Sciences and Public Health, Università Cattolica del Sacro Cuore, Rome, Italy
- Department of Medical and Surgical Sciences, University of Bologna, Bologna, Italy
| | - Raluca Sfetcu
- National Institute for Health Services Management, Bucharest, Romania
- Spiru Haret University, Faculty of Psychology and Educational Sciences, Bucharest, Romania
| | - María E Marqués
- Red de Nutrición Basada en La Evidencia, Academia Española de Nutrición y Dietética, Pamplona, Spain
| | - Tereza Friessova
- Department of Health Sciences, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Eduard Baladia
- Red de Nutrición Basada en La Evidencia, Academia Española de Nutrición y Dietética, Pamplona, Spain
| | - Angelo Maria Pezzullo
- Section of Hygiene, University Department of Life Sciences and Public Health, Università Cattolica del Sacro Cuore, Rome, Italy
| | - Patricia Martinez
- Red de Nutrición Basada en La Evidencia, Academia Española de Nutrición y Dietética, Pamplona, Spain
- Techné Research Group, Department of Knowledge Engineering of the Faculty of Science, University of Granada, Granada, Spain
| | - Gerald Gartlehner
- Cochrane Austria, Department for Evidence-Based Medicine and Clinical Epidemiology, University for Continuing Education Krems, Krems an der Donau, Austria
- RTI International, Center for Public Health Methods, Research Triangle Park, Durham, NC, USA
| | - René Spijker
- Cochrane Netherlands, Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
- Amsterdam UMC, University of Amsterdam, Medical Library, Amsterdam Public Health, Amsterdam, the Netherlands
| |
Collapse
|
8
|
Rakhimzhanova T, Kuzdeuov A, Varol HA. AnyFace++: Deep Multi-Task, Multi-Domain Learning for Efficient Face AI. SENSORS (BASEL, SWITZERLAND) 2024; 24:5993. [PMID: 39338738 PMCID: PMC11436022 DOI: 10.3390/s24185993] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 09/11/2024] [Accepted: 09/13/2024] [Indexed: 09/30/2024]
Abstract
Accurate face detection and subsequent localization of facial landmarks are mandatory steps in many computer vision applications, such as emotion recognition, age estimation, and gender identification. Thanks to advancements in deep learning, numerous facial applications have been developed for human faces. However, most have to employ multiple models to accomplish several tasks simultaneously. As a result, they require more memory usage and increased inference time. Also, less attention is paid to other domains, such as animals and cartoon characters. To address these challenges, we propose an input-agnostic face model, AnyFace++, to perform multiple face-related tasks concurrently. The tasks are face detection and prediction of facial landmarks for human, animal, and cartoon faces, including age estimation, gender classification, and emotion recognition for human faces. We trained the model using deep multi-task, multi-domain learning with a heterogeneous cost function. The experimental results demonstrate that AnyFace++ generates outcomes comparable to cutting-edge models designed for specific domains.
Collapse
Affiliation(s)
| | | | - Huseyin Atakan Varol
- Institute of Smart Systems and Artificial Intelligence, Nazarbayev University, Astana 010000, Kazakhstan; (T.R.); (A.K.)
| |
Collapse
|
9
|
Wiest IC, Wolf F, Leßmann ME, van Treeck M, Ferber D, Zhu J, Boehme H, Bressem KK, Ulrich H, Ebert MP, Kather JN. LLM-AIx: An open source pipeline for Information Extraction from unstructured medical text based on privacy preserving Large Language Models. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.09.02.24312917. [PMID: 39281753 PMCID: PMC11398444 DOI: 10.1101/2024.09.02.24312917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]
Abstract
In clinical science and practice, text data, such as clinical letters or procedure reports, is stored in an unstructured way. This type of data is not a quantifiable resource for any kind of quantitative investigations and any manual review or structured information retrieval is time-consuming and costly. The capabilities of Large Language Models (LLMs) mark a paradigm shift in natural language processing and offer new possibilities for structured Information Extraction (IE) from medical free text. This protocol describes a workflow for LLM based information extraction (LLM-AIx), enabling extraction of predefined entities from unstructured text using privacy preserving LLMs. By converting unstructured clinical text into structured data, LLM-AIx addresses a critical barrier in clinical research and practice, where the efficient extraction of information is essential for improving clinical decision-making, enhancing patient outcomes, and facilitating large-scale data analysis. The protocol consists of four main processing steps: 1) Problem definition and data preparation, 2) data preprocessing, 3) LLM-based IE and 4) output evaluation. LLM-AIx allows integration on local hospital hardware without the need of transferring any patient data to external servers. As example tasks, we applied LLM-AIx for the anonymization of fictitious clinical letters from patients with pulmonary embolism. Additionally, we extracted symptoms and laterality of the pulmonary embolism of these fictitious letters. We demonstrate troubleshooting for potential problems within the pipeline with an IE on a real-world dataset, 100 pathology reports from the Cancer Genome Atlas Program (TCGA), for TNM stage extraction. LLM-AIx can be executed without any programming knowledge via an easy-to-use interface and in no more than a few minutes or hours, depending on the LLM model selected.
Collapse
Affiliation(s)
- Isabella Catharina Wiest
- Department of Medicine II, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
| | - Fabian Wolf
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
| | - Marie-Elisabeth Leßmann
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
- Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
| | - Marko van Treeck
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
| | - Dyke Ferber
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
- Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany
| | - Jiefu Zhu
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
| | - Heiko Boehme
- National Center for Tumor Diseases (NCT/UCC), Dresden, Germany: German Cancer Research Center (DKFZ), Heidelberg, Germany; Medical Faculty and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany; Helmholtz-Zentrum Dresden-Rossendorf (HZDR), Dresden, Germany
| | - Keno K Bressem
- Department of Cardiovascular Radiology and Nuclear Medicine, Technical University of Munich, School of Medicine and Health, German Heart Center, TUM University Hospital, Lazarethstr. 36, 80636, Munich, Germany
| | - Hannes Ulrich
- Institute for Medical Informatics and Statistics, Kiel University and University Hospital Schleswig-Holstein, Campus Kiel, Kiel and Lübeck, Schleswig-Holstein, Germany
| | - Matthias P Ebert
- Department of Medicine II, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
- DKFZ Hector Cancer Institute at the University Medical Center, Mannheim, Germany
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
- Department of Medicine I, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, 01307 Dresden, Germany
- Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany
| |
Collapse
|
10
|
Li B, Beaton D, Lee DS, Aljabri B, Al-Omran L, Wijeysundera DN, Hussain MA, Rotstein OD, de Mestral C, Mamdani M, Al-Omran M. Comprehensive review of virtual assistants in vascular surgery. Semin Vasc Surg 2024; 37:342-349. [PMID: 39277351 DOI: 10.1053/j.semvascsurg.2024.07.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 06/15/2024] [Accepted: 07/02/2024] [Indexed: 09/17/2024]
Abstract
Virtual assistants, broadly defined as digital services designed to simulate human conversation and provide personalized responses based on user input, have the potential to improve health care by supporting clinicians and patients in terms of diagnosing and managing disease, performing administrative tasks, and supporting medical research and education. These tasks are particularly helpful in vascular surgery, where the clinical and administrative burden is high due to the rising incidence of vascular disease, the medical complexity of the patients, and the potential for innovation and care advancement. The rapid development of artificial intelligence, machine learning, and natural language processing techniques have facilitated the training of large language models, such as GPT-4 (OpenAI), which can support the development of increasingly powerful virtual assistants. These tools may support holistic, multidisciplinary, and high-quality vascular care delivery throughout the pre-, intra-, and postoperative stages. Importantly, it is critical to consider the design, safety, and challenges related to virtual assistants, including data security, ethical, and equity concerns. By combining the perspectives of patients, clinicians, data scientists, and other stakeholders when developing, implementing, and monitoring virtual assistants, there is potential to harness the power of this technology to care for vascular surgery patients more effectively. In this comprehensive review article, we introduce the concept of virtual assistants, describe potential applications of virtual assistants in vascular surgery for clinicians and patients, highlight the benefits and drawbacks of large language models, such as GPT-4, and discuss considerations around the design, safety, and challenges associated with virtual assistants in vascular surgery.
Collapse
Affiliation(s)
- Ben Li
- Department of Surgery, University of Toronto, Toronto, ON, Canada; Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, 30 Bond Street, Suite 7-074, Bond Wing, Toronto, ON, Canada, M5B 1W8; Institute of Medical Science, University of Toronto, Toronto, ON, Canada; Temerty Centre for Artificial Intelligence Research and Education in Medicine, University of Toronto, Toronto, ON, Canada
| | - Derek Beaton
- Data Science and Advanced Analytics, Unity Health Toronto, University of Toronto, Toronto, ON, Canada
| | - Douglas S Lee
- Division of Cardiology, Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada; Institute for Clinical Evaluative Sciences, University of Toronto, Toronto, ON, Canada
| | - Badr Aljabri
- Department of Surgery, King Saud University, Saudi Arabia
| | - Leen Al-Omran
- School of Medicine, Alfaisal University, Saudi Arabia
| | - Duminda N Wijeysundera
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada; Institute for Clinical Evaluative Sciences, University of Toronto, Toronto, ON, Canada; Department of Anesthesia, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada
| | - Mohamad A Hussain
- Division of Vascular and Endovascular Surgery and the Center for Surgery and Public Health, Brigham and Women's Hospital, Harvard Medical School, Boston, MA
| | - Ori D Rotstein
- Department of Surgery, University of Toronto, Toronto, ON, Canada; Institute of Medical Science, University of Toronto, Toronto, ON, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada; Division of General Surgery, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada
| | - Charles de Mestral
- Department of Surgery, University of Toronto, Toronto, ON, Canada; Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, 30 Bond Street, Suite 7-074, Bond Wing, Toronto, ON, Canada, M5B 1W8; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada; Institute for Clinical Evaluative Sciences, University of Toronto, Toronto, ON, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada
| | - Muhammad Mamdani
- Institute of Medical Science, University of Toronto, Toronto, ON, Canada; Temerty Centre for Artificial Intelligence Research and Education in Medicine, University of Toronto, Toronto, ON, Canada; Data Science and Advanced Analytics, Unity Health Toronto, University of Toronto, Toronto, ON, Canada; Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, ON, Canada; Institute for Clinical Evaluative Sciences, University of Toronto, Toronto, ON, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada; Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Mohammed Al-Omran
- Department of Surgery, University of Toronto, Toronto, ON, Canada; Division of Vascular Surgery, St. Michael's Hospital, Unity Health Toronto, 30 Bond Street, Suite 7-074, Bond Wing, Toronto, ON, Canada, M5B 1W8; Institute of Medical Science, University of Toronto, Toronto, ON, Canada; Temerty Centre for Artificial Intelligence Research and Education in Medicine, University of Toronto, Toronto, ON, Canada; Li Ka Shing Knowledge Institute, St. Michael's Hospital, Unity Health Toronto, Toronto, ON, Canada; Department of Surgery, King Faisal Specialist Hospital and Research Center, Saudi Arabia.
| |
Collapse
|
11
|
Madrid-García A, Merino-Barbancho B, Freites-Núñez D, Rodríguez-Rodríguez L, Menasalvas-Ruíz E, Rodríguez-González A, Peñas A. From Web to RheumaLpack: Creating a Linguistic Corpus for Exploitation and Knowledge Discovery in Rheumatology. Comput Biol Med 2024; 179:108920. [PMID: 39047506 DOI: 10.1016/j.compbiomed.2024.108920] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/30/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]
Abstract
This study introduces RheumaLinguisticpack (RheumaLpack), the first specialised linguistic web corpus designed for the field of musculoskeletal disorders. By combining web mining (i.e., web scraping) and natural language processing (NLP) techniques, as well as clinical expertise, RheumaLpack systematically captures and curates structured and unstructured data across a spectrum of web sources including clinical trials registers (i.e., ClinicalTrials.gov), bibliographic databases (i.e., PubMed), medical agencies (i.e. European Medicines Agency), social media (i.e., Reddit), and accredited health websites (i.e., MedlinePlus, Harvard Health Publishing, and Cleveland Clinic). Given the complexity of rheumatic and musculoskeletal diseases (RMDs) and their significant impact on quality of life, this resource can be proposed as a useful tool to train algorithms that could mitigate the diseases' effects. Therefore, the corpus aims to improve the training of artificial intelligence (AI) algorithms and facilitate knowledge discovery in RMDs. The development of RheumaLpack involved a systematic six-step methodology covering data identification, characterisation, selection, collection, processing, and corpus description. The result is a non-annotated, monolingual, and dynamic corpus, featuring almost 3 million records spanning from 2000 to 2023. RheumaLpack represents a pioneering contribution to rheumatology research, providing a useful resource for the development of advanced AI and NLP applications. This corpus highlights the value of web data to address the challenges posed by musculoskeletal diseases, illustrating the corpus's potential to improve research and treatment paradigms in rheumatology. Finally, the methodology shown can be replicated to obtain data from other medical specialities. The code and details on how to build RheumaLpack are also provided to facilitate the dissemination of such resource.
Collapse
Affiliation(s)
- Alfredo Madrid-García
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain.
| | - Beatriz Merino-Barbancho
- Escuela Técnica Superior de Ingenieros de Telecomunicación Universidad Politécnica de Madrid, Avenida Complutense, 30, Madrid, 28040, Spain
| | - Dalifer Freites-Núñez
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain
| | - Luis Rodríguez-Rodríguez
- Grupo de Patología Musculoesquelética, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Prof. Martin Lagos s/n, Madrid, 28040, Spain
| | - Ernestina Menasalvas-Ruíz
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, 28223, Spain; Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Madrid, 28660, Spain
| | - Alejandro Rodríguez-González
- Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, 28223, Spain; Escuela Técnica Superior de Ingenieros Informáticos, Universidad Politécnica de Madrid, Boadilla del Monte, Madrid, 28660, Spain
| | - Anselmo Peñas
- UNED NLP & IR Group Universidad Nacional de Educación a Distancia, Juan del Rosal 16, 28040, Madrid, Spain
| |
Collapse
|
12
|
Jannesar N, Akbarzadeh-Sherbaf K, Safari S, Vahabie AH. SSTE: Syllable-Specific Temporal Encoding to FORCE-learn audio sequences with an associative memory approach. Neural Netw 2024; 177:106368. [PMID: 38761415 DOI: 10.1016/j.neunet.2024.106368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Revised: 03/28/2024] [Accepted: 05/05/2024] [Indexed: 05/20/2024]
Abstract
The circuitry and pathways in the brains of humans and other species have long inspired researchers and system designers to develop accurate and efficient systems capable of solving real-world problems and responding in real-time. We propose the Syllable-Specific Temporal Encoding (SSTE) to learn vocal sequences in a reservoir of Izhikevich neurons, by forming associations between exclusive input activities and their corresponding syllables in the sequence. Our model converts the audio signals to cochleograms using the CAR-FAC model to simulate a brain-like auditory learning and memorization process. The reservoir is trained using a hardware-friendly approach to FORCE learning. Reservoir computing could yield associative memory dynamics with far less computational complexity compared to RNNs. The SSTE-based learning enables competent accuracy and stable recall of spatiotemporal sequences with fewer reservoir inputs compared with existing encodings in the literature for similar purpose, offering resource savings. The encoding points to syllable onsets and allows recalling from a desired point in the sequence, making it particularly suitable for recalling subsets of long vocal sequences. The SSTE demonstrates the capability of learning new signals without forgetting previously memorized sequences and displays robustness against occasional noise, a characteristic of real-world scenarios. The components of this model are configured to improve resource consumption and computational intensity, addressing some of the cost-efficiency issues that might arise in future implementations aiming for compactness and real-time, low-power operation. Overall, this model proposes a brain-inspired pattern generation network for vocal sequences that can be extended with other bio-inspired computations to explore their potentials for brain-like auditory perception. Future designs could inspire from this model to implement embedded devices that learn vocal sequences and recall them as needed in real-time. Such systems could acquire language and speech, operate as artificial assistants, and transcribe text to speech, in the presence of natural noise and corruption on audio data.
Collapse
Affiliation(s)
- Nastaran Jannesar
- High Performance Embedded Architecture Lab., School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | | | - Saeed Safari
- High Performance Embedded Architecture Lab., School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| | - Abdol-Hossein Vahabie
- Department of Psychology, Faculty of Psychology and Education, University of Tehran, Tehran, Iran; Cognitive Systems Laboratory, Control and Intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran.
| |
Collapse
|
13
|
Urbina JT, Vu PD, Nguyen MV. Disability Ethics and Education in the Age of Artificial Intelligence: Identifying Ability Bias in ChatGPT and Gemini. Arch Phys Med Rehabil 2024:S0003-9993(24)01191-2. [PMID: 39216786 DOI: 10.1016/j.apmr.2024.08.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 08/17/2024] [Accepted: 08/19/2024] [Indexed: 09/04/2024]
Abstract
OBJECTIVE To identify and quantify ability bias in generative artificial intelligence large language model chatbots, specifically OpenAI's ChatGPT and Google's Gemini. DESIGN Observational study of language usage in generative artificial intelligence models. SETTING Investigation-only browser profile restricted to ChatGPT and Gemini. PARTICIPANTS Each chatbot generated 60 descriptions of people prompted without specified functional status, 30 descriptions of people with a disability, 30 descriptions of patients with a disability, and 30 descriptions of athletes with a disability (N=300). INTERVENTIONS Not applicable. MAIN OUTCOME MEASURES Generated descriptions produced by the models were parsed into words that were linguistically analyzed into favorable qualities or limiting qualities. RESULTS Both large language models significantly underestimated disability in a population of people, and linguistic analysis showed that descriptions of people, patients, and athletes with a disability were generated as having significantly fewer favorable qualities and significantly more limitations than people without a disability in both ChatGPT and Gemini. CONCLUSIONS Generative artificial intelligence chatbots demonstrate quantifiable ability bias and often exclude people with disabilities in their responses. Ethical use of these generative large language model chatbots in medical systems should recognize this limitation, and further consideration should be taken in developing equitable artificial intelligence technologies.
Collapse
Affiliation(s)
- Jacob T Urbina
- Department of Physical Medicine and Rehabilitation, McGovern Medical School, UTHealth Houston, Houston, TX.
| | - Peter D Vu
- Department of Physical Medicine and Rehabilitation, McGovern Medical School, UTHealth Houston, Houston, TX
| | - Michael V Nguyen
- Department of Physical Medicine and Rehabilitation, McGovern Medical School, UTHealth Houston, Houston, TX
| |
Collapse
|
14
|
Reichenpfader D, Müller H, Denecke K. A scoping review of large language model based approaches for information extraction from radiology reports. NPJ Digit Med 2024; 7:222. [PMID: 39182008 PMCID: PMC11344824 DOI: 10.1038/s41746-024-01219-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Accepted: 08/09/2024] [Indexed: 08/27/2024] Open
Abstract
Radiological imaging is a globally prevalent diagnostic method, yet the free text contained in radiology reports is not frequently used for secondary purposes. Natural Language Processing can provide structured data retrieved from these reports. This paper provides a summary of the current state of research on Large Language Model (LLM) based approaches for information extraction (IE) from radiology reports. We conduct a scoping review that follows the PRISMA-ScR guideline. Queries of five databases were conducted on August 1st 2023. Among the 34 studies that met inclusion criteria, only pre-transformer and encoder-based models are described. External validation shows a general performance decrease, although LLMs might improve generalizability of IE approaches. Reports related to CT and MRI examinations, as well as thoracic reports, prevail. Most common challenges reported are missing validation on external data and augmentation of the described methods. Different reporting granularities affect the comparability and transparency of approaches.
Collapse
Affiliation(s)
- Daniel Reichenpfader
- Institute for Patient-Centered Digital Health, Bern University of Applied Sciences, Biel/Bienne, Switzerland.
- Faculty of Medicine, University of Geneva, Geneva, Switzerland.
| | - Henning Müller
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
- Informatics Institute, HES-SO Valais-Wallis, Sierre, Switzerland
| | - Kerstin Denecke
- Institute for Patient-Centered Digital Health, Bern University of Applied Sciences, Biel/Bienne, Switzerland
| |
Collapse
|
15
|
Sato T, Masuda K, Sano C, Matsumoto K, Numata H, Munetoh S, Kasama T, Miyake R. Democratizing Microreactor Technology for Accelerated Discoveries in Chemistry and Materials Research. MICROMACHINES 2024; 15:1064. [PMID: 39337724 PMCID: PMC11434323 DOI: 10.3390/mi15091064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 08/11/2024] [Accepted: 08/14/2024] [Indexed: 09/30/2024]
Abstract
Microreactor technologies have emerged as versatile platforms with the potential to revolutionize chemistry and materials research, offering sustainable solutions to global challenges in environmental and health domains. This survey paper provides an in-depth review of recent advancements in microreactor technologies, focusing on their role in facilitating accelerated discoveries in chemistry and materials. Specifically, we examine the convergence of microfluidics with machine intelligence and automation, enabling the exploitation of the cyber-physical environment as a highly integrated experimentation platform for rapid scientific discovery and process development. We investigate the applicability and limitations of microreactor-enabled discovery accelerators in various chemistry and materials contexts. Despite their tremendous potential, the integration of machine intelligence and automation into microreactor-based experiments presents challenges in establishing fully integrated, automated, and intelligent systems. These challenges can hinder the broader adoption of microreactor technologies within the research community. To address this, we review emerging technologies that can help lower barriers and facilitate the implementation of microreactor-enabled discovery accelerators. Lastly, we provide our perspective on future research directions for democratizing microreactor technologies, with the aim of accelerating scientific discoveries and promoting widespread adoption of these transformative platforms.
Collapse
Affiliation(s)
- Tomomi Sato
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Koji Masuda
- Department of Physics and Astronomy, University of Exeter, Exeter EX4 4QL, UK;
| | - Chikako Sano
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Keiji Matsumoto
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Hidetoshi Numata
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Seiji Munetoh
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Toshihiro Kasama
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Ryo Miyake
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| |
Collapse
|
16
|
Kalupahana D, Kahatapitiya NS, Silva BN, Kim J, Jeon M, Wijenayake U, Wijesinghe RE. Dense Convolutional Neural Network-Based Deep Learning Pipeline for Pre-Identification of Circular Leaf Spot Disease of Diospyros kaki Leaves Using Optical Coherence Tomography. SENSORS (BASEL, SWITZERLAND) 2024; 24:5398. [PMID: 39205092 PMCID: PMC11359294 DOI: 10.3390/s24165398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 07/30/2024] [Accepted: 08/05/2024] [Indexed: 09/04/2024]
Abstract
Circular leaf spot (CLS) disease poses a significant threat to persimmon cultivation, leading to substantial harvest reductions. Existing visual and destructive inspection methods suffer from subjectivity, limited accuracy, and considerable time consumption. This study presents an automated pre-identification method of the disease through a deep learning (DL) based pipeline integrated with optical coherence tomography (OCT), thereby addressing the highlighted issues with the existing methods. The investigation yielded promising outcomes by employing transfer learning with pre-trained DL models, specifically DenseNet-121 and VGG-16. The DenseNet-121 model excels in differentiating among three stages of CLS disease (healthy (H), apparently healthy (or healthy-infected (HI)), and infected (I)). The model achieved precision values of 0.7823 for class-H, 0.9005 for class-HI, and 0.7027 for class-I, supported by recall values of 0.8953 for class-HI and 0.8387 for class-I. Moreover, the performance of CLS detection was enhanced by a supplemental quality inspection model utilizing VGG-16, which attained an accuracy of 98.99% in discriminating between low-detail and high-detail images. Moreover, this study employed a combination of LAMP and A-scan for the dataset labeling process, significantly enhancing the accuracy of the models. Overall, this study underscores the potential of DL techniques integrated with OCT to enhance disease identification processes in agricultural settings, particularly in persimmon cultivation, by offering efficient and objective pre-identification of CLS and enabling early intervention and management strategies.
Collapse
Affiliation(s)
- Deshan Kalupahana
- Department of Computer Engineering, Faculty of Engineering, University of Sri Jayewardenepura, Nugegoda 10250, Sri Lanka; (D.K.); (N.S.K.)
| | - Nipun Shantha Kahatapitiya
- Department of Computer Engineering, Faculty of Engineering, University of Sri Jayewardenepura, Nugegoda 10250, Sri Lanka; (D.K.); (N.S.K.)
| | - Bhagya Nathali Silva
- Department of Information Technology, Faculty of Computing, Sri Lanka Institute of Information Technology, Malabe 10115, Sri Lanka;
- Center for Excellence in Informatics, Electronics & Transmission (CIET), Sri Lanka Institute of Information Technology, Malabe 10115, Sri Lanka
| | - Jeehyun Kim
- School of Electronic and Electrical Engineering, College of IT Engineering, Kyungpook National University, 80 Daehak-ro, Buk-gu, Daegu 41566, Republic of Korea; (J.K.); (M.J.)
| | - Mansik Jeon
- School of Electronic and Electrical Engineering, College of IT Engineering, Kyungpook National University, 80 Daehak-ro, Buk-gu, Daegu 41566, Republic of Korea; (J.K.); (M.J.)
| | - Udaya Wijenayake
- Department of Computer Engineering, Faculty of Engineering, University of Sri Jayewardenepura, Nugegoda 10250, Sri Lanka; (D.K.); (N.S.K.)
| | - Ruchire Eranga Wijesinghe
- Center for Excellence in Informatics, Electronics & Transmission (CIET), Sri Lanka Institute of Information Technology, Malabe 10115, Sri Lanka
- Department of Electrical and Electronic Engineering, Faculty of Engineering, Sri Lanka Institute of Information Technology, Malabe 10115, Sri Lanka
| |
Collapse
|
17
|
Tucudean G, Bucos M, Dragulescu B, Caleanu CD. Natural language processing with transformers: a review. PeerJ Comput Sci 2024; 10:e2222. [PMID: 39145251 PMCID: PMC11322986 DOI: 10.7717/peerj-cs.2222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 07/08/2024] [Indexed: 08/16/2024]
Abstract
Natural language processing (NLP) tasks can be addressed with several deep learning architectures, and many different approaches have proven to be efficient. This study aims to briefly summarize the use cases for NLP tasks along with the main architectures. This research presents transformer-based solutions for NLP tasks such as Bidirectional Encoder Representations from Transformers (BERT), and Generative Pre-Training (GPT) architectures. To achieve that, we conducted a step-by-step process in the review strategy: identify the recent studies that include Transformers, apply filters to extract the most consistent studies, identify and define inclusion and exclusion criteria, assess the strategy proposed in each study, and finally discuss the methods and architectures presented in the resulting articles. These steps facilitated the systematic summarization and comparative analysis of NLP applications based on Transformer architectures. The primary focus is the current state of the NLP domain, particularly regarding its applications, language models, and data set types. The results provide insights into the challenges encountered in this research domain.
Collapse
Affiliation(s)
- Georgiana Tucudean
- Communications Department, Politehnica University Timișoara, Timișoara, Timiș, România
| | - Marian Bucos
- Communications Department, Politehnica University Timișoara, Timișoara, Timiș, România
| | - Bogdan Dragulescu
- Communications Department, Politehnica University Timișoara, Timișoara, Timiș, România
| | - Catalin Daniel Caleanu
- Applied Electronics Department, Politehnica University Timișoara, Timișoara, Timiș, România
| |
Collapse
|
18
|
Gencer G, Gencer K. A Comparative Analysis of ChatGPT and Medical Faculty Graduates in Medical Specialization Exams: Uncovering the Potential of Artificial Intelligence in Medical Education. Cureus 2024; 16:e66517. [PMID: 39246999 PMCID: PMC11380914 DOI: 10.7759/cureus.66517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/09/2024] [Indexed: 09/10/2024] Open
Abstract
Background This study aims to evaluate the performance of ChatGPT in the medical specialization exam (MSE) that medical graduates take when choosing their postgraduate specialization and to reveal how artificial intelligence-supported education can increase the quality and academic success of medical education. The research aims to explore the potential applications and advantages of artificial intelligence in medical education and examine ways in which this technology can contribute to student learning and exam preparation. Methodology A total of 240 MSE questions were posed to ChatGPT, 120 of which were basic medical sciences questions and 120 were clinical medical sciences questions. A total of 18,481 people participated in the exam. The performance of medical school graduates was compared with ChatGPT-3.5 in terms of answering these questions correctly. The average score for ChatGPT-3.5 was calculated by averaging the minimum and maximum scores. Calculations were done using the R.4.0.2 environment. Results The general average score of graduates was a minimum of 7.51 in basic sciences and a maximum of 81.46, while in clinical sciences, the average was a minimum of 12.51 and a maximum of 80.78. ChatGPT, on the other hand, had an average of at least 60.00 in basic sciences and a maximum of 72.00, with an average of at least 66.25 and a maximum of 77.00 in clinical sciences. The rate of correct answers in basic medical sciences for graduates was 43.03%, while for ChatGPT was 60.00%. In clinical medical sciences, the rate of correct answers for graduates was 53.29%, while for ChatGPT was 64.16%. ChatGPT performed best with a 91.66% correct answer rate in Obstetrics and Gynecology and an 86.36% correct answer rate in Medical Microbiology. The least successful area for ChatGPT was Anatomy, with a 28.00% correct answer rate, a subfield of basic medical sciences. Graduates outperformed ChatGPT in the Anatomy and Physiology subfields. Significant differences were found in all comparisons between ChatGPT and graduates. Conclusions This study shows that artificial intelligence models such as ChatGPT can provide significant advantages to graduates, as they score higher than medical school graduates. In terms of these benefits, recommended applications include interactive support, private lessons, learning material production, personalized learning plans, self-assessment, motivation boosting, and 24/7 access, among a variety of benefits. As a result, artificial intelligence-supported education can play an important role in improving the quality of medical education and increasing student success.
Collapse
Affiliation(s)
- Gülcan Gencer
- Department of Biostatistics and Medical Informatics, Faculty of Medicine, Afyonkarahisar Health Sciences University, Afyonkarahisar, TUR
| | - Kerem Gencer
- Department of Computer Engineering, Faculty of Engineering, Afyon Kocatepe University, Afyonkarahisar, TUR
| |
Collapse
|
19
|
Masua B, Masasi N. In the heart of Swahili: An exploration of data collection methods and corpus curation for natural language processing. Data Brief 2024; 55:110751. [PMID: 39234059 PMCID: PMC11372376 DOI: 10.1016/j.dib.2024.110751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2024] [Revised: 06/14/2024] [Accepted: 07/10/2024] [Indexed: 09/06/2024] Open
Abstract
Swahili corpus is a dataset generated by collecting written Kiswahili sentences from different sectors that deals with Kiswahili documents. Corpus of intended language is needed in Natural Language Processing (NLP) task to fit algorithm in order to understand that language before training the model. Swahili corpus dataset generated contained 1,693,228 sentences with 39,639,824 words and 871,452 unique words. Corpus exported in text file format with storage size of 168 MB. These sentences collected from different sources in different categories as follows: - Health (AFYA), Business and Industries (BIASHARA), Parliament (BUNGE), Religion (DINI), Education (ELIMU), News (HABARI), Agriculture (KILIMO), Social Media (MITANDAO), Non-Governmental Organizations (MASHIRIKA YA KIRAIA), Government (SERIKALI), Laws (SHERIA) and Politics (SIASA). This abstract outlines the systematic data collection process employed for the creation of a Swahili corpus derived from multiple public websites and reports. The compilation of this corpus involves a meticulous and comprehensive approach to ensure the representation of diverse linguistic contexts and topics relevant to the Swahili language. The data collection process commenced with the identification of suitable sources across various domains, including news articles, health publications, online forums, and Governmental public reports. Websites and platforms with publicly available Swahili content were systematically crawled and archived to capture a broad spectrum of linguistic expressions. Furthermore, special attention was given to reputable sources to maintain the authenticity of the corpus and linguistic richness. The inclusion of diverse sources ensures that the corpus reflects the linguistic nuances inherent in different contexts and registers within the Swahili language. Additionally, efforts were made to incorporate variations in domain dialects, acknowledging the linguistic diversity present in Swahili. The potential for reusing this Swahili corpus is vast. Researchers, linguists, and language enthusiasts can leverage the diverse and extensive dataset for a multitude of applications, including NLP tasks such as sentiment analysis, textual data clustering, classifications tasks and machine translation. The Corpus can serve as training data for developing and evaluating NLP algorithms, including part-of-speech tagging, and named entity recognition. Also, text mining techniques can be applied to corpus and enable researchers to extract valuable insights, identify patterns, and discover knowledge from large textual datasets.
Collapse
Affiliation(s)
- Bernard Masua
- College of Information and Communication Technologies (CoICT), University of Dar Es Salaam, Ali Hassan Mwinyi Road, Kijitonyama campus, Dar Es Salaam TZ 33335, Tanzania
| | - Noel Masasi
- College of Information and Communication Technologies (CoICT), University of Dar Es Salaam, Ali Hassan Mwinyi Road, Kijitonyama campus, Dar Es Salaam TZ 33335, Tanzania
| |
Collapse
|
20
|
Indiran AP, Fatima H, Chattopadhyay S, Ramadoss S, Radhakrishnan Y. UmamiPreDL: Deep learning model for umami taste prediction of peptides using BERT and CNN. Comput Biol Chem 2024; 111:108116. [PMID: 38823360 DOI: 10.1016/j.compbiolchem.2024.108116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 04/24/2024] [Accepted: 05/28/2024] [Indexed: 06/03/2024]
Abstract
Taste is crucial in driving food choice and preference. Umami is one of the basic tastes defined by characteristic deliciousness and mouthfulness that it imparts to foods. Identification of ingredients to enhance umami taste is of significant value to food industry. Various models have been shown to predict umami taste using feature encodings derived from traditional molecular descriptors such as amphiphilic pseudo-amino acid composition, dipeptide composition, and composition-transition-distribution. Highest reported accuracy of 90.5 % was recently achieved through novel model architecture. Here, we propose use of biological sequence transformers such as ProtBert and ESM2, trained on the Uniref databases, as the feature encoders block. With combination of 2 encoders and 2 classifiers, 4 model architectures were developed. Among the 4 models, ProtBert-CNN model outperformed other models with accuracy of 95 % on 5-fold cross validation data and 94 % on independent data.
Collapse
Affiliation(s)
- Arun Pandiyan Indiran
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India
| | - Humaira Fatima
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India
| | | | - Sureshkumar Ramadoss
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India; ITC Infotech India Limited, Bengaluru 560005, India
| | - Yashwanth Radhakrishnan
- ITC Life Sciences and Technology Centre, Peenya Industrial Area, 1st Phase, Bengaluru 560058, India.
| |
Collapse
|
21
|
Bektaş M, Pereira JK, Daams F, van der Peet DL. ChatGPT in surgery: a revolutionary innovation? Surg Today 2024; 54:964-971. [PMID: 38421439 PMCID: PMC11266448 DOI: 10.1007/s00595-024-02800-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 12/13/2023] [Indexed: 03/02/2024]
Abstract
ChatGPT has brought about a new era of digital health, as this model has become prominent and been rapidly developing since its release. ChatGPT may be able to facilitate improvements in surgery as well; however, the influence of ChatGPT on surgery is largely unknown at present. Therefore, the present study reports on the current applications of ChatGPT in the field of surgery, evaluating its workflow, practical implementations, limitations, and future perspectives. A literature search was performed using the PubMed and Embase databases. The initial search was performed from its inception until July 2023. This study revealed that ChatGPT has promising capabilities in areas of surgical research, education, training, and practice. In daily practice, surgeons and surgical residents can be aided in performing logistics and administrative tasks, and patients can be more efficiently informed about the details of their condition. However, priority should be given to establishing proper policies and protocols to ensure the safe and reliable use of this model.
Collapse
Affiliation(s)
- Mustafa Bektaş
- Amsterdam UMC Location Vrije Universiteit Amsterdam, Surgery, De Boelelaan 1117, Amsterdam, The Netherlands.
| | - Jaime Ken Pereira
- Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 1105, Amsterdam, The Netherlands
| | - Freek Daams
- Amsterdam UMC Location Vrije Universiteit Amsterdam, Surgery, De Boelelaan 1117, Amsterdam, The Netherlands
| | - Donald L van der Peet
- Amsterdam UMC Location Vrije Universiteit Amsterdam, Surgery, De Boelelaan 1117, Amsterdam, The Netherlands
| |
Collapse
|
22
|
Li C, Sutherland D, Richter A, Coombe L, Yanai A, Warren RL, Kotkoff M, Hof F, Hoang LMN, Helbing CC, Birol I. De novo synthetic antimicrobial peptide design with a recurrent neural network. Protein Sci 2024; 33:e5088. [PMID: 38988311 PMCID: PMC11237553 DOI: 10.1002/pro.5088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 05/16/2024] [Accepted: 06/10/2024] [Indexed: 07/12/2024]
Abstract
Antibiotic resistance is recognized as an imminent and growing global health threat. New antimicrobial drugs are urgently needed due to the decreasing effectiveness of conventional small-molecule antibiotics. Antimicrobial peptides (AMPs), a class of host defense peptides, are emerging as promising candidates to address this need. The potential sequence space of amino acids is combinatorially vast, making it possible to extend the current arsenal of antimicrobial agents with a practically infinite number of new peptide-based candidates. However, mining naturally occurring AMPs, whether directly by wet lab screening methods or aided by bioinformatics prediction tools, has its theoretical limit regarding the number of samples or genomic/transcriptomic resources researchers have access to. Further, manually designing novel synthetic AMPs requires prior field knowledge, restricting its throughput. In silico sequence generation methods are gaining interest as a high-throughput solution to the problem. Here, we introduce AMPd-Up, a recurrent neural network based tool for de novo AMP design, and demonstrate its utility over existing methods. Validation of candidates designed by AMPd-Up through antimicrobial susceptibility testing revealed that 40 of the 58 generated sequences possessed antimicrobial activity against Escherichia coli and/or Staphylococcus aureus. These results illustrate that AMPd-Up can be used to design novel synthetic AMPs with potent activities.
Collapse
Affiliation(s)
- Chenkai Li
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
- Bioinformatics Graduate ProgramUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Darcy Sutherland
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
- Public Health LaboratoryBritish Columbia Centre for Disease ControlVancouverBritish ColumbiaCanada
- Department of Pathology and Laboratory MedicineUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Amelia Richter
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
- Public Health LaboratoryBritish Columbia Centre for Disease ControlVancouverBritish ColumbiaCanada
| | - Lauren Coombe
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
| | - Anat Yanai
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
- Public Health LaboratoryBritish Columbia Centre for Disease ControlVancouverBritish ColumbiaCanada
| | - René L. Warren
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
| | - Monica Kotkoff
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
| | - Fraser Hof
- Department of Chemistry and the Centre for Advanced Materials and Related TechnologyUniversity of VictoriaVictoriaBritish ColumbiaCanada
| | - Linda M. N. Hoang
- Public Health LaboratoryBritish Columbia Centre for Disease ControlVancouverBritish ColumbiaCanada
- Department of Pathology and Laboratory MedicineUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| | - Caren C. Helbing
- Department of Biochemistry and MicrobiologyUniversity of VictoriaVictoriaBritish ColumbiaCanada
| | - Inanc Birol
- Canada's Michael Smith Genome Sciences CentreBC Cancer AgencyVancouverBritish ColumbiaCanada
- Public Health LaboratoryBritish Columbia Centre for Disease ControlVancouverBritish ColumbiaCanada
- Department of Pathology and Laboratory MedicineUniversity of British ColumbiaVancouverBritish ColumbiaCanada
- Department of Medical GeneticsUniversity of British ColumbiaVancouverBritish ColumbiaCanada
| |
Collapse
|
23
|
Hadi A, Tran E, Nagarajan B, Kirpalani A. Evaluation of ChatGPT as a diagnostic tool for medical learners and clinicians. PLoS One 2024; 19:e0307383. [PMID: 39083523 PMCID: PMC11290643 DOI: 10.1371/journal.pone.0307383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 07/03/2024] [Indexed: 08/02/2024] Open
Abstract
BACKGROUND ChatGPT is a large language model (LLM) trained on over 400 billion words from books, articles, and websites. Its extensive training draws from a large database of information, making it valuable as a diagnostic aid. Moreover, its capacity to comprehend and generate human language allows medical trainees to interact with it, enhancing its appeal as an educational resource. This study aims to investigate ChatGPT's diagnostic accuracy and utility in medical education. METHODS 150 Medscape case challenges (September 2021 to January 2023) were inputted into ChatGPT. The primary outcome was the number (%) of cases for which the answer given was correct. Secondary outcomes included diagnostic accuracy, cognitive load, and quality of medical information. A qualitative content analysis was also conducted to assess its responses. RESULTS ChatGPT answered 49% (74/150) cases correctly. It had an overall accuracy of 74%, a precision of 48.67%, sensitivity of 48.67%, specificity of 82.89%, and an AUC of 0.66. Most answers were considered low cognitive load 51% (77/150) and most answers were complete and relevant 52% (78/150). DISCUSSION ChatGPT in its current form is not accurate as a diagnostic tool. ChatGPT does not necessarily give factual correctness, despite the vast amount of information it was trained on. Based on our qualitative analysis, ChatGPT struggles with the interpretation of laboratory values, imaging results, and may overlook key information relevant to the diagnosis. However, it still offers utility as an educational tool. ChatGPT was generally correct in ruling out a specific differential diagnosis and providing reasonable next diagnostic steps. Additionally, answers were easy to understand, showcasing a potential benefit in simplifying complex concepts for medical learners. Our results should guide future research into harnessing ChatGPT's potential educational benefits, such as simplifying medical concepts and offering guidance on differential diagnoses and next steps.
Collapse
Affiliation(s)
- Ali Hadi
- Department of Paediatrics, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| | - Edward Tran
- Department of Paediatrics, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| | - Branavan Nagarajan
- Department of Paediatrics, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
| | - Amrit Kirpalani
- Department of Paediatrics, Schulich School of Medicine and Dentistry, Western University, London, Ontario, Canada
- Division of Nephrology, Children’s Hospital, London Health Sciences Centre, London, Ontario, Canada
| |
Collapse
|
24
|
Kittrell HD, Shaikh A, Adintori PA, McCarthy P, Kohli-Seth R, Nadkarni GN, Sakhuja A. Role of artificial intelligence in critical care nutrition support and research. Nutr Clin Pract 2024. [PMID: 39073166 DOI: 10.1002/ncp.11194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 06/06/2024] [Accepted: 06/28/2024] [Indexed: 07/30/2024] Open
Abstract
Nutrition plays a key role in the comprehensive care of critically ill patients. Determining optimal nutrition strategy, however, remains a subject of intense debate. Artificial intelligence (AI) applications are becoming increasingly common in medicine, and specifically in critical care, driven by the data-rich environment of intensive care units. In this review, we will examine the evidence regarding the application of AI in critical care nutrition. As of now, the use of AI in critical care nutrition is relatively limited, with its primary emphasis on malnutrition screening and tolerance of enteral nutrition. Despite the current scarcity of evidence, the potential for AI for more personalized nutrition management for critically ill patients is substantial. This stems from the ability of AI to integrate multiple data streams reflecting patients' changing needs while addressing inherent heterogeneity. The application of AI in critical care nutrition holds promise for optimizing patient outcomes through tailored and adaptive nutrition interventions. A successful implementation of AI, however, necessitates a multidisciplinary approach, coupled with careful consideration of challenges related to data management, financial aspects, and patient privacy.
Collapse
Affiliation(s)
- Hannah D Kittrell
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ahmed Shaikh
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Peter A Adintori
- Food and Nutrition Services Department, Memorial Sloan Kettering Cancer Center, New York, New York, USA
- Program in Rehabilitation Sciences, New York University Steinhardt, New York, New York, USA
| | - Paul McCarthy
- Department of Cardiovascular and Thoracic Surgery, Division of Cardiovascular Critical Care, West Virginia University, Morgantown, West Virginia, USA
| | - Roopa Kohli-Seth
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Girish N Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Mount Sinai Clinical Intelligence Center, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Department of Medicine, Division of Nephrology, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| | - Ankit Sakhuja
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Division of Data Driven and Digital Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
- Institute for Critical Care Medicine, Icahn School of Medicine at Mount Sinai, New York, New York, USA
| |
Collapse
|
25
|
Affengruber L, Nussbaumer-Streit B, Hamel C, Van der Maten M, Thomas J, Mavergames C, Spijker R, Gartlehner G. Rapid review methods series: Guidance on the use of supportive software. BMJ Evid Based Med 2024; 29:264-271. [PMID: 38242566 PMCID: PMC11287527 DOI: 10.1136/bmjebm-2023-112530] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/22/2023] [Indexed: 01/21/2024]
Abstract
This paper is part of a series of methodological guidance from the Cochrane Rapid Reviews Methods Group. Rapid reviews (RRs) use modified systematic review methods to accelerate the review process while maintaining systematic, transparent and reproducible methods. This paper guides how to use supportive software for RRs.We strongly encourage the use of supportive software throughout RR production. Specifically, we recommend (1) using collaborative online platforms that enable working in parallel, allow for real-time project management and centralise review details; (2) using automation software to support, but not entirely replace a human reviewer and human judgement and (3) being transparent in reporting the methodology and potential risk for bias due to the use of supportive software.
Collapse
Affiliation(s)
- Lisa Affengruber
- Department for Evidence-based Medicine and Evaluation, Cochrane Austria, University for Continuing Education Krems, Krems, Austria
- Department of Family Medicine, Maastricht University, Maastricht, The Netherlands
| | - Barbara Nussbaumer-Streit
- Department for Evidence-based Medicine and Evaluation, Cochrane Austria, University for Continuing Education Krems, Krems, Austria
| | - Candyce Hamel
- Canadian Association of Radiologists, Ottawa, Ontario, Canada
- School of Epidemiology and Public Health, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
| | - Miriam Van der Maten
- Knowledge Institute, Dutch Association of Medical Specialists, Utrecht, The Netherlands
| | - James Thomas
- University College London, UCL Social Research Institute, London, UK
| | | | - Rene Spijker
- Cochrane Netherlands, Julius Center for Health Sciences and Primary Care, UMC Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Gerald Gartlehner
- Department for Evidence-based Medicine and Evaluation, Cochrane Austria, University for Continuing Education Krems, Krems, Austria
- Center for Public Health Methods, RTI International, Research Triangle Park, North Carolina, USA
| |
Collapse
|
26
|
Deneault A, Dumais A, Désilets M, Hudon A. Natural Language Processing and Schizophrenia: A Scoping Review of Uses and Challenges. J Pers Med 2024; 14:744. [PMID: 39063998 PMCID: PMC11278236 DOI: 10.3390/jpm14070744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Revised: 07/04/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
(1) Background: Approximately 1% of the global population is affected by schizophrenia, a disorder marked by cognitive deficits, delusions, hallucinations, and language issues. It is associated with genetic, neurological, and environmental factors, and linked to dopaminergic hyperactivity and neurotransmitter imbalances. Recent research reveals that patients exhibit significant language impairments, such as reduced verbal output and fluency. Advances in machine learning and natural language processing show potential for early diagnosis and personalized treatments, but additional research is required for the practical application and interpretation of such technology. The objective of this study is to explore the applications of natural language processing in patients diagnosed with schizophrenia. (2) Methods: A scoping review was conducted across multiple electronic databases, including Medline, PubMed, Embase, and PsycInfo. The search strategy utilized a combination of text words and subject headings, focusing on schizophrenia and natural language processing. Systematically extracted information included authors, population, primary uses of the natural language processing algorithms, main outcomes, and limitations. The quality of the identified studies was assessed. (3) Results: A total of 516 eligible articles were identified, from which 478 studies were excluded based on the first analysis of titles and abstracts. Of the remaining 38 studies, 18 were selected as part of this scoping review. The following six main uses of natural language processing were identified: diagnostic and predictive modeling, followed by specific linguistic phenomena, speech and communication analysis, social media and online content analysis, clinical and cognitive assessment, and linguistic feature analysis. (4) Conclusions: This review highlights the main uses of natural language processing in the field of schizophrenia and the need for more studies to validate the effectiveness of natural language processing in diagnosing and treating schizophrenia.
Collapse
Affiliation(s)
- Antoine Deneault
- Department of Psychiatry and Addictology, Faculty of Medicine, Université de Montréal, Montreal, QC H3T 1J4, Canada;
| | - Alexandre Dumais
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| | - Marie Désilets
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| | - Alexandre Hudon
- Department of Psychiatry, Institut Universitaire en santé Mentale de Montréal, Montreal, QC H1N 3M5, Canada; (A.D.); (M.D.)
| |
Collapse
|
27
|
Kwok WH, Zhang Y, Wang G. Artificial intelligence in perinatal mental health research: A scoping review. Comput Biol Med 2024; 177:108685. [PMID: 38838557 DOI: 10.1016/j.compbiomed.2024.108685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 04/28/2024] [Accepted: 06/01/2024] [Indexed: 06/07/2024]
Abstract
The intersection of Artificial Intelligence (AI) and perinatal mental health research presents promising avenues, yet uncovers significant challenges for innovation. This review explicitly focuses on this multidisciplinary field and undertakes a comprehensive exploration of existing research therein. Through a scoping review guided by the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) framework, we searched relevant literature spanning a decade (2013-2023) and selected fourteen studies for our analysis. We first provide an overview of the main AI techniques and their development, including traditional methods across different categories, as well as recent emerging methods in the field. Then, through our analysis of the literature, we summarize the predominant AI and ML techniques adopted and their applications in perinatal mental health studies, such as identifying risk factors, predicting perinatal mental health disorders, voice assistants, and Q&A chatbots. We also discuss existing limitations and potential challenges that hinder AI technologies from improving perinatal mental health outcomes, and suggest several promising directions for future research to meet real needs in the field and facilitate the translation of research into clinical settings.
Collapse
Affiliation(s)
- Wai Hang Kwok
- School of Nursing and Midwifery, Edith Cowan University, WA, Australia
| | - Yuanpeng Zhang
- Department of Medical Informatics, Nantong University, Nantong, 226001, China
| | - Guanjin Wang
- School of Information Technology, Murdoch University, Murdoch, WA, Australia.
| |
Collapse
|
28
|
Khan M, Banerjee S, Muskawad S, Maity R, Chowdhury SR, Ejaz R, Kuuzie E, Satnarine T. The Impact of Artificial Intelligence on Allergy Diagnosis and Treatment. Curr Allergy Asthma Rep 2024; 24:361-372. [PMID: 38954325 DOI: 10.1007/s11882-024-01152-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/19/2024] [Indexed: 07/04/2024]
Abstract
PURPOSE OF REVIEW Artificial intelligence (AI), be it neuronal networks, machine learning or deep learning, has numerous beneficial effects on healthcare systems; however, its potential applications and diagnostic capabilities for immunologic diseases have yet to be explored. Understanding AI systems can help healthcare workers better assimilate artificial intelligence into their practice and unravel its potential in diagnostics, clinical research, and disease management. RECENT FINDINGS We reviewed recent advancements in AI systems and their integration in healthcare systems, along with their potential benefits in the diagnosis and management of diseases. We explored machine learning as employed in allergy diagnosis and its learning patterns from patient datasets, as well as the possible advantages of using AI in the field of research related to allergic reactions and even remote monitoring. Considering the ethical challenges and privacy concerns raised by clinicians and patients with regard to integrating AI in healthcare, we explored the new guidelines adapted by regulatory bodies. Despite these challenges, AI appears to have been successfully incorporated into various healthcare systems and is providing patient-centered solutions while simultaneously assisting healthcare workers. Artificial intelligence offers new hope in the field of immunologic disease diagnosis, monitoring, and management and thus has the potential to revolutionize healthcare systems.
Collapse
Affiliation(s)
- Maham Khan
- Fatima Jinnah Medical University, Lahore, Pakistan.
| | | | | | - Rick Maity
- Institute of Post Graduate Medical Education and Research, Kolkata, West Bengal, India
| | | | - Rida Ejaz
- Shifa College of Medicine, Islamabad, Pakistan
| | | | | |
Collapse
|
29
|
Lam BD, Chrysafi P, Chiasakul T, Khosla H, Karagkouni D, McNichol M, Adamski A, Reyes N, Abe K, Mantha S, Vlachos IS, Zwicker JI, Patell R. Machine learning natural language processing for identifying venous thromboembolism: systematic review and meta-analysis. Blood Adv 2024; 8:2991-3000. [PMID: 38522096 PMCID: PMC11215191 DOI: 10.1182/bloodadvances.2023012200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 02/22/2024] [Accepted: 02/22/2024] [Indexed: 03/26/2024] Open
Abstract
ABSTRACT Venous thromboembolism (VTE) is a leading cause of preventable in-hospital mortality. Monitoring VTE cases is limited by the challenges of manual medical record review and diagnosis code interpretation. Natural language processing (NLP) can automate the process. Rule-based NLP methods are effective but time consuming. Machine learning (ML)-NLP methods present a promising solution. We conducted a systematic review and meta-analysis of studies published before May 2023 that use ML-NLP to identify VTE diagnoses in the electronic health records. Four reviewers screened all manuscripts, excluding studies that only used a rule-based method. A meta-analysis evaluated the pooled performance of each study's best performing model that evaluated for pulmonary embolism and/or deep vein thrombosis. Pooled sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with confidence interval (CI) were calculated by DerSimonian and Laird method using a random-effects model. Study quality was assessed using an adapted TRIPOD (Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis) tool. Thirteen studies were included in the systematic review and 8 had data available for meta-analysis. Pooled sensitivity was 0.931 (95% CI, 0.881-0.962), specificity 0.984 (95% CI, 0.967-0.992), PPV 0.910 (95% CI, 0.865-0.941) and NPV 0.985 (95% CI, 0.977-0.990). All studies met at least 13 of the 21 NLP-modified TRIPOD items, demonstrating fair quality. The highest performing models used vectorization rather than bag-of-words and deep-learning techniques such as convolutional neural networks. There was significant heterogeneity in the studies, and only 4 validated their model on an external data set. Further standardization of ML studies can help progress this novel technology toward real-world implementation.
Collapse
Affiliation(s)
- Barbara D. Lam
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
- Division of Clinical Informatics, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| | - Pavlina Chrysafi
- Department of Medicine, Mount Auburn Hospital, Harvard Medical School, Boston, MA
| | - Thita Chiasakul
- Center of Excellence in Translational Hematology, Division of Hematology, Department of Medicine, Faculty of Medicine, Chulalongkorn University and King Chulalongkorn Memorial Hospital, Thai Red Cross Society, Bangkok, Thailand
| | - Harshit Khosla
- Department of Medicine, Saint Vincent Hospital, Worcester, MA
| | - Dimitra Karagkouni
- Department of Pathology, Cancer Research Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| | - Megan McNichol
- Library Sciences, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| | - Alys Adamski
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA
| | - Nimia Reyes
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA
| | - Karon Abe
- Division of Blood Disorders, National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA
| | - Simon Mantha
- Division of Hematology, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Ioannis S. Vlachos
- Department of Pathology, Cancer Research Institute, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| | - Jeffrey I. Zwicker
- Division of Hematology, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, NY
| | - Rushad Patell
- Division of Hematology, Department of Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA
| |
Collapse
|
30
|
Abbasi OR, Alesheikh AA, Lotfata A. Semantic similarity is not enough: A novel NLP-based semantic similarity measure in geospatial context. iScience 2024; 27:109883. [PMID: 38974474 PMCID: PMC11225810 DOI: 10.1016/j.isci.2024.109883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 01/20/2024] [Accepted: 04/30/2024] [Indexed: 07/09/2024] Open
Abstract
In this study, we addressed two primary challenges: firstly, the issue of domain shift, which pertains to changes in data characteristics or context that can impact model performance, and secondly, the discrepancy between semantic similarity and geographical distance. We employed topic modeling in conjunction with the BERT architecture. Our model was crafted to enhance similarity computations applied to geospatial text, aiming to integrate both semantic similarity and geographical proximity. We tested the model on two datasets, Persian Wikipedia articles and rental property advertisements. The findings demonstrate that the model effectively improved the correlation between semantic similarity and geographical distance. Furthermore, evaluation by real-world users within a recommender system context revealed a notable increase in user satisfaction by approximately 22% for Wikipedia articles and 56% for advertisements.
Collapse
Affiliation(s)
- Omid Reza Abbasi
- Department of Geospatial Information Systems, K. N. Toosi University of Technology, Tehran, Iran
| | - Ali Asghar Alesheikh
- Department of Geospatial Information Systems, K. N. Toosi University of Technology, Tehran, Iran
| | - Aynaz Lotfata
- Department of Pathology, Microbiology, and Immunology, School of Veterinary Medicine, University of California, Davis, Davis, CA, USA
| |
Collapse
|
31
|
Liu J. ChatGPT: perspectives from human-computer interaction and psychology. Front Artif Intell 2024; 7:1418869. [PMID: 38957452 PMCID: PMC11217544 DOI: 10.3389/frai.2024.1418869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 06/04/2024] [Indexed: 07/04/2024] Open
Abstract
The release of GPT-4 has garnered widespread attention across various fields, signaling the impending widespread adoption and application of Large Language Models (LLMs). However, previous research has predominantly focused on the technical principles of ChatGPT and its social impact, overlooking its effects on human-computer interaction and user psychology. This paper explores the multifaceted impacts of ChatGPT on human-computer interaction, psychology, and society through a literature review. The author investigates ChatGPT's technical foundation, including its Transformer architecture and RLHF (Reinforcement Learning from Human Feedback) process, enabling it to generate human-like responses. In terms of human-computer interaction, the author studies the significant improvements GPT models bring to conversational interfaces. The analysis extends to psychological impacts, weighing the potential of ChatGPT to mimic human empathy and support learning against the risks of reduced interpersonal connections. In the commercial and social domains, the paper discusses the applications of ChatGPT in customer service and social services, highlighting the improvements in efficiency and challenges such as privacy issues. Finally, the author offers predictions and recommendations for ChatGPT's future development directions and its impact on social relationships.
Collapse
Affiliation(s)
- Jiaxi Liu
- Wee Kim Wee School of Communication and Information, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
32
|
Bonnechère B. Unlocking the Black Box? A Comprehensive Exploration of Large Language Models in Rehabilitation. Am J Phys Med Rehabil 2024; 103:532-537. [PMID: 38261757 DOI: 10.1097/phm.0000000000002440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2024]
Abstract
ABSTRACT Rehabilitation is a vital component of health care, aiming to restore function and improve the well-being of individuals with disabilities or injuries. Nevertheless, the rehabilitation process is often likened to a " black box ," with complexities that pose challenges for comprehensive analysis and optimization. The emergence of large language models offers promising solutions to better understand this " black box ." Large language models excel at comprehending and generating human-like text, making them valuable in the healthcare sector. In rehabilitation, healthcare professionals must integrate a wide range of data to create effective treatment plans, akin to selecting the best ingredients for the " black box. " Large language models enhance data integration, communication, assessment, and prediction.This article delves into the ground-breaking use of large language models as a tool to further understand the rehabilitation process. Large language models address current rehabilitation issues, including data bias, contextual comprehension, and ethical concerns. Collaboration with healthcare experts and rigorous validation is crucial when deploying large language models. Integrating large language models into rehabilitation yields insights into this intricate process, enhancing data-driven decision making, refining clinical practices, and predicting rehabilitation outcomes. Although challenges persist, large language models represent a significant stride in rehabilitation, underscoring the importance of ethical use and collaboration.
Collapse
Affiliation(s)
- Bruno Bonnechère
- From the REVAL Rehabilitation Research Center, Faculty of Rehabilitation Sciences, Hasselt University, Diepenbeek, Belgium; Technology-Supported and Data-Driven Rehabilitation, Data Sciences Institute, Hasselt University, Diepenbeek, Belgium; and Department of PXL-Healthcare, PXL University of Applied Sciences and Arts, Hasselt, Belgium
| |
Collapse
|
33
|
Rodler S, Ganjavi C, De Backer P, Magoulianitis V, Ramacciotti LS, De Castro Abreu AL, Gill IS, Cacciamani GE. Generative artificial intelligence in surgery. Surgery 2024; 175:1496-1502. [PMID: 38582732 DOI: 10.1016/j.surg.2024.02.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 02/18/2024] [Accepted: 02/23/2024] [Indexed: 04/08/2024]
Abstract
Generative artificial intelligence is able to collect, extract, digest, and generate information in an understandable way for humans. As the first surgical applications of generative artificial intelligence are applied, this perspective paper aims to provide a comprehensive overview of current applications and future perspectives for the application of generative artificial intelligence in surgery, from preoperative planning to training. Generative artificial intelligence can be used before surgery for planning and decision support by extracting patient information and providing patients with information and simulation regarding the procedure. Intraoperatively, generative artificial intelligence can document data that is normally not captured as intraoperative adverse events or provide information to help decision-making. Postoperatively, GAIs can help with patient discharge and follow-up. The ability to provide real-time feedback and store it for later review is an important capability of GAIs. GAI applications are emerging as highly specialized, task-specific tools for tasks such as data extraction, synthesis, presentation, and communication within the realm of surgery. GAIs have the potential to play a pivotal role in facilitating interaction between surgeons and artificial intelligence.
Collapse
Affiliation(s)
- Severin Rodler
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA; Department of Urology, University Hospital of LMU Munich, Germany; Young Academic Working Group in Urologic Technology of the European Association of Urology, Arnhem, The Netherlands
| | - Conner Ganjavi
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA
| | - Pieter De Backer
- Young Academic Working Group in Urologic Technology of the European Association of Urology, Arnhem, The Netherlands; Department of Urology, Onze-Lieve-Vrouwziekenhuis Hospital, Aalst, Belgium; ORSI Academy, Ghent, Belgium
| | - Vasileios Magoulianitis
- Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California, Los Angeles, CA
| | - Lorenzo Storino Ramacciotti
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA
| | - Andre Luis De Castro Abreu
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA
| | - Inderbir S Gill
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA
| | - Giovanni E Cacciamani
- USC Institute of Urology and Catherine and Joseph Aresty Department of Urology, Keck School of Medicine, University of Southern California, Los Angeles, CA; Artificial Intelligence Center at USC Urology, USC Institute of Urology, University of Southern California, Los Angeles, CA; Young Academic Working Group in Urologic Technology of the European Association of Urology, Arnhem, The Netherlands.
| |
Collapse
|
34
|
Iscoe M, Socrates V, Gilson A, Chi L, Li H, Huang T, Kearns T, Perkins R, Khandjian L, Taylor RA. Identifying signs and symptoms of urinary tract infection from emergency department clinical notes using large language models. Acad Emerg Med 2024; 31:599-610. [PMID: 38567658 DOI: 10.1111/acem.14883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 01/24/2024] [Accepted: 01/24/2024] [Indexed: 04/04/2024]
Abstract
BACKGROUND Natural language processing (NLP) tools including recently developed large language models (LLMs) have myriad potential applications in medical care and research, including the efficient labeling and classification of unstructured text such as electronic health record (EHR) notes. This opens the door to large-scale projects that rely on variables that are not typically recorded in a structured form, such as patient signs and symptoms. OBJECTIVES This study is designed to acquaint the emergency medicine research community with the foundational elements of NLP, highlighting essential terminology, annotation methodologies, and the intricacies involved in training and evaluating NLP models. Symptom characterization is critical to urinary tract infection (UTI) diagnosis, but identification of symptoms from the EHR has historically been challenging, limiting large-scale research, public health surveillance, and EHR-based clinical decision support. We therefore developed and compared two NLP models to identify UTI symptoms from unstructured emergency department (ED) notes. METHODS The study population consisted of patients aged ≥ 18 who presented to an ED in a northeastern U.S. health system between June 2013 and August 2021 and had a urinalysis performed. We annotated a random subset of 1250 ED clinician notes from these visits for a list of 17 UTI symptoms. We then developed two task-specific LLMs to perform the task of named entity recognition: a convolutional neural network-based model (SpaCy) and a transformer-based model designed to process longer documents (Clinical Longformer). Models were trained on 1000 notes and tested on a holdout set of 250 notes. We compared model performance (precision, recall, F1 measure) at identifying the presence or absence of UTI symptoms at the note level. RESULTS A total of 8135 entities were identified in 1250 notes; 83.6% of notes included at least one entity. Overall F1 measure for note-level symptom identification weighted by entity frequency was 0.84 for the SpaCy model and 0.88 for the Longformer model. F1 measure for identifying presence or absence of any UTI symptom in a clinical note was 0.96 (232/250 correctly classified) for the SpaCy model and 0.98 (240/250 correctly classified) for the Longformer model. CONCLUSIONS The study demonstrated the utility of LLMs and transformer-based models in particular for extracting UTI symptoms from unstructured ED clinical notes; models were highly accurate for detecting the presence or absence of any UTI symptom on the note level, with variable performance for individual symptoms.
Collapse
Affiliation(s)
- Mark Iscoe
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut, USA
- Section for Biomedical Informatics and Data Science, Yale University School of Medicine, New Haven, Connecticut, USA
| | - Vimig Socrates
- Section for Biomedical Informatics and Data Science, Yale University School of Medicine, New Haven, Connecticut, USA
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
| | - Aidan Gilson
- Yale School of Medicine, New Haven, Connecticut, USA
| | - Ling Chi
- Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut, USA
| | - Huan Li
- Program of Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, USA
| | - Thomas Huang
- Yale School of Medicine, New Haven, Connecticut, USA
| | - Thomas Kearns
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut, USA
| | - Rachelle Perkins
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut, USA
| | - Laura Khandjian
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut, USA
| | - R Andrew Taylor
- Department of Emergency Medicine, Yale School of Medicine, New Haven, Connecticut, USA
- Section for Biomedical Informatics and Data Science, Yale University School of Medicine, New Haven, Connecticut, USA
| |
Collapse
|
35
|
Azadi A, Gorjinejad F, Mohammad-Rahimi H, Tabrizi R, Alam M, Golkar M. Evaluation of AI-generated responses by different artificial intelligence chatbots to the clinical decision-making case-based questions in oral and maxillofacial surgery. Oral Surg Oral Med Oral Pathol Oral Radiol 2024; 137:587-593. [PMID: 38570273 DOI: 10.1016/j.oooo.2024.02.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 02/24/2024] [Accepted: 02/26/2024] [Indexed: 04/05/2024]
Abstract
OBJECTIVES This study aims to evaluate the correctness of the generated answers by Google Bard, GPT-3.5, GPT-4, Claude-Instant, and Bing chatbots to decision-making clinical questions in the oral and maxillofacial surgery (OMFS) area. STUDY DESIGN A group of 3 board-certified oral and maxillofacial surgeons designed a questionnaire with 50 case-based questions in multiple-choice and open-ended formats. Answers of chatbots to multiple-choice questions were examined against the chosen option by 3 referees. The chatbots' answers to the open-ended questions were evaluated based on the modified global quality scale. A P-value under .05 was considered significant. RESULTS Bard, GPT-3.5, GPT-4, Claude-Instant, and Bing answered 34%, 36%, 38%, 38%, and 26% of the questions correctly, respectively. In open-ended questions, GPT-4 scored the most answers evaluated as grades "4" or "5," and Bing scored the most answers evaluated as grades "1" or "2." There were no statistically significant differences between the 5 chatbots in responding to the open-ended (P = .275) and multiple-choice (P = .699) questions. CONCLUSION Considering the major inaccuracies in the responses of chatbots, despite their relatively good performance in answering open-ended questions, this technology yet cannot be trusted as a consultant for clinicians in decision-making situations.
Collapse
Affiliation(s)
- Ali Azadi
- Dentofacial Deformities Research Center, Research Institute for Dental Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Fatemeh Gorjinejad
- Islamic Azad University of Medical Sciences, Faculty of Dentistry, Tehran, Iran
| | - Hossein Mohammad-Rahimi
- Dentofacial Deformities Research Center, Research Institute for Dental Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran; Topic Group Dental Diagnostics and Digital Dentistry, ITU/WHO Focus Group AI on Health, Berlin, Germany
| | - Reza Tabrizi
- Department of Oral and Maxillofacial Surgery, School of Dentistry, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mostafa Alam
- Department of Oral and Maxillofacial Surgery, School of Dentistry, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Mohsen Golkar
- Department of Oral and Maxillofacial Surgery, School of Dentistry, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
| |
Collapse
|
36
|
Murcia VM, Aggarwal V, Pesaladinne N, Thammineni R, Do N, Alterovitz G, Fricks RB. Automating Clinical Trial Matches Via Natural Language Processing of Synthetic Electronic Health Records and Clinical Trial Eligibility Criteria. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2024; 2024:125-134. [PMID: 38827083 PMCID: PMC11141802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Clinical trials are critical to many medical advances; however, recruiting patients remains a persistent obstacle. Automated clinical trial matching could expedite recruitment across all trial phases. We detail our initial efforts towards automating the matching process by linking realistic synthetic electronic health records to clinical trial eligibility criteria using natural language processing methods. We also demonstrate how the Sørensen-Dice Index can be adapted to quantify match quality between a patient and a clinical trial.
Collapse
Affiliation(s)
- Victor M Murcia
- VA Massachusetts Veterans Epidemiology Research and Information Center, Boston, MA
- VA National Artificial Intelligence Institute, Washington, D.C
| | - Vinod Aggarwal
- VHA Office of Healthcare Innovation and Learning, VA Central Office, Washington DC
- MDClone, Be'er Sheva, Israel
| | | | - Ram Thammineni
- CTS Group, Girls Computing League, Nonprofit Organization, Herndon, VA
| | - Nhan Do
- VA Massachusetts Veterans Epidemiology Research and Information Center, Boston, MA
| | - Gil Alterovitz
- VA National Artificial Intelligence Institute, Washington, D.C
| | - Rafael B Fricks
- VA Massachusetts Veterans Epidemiology Research and Information Center, Boston, MA
- VA National Artificial Intelligence Institute, Washington, D.C
| |
Collapse
|
37
|
Cortial L, Montero V, Tourlet S, Del Bano J, Blin O. Artificial intelligence in drug repurposing for rare diseases: a mini-review. Front Med (Lausanne) 2024; 11:1404338. [PMID: 38841574 PMCID: PMC11150798 DOI: 10.3389/fmed.2024.1404338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Accepted: 04/29/2024] [Indexed: 06/07/2024] Open
Abstract
Drug repurposing, the process of identifying new uses for existing drugs beyond their original indications, offers significant advantages in terms of reduced development time and costs, particularly in addressing unmet medical needs in rare diseases. Artificial intelligence (AI) has emerged as a transformative force in healthcare, and by leveraging AI technologies, researchers aim to overcome some of the challenges associated with rare diseases. This review presents concrete case studies, as well as pre-existing platforms, initiatives, and companies that demonstrate the application of AI for drug repurposing in rare diseases. Despite representing a modest part of the literature compared to other diseases such as COVID-19 or cancer, the growing interest, and investment in AI for drug repurposing in rare diseases underscore its potential to accelerate treatment availability for patients with unmet medical needs.
Collapse
Affiliation(s)
- Lucas Cortial
- OrphanDEV FCRIN Reference Network, Aix Marseille Univ, APHM, INSERM, Inst Neurosci Syst, CHU Timone, Marseille, France
- Thelonius Mind, Marseille, France
| | - Vincent Montero
- OrphanDEV FCRIN Reference Network, Aix Marseille Univ, APHM, INSERM, Inst Neurosci Syst, CHU Timone, Marseille, France
- Thelonius Mind, Marseille, France
| | | | | | - Olivier Blin
- OrphanDEV FCRIN Reference Network, Aix Marseille Univ, APHM, INSERM, Inst Neurosci Syst, CHU Timone, Marseille, France
| |
Collapse
|
38
|
Le KDR, Tay SBP, Choy KT, Verjans J, Sasanelli N, Kong JCH. Applications of natural language processing tools in the surgical journey. Front Surg 2024; 11:1403540. [PMID: 38826809 PMCID: PMC11140056 DOI: 10.3389/fsurg.2024.1403540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 05/07/2024] [Indexed: 06/04/2024] Open
Abstract
Background Natural language processing tools are becoming increasingly adopted in multiple industries worldwide. They have shown promising results however their use in the field of surgery is under-recognised. Many trials have assessed these benefits in small settings with promising results before large scale adoption can be considered in surgery. This study aims to review the current research and insights into the potential for implementation of natural language processing tools into surgery. Methods A narrative review was conducted following a computer-assisted literature search on Medline, EMBASE and Google Scholar databases. Papers related to natural language processing tools and consideration into their use for surgery were considered. Results Current applications of natural language processing tools within surgery are limited. From the literature, there is evidence of potential improvement in surgical capability and service delivery, such as through the use of these technologies to streamline processes including surgical triaging, data collection and auditing, surgical communication and documentation. Additionally, there is potential to extend these capabilities to surgical academia to improve processes in surgical research and allow innovation in the development of educational resources. Despite these outcomes, the evidence to support these findings are challenged by small sample sizes with limited applicability to broader settings. Conclusion With the increasing adoption of natural language processing technology, such as in popular forms like ChatGPT, there has been increasing research in the use of these tools within surgery to improve surgical workflow and efficiency. This review highlights multifaceted applications of natural language processing within surgery, albeit with clear limitations due to the infancy of the infrastructure available to leverage these technologies. There remains room for more rigorous research into broader capability of natural language processing technology within the field of surgery and the need for cross-sectoral collaboration to understand the ways in which these algorithms can best be integrated.
Collapse
Affiliation(s)
- Khang Duy Ricky Le
- Department of General Surgical Specialties, The Royal Melbourne Hospital, Melbourne, VIC, Australia
- Department of Surgical Oncology, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Geelong Clinical School, Deakin University, Geelong, VIC, Australia
- Department of Medical Education, The University of Melbourne, Melbourne, VIC, Australia
| | - Samuel Boon Ping Tay
- Department of Anaesthesia and Pain Medicine, Eastern Health, Box Hill, VIC, Australia
| | - Kay Tai Choy
- Department of Surgery, Austin Health, Melbourne, VIC, Australia
| | - Johan Verjans
- Australian Institute for Machine Learning (AIML), University of Adelaide, Adelaide, SA, Australia
- Lifelong Health Theme (Platform AI), South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Nicola Sasanelli
- Division of Information Technology, Engineering and the Environment, University of South Australia, Adelaide, SA, Australia
- Department of Operations (Strategic and International Partnerships), SmartSAT Cooperative Research Centre, Adelaide, SA, Australia
- Agora High Tech, Adelaide, SA, Australia
| | - Joseph C. H. Kong
- Department of Surgical Oncology, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Monash University Department of Surgery, Alfred Hospital, Melbourne, VIC, Australia
- Department of Colorectal Surgery, Alfred Hospital, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
39
|
Xu Z, Fang Q, Huang Y, Xie M. The public attitude towards ChatGPT on reddit: A study based on unsupervised learning from sentiment analysis and topic modeling. PLoS One 2024; 19:e0302502. [PMID: 38743773 PMCID: PMC11093324 DOI: 10.1371/journal.pone.0302502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 04/07/2024] [Indexed: 05/16/2024] Open
Abstract
ChatGPT has demonstrated impressive abilities and impacted various aspects of human society since its creation, gaining widespread attention from different social spheres. This study aims to comprehensively assess public perception of ChatGPT on Reddit. The dataset was collected via Reddit, a social media platform, and includes 23,733 posts and comments related to ChatGPT. Firstly, to examine public attitudes, this study conducts content analysis utilizing topic modeling with the Latent Dirichlet Allocation (LDA) algorithm to extract pertinent topics. Furthermore, sentiment analysis categorizes user posts and comments as positive, negative, or neutral using Textblob and Vader in natural language processing. The result of topic modeling shows that seven topics regarding ChatGPT are identified, which can be grouped into three themes: user perception, technical methods, and impacts on society. Results from the sentiment analysis show that 61.6% of the posts and comments hold favorable opinions on ChatGPT. They emphasize ChatGPT's ability to prompt and engage in natural conversations with users, without relying on complex natural language processing. It provides suggestions for ChatGPT developers to enhance its usability design and functionality. Meanwhile, stakeholders, including users, should comprehend the advantages and disadvantages of ChatGPT in human society to promote ethical and regulated implementation of the system.
Collapse
Affiliation(s)
- Zhaoxiang Xu
- Department of Data Science, School of Computer Science and Engineering, Guangzhou Institute of Science and Technology, Guangzhou, Guangdong, China
| | - Qingguo Fang
- Department of Management, School of Business, Macau University of Science and Technology, Macao, China
| | - Yanbo Huang
- Data Science Research Center, Faculty of Innovation Engineering, Macau University of Science and Technology, Macao, China
| | - Mingjian Xie
- Department of Decision Sciences, School of Business, Macau University of Science and Technology, Macao, China
| |
Collapse
|
40
|
Treder MS, Lee S, Tsvetanov KA. Introduction to Large Language Models (LLMs) for dementia care and research. FRONTIERS IN DEMENTIA 2024; 3:1385303. [PMID: 39081594 PMCID: PMC11285660 DOI: 10.3389/frdem.2024.1385303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 04/23/2024] [Indexed: 08/02/2024]
Abstract
Introduction Dementia is a progressive neurodegenerative disorder that affects cognitive abilities including memory, reasoning, and communication skills, leading to gradual decline in daily activities and social engagement. In light of the recent advent of Large Language Models (LLMs) such as ChatGPT, this paper aims to thoroughly analyse their potential applications and usefulness in dementia care and research. Method To this end, we offer an introduction into LLMs, outlining the key features, capabilities, limitations, potential risks, and practical considerations for deployment as easy-to-use software (e.g., smartphone apps). We then explore various domains related to dementia, identifying opportunities for LLMs to enhance understanding, diagnostics, and treatment, with a broader emphasis on improving patient care. For each domain, the specific contributions of LLMs are examined, such as their ability to engage users in meaningful conversations, deliver personalized support, and offer cognitive enrichment. Potential benefits encompass improved social interaction, enhanced cognitive functioning, increased emotional well-being, and reduced caregiver burden. The deployment of LLMs in caregiving frameworks also raises a number of concerns and considerations. These include privacy and safety concerns, the need for empirical validation, user-centered design, adaptation to the user's unique needs, and the integration of multimodal inputs to create more immersive and personalized experiences. Additionally, ethical guidelines and privacy protocols must be established to ensure responsible and ethical deployment of LLMs. Results We report the results on a questionnaire filled in by people with dementia (PwD) and their supporters wherein we surveyed the usefulness of different application scenarios of LLMs as well as the features that LLM-powered apps should have. Both PwD and supporters were largely positive regarding the prospect of LLMs in care, although concerns were raised regarding bias, data privacy and transparency. Discussion Overall, this review corroborates the promising utilization of LLMs to positively impact dementia care by boosting cognitive abilities, enriching social interaction, and supporting caregivers. The findings underscore the importance of further research and development in this field to fully harness the benefits of LLMs and maximize their potential for improving the lives of individuals living with dementia.
Collapse
Affiliation(s)
- Matthias S. Treder
- School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
| | - Sojin Lee
- Olive AI Limited, London, United Kingdom
| | - Kamen A. Tsvetanov
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, United Kingdom
- Department of Psychology, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
41
|
Pedone G, Váncza J, Szaller Á. Exploring hidden pathways to sustainable manufacturing for cyber-physical production systems. Heliyon 2024; 10:e29004. [PMID: 38638957 PMCID: PMC11024542 DOI: 10.1016/j.heliyon.2024.e29004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 03/26/2024] [Accepted: 03/28/2024] [Indexed: 04/20/2024] Open
Abstract
Future manufacturing scenarios will likely be built around cyber-physical production systems. To succeed, this new manufacturing paradigm will also have to comply with the golden rule of sustainability. However, the concept of sustainability as defined in a number of high-level policy documents and recommendations requires disambiguation. The paper introduces HITECS, a novel, context-independent text analytics methodology for hidden correlation analysis in documents. HITECS is based on the assumption that there is a strong link between a concept and the words implicitly chosen to explain it. The analysis is based on the combination of bare words frequency and cosine similarity, excluding trivial, first-level terms (titles, keywords, and definitions). Processing a corpus of generally accepted documents related to various definitions and requirements of sustainability unfolded their hidden correlations and some common key concepts. These results indicate that terms like access, inclusion, global, change, together with others like resource, share, and integration, are among leading concepts in the high-level documents discussing the requirements of sustainability. A similar analysis in the domain of cyber-physical production systems shows strong conceptual overlaps but also gaps indicating pathways for future research and actions.
Collapse
Affiliation(s)
- Gianfranco Pedone
- Research Laboratory on Engineering & Management Intelligence, Centre of Excellence in Production Informatics and Control, Institute for Computer Science and Control, Kende u. 13-17, Budapest, 1111, Hungary
| | - József Váncza
- Research Laboratory on Engineering & Management Intelligence, Centre of Excellence in Production Informatics and Control, Institute for Computer Science and Control, Kende u. 13-17, Budapest, 1111, Hungary
| | - Ádám Szaller
- Research Laboratory on Engineering & Management Intelligence, Centre of Excellence in Production Informatics and Control, Institute for Computer Science and Control, Kende u. 13-17, Budapest, 1111, Hungary
| |
Collapse
|
42
|
Cabello-Collado C, Rodriguez-Juan J, Ortiz-Perez D, Garcia-Rodriguez J, Tomás D, Vizcaya-Moreno MF. Automated Generation of Clinical Reports Using Sensing Technologies with Deep Learning Techniques. SENSORS (BASEL, SWITZERLAND) 2024; 24:2751. [PMID: 38732857 PMCID: PMC11086159 DOI: 10.3390/s24092751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 04/14/2024] [Accepted: 04/23/2024] [Indexed: 05/13/2024]
Abstract
This study presents a pioneering approach that leverages advanced sensing technologies and data processing techniques to enhance the process of clinical documentation generation during medical consultations. By employing sophisticated sensors to capture and interpret various cues such as speech patterns, intonations, or pauses, the system aims to accurately perceive and understand patient-doctor interactions in real time. This sensing capability allows for the automation of transcription and summarization tasks, facilitating the creation of concise and informative clinical documents. Through the integration of automatic speech recognition sensors, spoken dialogue is seamlessly converted into text, enabling efficient data capture. Additionally, deep models such as Transformer models are utilized to extract and analyze crucial information from the dialogue, ensuring that the generated summaries encapsulate the essence of the consultations accurately. Despite encountering challenges during development, experimentation with these sensing technologies has yielded promising results. The system achieved a maximum ROUGE-1 metric score of 0.57, demonstrating its effectiveness in summarizing complex medical discussions. This sensor-based approach aims to alleviate the administrative burden on healthcare professionals by automating documentation tasks and safeguarding important patient information. Ultimately, by enhancing the efficiency and reliability of clinical documentation, this innovative method contributes to improving overall healthcare outcomes.
Collapse
Affiliation(s)
- Celia Cabello-Collado
- Department of Computer Technology, University of Alicante, 03080 Alicante, Spain; (C.C.-C.); (J.R.-J.); (D.O.-P.)
| | - Javier Rodriguez-Juan
- Department of Computer Technology, University of Alicante, 03080 Alicante, Spain; (C.C.-C.); (J.R.-J.); (D.O.-P.)
| | - David Ortiz-Perez
- Department of Computer Technology, University of Alicante, 03080 Alicante, Spain; (C.C.-C.); (J.R.-J.); (D.O.-P.)
| | - Jose Garcia-Rodriguez
- Department of Computer Technology, University of Alicante, 03080 Alicante, Spain; (C.C.-C.); (J.R.-J.); (D.O.-P.)
| | - David Tomás
- Department of Computer Languages, University of Alicante, 03080 Alicante, Spain;
| | - Maria Flores Vizcaya-Moreno
- Unit of Clinical Nursing Research, Faculty of Health Sciences, University of Alicante, 03080 Alicante, Spain;
| |
Collapse
|
43
|
Lau-Min KS, Marini J, Shah NK, Pucci D, Blauch AN, Cambareri C, Mooney B, Agarwal P, Johnston C, Schumacher RP, White K, Gabriel PE, Rosin R, Jacobs LA, Shulman LN. Pilot Study of a Mobile Phone Chatbot for Medication Adherence and Toxicity Management Among Patients With GI Cancers on Capecitabine. JCO Oncol Pract 2024; 20:483-490. [PMID: 38237102 DOI: 10.1200/op.23.00365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 10/11/2023] [Accepted: 12/04/2023] [Indexed: 04/12/2024] Open
Abstract
PURPOSE Capecitabine is an oral chemotherapy used to treat many gastrointestinal cancers. Its complex dosing and narrow therapeutic index make medication adherence and toxicity management crucial for quality care. METHODS We conducted a pilot study of PENNY-GI, a mobile phone text messaging-based chatbot that leverages algorithmic surveys and natural language processing to promote medication adherence and toxicity management among patients with gastrointestinal cancers on capecitabine. Eligibility initially included all capecitabine-containing regimens but was subsequently restricted to capecitabine monotherapy because of challenges in integrating PENNY-GI with radiation and intravenous chemotherapy schedules. We used design thinking principles and real-time data on safety, accuracy, and usefulness to make iterative refinements to PENNY-GI with the goal of minimizing the proportion of text messaging exchanges with incorrect medication or symptom management recommendations. All patients were invited to participate in structured exit interviews to provide feedback on PENNY-GI. RESULTS We enrolled 40 patients (median age 64.5 years, 52.5% male, 62.5% White, 55.0% with colorectal cancer, 50.0% on capecitabine monotherapy). We identified 284 of 3,895 (7.3%) medication-related and 13 of 527 (2.5%) symptom-related text messaging exchanges with incorrect recommendations. In exit interviews with 24 patients, participants reported finding the medication reminders reliable and user-friendly, but the symptom management tool was too simplistic to be helpful. CONCLUSION Although PENNY-GI provided accurate recommendations in >90% of text messaging exchanges, we identified multiple limitations with respect to the intervention's generalizability, usefulness, and scalability. Lessons from this pilot study should inform future efforts to develop and implement digital health interventions in oncology.
Collapse
Affiliation(s)
- Kelsey S Lau-Min
- Division of Hematology/Oncology, Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, MA
| | - Jessica Marini
- Hospital of the University of Pennsylvania, Penn Medicine, Philadelphia, PA
| | - Nishant K Shah
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Donna Pucci
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Abigail N Blauch
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | | | - Bethany Mooney
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Parul Agarwal
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | | | | | | | - Peter E Gabriel
- Department of Radiation Oncology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Roy Rosin
- Center for Health Care Innovation, Penn Medicine, Philadelphia, PA
| | - Linda A Jacobs
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Lawrence N Shulman
- Division of Hematology/Oncology, Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| |
Collapse
|
44
|
Haar M, Sonntagbauer M, Kluge S. [Significance of natural language processing and chat-based generative language models]. Med Klin Intensivmed Notfmed 2024; 119:181-188. [PMID: 38108880 DOI: 10.1007/s00063-023-01098-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 10/30/2023] [Indexed: 12/19/2023]
Abstract
BACKGROUND Natural language processing (NLP) has experienced significant growth in recent years and shows potential for broad impacts in scientific research and clinical practice. OBJECTIVE This study comprises an exploration of the role of NLP in scientific research and its subsequent effects on traditional publication practices, as well as an evaluation of the opportunities and challenges offered by large language models (LLM) and a reflection on necessary paradigm shifts in research culture. MATERIALS AND METHODS Current LLMs, such as ChatGPT, and their potential applications were compared and assessed. An analysis of the literature and case studies on the integration of LLMs into scientific and clinical practice was conducted. RESULTS AND CONCLUSION LLMs provide enhanced access to and processing capabilities of text-based information and represent a vast potential for (medical) research as well as daily clinical practice. Chat-based LLMs enable efficient completion of often time-consuming tasks, but due to their tendency for hallucinations, have a significant limitation. Current developments require critical examination and a paradigm shift to fully exploit the benefits of LLMs and minimize potential risks.
Collapse
Affiliation(s)
- Markus Haar
- Klinik für Intensivmedizin, Universitätsklinikum Hamburg-Eppendorf, Martinistr. 52, 20251, Hamburg, Deutschland.
| | - Michael Sonntagbauer
- Klinik für Intensivmedizin, Universitätsklinikum Hamburg-Eppendorf, Martinistr. 52, 20251, Hamburg, Deutschland
| | - Stefan Kluge
- Klinik für Intensivmedizin, Universitätsklinikum Hamburg-Eppendorf, Martinistr. 52, 20251, Hamburg, Deutschland
| |
Collapse
|
45
|
Ranjan S, Odegaard B. Reality monitoring and metacognitive judgments in a false-memory paradigm. Neurosci Res 2024; 201:3-17. [PMID: 38007192 DOI: 10.1016/j.neures.2023.11.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 10/19/2023] [Accepted: 11/15/2023] [Indexed: 11/27/2023]
Abstract
How well do we distinguish between different memory sources when the information from imagination and perception is similar? And how do metacognitive (confidence) judgments differ across different sources of experiences? To study these questions, we developed a reality monitoring task using semantically related words from the Deese-Roediger-McDermott (DRM) paradigm of false memories. In an orientation phase, participants either perceived word pairs or had to voluntarily imagine the second word of a word pair. In a test phase, participants viewed words and had to judge whether the paired word was previously perceived, imagined, or new. Results revealed an interaction between memory source and judgment type on both response rates and confidence judgments: reality monitoring was better for new and perceived (compared to imagined) sources, and participants often incorrectly reported imagined experiences to be perceived. Individuals exhibited similar confidence between correct imagined source judgments and incorrect imagined sources reported to be perceived. Modeling results indicated that the observed judgments were likely due to an externalizing bias (i.e., a bias to judge the memory source as perceived). Additionally, we found that overall metacognitive ability was best in the perceived source. Together, these results reveal a source-dependent effect on response rates and confidence ratings, and provide evidence that observers are surprisingly prone to externalizing biases when monitoring their own memories.
Collapse
|
46
|
Abbasian M, Khatibi E, Azimi I, Oniani D, Shakeri Hossein Abad Z, Thieme A, Sriram R, Yang Z, Wang Y, Lin B, Gevaert O, Li LJ, Jain R, Rahmani AM. Foundation metrics for evaluating effectiveness of healthcare conversations powered by generative AI. NPJ Digit Med 2024; 7:82. [PMID: 38553625 PMCID: PMC10980701 DOI: 10.1038/s41746-024-01074-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 03/07/2024] [Indexed: 04/02/2024] Open
Abstract
Generative Artificial Intelligence is set to revolutionize healthcare delivery by transforming traditional patient care into a more personalized, efficient, and proactive process. Chatbots, serving as interactive conversational models, will probably drive this patient-centered transformation in healthcare. Through the provision of various services, including diagnosis, personalized lifestyle recommendations, dynamic scheduling of follow-ups, and mental health support, the objective is to substantially augment patient health outcomes, all the while mitigating the workload burden on healthcare providers. The life-critical nature of healthcare applications necessitates establishing a unified and comprehensive set of evaluation metrics for conversational models. Existing evaluation metrics proposed for various generic large language models (LLMs) demonstrate a lack of comprehension regarding medical and health concepts and their significance in promoting patients' well-being. Moreover, these metrics neglect pivotal user-centered aspects, including trust-building, ethics, personalization, empathy, user comprehension, and emotional support. The purpose of this paper is to explore state-of-the-art LLM-based evaluation metrics that are specifically applicable to the assessment of interactive conversational models in healthcare. Subsequently, we present a comprehensive set of evaluation metrics designed to thoroughly assess the performance of healthcare chatbots from an end-user perspective. These metrics encompass an evaluation of language processing abilities, impact on real-world clinical tasks, and effectiveness in user-interactive conversations. Finally, we engage in a discussion concerning the challenges associated with defining and implementing these metrics, with particular emphasis on confounding factors such as the target audience, evaluation methods, and prompt techniques involved in the evaluation process.
Collapse
Affiliation(s)
- Mahyar Abbasian
- University of California, Irvine, CA, USA.
- HealthUnity, Palo Alto, CA, USA.
| | - Elahe Khatibi
- University of California, Irvine, CA, USA.
- HealthUnity, Palo Alto, CA, USA.
| | - Iman Azimi
- University of California, Irvine, CA, USA
- HealthUnity, Palo Alto, CA, USA
| | - David Oniani
- HealthUnity, Palo Alto, CA, USA
- University of Pittsburgh, Pittsburgh, PA, USA
| | | | | | - Ram Sriram
- National Institute of Standards and Technology (NIST), Gaithersburg, MD, USA
| | | | - Yanshan Wang
- HealthUnity, Palo Alto, CA, USA
- University of Pittsburgh, Pittsburgh, PA, USA
| | - Bryant Lin
- HealthUnity, Palo Alto, CA, USA
- Stanford University, Stanford, CA, USA
| | | | | | - Ramesh Jain
- University of California, Irvine, CA, USA
- HealthUnity, Palo Alto, CA, USA
| | - Amir M Rahmani
- University of California, Irvine, CA, USA
- HealthUnity, Palo Alto, CA, USA
| |
Collapse
|
47
|
Kumar N, Srivastava R. Deep learning in structural bioinformatics: current applications and future perspectives. Brief Bioinform 2024; 25:bbae042. [PMID: 38701422 PMCID: PMC11066934 DOI: 10.1093/bib/bbae042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Revised: 01/05/2024] [Accepted: 01/18/2024] [Indexed: 05/05/2024] Open
Abstract
In this review article, we explore the transformative impact of deep learning (DL) on structural bioinformatics, emphasizing its pivotal role in a scientific revolution driven by extensive data, accessible toolkits and robust computing resources. As big data continue to advance, DL is poised to become an integral component in healthcare and biology, revolutionizing analytical processes. Our comprehensive review provides detailed insights into DL, featuring specific demonstrations of its notable applications in bioinformatics. We address challenges tailored for DL, spotlight recent successes in structural bioinformatics and present a clear exposition of DL-from basic shallow neural networks to advanced models such as convolution, recurrent, artificial and transformer neural networks. This paper discusses the emerging use of DL for understanding biomolecular structures, anticipating ongoing developments and applications in the realm of structural bioinformatics.
Collapse
Affiliation(s)
- Niranjan Kumar
- School of Computational and Integrative Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Rakesh Srivastava
- Center for Computational Natural Sciences and Bioinformatics, International Institute of Information Technology, Hyderabad, India
| |
Collapse
|
48
|
Yang Z, Lin Z, Yang Y, Li J. Dual-Path Graph Neural Network with Adaptive Auxiliary Module for Link Prediction. BIG DATA 2024. [PMID: 38527254 DOI: 10.1089/big.2023.0130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/27/2024]
Abstract
Link prediction, which has important applications in many fields, predicts the possibility of the link between two nodes in a graph. Link prediction based on Graph Neural Network (GNN) obtains node representation and graph structure through GNN, which has attracted a growing amount of attention recently. However, the existing GNN-based link prediction approaches possess some shortcomings. On the one hand, because a graph contains different types of nodes, it leads to a great challenge for aggregating information and learning node representation from its neighbor nodes. On the other hand, the attention mechanism has been an effect instrument for enhancing the link prediction performance. However, the traditional attention mechanism is always monotonic for query nodes, which limits its influence on link prediction. To address these two problems, a Dual-Path Graph Neural Network (DPGNN) for link prediction is proposed in this study. First, we propose a novel Local Random Features Augmentation for Graph Convolution Network as a baseline of one path. Meanwhile, Graph Attention Network version 2 based on dynamic attention mechanism is adopted as a baseline of the other path. And then, we capture more meaningful node representation and more accurate link features by concatenating the information of these two paths. In addition, we propose an adaptive auxiliary module for better balancing the weight of auxiliary tasks, which brings more benefit to link prediction. Finally, extensive experiments verify the effectiveness and superiority of our proposed DPGNN for link prediction.
Collapse
Affiliation(s)
- Zhenzhen Yang
- Key Laboratory of Ministry of Education in Broadband Wireless Communication and Sensor Network Technology, Nanjing University of Posts and Telecommunications, Nanjing, China
| | - Zelong Lin
- Key Laboratory of Ministry of Education in Broadband Wireless Communication and Sensor Network Technology, Nanjing University of Posts and Telecommunications, Nanjing, China
| | - Yongpeng Yang
- Key Laboratory of Ministry of Education in Broadband Wireless Communication and Sensor Network Technology, Nanjing University of Posts and Telecommunications, Nanjing, China
- School of Network and Communication, Nanjing Vocational College of Information Technology, Nanjing, China
| | - Jiaqi Li
- Key Laboratory of Ministry of Education in Broadband Wireless Communication and Sensor Network Technology, Nanjing University of Posts and Telecommunications, Nanjing, China
| |
Collapse
|
49
|
Michael-Pitschaze T, Cohen N, Ofer D, Hoshen Y, Linial M. Detecting anomalous proteins using deep representations. NAR Genom Bioinform 2024; 6:lqae021. [PMID: 38486884 PMCID: PMC10939404 DOI: 10.1093/nargab/lqae021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 11/17/2023] [Accepted: 02/23/2024] [Indexed: 03/17/2024] Open
Abstract
Many advances in biomedicine can be attributed to identifying unusual proteins and genes. Many of these proteins' unique properties were discovered by manual inspection, which is becoming infeasible at the scale of modern protein datasets. Here, we propose to tackle this challenge using anomaly detection methods that automatically identify unexpected properties. We adopt a state-of-the-art anomaly detection paradigm from computer vision, to highlight unusual proteins. We generate meaningful representations without labeled inputs, using pretrained deep neural network models. We apply these protein language models (pLM) to detect anomalies in function, phylogenetic families, and segmentation tasks. We compute protein anomaly scores to highlight human prion-like proteins, distinguish viral proteins from their host proteome, and mark non-classical ion/metal binding proteins and enzymes. Other tasks concern segmentation of protein sequences into folded and unstructured regions. We provide candidates for rare functionality (e.g. prion proteins). Additionally, we show the anomaly score is useful in 3D folding-related segmentation. Our novel method shows improved performance over strong baselines and has objectively high performance across a variety of tasks. We conclude that the combination of pLM and anomaly detection techniques is a valid method for discovering a range of global and local protein characteristics.
Collapse
Affiliation(s)
- Tomer Michael-Pitschaze
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Niv Cohen
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Dan Ofer
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yedid Hoshen
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Michal Linial
- Department of Biological Chemistry, Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| |
Collapse
|
50
|
Seckel E, Stephens BY, Rodriguez F. Ten simple rules to leverage large language models for getting grants. PLoS Comput Biol 2024; 20:e1011863. [PMID: 38427611 PMCID: PMC10906892 DOI: 10.1371/journal.pcbi.1011863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/03/2024] Open
Affiliation(s)
- Elizabeth Seckel
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, California, United States of America
| | - Brandi Y. Stephens
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, California, United States of America
| | - Fatima Rodriguez
- Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, California, United States of America
| |
Collapse
|