1
|
Wang H, Alanis N, Haygood L, Swoboda TK, Hoot N, Phillips D, Knowles H, Stinson SA, Mehta P, Sambamoorthi U. Using natural language processing in emergency medicine health service research: A systematic review and meta-analysis. Acad Emerg Med 2024; 31:696-706. [PMID: 38757352 PMCID: PMC11246236 DOI: 10.1111/acem.14937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 04/15/2024] [Accepted: 04/17/2024] [Indexed: 05/18/2024]
Abstract
OBJECTIVES Natural language processing (NLP) represents one of the adjunct technologies within artificial intelligence and machine learning, creating structure out of unstructured data. This study aims to assess the performance of employing NLP to identify and categorize unstructured data within the emergency medicine (EM) setting. METHODS We systematically searched publications related to EM research and NLP across databases including MEDLINE, Embase, Scopus, CENTRAL, and ProQuest Dissertations & Theses Global. Independent reviewers screened, reviewed, and evaluated article quality and bias. NLP usage was categorized into syndromic surveillance, radiologic interpretation, and identification of specific diseases/events/syndromes, with respective sensitivity analysis reported. Performance metrics for NLP usage were calculated and the overall area under the summary of receiver operating characteristic curve (SROC) was determined. RESULTS A total of 27 studies underwent meta-analysis. Findings indicated an overall mean sensitivity (recall) of 82%-87%, specificity of 95%, with the area under the SROC at 0.96 (95% CI 0.94-0.98). Optimal performance using NLP was observed in radiologic interpretation, demonstrating an overall mean sensitivity of 93% and specificity of 96%. CONCLUSIONS Our analysis revealed a generally favorable performance accuracy in using NLP within EM research, particularly in the realm of radiologic interpretation. Consequently, we advocate for the adoption of NLP-based research to augment EM health care management.
Collapse
Affiliation(s)
- Hao Wang
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Naomi Alanis
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Laura Haygood
- Health Sciences Librarian for Public Health, Brown University, 69 Brown St., Providence, RI 02912
| | - Thomas K. Swoboda
- Department of Emergency Medicine, The Valley Health System, Touro University Nevada School of Osteopathic Medicine, 657 N. Town Center Drive, Las Vegas, NV 89144
| | - Nathan Hoot
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Daniel Phillips
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Heidi Knowles
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Sara Ann Stinson
- Mary Couts Burnett Library, Burnett School of Medicine at Texas Christian University, 2800 S. University Dr., Fort Worth, TX 76109
| | - Prachi Mehta
- Department of Emergency Medicine, JPS Health Network, 1500 S. Main St., Fort Worth, TX 76104
| | - Usha Sambamoorthi
- College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, TX 76107
| |
Collapse
|
2
|
Clark EC, Neumann S, Hopkins S, Kostopoulos A, Hagerman L, Dobbins M. Changes to Public Health Surveillance Methods Due to the COVID-19 Pandemic: Scoping Review. JMIR Public Health Surveill 2024; 10:e49185. [PMID: 38241067 PMCID: PMC10837764 DOI: 10.2196/49185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 09/06/2023] [Accepted: 12/07/2023] [Indexed: 01/23/2024] Open
Abstract
BACKGROUND Public health surveillance plays a vital role in informing public health decision-making. The onset of the COVID-19 pandemic in early 2020 caused a widespread shift in public health priorities. Global efforts focused on COVID-19 monitoring and contact tracing. Existing public health programs were interrupted due to physical distancing measures and reallocation of resources. The onset of the COVID-19 pandemic intersected with advancements in technologies that have the potential to support public health surveillance efforts. OBJECTIVE This scoping review aims to explore emergent public health surveillance methods during the early COVID-19 pandemic to characterize the impact of the pandemic on surveillance methods. METHODS A scoping search was conducted in multiple databases and by scanning key government and public health organization websites from March 2020 to January 2022. Published papers and gray literature that described the application of new or revised approaches to public health surveillance were included. Papers that discussed the implications of novel public health surveillance approaches from ethical, legal, security, and equity perspectives were also included. The surveillance subject, method, location, and setting were extracted from each paper to identify trends in surveillance practices. Two public health epidemiologists were invited to provide their perspectives as peer reviewers. RESULTS Of the 14,238 unique papers, a total of 241 papers describing novel surveillance methods and changes to surveillance methods are included. Eighty papers were review papers and 161 were single studies. Overall, the literature heavily featured papers detailing surveillance of COVID-19 transmission (n=187). Surveillance of other infectious diseases was also described, including other pathogens (n=12). Other public health topics included vaccines (n=9), mental health (n=11), substance use (n=4), healthy nutrition (n=1), maternal and child health (n=3), antimicrobial resistance (n=2), and misinformation (n=6). The literature was dominated by applications of digital surveillance, for example, by using big data through mobility tracking and infodemiology (n=163). Wastewater surveillance was also heavily represented (n=48). Other papers described adaptations to programs or methods that existed prior to the COVID-19 pandemic (n=9). The scoping search also found 109 papers that discuss the ethical, legal, security, and equity implications of emerging surveillance methods. The peer reviewer public health epidemiologists noted that additional changes likely exist, beyond what has been reported and available for evidence syntheses. CONCLUSIONS The COVID-19 pandemic accelerated advancements in surveillance and the adoption of new technologies, especially for digital and wastewater surveillance methods. Given the investments in these systems, further applications for public health surveillance are likely. The literature for surveillance methods was dominated by surveillance of infectious diseases, particularly COVID-19. A substantial amount of literature on the ethical, legal, security, and equity implications of these emerging surveillance methods also points to a need for cautious consideration of potential harm.
Collapse
Affiliation(s)
- Emily C Clark
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
| | - Sophie Neumann
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
| | - Stephanie Hopkins
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
| | - Alyssa Kostopoulos
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
| | - Leah Hagerman
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
| | - Maureen Dobbins
- National Collaborating Centre for Methods and Tools, Hamilton, ON, Canada
- School of Nursing, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
3
|
Eysenbach G, Kleib M, Norris C, O'Rourke HM, Montgomery C, Douma M. The Use and Structure of Emergency Nurses' Triage Narrative Data: Scoping Review. JMIR Nurs 2023; 6:e41331. [PMID: 36637881 PMCID: PMC9883744 DOI: 10.2196/41331] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 11/24/2022] [Accepted: 11/28/2022] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Emergency departments use triage to ensure that patients with the highest level of acuity receive care quickly and safely. Triage is typically a nursing process that is documented as structured and unstructured (free text) data. Free-text triage narratives have been studied for specific conditions but never reviewed in a comprehensive manner. OBJECTIVE The objective of this paper was to identify and map the academic literature that examines triage narratives. The paper described the types of research conducted, identified gaps in the research, and determined where additional review may be warranted. METHODS We conducted a scoping review of unstructured triage narratives. We mapped the literature, described the use of triage narrative data, examined the information available on the form and structure of narratives, highlighted similarities among publications, and identified opportunities for future research. RESULTS We screened 18,074 studies published between 1990 and 2022 in CINAHL, MEDLINE, Embase, Cochrane, and ProQuest Central. We identified 0.53% (96/18,074) of studies that directly examined the use of triage nurses' narratives. More than 12 million visits were made to 2438 emergency departments included in the review. In total, 82% (79/96) of these studies were conducted in the United States (43/96, 45%), Australia (31/96, 32%), or Canada (5/96, 5%). Triage narratives were used for research and case identification, as input variables for predictive modeling, and for quality improvement. Overall, 31% (30/96) of the studies offered a description of the triage narrative, including a list of the keywords used (27/96, 28%) or more fulsome descriptions (such as word counts, character counts, abbreviation, etc; 7/96, 7%). We found limited use of reporting guidelines (8/96, 8%). CONCLUSIONS The breadth of the identified studies suggests that there is widespread routine collection and research use of triage narrative data. Despite the use of triage narratives as a source of data in studies, the narratives and nurses who generate them are poorly described in the literature, and data reporting is inconsistent. Additional research is needed to describe the structure of triage narratives, determine the best use of triage narratives, and improve the consistent use of triage-specific data reporting guidelines. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) RR2-10.1136/bmjopen-2021-055132.
Collapse
Affiliation(s)
| | - Manal Kleib
- Faculty of Nursing, University of Alberta, Edmonton, AB, Canada
| | - Colleen Norris
- Faculty of Nursing, University of Alberta, Edmonton, AB, Canada
| | | | | | - Matthew Douma
- School of Nursing, Midwifery and Health Systems, University College Dublin, Dublin, Ireland
| |
Collapse
|
4
|
Mavragani A, Sanchez T, Ackerson BK, Hong V, Skarbinski J, Yau V, Qian L, Fischer H, Shaw SF, Caparosa S, Xie F. Natural Language Processing for Improved Characterization of COVID-19 Symptoms: Observational Study of 350,000 Patients in a Large Integrated Health Care System. JMIR Public Health Surveill 2022; 8:e41529. [PMID: 36446133 PMCID: PMC9822566 DOI: 10.2196/41529] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Revised: 11/07/2022] [Accepted: 11/29/2022] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND Natural language processing (NLP) of unstructured text from electronic medical records (EMR) can improve the characterization of COVID-19 signs and symptoms, but large-scale studies demonstrating the real-world application and validation of NLP for this purpose are limited. OBJECTIVE The aim of this paper is to assess the contribution of NLP when identifying COVID-19 signs and symptoms from EMR. METHODS This study was conducted in Kaiser Permanente Southern California, a large integrated health care system using data from all patients with positive SARS-CoV-2 laboratory tests from March 2020 to May 2021. An NLP algorithm was developed to extract free text from EMR on 12 established signs and symptoms of COVID-19, including fever, cough, headache, fatigue, dyspnea, chills, sore throat, myalgia, anosmia, diarrhea, vomiting or nausea, and abdominal pain. The proportion of patients reporting each symptom and the corresponding onset dates were described before and after supplementing structured EMR data with NLP-extracted signs and symptoms. A random sample of 100 chart-reviewed and adjudicated SARS-CoV-2-positive cases were used to validate the algorithm performance. RESULTS A total of 359,938 patients (mean age 40.4 [SD 19.2] years; 191,630/359,938, 53% female) with confirmed SARS-CoV-2 infection were identified over the study period. The most common signs and symptoms identified through NLP-supplemented analyses were cough (220,631/359,938, 61%), fever (185,618/359,938, 52%), myalgia (153,042/359,938, 43%), and headache (144,705/359,938, 40%). The NLP algorithm identified an additional 55,568 (15%) symptomatic cases that were previously defined as asymptomatic using structured data alone. The proportion of additional cases with each selected symptom identified in NLP-supplemented analysis varied across the selected symptoms, from 29% (63,742/220,631) of all records for cough to 64% (38,884/60,865) of all records with nausea or vomiting. Of the 295,305 symptomatic patients, the median time from symptom onset to testing was 3 days using structured data alone, whereas the NLP algorithm identified signs or symptoms approximately 1 day earlier. When validated against chart-reviewed cases, the NLP algorithm successfully identified signs and symptoms with consistently high sensitivity (ranging from 87% to 100%) and specificity (94% to 100%). CONCLUSIONS These findings demonstrate that NLP can identify and characterize a broad set of COVID-19 signs and symptoms from unstructured EMR data with enhanced detail and timeliness compared with structured data alone.
Collapse
Affiliation(s)
| | | | - Bradley K Ackerson
- Southern California Permanente Medical Group, Harbor City, CA, United States
| | - Vennis Hong
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Jacek Skarbinski
- The Permanente Medical Group, Kaiser Permanente Northern California, Oakland, CA, United States.,Division of Research, Kaiser Permanente Northern California, Oakland, CA, United States
| | - Vincent Yau
- Genentech, a Member of the Roche Group, San Francisco, CA, United States
| | - Lei Qian
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Heidi Fischer
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Sally F Shaw
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Susan Caparosa
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Fagen Xie
- Department of Research & Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| |
Collapse
|
5
|
Thiam MM, Simac L, Fougère E, Forgeot C, Meurice L, Naud J, Le Strat Y, Caserio-Schönemann C. Expert consultation using the on-line Delphi method for the revision of syndromic groups compiled from emergency data (SOS Médecins and OSCOUR®) in France. BMC Public Health 2022; 22:1791. [PMID: 36131273 PMCID: PMC9494916 DOI: 10.1186/s12889-022-14157-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Accepted: 09/08/2022] [Indexed: 11/10/2022] Open
Abstract
Background Consultation data from emergency general practitioners known as SOS Médecins and emergency departments (ED) from OSCOUR® network to the French syndromic surveillance system SurSaUD® (Surveillance sanitaire des urgences et décès). These data are aggregated and monitored on a daily basis through groupings of one or more medical symptoms or diagnoses (“syndromic groups” (SG)). The objective of this study was to evaluate, revise and enrich the composition of SGs through a consensus of experts who contributed or have experience in syndromic surveillance. Methods Three rounds of a Delphi survey were organised, involving 15 volunteers from SOS Médecins and 64 ED physicians in the OSCOUR® network as well as 8 international epidemiologists. Thirty-four SOS Médecins and 40 OSCOUR® SGs covering major medical specialities were put to the experts, along with their diagnostic codes and their surveillance objectives. In each round, the experts could retain or reject the codes according to the surveillance objective. The panel could also put forward new diagnostic codes in the 1st round, included in subsequent rounds. Consensus was reached for a code if 80% of participants had chosen to keep it, or less than 20% to reject it. Results A total of 12 SOS Médecins doctors (80%), 30 ED doctors (47%) and 4 international experts (50%) participated in the three rounds. All of the SGs presented to the panel included 102 initial diagnostic codes and 73 additional codes for SOS Médecins, 272 initial diagnostic codes and 204 additional codes for OSCOUR®. At the end of the 3 rounds, 14 SOS Médecins (40%) and 11 OSCOUR® (28%) SGs achieved a consensus to maintain all of their diagnostic codes. Among these, indicators of winter seasonal surveillance (bronchiolitis and gastroenteritis) were included. Conclusion This study involved a panel of national experts with international representation and a good level of involvement throughout the survey. In the absence of a standard definition, the Delphi method has been shown to be useful in defining and validating syndromic surveillance indicators. Supplementary Information The online version contains supplementary material available at 10.1186/s12889-022-14157-x.
Collapse
Affiliation(s)
| | - Leslie Simac
- Regional Division, Santé Publique France, Saint-Maurice, France.
| | - Erica Fougère
- Regional Division, Santé Publique France, Saint-Maurice, France
| | - Cécile Forgeot
- Data Science Division, Santé Publique France, Saint-Maurice, France
| | - Laure Meurice
- Regional Division, Santé Publique France, Saint-Maurice, France
| | - Jérôme Naud
- Data Science Division, Santé Publique France, Saint-Maurice, France
| | - Yann Le Strat
- Data Science Division, Santé Publique France, Saint-Maurice, France
| | | |
Collapse
|
6
|
Rapp M, Kulessa M, Loza Mencía E, Fürnkranz J. Correlation-Based Discovery of Disease Patterns for Syndromic Surveillance. Front Big Data 2022; 4:784159. [PMID: 35098113 PMCID: PMC8793623 DOI: 10.3389/fdata.2021.784159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Accepted: 12/21/2021] [Indexed: 11/23/2022] Open
Abstract
Early outbreak detection is a key aspect in the containment of infectious diseases, as it enables the identification and isolation of infected individuals before the disease can spread to a larger population. Instead of detecting unexpected increases of infections by monitoring confirmed cases, syndromic surveillance aims at the detection of cases with early symptoms, which allows a more timely disclosure of outbreaks. However, the definition of these disease patterns is often challenging, as early symptoms are usually shared among many diseases and a particular disease can have several clinical pictures in the early phase of an infection. As a first step toward the goal to support epidemiologists in the process of defining reliable disease patterns, we present a novel, data-driven approach to discover such patterns in historic data. The key idea is to take into account the correlation between indicators in a health-related data source and the reported number of infections in the respective geographic region. In an preliminary experimental study, we use data from several emergency departments to discover disease patterns for three infectious diseases. Our results show the potential of the proposed approach to find patterns that correlate with the reported infections and to identify indicators that are related to the respective diseases. It also motivates the need for additional measures to overcome practical limitations, such as the requirement to deal with noisy and unbalanced data, and demonstrates the importance of incorporating feedback of domain experts into the learning procedure.
Collapse
Affiliation(s)
- Michael Rapp
- Knowledge Engineering Group, Technical University of Darmstadt, Darmstadt, Germany
| | - Moritz Kulessa
- Knowledge Engineering Group, Technical University of Darmstadt, Darmstadt, Germany
| | - Eneldo Loza Mencía
- Knowledge Engineering Group, Technical University of Darmstadt, Darmstadt, Germany
| | - Johannes Fürnkranz
- Computational Data Analysis Group, Johannes Kepler University Linz, Linz, Austria
| |
Collapse
|