1
|
Huang G, Li Y, Jameel S, Long Y, Papanastasiou G. From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality? Comput Struct Biotechnol J 2024; 24:362-373. [PMID: 38800693 PMCID: PMC11126530 DOI: 10.1016/j.csbj.2024.05.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 05/03/2024] [Accepted: 05/03/2024] [Indexed: 05/29/2024] Open
Abstract
Deep learning (DL) has substantially enhanced natural language processing (NLP) in healthcare research. However, the increasing complexity of DL-based NLP necessitates transparent model interpretability, or at least explainability, for reliable decision-making. This work presents a thorough scoping review of explainable and interpretable DL in healthcare NLP. The term "eXplainable and Interpretable Artificial Intelligence" (XIAI) is introduced to distinguish XAI from IAI. Different models are further categorized based on their functionality (model-, input-, output-based) and scope (local, global). Our analysis shows that attention mechanisms are the most prevalent emerging IAI technique. The use of IAI is growing, distinguishing it from XAI. The major challenges identified are that most XIAI does not explore "global" modelling processes, the lack of best practices, and the lack of systematic evaluation and benchmarks. One important opportunity is to use attention mechanisms to enhance multi-modal XIAI for personalized medicine. Additionally, combining DL with causal logic holds promise. Our discussion encourages the integration of XIAI in Large Language Models (LLMs) and domain-specific smaller models. In conclusion, XIAI adoption in healthcare requires dedicated in-house expertise. Collaboration with domain experts, end-users, and policymakers can lead to ready-to-use XIAI methods across NLP and medical tasks. While challenges exist, XIAI techniques offer a valuable foundation for interpretable NLP algorithms in healthcare.
Collapse
Affiliation(s)
- Guangming Huang
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, CO4 3SQ, United Kingdom
| | - Yingya Li
- Harvard Medical School and Boston Children's Hospital, Boston, 02115, United States
| | - Shoaib Jameel
- Electronics and Computer Science, University of Southampton, Southampton, SO17 1BJ, United Kingdom
| | - Yunfei Long
- School of Computer Science and Electronic Engineering, University of Essex, Colchester, CO4 3SQ, United Kingdom
| | | |
Collapse
|
2
|
Bandyopadhyay A, Albashayreh A, Zeinali N, Fan W, Gilbertson-White S. Using real-world electronic health record data to predict the development of 12 cancer-related symptoms in the context of multimorbidity. JAMIA Open 2024; 7:ooae082. [PMID: 39282082 PMCID: PMC11397936 DOI: 10.1093/jamiaopen/ooae082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 08/09/2024] [Accepted: 09/05/2024] [Indexed: 09/18/2024] Open
Abstract
Objective This study uses electronic health record (EHR) data to predict 12 common cancer symptoms, assessing the efficacy of machine learning (ML) models in identifying symptom influencers. Materials and Methods We analyzed EHR data of 8156 adults diagnosed with cancer who underwent cancer treatment from 2017 to 2020. Structured and unstructured EHR data were sourced from the Enterprise Data Warehouse for Research at the University of Iowa Hospital and Clinics. Several predictive models, including logistic regression, random forest (RF), and XGBoost, were employed to forecast symptom development. The performances of the models were evaluated by F1-score and area under the curve (AUC) on the testing set. The SHapley Additive exPlanations framework was used to interpret these models and identify the predictive risk factors associated with fatigue as an exemplar. Results The RF model exhibited superior performance with a macro average AUC of 0.755 and an F1-score of 0.729 in predicting a range of cancer-related symptoms. For instance, the RF model achieved an AUC of 0.954 and an F1-score of 0.914 for pain prediction. Key predictive factors identified included clinical history, cancer characteristics, treatment modalities, and patient demographics depending on the symptom. For example, the odds ratio (OR) for fatigue was significantly influenced by allergy (OR = 2.3, 95% CI: 1.8-2.9) and colitis (OR = 1.9, 95% CI: 1.5-2.4). Discussion Our research emphasizes the critical integration of multimorbidity and patient characteristics in modeling cancer symptoms, revealing the considerable influence of chronic conditions beyond cancer itself. Conclusion We highlight the potential of ML for predicting cancer symptoms, suggesting a pathway for integrating such models into clinical systems to enhance personalized care and symptom management.
Collapse
Affiliation(s)
- Anindita Bandyopadhyay
- Department of Business Analytics, University of Iowa, Iowa City, IA 52242, United States
| | - Alaa Albashayreh
- College of Nursing, University of Iowa, Iowa City, IA 52242, United States
| | - Nahid Zeinali
- Department of Informatics, University of Iowa, Iowa City, IA 52242, United States
| | - Weiguo Fan
- Department of Business Analytics, University of Iowa, Iowa City, IA 52242, United States
| | | |
Collapse
|
3
|
Mathis WS, Zhao S, Pratt N, Weleff J, De Paoli S. Inductive thematic analysis of healthcare qualitative interviews using open-source large language models: How does it compare to traditional methods? COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 255:108356. [PMID: 39067136 DOI: 10.1016/j.cmpb.2024.108356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 07/13/2024] [Accepted: 07/23/2024] [Indexed: 07/30/2024]
Abstract
BACKGROUND Large language models (LLMs) are generative artificial intelligence that have ignited much interest and discussion about their utility in clinical and research settings. Despite this interest there is sparse analysis of their use in qualitative thematic analysis comparing their current ability to that of human coding and analysis. In addition, there has been no published analysis of their use in real-world, protected health information. OBJECTIVE Here we fill that gap in the literature by comparing an LLM to standard human thematic analysis in real-world, semi-structured interviews of both patients and clinicians within a psychiatric setting. METHODS Using a 70 billion parameter open-source LLM running on local hardware and advanced prompt engineering techniques, we produced themes that summarized a full corpus of interviews in minutes. Subsequently we used three different evaluation methods for quantifying similarity between themes produced by the LLM and those produced by humans. RESULTS These revealed similarities ranging from moderate to substantial (Jaccard similarity coefficients 0.44-0.69), which are promising preliminary results. CONCLUSION Our study demonstrates that open-source LLMs can effectively generate robust themes from qualitative data, achieving substantial similarity to human-generated themes. The validation of LLMs in thematic analysis, coupled with evaluation methodologies, highlights their potential to enhance and democratize qualitative research across diverse fields.
Collapse
Affiliation(s)
- Walter S Mathis
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA.
| | - Sophia Zhao
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Nicholas Pratt
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Jeremy Weleff
- Department of Psychiatry, Yale University School of Medicine, New Haven, CT, USA
| | - Stefano De Paoli
- Division of Sociology, School of Business, Law and Social Sciences, Abertay University, Dundee, Scotland, United Kingdom
| |
Collapse
|
4
|
Hohenschurz-Schmidt D, Cherkin D, Rice ASC, Dworkin RH, Turk DC, McDermott MP, Bair MJ, DeBar LL, Edwards RR, Evans SR, Farrar JT, Kerns RD, Rowbotham MC, Wasan AD, Cowan P, Ferguson M, Freeman R, Gewandter JS, Gilron I, Grol-Prokopczyk H, Iyengar S, Kamp C, Karp BI, Kleykamp BA, Loeser JD, Mackey S, Malamut R, McNicol E, Patel KV, Schmader K, Simon L, Steiner DJ, Veasley C, Vollert J. Methods for pragmatic randomized clinical trials of pain therapies: IMMPACT statement. Pain 2024; 165:2165-2183. [PMID: 38723171 PMCID: PMC11404339 DOI: 10.1097/j.pain.0000000000003249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Accepted: 03/08/2024] [Indexed: 09/18/2024]
Abstract
ABSTRACT Pragmatic, randomized, controlled trials hold the potential to directly inform clinical decision making and health policy regarding the treatment of people experiencing pain. Pragmatic trials are designed to replicate or are embedded within routine clinical care and are increasingly valued to bridge the gap between trial research and clinical practice, especially in multidimensional conditions, such as pain and in nonpharmacological intervention research. To maximize the potential of pragmatic trials in pain research, the careful consideration of each methodological decision is required. Trials aligned with routine practice pose several challenges, such as determining and enrolling appropriate study participants, deciding on the appropriate level of flexibility in treatment delivery, integrating information on concomitant treatments and adherence, and choosing comparator conditions and outcome measures. Ensuring data quality in real-world clinical settings is another challenging goal. Furthermore, current trials in the field would benefit from analysis methods that allow for a differentiated understanding of effects across patient subgroups and improved reporting of methods and context, which is required to assess the generalizability of findings. At the same time, a range of novel methodological approaches provide opportunities for enhanced efficiency and relevance of pragmatic trials to stakeholders and clinical decision making. In this study, best-practice considerations for these and other concerns in pragmatic trials of pain treatments are offered and a number of promising solutions discussed. The basis of these recommendations was an Initiative on Methods, Measurement, and Pain Assessment in Clinical Trials (IMMPACT) meeting organized by the Analgesic, Anesthetic, and Addiction Clinical Trial Translations, Innovations, Opportunities, and Networks.
Collapse
Affiliation(s)
- David Hohenschurz-Schmidt
- Pain Research, Department of Surgery & Cancer, Faculty of Medicine, Imperial College London, United Kingdom
- Research Department, University College of Osteopathy, London, United Kingdom
| | - Dan Cherkin
- Osher Center for Integrative Health, Department of Family Medicine, University of Washington, Seattle, WA, United States
| | - Andrew S C Rice
- Pain Research, Department of Surgery & Cancer, Faculty of Medicine, Imperial College London, United Kingdom
| | - Robert H Dworkin
- Department of Anesthesiology and Perioperative Medicine, University of Rochester Medical Center, Rochester, NY, United States
| | - Dennis C Turk
- Anesthesiology and Pain Medicine, University of Washington, Seattle, WA, United States
| | - Michael P McDermott
- Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, United States
| | - Matthew J Bair
- VA Center for Health Information and Communication, Regenstrief Institute, and Indiana University School of Medicine, Indianapolis, IN, United States
| | - Lynn L DeBar
- Kaiser Permanente Washington Health Research Institute, Seattle, WA, United States
| | | | - Scott R Evans
- Biostatistics Center and the Department of Biostatistics and Bioinformatics, Milken Institute School of Public Health, George Washington University, Rockville, MD, United States
| | - John T Farrar
- Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, PA, United States
| | - Robert D Kerns
- Department of Psychiatry, Yale School of Medicine, New Haven, CT, United States
| | - Michael C Rowbotham
- Department of Anesthesia, University of California San Francisco School of Medicine, San Francisco, CA, United States
| | - Ajay D Wasan
- Departments of Anesthesiology & Perioperative Medicine, and Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA, United States
| | - Penney Cowan
- American Chronic Pain Association, Rocklin, CA, United States
| | - McKenzie Ferguson
- Department of Pharmacy Practice, Southern Illinois University Edwardsville, Edwardsville, IL, United States
| | - Roy Freeman
- Department of Neurology, Harvard Medical School, Boston, MA, United States
| | - Jennifer S Gewandter
- Department of Anesthesiology and Perioperative, University of Rochester, Rochester, NY, United States
| | - Ian Gilron
- Departments of Anesthesiology & Perioperative Medicine, Biomedical & Molecular Sciences, Centre for Neuroscience Studies, and School of Policy Studies, Queen's University, Kingston Health Sciences Centre, Kingston, ON, Canada
| | - Hanna Grol-Prokopczyk
- Department of Sociology, University at Buffalo, State University of New York, Buffalo, NY, United States
| | | | - Cornelia Kamp
- Center for Health and Technology (CHeT), Clinical Materials Services Unit (CMSU), University of Rochester Medical Center, Rochester, NY, United States
| | - Barbara I Karp
- National Institutes of Health, Bethesda, MD, United States
| | - Bethea A Kleykamp
- University of Maryland, School of Medicine, Baltimore, MD, United States
| | - John D Loeser
- Departments of Neurological Surgery and Anesthesia and Pain Medicine, University of Washington, Seattle, WA, United States
| | - Sean Mackey
- Stanford University School of Medicine, Department of Anesthesiology, Perioperative, and Pain Medicine, Neurosciences and Neurology, Palo Alto, CA, United States
| | | | - Ewan McNicol
- Department of Pharmacy Practice, Massachusetts College of Pharmacy and Health Sciences University, Boston, MA, United States
| | - Kushang V Patel
- Department of Anesthesiology and Pain Medicine, University of Washington, Seattle, WA, United States
| | - Kenneth Schmader
- Department of Medicine-Geriatrics, Center for the Study of Aging, Duke University Medical Center, and Geriatrics Research Education and Clinical Center, Durham VA Medical Center, Durham, NC, United States
| | - Lee Simon
- SDG, LLC, Cambridge, MA, United States
| | | | | | - Jan Vollert
- Department of Clinical and Biomedical Sciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, United Kingdom
| |
Collapse
|
5
|
Hughes JA, Wu Y, Jones L, Douglas C, Brown N, Hazelwood S, Lyrstedt AL, Jarugula R, Chu K, Nguyen A. Analyzing pain patterns in the emergency department: Leveraging clinical text deep learning models for real-world insights. Int J Med Inform 2024; 190:105544. [PMID: 39003790 DOI: 10.1016/j.ijmedinf.2024.105544] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Revised: 06/09/2024] [Accepted: 07/06/2024] [Indexed: 07/16/2024]
Abstract
OBJECTIVE To determine the incidence of patients presenting in pain to a large Australian inner-city emergency department (ED) using a clinical text deep learning algorithm. MATERIALS AND METHODS A fine-tuned, domain-specific, transformer-based clinical text deep learning model was used to interpret free-text nursing assessments in the electronic medical records of 235,789 adult presentations to the ED over a three-year period. The model classified presentations according to whether the patient had pain on arrival at the ED. Interrupted time series analysis was used to determine the incidence of pain in patients on arrival over time. We described the changes in the population characteristics and incidence of patients with pain on arrival occurring with the start of the Covid-19 pandemic. RESULTS 55.16% (95%CI 54.95%-55.36%) of all patients presenting to this ED had pain on arrival. There were differences in demographics and arrival and departure patterns between patients with and without pain. The Covid-19 pandemic initially precipitated a decrease followed by a sharp, sustained rise in pain on arrival, with concurrent changes to the population arriving in pain and their treatment. DISCUSSION Applying a clinical text deep learning model has successfully identified the incidence of pain on arrival. It represents an automated, reproducible mechanism to identify pain from routinely collected medical records. The description of this population and their treatment forms the basis of intervention to improve care for patients with pain. The combination of the clinical text deep learning models and interrupted time series analysis has reported on the effects of the Covid-19 pandemic on pain care in the ED, outlining a methodology to assess the impact of significant events or interventions on pain care in the ED. CONCLUSION Applying a novel deep learning approach to identifying pain guides methodological approaches to evaluating pain care interventions in the ED, giving previously unavailable population-level insights.
Collapse
Affiliation(s)
- James A Hughes
- School of Nursing, Queensland University of Technology, Brisbane, Australia; Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia.
| | - Yutong Wu
- Australian e-Health Research Centre, CSIRO, Brisbane, Australia
| | - Lee Jones
- QIMR-Berghoffer Research Institute, Brisbane, Australia
| | - Clint Douglas
- School of Nursing, Queensland University of Technology, Brisbane, Australia; Metro North Health, Queensland, Australia
| | - Nathan Brown
- Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia; Faculty of Medicine, University of Queensland, Brisbane, Australia
| | - Sarah Hazelwood
- Emergency Department, The Prince Charles Hospital, Queensland, Australia
| | - Anna-Lisa Lyrstedt
- School of Nursing, Queensland University of Technology, Brisbane, Australia; Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia
| | - Rajeev Jarugula
- Emergency Department, The Prince Charles Hospital, Queensland, Australia
| | - Kevin Chu
- Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Brisbane, Australia; Faculty of Medicine, University of Queensland, Brisbane, Australia
| | - Anthony Nguyen
- Australian e-Health Research Centre, CSIRO, Brisbane, Australia
| |
Collapse
|
6
|
Payton EM, Graber ML, Bachiashvili V, Mehta T, Dissanayake PI, Berner ES. Impact of clinical note format on diagnostic accuracy and efficiency. HEALTH INF MANAG J 2024; 53:183-188. [PMID: 37129041 DOI: 10.1177/18333583231151979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
BACKGROUND Clinician notes are structured in a variety of ways. This research pilot tested an innovative study design and explored the impact of note formats on diagnostic accuracy and documentation review time. OBJECTIVE To compare two formats for clinical documentation (narrative format vs. list of findings) on clinician diagnostic accuracy and documentation review time. METHOD Participants diagnosed written clinical cases, half in narrative format, and half in list format. Diagnostic accuracy (defined as including correct case diagnosis among top three diagnoses) and time spent processing the case scenario were measured for each format. Generalised linear mixed regression models and bias-corrected bootstrap percentile confidence intervals for mean paired differences were used to analyse the primary research questions. RESULTS Odds of correctly diagnosing list format notes were 26% greater than with narrative notes. However, there is insufficient evidence that this difference is significant (75% CI 0.8-1.99). On average the list format notes required 85.6 more seconds to process and arrive at a diagnosis compared to narrative notes (95% CI -162.3, -2.77). Of cases where participants included the correct diagnosis, on average the list format notes required 94.17 more seconds compared to narrative notes (75% CI -195.9, -8.83). CONCLUSION This study offers note format considerations for those interested in improving clinical documentation and suggests directions for future research. Balancing the priority of clinician preference with value of structured data may be necessary. IMPLICATIONS This study provides a method and suggestive results for further investigation in usability of electronic documentation formats.
Collapse
Affiliation(s)
- Evita M Payton
- University of Alabama at Birmingham, Birmingham, AL, USA
| | - Mark L Graber
- Society to Improve Diagnosis in Medicine, Alpharetta, MD, USA
| | | | - Tapan Mehta
- University of Alabama at Birmingham, Birmingham, AL, USA
| | | | - Eta S Berner
- University of Alabama at Birmingham, Birmingham, AL, USA
| |
Collapse
|
7
|
Cao T, Brady V, Whisenant M, Wang X, Gu Y, Wu H. Toward Reliable Symptom Coding in Electronic Health Records for Symptom Assessment and Research: Identification and Categorization of International Classification of Diseases, Ninth Revision, Clinical Modification Symptom Codes. Comput Inform Nurs 2024; 42:636-647. [PMID: 38968447 PMCID: PMC11377150 DOI: 10.1097/cin.0000000000001146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2024]
Abstract
To date, symptom documentation has mostly relied on clinical notes in electronic health records or patient-reported outcomes using disease-specific symptom inventories. To provide a common and precise language for symptom recording, assessment, and research, a comprehensive list of symptom codes is needed. The International Classification of Diseases, Ninth Revision or its clinical modification ( International Classification of Diseases, Ninth Revision, Clinical Modification ) has a range of codes designated for symptoms, but it does not contain codes for all possible symptoms, and not all codes in that range are symptom related. This study aimed to identify and categorize the first list of International Classification of Diseases, Ninth Revision, Clinical Modification symptom codes for a general population and demonstrate their use to characterize symptoms of patients with type 2 diabetes mellitus in the Cerner database. A list of potential symptom codes was automatically extracted from the Unified Medical Language System Metathesaurus. Two clinical experts in symptom science and diabetes manually reviewed this list to identify and categorize codes as symptoms. A total of 1888 International Classification of Diseases, Ninth Revision, Clinical Modification symptom codes were identified and categorized into 65 categories. The symptom characterization using the newly obtained symptom codes and categories was found to be more reasonable than that using the previous symptom codes and categories on the same Cerner diabetes cohort.
Collapse
Affiliation(s)
- Tru Cao
- Author Affiliations: UTHealth Houston School of Public Health (Drs Cao, Wang, and Wu and Mr Gu), UTHealth Houston Cizik School of Nursing (Dr Brady), and The University of Texas MD Anderson Cancer Center (Dr Whisenant)
| | | | | | | | | | | |
Collapse
|
8
|
Danna G, Garg R, Buchheit J, Patel R, Zhan T, Ellyn A, Maqbool F, Yala L, Moklyak Y, Frydman J, Kho A, Kong N, Furmanchuk A, Lundberg A, Stey AM. Prediction of intra-abdominal injury using natural language processing of electronic medical record data. Surgery 2024; 176:577-585. [PMID: 38972771 PMCID: PMC11330356 DOI: 10.1016/j.surg.2024.05.042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 05/12/2024] [Accepted: 05/28/2024] [Indexed: 07/09/2024]
Abstract
BACKGROUND This study aimed to use natural language processing to predict the presence of intra-abdominal injury using unstructured data from electronic medical records. METHODS This was a random-sample retrospective observational cohort study leveraging unstructured data from injured patients taken to one of 9 acute care hospitals in an integrated health system between 2015 and 2021. Patients with International Classification of Diseases External Cause of Morbidity codes were identified. History and physical, consult, progress, and radiology report text from the first 8 hours of care were abstracted. Annotator dyads independently annotated encounters' text files to establish ground truth regarding whether intra-abdominal injury occurred. Features were extracted from text using natural language processing techniques, bag of words, and principal component analysis. We tested logistic regression, random forests, and gradient boosting machine to determine accuracy, recall, and precision of natural language processing to predict intra-abdominal injury. RESULTS A random sample of 7,000 patient encounters of 177,127 was annotated. Only 2,951 had sufficient information to determine whether an intra-abdominal injury was present. Among those, 84 (2.9%) had an intra-abdominal injury. The concordance between annotators was 0.989. Logistic regression of features identified with bag of words and principal component analysis had the best predictive ability, with an area under the receiver operating characteristic curve of 0.9, recall of 0.73, and precision of 0.17. Text features with greatest importance included "abdomen," "pelvis," "spleen," and "hematoma." CONCLUSION Natural language processing could be a screening decision support tool, which, if paired with human clinical assessment, can maximize precision of intra-abdominal injury identification.
Collapse
Affiliation(s)
- Giovanna Danna
- Chicago Medical School, Rosalind Franklin University, Chicago, IL
| | - Ravi Garg
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Joanna Buchheit
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Radha Patel
- Chicago Medical School, Rosalind Franklin University, Chicago, IL
| | - Tiannan Zhan
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Alexander Ellyn
- Chicago Medical School, Rosalind Franklin University, Chicago, IL
| | - Farhan Maqbool
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Linda Yala
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Yuriy Moklyak
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - James Frydman
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | - Abel Kho
- Feinberg School of Medicine, Northwestern University, Chicago, IL. https://www.twitter.com/Abelkho
| | - Nan Kong
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN
| | - Alona Furmanchuk
- Feinberg School of Medicine, Northwestern University, Chicago, IL
| | | | - Anne M Stey
- Feinberg School of Medicine, Northwestern University, Chicago, IL.
| |
Collapse
|
9
|
Albashayreh A, Bandyopadhyay A, Zeinali N, Zhang M, Fan W, Gilbertson White S. Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives. JCO Clin Cancer Inform 2024; 8:e2300235. [PMID: 39116379 DOI: 10.1200/cci.23.00235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Revised: 04/29/2024] [Accepted: 05/30/2024] [Indexed: 08/10/2024] Open
Abstract
PURPOSE Identifying cancer symptoms in electronic health record (EHR) narratives is feasible with natural language processing (NLP). However, more efficient NLP systems are needed to detect various symptoms and distinguish observed symptoms from negated symptoms and medication-related side effects. We evaluated the accuracy of NLP in (1) detecting 14 symptom groups (ie, pain, fatigue, swelling, depressed mood, anxiety, nausea/vomiting, pruritus, headache, shortness of breath, constipation, numbness/tingling, decreased appetite, impaired memory, disturbed sleep) and (2) distinguishing observed symptoms in EHR narratives among patients with cancer. METHODS We extracted 902,508 notes for 11,784 unique patients diagnosed with cancer and developed a gold standard corpus of 1,112 notes labeled for presence or absence of 14 symptom groups. We trained an embeddings-augmented NLP system integrating human and machine intelligence and conventional machine learning algorithms. NLP metrics were calculated on a gold standard corpus subset for testing. RESULTS The interannotator agreement for labeling the gold standard corpus was excellent at 92%. The embeddings-augmented NLP model achieved the best performance (F1 score = 0.877). The highest NLP accuracy was observed in pruritus (F1 score = 0.937) while the lowest accuracy was in swelling (F1 score = 0.787). After classifying the entire data set with embeddings-augmented NLP, we found that 41% of the notes included symptom documentation. Pain was the most documented symptom (29% of all notes) while impaired memory was the least documented (0.7% of all notes). CONCLUSION We illustrated the feasibility of detecting 14 symptom groups in EHR narratives and showed that an embeddings-augmented NLP system outperforms conventional machine learning algorithms in detecting symptom information and differentiating observed symptoms from negated symptoms and medication-related side effects.
Collapse
Affiliation(s)
| | | | | | - Min Zhang
- School of Economics and Management, Communication University of China, Beijing, China
| | - Weiguo Fan
- Tippie College of Business, University of Iowa, Iowa City, IA
| | | |
Collapse
|
10
|
Mora S, Giacobbe DR, Bartalucci C, Viglietti G, Mikulska M, Vena A, Ball L, Robba C, Cappello A, Battaglini D, Brunetti I, Pelosi P, Bassetti M, Giacomini M. Towards the automatic calculation of the EQUAL Candida Score: Extraction of CVC-related information from EMRs of critically ill patients with candidemia in Intensive Care Units. J Biomed Inform 2024; 156:104667. [PMID: 38848885 DOI: 10.1016/j.jbi.2024.104667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 06/01/2024] [Accepted: 06/03/2024] [Indexed: 06/09/2024]
Abstract
OBJECTIVES Candidemia is the most frequent invasive fungal disease and the fourth most frequent bloodstream infection in hospitalized patients. Its optimal management is crucial for improving patients' survival. The quality of candidemia management can be assessed with the EQUAL Candida Score. The objective of this work is to support its automatic calculation by extracting central venous catheter-related information from Italian text in clinical notes of electronic medical records. MATERIALS AND METHODS The sample includes 4,787 clinical notes of 108 patients hospitalized between January 2018 to December 2020 in the Intensive Care Units of the IRCCS San Martino Polyclinic Hospital in Genoa (Italy). The devised pipeline exploits natural language processing (NLP) to produce numerical representations of clinical notes used as input of machine learning (ML) algorithms to identify CVC presence and removal. It compares the performances of (i) rule-based method, (ii) count-based method together with a ML algorithm, and (iii) a transformers-based model. RESULTS Results, obtained with three different approaches, were evaluated in terms of weighted F1 Score. The random forest classifier showed the higher performance in both tasks reaching 82.35%. CONCLUSION The present work constitutes a first step towards the automatic calculation of the EQUAL Candida Score from unstructured daily collected data by combining ML and NLP methods. The automatic calculation of the EQUAL Candida Score could provide crucial real-time feedback on the quality of candidemia management, aimed at further improving patients' health.
Collapse
Affiliation(s)
- Sara Mora
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy; UO Information and Communication Technologies (ICT), IRCCS Ospedale Policlinico San Martino, Genoa, Italy.
| | - Daniele Roberto Giacobbe
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy; Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Claudia Bartalucci
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy; Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Giulia Viglietti
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Malgorzata Mikulska
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy; Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Antonio Vena
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy; Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Lorenzo Ball
- Department of Surgical Sciences and Integrated Diagnostics (DISC), University of Genoa, Genoa, Italy; Anesthesia and Intensive Care, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Chiara Robba
- Department of Surgical Sciences and Integrated Diagnostics (DISC), University of Genoa, Genoa, Italy; Anesthesia and Intensive Care, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Alice Cappello
- Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Denise Battaglini
- Anesthesia and Intensive Care, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Iole Brunetti
- Anesthesia and Intensive Care, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Paolo Pelosi
- Department of Surgical Sciences and Integrated Diagnostics (DISC), University of Genoa, Genoa, Italy; Anesthesia and Intensive Care, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Matteo Bassetti
- Department of Health Sciences (DISSAL), University of Genoa, Genoa, Italy; Clinica Malattie Infettive, IRCCS Ospedale Policlinico San Martino, Genoa, Italy
| | - Mauro Giacomini
- Department of Informatics, Bioengineering, Robotics and System Engineering (DIBRIS), University of Genoa, Genoa, Italy
| |
Collapse
|
11
|
Borchert F, Llorca I, Schapranow MP. Improving biomedical entity linking for complex entity mentions with LLM-based text simplification. Database (Oxford) 2024; 2024:baae067. [PMID: 39066514 PMCID: PMC11281847 DOI: 10.1093/database/baae067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Revised: 05/08/2024] [Accepted: 07/18/2024] [Indexed: 07/28/2024]
Abstract
Large amounts of important medical information are captured in free-text documents in biomedical research and within healthcare systems, which can be made accessible through natural language processing (NLP). A key component in most biomedical NLP pipelines is entity linking, i.e. grounding textual mentions of named entities to a reference of medical concepts, usually derived from a terminology system, such as the Systematized Nomenclature of Medicine Clinical Terms. However, complex entity mentions, spanning multiple tokens, are notoriously hard to normalize due to the difficulty of finding appropriate candidate concepts. In this work, we propose an approach to preprocess such mentions for candidate generation, building upon recent advances in text simplification with generative large language models. We evaluate the feasibility of our method in the context of the entity linking track of the BioCreative VIII SympTEMIST shared task. We find that instructing the latest Generative Pre-trained Transformer model with a few-shot prompt for text simplification results in mention spans that are easier to normalize. Thus, we can improve recall during candidate generation by 2.9 percentage points compared to our baseline system, which achieved the best score in the original shared task evaluation. Furthermore, we show that this improvement in recall can be fully translated into top-1 accuracy through careful initialization of a subsequent reranking model. Our best system achieves an accuracy of 63.6% on the SympTEMIST test set. The proposed approach has been integrated into the open-source xMEN toolkit, which is available online via https://github.com/hpi-dhc/xmen.
Collapse
Affiliation(s)
- Florian Borchert
- Hasso Plattner Institute for Digital Engineering, University of Potsdam, Prof.-Dr.-Helmert-Straße 2-3, Potsdam 14482, Germany
| | - Ignacio Llorca
- Hasso Plattner Institute for Digital Engineering, University of Potsdam, Prof.-Dr.-Helmert-Straße 2-3, Potsdam 14482, Germany
| | - Matthieu-P Schapranow
- Hasso Plattner Institute for Digital Engineering, University of Potsdam, Prof.-Dr.-Helmert-Straße 2-3, Potsdam 14482, Germany
| |
Collapse
|
12
|
Kim A, Jeon E, Lee H, Heo H, Woo K. Risk factors for prediabetes in community-dwelling adults: A generalized estimating equation logistic regression approach with natural language processing insights. Res Nurs Health 2024. [PMID: 38961672 DOI: 10.1002/nur.22413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 05/11/2024] [Accepted: 06/22/2024] [Indexed: 07/05/2024]
Abstract
The global prevalence of prediabetes is expected to reach 8.3% (587 million people) by 2045, with 70% of people with prediabetes developing diabetes during their lifetimes. We aimed to classify community-dwelling adults with a high risk for prediabetes based on prediabetes-related symptoms and to identify their characteristics, which might be factors associated with prediabetes. We analyzed homecare nursing records (n = 26,840) of 1628 patients aged over 20 years. Using a natural language processing algorithm, we classified each nursing episode as either low-risk or high-risk for prediabetes based on the detected number and category of prediabetes-symptom words. To identify differences between the risk groups, we employed t-tests, chi-square tests, and data visualization. Risk factors for prediabetes were identified using multiple logistic regression models with generalized estimating equations. A total of 3270 episodes (12.18%) were classified as potentially high-risk for prediabetes. There were significant differences in the personal, social, and clinical factors between groups. Results revealed that female sex, age, cancer coverage as part of homecare insurance coverage, and family caregivers were significantly associated with an increased risk of prediabetes. Although prediabetes is not a life-threatening disease, uncontrolled blood glucose can cause unfavorable outcomes for other major diseases. Thus, medical professionals should consider the associated symptoms and risk factors of prediabetes. Moreover, the proposed algorithm may support the detection of individuals at a high risk for prediabetes. Implementing this approach could facilitate proactive monitoring and early intervention, leading to reduced healthcare expenses and better health outcomes for community-dwelling adults.
Collapse
Affiliation(s)
- Aeri Kim
- College of Nursing, Seoul National University, Seoul, South Korea
| | - Eunjoo Jeon
- Technology Research, Samsung SDS, Seoul, South Korea
| | - Hana Lee
- College of Nursing, Seoul National University, Seoul, South Korea
| | - Hyunsook Heo
- Seoul National University Hospital, Seoul, South Korea
| | - Kyungmi Woo
- College of Nursing, Seoul National University, Seoul, South Korea
| |
Collapse
|
13
|
Park JI, Park JW, Zhang K, Kim D. Advancing equity in breast cancer care: natural language processing for analysing treatment outcomes in under-represented populations. BMJ Health Care Inform 2024; 31:e100966. [PMID: 38955389 PMCID: PMC11218025 DOI: 10.1136/bmjhci-2023-100966] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 06/21/2024] [Indexed: 07/04/2024] Open
Abstract
OBJECTIVE The study aimed to develop natural language processing (NLP) algorithms to automate extracting patient-centred breast cancer treatment outcomes from clinical notes in electronic health records (EHRs), particularly for women from under-represented populations. METHODS The study used clinical notes from 2010 to 2021 from a tertiary hospital in the USA. The notes were processed through various NLP techniques, including vectorisation methods (term frequency-inverse document frequency (TF-IDF), Word2Vec, Doc2Vec) and classification models (support vector classification, K-nearest neighbours (KNN), random forest (RF)). Feature selection and optimisation through random search and fivefold cross-validation were also conducted. RESULTS The study annotated 100 out of 1000 clinical notes, using 970 notes to build the text corpus. TF-IDF and Doc2Vec combined with RF showed the highest performance, while Word2Vec was less effective. RF classifier demonstrated the best performance, although with lower recall rates, suggesting more false negatives. KNN showed lower recall due to its sensitivity to data noise. DISCUSSION The study highlights the significance of using NLP in analysing clinical notes to understand breast cancer treatment outcomes in under-represented populations. The TF-IDF and Doc2Vec models were more effective in capturing relevant information than Word2Vec. The study observed lower recall rates in RF models, attributed to the dataset's imbalanced nature and the complexity of clinical notes. CONCLUSION The study developed high-performing NLP pipeline to capture treatment outcomes for breast cancer in under-represented populations, demonstrating the importance of document-level vectorisation and ensemble methods in clinical notes analysis. The findings provide insights for more equitable healthcare strategies and show the potential for broader NLP applications in clinical settings.
Collapse
Affiliation(s)
- Jung In Park
- University of California Irvine, Irvine, California, USA
| | - Jong Won Park
- Yonsei Cancer Center, Yonsei University College of Medicine, Seoul, South Korea
| | - Kexin Zhang
- Donald Bren School of Information & Computer Sciences, University of California Irvine, Irvine, California, USA
| | - Doyop Kim
- Independent Researcher, Irvine, California, USA
| |
Collapse
|
14
|
Gleason KT, Tran A, Fawzy A, Yan L, Farley H, Garibaldi B, Iwashyna TJ. Does nurse use of a standardized flowsheet to document communication with advanced providers provide a mechanism to detect pulse oximetry failures? A retrospective study of electronic health record data. Int J Nurs Stud 2024; 155:104770. [PMID: 38676990 DOI: 10.1016/j.ijnurstu.2024.104770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 03/05/2024] [Accepted: 04/02/2024] [Indexed: 04/29/2024]
Abstract
BACKGROUND Pulse oximetry guides clinical decisions, yet does not uniformly identify hypoxemia. We hypothesized that nursing documentation of notifying providers, facilitated by a standardized flowsheet for documenting communication to providers (physicians, nurse practitioners, and physician assistants), may increase when hypoxemia is present, but undetected by the pulse oximeter, in events termed "occult hypoxemia." OBJECTIVE To compare nurse documentation of provider notification in the 4 h preceding cases of occult hypoxemia, normal oxygenation, and evident hypoxemia confirmed by an arterial blood gas reading. METHODS We conducted a retrospective study using electronic health record data from patients with COVID-19 at five hospitals in a healthcare system with paired SpO2 and SaO2 readings (measurements within 10 min of oxygen saturation levels in arterial blood, SaO2, and by pulse oximetry, SpO2). We applied multivariate logistic regression to assess if having any nursing documentation of provider notification in the 4 h prior to a paired reading confirming occult hypoxemia was more likely compared to a paired reading confirming normal oxygen status, adjusting for characteristics significantly associated with nursing documentation. We applied conditional logistic regression to assess if having any nursing documentation of provider notification was more likely in the 4-hour window preceding a paired reading compared to the 4-hour window 24 h earlier separately for occult hypoxemia, visible hypoxemia, and normal oxygenation. RESULTS There were data from 1910 patients hospitalized with COVID-19 who had 44,972 paired readings and an average of 26.5 (34.5) nursing documentation of provider notification events. The mean age was 63.4 (16.2). Almost half (866/1910, 45.3 %) were White, 701 (36.7 %) were Black, and 239 (12.5 %) were Hispanic. Having any nursing documentation of provider notification was 46 % more common in the 4 h before an occult hypoxemia paired reading compared to a normal oxygen status paired reading (OR 1.46, 95 % CI: 1.28-1.67). Comparing the 4 h immediately before the reading to the 4 h one day preceding the paired reading, there was a higher likelihood of having any nursing documentation of provider notification for both evident (OR 1.45, 95 % CI 1.24-1.68) and occult paired readings (OR 1.26, 95 % CI 1.04-1.53). CONCLUSION This study finds that nursing documentation of provider notification significantly increases prior to confirmed occult hypoxemia, which has potential in proactively identifying occult hypoxemia and other clinical issues. There is potential value to encouraging standardized documentation of nurse concern, including communication to providers, to facilitate its inclusion in clinical decision-making.
Collapse
Affiliation(s)
- Kelly T Gleason
- Johns Hopkins University School of Nursing, Baltimore, MD, USA.
| | | | - Ashraf Fawzy
- Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Li Yan
- Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | | | - Brian Garibaldi
- Johns Hopkins University School of Medicine, Baltimore, MD, USA; Johns Hopkins Hospital, Baltimore, MD, USA
| | - Theodore J Iwashyna
- Johns Hopkins University School of Medicine, Baltimore, MD, USA; Johns Hopkins Hospital, Baltimore, MD, USA; Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| |
Collapse
|
15
|
Jackson AF, Burkom H. A Framework for Developing and Assessing Custom Case Definitions: A Demonstration Applied to Opioid Overdose in Maryland. JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE 2024; 30:578-585. [PMID: 38870375 DOI: 10.1097/phh.0000000000001885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
CONTEXT Public health epidemiologists monitor data sources for disease outbreaks and other events of public health concern, but manual review of records to identify cases of interest is slow and labor-intensive and may not reflect evolving data practices. To automatically identify cases from electronic data sources, epidemiologists must use "case definitions" or formal logic that captures the criteria used to identify a record as a case of interest. OBJECTIVE To establish a methodology for development and evaluation of case definitions. A logical evaluation framework to approach case definitions will allow jurisdictions the flexibility to implement a case definition tailored to their goals and available data. DESIGN Case definition development is explained as a process with multiple logical components combining free-text and categorical data fields. The process is illustrated with the development of a case definition to identify emergency medical services (EMS) call records related to opioid overdoses in Maryland. SETTING The Maryland Department of Health (MDH) installation of the Electronic Surveillance System for Early Notification of Community-Based Epidemics (ESSENCE), which began capturing EMS call records in ESSENCE in 2019 to improve statewide coverage of all-hazards health issues. RESULTS We describe a case definition evaluation framework and demonstrate its application through development of an opioid overdose case definition to be used in MDH ESSENCE. We show the iterative process of development, from defining how a case can be identified conceptually to examining each component of the conceptual definition and then exploring how to capture that component using available data. CONCLUSION We present a framework for developing and qualitatively assessing case definitions and demonstrate an application of the framework to identifying opioid overdose incidents from MDH EMS data. We discuss guidelines to support jurisdictions in applying this framework to their own data and public health challenges to improve local surveillance capability.
Collapse
Affiliation(s)
- Alice F Jackson
- Johns Hopkins University Applied Physics Laboratory, Laurel, Maryland
| | | |
Collapse
|
16
|
Wieland-Jorna Y, van Kooten D, Verheij RA, de Man Y, Francke AL, Oosterveld-Vlug MG. Natural language processing systems for extracting information from electronic health records about activities of daily living. A systematic review. JAMIA Open 2024; 7:ooae044. [PMID: 38798774 PMCID: PMC11126158 DOI: 10.1093/jamiaopen/ooae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 03/21/2024] [Accepted: 05/07/2024] [Indexed: 05/29/2024] Open
Abstract
Objective Natural language processing (NLP) can enhance research on activities of daily living (ADL) by extracting structured information from unstructured electronic health records (EHRs) notes. This review aims to give insight into the state-of-the-art, usability, and performance of NLP systems to extract information on ADL from EHRs. Materials and Methods A systematic review was conducted based on searches in Pubmed, Embase, Cinahl, Web of Science, and Scopus. Studies published between 2017 and 2022 were selected based on predefined eligibility criteria. Results The review identified 22 studies. Most studies (65%) used NLP for classifying unstructured EHR data on 1 or 2 ADL. Deep learning, combined with a ruled-based method or machine learning, was the approach most commonly used. NLP systems varied widely in terms of the pre-processing and algorithms. Common performance evaluation methods were cross-validation and train/test datasets, with F1, precision, and sensitivity as the most frequently reported evaluation metrics. Most studies reported relativity high overall scores on the evaluation metrics. Discussion NLP systems are valuable for the extraction of unstructured EHR data on ADL. However, comparing the performance of NLP systems is difficult due to the diversity of the studies and challenges related to the dataset, including restricted access to EHR data, inadequate documentation, lack of granularity, and small datasets. Conclusion This systematic review indicates that NLP is promising for deriving information on ADL from unstructured EHR notes. However, what the best-performing NLP system is, depends on characteristics of the dataset, research question, and type of ADL.
Collapse
Affiliation(s)
- Yvonne Wieland-Jorna
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
- Tranzo, School of Social Sciences and Behavioural Research, Tilburg University, Tilburg, Postbus 90153, 5000 LE, The Netherlands
| | - Daan van Kooten
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
| | - Robert A Verheij
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
- Tranzo, School of Social Sciences and Behavioural Research, Tilburg University, Tilburg, Postbus 90153, 5000 LE, The Netherlands
| | - Yvonne de Man
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
| | - Anneke L Francke
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
- Department of Public and Occupational Health, Location Vrije Universiteit Amsterdam, Amsterdam UMC, Amsterdam, Postbus 7057, 1007 MB, The Netherlands
| | - Mariska G Oosterveld-Vlug
- Netherlands Institute for Health Services Research (Nivel), Utrecht, Postbus 1568, 3500 BN, The Netherlands
| |
Collapse
|
17
|
Smith SJ, Moorin R, Taylor K, Newton J, Smith S. Collecting routine and timely cancer stage at diagnosis by implementing a cancer staging tiered framework: the Western Australian Cancer Registry experience. BMC Health Serv Res 2024; 24:770. [PMID: 38943091 PMCID: PMC11214229 DOI: 10.1186/s12913-024-11224-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 06/20/2024] [Indexed: 07/01/2024] Open
Abstract
BACKGROUND Current processes collecting cancer stage data in population-based cancer registries (PBCRs) lack standardisation, resulting in difficulty utilising diverse data sources and incomplete, low-quality data. Implementing a cancer staging tiered framework aims to improve stage collection and facilitate inter-PBCR benchmarking. OBJECTIVE Demonstrate the application of a cancer staging tiered framework in the Western Australian Cancer Staging Project to establish a standardised method for collecting cancer stage at diagnosis data in PBCRs. METHODS The tiered framework, developed in collaboration with a Project Advisory Group and applied to breast, colorectal, and melanoma cancers, provides business rules - procedures for stage collection. Tier 1 represents the highest staging level, involving complete American Joint Committee on Cancer (AJCC) tumour-node-metastasis (TNM) data collection and other critical staging information. Tier 2 (registry-derived stage) relies on supplementary data, including hospital admission data, to make assumptions based on data availability. Tier 3 (pathology stage) solely uses pathology reports. FINDINGS The tiered framework promotes flexible utilisation of staging data, recognising various levels of data completeness. Tier 1 is suitable for all purposes, including clinical and epidemiological applications. Tiers 2 and 3 are recommended for epidemiological analysis alone. Lower tiers provide valuable insights into disease patterns, risk factors, and overall disease burden for public health planning and policy decisions. Capture of staging at each tier depends on data availability, with potential shifts to higher tiers as new data sources are acquired. CONCLUSIONS The tiered framework offers a dynamic approach for PBCRs to record stage at diagnosis, promoting consistency in population-level staging data and enabling practical use for benchmarking across jurisdictions, public health planning, policy development, epidemiological analyses, and assessing cancer outcomes. Evolution with staging classifications and data variable changes will futureproof the tiered framework. Its adaptability fosters continuous refinement of data collection processes and encourages improvements in data quality.
Collapse
Affiliation(s)
- Shantelle J Smith
- School of Population Health, Curtin University, Perth, WA, Australia.
- Faculty of Health Sciences, Curtin Health Innovation Research Institute, Curtin University, Bentley, WA, Australia.
| | - Rachael Moorin
- School of Population Health, Curtin University, Perth, WA, Australia
- Faculty of Health Sciences, Curtin Health Innovation Research Institute, Curtin University, Bentley, WA, Australia
- School of Population and Global Health, The University of Western Australia, Crawley, WA, Australia
| | - Karen Taylor
- Cancer Network WA, North Metropolitan Health Service, Perth, WA, Australia
| | - Jade Newton
- School of Population Health, Curtin University, Perth, WA, Australia
- Faculty of Health Sciences, Curtin Health Innovation Research Institute, Curtin University, Bentley, WA, Australia
| | - Stephanie Smith
- School of Population Health, Curtin University, Perth, WA, Australia
- Curtin Medical School, Curtin University, Perth, WA, Australia
| |
Collapse
|
18
|
Mun M, Kim A, Woo K. Natural Language Processing Application in Nursing Research: A Study Using Text Network Analysis and Topic Modeling. Comput Inform Nurs 2024:00024665-990000000-00202. [PMID: 38913983 DOI: 10.1097/cin.0000000000001158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Although the potential of natural language processing and an increase in its application in nursing research is evident, there is a lack of understanding of the research trends. This study conducts text network analysis and topic modeling to uncover the underlying knowledge structures, research trends, and emergent research themes within nursing literature related to natural language processing. In addition, this study aims to provide a foundation for future scholarly inquiries and enhance the integration of natural language processing in the analysis of nursing research. We analyzed 443 literature abstracts and performed core keyword analysis and topic modeling based on frequency and centrality. The following topics emerged: (1) Term Identification and Communication; (2) Application of Machine Learning; (3) Exploration of Health Outcome Factors; (4) Intervention and Participant Experience; and (5) Disease-Related Algorithms. Nursing meta-paradigm elements were identified within the core keyword analysis, which led to understanding and expanding the meta-paradigm. Although still in its infancy in nursing research with limited topics and research volumes, natural language processing can potentially enhance research efficiency and nursing quality. The findings emphasize the possibility of integrating natural language processing in nursing-related subjects, validating nursing value, and fostering the exploration of essential paradigms in nursing science.
Collapse
Affiliation(s)
- Minji Mun
- Author Affiliations: College of Nursing (Mrs Mun, Mrs Kim, and Dr Woo), and The Research Institute of Nursing Science, College of Nursing (Dr Woo), Seoul National University, South Korea
| | | | | |
Collapse
|
19
|
Dos Santos VL, Sato KS, Maher CG, Vidal RVC, Grande GHD, Costa LOP, Machado GC, Ferreira GE, Buchbinder R, Oliveira CB. Clinical indicators to monitor health care in low back pain: a scoping review. Int J Qual Health Care 2024; 36:mzae044. [PMID: 38814664 DOI: 10.1093/intqhc/mzae044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 05/27/2024] [Indexed: 05/31/2024] Open
Abstract
Clinical care indicators for low back pain can be used to monitor healthcare practices and consequently be used to evaluate success of strategies to improve care quality. The aim of this study was to identify the clinical care indicators that have been used to measure appropriateness of health care for patients with low back pain. We conducted a systematic search of five electronic databases and Google to identify clinical care indicators that have been used to measure any aspect of care for people with low back pain. Care indicators were narratively described according to their type (i.e. structure, process, or outcomes) and categorized by their purpose (e.g. to measure aspects related to assessment, imaging requests, treatment/prevention, and outcomes). A total of 3562 and 2180 records were retrieved from electronic databases and Google searches, respectively. We identified 280 indicators related to low back pain care from 40 documents and publications. Most quality indicators were process indicators (n = 213, 76%), followed by structure (n = 41, 15%) and outcome indicators (n = 26, 9%). The most common indicators were related to imaging requests (n = 41, 15%), referral to healthcare providers (n = 30, 11%), and shared decision-making (n = 21, 7%). Our review identified a range of clinical care indicators that have been used to measure the quality of health care for people with low back pain. Our findings will support a Delphi study to reach international consensus on what would be the most important and feasible indicators for a minimum dataset to be collected globally.
Collapse
Affiliation(s)
- Vanessa L Dos Santos
- Faculty of Medicine, University of Western São Paulo (UNOESTE), Presidente Prudente, Sao Paulo 19050-920, Brazil
| | - Karen S Sato
- Faculty of Medicine, University of Western São Paulo (UNOESTE), Presidente Prudente, Sao Paulo 19050-920, Brazil
| | - Chris G Maher
- Institute for Musculoskeletal Health, Sydney Local Health District, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
- Sydney Musculoskeletal Health, Faculty of Medicine and Health, The University of Sydney, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
| | - Rubens V C Vidal
- Faculty of Medicine, University of Western São Paulo (UNOESTE), Presidente Prudente, Sao Paulo 19050-920, Brazil
| | - Guilherme H D Grande
- Faculty of Medicine, University of Western São Paulo (UNOESTE), Presidente Prudente, Sao Paulo 19050-920, Brazil
- Departamento de Educação Física, Faculdade de Ciências e Tecnologia, Universidade Estadual Paulista, Rua Roberto Simonsen, 305, Presidente Prudente, Sao Pualo 19060-900, Brazil
| | - Leonardo O P Costa
- Masters and Doctoral Programs in Physical Therapy, Universidade Cidade de São Paulo, Rua Cesário Galeno, 448, Sao Paulo 03071-000, Brazil
| | - Gustavo C Machado
- Institute for Musculoskeletal Health, Sydney Local Health District, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
- Sydney Musculoskeletal Health, Faculty of Medicine and Health, The University of Sydney, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
| | - Giovanni E Ferreira
- Institute for Musculoskeletal Health, Sydney Local Health District, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
- Sydney Musculoskeletal Health, Faculty of Medicine and Health, The University of Sydney, King George V Building, Missenden Road, Camperdown, Sydney, New South Wales 2050, Australia
| | - Rachelle Buchbinder
- Musculoskeletal Health and Wiser Health Care Units, School of Public Health and Preventive Medicine, Monash University, 4 Drysdale St, Malvern, Melbourne, Victoria 3144, Australia
| | - Crystian B Oliveira
- Faculty of Medicine, University of Western São Paulo (UNOESTE), Presidente Prudente, Sao Paulo 19050-920, Brazil
- Departamento de Educação Física, Faculdade de Ciências e Tecnologia, Universidade Estadual Paulista, Rua Roberto Simonsen, 305, Presidente Prudente, Sao Pualo 19060-900, Brazil
| |
Collapse
|
20
|
Le KDR, Tay SBP, Choy KT, Verjans J, Sasanelli N, Kong JCH. Applications of natural language processing tools in the surgical journey. Front Surg 2024; 11:1403540. [PMID: 38826809 PMCID: PMC11140056 DOI: 10.3389/fsurg.2024.1403540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Accepted: 05/07/2024] [Indexed: 06/04/2024] Open
Abstract
Background Natural language processing tools are becoming increasingly adopted in multiple industries worldwide. They have shown promising results however their use in the field of surgery is under-recognised. Many trials have assessed these benefits in small settings with promising results before large scale adoption can be considered in surgery. This study aims to review the current research and insights into the potential for implementation of natural language processing tools into surgery. Methods A narrative review was conducted following a computer-assisted literature search on Medline, EMBASE and Google Scholar databases. Papers related to natural language processing tools and consideration into their use for surgery were considered. Results Current applications of natural language processing tools within surgery are limited. From the literature, there is evidence of potential improvement in surgical capability and service delivery, such as through the use of these technologies to streamline processes including surgical triaging, data collection and auditing, surgical communication and documentation. Additionally, there is potential to extend these capabilities to surgical academia to improve processes in surgical research and allow innovation in the development of educational resources. Despite these outcomes, the evidence to support these findings are challenged by small sample sizes with limited applicability to broader settings. Conclusion With the increasing adoption of natural language processing technology, such as in popular forms like ChatGPT, there has been increasing research in the use of these tools within surgery to improve surgical workflow and efficiency. This review highlights multifaceted applications of natural language processing within surgery, albeit with clear limitations due to the infancy of the infrastructure available to leverage these technologies. There remains room for more rigorous research into broader capability of natural language processing technology within the field of surgery and the need for cross-sectoral collaboration to understand the ways in which these algorithms can best be integrated.
Collapse
Affiliation(s)
- Khang Duy Ricky Le
- Department of General Surgical Specialties, The Royal Melbourne Hospital, Melbourne, VIC, Australia
- Department of Surgical Oncology, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Geelong Clinical School, Deakin University, Geelong, VIC, Australia
- Department of Medical Education, The University of Melbourne, Melbourne, VIC, Australia
| | - Samuel Boon Ping Tay
- Department of Anaesthesia and Pain Medicine, Eastern Health, Box Hill, VIC, Australia
| | - Kay Tai Choy
- Department of Surgery, Austin Health, Melbourne, VIC, Australia
| | - Johan Verjans
- Australian Institute for Machine Learning (AIML), University of Adelaide, Adelaide, SA, Australia
- Lifelong Health Theme (Platform AI), South Australian Health and Medical Research Institute, Adelaide, SA, Australia
| | - Nicola Sasanelli
- Division of Information Technology, Engineering and the Environment, University of South Australia, Adelaide, SA, Australia
- Department of Operations (Strategic and International Partnerships), SmartSAT Cooperative Research Centre, Adelaide, SA, Australia
- Agora High Tech, Adelaide, SA, Australia
| | - Joseph C. H. Kong
- Department of Surgical Oncology, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Monash University Department of Surgery, Alfred Hospital, Melbourne, VIC, Australia
- Department of Colorectal Surgery, Alfred Hospital, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
21
|
Roberts K, Chin AT, Loewy K, Pompeii L, Shin H, Rider NL. Natural language processing of clinical notes enables early inborn error of immunity risk ascertainment. THE JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY. GLOBAL 2024; 3:100224. [PMID: 38439946 PMCID: PMC10910118 DOI: 10.1016/j.jacig.2024.100224] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 12/24/2023] [Accepted: 01/21/2024] [Indexed: 03/06/2024]
Abstract
Background There are now approximately 450 discrete inborn errors of immunity (IEI) described; however, diagnostic rates remain suboptimal. Use of structured health record data has proven useful for patient detection but may be augmented by natural language processing (NLP). Here we present a machine learning model that can distinguish patients from controls significantly in advance of ultimate diagnosis date. Objective We sought to create an NLP machine learning algorithm that could identify IEI patients early during the disease course and shorten the diagnostic odyssey. Methods Our approach involved extracting a large corpus of IEI patient clinical-note text from a major referral center's electronic health record (EHR) system and a matched control corpus for comparison. We built text classifiers with simple machine learning methods and trained them on progressively longer time epochs before date of diagnosis. Results The top performing NLP algorithm effectively distinguished cases from controls robustly 36 months before ultimate clinical diagnosis (area under precision recall curve > 0.95). Corpus analysis demonstrated that statistically enriched, IEI-relevant terms were evident 24+ months before diagnosis, validating that clinical notes can provide a signal for early prediction of IEI. Conclusion Mining EHR notes with NLP holds promise for improving early IEI patient detection.
Collapse
Affiliation(s)
- Kirk Roberts
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, Tex
| | - Aaron T. Chin
- Division of Immunology, Allergy, and Rheumatology, University of California, Los Angeles, Calif
| | | | - Lisa Pompeii
- Department of Patient Services, Cincinnati Children’s Hospital Medical Center, Cincinnati, Ohio
| | - Harold Shin
- College of Osteopathic Medicine, Liberty University, Lynchburg, Va
| | - Nicholas L. Rider
- Division of Health System & Implementation Science, Virginia Tech Carilion School of Medicine, Roanoke, Va
- Section of Allergy and Immunology, Carilion Clinic, Roanoke, Va
| |
Collapse
|
22
|
Harris C, Tang Y, Birnbaum E, Cherian C, Mendhe D, Chen MH. Digital Neuropsychology beyond Computerized Cognitive Assessment: Applications of Novel Digital Technologies. Arch Clin Neuropsychol 2024; 39:290-304. [PMID: 38520381 DOI: 10.1093/arclin/acae016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 02/16/2024] [Indexed: 03/25/2024] Open
Abstract
Compared with other health disciplines, there is a stagnation in technological innovation in the field of clinical neuropsychology. Traditional paper-and-pencil tests have a number of shortcomings, such as low-frequency data collection and limitations in ecological validity. While computerized cognitive assessment may help overcome some of these issues, current computerized paradigms do not address the majority of these limitations. In this paper, we review recent literature on the applications of novel digital health approaches, including ecological momentary assessment, smartphone-based assessment and sensors, wearable devices, passive driving sensors, smart homes, voice biomarkers, and electronic health record mining, in neurological populations. We describe how each digital tool may be applied to neurologic care and overcome limitations of traditional neuropsychological assessment. Ethical considerations, limitations of current research, as well as our proposed future of neuropsychological practice are also discussed.
Collapse
Affiliation(s)
- Che Harris
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Yingfei Tang
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| | - Eliana Birnbaum
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Christine Cherian
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Dinesh Mendhe
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
| | - Michelle H Chen
- Institute for Health, Health Care Policy and Aging Research, Rutgers University, New Brunswick, NJ, USA
- Department of Neurology, Robert Wood Johnson Medical School, Rutgers University, New Brunswick, NJ, USA
| |
Collapse
|
23
|
Reading Turchioe M, Volodarskiy A, Guo W, Taylor B, Hobensack M, Pathak J, Slotwiner D. Characterizing atrial fibrillation symptom improvement following de novo catheter ablation. Eur J Cardiovasc Nurs 2024; 23:241-250. [PMID: 37479225 PMCID: PMC11008952 DOI: 10.1093/eurjcn/zvad068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 07/05/2023] [Accepted: 07/18/2023] [Indexed: 07/23/2023]
Abstract
AIMS Atrial fibrillation (AF) symptom relief is a primary indication for catheter ablation, but AF symptom resolution is not well characterized. The study objective was to describe AF symptom documentation in electronic health records (EHRs) pre- and post-ablation and identify correlates of post-ablation symptoms. METHODS AND RESULTS We conducted a retrospective cohort study using EHRs of patients with AF (n = 1293), undergoing ablation in a large, urban health system from 2010 to 2020. We extracted symptom data from clinical notes using a natural language processing algorithm (F score: 0.81). We used Cochran's Q tests with post-hoc McNemar's tests to determine differences in symptom prevalence pre- and post-ablation. We used logistic regression models to estimate the adjusted odds of symptom resolution by personal or clinical characteristics at 6 and 12 months post-ablation. In fully adjusted models, at 12 months post-ablation patients, patients with heart failure had significantly lower odds of dyspnoea resolution [odds ratio (OR) 0.38, 95% confidence interval (CI) 0.25-0.57], oedema resolution (OR 0.37, 95% CI 0.25-0.56), and fatigue resolution (OR 0.54, 95% CI 0.34-0.85), but higher odds of palpitations resolution (OR 1.90, 95% CI 1.25-2.89) compared with those without heart failure. Age 65 and older, female sex, Black or African American race, smoking history, and antiarrhythmic use were also associated with lower odds of resolution of specific symptoms at 6 and 12 months. CONCLUSION The post-ablation symptom patterns are heterogeneous. Findings warrant confirmation with larger, more representative data sets, which may be informative for patients whose primary goal for undergoing an ablation is symptom relief.
Collapse
Affiliation(s)
| | - Alexander Volodarskiy
- Department of Cardiology, NewYork-Presbyterian Queens Hospital, 56-45 Main St, Queens, NY 11355, USA
- Department of Population Health Sciences, Weill Cornell Medicine, 402 E 67th St, New York, NY 10065, USA
| | - Winston Guo
- Department of Population Health Sciences, Weill Cornell Medicine, 402 E 67th St, New York, NY 10065, USA
| | - Brittany Taylor
- Columbia University School of Nursing, 560 W. 168th Street, New York, NY 10032, USA
| | - Mollie Hobensack
- Columbia University School of Nursing, 560 W. 168th Street, New York, NY 10032, USA
| | - Jyotishman Pathak
- Department of Population Health Sciences, Weill Cornell Medicine, 402 E 67th St, New York, NY 10065, USA
| | - David Slotwiner
- Department of Cardiology, NewYork-Presbyterian Queens Hospital, 56-45 Main St, Queens, NY 11355, USA
- Department of Population Health Sciences, Weill Cornell Medicine, 402 E 67th St, New York, NY 10065, USA
| |
Collapse
|
24
|
Sim JA, Huang X, Horan MR, Baker JN, Huang IC. Using natural language processing to analyze unstructured patient-reported outcomes data derived from electronic health records for cancer populations: a systematic review. Expert Rev Pharmacoecon Outcomes Res 2024; 24:467-475. [PMID: 38383308 PMCID: PMC11001514 DOI: 10.1080/14737167.2024.2322664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 02/20/2024] [Indexed: 02/23/2024]
Abstract
INTRODUCTION Patient-reported outcomes (PROs; symptoms, functional status, quality-of-life) expressed in the 'free-text' or 'unstructured' format within clinical notes from electronic health records (EHRs) offer valuable insights beyond biological and clinical data for medical decision-making. However, a comprehensive assessment of utilizing natural language processing (NLP) coupled with machine learning (ML) methods to analyze unstructured PROs and their clinical implementation for individuals affected by cancer remains lacking. AREAS COVERED This study aimed to systematically review published studies that used NLP techniques to extract and analyze PROs in clinical narratives from EHRs for cancer populations. We examined the types of NLP (with and without ML) techniques and platforms for data processing, analysis, and clinical applications. EXPERT OPINION Utilizing NLP methods offers a valuable approach for processing and analyzing unstructured PROs among cancer patients and survivors. These techniques encompass a broad range of applications, such as extracting or recognizing PROs, categorizing, characterizing, or grouping PROs, predicting or stratifying risk for unfavorable clinical results, and evaluating connections between PROs and adverse clinical outcomes. The employment of NLP techniques is advantageous in converting substantial volumes of unstructured PRO data within EHRs into practical clinical utilities for individuals with cancer.
Collapse
Affiliation(s)
- Jin-ah Sim
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, TN, USA
- Department of AI Convergence, Hallym University, Chuncheon, Republic of Korea
| | - Xiaolei Huang
- Department of Computer Science, University of Memphis, Memphis, Tennessee, United States
| | - Madeline R. Horan
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - Justin N. Baker
- Department of Oncology, St. Jude Children’s Research Hospital, Memphis, TN, USA
| | - I-Chan Huang
- Department of Epidemiology and Cancer Control, St. Jude Children’s Research Hospital, Memphis, TN, USA
| |
Collapse
|
25
|
Leal I, Nogueira V, Matos DB, Araújo J, Berens O, Ribeiro M, Furtado MJ, Liverani M, Silva MI, Guedes M, Cordeiro M, Ribeiro M, José P, Barão R, Nunes Ferreira R, Fonseca S, Mano S, Pina S, Santos MJ, Fonseca JE, Fonseca C, Figueira L. Design and Development of a Web-Based Prospective Nationwide Registry for Ocular Inflammatory Diseases: UVEITE.PT - The Portuguese Ocular Inflammation Registry. Ocul Immunol Inflamm 2024; 32:342-350. [PMID: 36780588 DOI: 10.1080/09273948.2023.2171891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 01/11/2023] [Accepted: 01/18/2023] [Indexed: 02/15/2023]
Abstract
Uveitis is a heterogeneous collection of infrequent diseases, which poses significant challenges to cost-effective research in the field. Medical registries are being increasingly recognized as crucial tools to provide high-quality data, thus enabling prospective clinical research. This paper describes the design and technical structure development of an innovative countrywide electronic medical record for uveitis, Uveite.pt, and gives an overview of the cohort registered since its foundation, March 2020.Uveite.pt is an electronic medical record platform developed by the Portuguese Ocular Inflammation Group (POIG), a scientific committee of the Portuguese Ophthalmology Society. This is a nationwide customized web-based platform for uveitis patients useful for both clinical practice and real-world-based research, working as a central repository and reporting tool for uveitis. This paper describes the technical principles, the design and the development of a web-based interoperable registry for uveitis in Portugal and provides an overview of more than 400 patients registered in the first 18 months since inception.In infrequent diseases, the existence of registries enables to gather evidence and increase research possibilities to clinicians. The adoption of this platform enables standardization and improvement of clinical practice in uveitis. It is useful to apprehend the repercussion of medical and surgical treatments in uveitis and scleritis, supporting clinicians in the strict monitoring of drug adverse reactions and surgical outcomes.
Collapse
Affiliation(s)
- Inês Leal
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Vanda Nogueira
- Instituto de Oftalmologia Dr. Gama Pinto, Lisbon, Portugal
| | - Diogo Bernardo Matos
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Joana Araújo
- Ophthalmology Department, Centro Hospitalar Universitário São João, Porto, Portugal
- Departamento de Cirurgia e Fisiologia, Faculdade de Medicina, Universidade do Porto, Porto, Portugal
| | - Olga Berens
- Ophthalmology Department, Hospital do Espírito Santo, Évora, Portugal
| | - Margarida Ribeiro
- Ophthalmology Department, Centro Hospitalar Universitário São João, Porto, Portugal
- Department of Biomedicine, Unit of Pharmacology and Therapeutics, Faculdade de Medicina, Universidade do Porto, Porto, Portugal
| | - Maria João Furtado
- Ophthalmology Department, Centro Hospitalar Universitário do Porto, Porto, Portugal
| | - Marco Liverani
- Ophthalmology Department, Hospital de Vila Franca de Xira, Vila Franca de Xira, Portugal
| | - Marta Inês Silva
- Ophthalmology Department, Centro Hospitalar Universitário São João, Porto, Portugal
| | - Marta Guedes
- Ophthalmology Department, Hospital Egas Moniz, Centro Hospitalar de Lisboa Ocidental, Lisbon, Portugal
| | - Miguel Cordeiro
- Ophthalmology Department, Hospital Egas Moniz, Centro Hospitalar de Lisboa Ocidental, Lisbon, Portugal
| | - Miguel Ribeiro
- Ophthalmology Department, Centro Hospitalar Tondela-Viseu, Viseu, Portugal
| | - Patrícia José
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Rafael Barão
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Rui Nunes Ferreira
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Sofia Fonseca
- Ophthalmology Department, Centro Hospitalar de Vila Nova de Gaia/Espinho, Vila Nova de Gaia, Portugal
| | - Sofia Mano
- Ophthalmology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Centro de Estudos das Ciências da Visão, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Susana Pina
- Ophthalmology Department, Hospital Beatriz Ângelo, Loures, Portugal
| | - Maria José Santos
- Rheumatology Department, Hospital Garcia de Orta, Almada, Portugal
- Rheumatology Research Unit, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
| | - João Eurico Fonseca
- Rheumatology Research Unit, Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Centro Académico de Medicina de Lisboa, Lisbon, Portugal
- Rheumatology Department, Hospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Lisbon, Portugal
| | - Cristina Fonseca
- Ophthalmology Department, Centro de Responsabilidade Integrado de Oftalmologia, Centro Hospitalar e Universitário de Coimbra, Coimbra, Portugal
| | - Luís Figueira
- Ophthalmology Department, Centro Hospitalar Universitário São João, Porto, Portugal
- Center for Drug Discovery and Innovative Medicines (MedInUP) of the University of Porto, Porto, Portugal
| |
Collapse
|
26
|
Mashima Y, Tanigawa M, Yokoi H. Information heterogeneity between progress notes by physicians and nurses for inpatients with digestive system diseases. Sci Rep 2024; 14:7656. [PMID: 38561333 PMCID: PMC10984979 DOI: 10.1038/s41598-024-56324-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 03/05/2024] [Indexed: 04/04/2024] Open
Abstract
This study focused on the heterogeneity in progress notes written by physicians or nurses. A total of 806 days of progress notes written by physicians or nurses from 83 randomly selected patients hospitalized in the Gastroenterology Department at Kagawa University Hospital from January to December 2021 were analyzed. We extracted symptoms as the International Classification of Diseases (ICD) Chapter 18 (R00-R99, hereinafter R codes) from each progress note using MedNER-J natural language processing software and counted the days one or more symptoms were extracted to calculate the extraction rate. The R-code extraction rate was significantly higher from progress notes by nurses than by physicians (physicians 68.5% vs. nurses 75.2%; p = 0.00112), regardless of specialty. By contrast, the R-code subcategory R10-R19 for digestive system symptoms (44.2 vs. 37.5%, respectively; p = 0.00299) and many chapters of ICD codes for disease names, as represented by Chapter 11 K00-K93 (68.4 vs. 30.9%, respectively; p < 0.001), were frequently extracted from the progress notes by physicians, reflecting their specialty. We believe that understanding the information heterogeneity of medical documents, which can be the basis of medical artificial intelligence, is crucial, and this study is a pioneering step in that direction.
Collapse
Affiliation(s)
- Yukinori Mashima
- Clinical Research Support Center, Kagawa University Hospital, 1750-1 Ikenobe, Miki-cho, Kita-gun, Kagawa, 761-0793, Japan.
- Department of Medical Informatics, Faculty of Medicine, Kagawa University, Kagawa, Japan.
| | - Masatoshi Tanigawa
- Clinical Research Support Center, Kagawa University Hospital, 1750-1 Ikenobe, Miki-cho, Kita-gun, Kagawa, 761-0793, Japan
| | - Hideto Yokoi
- Clinical Research Support Center, Kagawa University Hospital, 1750-1 Ikenobe, Miki-cho, Kita-gun, Kagawa, 761-0793, Japan
- Department of Medical Informatics, Faculty of Medicine, Kagawa University, Kagawa, Japan
| |
Collapse
|
27
|
Irrera O, Marchesin S, Silvello G. MetaTron: advancing biomedical annotation empowering relation annotation and collaboration. BMC Bioinformatics 2024; 25:112. [PMID: 38486137 PMCID: PMC10941452 DOI: 10.1186/s12859-024-05730-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 03/04/2024] [Indexed: 03/17/2024] Open
Abstract
BACKGROUND The constant growth of biomedical data is accompanied by the need for new methodologies to effectively and efficiently extract machine-readable knowledge for training and testing purposes. A crucial aspect in this regard is creating large, often manually or semi-manually, annotated corpora vital for developing effective and efficient methods for tasks like relation extraction, topic recognition, and entity linking. However, manual annotation is expensive and time-consuming especially if not assisted by interactive, intuitive, and collaborative computer-aided tools. To support healthcare experts in the annotation process and foster annotated corpora creation, we present MetaTron. MetaTron is an open-source and free-to-use web-based annotation tool to annotate biomedical data interactively and collaboratively; it supports both mention-level and document-level annotations also integrating automatic built-in predictions. Moreover, MetaTron enables relation annotation with the support of ontologies, functionalities often overlooked by off-the-shelf annotation tools. RESULTS We conducted a qualitative analysis to compare MetaTron with a set of manual annotation tools including TeamTat, INCEpTION, LightTag, MedTAG, and brat, on three sets of criteria: technical, data, and functional. A quantitative evaluation allowed us to assess MetaTron performances in terms of time and number of clicks to annotate a set of documents. The results indicated that MetaTron fulfills almost all the selected criteria and achieves the best performances. CONCLUSIONS MetaTron stands out as one of the few annotation tools targeting the biomedical domain supporting the annotation of relations, and fully customizable with documents in several formats-PDF included, as well as abstracts retrieved from PubMed, Semantic Scholar, and OpenAIRE. To meet any user need, we released MetaTron both as an online instance and as a Docker image locally deployable.
Collapse
Affiliation(s)
- Ornella Irrera
- Department of Information Engineering, University of Padova, Padua, Italy.
| | - Stefano Marchesin
- Department of Information Engineering, University of Padova, Padua, Italy
| | - Gianmaria Silvello
- Department of Information Engineering, University of Padova, Padua, Italy
| |
Collapse
|
28
|
Hughes JA, Hazelwood S, Lyrstedt AL, Jones L, Brown NJ, Jarugula R, Douglas C, Chu K. Enhancing pain care with the American Pain Society Patient Outcome Questionnaire for use in the emergency department (APS-POQ-RED): validating a patient-reported outcome measure. BMJ Open Qual 2024; 13:e002295. [PMID: 38448040 PMCID: PMC10916172 DOI: 10.1136/bmjoq-2023-002295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Accepted: 12/02/2023] [Indexed: 03/08/2024] Open
Abstract
BACKGROUND In general, the quality of pain care in emergency departments (ED) is poor, despite up to 80% of all ED patients presenting with pain. This may be due to the lack of well-validated patient-reported outcome measures (PROMs) of pain care in the ED setting. The American Pain Society-Patient Outcome Questionnaire-Revised Edition (APS-POQ-R), with slight modification for ED patients, is a potentially useful PROM for the adult ED, however it is yet to be completely validated. METHODS Adult patients, who had presented with moderate to severe acute pain, were recruited at two large inner-city EDs in Australia. A modified version of the APS-POQ-R was administered at the completion of their ED care. Responses were randomly split into three groups and underwent multiple rounds of exploratory and confirmatory factor analysis with testing for construct, convergent, divergent validity and internal consistency. RESULTS A total of 646 ED patients (55.6% female), with a median age of 48.3 years, and moderate to severe pain on arrival, completed the ED-modified APS-POQ-R. Psychometric evaluation resulted in a reduced nine-question tool, which measures three constructs (pain relief and satisfaction (α=0.891), affective distress (α=0.823) and pain interference (α=0.908)) and demonstrated construct, convergent, divergent validity, and internal consistency. CONCLUSIONS This new tool, which we refer to as the American Pain Society-Patient Outcome Questionnaire-Revised for the ED (APS-POQ-RED), should form the basis for reporting patient-reported outcomes of ED pain care in future quality improvement and research.
Collapse
Affiliation(s)
- James A Hughes
- School of Nursing, Queensland University of Technology, Kelvin Grove, Queensland, Australia
| | - Sarah Hazelwood
- Emergency Department, The Prince Charles Hospital, Chermside, Queensland, Australia
| | - Anna-Lisa Lyrstedt
- Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Herston, Queensland, Australia
| | - Lee Jones
- Queensland University of Technology, Brisbane, Queensland, Australia
- QIMR Berghofer Medical Research Institute, Herston, Queensland, Australia
| | - Nathan J Brown
- Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Herston, Queensland, Australia
- Faculty of Medicine, University of Queensland Faculty of Medicine and Biomedical Sciences, Herston, Queensland, Australia
| | - Rajeev Jarugula
- Emergency Department, The Prince Charles Hospital, Chermside, Queensland, Australia
| | - Clint Douglas
- Faculty of Health, Queensland University of Technology, Brisbane, Queensland, Australia
- Metro North Hospital and Health Service, Herston, Queensland, Australia
| | - Kevin Chu
- Emergency and Trauma Centre, Royal Brisbane and Women's Hospital, Herston, Queensland, Australia
- Faculty of Medicine, University of Queensland Faculty of Medicine and Biomedical Sciences, Herston, Queensland, Australia
| |
Collapse
|
29
|
Margetta J, Sale A. Distinguishing cardiac catheter ablation energy modalities by applying natural language processing to electronic health records. J Comp Eff Res 2024; 13:e230053. [PMID: 38261335 PMCID: PMC10945417 DOI: 10.57264/cer-2023-0053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 01/03/2024] [Indexed: 01/24/2024] Open
Abstract
Aim: Catheter ablation is used to treat symptomatic atrial fibrillation (AF) and is performed using either cryoballoon (CB) or radiofrequency (RF) ablation. There is limited real world data of CB and RF in the US as healthcare codes are agnostic of energy modality. An alternative method is to analyze patients' electronic health records (EHRs) using Optum's EHR database. Objective: To determine the feasibility of using patients' EHRs with natural language processing (NLP) to distinguish CB versus RF ablation procedures. Data Source: Optum® de-identified EHR dataset, Optum® Cardiac Ablation NLP Table. Methods: This was a retrospective analysis of existing de-identified EHR data. Medical codes were used to create an ablation validation table. Frequency analysis was used to assess ablation procedures and their associated note terms. Two cohorts were created (1) index procedures, (2) multiple procedures. Possible note term combinations included (1) cryoablation (2) radiofrequency (3) ablation, or (4) both. Results: Of the 40,810 validated cardiac ablations, 3777 (9%) index ablation procedures had available and matching NLP note terms. Of these, 22% (n = 844) were classified as ablation, 27% (n = 1016) as cryoablation, 49% (n = 1855) as radiofrequency ablation, and 1.6% (n = 62) as both. In the multiple procedures analysis, 5691 (14%) procedures had matching note terms. 24% (n = 1362) were classified as ablation, 27% as cryoablation, 47% as radiofrequency ablation, and 2% as both. Conclusion: NLP has potential to evaluate the frequency of cardiac ablation by type, however, for this to be a reliable real-world data source, mandatory data entry by providers and standardized electronic health reporting must occur.
Collapse
Affiliation(s)
- Jamie Margetta
- Department of Health Economics & Outcomes Research, Medtronic, Mounds View, MN 55112, USA
| | - Alicia Sale
- Department of Health Economics & Outcomes Research, Medtronic, Mounds View, MN 55112, USA
| |
Collapse
|
30
|
Pankratz N, Cole BR, Beutel KM, Liao KP, Ashe J. Parkinson Disease Genetics Extended to African and Hispanic Ancestries in the VA Million Veteran Program. Neurol Genet 2024; 10:e200110. [PMID: 38130828 PMCID: PMC10732342 DOI: 10.1212/nxg.0000000000200110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2023] [Accepted: 10/06/2023] [Indexed: 12/23/2023]
Abstract
Background and Objectives Nearly all genetic analyses of Parkinson disease (PD) have been in populations of European ancestry. We sought to test the ability of a machine learning method to extract accurate PD diagnoses from an electronic medical record (EMR) system, to see whether genetic variants identified in European populations generalize to individuals of African and Hispanic ancestries, and to compare the rates of PD across ancestries. Methods A machine learning method using natural language processing was applied to EMRs of US veterans participating in the VA Million Veteran Program (MVP) to identify individuals with PD. These putative cases were vetted via blind chart review by a movement disorder specialist. A polygenic risk score (PRS) of 90 established genetic variants whose genotypes were imputed from a customized Axiom Biobank Array was evaluated in different case groups. Results The EMR prediction scores had a distinct trimodal distribution, with 97% of the high group and only 30% of the middle group having a credible diagnosis of PD. Using the 3,542 cases from the high group matched 4:1 to controls, the PRS was highly predictive in individuals of European ancestry (n = 3,137 cases; OR = 1.82; p = 8.01E-48), and nearly identical effect sizes were seen in individuals of African (n = 184; OR = 2.07; p = 3.4E-4) and Hispanic ancestries (n = 221; OR = 2.13; p = 3.9E-6). The PRS was much less predictive for the 2,757 European ancestry cases who had an ICD code for PD but for whom the machine learning method had a lower confidence in their diagnosis. No novel ancestry-specific genetic variants were identified. Individuals with African ancestry had one-quarter the rate of PD compared with European or Hispanic ancestries aged 60-70 years and one half the rate in the 70-80 years age range. African American cases had a higher proportion of their DNA originating in Europe compared with African American controls. Discussion Machine learning can reliably classify PD using data from a large EMR. Larger studies of non-European populations are required to confirm the generalizability of PD risk variants identified in populations of European ancestry and the increased risk coming from a higher proportion of European DNA in African Americans.
Collapse
Affiliation(s)
- Nathan Pankratz
- From the Department of Laboratory Medicine and Pathology (N.P., B.R.C., K.M.B.), School of Medicine, University of Minnesota, Minneapolis; Division of Rheumatology (K.P.L.), Immunology, and Allergy, Brigham and Women's Hospital; Department of Biomedical Informatics (K.P.L.), Harvard Medical School; Division of Data Sciences (K.P.L.), VA Boston Healthcare System, MA; Department of Neurology (J.A.), University of Minnesota Medical School; and Department of Neurology (J.A.), Minneapolis Veterans Affairs Health Care System, MN
| | - Benjamin R Cole
- From the Department of Laboratory Medicine and Pathology (N.P., B.R.C., K.M.B.), School of Medicine, University of Minnesota, Minneapolis; Division of Rheumatology (K.P.L.), Immunology, and Allergy, Brigham and Women's Hospital; Department of Biomedical Informatics (K.P.L.), Harvard Medical School; Division of Data Sciences (K.P.L.), VA Boston Healthcare System, MA; Department of Neurology (J.A.), University of Minnesota Medical School; and Department of Neurology (J.A.), Minneapolis Veterans Affairs Health Care System, MN
| | - Kathleen M Beutel
- From the Department of Laboratory Medicine and Pathology (N.P., B.R.C., K.M.B.), School of Medicine, University of Minnesota, Minneapolis; Division of Rheumatology (K.P.L.), Immunology, and Allergy, Brigham and Women's Hospital; Department of Biomedical Informatics (K.P.L.), Harvard Medical School; Division of Data Sciences (K.P.L.), VA Boston Healthcare System, MA; Department of Neurology (J.A.), University of Minnesota Medical School; and Department of Neurology (J.A.), Minneapolis Veterans Affairs Health Care System, MN
| | - Katherine P Liao
- From the Department of Laboratory Medicine and Pathology (N.P., B.R.C., K.M.B.), School of Medicine, University of Minnesota, Minneapolis; Division of Rheumatology (K.P.L.), Immunology, and Allergy, Brigham and Women's Hospital; Department of Biomedical Informatics (K.P.L.), Harvard Medical School; Division of Data Sciences (K.P.L.), VA Boston Healthcare System, MA; Department of Neurology (J.A.), University of Minnesota Medical School; and Department of Neurology (J.A.), Minneapolis Veterans Affairs Health Care System, MN
| | - James Ashe
- From the Department of Laboratory Medicine and Pathology (N.P., B.R.C., K.M.B.), School of Medicine, University of Minnesota, Minneapolis; Division of Rheumatology (K.P.L.), Immunology, and Allergy, Brigham and Women's Hospital; Department of Biomedical Informatics (K.P.L.), Harvard Medical School; Division of Data Sciences (K.P.L.), VA Boston Healthcare System, MA; Department of Neurology (J.A.), University of Minnesota Medical School; and Department of Neurology (J.A.), Minneapolis Veterans Affairs Health Care System, MN
| |
Collapse
|
31
|
Abid R, Hussein AA, Guru KA. Artificial Intelligence in Urology: Current Status and Future Perspectives. Urol Clin North Am 2024; 51:117-130. [PMID: 37945097 DOI: 10.1016/j.ucl.2023.06.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
Surgical fields, especially urology, have shifted increasingly toward the use of artificial intelligence (AI). Advancements in AI have created massive improvements in diagnostics, outcome predictions, and robotic surgery. For robotic surgery to progress from assisting surgeons to eventually reaching autonomous procedures, there must be advancements in machine learning, natural language processing, and computer vision. Moreover, barriers such as data availability, interpretability of autonomous decision-making, Internet connection and security, and ethical concerns must be overcome.
Collapse
Affiliation(s)
- Rayyan Abid
- Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH 44106, USA
| | - Ahmed A Hussein
- Department of Urology, Roswell Park Comprehensive Cancer Center
| | - Khurshid A Guru
- Department of Urology, Roswell Park Comprehensive Cancer Center.
| |
Collapse
|
32
|
Abstract
The monitoring of vital signs in patients undergoing anesthesia began with the very first case of anesthesia and has evolved alongside the development of anesthesiology ever since. Patient monitoring started out as a manually performed, intermittent, and qualitative assessment of the patient's general well-being in the operating room. In its evolution, patient monitoring development has responded to the clinical need, for example, when critical incident studies in the 1980s found that many anesthesia adverse events could be prevented by improved monitoring, especially respiratory monitoring. It also facilitated and perhaps even enabled increasingly complex surgeries in increasingly higher-risk patients. For example, it would be very challenging to perform and provide anesthesia care during some of the very complex cardiovascular surgeries that are almost routine today without being able to simultaneously and reliably monitor multiple pressures in a variety of places in the circulatory system. Of course, anesthesia patient monitoring itself is enabled by technological developments in the world outside of the operating room. Throughout its history, anesthesia patient monitoring has taken advantage of advancements in material science (when nonthrombogenic polymers allowed the design of intravascular catheters, for example), in electronics and transducers, in computers, in displays, in information technology, and so forth. Slower product life cycles in medical devices mean that by carefully observing technologies such as consumer electronics, including user interfaces, it is possible to peek ahead and estimate with confidence the foundational technologies that will be used by patient monitors in the near future. Just as the discipline of anesthesiology has, the patient monitoring that accompanies it has come a long way from its beginnings in the mid-19th century. Extrapolating from careful observations of the prevailing trends that have shaped anesthesia patient monitoring historically, patient monitoring in the future will use noncontact technologies, will predict the trajectory of a patient's vital signs, will add regional vital signs to the current systemic ones, and will facilitate directed and supervised anesthesia care over the broader scope that anesthesia will be responsible for.
Collapse
Affiliation(s)
- Kai Kuck
- From the Departments of Anesthesiology and Biomedical Engineering, University of Utah, Salt Lake City, Utah
| | | | | |
Collapse
|
33
|
Hassan E, Abd El-Hafeez T, Shams MY. Optimizing classification of diseases through language model analysis of symptoms. Sci Rep 2024; 14:1507. [PMID: 38233458 PMCID: PMC10794698 DOI: 10.1038/s41598-024-51615-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 01/07/2024] [Indexed: 01/19/2024] Open
Abstract
This paper investigated the use of language models and deep learning techniques for automating disease prediction from symptoms. Specifically, we explored the use of two Medical Concept Normalization-Bidirectional Encoder Representations from Transformers (MCN-BERT) models and a Bidirectional Long Short-Term Memory (BiLSTM) model, each optimized with a different hyperparameter optimization method, to predict diseases from symptom descriptions. In this paper, we utilized two distinct dataset called Dataset-1, and Dataset-2. Dataset-1 consists of 1,200 data points, with each point representing a unique combination of disease labels and symptom descriptions. While, Dataset-2 is designed to identify Adverse Drug Reactions (ADRs) from Twitter data, comprising 23,516 rows categorized as ADR (1) or Non-ADR (0) tweets. The results indicate that the MCN-BERT model optimized with AdamP achieved 99.58% accuracy for Dataset-1 and 96.15% accuracy for Dataset-2. The MCN-BERT model optimized with AdamW performed well with 98.33% accuracy for Dataset-1 and 95.15% for Dataset-2, while the BiLSTM model optimized with Hyperopt achieved 97.08% accuracy for Dataset-1 and 94.15% for Dataset-2. Our findings suggest that language models and deep learning techniques have promise for supporting earlier detection and more prompt treatment of diseases, as well as expanding remote diagnostic capabilities. The MCN-BERT and BiLSTM models demonstrated robust performance in accurately predicting diseases from symptoms, indicating the potential for further related research.
Collapse
Affiliation(s)
- Esraa Hassan
- Faculty of Artificial Intelligence, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt.
| | - Tarek Abd El-Hafeez
- Department of Computer Science, Faculty of Science, Minia University, Minia, 61519, Egypt.
- Computer Science Unit, Deraya University, Minia University, Minia, 61765, Egypt.
| | - Mahmoud Y Shams
- Faculty of Artificial Intelligence, Kafrelsheikh University, Kafrelsheikh, 33516, Egypt.
| |
Collapse
|
34
|
Xie F, Chang J, Luong T, Wu B, Lustigova E, Shrader E, Chen W. Identifying Symptoms Prior to Pancreatic Ductal Adenocarcinoma Diagnosis in Real-World Care Settings: Natural Language Processing Approach. JMIR AI 2024; 3:e51240. [PMID: 38875566 PMCID: PMC11041417 DOI: 10.2196/51240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 12/08/2023] [Accepted: 12/16/2023] [Indexed: 06/16/2024]
Abstract
BACKGROUND Pancreatic cancer is the third leading cause of cancer deaths in the United States. Pancreatic ductal adenocarcinoma (PDAC) is the most common form of pancreatic cancer, accounting for up to 90% of all cases. Patient-reported symptoms are often the triggers of cancer diagnosis and therefore, understanding the PDAC-associated symptoms and the timing of symptom onset could facilitate early detection of PDAC. OBJECTIVE This paper aims to develop a natural language processing (NLP) algorithm to capture symptoms associated with PDAC from clinical notes within a large integrated health care system. METHODS We used unstructured data within 2 years prior to PDAC diagnosis between 2010 and 2019 and among matched patients without PDAC to identify 17 PDAC-related symptoms. Related terms and phrases were first compiled from publicly available resources and then recursively reviewed and enriched with input from clinicians and chart review. A computerized NLP algorithm was iteratively developed and fine-trained via multiple rounds of chart review followed by adjudication. Finally, the developed algorithm was applied to the validation data set to assess performance and to the study implementation notes. RESULTS A total of 408,147 and 709,789 notes were retrieved from 2611 patients with PDAC and 10,085 matched patients without PDAC, respectively. In descending order, the symptom distribution of the study implementation notes ranged from 4.98% for abdominal or epigastric pain to 0.05% for upper extremity deep vein thrombosis in the PDAC group, and from 1.75% for back pain to 0.01% for pale stool in the non-PDAC group. Validation of the NLP algorithm against adjudicated chart review results of 1000 notes showed that precision ranged from 98.9% (jaundice) to 84% (upper extremity deep vein thrombosis), recall ranged from 98.1% (weight loss) to 82.8% (epigastric bloating), and F1-scores ranged from 0.97 (jaundice) to 0.86 (depression). CONCLUSIONS The developed and validated NLP algorithm could be used for the early detection of PDAC.
Collapse
Affiliation(s)
- Fagen Xie
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Jenny Chang
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Tiffany Luong
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Bechien Wu
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Eva Lustigova
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| | - Eva Shrader
- Pancreatic Cancer Action Network, Manhattan Beach, CA, United States
| | - Wansu Chen
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, United States
| |
Collapse
|
35
|
Wei WI, Leung CLK, Tang A, McNeil EB, Wong SYS, Kwok KO. Extracting symptoms from free-text responses using ChatGPT among COVID-19 cases in Hong Kong. Clin Microbiol Infect 2024; 30:142.e1-142.e3. [PMID: 37949111 DOI: 10.1016/j.cmi.2023.11.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 11/01/2023] [Accepted: 11/03/2023] [Indexed: 11/12/2023]
Abstract
OBJECTIVES To investigate the feasibility and performance of Chat Generative Pretrained Transformer (ChatGPT) in converting symptom narratives into structured symptom labels. METHODS We extracted symptoms from 300 deidentified symptom narratives of COVID-19 patients by a computer-based matching algorithm (the standard), and prompt engineering in ChatGPT. Common symptoms were those with a prevalence >10% according to the standard, and similarly less common symptoms were those with a prevalence of 2-10%. The precision of ChatGPT was compared with the standard using sensitivity and specificity with 95% exact binomial CIs (95% binCIs). In ChatGPT, we prompted without examples (zero-shot prompting) and with examples (few-shot prompting). RESULTS In zero-shot prompting, GPT-4 achieved high specificity (0.947 [95% binCI: 0.894-0.978]-1.000 [95% binCI: 0.965-0.988, 1.000]) for all symptoms, high sensitivity for common symptoms (0.853 [95% binCI: 0.689-0.950]-1.000 [95% binCI: 0.951-1.000]), and moderate sensitivity for less common symptoms (0.200 [95% binCI: 0.043-0.481]-1.000 [95% binCI: 0.590-0.815, 1.000]). Few-shot prompting increased the sensitivity and specificity. GPT-4 outperformed GPT-3.5 in response accuracy and consistent labelling. DISCUSSION This work substantiates ChatGPT's role as a research tool in medical fields. Its performance in converting symptom narratives to structured symptom labels was encouraging, saving time and effort in compiling the task-specific training data. It potentially accelerates free-text data compilation and synthesis in future disease outbreaks and improves the accuracy of symptom checkers. Focused prompt training addressing ambiguous descriptions impacts medical research positively.
Collapse
Affiliation(s)
- Wan In Wei
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Cyrus Lap Kwan Leung
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Arthur Tang
- Department of Information Technology, School of Science, Engineering and Technology, RMIT University, Vietnam
| | - Edward Braddon McNeil
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Samuel Yeung Shan Wong
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China
| | - Kin On Kwok
- JC School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China; Stanley Ho Centre for Emerging Infectious Diseases, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China; Hong Kong Institute of Asia-Pacific Studies, The Chinese University of Hong Kong, Hong Kong Special Administrative Region, China; Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, United Kingdom.
| |
Collapse
|
36
|
Jadhav P, Sears T, Floan G, Joskowitz K, Nienow S, Cruz S, David M, de Cos V, Choi P, Ignacio RC. Application of a Machine Learning Algorithm in Prediction of Abusive Head Trauma in Children. J Pediatr Surg 2024; 59:80-85. [PMID: 37858394 DOI: 10.1016/j.jpedsurg.2023.09.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Accepted: 09/07/2023] [Indexed: 10/21/2023]
Abstract
PURPOSE We explored the application of a machine learning algorithm for the timely detection of potential abusive head trauma (AHT) using the first free-text note of an encounter and demographic information. METHODS First free-text physician notes and demographic information were collected for children under 5 years of age at a Level 1 Trauma Center. The control group, which included patients with head/neck injury, was compared to those with AHT diagnosed by the Child Protective Team. Differential scores accounted for words overrepresented in AHT patient vs. control notes. Sentiment scores were reflective of note positivity/negativity and subjectivity scores accounted for note subjectivity/objectivity. The composite scores reflected the patient's differential score modified by the subjectivity score. Composite, sentiment, and subjectivity scores combined with demographic information trained a Random Forest (RF) machine learning algorithm to predict AHT. RESULTS Final composite scores with demographic information were highly associated with AHT in a test dataset. The control group included 587 patients and the test group included 193 patients. Combining composite scores with demographic information into the RF model improved AHT classification area under the curve (AUC) from 0.68 to 0.78, with an overall accuracy of 84%. Feature importance analysis of our RF model revealed that composite score, sentiment, age, and subjectivity were the most impactful predictors of AHT. The sentiment was not significantly different between control and AHT notes (p = 0.87), while subjectivity trended higher for AHT notes (p = 0.081). CONCLUSION We conclude that a machine learning algorithm can recognize patterns within free-text notes and demographic information that aid in AHT detection in children. LEVEL OF EVIDENCE III.
Collapse
Affiliation(s)
- Priyanka Jadhav
- University of California San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA
| | - Timothy Sears
- Department of Bioinformatics and Systems Biology Graduate Program, University of California San Diego, 9500 Gilman Drive, San Diego, CA, 92093, USA
| | - Gretchen Floan
- Department of General Surgery, Naval Medical Center San Diego, 34800 Bob Wilson Dr, San Diego, CA, 92134, USA
| | - Katie Joskowitz
- Rady Children's Hospital San Diego, 3020 Children's Way, San Diego, CA, 92123, USA
| | - Shalon Nienow
- Department of Pediatrics, Division of Child Abuse Pediatrics, University of California-San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA; Chadwick Center for Children and Families at Rady Childrens Hospital, 3665 Kearny Villa Road, Suite 500, San Diego, CA, 92123, USA
| | - Sheena Cruz
- University of California San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA
| | - Maya David
- Tulane University School of Medicine, 1430 Tulane Ave, New Orleans, LA, 70112, USA
| | - Víctor de Cos
- University of California San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA
| | - Pam Choi
- Department of General Surgery, Naval Medical Center San Diego, 34800 Bob Wilson Dr, San Diego, CA, 92134, USA
| | - Romeo C Ignacio
- University of California San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA; Division of Pediatric Surgery, Department of Surgery, University of California San Diego School of Medicine, 9500 Gilman Dr, La Jolla, CA, 92093, USA.
| |
Collapse
|
37
|
Scharp D, Hobensack M, Davoudi A, Topaz M. Natural Language Processing Applied to Clinical Documentation in Post-acute Care Settings: A Scoping Review. J Am Med Dir Assoc 2024; 25:69-83. [PMID: 37838000 PMCID: PMC10792659 DOI: 10.1016/j.jamda.2023.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 09/05/2023] [Accepted: 09/07/2023] [Indexed: 10/16/2023]
Abstract
OBJECTIVES To determine the scope of the application of natural language processing to free-text clinical notes in post-acute care and provide a foundation for future natural language processing-based research in these settings. DESIGN Scoping review; reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews guidelines. SETTING AND PARTICIPANTS Post-acute care (ie, home health care, long-term care, skilled nursing facilities, and inpatient rehabilitation facilities). METHODS PubMed, Cumulative Index of Nursing and Allied Health Literature, and Embase were searched in February 2023. Eligible studies had quantitative designs that used natural language processing applied to clinical documentation in post-acute care settings. The quality of each study was appraised. RESULTS Twenty-one studies were included. Almost all studies were conducted in home health care settings. Most studies extracted data from electronic health records to examine the risk for negative outcomes, including acute care utilization, medication errors, and suicide mortality. About half of the studies did not report age, sex, race, or ethnicity data or use standardized terminologies. Only 8 studies included variables from socio-behavioral domains. Most studies fulfilled all quality appraisal indicators. CONCLUSIONS AND IMPLICATIONS The application of natural language processing is nascent in post-acute care settings. Future research should apply natural language processing using standardized terminologies to leverage free-text clinical notes in post-acute care to promote timely, comprehensive, and equitable care. Natural language processing could be integrated with predictive models to help identify patients who are at risk of negative outcomes. Future research should incorporate socio-behavioral determinants and diverse samples to improve health equity in informatics tools.
Collapse
Affiliation(s)
| | | | - Anahita Davoudi
- VNS Health, Center for Home Care Policy & Research, New York, NY, USA
| | - Maxim Topaz
- Columbia University School of Nursing, New York, NY, USA
| |
Collapse
|
38
|
Sim JA, Huang X, Horan MR, Stewart CM, Robison LL, Hudson MM, Baker JN, Huang IC. Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review. Artif Intell Med 2023; 146:102701. [PMID: 38042599 PMCID: PMC10693655 DOI: 10.1016/j.artmed.2023.102701] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Revised: 09/30/2023] [Accepted: 10/29/2023] [Indexed: 12/04/2023]
Abstract
OBJECTIVE Natural language processing (NLP) combined with machine learning (ML) techniques are increasingly used to process unstructured/free-text patient-reported outcome (PRO) data available in electronic health records (EHRs). This systematic review summarizes the literature reporting NLP/ML systems/toolkits for analyzing PROs in clinical narratives of EHRs and discusses the future directions for the application of this modality in clinical care. METHODS We searched PubMed, Scopus, and Web of Science for studies written in English between 1/1/2000 and 12/31/2020. Seventy-nine studies meeting the eligibility criteria were included. We abstracted and summarized information related to the study purpose, patient population, type/source/amount of unstructured PRO data, linguistic features, and NLP systems/toolkits for processing unstructured PROs in EHRs. RESULTS Most of the studies used NLP/ML techniques to extract PROs from clinical narratives (n = 74) and mapped the extracted PROs into specific PRO domains for phenotyping or clustering purposes (n = 26). Some studies used NLP/ML to process PROs for predicting disease progression or onset of adverse events (n = 22) or developing/validating NLP/ML pipelines for analyzing unstructured PROs (n = 19). Studies used different linguistic features, including lexical, syntactic, semantic, and contextual features, to process unstructured PROs. Among the 25 NLP systems/toolkits we identified, 15 used rule-based NLP, 6 used hybrid NLP, and 4 used non-neural ML algorithms embedded in NLP. CONCLUSIONS This study supports the potential utility of different NLP/ML techniques in processing unstructured PROs available in EHRs for clinical care. Though using annotation rules for NLP/ML to analyze unstructured PROs is dominant, deploying novel neural ML-based methods is warranted.
Collapse
Affiliation(s)
- Jin-Ah Sim
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, United States; School of AI Convergence, Hallym University, Chuncheon, Republic of Korea
| | - Xiaolei Huang
- Department of Computer Science, University of Memphis, Memphis, TN, United States
| | - Madeline R Horan
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, United States
| | - Christopher M Stewart
- Institute for Intelligent Systems, University of Memphis, Memphis, TN, United States
| | - Leslie L Robison
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, United States
| | - Melissa M Hudson
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, United States; Department of Oncology, St. Jude Children's Research Hospital, Memphis, TN, United States
| | - Justin N Baker
- Department of Pediatrics, Stanford University, Stanford, CA, United States
| | - I-Chan Huang
- Department of Epidemiology and Cancer Control, St. Jude Children's Research Hospital, Memphis, TN, United States.
| |
Collapse
|
39
|
Keszthelyi D, Gaudet-Blavignac C, Bjelogrlic M, Lovis C. Patient Information Summarization in Clinical Settings: Scoping Review. JMIR Med Inform 2023; 11:e44639. [PMID: 38015588 DOI: 10.2196/44639] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 03/15/2023] [Accepted: 07/25/2023] [Indexed: 11/29/2023] Open
Abstract
BACKGROUND Information overflow, a common problem in the present clinical environment, can be mitigated by summarizing clinical data. Although there are several solutions for clinical summarization, there is a lack of a complete overview of the research relevant to this field. OBJECTIVE This study aims to identify state-of-the-art solutions for clinical summarization, to analyze their capabilities, and to identify their properties. METHODS A scoping review of articles published between 2005 and 2022 was conducted. With a clinical focus, PubMed and Web of Science were queried to find an initial set of reports, later extended by articles found through a chain of citations. The included reports were analyzed to answer the questions of where, what, and how medical information is summarized; whether summarization conserves temporality, uncertainty, and medical pertinence; and how the propositions are evaluated and deployed. To answer how information is summarized, methods were compared through a new framework "collect-synthesize-communicate" referring to information gathering from data, its synthesis, and communication to the end user. RESULTS Overall, 128 articles were included, representing various medical fields. Exclusively structured data were used as input in 46.1% (59/128) of papers, text in 41.4% (53/128) of articles, and both in 10.2% (13/128) of papers. Using the proposed framework, 42.2% (54/128) of the records contributed to information collection, 27.3% (35/128) contributed to information synthesis, and 46.1% (59/128) presented solutions for summary communication. Numerous summarization approaches have been presented, including extractive (n=13) and abstractive summarization (n=19); topic modeling (n=5); summary specification (n=11); concept and relation extraction (n=30); visual design considerations (n=59); and complete pipelines (n=7) using information extraction, synthesis, and communication. Graphical displays (n=53), short texts (n=41), static reports (n=7), and problem-oriented views (n=7) were the most common types in terms of summary communication. Although temporality and uncertainty information were usually not conserved in most studies (74/128, 57.8% and 113/128, 88.3%, respectively), some studies presented solutions to treat this information. Overall, 115 (89.8%) articles showed results of an evaluation, and methods included evaluations with human participants (median 15, IQR 24 participants): measurements in experiments with human participants (n=31), real situations (n=8), and usability studies (n=28). Methods without human involvement included intrinsic evaluation (n=24), performance on a proxy (n=10), or domain-specific tasks (n=11). Overall, 11 (8.6%) reports described a system deployed in clinical settings. CONCLUSIONS The scientific literature contains many propositions for summarizing patient information but reports very few comparisons of these proposals. This work proposes to compare these algorithms through how they conserve essential aspects of clinical information and through the "collect-synthesize-communicate" framework. We found that current propositions usually address these 3 steps only partially. Moreover, they conserve and use temporality, uncertainty, and pertinent medical aspects to varying extents, and solutions are often preliminary.
Collapse
Affiliation(s)
- Daniel Keszthelyi
- Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
| | - Christophe Gaudet-Blavignac
- Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
| | - Mina Bjelogrlic
- Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
| | - Christian Lovis
- Division of Medical Information Sciences, University Hospitals of Geneva, Geneva, Switzerland
- Department of Radiology and Medical Informatics, University of Geneva, Geneva, Switzerland
| |
Collapse
|
40
|
Stead WW, Flatley Brennan P. Celebrating Suzanne Bakken, 2023 Morris F. Collen Award winner and pioneer in health equity. J Am Med Inform Assoc 2023; 30:1760-1761. [PMID: 37855452 PMCID: PMC10586030 DOI: 10.1093/jamia/ocad189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2023] [Accepted: 09/05/2023] [Indexed: 10/20/2023] Open
Affiliation(s)
- William W Stead
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States
| | | |
Collapse
|
41
|
Shah AD, Subramanian A, Lewis J, Dhalla S, Ford E, Haroon S, Kuan V, Nirantharakumar K. Long Covid symptoms and diagnosis in primary care: A cohort study using structured and unstructured data in The Health Improvement Network primary care database. PLoS One 2023; 18:e0290583. [PMID: 37751444 PMCID: PMC10521988 DOI: 10.1371/journal.pone.0290583] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Accepted: 08/11/2023] [Indexed: 09/28/2023] Open
Abstract
BACKGROUND Long Covid is a widely recognised consequence of COVID-19 infection, but little is known about the burden of symptoms that patients present with in primary care, as these are typically recorded only in free text clinical notes. AIMS To compare symptoms in patients with and without a history of COVID-19, and investigate symptoms associated with a Long Covid diagnosis. METHODS We used primary care electronic health record data until the end of December 2020 from The Health Improvement Network (THIN), a Cegedim database. We included adults registered with participating practices in England, Scotland or Wales. We extracted information about 89 symptoms and 'Long Covid' diagnoses from free text using natural language processing. We calculated hazard ratios (adjusted for age, sex, baseline medical conditions and prior symptoms) for each symptom from 12 weeks after the COVID-19 diagnosis. RESULTS We compared 11,015 patients with confirmed COVID-19 and 18,098 unexposed controls. Only 20% of symptom records were coded, with 80% in free text. A wide range of symptoms were associated with COVID-19 at least 12 weeks post-infection, with strongest associations for fatigue (adjusted hazard ratio (aHR) 3.46, 95% confidence interval (CI) 2.87, 4.17), shortness of breath (aHR 2.89, 95% CI 2.48, 3.36), palpitations (aHR 2.59, 95% CI 1.86, 3.60), and phlegm (aHR 2.43, 95% CI 1.65, 3.59). However, a limited subset of symptoms were recorded within 7 days prior to a Long Covid diagnosis in more than 20% of cases: shortness of breath, chest pain, pain, fatigue, cough, and anxiety / depression. CONCLUSIONS Numerous symptoms are reported to primary care at least 12 weeks after COVID-19 infection, but only a subset are commonly associated with a GP diagnosis of Long Covid.
Collapse
Affiliation(s)
- Anoop D. Shah
- Institute of Health Informatics, University College London, London, United Kingdom
- NIHR University College London Hospitals Biomedical Research Centre, University College London Hospitals NHS Trust, London, United Kingdom
| | - Anuradhaa Subramanian
- Institute of Applied Health Research, University of Birmingham, Birmingham, United Kingdom
| | - Jadene Lewis
- Institute of Health Informatics, University College London, London, United Kingdom
| | - Samir Dhalla
- The Health Improvement Network Ltd., London, United Kingdom
| | - Elizabeth Ford
- Brighton and Sussex Medical School, Brighton, United Kingdom
| | - Shamil Haroon
- Institute of Applied Health Research, University of Birmingham, Birmingham, United Kingdom
| | - Valerie Kuan
- Institute of Health Informatics, University College London, London, United Kingdom
| | | |
Collapse
|
42
|
Nishioka S, Asano M, Yada S, Aramaki E, Yajima H, Yanagisawa Y, Sayama K, Kizaki H, Hori S. Adverse event signal extraction from cancer patients' narratives focusing on impact on their daily-life activities. Sci Rep 2023; 13:15516. [PMID: 37726371 PMCID: PMC10509234 DOI: 10.1038/s41598-023-42496-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Accepted: 09/11/2023] [Indexed: 09/21/2023] Open
Abstract
Adverse event (AE) management is important to improve anti-cancer treatment outcomes, but it is known that some AE signals can be missed during clinical visits. In particular, AEs that affect patients' activities of daily living (ADL) need careful monitoring as they may require immediate medical intervention. This study aimed to build deep-learning (DL) models for extracting signals of AEs limiting ADL from patients' narratives. The data source was blog posts written in Japanese by breast cancer patients. After pre-processing and annotation for AE signals, three DL models (BERT, ELECTRA, and T5) were trained and tested in three different approaches for AE signal identification. The performances of the trained models were evaluated in terms of precision, recall, and F1 scores. From 2,272 blog posts, 191 and 702 articles were identified as describing AEs limiting ADL or not limiting ADL, respectively. Among tested DL modes and approaches, T5 showed the best F1 scores to identify articles with AE limiting ADL or all AE: 0.557 and 0.811, respectively. The most frequent AE signals were "pain or numbness", "fatigue" and "nausea". Our results suggest that this AE monitoring scheme focusing on patients' ADL has potential to reinforce current AE management provided by medical staff.
Collapse
Affiliation(s)
- Satoshi Nishioka
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Masaki Asano
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Shuntaro Yada
- Nara Institute of Science and Technology, Nara, Japan
| | - Eiji Aramaki
- Nara Institute of Science and Technology, Nara, Japan
| | | | - Yuki Yanagisawa
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Kyoko Sayama
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Hayato Kizaki
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan
| | - Satoko Hori
- Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan.
| |
Collapse
|
43
|
Dhingra LS, Shen M, Mangla A, Khera R. Cardiovascular Care Innovation through Data-Driven Discoveries in the Electronic Health Record. Am J Cardiol 2023; 203:136-148. [PMID: 37499593 PMCID: PMC10865722 DOI: 10.1016/j.amjcard.2023.06.104] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/24/2023] [Accepted: 06/29/2023] [Indexed: 07/29/2023]
Abstract
The electronic health record (EHR) represents a rich source of patient information, increasingly being leveraged for cardiovascular research. Although its primary use remains the seamless delivery of health care, the various longitudinally aggregated structured and unstructured data elements for each patient within the EHR can define the computational phenotypes of disease and care signatures and their association with outcomes. Although structured data elements, such as demographic characteristics, laboratory measurements, problem lists, and medications, are easily extracted, unstructured data are underused. The latter include free text in clinical narratives, documentation of procedures, and reports of imaging and pathology. Rapid scaling up of data storage and rapid innovation in natural language processing and computer vision can power insights from unstructured data streams. However, despite an array of opportunities for research using the EHR, specific expertise is necessary to adequately address confidentiality, accuracy, completeness, and heterogeneity challenges in EHR-based research. These often require methodological innovation and best practices to design and conduct successful research studies. Our review discusses these challenges and their proposed solutions. In addition, we highlight the ongoing innovations in federated learning in the EHR through a greater focus on common data models and discuss ongoing work that defines such an approach to large-scale, multicenter, federated studies. Such parallel improvements in technology and research methods enable innovative care and optimization of patient outcomes.
Collapse
Affiliation(s)
| | - Miles Shen
- Section of Cardiovascular Medicine, Department of Internal Medicine; Department of Internal Medicine
| | - Anjali Mangla
- Section of Cardiovascular Medicine, Department of Internal Medicine; Department of Neuroscience, Yale School of Medicine, New Haven, Connecticut
| | - Rohan Khera
- Section of Cardiovascular Medicine, Department of Internal Medicine; Center for Outcomes Research and Evaluation (CORE), Yale New Haven Hospital, New Haven, Connecticut; Section of Health Informatics, Department of Biostatistics, Yale School of Public Health, New Haven, Connecticut.; Section of Biomedical Informatics and Data Science, Yale School of Medicine, New Haven, Connecticut.
| |
Collapse
|
44
|
Adamson B, Waskom M, Blarre A, Kelly J, Krismer K, Nemeth S, Gippetti J, Ritten J, Harrison K, Ho G, Linzmayer R, Bansal T, Wilkinson S, Amster G, Estola E, Benedum CM, Fidyk E, Estévez M, Shapiro W, Cohen AB. Approach to machine learning for extraction of real-world data variables from electronic health records. Front Pharmacol 2023; 14:1180962. [PMID: 37781703 PMCID: PMC10541019 DOI: 10.3389/fphar.2023.1180962] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 08/25/2023] [Indexed: 10/03/2023] Open
Abstract
Background: As artificial intelligence (AI) continues to advance with breakthroughs in natural language processing (NLP) and machine learning (ML), such as the development of models like OpenAI's ChatGPT, new opportunities are emerging for efficient curation of electronic health records (EHR) into real-world data (RWD) for evidence generation in oncology. Our objective is to describe the research and development of industry methods to promote transparency and explainability. Methods: We applied NLP with ML techniques to train, validate, and test the extraction of information from unstructured documents (e.g., clinician notes, radiology reports, lab reports, etc.) to output a set of structured variables required for RWD analysis. This research used a nationwide electronic health record (EHR)-derived database. Models were selected based on performance. Variables curated with an approach using ML extraction are those where the value is determined solely based on an ML model (i.e. not confirmed by abstraction), which identifies key information from visit notes and documents. These models do not predict future events or infer missing information. Results: We developed an approach using NLP and ML for extraction of clinically meaningful information from unstructured EHR documents and found high performance of output variables compared with variables curated by manually abstracted data. These extraction methods resulted in research-ready variables including initial cancer diagnosis with date, advanced/metastatic diagnosis with date, disease stage, histology, smoking status, surgery status with date, biomarker test results with dates, and oral treatments with dates. Conclusion: NLP and ML enable the extraction of retrospective clinical data in EHR with speed and scalability to help researchers learn from the experience of every person with cancer.
Collapse
Affiliation(s)
- Blythe Adamson
- Flatiron Health, Inc., New York, NY, United States
- The Comparative Health Outcomes, Policy, and Economics (CHOICE) Institute, Department of Pharmacy, University of Washington, Seattle, WA, United States
| | | | | | | | | | | | | | - John Ritten
- Flatiron Health, Inc., New York, NY, United States
| | | | - George Ho
- Flatiron Health, Inc., New York, NY, United States
| | | | - Tarun Bansal
- Flatiron Health, Inc., New York, NY, United States
| | | | - Guy Amster
- Flatiron Health, Inc., New York, NY, United States
| | - Evan Estola
- Flatiron Health, Inc., New York, NY, United States
| | | | - Erin Fidyk
- Flatiron Health, Inc., New York, NY, United States
| | | | - Will Shapiro
- Flatiron Health, Inc., New York, NY, United States
| | - Aaron B. Cohen
- Flatiron Health, Inc., New York, NY, United States
- Department of Medicine, NYU Grossman School of Medicine, New York, NY, United States
| |
Collapse
|
45
|
Fraile Navarro D, Ijaz K, Rezazadegan D, Rahimi-Ardabili H, Dras M, Coiera E, Berkovsky S. Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review. Int J Med Inform 2023; 177:105122. [PMID: 37295138 DOI: 10.1016/j.ijmedinf.2023.105122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 04/14/2023] [Accepted: 06/03/2023] [Indexed: 06/12/2023]
Abstract
BACKGROUND Natural Language Processing (NLP) applications have developed over the past years in various fields including its application to clinical free text for named entity recognition and relation extraction. However, there has been rapid developments the last few years that there's currently no overview of it. Moreover, it is unclear how these models and tools have been translated into clinical practice. We aim to synthesize and review these developments. METHODS We reviewed literature from 2010 to date, searching PubMed, Scopus, the Association of Computational Linguistics (ACL), and Association of Computer Machinery (ACM) libraries for studies of NLP systems performing general-purpose (i.e., not disease- or treatment-specific) information extraction and relation extraction tasks in unstructured clinical text (e.g., discharge summaries). RESULTS We included in the review 94 studies with 30 studies published in the last three years. Machine learning methods were used in 68 studies, rule-based in 5 studies, and both in 22 studies. 63 studies focused on Named Entity Recognition, 13 on Relation Extraction and 18 performed both. The most frequently extracted entities were "problem", "test" and "treatment". 72 studies used public datasets and 22 studies used proprietary datasets alone. Only 14 studies defined clearly a clinical or information task to be addressed by the system and just three studies reported its use outside the experimental setting. Only 7 studies shared a pre-trained model and only 8 an available software tool. DISCUSSION Machine learning-based methods have dominated the NLP field on information extraction tasks. More recently, Transformer-based language models are taking the lead and showing the strongest performance. However, these developments are mostly based on a few datasets and generic annotations, with very few real-world use cases. This may raise questions about the generalizability of findings, translation into practice and highlights the need for robust clinical evaluation.
Collapse
Affiliation(s)
- David Fraile Navarro
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia.
| | - Kiran Ijaz
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| | - Dana Rezazadegan
- Department of Computer Science and Software Engineering. School of Software and Electrical Engineering, Swinburne University of Technology, Melbourne, Australia
| | - Hania Rahimi-Ardabili
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| | - Mark Dras
- Department of Computing, Macquarie University, Sydney, Australia
| | - Enrico Coiera
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| | - Shlomo Berkovsky
- Centre for Health Informatics, Australian Institute of Health Innovation, Macquarie University, Sydney, Australia
| |
Collapse
|
46
|
Magoc T, Allen KS, McDonnell C, Russo JP, Cummins J, Vest JR, Harle CA. Generalizability and portability of natural language processing system to extract individual social risk factors. Int J Med Inform 2023; 177:105115. [PMID: 37302362 PMCID: PMC11164320 DOI: 10.1016/j.ijmedinf.2023.105115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 05/15/2023] [Accepted: 05/30/2023] [Indexed: 06/13/2023]
Abstract
OBJECTIVE The objective of this study is to validate and report on portability and generalizability of a Natural Language Processing (NLP) method to extract individual social factors from clinical notes, which was originally developed at a different institution. MATERIALS AND METHODS A rule-based deterministic state machine NLP model was developed to extract financial insecurity and housing instability using notes from one institution and was applied on all notes written during 6 months at another institution. 10% of positively-classified notes by NLP and the same number of negatively-classified notes were manually annotated. The NLP model was adjusted to accommodate notes at the new site. Accuracy, positive predictive value, sensitivity, and specificity were calculated. RESULTS More than 6 million notes were processed at the receiving site by the NLP model, which resulted in about 13,000 and 19,000 classified as positive for financial insecurity and housing instability, respectively. The NLP model showed excellent performance on the validation dataset with all measures over 0.87 for both social factors. DISCUSSION Our study illustrated the need to accommodate institution-specific note-writing templates as well as clinical terminology of emergent diseases when applying NLP model for social factors. A state machine is relatively simple to port effectively across institutions. Our study. showed superior performance to similar generalizability studies for extracting social factors. CONCLUSION Rule-based NLP model to extract social factors from clinical notes showed strong portability and generalizability across organizationally and geographically distinct institutions. With only relatively simple modifications, we obtained promising performance from an NLP-based model.
Collapse
Affiliation(s)
- Tanja Magoc
- College of Medicine, University of Florida, Gainesville, FL, USA.
| | - Katie S Allen
- Regenstrief Institute, Inc., Indianapolis, IN, USA; Richard M. Fairbanks School of Public Health, IUPUI, Indianapolis, IN, USA
| | - Cara McDonnell
- College of Medicine, University of Florida, Gainesville, FL, USA
| | - Jean-Paul Russo
- College of Medicine, University of Florida, Gainesville, FL, USA; Miller School of Medicine, University of Miami, Miami, FL, USA
| | | | - Joshua R Vest
- Regenstrief Institute, Inc., Indianapolis, IN, USA; Richard M. Fairbanks School of Public Health, IUPUI, Indianapolis, IN, USA
| | - Christopher A Harle
- Regenstrief Institute, Inc., Indianapolis, IN, USA; Richard M. Fairbanks School of Public Health, IUPUI, Indianapolis, IN, USA
| |
Collapse
|
47
|
Hobensack M, Zhao Y, Scharp D, Volodarskiy A, Slotwiner D, Reading Turchioe M. Characterising symptom clusters in patients with atrial fibrillation undergoing catheter ablation. Open Heart 2023; 10:e002385. [PMID: 37541744 PMCID: PMC10407417 DOI: 10.1136/openhrt-2023-002385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 07/11/2023] [Indexed: 08/06/2023] Open
Abstract
OBJECTIVE This study aims to leverage natural language processing (NLP) and machine learning clustering analyses to (1) identify co-occurring symptoms of patients undergoing catheter ablation for atrial fibrillation (AF) and (2) describe clinical and sociodemographic correlates of symptom clusters. METHODS We conducted a cross-sectional retrospective analysis using electronic health records data. Adults who underwent AF ablation between 2010 and 2020 were included. Demographic, comorbidity and medication information was extracted using structured queries. Ten AF symptoms were extracted from unstructured clinical notes (n=13 416) using a validated NLP pipeline (F-score=0.81). We used the unsupervised machine learning approach known as Ward's hierarchical agglomerative clustering to characterise and identify subgroups of patients representing different clusters. Fisher's exact tests were used to investigate subgroup differences based on age, gender, race and heart failure (HF) status. RESULTS A total of 1293 patients were included in our analysis (mean age 65.5 years, 35.2% female, 58% white). The most frequently documented symptoms were dyspnoea (64%), oedema (62%) and palpitations (57%). We identified six symptom clusters: generally symptomatic, dyspnoea and oedema, chest pain, anxiety, fatigue and palpitations, and asymptomatic (reference). The asymptomatic cluster had a significantly higher prevalence of male, white and comorbid HF patients. CONCLUSIONS We applied NLP and machine learning to a large dataset to identify symptom clusters, which may signify latent biological underpinnings of symptom experiences and generate implications for clinical care. AF patients' symptom experiences vary widely. Given prior work showing that AF symptoms predict adverse outcomes, future work should investigate associations between symptom clusters and postablation outcomes.
Collapse
Affiliation(s)
- Mollie Hobensack
- Columbia University School of Nursing, New York City, New York, USA
| | - Yihong Zhao
- Columbia University School of Nursing, New York City, New York, USA
| | - Danielle Scharp
- Columbia University School of Nursing, New York City, New York, USA
| | | | - David Slotwiner
- Cardiology, NewYork-Presbyterian Queens Hospital, Flushing, New York, USA
- Department of Population Health Sciences, Weill Cornell Medicine, New York City, New York, USA
| | | |
Collapse
|
48
|
Li H, Gerkin RC, Bakke A, Norel R, Cecchi G, Laudamiel C, Niv MY, Ohla K, Hayes JE, Parma V, Meyer P. Text-based predictions of COVID-19 diagnosis from self-reported chemosensory descriptions. COMMUNICATIONS MEDICINE 2023; 3:104. [PMID: 37500763 PMCID: PMC10374642 DOI: 10.1038/s43856-023-00334-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND There is a prevailing view that humans' capacity to use language to characterize sensations like odors or tastes is poor, providing an unreliable source of information. METHODS Here, we developed a machine learning method based on Natural Language Processing (NLP) using Large Language Models (LLM) to predict COVID-19 diagnosis solely based on text descriptions of acute changes in chemosensation, i.e., smell, taste and chemesthesis, caused by the disease. The dataset of more than 1500 subjects was obtained from survey responses early in the COVID-19 pandemic, in Spring 2020. RESULTS When predicting COVID-19 diagnosis, our NLP model performs comparably (AUC ROC ~ 0.65) to models based on self-reported changes in function collected via quantitative rating scales. Further, our NLP model could attribute importance of words when performing the prediction; sentiment and descriptive words such as "smell", "taste", "sense", had strong contributions to the predictions. In addition, adjectives describing specific tastes or smells such as "salty", "sweet", "spicy", and "sour" also contributed considerably to predictions. CONCLUSIONS Our results show that the description of perceptual symptoms caused by a viral infection can be used to fine-tune an LLM model to correctly predict and interpret the diagnostic status of a subject. In the future, similar models may have utility for patient verbatims from online health portals or electronic health records.
Collapse
Affiliation(s)
- Hongyang Li
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Richard C Gerkin
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Osmo, Cambridge, MA, USA
| | - Alyssa Bakke
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
| | - Raquel Norel
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Guillermo Cecchi
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | | | - Masha Y Niv
- The Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Kathrin Ohla
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
- Science & Research, dsm-firmenich, Satigny, Switzerland
| | - John E Hayes
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
| | | | - Pablo Meyer
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
| |
Collapse
|
49
|
Crowson MG, Alsentzer E, Fiskio J, Bates DW. Towards Medical Billing Automation: NLP for Outpatient Clinician Note Classification. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.07.23292367. [PMID: 37502975 PMCID: PMC10370228 DOI: 10.1101/2023.07.07.23292367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Objectives Our primary objective was to develop a natural language processing approach that accurately predicts outpatient Evaluation and Management (E/M) level of service (LoS) codes using clinicians' notes from a health system electronic health record. A secondary objective was to investigate the impact of clinic note de-identification on document classification performance. Methods We used retrospective outpatient office clinic notes from four medical and surgical specialties. Classification models were fine-tuned on the clinic notes datasets and stratified by subspecialty. The success criteria for the classification tasks were the classification accuracy and F1-scores on internal test data. For the secondary objective, the dataset was de-identified using Named Entity Recognition (NER) to remove protected health information (PHI), and models were retrained. Results The models demonstrated similar predictive performance across different specialties, except for internal medicine, which had the lowest classification accuracy across all model architectures. The models trained on the entire note corpus achieved an E/M LoS CPT code classification accuracy of 74.8% (CI 95: 74.1-75.6). However, the de-identified note corpus showed a markedly lower classification accuracy of 48.2% (CI 95: 47.7-48.6) compared to the model trained on the identified notes. Conclusion The study demonstrates the potential of NLP-based document classifiers to accurately predict E/M LoS CPT codes using clinical notes from various medical and procedural specialties. The models' performance suggests that the classification task's complexity merits further investigation. The de-identification experiment demonstrated that de-identification may negatively impact classifier performance. Further research is needed to validate the performance of our NLP classifiers in different healthcare settings and patient populations and to investigate the potential implications of de-identification on model performance.
Collapse
|
50
|
Sun Z, Shi H, Huang Z, Ding N. Learning Representations from Medical Text for Effective Diagnoses and Knowledge Discovery. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2023; 2023:1-7. [PMID: 38083156 DOI: 10.1109/embc40787.2023.10340797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2023]
Abstract
Discovering knowledge and effectively predicting target events are two main goals of medical text mining. However, few models can achieve them simultaneously. In this study, we investigated the possibility of discovering knowledge and predicting diagnosis at once via raw medical text. We proposed the Enhanced Neural Topic Model (ENTM), a variant of the neural topic model, to learn interpretable representations. We introduced the auxiliary loss set to improve the effectiveness of learned representations. Then, we used learned representations to train a softmax regression model to predict target events. As each element in representations learned by the ENTM has an explicit semantic meaning, weights in softmax regression represent potential knowledge of whether an element is a significant factor in predicting diagnosis. We adopted two independent medical text datasets to evaluate our ENTM model. Results indicate that our model performed better than the latest pretrained neural language models. Meanwhile, analysis of model parameters indicates that our model has the potential discover knowledge from data.Clinical relevance- This work provides a model that can effectively predict patient diagnosis and has the potential to discover knowledge from medical text.
Collapse
|