1
|
Fonferko-Shadrach B, Strafford H, Jones C, Khan RA, Brown S, Edwards J, Hawken J, Shrimpton LE, White CP, Powell R, Sawhney IMS, Pickrell WO, Lacey AS. Annotation of epilepsy clinic letters for natural language processing. J Biomed Semantics 2024; 15:17. [PMID: 39277770 PMCID: PMC11402197 DOI: 10.1186/s13326-024-00316-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Accepted: 07/22/2024] [Indexed: 09/17/2024] Open
Abstract
BACKGROUND Natural language processing (NLP) is increasingly being used to extract structured information from unstructured text to assist clinical decision-making and aid healthcare research. The availability of expert-annotated documents for the development and validation of NLP applications is limited. We created synthetic clinical documents to address this, and to validate the Extraction of Epilepsy Clinical Text version 2 (ExECTv2) NLP pipeline. METHODS We created 200 synthetic clinic letters based on hospital outpatient consultations with epilepsy specialists. The letters were double annotated by trained clinicians and researchers according to agreed guidelines. We used the annotation tool, Markup, with an epilepsy concept list based on the Unified Medical Language System ontology. All annotations were reviewed, and a gold standard set of annotations was agreed and used to validate the performance of ExECTv2. RESULTS The overall inter-annotator agreement (IAA) between the two sets of annotations produced a per item F1 score of 0.73. Validating ExECTv2 using the gold standard gave an overall F1 score of 0.87 per item, and 0.90 per letter. CONCLUSION The synthetic letters, annotations, and annotation guidelines have been made freely available. To our knowledge, this is the first publicly available set of annotated epilepsy clinic letters and guidelines that can be used for NLP researchers with minimum epilepsy knowledge. The IAA results show that clinical text annotation tasks are difficult and require a gold standard to be arranged by researcher consensus. The results for ExECTv2, our automated epilepsy NLP pipeline, extracted detailed epilepsy information from unstructured epilepsy letters with more accuracy than human annotators, further confirming the utility of NLP for clinical and research applications.
Collapse
Affiliation(s)
| | - Huw Strafford
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
| | - Carys Jones
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
| | - Russell A Khan
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
| | - Sharon Brown
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Jenny Edwards
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Jonathan Hawken
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Luke E Shrimpton
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Catharine P White
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
- Paediatric Neurology Centre, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Robert Powell
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Inder M S Sawhney
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - William O Pickrell
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
- Neurology Department, Swansea Bay University Health Board, Swansea, Wales, UK
| | - Arron S Lacey
- Swansea University Medical School, Swansea University, Swansea, Wales, UK
| |
Collapse
|
2
|
Li D, Yang Y, Cui J, Meng X, Qu J, Jiang Z, Zhao Y. Joint extraction of Chinese medical entities and relations based on RoBERTa and single-module global pointer. BMC Med Inform Decis Mak 2024; 24:218. [PMID: 39085892 PMCID: PMC11293210 DOI: 10.1186/s12911-024-02577-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 06/13/2024] [Indexed: 08/02/2024] Open
Abstract
BACKGROUND Most Chinese joint entity and relation extraction tasks in medicine involve numerous nested entities, overlapping relations, and other challenging extraction issues. In response to these problems, some traditional methods decompose the joint extraction task into multiple steps or multiple modules, resulting in local dependency in the meantime. METHODS To alleviate this issue, we propose a joint extraction model of Chinese medical entities and relations based on RoBERTa and single-module global pointer, namely RSGP, which formulates joint extraction as a global pointer linking problem. Considering the uniqueness of Chinese language structure, we introduce the RoBERTa-wwm pre-trained language model at the encoding layer to obtain a better embedding representation. Then, we represent the input sentence as a third-order tensor and score each position in the tensor to prepare for the subsequent process of decoding the triples. In the end, we design a novel single-module global pointer decoding approach to alleviate the generation of redundant information. Specifically, we analyze the decoding process of single character entities individually, improving the time and space performance of RSGP to some extent. RESULTS In order to verify the effectiveness of our model in extracting Chinese medical entities and relations, we carry out the experiments on the public dataset, CMeIE. Experimental results show that RSGP performs significantly better on the joint extraction of Chinese medical entities and relations, and achieves state-of-the-art results compared with baseline models. CONCLUSION The proposed RSGP can effectively extract entities and relations from Chinese medical texts and help to realize the structure of Chinese medical texts, so as to provide high-quality data support for the construction of Chinese medical knowledge graphs.
Collapse
Affiliation(s)
- Dongmei Li
- School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
- Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, 100083, Beijing, China
| | - Yu Yang
- School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
- Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, 100083, Beijing, China
| | - Jinman Cui
- School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
- Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, 100083, Beijing, China
| | - Xianghao Meng
- School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
- Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, 100083, Beijing, China
| | - Jintao Qu
- School of Information Science and Technology, Beijing Forestry University, 100083, Beijing, China
- Engineering Research Center for Forestry-Oriented Intelligent Information Processing of National Forestry and Grassland Administration, 100083, Beijing, China
| | - Zhuobin Jiang
- National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, 100700, Beijing, China.
| | - Yufeng Zhao
- National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, 100700, Beijing, China.
| |
Collapse
|
3
|
Mojibian A, Jaskolka J, Ching G, Lee B, Myers R, Devine C, Nicolaou S, Parker W. The Efficacy of a Named Entity Recognition AI Model for Identifying Incidental Pulmonary Nodules in CT Reports. Can Assoc Radiol J 2024:8465371241266785. [PMID: 39066637 DOI: 10.1177/08465371241266785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024] Open
Abstract
Purpose: This study evaluates the efficacy of a commercial medical Named Entity Recognition (NER) model combined with a post-processing protocol in identifying incidental pulmonary nodules from CT reports. Methods: We analyzed 9165 anonymized CT reports and classified them into 3 categories: no nodules, nodules present, and nodules >6 mm. For each report, a generic medical NER model annotated entities and their relations, which were then filtered through inclusion/exclusion criteria selected to identify pulmonary nodules. Ground truth was established by manual review. To better understand the relationship between model performance and nodule prevalence, a subset of the data was programmatically balanced to equalize the number of reports in each class category. Results: In the unbalanced subset of the data, the model achieved a sensitivity of 97%, specificity of 99%, and accuracy of 99% in detecting pulmonary nodules mentioned in the reports. For nodules >6 mm, sensitivity was 95%, specificity was 100%, and accuracy was 100%. In the balanced subset of the data, sensitivity was 99%, specificity 96%, and accuracy 97% for nodule detection; for larger nodules, sensitivity was 94%, specificity 99%, and accuracy 98%. Conclusions: The NER model demonstrated high sensitivity and specificity in detecting pulmonary nodules reported in CT scans, including those >6 mm which are potentially clinically significant. The results were consistent across both unbalanced and balanced datasets indicating that the model performance is independent of nodule prevalence. Implementing this technology in hospital systems could automate the identification of at-risk patients, ensuring timely follow-up and potentially reducing missed or late-stage cancer diagnoses.
Collapse
Affiliation(s)
- Alireza Mojibian
- Sapien Machine Learning Corporation (SapienML), Vancouver, BC, Canada
| | - Jeff Jaskolka
- Radiology Department, Brampton Civic Hospital, Brampton, ON, Canada
- Faculty of Medicine - Medical Imaging, University of Toronto, Toronto, ON, Canada
| | - Geoffrey Ching
- Schulich School of Medicine & Dentistry - University of Western Ontario, London, On, Canada
| | - Brian Lee
- Sapien Machine Learning Corporation (SapienML), Vancouver, BC, Canada
| | - Renelle Myers
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- BC Cancer Agency, Provincial Health Services Authority, Vancouver, BC, Canada
- Respirology, Vancouver General Hospital, Vancouver, BC, Canada
| | - Chloe Devine
- Sapien Machine Learning Corporation (SapienML), Vancouver, BC, Canada
| | - Savvas Nicolaou
- Sapien Machine Learning Corporation (SapienML), Vancouver, BC, Canada
- Faculty of Medicine, University of British Columbia, Vancouver, BC, Canada
- Radiology Department, Vancouver General Hospital, Vancouver, BC, Canada
| | - William Parker
- Sapien Machine Learning Corporation (SapienML), Vancouver, BC, Canada
- Radiology Department, Vancouver General Hospital, Vancouver, BC, Canada
- Radiology Department, Nanaimo Regional General Hospital, Nanaimo, BC, Canada
| |
Collapse
|
4
|
Kuo NIH, Perez-Concha O, Hanly M, Mnatzaganian E, Hao B, Di Sipio M, Yu G, Vanjara J, Valerie IC, de Oliveira Costa J, Churches T, Lujic S, Hegarty J, Jorm L, Barbieri S. Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project. JMIR MEDICAL EDUCATION 2024; 10:e51388. [PMID: 38227356 PMCID: PMC10828942 DOI: 10.2196/51388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 10/20/2023] [Accepted: 11/08/2023] [Indexed: 01/17/2024]
Abstract
Large-scale medical data sets are vital for hands-on education in health data science but are often inaccessible due to privacy concerns. Addressing this gap, we developed the Health Gym project, a free and open-source platform designed to generate synthetic health data sets applicable to various areas of data science education, including machine learning, data visualization, and traditional statistical models. Initially, we generated 3 synthetic data sets for sepsis, acute hypotension, and antiretroviral therapy for HIV infection. This paper discusses the educational applications of Health Gym's synthetic data sets. We illustrate this through their use in postgraduate health data science courses delivered by the University of New South Wales, Australia, and a Datathon event, involving academics, students, clinicians, and local health district professionals. We also include adaptable worked examples using our synthetic data sets, designed to enrich hands-on tutorial and workshop experiences. Although we highlight the potential of these data sets in advancing data science education and health care artificial intelligence, we also emphasize the need for continued research into the inherent limitations of synthetic data.
Collapse
Affiliation(s)
- Nicholas I-Hsien Kuo
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| | - Oscar Perez-Concha
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| | - Mark Hanly
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| | | | - Brandon Hao
- The University of New South Wales, Sydney, Australia
| | | | - Guolin Yu
- The University of New South Wales, Sydney, Australia
| | - Jash Vanjara
- The University of New South Wales, Sydney, Australia
| | | | - Juliana de Oliveira Costa
- Medicines Intelligence Research Program, School of Population Health, The University of New South Wales, Sydney, Australia
| | - Timothy Churches
- School of Clinical Medicine, University of New South Wales, Sydney, Australia
- Ingham Institute of Applied Medical Research, Liverpool, Sydney, Australia
| | - Sanja Lujic
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| | - Jo Hegarty
- Sydney Local Health District, Sydney, Australia
| | - Louisa Jorm
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| | - Sebastiano Barbieri
- Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia
| |
Collapse
|
5
|
Wibaek R, Andersen GS, Dahm CC, Witte DR, Hulman A. Large Language Models for Epidemiological Research via Automated Machine Learning: Case Study Using Data From the British National Child Development Study. JMIR Med Inform 2023; 11:e43638. [PMID: 37787655 PMCID: PMC10547934 DOI: 10.2196/43638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 06/29/2023] [Accepted: 07/22/2023] [Indexed: 10/04/2023] Open
Abstract
Background Large language models have had a huge impact on natural language processing (NLP) in recent years. However, their application in epidemiological research is still limited to the analysis of electronic health records and social media data. objectives To demonstrate the potential of NLP beyond these domains, we aimed to develop prediction models based on texts collected from an epidemiological cohort and compare their performance to classical regression methods. Methods We used data from the British National Child Development Study, where 10,567 children aged 11 years wrote essays about how they imagined themselves as 25-year-olds. Overall, 15% of the data set was set aside as a test set for performance evaluation. Pretrained language models were fine-tuned using AutoTrain (Hugging Face) to predict current reading comprehension score (range: 0-35) and future BMI and physical activity (active vs inactive) at the age of 33 years. We then compared their predictive performance (accuracy or discrimination) with linear and logistic regression models, including demographic and lifestyle factors of the parents and children from birth to the age of 11 years as predictors. Results NLP clearly outperformed linear regression when predicting reading comprehension scores (root mean square error: 3.89, 95% CI 3.74-4.05 for NLP vs 4.14, 95% CI 3.98-4.30 and 5.41, 95% CI 5.23-5.58 for regression models with and without general ability score as a predictor, respectively). Predictive performance for physical activity was similarly poor for the 2 methods (area under the receiver operating characteristic curve: 0.55, 95% CI 0.52-0.60 for both) but was slightly better than random assignment, whereas linear regression clearly outperformed the NLP approach when predicting BMI (root mean square error: 4.38, 95% CI 4.02-4.74 for NLP vs 3.85, 95% CI 3.54-4.16 for regression). The NLP approach did not perform better than simply assigning the mean BMI from the training set as a predictor. Conclusions Our study demonstrated the potential of using large language models on text collected from epidemiological studies. The performance of the approach appeared to depend on how directly the topic of the text was related to the outcome. Open-ended questions specifically designed to capture certain health concepts and lived experiences in combination with NLP methods should receive more attention in future epidemiological studies.
Collapse
Affiliation(s)
| | | | | | - Daniel R Witte
- Department of Public Health, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| | - Adam Hulman
- Department of Public Health, Aarhus University, Aarhus, Denmark
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Aarhus, Denmark
| |
Collapse
|
6
|
Peng C, Yang X, Yu Z, Bian J, Hogan WR, Wu Y. Clinical concept and relation extraction using prompt-based machine reading comprehension. J Am Med Inform Assoc 2023; 30:1486-1493. [PMID: 37316988 PMCID: PMC10436141 DOI: 10.1093/jamia/ocad107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 05/08/2023] [Accepted: 06/05/2023] [Indexed: 06/16/2023] Open
Abstract
OBJECTIVE To develop a natural language processing system that solves both clinical concept extraction and relation extraction in a unified prompt-based machine reading comprehension (MRC) architecture with good generalizability for cross-institution applications. METHODS We formulate both clinical concept extraction and relation extraction using a unified prompt-based MRC architecture and explore state-of-the-art transformer models. We compare our MRC models with existing deep learning models for concept extraction and end-to-end relation extraction using 2 benchmark datasets developed by the 2018 National NLP Clinical Challenges (n2c2) challenge (medications and adverse drug events) and the 2022 n2c2 challenge (relations of social determinants of health [SDoH]). We also evaluate the transfer learning ability of the proposed MRC models in a cross-institution setting. We perform error analyses and examine how different prompting strategies affect the performance of MRC models. RESULTS AND CONCLUSION The proposed MRC models achieve state-of-the-art performance for clinical concept and relation extraction on the 2 benchmark datasets, outperforming previous non-MRC transformer models. GatorTron-MRC achieves the best strict and lenient F1-scores for concept extraction, outperforming previous deep learning models on the 2 datasets by 1%-3% and 0.7%-1.3%, respectively. For end-to-end relation extraction, GatorTron-MRC and BERT-MIMIC-MRC achieve the best F1-scores, outperforming previous deep learning models by 0.9%-2.4% and 10%-11%, respectively. For cross-institution evaluation, GatorTron-MRC outperforms traditional GatorTron by 6.4% and 16% for the 2 datasets, respectively. The proposed method is better at handling nested/overlapped concepts, extracting relations, and has good portability for cross-institute applications. Our clinical MRC package is publicly available at https://github.com/uf-hobi-informatics-lab/ClinicalTransformerMRC.
Collapse
Affiliation(s)
- Cheng Peng
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Xi Yang
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
- Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, Florida, USA
| | - Zehao Yu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Jiang Bian
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
- Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, Florida, USA
| | - William R Hogan
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
| | - Yonghui Wu
- Department of Health Outcomes and Biomedical Informatics, College of Medicine, University of Florida, Gainesville, Florida, USA
- Cancer Informatics Shared Resource, University of Florida Health Cancer Center, Gainesville, Florida, USA
| |
Collapse
|
7
|
Ullah Miah MS, Sulaiman J, Sarwar TB, Islam SS, Rahman M, Haque MS. Medical Named Entity Recognition (MedNER): A Deep Learning Model for Recognizing Medical Entities (Drug, Disease) from Scientific Texts. IEEE EUROCON 2023 - 20TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES 2023. [DOI: 10.1109/eurocon56442.2023.10199075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Affiliation(s)
| | | | | | | | - Mizanur Rahman
- Faculty of Computing Universiti Malaysia Pahang,Pekan,Malaysia
| | - Md. Samiul Haque
- Institute of Information Technology University of Dhaka,Dhaka,Bangladesh
| |
Collapse
|
8
|
Schäfer H, Idrissi-Yaghir A, Bewersdorff J, Frihat S, Friedrich CM, Zesch T. Medication event extraction in clinical notes: Contribution of the WisPerMed team to the n2c2 2022 challenge. J Biomed Inform 2023; 143:104400. [PMID: 37211196 DOI: 10.1016/j.jbi.2023.104400] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 04/21/2023] [Accepted: 05/15/2023] [Indexed: 05/23/2023]
Abstract
In this work, we describe the findings of the 'WisPerMed' team from their participation in Track 1 (Contextualized Medication Event Extraction) of the n2c2 2022 challenge. We tackle two tasks: (i) medication extraction, which involves extracting all mentions of medications from the clinical notes, and (ii) event classification, which involves classifying the medication mentions based on whether a change in the medication has been discussed. To address the long lengths of clinical texts, which often exceed the maximum token length that models based on the transformer-architecture can handle, various approaches, such as the use of ClinicalBERT with a sliding window approach and Longformer-based models, are employed. In addition, domain adaptation through masked language modeling and preprocessing steps such as sentence splitting are utilized to improve model performance. Since both tasks were treated as named entity recognition (NER) problems, a sanity check was performed in the second release to eliminate possible weaknesses in the medication detection itself. This check used the medication spans to remove false positive predictions and replace missed tokens with the highest softmax probability of the disposition types. The effectiveness of these approaches is evaluated through multiple submissions to the tasks, as well as with post-challenge results, with a focus on the DeBERTa v3 model and its disentangled attention mechanism. Results show that the DeBERTa v3 model performs well in both the NER task and the event classification task.
Collapse
Affiliation(s)
- Henning Schäfer
- Department of Computer Science, University of Applied Sciences and Arts Dortmund (FH Dortmund), Emil-Figge-Straße 42, Dortmund, 44227, Germany; Institute for Transfusion Medicine, University Hospital Essen, Hufelandstraße 55, Essen, 45147, Germany.
| | - Ahmad Idrissi-Yaghir
- Department of Computer Science, University of Applied Sciences and Arts Dortmund (FH Dortmund), Emil-Figge-Straße 42, Dortmund, 44227, Germany; Institute for Medical Informatics, Biometry and Epidemiology (IMIBE), University Hospital Essen, Hufelandstraße 55, Essen, 45147, Germany
| | - Jeanette Bewersdorff
- Computational Linguistics, CATALPA - Center for Advanced Technology-Assisted Learning and Predictive Analytics, FernUniversität in Hagen, Germany
| | | | - Christoph M Friedrich
- Department of Computer Science, University of Applied Sciences and Arts Dortmund (FH Dortmund), Emil-Figge-Straße 42, Dortmund, 44227, Germany; Institute for Medical Informatics, Biometry and Epidemiology (IMIBE), University Hospital Essen, Hufelandstraße 55, Essen, 45147, Germany
| | - Torsten Zesch
- Computational Linguistics, CATALPA - Center for Advanced Technology-Assisted Learning and Predictive Analytics, FernUniversität in Hagen, Germany
| |
Collapse
|
9
|
Yang L, Huang X, Wang J, Yang X, Ding L, Li Z, Li J. Identifying stroke-related quantified evidence from electronic health records in real-world studies. Artif Intell Med 2023; 140:102552. [PMID: 37210153 DOI: 10.1016/j.artmed.2023.102552] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 02/28/2023] [Accepted: 04/11/2023] [Indexed: 05/22/2023]
Abstract
BACKGROUND Stroke is one of the leading causes of death and disability worldwide. The National Institutes of Health Stroke Scale (NIHSS) scores in electronic health records (EHRs), which quantitatively describe patients' neurological deficits in evidence-based treatment, are crucial in stroke-related clinical investigations. However, the free-text format and lack of standardization inhibit their effective use. Automatically extracting the scale scores from the clinical free text so that its potential value in real-world studies is realized has become an important goal. OBJECTIVE This study aims to develop an automated method to extract scale scores from the free text of EHRs. METHODS We propose a two-step pipeline method to identify NIHSS items and numerical scores and validate its feasibility using a freely accessible critical care database: MIMIC-III (Medical Information Mart for Intensive Care III). First, we utilize MIMIC-III to create an annotated corpus. Then, we investigate possible machine learning methods for two subtasks, NIHSS item and score recognition and item-score relation extraction. In the evaluation, we conduct both task-specific and end-to-end evaluations and compare our method with the rule-based method using precision, recall and F1 scores as evaluation metrics. RESULTS We use all available discharge summaries of stroke cases in MIMIC-III. The annotated NIHSS corpus contains 312 cases, 2929 scale items, 2774 scores and 2733 relations. The results show that the best F1-score of our method was 0.9006, which was attained by combining BERT-BiLSTM-CRF and Random Forest, and it outperformed the rule-based method (F1-score = 0.8098). In the end-to-end task, our method could successfully recognize the item "1b level of consciousness questions", the score "1" and their relation "('1b level of consciousness questions', '1', 'has value')" from the sentence "1b level of consciousness questions: said name = 1", while the rule-based method could not. CONCLUSIONS The two-step pipeline method we propose is an effective approach to identify NIHSS items, scores and their relations. With its help, clinical investigators can easily retrieve and access structured scale data, thereby supporting stroke-related real-world studies.
Collapse
Affiliation(s)
- Lin Yang
- Institute of Medical Information and Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China; Key Laboratory of Medical Information Intelligent Technology, Chinese Academy of Medical Sciences, Beijing 100020, China
| | - Xiaoshuo Huang
- Institute of Medical Information and Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China; School of Health Care Technology, Dalian Neusoft University of Information, Dalian 116023, China
| | - Jiayang Wang
- Institute of Medical Information and Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China
| | - Xin Yang
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China; National Center for Healthcare Quality Management in Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China
| | - Lingling Ding
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China; Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China
| | - Zixiao Li
- China National Clinical Research Center for Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China; National Center for Healthcare Quality Management in Neurological Diseases, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China; Department of Neurology, Beijing Tiantan Hospital, Capital Medical University, Beijing 100070, China
| | - Jiao Li
- Institute of Medical Information and Library, Chinese Academy of Medical Sciences/Peking Union Medical College, Beijing 100020, China; Key Laboratory of Medical Information Intelligent Technology, Chinese Academy of Medical Sciences, Beijing 100020, China.
| |
Collapse
|
10
|
Shi B, Fan R, Zhang L, Huang J, Xiong N, Vasilakos A, Wan J, Zhang L. A Joint Extraction System Based on Conditional Layer Normalization for Health Monitoring. SENSORS (BASEL, SWITZERLAND) 2023; 23:4812. [PMID: 37430725 DOI: 10.3390/s23104812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 05/10/2023] [Accepted: 05/11/2023] [Indexed: 07/12/2023]
Abstract
Natural language processing (NLP) technology has played a pivotal role in health monitoring as an important artificial intelligence method. As a key technology in NLP, relation triplet extraction is closely related to the performance of health monitoring. In this paper, a novel model is proposed for joint extraction of entities and relations, combining conditional layer normalization with the talking-head attention mechanism to strengthen the interaction between entity recognition and relation extraction. In addition, the proposed model utilizes position information to enhance the extraction accuracy of overlapping triplets. Experiments on the Baidu2019 and CHIP2020 datasets demonstrate that the proposed model can effectively extract overlapping triplets, which leads to significant performance improvements compared with baselines.
Collapse
Affiliation(s)
- Binbin Shi
- School of Biological and Chemical Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| | - Rongli Fan
- School of Biological and Chemical Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| | - Lijuan Zhang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| | - Jie Huang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| | - Neal Xiong
- Department of Computer Science, Mathematics Sul Ross State University, Alpine, TX 79830, USA
| | | | - Jian Wan
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| | - Lei Zhang
- School of Information and Electronic Engineering, Zhejiang University of Science and Technology, Hangzhou 310023, China
| |
Collapse
|
11
|
Vasilakes J, Georgiadis P, Nguyen NT, Miwa M, Ananiadou S. Contextualized medication event extraction with levitated markers. J Biomed Inform 2023; 141:104347. [PMID: 37030658 DOI: 10.1016/j.jbi.2023.104347] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 03/23/2023] [Indexed: 04/09/2023]
Abstract
Automatic extraction of patient medication histories from free-text clinical notes can increase the amount of relevant information to clinicians for developing treatment plans. In addition to detecting medication events, clinical text mining systems must also be able to predict event context, such as negation, uncertainty, and time of occurrence, in order to construct accurate patient timelines. Towards this goal, we introduce Levitated Context Markers (LCMs), a novel transformer-based model for contextualized event extraction. LCMs are an adaptation of levitated markers -originally developed for relation extraction- that allow pretrained transformer models to utilize global input representations while also focusing on event-related subspans using a sparse attention mechanism. In addition to outperforming a strong baseline model on the Contextualized Medication Event Dataset, we show that LCMs' sparse attention can provide interpretable predictions by detecting relevant context cues in an unsupervised manner.
Collapse
|
12
|
Kaplar A, Stošović M, Kaplar A, Brković V, Naumović R, Kovačević A. Evaluation of clinical named entity recognition methods for Serbian electronic health records. Int J Med Inform 2022; 164:104805. [PMID: 35653828 DOI: 10.1016/j.ijmedinf.2022.104805] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 05/06/2022] [Accepted: 05/22/2022] [Indexed: 11/25/2022]
Abstract
BACKGROUND AND OBJECTIVES The importance of clinical natural language processing (NLP) has increased with the adoption of electronic health records (EHRs). One of the critical tasks in clinical NLP is named entity recognition (NER). Clinical NER in the Serbian language is a severely under-researched area. The few approaches that have been proposed so far are based on rules or machine-learning models with hand-crafted features, while current state-of-the-art models have not been explored. The objective of this paper is to assess the performance of state-of-the-art NER methods on clinical narratives in the Serbian language. MATERIALS AND METHODS We designed an experimental setup for a comprehensive evaluation of state-of-the-art NER models. The gold standard corpus we used for the evaluation is comprised of discharge summaries from the Clinic for Nephrology at the University Clinical Center of Serbia. The following models were evaluated: conditional random fields (CRF), multilingual transformers (BERT Multilingual and XLM RoBERTa), and long short-term memory (LSTM) recurrent neural networks, and their ensembles. In addition, we investigated the necessity of the pretraining task of transformer based models and the use of pretrained word embeddings with LSTM model. RESULTS Our results show that individually CRF had the best precision, the pretrained BERT Multilingual model had the best recall values, and the LSTM model had the best F1 score. The best performance was achieved by combining the existing models in a majority voting ensemble with an F1 score of 0.892. The presented results are similar to the inter annotator agreement on our gold standard corpus and are comparable to existing state-of-the-art results for clinical NER reported in literature. CONCLUSION Existing state-of-the-art models can provide viable results for clinical named entity recognition when applied to languages with the complexity of the Serbian language without major modifications.
Collapse
Affiliation(s)
- Aleksandar Kaplar
- Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
| | - Milan Stošović
- Clinic of Nephrology, University Clinical Center of Serbia, Belgrade, Serbia
| | - Aleksandra Kaplar
- Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia
| | - Voin Brković
- Clinic of Nephrology, University Clinical Center of Serbia, Belgrade, Serbia
| | - Radomir Naumović
- Clinic of Nephrology, University Clinical Center of Serbia, Belgrade, Serbia
| | | |
Collapse
|
13
|
Current Approaches and Applications in Natural Language Processing. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12104859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Artificial Intelligence has gained a lot of popularity in recent years thanks to the advent of, mainly, Deep Learning techniques [...]
Collapse
|
14
|
Privacy-Preserving Mimic Models for clinical Named Entity Recognition in French. J Biomed Inform 2022; 130:104073. [DOI: 10.1016/j.jbi.2022.104073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 02/09/2022] [Accepted: 04/07/2022] [Indexed: 11/18/2022]
|
15
|
Ebbehoj A, Thunbo MØ, Andersen OE, Glindtvad MV, Hulman A. Transfer learning for non-image data in clinical research: A scoping review. PLOS DIGITAL HEALTH 2022; 1:e0000014. [PMID: 36812540 PMCID: PMC9931256 DOI: 10.1371/journal.pdig.0000014] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/05/2021] [Accepted: 12/15/2021] [Indexed: 01/14/2023]
Abstract
BACKGROUND Transfer learning is a form of machine learning where a pre-trained model trained on a specific task is reused as a starting point and tailored to another task in a different dataset. While transfer learning has garnered considerable attention in medical image analysis, its use for clinical non-image data is not well studied. Therefore, the objective of this scoping review was to explore the use of transfer learning for non-image data in the clinical literature. METHODS AND FINDINGS We systematically searched medical databases (PubMed, EMBASE, CINAHL) for peer-reviewed clinical studies that used transfer learning on human non-image data. We included 83 studies in the review. More than half of the studies (63%) were published within 12 months of the search. Transfer learning was most often applied to time series data (61%), followed by tabular data (18%), audio (12%) and text (8%). Thirty-three (40%) studies applied an image-based model to non-image data after transforming data into images (e.g. spectrograms). Twenty-nine (35%) studies did not have any authors with a health-related affiliation. Many studies used publicly available datasets (66%) and models (49%), but fewer shared their code (27%). CONCLUSIONS In this scoping review, we have described current trends in the use of transfer learning for non-image data in the clinical literature. We found that the use of transfer learning has grown rapidly within the last few years. We have identified studies and demonstrated the potential of transfer learning in clinical research in a wide range of medical specialties. More interdisciplinary collaborations and the wider adaption of reproducible research principles are needed to increase the impact of transfer learning in clinical research.
Collapse
Affiliation(s)
- Andreas Ebbehoj
- Department of Endocrinology and Internal Medicine, Aarhus University Hospital, Denmark
- Department of Clinical Medicine, Aarhus University, Denmark
| | | | | | | | - Adam Hulman
- Steno Diabetes Center Aarhus, Aarhus University Hospital, Denmark
- * E-mail:
| |
Collapse
|
16
|
Zhang X, Gao F, Zhou L, Jing S, Wang Z, Wang Y, Miao S, Zhang X, Guo J, Shan T, Liu Y. Fine-Grained Drug Interaction Extraction Based on Entity Pair Calibration and Pre-Training Model for Chinese Drug Instructions. INT J SEMANT WEB INF 2022. [DOI: 10.4018/ijswis.307908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Existing pharmaceutical information extraction research often focus on standalone entity or relationship identification tasks over drug instructions. There is a lack of a holistic solution for drug knowledge extraction. Moreover, current methods perform poorly in extracting fine-grained interaction relations from drug instructions. To solve these problems, this paper proposes an information extraction framework for drug instructions. The framework proposes deep learning models with fine-tuned pre-training models for entity recognition and relation extraction, in addition, it incorporates an novel entity pair calibration process to promote the performance for fine-grained relation extraction. The framework experiments on more than 60k Chinese drug description sentences from 4000 drug instructions. Empirical results show that the framework can successfully identify drug related entities (F1 ≥ 0.95) and their relations (F1 ≥ 0.83) from the realistic dataset, and the entity pair calibration plays an important role (~5% F1 score improvement) in extracting fine-grained relations.
Collapse
Affiliation(s)
| | - Feng Gao
- Wuhan University of Science and Technology, China
| | | | | | | | | | | | | | | | - Tao Shan
- Nanjing Medical University, China
| | - Yun Liu
- Nanjing Medical University, China
| |
Collapse
|