1
|
Serhan M, Psihogios A, Kabir N, Bota AB, Mithani SS, Smith DP, Zhu DT, Greyson D, Wilson S, Fell D, Top KA, Bettinger JA, Wilson K. A scoping review of active, participant centred, digital adverse events following immunization (AEFI) surveillance of WHO approved COVID-19 vaccines: A Canadian immunization Research Network study. Hum Vaccin Immunother 2024; 20:2293550. [PMID: 38374618 PMCID: PMC10880498 DOI: 10.1080/21645515.2023.2293550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2023] [Accepted: 12/08/2023] [Indexed: 02/21/2024] Open
Abstract
This scoping review examines the role of digital solutions in active, participant-centered surveillance of adverse events following initial release of COVID-19 vaccines. The goals of this paper were to examine the existing literature surrounding digital solutions and technology used for active, participant centered, AEFI surveillance of novel COVID-19 vaccines approved by WHO. This paper also aimed to identify gaps in literature surrounding digital, active, participant centered AEFI surveillance systems and to identify and describe the core components of active, participant centered, digital surveillance systems being used for post-market AEFI surveillance of WHO approved COVID-19 vaccines, with a focus on the digital solutions and technology being used, the type of AEFI detected, and the populations under surveillance. The findings highlight the need for customized surveillance systems based on local contexts and the lessons learned to improve future vaccine monitoring and pandemic preparedness.
Collapse
Affiliation(s)
- Mohamed Serhan
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Athanasios Psihogios
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Nooh Kabir
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - A. Brianne Bota
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - Salima S. Mithani
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
| | - David P. Smith
- Department of Cellular and Molecular Medicine, University of Ottawa, Ottawa, Canada
- Institute of Mental Health Research, University of Ottawa, Ottawa, Canada
| | - David T. Zhu
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- Yale School of Public Health, Yale University, New Haven, CT, USA
| | - Devon Greyson
- School of Population and Public Health, University of British Columbia, Vancouver, Canada
| | - Sarah Wilson
- Health Protection, Public Health Ontario, Toronto, ON, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
- Populations & Public Health Research Program, ICES, Toronto, ON, Canada
| | - Deshayne Fell
- Department of Pediatrics, Children’s Hospital of Eastern Ontario (CHEO) Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada
| | - Karina A. Top
- Department of Pediatrics, University of Alberta, Edmonton, AB, Canada
- Departments of Pediatrics and Community Health & Epidemiology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Julie A. Bettinger
- Vaccine Evaluation Center, Department of Pediatrics, BC Children’s Hospital Research Institute, University of British Columbia, Vancouver, Canada
| | - Kumanan Wilson
- Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada
- School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada
- Department of Medicine, University of Ottawa, Ottawa, Canada
- O’Neill Institute for National and Global Health Law, Georgetown University
- Bruyère Research Institute, Ottawa, Canada
| |
Collapse
|
2
|
van Buchem MM, de Hond AAH, Fanconi C, Shah V, Schuessler M, Kant IMJ, Steyerberg EW, Hernandez-Boussard T. Applying natural language processing to patient messages to identify depression concerns in cancer patients. J Am Med Inform Assoc 2024; 31:2255-2262. [PMID: 39018490 DOI: 10.1093/jamia/ocae188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2024] [Revised: 06/03/2024] [Accepted: 07/09/2024] [Indexed: 07/19/2024] Open
Abstract
OBJECTIVE This study aims to explore and develop tools for early identification of depression concerns among cancer patients by leveraging the novel data source of messages sent through a secure patient portal. MATERIALS AND METHODS We developed classifiers based on logistic regression (LR), support vector machines (SVMs), and 2 Bidirectional Encoder Representations from Transformers (BERT) models (original and Reddit-pretrained) on 6600 patient messages from a cancer center (2009-2022), annotated by a panel of healthcare professionals. Performance was compared using AUROC scores, and model fairness and explainability were examined. We also examined correlations between model predictions and depression diagnosis and treatment. RESULTS BERT and RedditBERT attained AUROC scores of 0.88 and 0.86, respectively, compared to 0.79 for LR and 0.83 for SVM. BERT showed bigger differences in performance across sex, race, and ethnicity than RedditBERT. Patients who sent messages classified as concerning had a higher chance of receiving a depression diagnosis, a prescription for antidepressants, or a referral to the psycho-oncologist. Explanations from BERT and RedditBERT differed, with no clear preference from annotators. DISCUSSION We show the potential of BERT and RedditBERT in identifying depression concerns in messages from cancer patients. Performance disparities across demographic groups highlight the need for careful consideration of potential biases. Further research is needed to address biases, evaluate real-world impacts, and ensure responsible integration into clinical settings. CONCLUSION This work represents a significant methodological advancement in the early identification of depression concerns among cancer patients. Our work contributes to a route to reduce clinical burden while enhancing overall patient care, leveraging BERT-based models.
Collapse
Affiliation(s)
- Marieke M van Buchem
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA 94305, United States
- Clinical Artificial Intelligence Implementation and Research Lab (CAIRELab), Leiden University Medical Center, Leiden 2333ZN, The Netherlands
| | - Anne A H de Hond
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA 94305, United States
- Julius Centre for Health Sciences and Primary Care, University Medical Center, Utrecht 3584CX, The Netherlands
| | - Claudio Fanconi
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA 94305, United States
- Department of Information Technology and Electrical Engineering, ETH Zürich, Zürich 8092, Switzerland
| | - Vaibhavi Shah
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA 94305, United States
| | - Max Schuessler
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, United States
| | - Ilse M J Kant
- Department of Digital Health, University Medical Center Utrecht, Utrecht 3584CX, The Netherlands
| | - Ewout W Steyerberg
- Clinical Artificial Intelligence Implementation and Research Lab (CAIRELab), Leiden University Medical Center, Leiden 2333ZN, The Netherlands
- Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden 2333ZN, The Netherlands
| | - Tina Hernandez-Boussard
- Department of Medicine (Biomedical Informatics), Stanford University, Stanford, CA 94305, United States
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, United States
| |
Collapse
|
3
|
Hu Y, Chen Q, Du J, Peng X, Keloth VK, Zuo X, Zhou Y, Li Z, Jiang X, Lu Z, Roberts K, Xu H. Improving large language models for clinical named entity recognition via prompt engineering. J Am Med Inform Assoc 2024; 31:1812-1820. [PMID: 38281112 PMCID: PMC11339492 DOI: 10.1093/jamia/ocad259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Revised: 12/15/2023] [Accepted: 12/26/2023] [Indexed: 01/29/2024] Open
Abstract
IMPORTANCE The study highlights the potential of large language models, specifically GPT-3.5 and GPT-4, in processing complex clinical data and extracting meaningful information with minimal training data. By developing and refining prompt-based strategies, we can significantly enhance the models' performance, making them viable tools for clinical NER tasks and possibly reducing the reliance on extensive annotated datasets. OBJECTIVES This study quantifies the capabilities of GPT-3.5 and GPT-4 for clinical named entity recognition (NER) tasks and proposes task-specific prompts to improve their performance. MATERIALS AND METHODS We evaluated these models on 2 clinical NER tasks: (1) to extract medical problems, treatments, and tests from clinical notes in the MTSamples corpus, following the 2010 i2b2 concept extraction shared task, and (2) to identify nervous system disorder-related adverse events from safety reports in the vaccine adverse event reporting system (VAERS). To improve the GPT models' performance, we developed a clinical task-specific prompt framework that includes (1) baseline prompts with task description and format specification, (2) annotation guideline-based prompts, (3) error analysis-based instructions, and (4) annotated samples for few-shot learning. We assessed each prompt's effectiveness and compared the models to BioClinicalBERT. RESULTS Using baseline prompts, GPT-3.5 and GPT-4 achieved relaxed F1 scores of 0.634, 0.804 for MTSamples and 0.301, 0.593 for VAERS. Additional prompt components consistently improved model performance. When all 4 components were used, GPT-3.5 and GPT-4 achieved relaxed F1 socres of 0.794, 0.861 for MTSamples and 0.676, 0.736 for VAERS, demonstrating the effectiveness of our prompt framework. Although these results trail BioClinicalBERT (F1 of 0.901 for the MTSamples dataset and 0.802 for the VAERS), it is very promising considering few training samples are needed. DISCUSSION The study's findings suggest a promising direction in leveraging LLMs for clinical NER tasks. However, while the performance of GPT models improved with task-specific prompts, there's a need for further development and refinement. LLMs like GPT-4 show potential in achieving close performance to state-of-the-art models like BioClinicalBERT, but they still require careful prompt engineering and understanding of task-specific knowledge. The study also underscores the importance of evaluation schemas that accurately reflect the capabilities and performance of LLMs in clinical settings. CONCLUSION While direct application of GPT models to clinical NER tasks falls short of optimal performance, our task-specific prompt framework, incorporating medical knowledge and training samples, significantly enhances GPT models' feasibility for potential clinical applications.
Collapse
Affiliation(s)
- Yan Hu
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Qingyu Chen
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, United States
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Jingcheng Du
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Xueqing Peng
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, United States
| | - Vipina Kuttichi Keloth
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, United States
| | - Xu Zuo
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Yujia Zhou
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Zehan Li
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Xiaoqian Jiang
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, United States
| | - Kirk Roberts
- McWilliams School of Biomedical Informatics, Houston, TX, United States
| | - Hua Xu
- Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT, United States
| |
Collapse
|
4
|
Lu Z, Peng Y, Cohen T, Ghassemi M, Weng C, Tian S. Large language models in biomedicine and health: current research landscape and future directions. J Am Med Inform Assoc 2024; 31:1801-1811. [PMID: 39169867 PMCID: PMC11339542 DOI: 10.1093/jamia/ocae202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Indexed: 08/23/2024] Open
Affiliation(s)
- Zhiyong Lu
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, United States
| | - Yifan Peng
- Department of Population Health Sciences, Weill Cornell Medicine, New York, NY 10065, United States
| | - Trevor Cohen
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195, United States
| | - Marzyeh Ghassemi
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139, United States
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, United States
| | - Shubo Tian
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, United States
| |
Collapse
|
5
|
Li Y, Li J, He J, Tao C. AE-GPT: Using Large Language Models to extract adverse events from surveillance reports-A use case with influenza vaccine adverse events. PLoS One 2024; 19:e0300919. [PMID: 38512919 PMCID: PMC10956752 DOI: 10.1371/journal.pone.0300919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Accepted: 03/06/2024] [Indexed: 03/23/2024] Open
Abstract
Though Vaccines are instrumental in global health, mitigating infectious diseases and pandemic outbreaks, they can occasionally lead to adverse events (AEs). Recently, Large Language Models (LLMs) have shown promise in effectively identifying and cataloging AEs within clinical reports. Utilizing data from the Vaccine Adverse Event Reporting System (VAERS) from 1990 to 2016, this study particularly focuses on AEs to evaluate LLMs' capability for AE extraction. A variety of prevalent LLMs, including GPT-2, GPT-3 variants, GPT-4, and Llama2, were evaluated using Influenza vaccine as a use case. The fine-tuned GPT 3.5 model (AE-GPT) stood out with a 0.704 averaged micro F1 score for strict match and 0.816 for relaxed match. The encouraging performance of the AE-GPT underscores LLMs' potential in processing medical data, indicating a significant stride towards advanced AE detection, thus presumably generalizable to other AE extraction tasks.
Collapse
Affiliation(s)
- Yiming Li
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - Jianfu Li
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Jacksonville, FL, United States of America
| | - Jianping He
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, United States of America
| | - Cui Tao
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Jacksonville, FL, United States of America
| |
Collapse
|
6
|
Modi S, Kasmiran KA, Mohd Sharef N, Sharum MY. Extracting adverse drug events from clinical Notes: A systematic review of approaches used. J Biomed Inform 2024; 151:104603. [PMID: 38331081 DOI: 10.1016/j.jbi.2024.104603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 01/31/2024] [Accepted: 02/01/2024] [Indexed: 02/10/2024]
Abstract
BACKGROUND An adverse drug event (ADE) is any unfavorable effect that occurs due to the use of a drug. Extracting ADEs from unstructured clinical notes is essential to biomedical text extraction research because it helps with pharmacovigilance and patient medication studies. OBJECTIVE From the considerable amount of clinical narrative text, natural language processing (NLP) researchers have developed methods for extracting ADEs and their related attributes. This work presents a systematic review of current methods. METHODOLOGY Two biomedical databases have been searched from June 2022 until December 2023 for relevant publications regarding this review, namely the databases PubMed and Medline. Similarly, we searched the multi-disciplinary databases IEEE Xplore, Scopus, ScienceDirect, and the ACL Anthology. We adopted the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2020 statement guidelines and recommendations for reporting systematic reviews in conducting this review. Initially, we obtained 5,537 articles from the search results from the various databases between 2015 and 2023. Based on predefined inclusion and exclusion criteria for article selection, 100 publications have undergone full-text review, of which we consider 82 for our analysis. RESULTS We determined the general pattern for extracting ADEs from clinical notes, with named entity recognition (NER) and relation extraction (RE) being the dual tasks considered. Researchers that tackled both NER and RE simultaneously have approached ADE extraction as a "pipeline extraction" problem (n = 22), as a "joint task extraction" problem (n = 7), and as a "multi-task learning" problem (n = 6), while others have tackled only NER (n = 27) or RE (n = 20). We further grouped the reviews based on the approaches for data extraction, namely rule-based (n = 8), machine learning (n = 11), deep learning (n = 32), comparison of two or more approaches (n = 11), hybrid (n = 12) and large language models (n = 8). The most used datasets are MADE 1.0, TAC 2017 and n2c2 2018. CONCLUSION Extracting ADEs is crucial, especially for pharmacovigilance studies and patient medications. This survey showcases advances in ADE extraction research, approaches, datasets, and state-of-the-art performance in them. Challenges and future research directions are highlighted. We hope this review will guide researchers in gaining background knowledge and developing more innovative ways to address the challenges.
Collapse
Affiliation(s)
- Salisu Modi
- Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Selangor, Malaysia; Department of Computer Science, Sokoto State University, Sokoto, Nigeria.
| | - Khairul Azhar Kasmiran
- Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Selangor, Malaysia.
| | - Nurfadhlina Mohd Sharef
- Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Selangor, Malaysia.
| | - Mohd Yunus Sharum
- Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Selangor, Malaysia.
| |
Collapse
|
7
|
Chen D, Zhang R. COVID-19 Vaccine Adverse Event Detection Based on Multi-Label Classification With Various Label Selection Strategies. IEEE J Biomed Health Inform 2023; 27:4192-4203. [PMID: 37418397 DOI: 10.1109/jbhi.2023.3292252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/09/2023]
Abstract
Analyzing massive VAERS reports without medical context may lead to incorrect conclusions about vaccine adverse events (VAE). Facilitating VAE detection promotes continual safety improvement for new vaccines. This study proposes a multi-label classification method with various term-and topic-based label selection strategies to improve the accuracy and efficiency of VAE detection. Topic modeling methods are first used to generate rule-based label dependencies from Medical Dictionary for Regulatory Activities terms in VAE reports with two hyper-parameters. Multiple label selection strategies, namely one-vs-rest (OvsR), problem transformation (PT), algorithm adaption (AA), and deep learning (DL) methods, are used in multi-label classification to examine the model performance, respectively. Experimental results indicated that the topic-based PT methods improve the accuracy by up to 33.69% using a COVID-19 VAE reporting data set, which improves the robustness and interpretability of our models. In addition, the topic-based OvsR methods achieve an optimal accuracy of up to 98.88%. The accuracy of the AA methods with topic-based labels increased by up to 87.36%. By contrast, the state-of-art LSTM- and BERT-based DL methods have relatively poor performance with accuracy rates of 71.89% and 64.63%, respectively. Our findings reveal that the proposed method effectively improves the model accuracy and strengthens VAE interpretability by using different label selection strategies and domain knowledge in multi-label classification for VAE detection.
Collapse
|
8
|
Kim S, Kang T, Chung TK, Choi Y, Hong Y, Jung K, Lee H. Automatic Extraction of Comprehensive Drug Safety Information from Adverse Drug Event Narratives in the Korea Adverse Event Reporting System Using Natural Language Processing Techniques. Drug Saf 2023; 46:781-795. [PMID: 37330415 PMCID: PMC10344995 DOI: 10.1007/s40264-023-01323-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/22/2023] [Indexed: 06/19/2023]
Abstract
INTRODUCTION Concerns have been raised over the quality of drug safety information, particularly data completeness, collected through spontaneous reporting systems (SRS), although regulatory agencies routinely use SRS data to guide their pharmacovigilance programs. We expected that collecting additional drug safety information from adverse event (ADE) narratives and incorporating it into the SRS database would improve data completeness. OBJECTIVE The aims of this study were to define the extraction of comprehensive drug safety information from ADE narratives reported through the Korea Adverse Event Reporting System (KAERS) as natural language processing (NLP) tasks and to provide baseline models for the defined tasks. METHODS This study used ADE narratives and structured drug safety information from individual case safety reports (ICSRs) reported through KAERS between 1 January 2015 and 31 December 2019. We developed the annotation guideline for the extraction of comprehensive drug safety information from ADE narratives based on the International Conference on Harmonisation (ICH) E2B(R3) guideline and manually annotated 3723 ADE narratives. Then, we developed a domain-specific Korean Bidirectional Encoder Representations from Transformers (KAERS-BERT) model using 1.2 million ADE narratives in KAERS and provided baseline models for the task we defined. In addition, we performed an ablation experiment to investigate whether named entity recognition (NER) models were improved when a training dataset contained more diverse ADE narratives. RESULTS We defined 21 types of word entities, six types of entity labels, and 49 types of relations to formulate the extraction of comprehensive drug safety information as NLP tasks. We obtained a total of 86,750 entities, 81,828 entity labels, and 45,107 relations from manually annotated ADE narratives. The KAERS-BERT model achieved F1-scores of 83.81 and 76.62% on the NER and sentence extraction tasks, respectively, while outperforming other baseline models on all the NLP tasks we defined except the sentence extraction task. Finally, utilizing the NER model for extracting drug safety information from ADE narratives resulted in an average increase of 3.24% in data completeness for KAERS structured data fields. CONCLUSIONS We formulated the extraction of comprehensive drug safety information from ADE narratives as NLP tasks and developed the annotated corpus and strong baseline models for the tasks. The annotated corpus and models for extracting comprehensive drug safety information can improve the data quality of an SRS database.
Collapse
Affiliation(s)
- Siun Kim
- Biomedical Research Institute, Seoul National University Hospital, Seoul, South Korea
| | - Taegwan Kang
- Department of Electrical and Computer Engineering, Seoul National University, Room 1005 Building 301, 1 Gwanak-ro, Gwanak-gu, Seoul, 151-744, Republic of Korea
- LG AI Research, 128, Yeoui-daero, Yeongdeungpo-gu, Seoul, South Korea
| | - Tae Kyu Chung
- Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, South Korea
| | - Yoona Choi
- Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, South Korea
| | - YeSol Hong
- Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, South Korea
| | - Kyomin Jung
- Department of Electrical and Computer Engineering, Seoul National University, Room 1005 Building 301, 1 Gwanak-ro, Gwanak-gu, Seoul, 151-744, Republic of Korea.
| | - Howard Lee
- Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, South Korea.
- Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul, South Korea.
- Department of Clinical Pharmacology and Therapeutics, Seoul National University College of Medicine and Hospital, 103 Daehak-ro, Jongno-gu, Seoul, 110-799, South Korea.
- Advanced Institutes of Convergence Technology, Suwon, 16229, South Korea.
| |
Collapse
|
9
|
Mo Q, Zhang T, Wu J, Wang L, Luo J. Identification of thrombopoiesis inducer based on a hybrid deep neural network model. Thromb Res 2023; 226:36-50. [PMID: 37119555 DOI: 10.1016/j.thromres.2023.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 03/13/2023] [Accepted: 04/11/2023] [Indexed: 05/01/2023]
Abstract
Thrombocytopenia is a common haematological problem worldwide. Currently, there are no relatively safe and effective agents for the treatment of thrombocytopenia. To address this challenge, we propose a computational method that enables the discovery of novel drug candidates with haematopoietic activities. Based on different types of molecular representations, three deep learning (DL) algorithms, namely recurrent neural networks (RNNs), deep neural networks (DNNs), and hybrid neural networks (RNNs+DNNs), were used to develop classification models to distinguish between active and inactive compounds. The evaluation results illustrated that the hybrid DL model exhibited the best prediction performance, with an accuracy of 97.8 % and Matthews correlation coefficient of 0.958 on the test dataset. Subsequently, we performed drug discovery screening based on the hybrid DL model and identified a compound from the FDA-approved drug library that was structurally divergent from conventional drugs and showed a potential therapeutic action against thrombocytopenia. The novel drug candidate wedelolactone significantly promoted megakaryocyte differentiation in vitro and increased platelet levels and megakaryocyte differentiation in irradiated mice with no systemic toxicity. Overall, our work demonstrates how artificial intelligence can be used to discover novel drugs against thrombocytopenia.
Collapse
Affiliation(s)
- Qi Mo
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Ting Zhang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou 646000, China
| | - Jianming Wu
- Basic Medical College, Southwest Medical University, Luzhou 646000, China.
| | - Long Wang
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou 646000, China.
| | - Jiesi Luo
- Basic Medical College, Southwest Medical University, Luzhou 646000, China; State Key Laboratory of Southwestern Chinese Medicine Resources, Chengdu University of Traditional Chinese Medicine, Chengdu 610075, China.
| |
Collapse
|
10
|
Flora J, Khan W, Jin J, Jin D, Hussain A, Dajani K, Khan B. Usefulness of Vaccine Adverse Event Reporting System for Machine-Learning Based Vaccine Research: A Case Study for COVID-19 Vaccines. Int J Mol Sci 2022; 23:ijms23158235. [PMID: 35897804 PMCID: PMC9368306 DOI: 10.3390/ijms23158235] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/06/2022] [Accepted: 07/21/2022] [Indexed: 02/04/2023] Open
Abstract
Usefulness of Vaccine-Adverse Event-Reporting System (VAERS) data and protocols required for statistical analyses were pinpointed with a set of recommendations for the application of machine learning modeling or exploratory analyses on VAERS data with a case study of COVID-19 vaccines (Pfizer-BioNTech, Moderna, Janssen). A total of 262,454 duplicate reports (29%) from 905,976 reports were identified, which were merged into a total of 643,522 distinct reports. A customized online survey was also conducted providing 211 reports. A total of 20 highest reported adverse events were first identified. Differences in results after applying various machine learning algorithms (association rule mining, self-organizing maps, hierarchical clustering, bipartite graphs) on VAERS data were noticed. Moderna reports showed injection-site-related AEs of higher frequencies by 15.2%, consistent with the online survey (12% higher reporting rate for pain in the muscle for Moderna compared to Pfizer-BioNTech). AEs {headache, pyrexia, fatigue, chills, pain, dizziness} constituted >50% of the total reports. Chest pain in male children reports was 295% higher than in female children reports. Penicillin and sulfa were of the highest frequencies (22%, and 19%, respectively). Analysis of uncleaned VAERS data demonstrated major differences from the above (7% variations). Spelling/grammatical mistakes in allergies were discovered (e.g., ~14% reports with incorrect spellings for penicillin).
Collapse
Affiliation(s)
- James Flora
- Department of Computer Science and Engineering, California State University San Bernardino, 5500 University Parkway, San Bernardino, CA 92407, USA; (J.F.); (J.J.); (K.D.)
| | - Wasiq Khan
- School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool L3 3AF, UK;
| | - Jennifer Jin
- Department of Computer Science and Engineering, California State University San Bernardino, 5500 University Parkway, San Bernardino, CA 92407, USA; (J.F.); (J.J.); (K.D.)
| | - Daniel Jin
- Division of Vascular & Interventional Radiology, Department of Radiology, Loma Linda University Medical Center, Loma Linda, CA 92354, USA;
| | - Abir Hussain
- Department of Electrical Engineering, University of Sharjah, Sharjah P.O. Box 27272, United Arab Emirates;
| | - Khalil Dajani
- Department of Computer Science and Engineering, California State University San Bernardino, 5500 University Parkway, San Bernardino, CA 92407, USA; (J.F.); (J.J.); (K.D.)
| | - Bilal Khan
- Department of Computer Science and Engineering, California State University San Bernardino, 5500 University Parkway, San Bernardino, CA 92407, USA; (J.F.); (J.J.); (K.D.)
- Institute of the Environment and Sustainability, University of California Los Angeles, Los Angeles, CA 90095, USA
- Correspondence: ; Tel.: +1-(909)-537-5428
| |
Collapse
|
11
|
Zhou S, Wang N, Wang L, Liu H, Zhang R. CancerBERT: a cancer domain-specific language model for extracting breast cancer phenotypes from electronic health records. J Am Med Inform Assoc 2022; 29:1208-1216. [PMID: 35333345 PMCID: PMC9196678 DOI: 10.1093/jamia/ocac040] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/06/2022] [Accepted: 03/09/2022] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVE Accurate extraction of breast cancer patients' phenotypes is important for clinical decision support and clinical research. This study developed and evaluated cancer domain pretrained CancerBERT models for extracting breast cancer phenotypes from clinical texts. We also investigated the effect of customized cancer-related vocabulary on the performance of CancerBERT models. MATERIALS AND METHODS A cancer-related corpus of breast cancer patients was extracted from the electronic health records of a local hospital. We annotated named entities in 200 pathology reports and 50 clinical notes for 8 cancer phenotypes for fine-tuning and evaluation. We kept pretraining the BlueBERT model on the cancer corpus with expanded vocabularies (using both term frequency-based and manually reviewed methods) to obtain CancerBERT models. The CancerBERT models were evaluated and compared with other baseline models on the cancer phenotype extraction task. RESULTS All CancerBERT models outperformed all other models on the cancer phenotyping NER task. Both CancerBERT models with customized vocabularies outperformed the CancerBERT with the original BERT vocabulary. The CancerBERT model with manually reviewed customized vocabulary achieved the best performance with macro F1 scores equal to 0.876 (95% CI, 0.873-0.879) and 0.904 (95% CI, 0.902-0.906) for exact match and lenient match, respectively. CONCLUSIONS The CancerBERT models were developed to extract the cancer phenotypes in clinical notes and pathology reports. The results validated that using customized vocabulary may further improve the performances of domain specific BERT models in clinical NLP tasks. The CancerBERT models developed in the study would further help clinical decision support.
Collapse
Affiliation(s)
- Sicheng Zhou
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Nan Wang
- School of Statistics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Liwei Wang
- Department of AI and Informatics Research, Mayo Clinic, Rochester, Minnesota, USA
| | - Hongfang Liu
- Department of AI and Informatics Research, Mayo Clinic, Rochester, Minnesota, USA
| | - Rui Zhang
- Institute for Health Informatics, University of Minnesota, Minneapolis, Minnesota, USA.,Department of Pharmaceutical Care & Health Systems, University of Minnesota, Minneapolis, Minnesota, USA
| |
Collapse
|
12
|
Shiju A, He Z. Classifying Drug Ratings Using User Reviews with Transformer-Based Language Models. IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS. IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS 2022; 2022:163-169. [PMID: 36518748 PMCID: PMC9744636 DOI: 10.1109/ichi54592.2022.00035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Drug review websites such as Drugs.com provide users' textual reviews and numeric ratings of drugs. These reviews along with the ratings are used for the consumers for choosing a drug. However, the numeric ratings may not always be consistent with text reviews and purely relying on the rating score for finding positive/negative reviews may not be reliable. Automatic classification of user ratings based on textual review can create a more reliable rating for drugs. In this project, we built classification models to classify drug review ratings using textual reviews with traditional machine learning and deep learning models. Traditional machine learning models including Random Forest and Naive Bayesian classifiers were built using TF-IDF features as input. Also, transformer-based neural network models including BERT, Bio_ClinicalBERT, RoBERTa, XLNet, ELECTRA, and ALBERT were built using the raw text as input. Overall, Bio_ClinicalBERT model outperformed the other models with an overall accuracy of 87%. We further identified concepts of the Unified Medical Language System (UMLS) from the postings and analyzed their semantic types stratified by class types. This research demonstrated that transformer-based models can be used to classify drug reviews based solely on textual reviews.
Collapse
Affiliation(s)
- Akhil Shiju
- Department of Biological Sciences, Florida State University, Tallahassee, Florida, USA
| | - Zhe He
- School of Information, Florida State University, Tallahassee, Florida, USA
| |
Collapse
|
13
|
Kompa B, Hakim JB, Palepu A, Kompa KG, Smith M, Bain PA, Woloszynek S, Painter JL, Bate A, Beam AL. Artificial Intelligence Based on Machine Learning in Pharmacovigilance: A Scoping Review. Drug Saf 2022; 45:477-491. [PMID: 35579812 PMCID: PMC9883349 DOI: 10.1007/s40264-022-01176-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2022] [Indexed: 02/07/2023]
Abstract
INTRODUCTION Artificial intelligence based on machine learning has made large advancements in many fields of science and medicine but its impact on pharmacovigilance is yet unclear. OBJECTIVE The present study conducted a scoping review of the use of artificial intelligence based on machine learning to understand how it is used for pharmacovigilance tasks, characterize differences with other fields, and identify opportunities to improve pharmacovigilance through the use of machine learning. DESIGN The PubMed, Embase, Web of Science, and IEEE Xplore databases were searched to identify articles pertaining to the use of machine learning in pharmacovigilance published from the year 2000 to September 2021. After manual screening of 7744 abstracts, a total of 393 papers met the inclusion criteria for further analysis. Extraction of key data on study design, data sources, sample size, and machine learning methodology was performed. Studies with the characteristics of good machine learning practice were defined and manual review focused on identifying studies that fulfilled these criteria and results that showed promise. RESULTS The majority of studies (53%) were focused on detecting safety signals using traditional statistical methods. Of the studies that used more recent machine learning methods, 61% used off-the-shelf techniques with minor modifications. Temporal analysis revealed that newer methods such as deep learning have shown increased use in recent years. We found only 42 studies (10%) that reflect current best practices and trends in machine learning. In the subset of 154 papers that focused on data intake and ingestion, 30 (19%) were found to incorporate the same best practices. CONCLUSION Advances from artificial intelligence have yet to fully penetrate pharmacovigilance, although recent studies show signs that this may be changing.
Collapse
Affiliation(s)
- Benjamin Kompa
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Joe B Hakim
- Department of Health Sciences and Technology, Harvard-MIT, Cambridge, MA, USA
| | - Anil Palepu
- Department of Health Sciences and Technology, Harvard-MIT, Cambridge, MA, USA
| | | | - Michael Smith
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Paul A Bain
- Countway Library of Medicine, Harvard Medical School, Boston, MA, USA
| | | | | | - Andrew Bate
- GlaxoSmithKline, Brentford, UK
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, University of London, London, UK
- Department of Medicine, NYU Grossman School of Medicine, New York, NY, USA
| | - Andrew L Beam
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
- CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
| |
Collapse
|
14
|
Gringeri M, Battini V, Cammarata G, Mosini G, Guarnieri G, Leoni C, Pozzi M, Radice S, Clementi E, Carnovale C. Herpes zoster and simplex reactivation following COVID-19 vaccination: new insights from a vaccine adverse event reporting system (VAERS) database analysis. Expert Rev Vaccines 2022; 21:675-684. [PMID: 35191364 DOI: 10.1080/14760584.2022.2044799] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
BACKGROUND A few cases of Herpes Zoster and Simplex reactivation following COVID-19 immunization have been recently described, but the real extent of this suspected adverse event has not been elucidated yet. METHODS We performed a nested case/control study by using the U.S. Vaccine Adverse Event Reporting System database. We carried out a case-level clinical review of all Herpes reactivation cases following the administration of COVID-19 vaccines. For cases and controls, significance was set at P = 0.05, differential risk of reporting was assessed for each vaccine as reporting odds ratio and incidence was estimated based on the total number of vaccine doses administered. RESULTS Of 6,195 cases included in the analysis (5,934 and 273 reporting Herpes Zoster and Herpes Simplex, respectively) over 90% were non-serious. We found a slightly higher risk of reporting both for Zoster (ROR = 1.49) and Simplex (ROR = 1.51) infections following the Pfizer-BioNTech vaccine. The estimated incidence was approximately 0.7/100,000 and 0.03/100,000 for Zoster and Simplex, respectively. CONCLUSIONS The paucity of cases (almost all of non-serious nature) makes the potential occurrence of this adverse effect negligible from clinical standpoints, thus supporting the good safety profile of the COVID-19 vaccination, which remains strongly recommended.
Collapse
Affiliation(s)
- Michele Gringeri
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Vera Battini
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Gianluca Cammarata
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Giulia Mosini
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Greta Guarnieri
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Chiara Leoni
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Marco Pozzi
- Scientific Institute, IRCCS E. Medea, Bosisio Parini, LC, Italy
| | - Sonia Radice
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| | - Emilio Clementi
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy.,Scientific Institute, IRCCS E. Medea, Bosisio Parini, LC, Italy
| | - Carla Carnovale
- Unit of Clinical Pharmacology, Department of Biomedical and Clinical Sciences L. Sacco, "Luigi Sacco" University Hospital, University of Milan, Milano, Italy
| |
Collapse
|
15
|
Imran M, Bhatti A, King DM, Lerch M, Dietrich J, Doron G, Manlik K. Supervised Machine Learning-Based Decision Support for Signal Validation Classification. Drug Saf 2022; 45:583-596. [PMID: 35579820 PMCID: PMC9114067 DOI: 10.1007/s40264-022-01159-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2022] [Indexed: 01/28/2023]
Abstract
INTRODUCTION Signal validation in pharmacovigilance is the process of evaluating data to decide whether evidence is sufficient to justify further assessment of a detected signal. During the signal validation process, safety experts in our organization are required to review signals of disproportionate reporting (SDRs) and classify them into one of six predefined categories. OBJECTIVE This experiment explored the extent to which predictive machine learning (ML) models can support the decision making of safety experts by accurately identifying the most appropriate predefined signal validation category. METHODS We extracted cumulative data for six medicinal products, consisting of historic SDR validations and Individual Case Safety Reports, from the company's safety database for training and testing of the ML model. We implemented a decision tree-based supervised multiclass classifier model termed Gradient Boosted Trees followed by a SHapley Additive exPlanations (SHAP) analysis to mitigate the "black box" effect of the ensemble model by identifying the key predicting features in the model. Following a retrospective analysis, a prospective experiment was conducted to test the model accuracy and user acceptance in a real-life setting. RESULTS The prediction accuracy of our ML model ranged from 83 to 86% over 3 months for the six medicinal products. The applicability of the model was confirmed by the company's safety experts. Additionally, the systematic predictions provided valuable information to the safety experts and assisted them in reviewing the SDRs efficiently and consistently. CONCLUSIONS This experiment demonstrated that it is possible to train a multiclass classification model to accurately predict signal validation categories for SDRs. More importantly, the transparency of the predictions provided by the SHAP analysis led to high acceptance by the safety experts.
Collapse
Affiliation(s)
- Muhammad Imran
- Bayer AG, Digital Transformation and Information Technology Pharma, Decision Science and Advanced Analytics for Medical Affairs, Pharmacovigilance and Regulatory Affairs, Müllerstr. 178, 13353, Berlin, Germany.
| | - Aasia Bhatti
- Bayer US LLC, Pharmaceuticals, Pharmacovigilance, Benefit-Risk Management TA Radiology, Whippany, NJ, USA
| | - David M King
- Bayer US LLC, Digital Transformation and Information Technology Pharma, Adverse Event Management, Morristown, NJ, USA
| | | | - Jürgen Dietrich
- Bayer AG, Pharmaceuticals, Pharmacovigilance, Innovation and Digitalization, Berlin, Germany
| | - Guy Doron
- Bayer AG, Pharmaceuticals, Pharmacovigilance, R&D, Data Sciences, Berlin, Germany
| | - Katrin Manlik
- Bayer AG, Pharmaceuticals, Pharmacovigilance, Data Science and Insight Generation, Berlin, Germany
| |
Collapse
|
16
|
Kalyan KS, Rajasekharan A, Sangeetha S. AMMU: A survey of transformer-based biomedical pretrained language models. J Biomed Inform 2021; 126:103982. [PMID: 34974190 DOI: 10.1016/j.jbi.2021.103982] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 12/12/2021] [Accepted: 12/20/2021] [Indexed: 01/04/2023]
Abstract
Transformer-based pretrained language models (PLMs) have started a new era in modern natural language processing (NLP). These models combine the power of transformers, transfer learning, and self-supervised learning (SSL). Following the success of these models in the general domain, the biomedical research community has developed various in-domain PLMs starting from BioBERT to the latest BioELECTRA and BioALBERT models. We strongly believe there is a need for a survey paper that can provide a comprehensive survey of various transformer-based biomedical pretrained language models (BPLMs). In this survey, we start with a brief overview of foundational concepts like self-supervised learning, embedding layer and transformer encoder layers. We discuss core concepts of transformer-based PLMs like pretraining methods, pretraining tasks, fine-tuning methods, and various embedding types specific to biomedical domain. We introduce a taxonomy for transformer-based BPLMs and then discuss all the models. We discuss various challenges and present possible solutions. We conclude by highlighting some of the open issues which will drive the research community to further improve transformer-based BPLMs. The list of all the publicly available transformer-based BPLMs along with their links is provided at https://mr-nlp.github.io/posts/2021/05/transformer-based-biomedical-pretrained-language-models-list/.
Collapse
|
17
|
Chopard D, Treder MS, Corcoran P, Ahmed N, Johnson C, Busse M, Spasic I. Text Mining of Adverse Events in Clinical Trials: Deep Learning Approach. JMIR Med Inform 2021; 9:e28632. [PMID: 34951601 PMCID: PMC8742206 DOI: 10.2196/28632] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 08/01/2021] [Accepted: 11/14/2021] [Indexed: 11/28/2022] Open
Abstract
BACKGROUND Pharmacovigilance and safety reporting, which involve processes for monitoring the use of medicines in clinical trials, play a critical role in the identification of previously unrecognized adverse events or changes in the patterns of adverse events. OBJECTIVE This study aims to demonstrate the feasibility of automating the coding of adverse events described in the narrative section of the serious adverse event report forms to enable statistical analysis of the aforementioned patterns. METHODS We used the Unified Medical Language System (UMLS) as the coding scheme, which integrates 217 source vocabularies, thus enabling coding against other relevant terminologies such as the International Classification of Diseases-10th Revision, Medical Dictionary for Regulatory Activities, and Systematized Nomenclature of Medicine). We used MetaMap, a highly configurable dictionary lookup software, to identify the mentions of the UMLS concepts. We trained a binary classifier using Bidirectional Encoder Representations from Transformers (BERT), a transformer-based language model that captures contextual relationships, to differentiate between mentions of the UMLS concepts that represented adverse events and those that did not. RESULTS The model achieved a high F1 score of 0.8080, despite the class imbalance. This is 10.15 percent points lower than human-like performance but also 17.45 percent points higher than that of the baseline approach. CONCLUSIONS These results confirmed that automated coding of adverse events described in the narrative section of serious adverse event reports is feasible. Once coded, adverse events can be statistically analyzed so that any correlations with the trialed medicines can be estimated in a timely fashion.
Collapse
Affiliation(s)
- Daphne Chopard
- School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
| | - Matthias S Treder
- School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
| | - Padraig Corcoran
- School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
| | - Nagheen Ahmed
- Centre for Trials Research, Cardiff University, Cardiff, United Kingdom
| | - Claire Johnson
- Centre for Trials Research, Cardiff University, Cardiff, United Kingdom
| | - Monica Busse
- Centre for Trials Research, Cardiff University, Cardiff, United Kingdom
| | - Irena Spasic
- School of Computer Science & Informatics, Cardiff University, Cardiff, United Kingdom
| |
Collapse
|