1
|
La J, DuMontier C, Hassan H, Abdallah M, Edwards C, Verma K, Ferri G, Dharne M, Yildirim C, Corrigan J, Gaziano JM, Do NV, Brophy MT, Driver JA, Munshi NC, Fillmore NR. Validation of algorithms to select patients with multiple myeloma and patients initiating myeloma treatment in the national Veterans Affairs Healthcare System. Pharmacoepidemiol Drug Saf 2023; 32:558-566. [PMID: 36458420 PMCID: PMC10448707 DOI: 10.1002/pds.5579] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2022] [Revised: 11/13/2022] [Accepted: 11/23/2022] [Indexed: 12/04/2022]
Abstract
BACKGROUND We aimed to evaluate and compare the performance of multiple myeloma (MM) selection algorithms for use in Veterans Affairs (VA) research. METHODS Using the VA Corporate Data Warehouse (CDW), the VA Cancer Registry (VACR), and VA pharmacy data, we randomly selected 500 patients from 01/01/1999 to 06/01/2021 who had (1) either one MM diagnostic code OR were listed in the VACR as having MM AND (2) at least one MM treatment code. A team reviewed oncology notes for each veteran to annotate details regarding MM diagnosis and initial treatment within VA. We evaluated inter-annotator agreement and compared the performance of four published algorithms (two developed and validated external to VA data and two used in VA data). RESULTS A total of 859 patients were reviewed to obtain 500 patients who were annotated as having MM and initiating MM treatment in VA. Agreement was high among annotators for all variables: MM diagnosis (98.3% agreement, Kappa = 0.93); initial treatment in VA (91.8% agreement; Kappa = 0.77); and initial treatment classification (87.6% agreement; Kappa = 0.86). VA Algorithms were more specific and had higher PPVs than non-VA algorithms for both MM diagnosis and initial treatment in VA. We developed the "VA Recommended Algorithm," which had the highest PPV among all algorithms in identifying patients diagnosed with MM (PPV = 0.98, 95% CI = 0.95-0.99) and in identifying patients who initiated their MM treatment in VA (PPV = 0.93, 95% CI = 0.90-0.96). CONCLUSION Our VA Recommended Algorithm optimizes sensitivity and PPV for cohort selection and treatment classification.
Collapse
Affiliation(s)
- Jennifer La
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
| | - Clark DuMontier
- New England Geriatrics Research, Education and Clinical Center, VA Boston Healthcare System, Boston, Massachusetts, USA
- Division of Aging, Brigham and Women's Hospital, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Hamza Hassan
- Boston University School of Medicine, Boston, Massachusetts, USA
- Boston Medical Center, Boston, Massachusetts, USA
| | - Maya Abdallah
- Boston University School of Medicine, Boston, Massachusetts, USA
- Boston Medical Center, Boston, Massachusetts, USA
| | - Camille Edwards
- Boston University School of Medicine, Boston, Massachusetts, USA
- Boston Medical Center, Boston, Massachusetts, USA
| | - Karina Verma
- Boston University School of Medicine, Boston, Massachusetts, USA
- Boston Medical Center, Boston, Massachusetts, USA
| | - Grace Ferri
- Boston University School of Medicine, Boston, Massachusetts, USA
- Boston Medical Center, Boston, Massachusetts, USA
| | - Mayuri Dharne
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
| | - Cenk Yildirim
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
| | - June Corrigan
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
| | - J Michael Gaziano
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- Division of Aging, Brigham and Women's Hospital, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Nhan V Do
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
- Boston University School of Medicine, Boston, Massachusetts, USA
| | - Mary T Brophy
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
- Boston University School of Medicine, Boston, Massachusetts, USA
| | - Jane A Driver
- New England Geriatrics Research, Education and Clinical Center, VA Boston Healthcare System, Boston, Massachusetts, USA
- Division of Aging, Brigham and Women's Hospital, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
| | - Nikhil C Munshi
- VA Boston CSP Center, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Nathanael R Fillmore
- Massachusetts Veterans Epidemiology Research and Information Center (MAVERIC), Boston, Massachusetts, USA
- VA Boston CSP Center, Boston, Massachusetts, USA
- VA Boston Healthcare System, Boston, Massachusetts, USA
- Harvard Medical School, Boston, Massachusetts, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| |
Collapse
|
2
|
Weiskopf NG, Dorr DA, Jackson C, Lehmann HP, Thompson CA. Healthcare utilization is a collider: an introduction to collider bias in EHR data reuse. J Am Med Inform Assoc 2023; 30:971-977. [PMID: 36752649 PMCID: PMC10114115 DOI: 10.1093/jamia/ocad013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Revised: 12/19/2022] [Accepted: 01/27/2023] [Indexed: 02/09/2023] Open
Abstract
OBJECTIVES Collider bias is a common threat to internal validity in clinical research but is rarely mentioned in informatics education or literature. Conditioning on a collider, which is a variable that is the shared causal descendant of an exposure and outcome, may result in spurious associations between the exposure and outcome. Our objective is to introduce readers to collider bias and its corollaries in the retrospective analysis of electronic health record (EHR) data. TARGET AUDIENCE Collider bias is likely to arise in the reuse of EHR data, due to data-generating mechanisms and the nature of healthcare access and utilization in the United States. Therefore, this tutorial is aimed at informaticians and other EHR data consumers without a background in epidemiological methods or causal inference. SCOPE We focus specifically on problems that may arise from conditioning on forms of healthcare utilization, a common collider that is an implicit selection criterion when one reuses EHR data. Directed acyclic graphs (DAGs) are introduced as a tool for identifying potential sources of bias during study design and planning. References for additional resources on causal inference and DAG construction are provided.
Collapse
Affiliation(s)
- Nicole G Weiskopf
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - David A Dorr
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Christie Jackson
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Harold P Lehmann
- Division of Health Science Informatics, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Caroline A Thompson
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.,Division of Cancer Epidemiology, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
3
|
Chen L, Gu Y, Ji X, Lou C, Sun Z, Li H, Gao Y, Huang Y. Clinical trial cohort selection based on multi-level rule-based natural language processing system. J Am Med Inform Assoc 2021; 26:1218-1226. [PMID: 31300825 DOI: 10.1093/jamia/ocz109] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 04/16/2019] [Accepted: 06/07/2019] [Indexed: 01/08/2023] Open
Abstract
OBJECTIVE Identifying patients who meet selection criteria for clinical trials is typically challenging and time-consuming. In this article, we describe our clinical natural language processing (NLP) system to automatically assess patients' eligibility based on their longitudinal medical records. This work was part of the 2018 National NLP Clinical Challenges (n2c2) Shared-Task and Workshop on Cohort Selection for Clinical Trials. MATERIALS AND METHODS The authors developed an integrated rule-based clinical NLP system which employs a generic rule-based framework plugged in with lexical-, syntactic- and meta-level, task-specific knowledge inputs. In addition, the authors also implemented and evaluated a general clinical NLP (cNLP) system which is built with the Unified Medical Language System and Unstructured Information Management Architecture. RESULTS AND DISCUSSION The systems were evaluated as part of the 2018 n2c2-1 challenge, and authors' rule-based system obtained an F-measure of 0.9028, ranking fourth at the challenge and had less than 1% difference from the best system. While the general cNLP system didn't achieve performance as good as the rule-based system, it did establish its own advantages and potential in extracting clinical concepts. CONCLUSION Our results indicate that a well-designed rule-based clinical NLP system is capable of achieving good performance on cohort selection even with a small training data set. In addition, the investigation of a Unified Medical Language System-based general cNLP system suggests that a hybrid system combining these 2 approaches is promising to surpass the state-of-the-art performance.
Collapse
Affiliation(s)
- Long Chen
- Med Data Quest, Inc, La Jolla, California, USA
| | - Yu Gu
- Med Data Quest, Inc, La Jolla, California, USA
| | - Xin Ji
- Med Data Quest, Inc, La Jolla, California, USA
| | - Chao Lou
- Med Data Quest, Inc, La Jolla, California, USA
| | - Zhiyong Sun
- Med Data Quest, Inc, La Jolla, California, USA
| | - Haodan Li
- Med Data Quest, Inc, La Jolla, California, USA
| | - Yuan Gao
- Med Data Quest, Inc, La Jolla, California, USA
| | - Yang Huang
- Med Data Quest, Inc, La Jolla, California, USA
| |
Collapse
|
4
|
Stubbs A, Filannino M, Soysal E, Henry S, Uzuner Ö. Cohort selection for clinical trials: n2c2 2018 shared task track 1. J Am Med Inform Assoc 2021; 26:1163-1171. [PMID: 31562516 DOI: 10.1093/jamia/ocz163] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 08/07/2019] [Accepted: 09/18/2019] [Indexed: 01/02/2023] Open
Abstract
OBJECTIVE Track 1 of the 2018 National NLP Clinical Challenges shared tasks focused on identifying which patients in a corpus of longitudinal medical records meet and do not meet identified selection criteria. MATERIALS AND METHODS To address this challenge, we annotated American English clinical narratives for 288 patients according to whether they met these criteria. We chose criteria from existing clinical trials that represented a variety of natural language processing tasks, including concept extraction, temporal reasoning, and inference. RESULTS A total of 47 teams participated in this shared task, with 224 participants in total. The participants represented 18 countries, and the teams submitted 109 total system outputs. The best-performing system achieved a micro F1 score of 0.91 using a rule-based approach. The top 10 teams used rule-based and hybrid systems to approach the problems. DISCUSSION Clinical narratives are open to interpretation, particularly in cases where the selection criterion may be underspecified. This leaves room for annotators to use domain knowledge and intuition in selecting patients, which may lead to error in system outputs. However, teams who consulted medical professionals while building their systems were more likely to have high recall for patients, which is preferable for patient selection systems. CONCLUSIONS There is not yet a 1-size-fits-all solution for natural language processing systems approaching this task. Future research in this area can look to examining criteria requiring even more complex inferences, temporal reasoning, and domain knowledge.
Collapse
Affiliation(s)
- Amber Stubbs
- Department of Mathematics and Computer Science, Simmons University, Boston, Massachusetts, USA
| | - Michele Filannino
- Information Sciences and Technology, George Mason University, Fairfax, Virginia, USA.,Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Ergin Soysal
- School of Biomedical Informatics, University of Texas Health Science Center, Houston, Texas, USA
| | - Samuel Henry
- Information Sciences and Technology, George Mason University, Fairfax, Virginia, USA
| | - Özlem Uzuner
- Information Sciences and Technology, George Mason University, Fairfax, Virginia, USA.,Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| |
Collapse
|
5
|
Miller HN, Gleason KT, Juraschek SP, Plante TB, Lewis-Land C, Woods B, Appel LJ, Ford DE, Dennison Himmelfarb CR. Electronic medical record-based cohort selection and direct-to-patient, targeted recruitment: early efficacy and lessons learned. J Am Med Inform Assoc 2021; 26:1209-1217. [PMID: 31553434 DOI: 10.1093/jamia/ocz168] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 08/15/2019] [Accepted: 09/03/2019] [Indexed: 12/18/2022] Open
Abstract
OBJECTIVE The study sought to characterize institution-wide participation in secure messaging (SM) at a large academic health network, describe our experience with electronic medical record (EMR)-based cohort selection, and discuss the potential roles of SM for research recruitment. MATERIALS AND METHODS Study teams defined eligibility criteria to create a computable phenotype, structured EMR data, to identify and recruit participants. Patients with SM accounts matching this phenotype received recruitment messages. We compared demographic characteristics across SM users and the overall health system. We also tabulated SM activation and use, characteristics of individual studies, and efficacy of the recruitment methods. RESULTS Of the 1 308 820 patients in the health network, 40% had active SM accounts. SM users had a greater proportion of white and non-Hispanic patients than nonactive SM users id. Among the studies included (n = 13), 77% recruited participants with a specific disease or condition. All studies used demographic criteria for their phenotype, while 46% (n = 6) used demographic, disease, and healthcare utilization criteria. The average SM response rate was 2.9%, with higher rates among condition-specific (3.4%) vs general health (1.4%) studies. Those studies with a more inclusive comprehensive phenotype had a higher response rate. DISCUSSION Target population and EMR queries (computable phenotypes) affect recruitment efficacy and should be considered when designing an EMR-based recruitment strategy. CONCLUSIONS SM guided by EMR-based cohort selection is a promising approach to identify and enroll research participants. Efforts to increase the number of active SM users and response rate should be implemented to enhance the effectiveness of this recruitment strategy.
Collapse
Affiliation(s)
- Hailey N Miller
- School of Nursing, Johns Hopkins University, Baltimore, Maryland, USA.,Institute for Clinical and Translational Research, Johns Hopkins University, Baltimore, Maryland, USA
| | - Kelly T Gleason
- School of Nursing, Johns Hopkins University, Baltimore, Maryland, USA.,Institute for Clinical and Translational Research, Johns Hopkins University, Baltimore, Maryland, USA
| | - Stephen P Juraschek
- Department of Medicine, Beth Israel Deaconess Medical Center/Harvard Medical School, Boston, Massachusetts, USA
| | - Timothy B Plante
- Department of Medicine, Larner College of Medicine, University of Vermont, Burlington, Vermont, USA
| | - Cassie Lewis-Land
- Institute for Clinical and Translational Research, Johns Hopkins University, Baltimore, Maryland, USA
| | - Bonnie Woods
- Institute for Clinical and Translational Research, Johns Hopkins University, Baltimore, Maryland, USA
| | - Lawrence J Appel
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Daniel E Ford
- Welch Center for Prevention, Epidemiology, and Clinical Research, Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
| | - Cheryl R Dennison Himmelfarb
- School of Nursing, Johns Hopkins University, Baltimore, Maryland, USA.,Institute for Clinical and Translational Research, Johns Hopkins University, Baltimore, Maryland, USA
| |
Collapse
|
6
|
Xiong Y, Shi X, Chen S, Jiang D, Tang B, Wang X, Chen Q, Yan J. Cohort selection for clinical trials using hierarchical neural network. J Am Med Inform Assoc 2021; 26:1203-1208. [PMID: 31305921 DOI: 10.1093/jamia/ocz099] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2019] [Revised: 04/28/2019] [Accepted: 06/13/2019] [Indexed: 12/22/2022] Open
Abstract
OBJECTIVE Cohort selection for clinical trials is a key step for clinical research. We proposed a hierarchical neural network to determine whether a patient satisfied selection criteria or not. MATERIALS AND METHODS We designed a hierarchical neural network (denoted as CNN-Highway-LSTM or LSTM-Highway-LSTM) for the track 1 of the national natural language processing (NLP) clinical challenge (n2c2) on cohort selection for clinical trials in 2018. The neural network is composed of 5 components: (1) sentence representation using convolutional neural network (CNN) or long short-term memory (LSTM) network; (2) a highway network to adjust information flow; (3) a self-attention neural network to reweight sentences; (4) document representation using LSTM, which takes sentence representations in chronological order as input; (5) a fully connected neural network to determine whether each criterion is met or not. We compared the proposed method with its variants, including the methods only using the first component to represent documents directly and the fully connected neural network for classification (denoted as CNN-only or LSTM-only) and the methods without using the highway network (denoted as CNN-LSTM or LSTM-LSTM). The performance of all methods was measured by micro-averaged precision, recall, and F1 score. RESULTS The micro-averaged F1 scores of CNN-only, LSTM-only, CNN-LSTM, LSTM-LSTM, CNN-Highway-LSTM, and LSTM-Highway-LSTM were 85.24%, 84.25%, 87.27%, 88.68%, 88.48%, and 90.21%, respectively. The highest micro-averaged F1 score is higher than our submitted 1 of 88.55%, which is 1 of the top-ranked results in the challenge. The results indicate that the proposed method is effective for cohort selection for clinical trials. DISCUSSION Although the proposed method achieved promising results, some mistakes were caused by word ambiguity, negation, number analysis and incomplete dictionary. Moreover, imbalanced data was another challenge that needs to be tackled in the future. CONCLUSION In this article, we proposed a hierarchical neural network for cohort selection. Experimental results show that this method is good at selecting cohort.
Collapse
Affiliation(s)
- Ying Xiong
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Xue Shi
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Shuai Chen
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Dehuan Jiang
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Buzhou Tang
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Xiaolong Wang
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Qingcai Chen
- Department of Computer Science, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, China
| | - Jun Yan
- Yidu Cloud (Beijing) Technology Co., Ltd, Beijing, China
| |
Collapse
|
7
|
Spengler H, Lang C, Mahapatra T, Gatz I, Kuhn KA, Prasser F. Enabling Agile Clinical and Translational Data Warehousing: Platform Development and Evaluation. JMIR Med Inform 2020; 8:e15918. [PMID: 32706673 PMCID: PMC7404007 DOI: 10.2196/15918] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 02/16/2020] [Accepted: 05/06/2020] [Indexed: 01/16/2023] Open
Abstract
Background Modern data-driven medical research provides new insights into the development and course of diseases and enables novel methods of clinical decision support. Clinical and translational data warehouses, such as Informatics for Integrating Biology and the Bedside (i2b2) and tranSMART, are important infrastructure components that provide users with unified access to the large heterogeneous data sets needed to realize this and support use cases such as cohort selection, hypothesis generation, and ad hoc data analysis. Objective Often, different warehousing platforms are needed to support different use cases and different types of data. Moreover, to achieve an optimal data representation within the target systems, specific domain knowledge is needed when designing data-loading processes. Consequently, informaticians need to work closely with clinicians and researchers in short iterations. This is a challenging task as installing and maintaining warehousing platforms can be complex and time consuming. Furthermore, data loading typically requires significant effort in terms of data preprocessing, cleansing, and restructuring. The platform described in this study aims to address these challenges. Methods We formulated system requirements to achieve agility in terms of platform management and data loading. The derived system architecture includes a cloud infrastructure with unified management interfaces for multiple warehouse platforms and a data-loading pipeline with a declarative configuration paradigm and meta-loading approach. The latter compiles data and configuration files into forms required by existing loading tools, thereby automating a wide range of data restructuring and cleansing tasks. We demonstrated the fulfillment of the requirements and the originality of our approach by an experimental evaluation and a comparison with previous work. Results The platform supports both i2b2 and tranSMART with built-in security. Our experiments showed that the loading pipeline accepts input data that cannot be loaded with existing tools without preprocessing. Moreover, it lowered efforts significantly, reducing the size of configuration files required by factors of up to 22 for tranSMART and 1135 for i2b2. The time required to perform the compilation process was roughly equivalent to the time required for actual data loading. Comparison with other tools showed that our solution was the only tool fulfilling all requirements. Conclusions Our platform significantly reduces the efforts required for managing clinical and translational warehouses and for loading data in various formats and structures, such as complex entity-attribute-value structures often found in laboratory data. Moreover, it facilitates the iterative refinement of data representations in the target platforms, as the required configuration files are very compact. The quantitative measurements presented are consistent with our experiences of significantly reduced efforts for building warehousing platforms in close cooperation with medical researchers. Both the cloud-based hosting infrastructure and the data-loading pipeline are available to the community as open source software with comprehensive documentation.
Collapse
Affiliation(s)
- Helmut Spengler
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Claudia Lang
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Tanmaya Mahapatra
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Ingrid Gatz
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Klaus A Kuhn
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Fabian Prasser
- Charité - Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute of Health, Berlin, Germany
| |
Collapse
|
8
|
Guo GN, Jonnagaddala J, Farshid S, Huser V, Reich C, Liaw ST. Comparison of the cohort selection performance of Australian Medicines Terminology to Anatomical Therapeutic Chemical mappings. J Am Med Inform Assoc 2019; 26:1237-1246. [PMID: 31545380 PMCID: PMC7647230 DOI: 10.1093/jamia/ocz143] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 06/10/2019] [Accepted: 07/22/2019] [Indexed: 12/16/2022] Open
Abstract
OBJECTIVE Electronic health records are increasingly utilized for observational and clinical research. Identification of cohorts using electronic health records is an important step in this process. Previous studies largely focused on the methods of cohort selection, but there is little evidence on the impact of underlying vocabularies and mappings between vocabularies used for cohort selection. We aim to compare the cohort selection performance using Australian Medicines Terminology to Anatomical Therapeutic Chemical (ATC) mappings from 2 different sources. These mappings were taken from the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) and the Pharmaceutical Benefits Scheme (PBS) schedule. MATERIALS AND METHODS We retrieved patients from the electronic Practice Based Research Network data repository using 3 ATC classification groups (A10, N02A, N06A). The retrieved patients were further verified manually and pooled to form a reference standard which was used to assess the accuracy of mappings using precision, recall, and F measure metrics. RESULTS The OMOP-CDM mappings identified 2.6%, 15.2%, and 24.4% more drugs than the PBS mappings in the A10, N02A and N06A groups respectively. Despite this, the PBS mappings generally performed the same in cohort selection as OMOP-CDM mappings except for the N02A Opioids group, where a significantly greater number of patients were retrieved. Both mappings exhibited variable recall, but perfect precision, with all drugs found to be correctly identified. CONCLUSION We found that 1 of the 3 ATC groups had a significant difference and this affected cohort selection performance. Our findings highlighted that underlying terminology mappings can greatly impact cohort selection accuracy. Clinical researchers should carefully evaluate vocabulary mapping sources including methodologies used to develop those mappings.
Collapse
Affiliation(s)
- Guan N Guo
- School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia
- WHO Collaborating Centre for eHealth, University of New South Wales, Sydney, Australia
| | - Jitendra Jonnagaddala
- School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia
- WHO Collaborating Centre for eHealth, University of New South Wales, Sydney, Australia
| | - Sanjay Farshid
- School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia
| | - Vojtech Huser
- Lister Hill National Centre for Biomedical Communications, National Library of Medicine National Institutes of Health, Bethesda, Maryland, USA
| | | | - Siaw-Teng Liaw
- School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia
- WHO Collaborating Centre for eHealth, University of New South Wales, Sydney, Australia
| |
Collapse
|
9
|
Segura-Bedmar I, Raez P. Cohort selection for clinical trials using deep learning models. J Am Med Inform Assoc 2019; 26:1181-1188. [PMID: 31532478 PMCID: PMC6798560 DOI: 10.1093/jamia/ocz139] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Revised: 07/10/2019] [Accepted: 07/22/2019] [Indexed: 01/09/2023] Open
Abstract
OBJECTIVE The goal of the 2018 n2c2 shared task on cohort selection for clinical trials (track 1) is to identify which patients meet the selection criteria for clinical trials. Cohort selection is a particularly demanding task to which natural language processing and deep learning can make a valuable contribution. Our goal is to evaluate several deep learning architectures to deal with this task. MATERIALS AND METHODS Cohort selection can be formulated as a multilabeling problem whose goal is to determine which criteria are met for each patient record. We explore several deep learning architectures such as a simple convolutional neural network (CNN), a deep CNN, a recurrent neural network (RNN), and CNN-RNN hybrid architecture. Although our architectures are similar to those proposed in existing deep learning systems for text classification, our research also studies the impact of using a fully connected feedforward layer on the performance of these architectures. RESULTS The RNN and hybrid models provide the best results, though without statistical significance. The use of the fully connected feedforward layer improves the results for all the architectures, except for the hybrid architecture. CONCLUSIONS Despite the limited size of the dataset, deep learning methods show promising results in learning useful features for the task of cohort selection. Therefore, they can be used as a previous filter for cohort selection for any clinical trial with a minimum of human intervention, thus reducing the cost and time of clinical trials significantly.
Collapse
Affiliation(s)
- Isabel Segura-Bedmar
- Department of Computer Science and Engineering, Universidad Carlos III de Madrid, Leganés, Spain
| | - Pablo Raez
- Department of Computer Science and Engineering, Universidad Carlos III de Madrid, Leganés, Spain
| |
Collapse
|
10
|
Chen CJ, Warikoo N, Chang YC, Chen JH, Hsu WL. Medical knowledge infused convolutional neural networks for cohort selection in clinical trials. J Am Med Inform Assoc 2019; 26:1227-1236. [PMID: 31390470 DOI: 10.1093/jamia/ocz128] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2019] [Revised: 06/18/2019] [Accepted: 07/04/2019] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVE In this era of digitized health records, there has been a marked interest in using de-identified patient records for conducting various health related surveys. To assist in this research effort, we developed a novel clinical data representation model entitled medical knowledge-infused convolutional neural network (MKCNN), which is used for learning the clinical trial criteria eligibility status of patients to participate in cohort studies. MATERIALS AND METHODS In this study, we propose a clinical text representation infused with medical knowledge (MK). First, we isolate the noise from the relevant data using a medically relevant description extractor; then we utilize log-likelihood ratio based weights from selected sentences to highlight "met" and "not-met" knowledge-infused representations in bichannel setting for each instance. The combined medical knowledge-infused representation (MK) from these modules helps identify significant clinical criteria semantics, which in turn renders effective learning when used with a convolutional neural network architecture. RESULTS MKCNN outperforms other Medical Knowledge (MK) relevant learning architectures by approximately 3%; notably SVM and XGBoost implementations developed in this study. MKCNN scored 86.1% on F1metric, a gain of 6% above the average performance assessed from the submissions for n2c2 task. Although pattern/rule-based methods show a higher average performance for the n2c2 clinical data set, MKCNN significantly improves performance of machine learning implementations for clinical datasets. CONCLUSION MKCNN scored 86.1% on the F1 score metric. In contrast to many of the rule-based systems introduced during the n2c2 challenge workshop, our system presents a model that heavily draws on machine-based learning. In addition, the MK representations add more value to clinical comprehension and interpretation of natural texts.
Collapse
Affiliation(s)
- Chi-Jen Chen
- Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, Taiwan
| | - Neha Warikoo
- Institute of Biomedical Informatics, National Yang-Ming University, Taipei, Taiwan.,Bioinformatics Program, Taiwan International Graduate Program, Institute of Information Science, Academia Sinica, Taipei, Taiwan
| | - Yung-Chun Chang
- Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, Taiwan.,Clinical Big Data Research Center, Taipei Medical University Hospital, Taipei, Taiwan.,Pervasive AI Research Labs, Ministry of Science and Technology, Taipei, Taiwan
| | - Jin-Hua Chen
- Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, Taiwan
| | - Wen-Lian Hsu
- Pervasive AI Research Labs, Ministry of Science and Technology, Taipei, Taiwan.,Institute of Information Science, Academia Sinica, Taipei, Taiwan
| |
Collapse
|
11
|
Gomes MGM, King JG, Nunes A, Colegrave N, Hoffmann AA. The effects of individual nonheritable variation on fitness estimation and coexistence. Ecol Evol 2019; 9:8995-9004. [PMID: 31462998 PMCID: PMC6706197 DOI: 10.1002/ece3.5437] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2019] [Accepted: 06/18/2019] [Indexed: 12/17/2022] Open
Abstract
Demographic theory and data have emphasized that nonheritable variation in individual frailty enables selection within cohorts, affecting the dynamics of a population while being invisible to its evolution. Here, we include the component of individual variation in longevity or viability which is nonheritable in simple bacterial growth models and explore its ecological and evolutionary impacts. First, we find that this variation produces consistent trends in longevity differences between bacterial genotypes when measured across stress gradients. Given that direct measurements of longevity are inevitably biased due to the presence of this variation and ongoing selection, we propose the use of the trend itself for obtaining more exact inferences of genotypic fitness. Second, we show how species or strain coexistence can be enabled by nonheritable variation in longevity or viability. These general conclusions are likely to extend beyond bacterial systems.
Collapse
Affiliation(s)
- M. Gabriela M. Gomes
- Liverpool School of Tropical MedicineLiverpoolUK
- CIBIO‐InBIO, Centro de Investigação em Biodiversidade e Recursos GenéticosCMUP, Centro de Matemática da Universidade do PortoPortoPortugal
| | - Jessica G. King
- School of Biological Sciences, Institute of Evolutionary BiologyUniversity of EdinburghEdinburghUK
| | - Ana Nunes
- Departamento de Física, Faculdade de CiênciasBioISI – Biosystems and Integrative Sciences Institute, Universidade de LisboaLisboaPortugal
| | - Nick Colegrave
- School of Biological Sciences, Institute of Evolutionary BiologyUniversity of EdinburghEdinburghUK
| | - Ary A. Hoffmann
- School of BioSciencesBio21 Institute, University of MelbourneMelbourneVic.Australia
| |
Collapse
|
12
|
Harrer S, Shah P, Antony B, Hu J. Artificial Intelligence for Clinical Trial Design. Trends Pharmacol Sci 2019; 40:577-591. [PMID: 31326235 DOI: 10.1016/j.tips.2019.05.005] [Citation(s) in RCA: 170] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2018] [Revised: 05/28/2019] [Accepted: 05/28/2019] [Indexed: 12/23/2022]
Abstract
Clinical trials consume the latter half of the 10 to 15 year, 1.5-2.0 billion USD, development cycle for bringing a single new drug to market. Hence, a failed trial sinks not only the investment into the trial itself but also the preclinical development costs, rendering the loss per failed clinical trial at 800 million to 1.4 billion USD. Suboptimal patient cohort selection and recruiting techniques, paired with the inability to monitor patients effectively during trials, are two of the main causes for high trial failure rates: only one of 10 compounds entering a clinical trial reaches the market. We explain how recent advances in artificial intelligence (AI) can be used to reshape key steps of clinical trial design towards increasing trial success rates.
Collapse
Affiliation(s)
- Stefan Harrer
- IBM Research, IBM Research Australia Lab, 3006 Melbourne, VIC, Australia.
| | - Pratik Shah
- Massachusetts Institute of Technology, Media Lab, 02139 Cambridge, MA, USA
| | - Bhavna Antony
- IBM Research, IBM Research Australia Lab, 3006 Melbourne, VIC, Australia
| | - Jianying Hu
- IBM Research, IBM T.J. Watson Research Center, 10598 Yorktown Heights, NY, USA
| |
Collapse
|