Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Li Q, Deleger L, Lingren T, Zhai H, Kaiser M, Stoutenborough L, Jegga AG, Cohen KB, Solti I. Mining FDA drug labels for medical conditions. BMC Med Inform Decis Mak 2013;13:53. [PMID: 23617267 PMCID: PMC3646673 DOI: 10.1186/1472-6947-13-53] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 04/22/2013] [Indexed: 12/03/2022] Open

For:	Li Q, Deleger L, Lingren T, Zhai H, Kaiser M, Stoutenborough L, Jegga AG, Cohen KB, Solti I. Mining FDA drug labels for medical conditions. BMC Med Inform Decis Mak 2013;13:53. [PMID: 23617267 PMCID: PMC3646673 DOI: 10.1186/1472-6947-13-53] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2012] [Accepted: 04/22/2013] [Indexed: 12/03/2022] Open

Number

Cited by Other Article(s)

Exploring Patterns of Transportation-Related CO2 Emissions Using Machine Learning Methods. SUSTAINABILITY 2022. [DOI: 10.3390/su14084588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Liu F, Zheng X, Yu H, Tjia J. Neural Multi-Task Learning for Adverse Drug Reaction Extraction. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2021;2020:756-762. [PMID: 33936450 PMCID: PMC8075418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Sutphin C, Lee K, Yepes AJ, Uzuner Ö, McInnes BT. Adverse drug event detection using reason assignments in FDA drug labels. J Biomed Inform 2020;110:103552. [PMID: 32890727 DOI: 10.1016/j.jbi.2020.103552] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2020] [Revised: 08/27/2020] [Accepted: 08/29/2020] [Indexed: 10/23/2022]

Malec SA, Boyce RD. Exploring Novel Computable Knowledge in Structured Drug Product Labels. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2020;2020:403-412. [PMID: 32477661 PMCID: PMC7233092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Vos RA, Katayama T, Mishima H, Kawano S, Kawashima S, Kim JD, Moriya Y, Tokimatsu T, Yamaguchi A, Yamamoto Y, Wu H, Amstutz P, Antezana E, Aoki NP, Arakawa K, Bolleman JT, Bolton E, Bonnal RJP, Bono H, Burger K, Chiba H, Cohen KB, Deutsch EW, Fernández-Breis JT, Fu G, Fujisawa T, Fukushima A, García A, Goto N, Groza T, Hercus C, Hoehndorf R, Itaya K, Juty N, Kawashima T, Kim JH, Kinjo AR, Kotera M, Kozaki K, Kumagai S, Kushida T, Lütteke T, Matsubara M, Miyamoto J, Mohsen A, Mori H, Naito Y, Nakazato T, Nguyen-Xuan J, Nishida K, Nishida N, Nishide H, Ogishima S, Ohta T, Okuda S, Paten B, Perret JL, Prathipati P, Prins P, Queralt-Rosinach N, Shinmachi D, Suzuki S, Tabata T, Takatsuki T, Taylor K, Thompson M, Uchiyama I, Vieira B, Wei CH, Wilkinson M, Yamada I, Yamanaka R, Yoshitake K, Yoshizawa AC, Dumontier M, Kosaki K, Takagi T. BioHackathon 2015: Semantics of data for life sciences and reproducible research. F1000Res 2020;9:136. [PMID: 32308977 PMCID: PMC7141167 DOI: 10.12688/f1000research.18236.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/05/2020] [Indexed: 01/08/2023] Open

Affiliation(s)

Rutger A. Vos Institute of Biology Leiden, Leiden University, Leiden, The Netherlands Naturalis Biodiversity Center, Leiden, The Netherlands
Toshiaki Katayama Database Center for Life Science, Tokyo, Japan
Hiroyuki Mishima Department of Human Genetics, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
Shin Kawano Database Center for Life Science, Tokyo, Japan
Shuichi Kawashima Database Center for Life Science, Tokyo, Japan
Jin-Dong Kim Database Center for Life Science, Tokyo, Japan
Yuki Moriya Database Center for Life Science, Tokyo, Japan
Toshiaki Tokimatsu DDBJ Center, National Institute of Genetics, Mishima, Japan
Atsuko Yamaguchi Database Center for Life Science, Tokyo, Japan
Yasunori Yamamoto Database Center for Life Science, Tokyo, Japan
Hongyan Wu Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
Peter Amstutz Curoverse, Somerville, USA
Erick Antezana Department of Biology, Norwegian University of Science and Technology, Trondheim, Norway
Nobuyuki P. Aoki Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Kazuharu Arakawa Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Jerven T. Bolleman SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, Lausanne, Switzerland
Evan Bolton National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Raoul J. P. Bonnal Istituto Nazionale Genetica Molecolare, Romeo ed Enrica Invernizzi, Milan, Italy
Hidemasa Bono Database Center for Life Science, Tokyo, Japan
Kees Burger Dutch Techcentre for Life Sciences, Utrecht, The Netherlands
Hirokazu Chiba National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Kevin B. Cohen Computational Bioscience Program, University of Colorado School of Medicine, Denver, USA Université Paris-Saclay, LIMSI, CNRS, Paris, France
Eric W. Deutsch Institute for Systems Biology, Seattle, USA
Jesualdo T. Fernández-Breis Universidad de Murcia, IMIB-Arrixaca, Murcia, Spain
Gang Fu National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Takatomo Fujisawa National Institute of Genetics, Mishima, Japan
Atsushi Fukushima RIKEN Center for Sustainable Resource Science, Yokohama, Japan
Alexander García Polytechnic University of Madrid, Madrid, Spain
Naohisa Goto Research Institute for Microbial Diseases, Osaka University, Osaka, Japan
Tudor Groza St Vincent's Clinical School, Faculty of Medicine, University of New South Wales, Darlinghurst, Australia Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Darlinghurst, Australia
Colin Hercus Novocraft Technologies Sdn. Bhd., Selangor, Malaysia
Robert Hoehndorf Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
Kotone Itaya Institute for Advanced Biosciences, Keio University, Tokyo, Japan
Nick Juty European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Takeshi Kawashima National Institute of Genetics, Mishima, Japan
Jee-Hyub Kim European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Akira R. Kinjo Institute for Protein Research, Osaka University, Osaka, Japan
Masaaki Kotera School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Kouji Kozaki The Institute of Scientific and Industrial Research, Osaka University, Osaka, Japan
Sadahiro Kumagai Hitachi Ltd., Tokyo, Japan
Tatsuya Kushida National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan
Thomas Lütteke Institute of Veterinary Physiology and Biochemistry, Justus-Liebig University Giessen, Giessen, Germany Gesellschaft für innovative Personalwirtschaftssysteme mbH (GIP GmbH), Offenbach, Germany
Masaaki Matsubara The Noguchi Institute, Tokyo, Japan
Joe Miyamoto National Cancer Center Japan, Tokyo, Japan
Attayeb Mohsen National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Hiroshi Mori Center for Information Biology, National Institute of Genetics, Mishima, Japan
Yuki Naito Database Center for Life Science, Tokyo, Japan
Takeru Nakazato Database Center for Life Science, Tokyo, Japan
Jeremy Nguyen-Xuan Lawrence Berkeley National Laboratory, Berkeley, USA
Kozo Nishida RIKEN Quantitative Biology Center, Osaka, Japan
Naoki Nishida Department of Systems Science, Osaka University, Osaka, Japan
Hiroyo Nishide National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Soichi Ogishima Tohoku Medical Megabank Organization, Tohoku University, Sendai, Japan
Tazro Ohta Database Center for Life Science, Tokyo, Japan
Shujiro Okuda Niigata University Graduate School of Medical and Dental Sciences, Niigata, Japan
Benedict Paten UC Santa Cruz Genomics Institute, University of California, Santa Cruz, USA
Jean-Luc Perret INVENesis, Neuchâtel, Switzerland
Philip Prathipati National Institutes of Biomedical Innovation, Health and Nutrition, Osaka, Japan
Pjotr Prins University Medical Center Utrecht, Utrecht, The Netherlands University of Tennessee Health Science Center, Memphis, USA
Núria Queralt-Rosinach Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
Daisuke Shinmachi Faculty of Science and Engineering, SOKA University, Tokyo, Japan
Shinya Suzuki School of Life Science and Technology, Tokyo Institute of Technology, Tokyo, Japan
Tsuyosi Tabata Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Terue Takatsuki RIKEN BioResource Center, Ibaraki, Japan
Kieron Taylor European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Mark Thompson Leiden University Medical Center, Leiden, The Netherlands
Ikuo Uchiyama National Institute for Basic Biology, National Institutes of Natural Sciences, Okazaki, Japan
Bruno Vieira WurmLab, School of Biological & Chemical Sciences, Queen Mary University of London, London, UK
Chih-Hsuan Wei National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, USA
Mark Wilkinson Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Universidad Politécnica de Madrid, Madrid, Spain
Issaku Yamada The Noguchi Institute, Tokyo, Japan
Ryota Yamanaka Oracle Corporation, Tokyo, Japan
Kazutoshi Yoshitake Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo, Japan
Akiyasu C. Yoshizawa Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto, Japan
Michel Dumontier Institute of Data Science, Maastricht University, Maastricht, The Netherlands
Kenjiro Kosaki Center for Medical Genetics, Keio University School of Medicine, Tokyo, Japan
Toshihisa Takagi National Bioscience Database Center, Japan Science and Technology Agency, Tokyo, Japan Department of Biological Sciences, Graduate School of Science, The University of Tokyo, Tokyo, Japan

Collapse

Santiso S, Perez A, Casillas A. Exploring Joint AB-LSTM With Embedded Lemmas for Adverse Drug Reaction Discovery. IEEE J Biomed Health Inform 2019;23:2148-2155. [DOI: 10.1109/jbhi.2018.2879744] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Jagannatha A, Liu F, Liu W, Yu H. Overview of the First Natural Language Processing Challenge for Extracting Medication, Indication, and Adverse Drug Events from Electronic Health Record Notes (MADE 1.0). Drug Saf 2019;42:99-111. [PMID: 30649735 PMCID: PMC6860017 DOI: 10.1007/s40264-018-0762-z] [Citation(s) in RCA: 71] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Abstract

INTRODUCTION

This work describes the Medication and Adverse Drug Events from Electronic Health Records (MADE 1.0) corpus and provides an overview of the MADE 1.0 2018 challenge for extracting medication, indication, and adverse drug events (ADEs) from electronic health record (EHR) notes.

OBJECTIVE

The goal of MADE is to provide a set of common evaluation tasks to assess the state of the art for natural language processing (NLP) systems applied to EHRs supporting drug safety surveillance and pharmacovigilance. We also provide benchmarks on the MADE dataset using the system submissions received in the MADE 2018 challenge.

METHODS

The MADE 1.0 challenge has released an expert-annotated cohort of medication and ADE information comprising 1089 fully de-identified longitudinal EHR notes from 21 randomly selected patients with cancer at the University of Massachusetts Memorial Hospital. Using this cohort as a benchmark, the MADE 1.0 challenge designed three shared NLP tasks. The named entity recognition (NER) task identifies medications and their attributes (dosage, route, duration, and frequency), indications, ADEs, and severity. The relation identification (RI) task identifies relations between the named entities: medication-indication, medication-ADE, and attribute relations. The third shared task (NER-RI) evaluates NLP models that perform the NER and RI tasks jointly. In total, 11 teams from four countries participated in at least one of the three shared tasks, and 41 system submissions were received in total.

RESULTS

The best systems F1 scores for NER, RI, and NER-RI were 0.82, 0.86, and 0.61, respectively. Ensemble classifiers using the team submissions improved the performance further, with an F1 score of 0.85, 0.87, and 0.66 for the three tasks, respectively.

CONCLUSION

MADE results show that recent progress in NLP has led to remarkable improvements in NER and RI tasks for the clinical domain. However, some room for improvement remains, particularly in the NER-RI task.

Collapse

Lamy JB, Berthelot H, Favre M, Ugon A, Duclos C, Venot A. Using visual analytics for presenting comparative information on new drugs. J Biomed Inform 2017;71:58-69. [DOI: 10.1016/j.jbi.2017.04.019] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 04/26/2017] [Accepted: 04/27/2017] [Indexed: 10/19/2022]

Moreno I, Boldrini E, Moreda P, Romá-Ferri MT. DrugSemantics: A corpus for Named Entity Recognition in Spanish Summaries of Product Characteristics. J Biomed Inform 2017. [PMID: 28624642 DOI: 10.1016/j.jbi.2017.06.013] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Sharp ME. Toward a comprehensive drug ontology: extraction of drug-indication relations from diverse information sources. J Biomed Semantics 2017;8:2. [PMID: 28069052 PMCID: PMC5223332 DOI: 10.1186/s13326-016-0110-0] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 12/16/2016] [Indexed: 01/08/2023] Open

Martínez P, Martínez JL, Segura-Bedmar I, Moreno-Schneider J, Luna A, Revert R. Turning user generated health-related content into actionable knowledge through text analytics services. COMPUT IND 2016. [DOI: 10.1016/j.compind.2015.10.006] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Adverse drug reactions in Colombian patients, 2007-2013: Analysis of population databases. BIOMEDICA 2016;36:59-66. [PMID: 27622439 DOI: 10.7705/biomedica.v36i1.2781] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 06/23/2015] [Indexed: 11/21/2022]

Rodriguez LM, Fushman DD. Automatic Classification of Structured Product Labels for Pregnancy Risk Drug Categories, a Machine Learning Approach. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2015;2015:1093-1102. [PMID: 26958248 PMCID: PMC4765680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

On the creation of a clinical gold standard corpus in Spanish: Mining adverse drug reactions. J Biomed Inform 2015;56:318-32. [DOI: 10.1016/j.jbi.2015.06.016] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Revised: 05/21/2015] [Accepted: 06/23/2015] [Indexed: 11/20/2022]

Segura-Bedmar I, Martínez P, Revert R, Moreno-Schneider J. Exploring Spanish health social media for detecting drug effects. BMC Med Inform Decis Mak 2015;15 Suppl 2:S6. [PMID: 26100267 PMCID: PMC4474583 DOI: 10.1186/1472-6947-15-s2-s6] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Adverse Drug reactions (ADR) cause a high number of deaths among hospitalized patients in developed countries. Major drug agencies have devoted a great interest in the early detection of ADRs due to their high incidence and increasing health care costs. Reporting systems are available in order for both healthcare professionals and patients to alert about possible ADRs. However, several studies have shown that these adverse events are underestimated. Our hypothesis is that health social networks could be a significant information source for the early detection of ADRs as well as of new drug indications.

METHODS

In this work we present a system for detecting drug effects (which include both adverse drug reactions as well as drug indications) from user posts extracted from a Spanish health forum. Texts were processed using MeaningCloud, a multilingual text analysis engine, to identify drugs and effects. In addition, we developed the first Spanish database storing drugs as well as their effects automatically built from drug package inserts gathered from online websites. We then applied a distant-supervision method using the database on a collection of 84,000 messages in order to extract the relations between drugs and their effects. To classify the relation instances, we used a kernel method based only on shallow linguistic information of the sentences.

RESULTS

Regarding Relation Extraction of drugs and their effects, the distant supervision approach achieved a recall of 0.59 and a precision of 0.48.

CONCLUSIONS

The task of extracting relations between drugs and their effects from social media is a complex challenge due to the characteristics of social media texts. These texts, typically posts or tweets, usually contain many grammatical errors and spelling mistakes. Moreover, patients use lay terminology to refer to diseases, symptoms and indications that is not usually included in lexical resources in languages other than English.

Collapse

Li Q, Spooner SA, Kaiser M, Lingren N, Robbins J, Lingren T, Tang H, Solti I, Ni Y. An end-to-end hybrid algorithm for automated medication discrepancy detection. BMC Med Inform Decis Mak 2015;15:37. [PMID: 25943550 PMCID: PMC4427951 DOI: 10.1186/s12911-015-0160-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2014] [Accepted: 04/27/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

In this study we implemented and developed state-of-the-art machine learning (ML) and natural language processing (NLP) technologies and built a computerized algorithm for medication reconciliation. Our specific aims are: (1) to develop a computerized algorithm for medication discrepancy detection between patients' discharge prescriptions (structured data) and medications documented in free-text clinical notes (unstructured data); and (2) to assess the performance of the algorithm on real-world medication reconciliation data.

METHODS

We collected clinical notes and discharge prescription lists for all 271 patients enrolled in the Complex Care Medical Home Program at Cincinnati Children's Hospital Medical Center between 1/1/2010 and 12/31/2013. A double-annotated, gold-standard set of medication reconciliation data was created for this collection. We then developed a hybrid algorithm consisting of three processes: (1) a ML algorithm to identify medication entities from clinical notes, (2) a rule-based method to link medication names with their attributes, and (3) a NLP-based, hybrid approach to match medications with structured prescriptions in order to detect medication discrepancies. The performance was validated on the gold-standard medication reconciliation data, where precision (P), recall (R), F-value (F) and workload were assessed.

RESULTS

The hybrid algorithm achieved 95.0%/91.6%/93.3% of P/R/F on medication entity detection and 98.7%/99.4%/99.1% of P/R/F on attribute linkage. The medication matching achieved 92.4%/90.7%/91.5% (P/R/F) on identifying matched medications in the gold-standard and 88.6%/82.5%/85.5% (P/R/F) on discrepant medications. By combining all processes, the algorithm achieved 92.4%/90.7%/91.5% (P/R/F) and 71.5%/65.2%/68.2% (P/R/F) on identifying the matched and the discrepant medications, respectively. The error analysis on algorithm outputs identified challenges to be addressed in order to improve medication discrepancy detection.

CONCLUSION

By leveraging ML and NLP technologies, an end-to-end, computerized algorithm achieves promising outcome in reconciling medications between clinical notes and discharge prescriptions.

Collapse

Ni Y, Wright J, Perentesis J, Lingren T, Deleger L, Kaiser M, Kohane I, Solti I. Increasing the efficiency of trial-patient matching: automated clinical trial eligibility pre-screening for pediatric oncology patients. BMC Med Inform Decis Mak 2015;15:28. [PMID: 25881112 PMCID: PMC4407835 DOI: 10.1186/s12911-015-0149-3] [Citation(s) in RCA: 66] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2014] [Accepted: 03/24/2015] [Indexed: 11/22/2022] Open

Abstract

Background

Manual eligibility screening (ES) for a clinical trial typically requires a labor-intensive review of patient records that utilizes many resources. Leveraging state-of-the-art natural language processing (NLP) and information extraction (IE) technologies, we sought to improve the efficiency of physician decision-making in clinical trial enrollment. In order to markedly reduce the pool of potential candidates for staff screening, we developed an automated ES algorithm to identify patients who meet core eligibility characteristics of an oncology clinical trial.

Methods

We collected narrative eligibility criteria from ClinicalTrials.gov for 55 clinical trials actively enrolling oncology patients in our institution between 12/01/2009 and 10/31/2011. In parallel, our ES algorithm extracted clinical and demographic information from the Electronic Health Record (EHR) data fields to represent profiles of all 215 oncology patients admitted to cancer treatment during the same period. The automated ES algorithm then matched the trial criteria with the patient profiles to identify potential trial-patient matches. Matching performance was validated on a reference set of 169 historical trial-patient enrollment decisions, and workload, precision, recall, negative predictive value (NPV) and specificity were calculated.

Results

Without automation, an oncologist would need to review 163 patients per trial on average to replicate the historical patient enrollment for each trial. This workload is reduced by 85% to 24 patients when using automated ES (precision/recall/NPV/specificity: 12.6%/100.0%/100.0%/89.9%). Without automation, an oncologist would need to review 42 trials per patient on average to replicate the patient-trial matches that occur in the retrospective data set. With automated ES this workload is reduced by 90% to four trials (precision/recall/NPV/specificity: 35.7%/100.0%/100.0%/95.5%).

Conclusion

By leveraging NLP and IE technologies, automated ES could dramatically increase the trial screening efficiency of oncologists and enable participation of small practices, which are often left out from trial enrollment. The algorithm has the potential to significantly reduce the effort to execute clinical research at a point in time when new initiatives of the cancer care community intend to greatly expand both the access to trials and the number of available trials.

Electronic supplementary material

The online version of this article (doi:10.1186/s12911-015-0149-3) contains supplementary material, which is available to authorized users.

Collapse

Khare R, Wei CH, Lu Z. Automatic extraction of drug indications from FDA drug labels. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2014;2014:787-794. [PMID: 25954385 PMCID: PMC4419914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Lehmann HP. From Text Tagging to Decision Support. Med Decis Making 2014;34:414-6. [DOI: 10.1177/0272989x14529847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]