1
|
Mahaux O, Powell G, Haguinet F, Sobczak P, Saini N, Barry A, Mustafa A, Bate A. Identifying Safety Subgroups at Risk: Assessing the Agreement Between Statistical Alerting and Patient Subgroup Risk. Drug Saf 2023; 46:601-614. [PMID: 37131012 PMCID: PMC10153776 DOI: 10.1007/s40264-023-01306-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/03/2023] [Indexed: 05/04/2023]
Abstract
INTRODUCTION Identifying individual characteristics or underlying conditions linked to adverse drug reactions (ADRs) can help optimise the benefit-risk ratio for individuals. A systematic evaluation of statistical methods to identify subgroups potentially at risk using spontaneous ADR report datasets is lacking. OBJECTIVES In this study, we aimed to assess concordance between subgroup disproportionality scores and European Medicines Agency Pharmacovigilance Risk Assessment Committee (PRAC) discussions of potential subgroup risk. METHODS The subgroup disproportionality method described by Sandberg et al., and variants, were applied to statistically screen for subgroups at potential increased risk of ADRs, using data from the US FDA Adverse Event Reporting System (FAERS) cumulative from 2004 to quarter 2 2021. The reference set used to assess concordance was manually extracted from PRAC minutes from 2015 to 2019. Mentions of subgroups presenting potential differentiated risk and overlapping with the Sandberg method were included. RESULTS Twenty-seven PRAC subgroup examples representing 1719 subgroup drug-event combinations (DECs) in FAERS were included. Using the Sandberg methodology, 2 of the 27 could be detected (one for age and one for sex). No subgroup examples for pregnancy and underlying condition were detected. With a methodological variant, 14 of 27 examples could be detected. CONCLUSIONS We observed low concordance between subgroup disproportionality scores and PRAC discussions of potential subgroup risk. Subgroup analyses performed better for age and sex, while for covariates not well-captured in FAERS, such as underlying condition and pregnancy, additional data sources should be considered.
Collapse
Affiliation(s)
- Olivia Mahaux
- Safety Innovation and Analytics, GSK, Wavre, Belgium.
| | - Greg Powell
- Safety Innovation and Analytics, GSK, Durham, NC, USA
| | | | | | - Namrata Saini
- Safety Evaluation and Risk Management, GSK, Bangalore, India
| | - Allen Barry
- University of North Carolina, Chapel Hill, NC, USA
| | | | - Andrew Bate
- Safety Innovation and Analytics, GSK, London, UK
- London School of Hygiene and Tropical Medicine, University of London, London, UK
| |
Collapse
|
2
|
Kontsioti E, Maskell S, Dutta B, Pirmohamed M. A reference set of clinically relevant adverse drug-drug interactions. Sci Data 2022; 9:72. [PMID: 35246559 PMCID: PMC8897500 DOI: 10.1038/s41597-022-01159-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 01/13/2022] [Indexed: 12/03/2022] Open
Abstract
The accurate and timely detection of adverse drug-drug interactions (DDIs) during the postmarketing phase is an important yet complex task with potentially major clinical implications. The development of data mining methodologies that scan healthcare databases for drug safety signals requires appropriate reference sets for performance evaluation. Methodologies for establishing DDI reference sets are limited in the literature, while there is no publicly available resource simultaneously focusing on clinical relevance of DDIs and individual behaviour of interacting drugs. By automatically extracting and aggregating information from multiple clinical resources, we provide a scalable approach for generating a reference set for DDIs that could support research in postmarketing safety surveillance. CRESCENDDI contains 10,286 positive and 4,544 negative controls, covering 454 drugs and 179 adverse events mapped to RxNorm and MedDRA concepts, respectively. It also includes single drug information for the included drugs (i.e., adverse drug reactions, indications, and negative drug-event associations). We demonstrate usability of the resource by scanning a spontaneous reporting system database for signals of DDIs using traditional signal detection algorithms. Measurement(s) | Adverse Event | Technology Type(s) | digital curation | Sample Characteristic - Organism | Homo sapiens |
Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.16681933
Collapse
Affiliation(s)
- Elpida Kontsioti
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK.
| | - Simon Maskell
- Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, UK
| | - Bhaskar Dutta
- Patient Safety Center of Excellence, Chief Medical Office Organization, AstraZeneca Pharmaceuticals, Gaithersburg, MD, USA
| | - Munir Pirmohamed
- The Wolfson Centre for Personalized Medicine, MRC Centre for Drug Safety Science, Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
3
|
Ding X, Mower J, Subramanian D, Cohen T. Augmenting aer2vec: Enriching distributed representations of adverse event report data with orthographic and lexical information. J Biomed Inform 2021; 119:103833. [PMID: 34111555 DOI: 10.1016/j.jbi.2021.103833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/10/2021] [Accepted: 06/02/2021] [Indexed: 11/29/2022]
Abstract
Adverse Drug Events (ADEs) are prevalent, costly, and sometimes preventable. Post-marketing drug surveillance aims to monitor ADEs that occur after a drug is released to market. Reports of such ADEs are aggregated by reporting systems, such as the Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS). In this paper, we consider the topic of how best to represent data derived from reports in FAERS for the purpose of detecting post-marketing surveillance signals, in order to inform regulatory decision making. In our previous work, we developed aer2vec, a method for deriving distributed representations (concept embeddings) of drugs and side effects from ADE reports, establishing the utility of distributional information for pharmacovigilance signal detection. In this paper, we advance this line of research further by evaluating the utility of encoding orthographic and lexical information. We do so by adapting two Natural Language Processing methods, subword embedding and vector retrofitting, which were developed to encode such information into word embeddings. Models were compared for their ability to distinguish between positive and negative examples in a set of manually curated drug/ADE relationships, with both aer2vec enhancements offering advantages in performances over baseline models, and best performance obtained when retrofitting and subword embeddings were applied in concert. In addition, this work demonstrates that models leveraging distributed representations do not require extensive manual preprocessing to perform well on this pharmacovigilance signal detection task, and may even benefit from information that would otherwise be lost during the normalization and standardization process.
Collapse
Affiliation(s)
- Xiruo Ding
- Department of Biomedical Informatics & Medical Education, University of Washington, Seattle, WA, USA.
| | - Justin Mower
- Department of Computer Science, Rice University, Houston, TX, USA.
| | | | - Trevor Cohen
- Department of Biomedical Informatics & Medical Education, University of Washington, Seattle, WA, USA.
| |
Collapse
|
4
|
Caster O, Aoki Y, Gattepaille LM, Grundmark B. Disproportionality Analysis for Pharmacovigilance Signal Detection in Small Databases or Subsets: Recommendations for Limiting False-Positive Associations. Drug Saf 2021; 43:479-487. [PMID: 32008183 PMCID: PMC7165139 DOI: 10.1007/s40264-020-00911-w] [Citation(s) in RCA: 63] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Introduction Uncovering safety signals through the collection and assessment of individual case reports remains a core pharmacovigilance activity. Despite the widespread use of disproportionality analysis in signal detection, recommendations are lacking on the minimum size of databases or subsets of databases required to yield robust results. Objective This study aims to investigate the relationship between database size and robustness of disproportionality analysis, with regards to limiting spurious associations. Methods Three types of subsets were created from the global database VigiBase: random subsets (500 replicates each of 11 fixed subset sizes between 250 and 100,000 reports), country-specific subsets (all 131 countries available in the original VigiBase extract) and subsets based on the Anatomical Therapeutic Chemical classification. For each subset, a spuriousness rate was computed as the ratio between the number of drug–event combinations highlighted by disproportionality analysis in a permuted version of the subset and the corresponding number in the original subset. In the permuted data, all true reporting associations between drugs and adverse events were broken. Subsets with fewer than five original associations were excluded. Additionally, the set of disproportionately over-reported drug–event combinations in three specific countries at three different time points were clinically assessed for labelledness. These time points corresponded to database sizes of less than 10,000, 5000 and 1000 reports, respectively. All disproportionality analysis was based on the Information Component (IC), implemented as IC025 > 0. Results Spuriousness rates were below 0.15 for all 110 included countries regardless of subset size, with only seven countries (6%) exceeding the empirical threshold of 0.10 observed for large subsets. All 21 excluded countries had < 500 reports. For random subsets containing 3000–5000 or more reports, the higher end of observed spuriousness rates was close to 0.10. In the clinical assessment, the proportion of labelled or otherwise known drug–event combinations was very high (87–100%) across all countries and time points studied. Conclusions To mitigate the risk of highlighting spurious associations with disproportionality analysis, a minimum size of 500 reports is recommended for national databases. For databases or subsets that are not country-specific, our recommendation is 5000 reports. This study does not consider sensitivity, which is expected to be poor in smaller databases.
Collapse
Affiliation(s)
- Ola Caster
- Uppsala Monitoring Centre, Box 1051, 751 40, Uppsala, Sweden
| | - Yasunori Aoki
- Uppsala Monitoring Centre, Box 1051, 751 40, Uppsala, Sweden.,National Institute of Informatics, Tokyo, Japan
| | | | | |
Collapse
|
5
|
Dijkstra L, Garling M, Foraita R, Pigeot I. Adverse drug reaction or innocent bystander? A systematic comparison of statistical discovery methods for spontaneous reporting systems. Pharmacoepidemiol Drug Saf 2020; 29:396-403. [DOI: 10.1002/pds.4970] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2018] [Revised: 01/23/2020] [Accepted: 01/28/2020] [Indexed: 01/01/2023]
Affiliation(s)
- Louis Dijkstra
- Leibniz Institute for Prevention Research & Epidemiology, BIPS, Achterstraße 30 28359 Bremen Germany
| | - Marco Garling
- Scientific Institute of TK for Benefit & Efficiency in Health Care, WINEG Bramfelder Straße 140, 22305 Hamburg Germany
| | - Ronja Foraita
- Leibniz Institute for Prevention Research & Epidemiology, BIPS, Achterstraße 30 28359 Bremen Germany
| | - Iris Pigeot
- Leibniz Institute for Prevention Research & Epidemiology, BIPS, Achterstraße 30 28359 Bremen Germany
| |
Collapse
|
6
|
Schuemie MJ, Cepeda MS, Suchard MA, Yang J, Tian Y, Schuler A, Ryan PB, Madigan D, Hripcsak G. How Confident Are We about Observational Findings in Healthcare: A Benchmark Study. HARVARD DATA SCIENCE REVIEW 2020; 2. [PMID: 33367288 DOI: 10.1162/99608f92.147cc28e] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Healthcare professionals increasingly rely on observational healthcare data, such as administrative claims and electronic health records, to estimate the causal effects of interventions. However, limited prior studies raise concerns about the real-world performance of the statistical and epidemiological methods that are used. We present the "OHDSI Methods Benchmark" that aims to evaluate the performance of effect estimation methods on real data. The benchmark comprises a gold standard, a set of metrics, and a set of open source software tools. The gold standard is a collection of real negative controls (drug-outcome pairs where no causal effect appears to exist) and synthetic positive controls (drug-outcome pairs that augment negative controls with simulated causal effects). We apply the benchmark using four large healthcare databases to evaluate methods commonly used in practice: the new-user cohort, self-controlled cohort, case-control, case-crossover, and self-controlled case series designs. The results confirm the concerns about these methods, showing that for most methods the operating characteristics deviate considerably from nominal levels. For example, in most contexts, only half of the 95% confidence intervals we calculated contain the corresponding true effect size. We previously developed an "empirical calibration" procedure to restore these characteristics and we also evaluate this procedure. While no one method dominates, self-controlled methods such as the empirically calibrated self-controlled case series perform well across a wide range of scenarios.
Collapse
Affiliation(s)
- Martijn J Schuemie
- Observational Health Data Sciences and Informatics.,Epidemiology Analytics, Janssen Research and Development.,Department of Biostatistics, University of California, Los Angeles
| | - M Soledad Cepeda
- Observational Health Data Sciences and Informatics.,Epidemiology Analytics, Janssen Research and Development
| | - Marc A Suchard
- Observational Health Data Sciences and Informatics.,Department of Biostatistics, University of California, Los Angeles.,Department of Biomathematics, University of California, Los Angeles.,Department of Human Genetics, University of California, Los Angeles
| | - Jianxiao Yang
- Observational Health Data Sciences and Informatics.,Department of Biomathematics, University of California, Los Angeles
| | - Yuxi Tian
- Observational Health Data Sciences and Informatics.,Department of Biomathematics, University of California, Los Angeles
| | - Alejandro Schuler
- Observational Health Data Sciences and Informatics.,Center for Biomedical Informatics Research, Stanford University
| | - Patrick B Ryan
- Observational Health Data Sciences and Informatics.,Epidemiology Analytics, Janssen Research and Development.,Department of Biomedical Informatics, Columbia University
| | - David Madigan
- Observational Health Data Sciences and Informatics.,Department of Statistics, Columbia University
| | - George Hripcsak
- Observational Health Data Sciences and Informatics.,Department of Biomedical Informatics, Columbia University.,Medical Informatics Services, New York-Presbyterian Hospital
| |
Collapse
|
7
|
Banda JM, Sarraju A, Abbasi F, Parizo J, Pariani M, Ison H, Briskin E, Wand H, Dubois S, Jung K, Myers SA, Rader DJ, Leader JB, Murray MF, Myers KD, Wilemon K, Shah NH, Knowles JW. Finding missed cases of familial hypercholesterolemia in health systems using machine learning. NPJ Digit Med 2019; 2:23. [PMID: 31304370 PMCID: PMC6550268 DOI: 10.1038/s41746-019-0101-5] [Citation(s) in RCA: 49] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 03/13/2019] [Indexed: 01/26/2023] Open
Abstract
Familial hypercholesterolemia (FH) is an underdiagnosed dominant genetic condition affecting approximately 0.4% of the population and has up to a 20-fold increased risk of coronary artery disease if untreated. Simple screening strategies have false positive rates greater than 95%. As part of the FH Foundation's FIND FH initiative, we developed a classifier to identify potential FH patients using electronic health record (EHR) data at Stanford Health Care. We trained a random forest classifier using data from known patients (n = 197) and matched non-cases (n = 6590). Our classifier obtained a positive predictive value (PPV) of 0.88 and sensitivity of 0.75 on a held-out test-set. We evaluated the accuracy of the classifier's predictions by chart review of 100 patients at risk of FH not included in the original dataset. The classifier correctly flagged 84% of patients at the highest probability threshold, with decreasing performance as the threshold lowers. In external validation on 466 FH patients (236 with genetically proven FH) and 5000 matched non-cases from the Geisinger Healthcare System our FH classifier achieved a PPV of 0.85. Our EHR-derived FH classifier is effective in finding candidate patients for further FH screening. Such machine learning guided strategies can lead to effective identification of the highest risk patients for enhanced management strategies.
Collapse
Affiliation(s)
- Juan M. Banda
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
- Department of Computer Science, Georgia State University, Atlanta, GA USA
| | - Ashish Sarraju
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Fahim Abbasi
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Justin Parizo
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Mitchel Pariani
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Hannah Ison
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Elinor Briskin
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Hannah Wand
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
| | - Sebastien Dubois
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | - Kenneth Jung
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | | | - Daniel J. Rader
- Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA USA
- The FH Foundation, Pasadena, CA USA
| | - Joseph B. Leader
- Geisinger Health System, Genomic Medicine Institute, Forty Fort, PA USA
| | | | - Kelly D. Myers
- Atomo, Inc, Austin, TX USA
- The FH Foundation, Pasadena, CA USA
| | | | - Nigam H. Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | - Joshua W. Knowles
- Cardiovascular Medicine and Cardiovascular Institute, Stanford University, Stanford, CA USA
- The FH Foundation, Pasadena, CA USA
- Stanford Diabetes Research Center, Stanford, CA USA
| |
Collapse
|
8
|
Caster O, Dietrich J, Kürzinger ML, Lerch M, Maskell S, Norén GN, Tcherny-Lessenot S, Vroman B, Wisniewski A, van Stekelenborg J. Assessment of the Utility of Social Media for Broad-Ranging Statistical Signal Detection in Pharmacovigilance: Results from the WEB-RADR Project. Drug Saf 2018; 41:1355-1369. [PMID: 30043385 PMCID: PMC6223695 DOI: 10.1007/s40264-018-0699-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
INTRODUCTION AND OBJECTIVE Social media has been proposed as a possibly useful data source for pharmacovigilance signal detection. This study primarily aimed to evaluate the performance of established statistical signal detection algorithms in Twitter/Facebook for a broad range of drugs and adverse events. METHODS Performance was assessed using a reference set by Harpaz et al., consisting of 62 US Food and Drug Administration labelling changes, and an internal WEB-RADR reference set consisting of 200 validated safety signals. In total, 75 drugs were studied. Twitter/Facebook posts were retrieved for the period March 2012 to March 2015, and drugs/events were extracted from the posts. We retrieved 4.3 million and 2.0 million posts for the WEB-RADR and Harpaz drugs, respectively. Individual case reports were extracted from VigiBase for the same period. Disproportionality algorithms based on the Information Component or the Proportional Reporting Ratio and crude post/report counting were applied in Twitter/Facebook and VigiBase. Receiver operating characteristic curves were generated, and the relative timing of alerting was analysed. RESULTS Across all algorithms, the area under the receiver operating characteristic curve for Twitter/Facebook varied between 0.47 and 0.53 for the WEB-RADR reference set and between 0.48 and 0.53 for the Harpaz reference set. For VigiBase, the ranges were 0.64-0.69 and 0.55-0.67, respectively. In Twitter/Facebook, at best, 31 (16%) and four (6%) positive controls were detected prior to their index dates in the WEB-RADR and Harpaz references, respectively. In VigiBase, the corresponding numbers were 66 (33%) and 17 (27%). CONCLUSIONS Our results clearly suggest that broad-ranging statistical signal detection in Twitter and Facebook, using currently available methods for adverse event recognition, performs poorly and cannot be recommended at the expense of other pharmacovigilance activities.
Collapse
Affiliation(s)
- Ola Caster
- Uppsala Monitoring Centre, Box 1051, Uppsala, 75140, Sweden.
| | | | | | | | | | - G Niklas Norén
- Uppsala Monitoring Centre, Box 1051, Uppsala, 75140, Sweden
| | | | | | | | | |
Collapse
|
9
|
Mower J, Subramanian D, Cohen T. Learning predictive models of drug side-effect relationships from distributed representations of literature-derived semantic predications. J Am Med Inform Assoc 2018; 25:1339-1350. [PMID: 30010902 PMCID: PMC6454491 DOI: 10.1093/jamia/ocy077] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Revised: 04/23/2018] [Accepted: 06/05/2018] [Indexed: 02/01/2023] Open
Abstract
Objective The aim of this work is to leverage relational information extracted from biomedical literature using a novel synthesis of unsupervised pretraining, representational composition, and supervised machine learning for drug safety monitoring. Methods Using ≈80 million concept-relationship-concept triples extracted from the literature using the SemRep Natural Language Processing system, distributed vector representations (embeddings) were generated for concepts as functions of their relationships utilizing two unsupervised representational approaches. Embeddings for drugs and side effects of interest from two widely used reference standards were then composed to generate embeddings of drug/side-effect pairs, which were used as input for supervised machine learning. This methodology was developed and evaluated using cross-validation strategies and compared to contemporary approaches. To qualitatively assess generalization, models trained on the Observational Medical Outcomes Partnership (OMOP) drug/side-effect reference set were evaluated against a list of ≈1100 drugs from an online database. Results The employed method improved performance over previous approaches. Cross-validation results advance the state of the art (AUC 0.96; F1 0.90 and AUC 0.95; F1 0.84 across the two sets), outperforming methods utilizing literature and/or spontaneous reporting system data. Examination of predictions for unseen drug/side-effect pairs indicates the ability of these methods to generalize, with over tenfold label support enrichment in the top 100 predictions versus the bottom 100 predictions. Discussion and Conclusion Our methods can assist the pharmacovigilance process using information from the biomedical literature. Unsupervised pretraining generates a rich relationship-based representational foundation for machine learning techniques to classify drugs in the context of a putative side effect, given known examples.
Collapse
Affiliation(s)
- Justin Mower
- Baylor College of Medicine, Quantitative and Computational Biosciences, Houston, Texas, USA
| | | | - Trevor Cohen
- School of Biomedical Informatics, University of Texas Health Science Center Houston, Texas, USA
| |
Collapse
|
10
|
Schuemie MJ, Ryan PB, Hripcsak G, Madigan D, Suchard MA. Improving reproducibility by using high-throughput observational studies with empirical calibration. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2018; 376:20170356. [PMID: 30082302 PMCID: PMC6107542 DOI: 10.1098/rsta.2017.0356] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Accepted: 05/08/2018] [Indexed: 06/08/2023]
Abstract
Concerns over reproducibility in science extend to research using existing healthcare data; many observational studies investigating the same topic produce conflicting results, even when using the same data. To address this problem, we propose a paradigm shift. The current paradigm centres on generating one estimate at a time using a unique study design with unknown reliability and publishing (or not) one estimate at a time. The new paradigm advocates for high-throughput observational studies using consistent and standardized methods, allowing evaluation, calibration and unbiased dissemination to generate a more reliable and complete evidence base. We demonstrate this new paradigm by comparing all depression treatments for a set of outcomes, producing 17 718 hazard ratios, each using methodology on par with current best practice. We furthermore include control hypotheses to evaluate and calibrate our evidence generation process. Results show good transitivity and consistency between databases, and agree with four out of the five findings from clinical trials. The distribution of effect size estimates reported in the literature reveals an absence of small or null effects, with a sharp cut-off at p = 0.05. No such phenomena were observed in our results, suggesting more complete and more reliable evidence.This article is part of a discussion meeting issue 'The growing ubiquity of algorithms in society: implications, impacts and innovations'.
Collapse
Affiliation(s)
- Martijn J Schuemie
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY 10032, USA
- Epidemiology Analytics, Janssen Research and Development, Titusville, NJ 08560, USA
| | - Patrick B Ryan
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY 10032, USA
- Epidemiology Analytics, Janssen Research and Development, Titusville, NJ 08560, USA
- Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA
| | - George Hripcsak
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY 10032, USA
- Department of Biomedical Informatics, Columbia University Medical Center, New York, NY 10032, USA
- Medical Informatics Services, New York-Presbyterian Hospital, New York, NY 10032, USA
| | - David Madigan
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY 10032, USA
- Department of Statistics, Columbia University, New York, NY 10027, USA
| | - Marc A Suchard
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY 10032, USA
- Department of Biomathematics, University of California, Los Angeles, CA 90095, USA
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
11
|
Empirical confidence interval calibration for population-level effect estimation studies in observational healthcare data. Proc Natl Acad Sci U S A 2018. [PMID: 29531023 DOI: 10.1073/pnas.1708282114] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Observational healthcare data, such as electronic health records and administrative claims, offer potential to estimate effects of medical products at scale. Observational studies have often been found to be nonreproducible, however, generating conflicting results even when using the same database to answer the same question. One source of discrepancies is error, both random caused by sampling variability and systematic (for example, because of confounding, selection bias, and measurement error). Only random error is typically quantified but converges to zero as databases become larger, whereas systematic error persists independent from sample size and therefore, increases in relative importance. Negative controls are exposure-outcome pairs, where one believes no causal effect exists; they can be used to detect multiple sources of systematic error, but interpreting their results is not always straightforward. Previously, we have shown that an empirical null distribution can be derived from a sample of negative controls and used to calibrate P values, accounting for both random and systematic error. Here, we extend this work to calibration of confidence intervals (CIs). CIs require positive controls, which we synthesize by modifying negative controls. We show that our CI calibration restores nominal characteristics, such as 95% coverage of the true effect size by the 95% CI. We furthermore show that CI calibration reduces disagreement in replications of two pairs of conflicting observational studies: one related to dabigatran, warfarin, and gastrointestinal bleeding and one related to selective serotonin reuptake inhibitors and upper gastrointestinal bleeding. We recommend CI calibration to improve reproducibility of observational studies.
Collapse
|
12
|
Pacurariu AC, Hoeve CE, Arlett P, Genov G, Slattery J, Sturkenboom MCJM, Straus SMJM. Is patient exposure preapproval and postapproval a determinant of the timing and frequency of occurrence of safety issues? Pharmacoepidemiol Drug Saf 2017; 27:168-173. [PMID: 29278866 DOI: 10.1002/pds.4359] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2017] [Revised: 10/20/2017] [Accepted: 10/25/2017] [Indexed: 11/08/2022]
Abstract
BACKGROUND The amount of drug exposure, pre and post approval, is considered to be a direct determinant of knowledge about the safety of a drug. A larger pre-approval exposed population is supposed to reduce the risk of unanticipated safety issues post-approval. The amount of use in the postapproval population is also expected to influence the occurrence and timing of safety issues. We investigated how the amount of pre and post approval exposure influences the detection of post-approval safety issues. METHODS A cohort of innovative drugs approved in Europe was followed for the period of 2012-2016. The main outcome of interest was a new safety issue in the period. Post-approval exposure was collected at 6 month intervals, and pre-approval exposure was collected at the moment of authorisation. Other characteristics collected for the included drugs were anatomical therapeutical chemical (ATC) class, biological status, orphan status and type of approval. We used Cox proportional hazards regression to investigate the association between exposure and the hazard of having a first safety issue. RESULTS The pre-approval exposure was not associated with the risk of safety issues after adjusting for ATC class, biological status, and treatment duration. Higher post-approval exposure was associated with more new safety issues identified (HR = 2.44 (95% CI = 1.12-5.31)) for drugs with more than 1,000 patient-years of cumulative exposure compared to drugs with less than 1,000 patient years of exposure. CONCLUSION Our results suggest that postapproval exposure influences the detection of safety issues.
Collapse
Affiliation(s)
- Alexandra C Pacurariu
- Medicines Evaluation Board, Utrecht, The Netherlands.,Erasmus University Medical Center, Rotterdam, The Netherlands.,European Medicines Agency, London, UK
| | - Christina E Hoeve
- Medicines Evaluation Board, Utrecht, The Netherlands.,Erasmus University Medical Center, Rotterdam, The Netherlands
| | | | | | | | | | - Sabine M J M Straus
- Medicines Evaluation Board, Utrecht, The Netherlands.,Erasmus University Medical Center, Rotterdam, The Netherlands
| |
Collapse
|
13
|
Drug Safety Monitoring in Children: Performance of Signal Detection Algorithms and Impact of Age Stratification. Drug Saf 2017; 39:873-81. [PMID: 27255487 PMCID: PMC4982893 DOI: 10.1007/s40264-016-0433-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
INTRODUCTION Spontaneous reports of suspected adverse drug reactions (ADRs) can be analyzed to yield additional drug safety evidence for the pediatric population. Signal detection algorithms (SDAs) are required for these analyses; however, the performance of SDAs in the pediatric population specifically is unknown. We tested the performance of two SDAs on pediatric data from the US FDA Adverse Event Reporting System (FAERS) and investigated the impact of age stratification and age adjustment on the performance of SDAs. METHODS We tested the performance of two established SDAs: the proportional reporting ratio (PRR) and the empirical Bayes geometric mean (EBGM) on a pediatric dataset from FAERS (2004-2012). We compared the performance of the SDAs with a published pediatric-specific reference set by calculating diagnostic test-related statistics, including the area under the curve (AUC) of receiver operating characteristics. Impact of age stratification and age-adjustment on the performance of the SDAs was assessed. Age adjustment was performed by pooling (Mantel-Hanszel) stratum-specific estimates. RESULTS A total of 115,674 pediatric reports (patients aged 0-18 years) comprising 893,587 drug-event combinations (DECs) were analysed. Crude values of the AUC were similar for both SDAs: 0.731 (PRR) and 0.745 (EBGM). Stratification unmasked four DECs, e.g., 'ibuprofen and thrombocytopenia'. Age adjustment did not improve performance. CONCLUSION The performance of the two tested SDAs was similar in the pediatric population. Age adjustment does not improve performance and is therefore not recommended to be performed routinely. Stratification can reveal new associations, and therefore is recommended when either drug use is age-specific or when an age-specific risk is suspected.
Collapse
|
14
|
Caster O, Sandberg L, Bergvall T, Watson S, Norén GN. vigiRank for statistical signal detection in pharmacovigilance: First results from prospective real-world use. Pharmacoepidemiol Drug Saf 2017; 26:1006-1010. [PMID: 28653790 PMCID: PMC5575476 DOI: 10.1002/pds.4247] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2016] [Revised: 03/22/2017] [Accepted: 05/25/2017] [Indexed: 11/08/2022]
Abstract
Purpose vigiRank is a data‐driven predictive model for emerging safety signals. In addition to disproportionate reporting patterns, it also accounts for the completeness, recency, and geographic spread of individual case reporting, as well as the availability of case narratives. Previous retrospective analysis suggested that vigiRank performed better than disproportionality analysis alone. The purpose of the present analysis was to evaluate its prospective performance. Methods The evaluation of vigiRank was based on real‐world signal detection in VigiBase. In May 2014, vigiRank scores were computed for pairs of new drugs and WHO Adverse Reaction Terminology critical terms with at most 30 reports from at least 2 countries. Initial manual assessments were performed in order of descending score, selecting a subset of drug‐adverse drug reaction pairs for in‐depth expert assessment. The primary performance metric was the proportion of initial assessments that were decided signals during in‐depth assessment. As comparator, the historical performance for disproportionality‐ guided signal detection in VigiBase was computed from a corresponding cohort of drug‐adverse drug reaction pairs assessed between 2009 and 2013. During this period, the requirement for initial manual assessment was a positive lower endpoint of the 95% credibility interval of the Information Component measure of disproportionality, observed for the first time. Results 194 initial assessments suggested by vigiRank's ordering eventually resulted in 6 (3.1%) signals. Disproportionality analysis yielded 19 signals from 1592 initial assessments (1.2%; P < .05). Conclusions Combining multiple strength‐of‐evidence aspects as in vigiRank significantly outperformed disproportionality analysis alone in real‐world pharmacovigilance signal detection, for VigiBase.
Collapse
Affiliation(s)
- Ola Caster
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden.,Department of Computer and Systems Sciences, Stockholm University, Kista, Sweden
| | - Lovisa Sandberg
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden
| | - Tomas Bergvall
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden
| | - Sarah Watson
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden
| | - G Niklas Norén
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Uppsala, Sweden
| |
Collapse
|
15
|
Hauben M, Hung E, Wood J, Soitkar A, Reshef D. The impact of database restriction on pharmacovigilance signal detection of selected cancer therapies. Ther Adv Drug Saf 2017; 8:145-156. [PMID: 28588760 DOI: 10.1177/2042098616685010] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 11/20/2016] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND The aim of this study was to investigate whether database restriction can improve oncology drug pharmacovigilance signal detection performance. METHODS We used spontaneous adverse event (AE) reports in the United States (US) Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) database. Positive control (PC) drug medical concept (DMC) pairs were selected from safety information not included in the product's first label but subsequently added as label changes. These medical concepts (MCs) were mapped to the Medical Dictionary for Regulatory Activities (MedDRA) preferred terms (PTs) used in FAERS to code AEs. Negative controls (NC) were MCs with circumscribed PTs not included in the corresponding US package insert (USPI). We calculated shrinkage-adjusted observed-to-expected (O/E) reporting frequencies for the aforementioned drug-PT pairs. We also formulated an adjudication framework to calculate performance at the MC level. Performance metrics [sensitivity, specificity, positive and negative predictive value (PPV, NPV), signal/noise (S/N), F and Matthews correlation coefficient (MCC)] were calculated for each analysis and compared. RESULTS The PC reference set consisted of 11 drugs, 487 PTs, 27 MCs, 37 drug-MC combinations and 638 drug-event combinations (DECs). The NC reference set consisted of 11 drugs, 9 PTs, 5 MCs, 40 drug-MC combinations and 67 DECs. Most drug-event pairs were not highlighted by either analysis. A small percentage of signals of disproportionate reporting were lost, more noise than signal, with no gains. Specificity and PPV improved whereas sensitivity, NPV, F and MCC decreased, but all changes were small relative to the decrease in sensitivity. The overall S/N improved. CONCLUSION This oncology drug restricted analysis improved the S/N ratio, removing proportionately more noise than signal, but with significant credible signal loss. Without broader experience and a calculus of costs and utilities of correct versus incorrect classifications in oncology pharmacovigilance such restricted analyses should be optional rather than a default analysis.
Collapse
Affiliation(s)
- Manfred Hauben
- Department of Medicine, New York University School of Medicine, and Pfizer Inc., Safety Sciences Research, 235 East 42nd Street, New York, NY 10017-5755, USA
| | - Eric Hung
- Pfizer Inc., Safety Sciences Research, New York, NY, USA
| | - Jennifer Wood
- Bristol-Myers Squibb, Global Pharmacovigilance and Epidemiology, Hopewell, NJ, USA
| | - Amit Soitkar
- Bristol-Myers Squibb, Global Pharmacovigilance and Epidemiology, Hopewell, NJ, USA
| | - Daniel Reshef
- Bristol-Myers Squibb, Global Pharmacovigilance and Epidemiology, Hopewell, NJ, USA
| |
Collapse
|
16
|
Harpaz R, Odgers D, Gaskin G, DuMouchel W, Winnenburg R, Bodenreider O, Ripple A, Szarfman A, Sorbello A, Horvitz E, White RW, Shah NH. A time-indexed reference standard of adverse drug reactions. Sci Data 2016; 1:140043. [PMID: 25632348 PMCID: PMC4306188 DOI: 10.1038/sdata.2014.43] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Undetected adverse drug reactions (ADRs) pose a major burden on the health system. Data mining methodologies designed to identify signals of novel ADRs are of deep importance for drug safety surveillance. The development and evaluation of these methodologies requires proper reference benchmarks. While progress has recently been made in developing such benchmarks, our understanding of the performance characteristics of the data mining methodologies is limited because existing benchmarks do not support prospective performance evaluations. We address this shortcoming by providing a reference standard to support prospective performance evaluations. The reference standard was systematically curated from drug labeling revisions, such as new warnings, which were issued and communicated by the US Food and Drug Administration in 2013. The reference standard includes 62 positive test cases and 75 negative controls, and covers 44 drugs and 38 events. We provide usage guidance and empirical support for the reference standard by applying it to analyze two data sources commonly mined for drug safety surveillance.
Collapse
Affiliation(s)
- Rave Harpaz
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - David Odgers
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | - Greg Gaskin
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| | | | | | | | - Anna Ripple
- National Library of Medicine, NIH, Bethesda, Maryland 20894, USA
| | | | | | - Eric Horvitz
- Microsoft Research, Redmond, Washington 98052, USA
| | - Ryen W White
- Microsoft Research, Redmond, Washington 98052, USA
| | - Nigam H Shah
- Center for Biomedical Informatics Research, Stanford University, Stanford, California 94305, USA
| |
Collapse
|
17
|
Abstract
Background and Objective Spontaneous reporting systems (SRSs) remain the cornerstone of post-marketing drug safety surveillance despite their well-known limitations. Judicious use of other available data sources is essential to enable better detection, strengthening and validation of signals. In this study, we investigated the potential of electronic healthcare records (EHRs) to be used alongside an SRS as an independent system, with the aim of improving signal detection. Methods A signal detection strategy, focused on a limited set of adverse events deemed important in pharmacovigilance, was performed retrospectively in two data sources—(1) the Exploring and Understanding Adverse Drug Reactions (EU-ADR) database network and (2) the EudraVigilance database—using data between 2000 and 2010. Five events were considered for analysis: (1) acute myocardial infarction (AMI); (2) bullous eruption; (3) hip fracture; (4) acute pancreatitis; and (5) upper gastrointestinal bleeding (UGIB). Potential signals identified in each system were verified using the current published literature. The complementarity of the two systems to detect signals was expressed as the percentage of the unilaterally identified signals out of the total number of confirmed signals. As a proxy for the associated costs, the number of signals that needed to be reviewed to detect one true signal (number needed to detect [NND]) was calculated. The relationship between the background frequency of the events and the capability of each system to detect signals was also investigated. Results The contribution of each system to signal detection appeared to be correlated with the background incidence of the events, being directly proportional to the incidence in EU-ADR and inversely proportional in EudraVigilance. EudraVigilance was particularly valuable in identifying bullous eruption and acute pancreatitis (71 and 42 % of signals were correctly identified from the total pool of known associations, respectively), while EU-ADR was most useful in identifying hip fractures (60 %). Both systems contributed reasonably well to identification of signals related to UGIB (45 % in EudraVigilance, 40 % in EU-ADR) but only fairly for signals related to AMI (25 % in EU-ADR, 20 % in EudraVigilance). The costs associated with detection of signals were variable across events; however, it was often more costly to detect safety signals in EU-ADR than in EudraVigilance (median NNDs: 7 versus 5). Conclusion An EHR-based system may have additional value for signal detection, alongside already established systems, especially in the presence of adverse events with a high background incidence. While the SRS appeared to be more cost effective overall, for some events the costs associated with signal detection in the EHR might be justifiable. Electronic supplementary material The online version of this article (doi:10.1007/s40264-015-0341-5) contains supplementary material, which is available to authorized users.
Collapse
|
18
|
Generalized enrichment analysis improves the detection of adverse drug events from the biomedical literature. BMC Bioinformatics 2016; 17:250. [PMID: 27333889 PMCID: PMC4918084 DOI: 10.1186/s12859-016-1080-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Accepted: 05/11/2016] [Indexed: 01/12/2023] Open
Abstract
Background Identification of associations between marketed drugs and adverse events from the biomedical literature assists drug safety monitoring efforts. Assessing the significance of such literature-derived associations and determining the granularity at which they should be captured remains a challenge. Here, we assess how defining a selection of adverse event terms from MeSH, based on information content, can improve the detection of adverse events for drugs and drug classes. Results We analyze a set of 105,354 candidate drug adverse event pairs extracted from article indexes in MEDLINE. First, we harmonize extracted adverse event terms by aggregating them into higher-level MeSH terms based on the terms’ information content. Then, we determine statistical enrichment of adverse events associated with drug and drug classes using a conditional hypergeometric test that adjusts for dependencies among associated terms. We compare our results with methods based on disproportionality analysis (proportional reporting ratio, PRR) and quantify the improvement in signal detection with our generalized enrichment analysis (GEA) approach using a gold standard of drug-adverse event associations spanning 174 drugs and four events. For single drugs, the best GEA method (Precision: .92/Recall: .71/F1-measure: .80) outperforms the best PRR based method (.69/.69/.69) on all four adverse event outcomes in our gold standard. For drug classes, our GEA performs similarly (.85/.69/.74) when increasing the level of abstraction for adverse event terms. Finally, on examining the 1609 individual drugs in our MEDLINE set, which map to chemical substances in ATC, we find signals for 1379 drugs (10,122 unique adverse event associations) on applying GEA with p < 0.005. Conclusions We present an approach based on generalized enrichment analysis that can be used to detect associations between drugs, drug classes and adverse events at a given level of granularity, at the same time correcting for known dependencies among events. Our study demonstrates the use of GEA, and the importance of choosing appropriate abstraction levels to complement current drug safety methods. We provide an R package for exploration of alternative abstraction levels of adverse event terms based on information content. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1080-z) contains supplementary material, which is available to authorized users.
Collapse
|
19
|
Abstract
What has been learned about electronic health data as a primary data source for regulatory decisions regarding the harms of drugs? Observational studies with electronic health data for postmarket risk assessment can now be conducted in Europe and the US in patient populations numbering in the tens of millions compared with a few hundred patients in a typical clinical trial. With standard protocols, results can be obtained in a few months; however, extensive research published by scores of investigators has illuminated the many obstacles that prevent obtaining robust, reproducible results that are reliable enough to be a primary source for drug safety decisions involving the health and safety of millions of patients. The most widely used terminology for coding patient interactions with medical providers for payment has proved ill-suited to identifying the adverse effects of drugs. Directly conflicting results were reported in otherwise similar patient health databases, even using identical event definitions and research methods. Evaluation of some accepted statistical methods revealed systematic bias, while others appeared to be unreliable. When electronic health data studies detected no drug risk, there were no robust and accepted standards to judge whether the drug was unlikely to cause the adverse effect or whether the study was incapable of detecting it. Substantial investment and careful thinking is needed to improve the reliability of risk assessments based on electronic health data, and current limitations need to be fully understood.
Collapse
Affiliation(s)
- Thomas J Moore
- Institute for Safe Medication Practices, 101 N. Columbus St, Suite 410, Alexandria, VA, 22214, USA,
| | | |
Collapse
|
20
|
White RW, Wang S, Pant A, Harpaz R, Shukla P, Sun W, DuMouchel W, Horvitz E. Early identification of adverse drug reactions from search log data. J Biomed Inform 2016; 59:42-8. [DOI: 10.1016/j.jbi.2015.11.005] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2015] [Revised: 11/07/2015] [Accepted: 11/12/2015] [Indexed: 01/28/2023]
|
21
|
Seabroke S, Candore G, Juhlin K, Quarcoo N, Wisniewski A, Arani R, Painter J, Tregunno P, Norén GN, Slattery J. Performance of Stratified and Subgrouped Disproportionality Analyses in Spontaneous Databases. Drug Saf 2016; 39:355-64. [DOI: 10.1007/s40264-015-0388-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
22
|
Li Y, Ryan PB, Wei Y, Friedman C. A Method to Combine Signals from Spontaneous Reporting Systems and Observational Healthcare Data to Detect Adverse Drug Reactions. Drug Saf 2015; 38:895-908. [PMID: 26153397 PMCID: PMC4579260 DOI: 10.1007/s40264-015-0314-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
INTRODUCTION Observational healthcare data contain information useful for hastening detection of adverse drug reactions (ADRs) that may be missed by using data in spontaneous reporting systems (SRSs) alone. There are only several papers describing methods that integrate evidence from healthcare databases and SRSs. We propose a methodology that combines ADR signals from these two sources. OBJECTIVES The aim of this study was to investigate whether the proposed method would result in more accurate ADR detection than methods using SRSs or healthcare data alone. RESEARCH DESIGN We applied the method to four clinically serious ADRs, and evaluated it using three experiments that involve combining an SRS with a single facility small-scale electronic health record (EHR), a larger scale network-based EHR, and a much larger scale healthcare claims database. The evaluation used a reference standard comprising 165 positive and 234 negative drug-ADR pairs. MEASURES Area under the receiver operator characteristics curve (AUC) was computed to measure performance. RESULTS There was no improvement in the AUC when the SRS and small-scale HER were combined. The AUC of the combined SRS and large-scale EHR was 0.82 whereas it was 0.76 for each of the individual systems. Similarly, the AUC of the combined SRS and claims system was 0.82 whereas it was 0.76 and 0.78, respectively, for the individual systems. CONCLUSIONS The proposed method resulted in a significant improvement in the accuracy of ADR detection when the resources used for combining had sufficient amounts of data, demonstrating that the method could integrate evidence from multiple sources and serve as a tool in actual pharmacovigilance practice.
Collapse
Affiliation(s)
- Ying Li
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA.
| | - Patrick B Ryan
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA
- Janssen Research and Development, 1125 Trenton Harbourton Rd, Titusville, NJ, 08560, USA
- Observational Health Data Sciences and Informatics (OHDSI), New York, NY, 10032, USA
| | - Ying Wei
- Department of Biostatistics, Columbia University, New York, NY, 10032, USA
| | - Carol Friedman
- Department of Biomedical Informatics, Columbia University Medical Center, 622 W. 168th Street, Presbyterian Building 20th Floor, New York, NY, 10032, USA
| |
Collapse
|
23
|
Wang G, Jung K, Winnenburg R, Shah NH. A method for systematic discovery of adverse drug events from clinical notes. J Am Med Inform Assoc 2015; 22:1196-204. [PMID: 26232442 DOI: 10.1093/jamia/ocv102] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Accepted: 06/16/2015] [Indexed: 01/11/2023] Open
Abstract
OBJECTIVE Adverse drug events (ADEs) are undesired harmful effects resulting from use of a medication, and occur in 30% of hospitalized patients. The authors have developed a data-mining method for systematic, automated detection of ADEs from electronic medical records. MATERIALS AND METHODS This method uses the text from 9.5 million clinical notes, along with prior knowledge of drug usages and known ADEs, as inputs. These inputs are further processed into statistics used by a discriminative classifier which outputs the probability that a given drug-disorder pair represents a valid ADE association. Putative ADEs identified by the classifier are further filtered for positive support in 2 independent, complementary data sources. The authors evaluate this method by assessing support for the predictions in other curated data sources, including a manually curated, time-indexed reference standard of label change events. RESULTS This method uses a classifier that achieves an area under the curve of 0.94 on a held out test set. The classifier is used on 2,362,950 possible drug-disorder pairs comprised of 1602 unique drugs and 1475 unique disorders for which we had data, resulting in 240 high-confidence, well-supported drug-AE associations. Eighty-seven of them (36%) are supported in at least one of the resources that have information that was not available to the classifier. CONCLUSION This method demonstrates the feasibility of systematic post-marketing surveillance for ADEs using electronic medical records, a key component of the learning healthcare system.
Collapse
Affiliation(s)
- Guan Wang
- Stanford University, Center for Biomedical Informatics, Stanford, California, USA
| | - Kenneth Jung
- Stanford University, Center for Biomedical Informatics, Stanford, California, USA
| | - Rainer Winnenburg
- Stanford University, Center for Biomedical Informatics, Stanford, California, USA
| | - Nigam H Shah
- Stanford University, Center for Biomedical Informatics, Stanford, California, USA
| |
Collapse
|
24
|
Norén GN. Pharmacovigilance for a revolving world: prospects of patient-generated data on the internet. Drug Saf 2015; 37:761-4. [PMID: 25096955 DOI: 10.1007/s40264-014-0205-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
- G Niklas Norén
- Uppsala Monitoring Centre, WHO Collaborating Centre for International Drug Monitoring, Box 1051, 751 40, Uppsala, Sweden,
| |
Collapse
|
25
|
Comment on: "Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance". Drug Saf 2015; 38:113-4. [PMID: 25432779 DOI: 10.1007/s40264-014-0245-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
26
|
Authors' reply to Harpaz et al. comment on: "Zoo or savannah? Choice of training ground for evidence-based pharmacovigilance". Drug Saf 2015; 38:115-6. [PMID: 25432780 DOI: 10.1007/s40264-014-0246-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
27
|
Osokogu OU, Fregonese F, Ferrajolo C, Verhamme K, de Bie S, 't Jong G, Catapano M, Weibel D, Kaguelidou F, Bramer WM, Hsia Y, Wong ICK, Gazarian M, Bonhoeffer J, Sturkenboom M. Pediatric drug safety signal detection: a new drug-event reference set for performance testing of data-mining methods and systems. Drug Saf 2015; 38:207-17. [PMID: 25663078 PMCID: PMC4328124 DOI: 10.1007/s40264-015-0265-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
BACKGROUND Better evidence regarding drug safety in the pediatric population might be generated from existing data sources such as spontaneous reporting systems and electronic healthcare records. The Global Research in Paediatrics (GRiP)-Network of Excellence aims to develop pediatric-specific methods that can be applied to these data sources. A reference set of positive and negative drug-event associations is required. OBJECTIVE The aim of this study was to develop a pediatric-specific reference set of positive and negative drug-event associations. METHODS Considering user patterns and expert opinion, 16 drugs that are used in individuals aged 0-18 years were selected and evaluated against 16 events, regarded as important safety outcomes. A cross-table of unique drug-event pairs was created. Each pair was classified as potential positive or negative control based on information from the drug's Summary of Product Characteristics and Micromedex. If both information sources consistently listed the event as an adverse event, the combination was reviewed as potential positive control. If both did not, the combination was evaluated as potential negative control. Further evaluation was based on published literature. RESULTS Selected drugs include ibuprofen, flucloxacillin, domperidone, methylphenidate, montelukast, quinine, and cyproterone/ethinylestradiol. Selected events include bullous eruption, aplastic anemia, ventricular arrhythmia, sudden death, acute kidney injury, psychosis, and seizure. Altogether, 256 unique combinations were reviewed, yielding 37 positive (17 with evidence from the pediatric population and 20 with evidence from adults only) and 90 negative control pairs, with the remainder being unclassifiable. CONCLUSION We propose a drug-event reference set that can be used to compare different signal detection methods in the pediatric population.
Collapse
Affiliation(s)
- Osemeke U Osokogu
- Department of Medical Informatics, Erasmus University Medical Center, PO Box 2040, 3000 CA, Rotterdam, The Netherlands,
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Authors' reply to Hennessy and Leonard's comment on "Desideratum for evidence-based epidemiology". Drug Saf 2014; 38:105-7. [PMID: 25511912 DOI: 10.1007/s40264-014-0254-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|