1
|
Sauzet O, Dyck J, Cornelius V. Optimal Significance Levels and Sample Sizes for Signal Detection Methods Based on Non-constant Hazards. Drug Saf 2024; 47:1149-1156. [PMID: 38982034 DOI: 10.1007/s40264-024-01460-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/11/2024] [Indexed: 07/11/2024]
Abstract
BACKGROUND AND OBJECTIVES Statistical methods for signal detection of adverse drug reactions (ADRs) in electronic health records (EHRs) need information about optimal significance levels and sample sizes to achieve sufficient power. Sauzet and Cornelius proposed tests for signal detection based on the hazard functions of Weibull type distributions (WSP tests) which use the time-to-event information available in EHRs. Optimal significance levels and sample sizes for the application of the WPS tests are derived. METHOD A simulation study was performed with a range of scenarios for sample size, rate of event due (ADRs), and not due to the drug and random time to ADR occurrence. Based on the area under the curve of the receiver operating characteristic graph, we obtain optimal significance levels of the different WSP tests for the implementation in a hypothesis free signal detection setting and approximate sample sizes required to reach a power of 80% or 90%. RESULTS The dWSP-pPWSP (combination of double WSP and power WSP) test with a significance level of 0.004 was recommended. Sample sizes needed for a power of 80% were found to start at 60 events for an ADR rate equal to the background rate of 0.1. The number of events required for a background rate of 0.05 and an ADR rate equal to a 20% increase of the background rate was 900. CONCLUSION Based on this study, it is recommended to use the dWSP-pWSP test combination for signal detection with a significance level of 0.004 when the same test is applied to all adverse events not depending on rates.
Collapse
Affiliation(s)
- Odile Sauzet
- Department of Business Administration and Economics, Bielefeld University, Bielefeld, Germany.
- Department of Epidemiology and International Public Health, Bielefeld School of Public Health (BiSPH), Bielefeld University, Bielefeld, Germany.
- Odile Sauzet Universität Bielefeld, Postfach 10 01 31, 33501, Bielefeld, Germany.
| | - Julia Dyck
- Department of Business Administration and Economics, Bielefeld University, Bielefeld, Germany
| | - Victoria Cornelius
- Imperial Clinical Trials Unit, School of Public Health, Imperial College London, London, UK
| |
Collapse
|
2
|
Taylor LG, Kürzinger ML, Hermans R, Enshaeifar S, Dwan B, Chhikara P, Li X, Thummisetti S, Colas S, Duverne M, Juhaeri J. Considerations for practical use of tree-based scan statistics for signal detection using electronic healthcare data: a case study with insulin glargine. Expert Opin Drug Saf 2024:1-11. [PMID: 39162331 DOI: 10.1080/14740338.2024.2393274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Accepted: 07/29/2024] [Indexed: 08/21/2024]
Abstract
BACKGROUND Hypothesis-free signal detection (HFSD) methods such as tree-based scan statistics (TBSS) applied to longitudinal electronic healthcare data (EHD) are increasingly used in safety monitoring. However, challenges may arise in interpreting HFSD results alongside results from disproportionality analysis of spontaneous reporting. RESEARCH DESIGN AND METHODS Using the anti-diabetes drug insulin glargine (Lantus®) we apply two different tree-based scan designs using TreeScan™ software on retrospective EHD and compare the results to one another as well as to results from a disproportionality analysis using SRD. RESULTS The self-controlled tree temporal scan method produced the larger number of alerts relative to propensity-score matched approach; however, far fewer alerts were observed when analyses were limited to EHD in inpatient/emergency room settings only. Very few reference adverse events were observed using TBSS methods on EHD relative to disproportionality methods in SRD. CONCLUSION Differences in detected alerts between TBSS methods and between TBSS and disproportionality analysis of SRD are likely attributable to differences in data, comparator, and study design. Our results suggest that HFDS methods like TBSS applied to EHD may complement more traditional approaches such as disproportionality analysis of SRD to provide a more complete picture of product safety in the post-approval setting.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Xinyu Li
- Sanofi, Epidemiology and Benefit-Risk, PSPV, Bridgewater, MA, USA
| | | | - Sandrine Colas
- Sanofi Epidemiology and Benefit-Risk, PSPV, Toronto, Canada
| | | | - Juhaeri Juhaeri
- Sanofi, Epidemiology and Benefit-Risk, PSPV, Bridgewater, MA, USA
| |
Collapse
|
3
|
Hsieh MHC, Liang HY, Tsai CY, Tseng YT, Chao PH, Huang WI, Chen WW, Lin SJ, Lai ECC. A New Drug Safety Signal Detection and Triage System Integrating Sequence Symmetry Analysis and Tree-Based Scan Statistics with Longitudinal Data. Clin Epidemiol 2023; 15:91-107. [PMID: 36699647 PMCID: PMC9868282 DOI: 10.2147/clep.s395922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 12/14/2022] [Indexed: 01/19/2023] Open
Abstract
Purpose Development and evaluation of a drug-safety signal detection system integrating data-mining tools in longitudinal data is essential. This study aimed to construct a new triage system using longitudinal data for drug-safety signal detection, integrating data-mining tools, and evaluate adaptability of such system. Patients and Methods Based on relevant guidelines and structural frameworks in Taiwan's pharmacovigilance system, we constructed a triage system integrating sequence symmetry analysis (SSA) and tree-based scan statistics (TreeScan) as data-mining tools for detecting safety signals. We conducted an exploratory analysis utilizing Taiwan's National Health Insurance Database and selecting two drug classes (sodium-glucose co-transporter-2 inhibitors (SGLT2i) and non-fluorinated quinolones (NFQ)) as chronic and episodic treatment respectively, as examples to test feasibility of the system. Results Under the proposed system, either cohort-based or self-controlled mining with SSA and TreeScan was selected, based on whether the screened drug had an appropriate comparator. All detected alerts were further classified as known adverse drug reactions (ADRs), events related to other causes or potential signals from the triage algorithm, building on existing drug labels and clinical judgement. Exploratory analysis revealed greater numbers of signals for NFQ with a relatively low proportion of known ADRs; most were related to indication, patient characteristics or bias. No safety signals were found. By contrast, most SGLT2i signals were known ADRs or events related to patient characteristics. Four were potential signals warranting further investigation. Conclusion The proposed system facilitated active and systematic screening to detect and classify potential safety signals. Countries with real-world longitudinal data could adopt it to streamline drug-safety surveillance.
Collapse
Affiliation(s)
- Miyuki Hsing-Chun Hsieh
- Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan,Taiwan Drug Relief Foundation (TDRF), Taipei, Taiwan
| | | | | | - Yu-Ting Tseng
- Taiwan Drug Relief Foundation (TDRF), Taipei, Taiwan
| | - Pi-Hui Chao
- Taiwan Drug Relief Foundation (TDRF), Taipei, Taiwan
| | - Wei-I Huang
- Taiwan Drug Relief Foundation (TDRF), Taipei, Taiwan
| | - Wen-Wen Chen
- Taiwan Drug Relief Foundation (TDRF), Taipei, Taiwan
| | - Swu-Jane Lin
- Department of Pharmacy Systems, Outcomes & Policy, College of Pharmacy, University of Illinois at Chicago, Chicago, IL, USA
| | - Edward Chia-Cheng Lai
- Institute of Clinical Pharmacy and Pharmaceutical Sciences, College of Medicine, National Cheng Kung University, Tainan, Taiwan,Correspondence: Edward Chia-Cheng Lai, Email
| |
Collapse
|
4
|
Zhao Y, Yu Y, Wang H, Li Y, Deng Y, Jiang G, Luo Y. Machine Learning in Causal Inference: Application in Pharmacovigilance. Drug Saf 2022; 45:459-476. [PMID: 35579811 PMCID: PMC9114053 DOI: 10.1007/s40264-022-01155-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2022] [Indexed: 01/28/2023]
Abstract
Monitoring adverse drug events or pharmacovigilance has been promoted by the World Health Organization to assure the safety of medicines through a timely and reliable information exchange regarding drug safety issues. We aim to discuss the application of machine learning methods as well as causal inference paradigms in pharmacovigilance. We first reviewed data sources for pharmacovigilance. Then, we examined traditional causal inference paradigms, their applications in pharmacovigilance, and how machine learning methods and causal inference paradigms were integrated to enhance the performance of traditional causal inference paradigms. Finally, we summarized issues with currently mainstream correlation-based machine learning models and how the machine learning community has tried to address these issues by incorporating causal inference paradigms. Our literature search revealed that most existing data sources and tasks for pharmacovigilance were not designed for causal inference. Additionally, pharmacovigilance was lagging in adopting machine learning-causal inference integrated models. We highlight several currently trending directions or gaps to integrate causal inference with machine learning in pharmacovigilance research. Finally, our literature search revealed that the adoption of causal paradigms can mitigate known issues with machine learning models. We foresee that the pharmacovigilance domain can benefit from the progress in the machine learning field.
Collapse
Affiliation(s)
- Yiqing Zhao
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yue Yu
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Hanyin Wang
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yikuan Li
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Yu Deng
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA
| | - Guoqian Jiang
- Department of Artificial Intelligence and Informatics, Mayo Clinic, Rochester, MN, 55902, USA
| | - Yuan Luo
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, 750 N Lake Shore Drive, Room 11-189, Chicago, IL, 60611, USA.
| |
Collapse
|
7
|
Chen X, Gao W, Li J, You D, Yu Z, Zhang M, Shao F, Wei Y, Zhang R, Lange T, Wang Q, Chen F, Lu X, Zhao Y. A predictive paradigm for COVID-19 prognosis based on the longitudinal measure of biomarkers. Brief Bioinform 2021; 22:6291518. [PMID: 34081102 PMCID: PMC8195146 DOI: 10.1093/bib/bbab206] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 04/10/2021] [Accepted: 05/11/2021] [Indexed: 12/30/2022] Open
Abstract
Novel coronavirus disease 2019 (COVID-19) is an emerging, rapidly evolving crisis, and the ability to predict prognosis for individual COVID-19 patient is important for guiding treatment. Laboratory examinations were repeatedly measured during hospitalization for COVID-19 patients, which provide the possibility for the individualized early prediction of prognosis. However, previous studies mainly focused on risk prediction based on laboratory measurements at one time point, ignoring disease progression and changes of biomarkers over time. By using historical regression trees (HTREEs), a novel machine learning method, and joint modeling technique, we modeled the longitudinal trajectories of laboratory biomarkers and made dynamically predictions on individual prognosis for 1997 COVID-19 patients. In the discovery phase, based on 358 COVID-19 patients admitted between 10 January and 18 February 2020 from Tongji Hospital, HTREE model identified a set of important variables including 14 prognostic biomarkers. With the trajectories of those biomarkers through 5-day, 10-day and 15-day, the joint model had a good performance in discriminating the survived and deceased COVID-19 patients (mean AUCs of 88.81, 84.81 and 85.62% for the discovery set). The predictive model was successfully validated in two independent datasets (mean AUCs of 87.61, 87.55 and 87.03% for validation the first dataset including 112 patients, 94.97, 95.78 and 94.63% for the second validation dataset including 1527 patients, respectively). In conclusion, our study identified important biomarkers associated with the prognosis of COVID-19 patients, characterized the time-to-event process and obtained dynamic predictions at the individual level.
Collapse
Affiliation(s)
- Xin Chen
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Wei Gao
- Department of Geriatrics, Sir Run Run Hospital, Nanjing Medical University, 109 Longmian Avenue, Nanjing, 211166, China
| | - Jie Li
- Research Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, Jiangsu, China.,Department of Bioinformatics, Nanjing Medical University, 101 Longmian Avenue, Jiangning District, Nanjing, 211166, Jiangsu, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Personalized Cancer Medicine, Nanjing Medical University, Nanjing, 211166, Jiangsu, China
| | - Dongfang You
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Zhaolei Yu
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Mingzhi Zhang
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Fang Shao
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Yongyue Wei
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,China International Cooperation Center for Environment and Human Health, Center for Global Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,The Center of Biomedical Big Data and the Laboratory of Biomedical Big Data, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Ruyang Zhang
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,China International Cooperation Center for Environment and Human Health, Center for Global Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,The Center of Biomedical Big Data and the Laboratory of Biomedical Big Data, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Theis Lange
- Section of Biostatistics, Department of Public Health, Faculty of Health and Medical Sciences, University of Copenhagen, Øster Farimagsgade 5, 1353, Copenhagen, Denmark
| | - Qianghu Wang
- Research Center for Global Health, School of Public Health, Nanjing Medical University, Nanjing, 211166, Jiangsu, China.,Department of Bioinformatics, Nanjing Medical University, 101 Longmian Avenue, Jiangning District, Nanjing, 211166, Jiangsu, China.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Personalized Cancer Medicine, Nanjing Medical University, Nanjing, 211166, Jiangsu, China
| | - Feng Chen
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Personalized Cancer Medicine, Nanjing Medical University, Nanjing, 211166, Jiangsu, China.,China International Cooperation Center for Environment and Human Health, Center for Global Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,The Center of Biomedical Big Data and the Laboratory of Biomedical Big Data, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| | - Xiang Lu
- Department of Geriatrics, Sir Run Run Hospital, Nanjing Medical University, 109 Longmian Avenue, Nanjing, 211166, China
| | - Yang Zhao
- Department of Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,Jiangsu Key Lab of Cancer Biomarkers, Prevention and Treatment, Collaborative Innovation Center for Personalized Cancer Medicine, Nanjing Medical University, Nanjing, 211166, Jiangsu, China.,China International Cooperation Center for Environment and Human Health, Center for Global Health, Nanjing Medical University, Nanjing, Jiangsu, China, 211166.,The Center of Biomedical Big Data and the Laboratory of Biomedical Big Data, Nanjing Medical University, Nanjing, Jiangsu, China, 211166
| |
Collapse
|