1
|
Randall S, Brown A, Ferrante A, Boyd J, Robinson S. Implementing privacy preserving record linkage: Insights from Australian use cases. Int J Med Inform 2024; 191:105582. [PMID: 39096591 DOI: 10.1016/j.ijmedinf.2024.105582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2024] [Revised: 07/15/2024] [Accepted: 07/29/2024] [Indexed: 08/05/2024]
Abstract
OBJECTIVE To describe the use of privacy preserving linkage methods operationally in Australia, and to present insights and key learnings from their implementation. METHODS Privacy preserving record linkage (PPRL) utilising Bloom filters provides a unique practical mechanism that allows linkage to occur without the release of personally identifiable information (PII), while still ensuring high accuracy. RESULTS The methodology has received wide uptake within Australia, with four state linkage units with privacy preserving capability. It has enabled access to general practice and private pathology data amongst other, both much sought after datasets previous inaccessible for linkage. CONCLUSION The Australian experience suggests privacy preserving linkage is a practical solution for improving data access for policy, planning and population health research. It is hoped interest in this methodology internationally continues to grow.
Collapse
Affiliation(s)
- Sean Randall
- Deakin Health Economics, Institute for Health Transformation, Deakin University. Burwood Hwy, Burwood, VIC 3125, Australia.
| | - Adrian Brown
- Centre for Data Linkage, School of Population Health, Curtin University. Kent St, Bentley, WA 6102, Australia.
| | - Anna Ferrante
- Centre for Data Linkage, School of Population Health, Curtin University. Kent St, Bentley, WA 6102, Australia.
| | - James Boyd
- Department of Public Health, La Trobe University, Plenty Rd, Bundoora, VIC 3086, Australia.
| | - Suzanne Robinson
- Deakin Health Economics, Institute for Health Transformation, Deakin University. Burwood Hwy, Burwood, VIC 3125, Australia.
| |
Collapse
|
2
|
Pathak A, Serrer L, Zapata D, King R, Mirel LB, Sukalac T, Srinivasan A, Baier P, Bhalla M, David-Ferdon C, Luxenberg S, Gundlapalli AV. Privacy preserving record linkage for public health action: opportunities and challenges. J Am Med Inform Assoc 2024:ocae196. [PMID: 39047294 DOI: 10.1093/jamia/ocae196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 06/27/2024] [Accepted: 07/12/2024] [Indexed: 07/27/2024] Open
Abstract
OBJECTIVES To understand the landscape of privacy preserving record linkage (PPRL) applications in public health, assess estimates of PPRL accuracy and privacy, and evaluate factors for PPRL adoption. MATERIALS AND METHODS A literature scan examined the accuracy, data privacy, and scalability of PPRL in public health. Twelve interviews with subject matter experts were conducted and coded using an inductive approach to identify factors related to PPRL adoption. RESULTS PPRL has a high level of linkage quality and accuracy. PPRL linkage quality was comparable to that of clear text linkage methods (requiring direct personally identifiable information [PII]) for linkage across various settings and research questions. Accuracy of PPRL depended on several components, such as PPRL technique, and the proportion of missingness and errors in underlying data. Strategies to increase adoption include increasing understanding of PPRL, improving data owner buy-in, establishing governance structure and oversight, and developing a public health implementation strategy for PPRL. DISCUSSION PPRL protects privacy by eliminating the need to share PII for linkage, but the accuracy and linkage quality depend on factors including the choice of PPRL technique and specific PII used to create encrypted identifiers. Large-scale implementations of PPRL linking millions of observations-including PCORnet, National Institutes for Health N3C, and the Centers for Disease Control and Prevention COVID-19 project have demonstrated the scalability of PPRL for public health applications. CONCLUSIONS Applications of PPRL in public health have demonstrated their value for the public health community. Although gaps must be addressed before wide implementation, PPRL is a promising solution to data linkage challenges faced by the public health ecosystem.
Collapse
Affiliation(s)
- Aditi Pathak
- American Institutes for Research, Arlington, VA 22202, United States
| | - Laina Serrer
- American Institutes for Research, Arlington, VA 22202, United States
| | - Daniela Zapata
- American Institutes for Research, Arlington, VA 22202, United States
| | - Raymond King
- National Center for Chronic Disease Prevention and Health Promotion, Centers for Disease Control and Prevention, Atlanta, GA 30341, United States
| | - Lisa B Mirel
- National Center for Science and Engineering Statistics, National Science Foundation, Alexandria, VA 22314, United States
| | - Thomas Sukalac
- Center for Forecasting and Outbreak Analytics, Centers for Disease Control and Prevention, Atlanta, GA 30333, United States
| | - Arunkumar Srinivasan
- National Center for Immunization and Respiratory Diseases, Centers for Disease Control and Prevention, Atlanta, GA 30333, United States
| | - Patrick Baier
- American Institutes for Research, Arlington, VA 22202, United States
| | - Meera Bhalla
- American Institutes for Research, Arlington, VA 22202, United States
| | - Corinne David-Ferdon
- Office of Public Health Data, Surveillance, and Technology, Centers for Disease Control and Prevention, Atlanta, GA 30333, United States
| | - Steven Luxenberg
- Office of Public Health Data, Surveillance, and Technology, Centers for Disease Control and Prevention, Atlanta, GA 30333, United States
| | - Adi V Gundlapalli
- Office of Public Health Data, Surveillance, and Technology, Centers for Disease Control and Prevention, Atlanta, GA 30333, United States
| |
Collapse
|
3
|
Campion TR, Craven CK, Dorr DA, Bernstam EV, Knosp BM. Understanding enterprise data warehouses to support clinical and translational research: impact, sustainability, demand management, and accessibility. J Am Med Inform Assoc 2024; 31:1522-1528. [PMID: 38777803 PMCID: PMC11187432 DOI: 10.1093/jamia/ocae111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 04/10/2024] [Accepted: 05/05/2024] [Indexed: 05/25/2024] Open
Abstract
OBJECTIVES Healthcare organizations, including Clinical and Translational Science Awards (CTSA) hubs funded by the National Institutes of Health, seek to enable secondary use of electronic health record (EHR) data through an enterprise data warehouse for research (EDW4R), but optimal approaches are unknown. In this qualitative study, our goal was to understand EDW4R impact, sustainability, demand management, and accessibility. MATERIALS AND METHODS We engaged a convenience sample of informatics leaders from CTSA hubs (n = 21) for semi-structured interviews and completed a directed content analysis of interview transcripts. RESULTS EDW4R have created institutional capacity for single- and multi-center studies, democratized access to EHR data for investigators from multiple disciplines, and enabled the learning health system. Bibliometrics have been challenging due to investigator non-compliance, but one hub's requirement to link all study protocols with funding records enabled quantifying an EDW4R's multi-million dollar impact. Sustainability of EDW4R has relied on multiple funding sources with a general shift away from the CTSA grant toward institutional and industry support. To address EDW4R demand, institutions have expanded staff, used different governance approaches, and provided investigator self-service tools. EDW4R accessibility can benefit from improved tools incorporating user-centered design, increased data literacy among scientists, expansion of informaticians in the workforce, and growth of team science. DISCUSSION As investigator demand for EDW4R has increased, approaches to tracking impact, ensuring sustainability, and improving accessibility of EDW4R resources have varied. CONCLUSION This study adds to understanding of how informatics leaders seek to support investigators using EDW4R across the CTSA consortium and potentially elsewhere.
Collapse
Affiliation(s)
- Thomas R Campion
- Clinical & Translational Science Center, Weill Cornell Medicine, New York, NY 10022, United States
| | - Catherine K Craven
- Division of Clinical Research Informatics, Department of Population Health Sciences, The University of Texas Health San Antonio, San Antonio, TX 78229, United States
| | - David A Dorr
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR 97239, United States
- Department of Medicine, Oregon Health & Science University, Portland, OR 97239, United States
| | - Elmer V Bernstam
- D. Bradley McWilliams School of Biomedical Informatics, The University of Texas Health Science Center, Houston, TX 77030, United States
- Division of General Internal Medicine, McGovern Medical School and Center for Clinical and Translational Sciences, The University of Texas Health Science Center, Houston, TX 77030, United States
| | - Boyd M Knosp
- Roy J. and Lucille A. Carver College of Medicine and the Institute for Clinical & Translational Science, University of Iowa, Iowa City, IA 52242, United States
| |
Collapse
|
4
|
Socal MP, Odouard IC, Kharrazi H. Ownership and Interoperability Challenges of Alzheimer Monoclonal Antibody Registries. JAMA Neurol 2024; 81:109-110. [PMID: 38079170 DOI: 10.1001/jamaneurol.2023.4675] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2024]
Abstract
This Viewpoint addresses the challenges that the Centers for Medicare and Medicaid Services faces to collect real-world data on the effectiveness and safety of lecanemab from external registries to achieve its coverage with evidence development objectives.
Collapse
Affiliation(s)
- Mariana P Socal
- Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
| | - Ilina C Odouard
- Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
| | - Hadi Kharrazi
- Department of Health Policy and Management, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
- Division of General Internal Medicine, Johns Hopkins School of Medicine, Baltimore, Maryland
| |
Collapse
|
5
|
Tachinardi U, Grannis SJ, Michael SG, Misquitta L, Dahlin J, Sheikh U, Kho A, Phua J, Rogovin SS, Amor B, Choudhury M, Sparks P, Mannaa A, Ljazouli S, Saltz J, Prior F, Baghal A, Gersing K, Embi PJ. Privacy-preserving record linkage across disparate institutions and datasets to enable a learning health system: The national COVID cohort collaborative (N3C) experience. Learn Health Syst 2024; 8:e10404. [PMID: 38249841 PMCID: PMC10797567 DOI: 10.1002/lrh2.10404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 12/06/2023] [Accepted: 12/06/2023] [Indexed: 01/23/2024] Open
Abstract
Introduction Research driven by real-world clinical data is increasingly vital to enabling learning health systems, but integrating such data from across disparate health systems is challenging. As part of the NCATS National COVID Cohort Collaborative (N3C), the N3C Data Enclave was established as a centralized repository of deidentified and harmonized COVID-19 patient data from institutions across the US. However, making this data most useful for research requires linking it with information such as mortality data, images, and viral variants. The objective of this project was to establish privacy-preserving record linkage (PPRL) methods to ensure that patient-level EHR data remains secure and private when governance-approved linkages with other datasets occur. Methods Separate agreements and approval processes govern N3C data contribution and data access. The Linkage Honest Broker (LHB), an independent neutral party (the Regenstrief Institute), ensures data linkages are robust and secure by adding an extra layer of separation between protected health information and clinical data. The LHB's PPRL methods (including algorithms, processes, and governance) match patient records using "deidentified tokens," which are hashed combinations of identifier fields that define a match across data repositories without using patients' clear-text identifiers. Results These methods enable three linkage functions: Deduplication, Linking Multiple Datasets, and Cohort Discovery. To date, two external repositories have been cross-linked. As of March 1, 2023, 43 sites have signed the LHB Agreement; 35 sites have sent tokens generated for 9 528 998 patients. In this initial cohort, the LHB identified 135 037 matches and 68 596 duplicates. Conclusion This large-scale linkage study using deidentified datasets of varying characteristics established secure methods for protecting the privacy of N3C patient data when linked for research purposes. This technology has potential for use with registries for other diseases and conditions.
Collapse
Affiliation(s)
- Umberto Tachinardi
- Department of Biomedical InformaticsUniversity of Cincinnati College of MedicineCincinnatiOhioUSA
| | - Shaun J. Grannis
- Center for Biomedical Informatics, Regenstrief InstituteDepartment of Family Medicine, IU School of MedicineRegenstrief Institute, Inc. and Indiana University School of MedicineIndianapolisIndianaUSA
| | - Sam G. Michael
- National Center for Advancing Translational ScienceNIHBethesdaMarylandUSA
| | - Leonie Misquitta
- National Center for Advancing Translational ScienceNIHBethesdaMarylandUSA
| | - Jayme Dahlin
- National Center for Advancing Translational ScienceNIHBethesdaMarylandUSA
| | - Usman Sheikh
- National Center for Advancing Translational ScienceNIHBethesdaMarylandUSA
| | - Abel Kho
- Department of MedicineNorthwestern University, Feinberg School of MedicineChicagoIllinoisUSA
- Public SectorDatavant, IncSan FranciscoCaliforniaUSA
| | - Jasmin Phua
- Public SectorDatavant, IncSan FranciscoCaliforniaUSA
| | | | - Benjamin Amor
- Federal HealthPalantir TechnologiesDenverColoradoUSA
| | | | - Philip Sparks
- Federal HealthPalantir TechnologiesDenverColoradoUSA
| | - Amin Mannaa
- Federal HealthPalantir TechnologiesDenverColoradoUSA
| | - Saad Ljazouli
- Federal HealthPalantir TechnologiesDenverColoradoUSA
| | - Joel Saltz
- School of MedicineStony Brook UniversityStony BrookNew YorkUSA
| | - Fred Prior
- COM Biomedical InformaticsUniversity of Arkansas for Medical SciencesLittle RockArkansasUSA
| | - Ahmen Baghal
- COM Biomedical InformaticsUniversity of Arkansas for Medical SciencesLittle RockArkansasUSA
| | - Kenneth Gersing
- National Center for Advancing Translational ScienceNIHBethesdaMarylandUSA
| | - Peter J. Embi
- Department of Biomedical InformaticsVanderbilt University Medical CenterNashvilleTennesseeUSA
| |
Collapse
|
6
|
Shah K, Patt D, Mullangi S. Use of Tokens to Unlock Greater Data Sharing in Health Care. JAMA 2023; 330:2333-2334. [PMID: 37983066 DOI: 10.1001/jama.2023.23720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/21/2023]
Abstract
This Viewpoint discusses the use of privacy-preserving record linkage, a token-based record linkage system, as a promising avenue for building a data infrastructure system that bridges isolated data.
Collapse
Affiliation(s)
- Kanan Shah
- Department of Medicine, NYU Langone Medical Center, New York, New York
| | | | | |
Collapse
|