Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

Download

Total Articles

694
(from Reference Citation Analysis)

Article PDFs (35)

Cited by > 0 (185)

Searched Name

Medical Record Linkage/methods

Ranked By

Results Analysis

Year Published Analysis
Article Type Analysis
Publication Title Analysis
Category Analysis

Results Analysis

Indexed Articles

Year Published

Show more Refine

Article Type

Show more Refine

Article Statistics

Refine

MESH Headings

Show more Refine

First Author

Show more Refine

First Author Affiliations

Show more Refine

Authors

Show more Refine

Publication Titles

Show more Refine

Grant Agencies

Show more Refine

Countries/Regions

Show more Refine

Affiliations

Show more Refine

Corresponding Author Affiliations

Show more Refine

Category

Show more Refine

Number

Citation Analysis

Röchner P, Rothlauf F. Using machine learning to link electronic health records in cancer registries: On the tradeoff between linkage quality and manual effort. Int J Med Inform 2024;185:105387. [PMID: 38428200 DOI: 10.1016/j.ijmedinf.2024.105387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 10/05/2023] [Accepted: 02/20/2024] [Indexed: 03/03/2024]

Abstract

BACKGROUND

Cancer registries link a large number of electronic health records reported by medical institutions to already registered records of the matching individual and tumor. Records are automatically linked using deterministic and probabilistic approaches; machine learning is rarely used. Records that cannot be matched automatically with sufficient accuracy are typically processed manually. For application, it is important to know how well record linkage approaches match real-world records and how much manual effort is required to achieve the desired linkage quality. We study the task of linking reported records to the matching registered tumor in cancer registries.

METHODS

We compare the tradeoff between linkage quality and manual effort of five machine learning methods (logistic regression, random forest, gradient boosting, neural network, and a stacked method) to a deterministic baseline. The record linkage methods are compared in a two-class setting (no-match/ match) and a three-class setting (no-match/ undecided/ match). A cancer registry collected and linked the dataset consisting of categorical variables matching 145,755 reported records with 33,289 registered tumors.

RESULTS

In the two-class setting, the gradient boosting, neural network, and stacked models have higher accuracy and F1 score (accuracy: 0.968-0.978, F1 score: 0.983-0.988) than the deterministic baseline (accuracy: 0.964, F1 score: 0.980) when the same records are manually processed (0.89% of all records). In the three-class setting, these three machine learning methods can automatically process all reported records and still have higher accuracy and F1 score than the deterministic baseline. The linkage quality of the machine learning methods studied, except for the neural network, increase as the number of manually processed records increases.

CONCLUSION

Machine learning methods can significantly improve linkage quality and reduce the manual effort required by medical coders to match tumor records in cancer registries compared to a deterministic baseline. Our results help cancer registries estimate how linkage quality increases as more records are manually processed.

Collapse

Kirilov N. Comparison of WebSocket and Hypertext Transfer Protocol for Transfer of Electronic Health Records. Stud Health Technol Inform 2024;313:124-128. [PMID: 38682516 DOI: 10.3233/shti240023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/01/2024]

Kim JW, Choi H, Lim HJ, Oh M, Ahn JJ. Evaluating Linkage Quality of Population-Based Administrative Data for Health Service Research. J Korean Med Sci 2024;39:e127. [PMID: 38622936 PMCID: PMC11018984 DOI: 10.3346/jkms.2024.39.e127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 03/11/2024] [Indexed: 04/17/2024] Open

Abstract

BACKGROUND

To overcome the limitations of relying on data from a single institution, many researchers have studied data linkage methodologies. Data linkage includes errors owing to legal issues surrounding personal information and technical issues related to data processing. Linkage errors affect selection bias, and external and internal validity. Therefore, quality verification for each connection method with adherence to personal information protection is an important issue. This study evaluated the linkage quality of linked data and analyzed the potential bias resulting from linkage errors.

METHODS

This study analyzed claims data submitted to the Health Insurance Review and Assessment Service (HIRA DATA). The linkage errors of the two deterministic linkage methods were evaluated based on the use of the match key. The first deterministic linkage uses a unique identification number, and the second deterministic linkage uses the name, gender, and date of birth as a set of partial identifiers. The linkage error included in this deterministic linkage method was compared with the absolute standardized difference (ASD) of Cohen's according to the baseline characteristics, and the linkage quality was evaluated through the following indicators: linked rate, false match rate, missed match rate, positive predictive value, sensitivity, specificity, and F1-score.

RESULTS

For the deterministic linkage method that used the name, gender, and date of birth as a set of partial identifiers, the true match rate was 83.5 and the missed match rate was 16.5. Although there was bias in some characteristics of the data, most of the ASD values were less than 0.1, with no case greater than 0.5. Therefore, it is difficult to determine whether linked data constructed with deterministic linkages have substantial differences.

CONCLUSION

This study confirms the possibility of building health and medical data at the national level as the first data linkage quality verification study using big data from the HIRA. Analyzing the quality of linkages is crucial for comprehending linkage errors and generating reliable analytical outcomes. Linkers should increase the reliability of linked data by providing linkage error-related information to researchers. The results of this study will serve as reference data to increase the reliability of multicenter data linkage studies.

Collapse

Lloyd LK, Nicholson C, Strange G, Celermajer DS. The burdensome logistics of data linkage in Australia - the example of a national registry for congenital heart disease. AUST HEALTH REV 2024;48:8-15. [PMID: 38118279 DOI: 10.1071/ah23185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 11/21/2023] [Indexed: 12/22/2023]

Silverwood RJ, Rajah N, Calderwood L, De Stavola BL, Harron K, Ploubidis GB. Examining the quality and population representativeness of linked survey and administrative data: guidance and illustration using linked 1958 National Child Development Study and Hospital Episode Statistics data. Int J Popul Data Sci 2024;9:2137. [PMID: 38425790 PMCID: PMC10901060 DOI: 10.23889/ijpds.v9i1.2137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open

Abstract

Introduction

Recent years have seen an increase in linkages between survey and administrative data. It is important to evaluate the quality of such data linkages to discern the likely reliability of ensuing research. Evaluation of linkage quality and bias can be conducted using different approaches, but many of these are not possible when there is a separation of processes for linkage and analysis to help preserve privacy, as is typically the case in the UK (and elsewhere).

Objectives

We aimed to describe a suite of generalisable methods to evaluate linkage quality and population representativeness of linked survey and administrative data which remain tractable when users of the linked data are not party to the linkage process itself. We emphasise issues particular to longitudinal survey data throughout.

Methods

Our proposed approaches cover several areas: i) Linkage rates, ii) Selection into response, linkage consent and successful linkage, iii) Linkage quality, and iv) Linked data population representativeness. We illustrate these methods using a recent linkage between the 1958 National Child Development Study (NCDS; a cohort following an initial 17,415 people born in Great Britain in a single week of 1958) and Hospital Episode Statistics (HES) databases (containing important information regarding admissions, accident and emergency attendances and outpatient appointments at NHS hospitals in England).

Results

Our illustrative analyses suggest that the linkage quality of the NCDS-HES data is high and that the linked sample maintains an excellent level of population representativeness with respect to the single dimension we assessed.

Conclusions

Through this work we hope to encourage providers and users of linked data resources to undertake and publish thorough evaluations. We further hope that providing illustrative analyses using linked NCDS-HES data will improve the quality and transparency of research using this particular linked data resource.

Collapse

Kamat G, Shan M, Gutman R. Bayesian record linkage with variables in one file. Stat Med 2023;42:4931-4951. [PMID: 37652076 DOI: 10.1002/sim.9894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 06/12/2023] [Accepted: 08/21/2023] [Indexed: 09/02/2023]

Prindle J, Suthar H, Putnam-Hornstein E. An open-source probabilistic record linkage process for records with family-level information: Simulation study and applied analysis. PLoS One 2023;18:e0291581. [PMID: 37862306 PMCID: PMC10588881 DOI: 10.1371/journal.pone.0291581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2023] [Accepted: 08/31/2023] [Indexed: 10/22/2023] Open

Garcia KKS, de Miranda CB, de Sousa FNEF. Procedures for health data linkage: applications in health surveillance. Epidemiol Serv Saude 2022;31:e20211272. [PMID: 36287481 PMCID: PMC9887966 DOI: 10.1590/s2237-96222022000300004] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Accepted: 07/08/2022] [Indexed: 12/23/2022] Open

Heng Y, Armknecht F, Chen Y, Schnell R. On the effectiveness of graph matching attacks against privacy-preserving record linkage. PLoS One 2022;17:e0267893. [PMID: 36137086 PMCID: PMC9499274 DOI: 10.1371/journal.pone.0267893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 04/19/2022] [Indexed: 11/19/2022] Open

Libuy N, Harron K, Gilbert R, Caulton R, Cameron E, Blackburn R. Linking education and hospital data in England: linkage process and quality. Int J Popul Data Sci 2021;6:1671. [PMID: 34568585 PMCID: PMC8445153 DOI: 10.23889/ijpds.v6i1.1671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022] Open

Abstract

INTRODUCTION

Linkage of administrative data for universal state education and National Health Service (NHS) hospital care would enable research into the inter-relationships between education and health for all children in England.

OBJECTIVES

We aim to describe the linkage process and evaluate the quality of linkage of four one-year birth cohorts within the National Pupil Database (NPD) and Hospital Episode Statistics (HES).

METHODS

We used multi-step deterministic linkage algorithms to link longitudinal records from state schools to the chronology of records in the NHS Personal Demographics Service (PDS; linkage stage 1), and HES (linkage stage 2). We calculated linkage rates and compared pupil characteristics in linked and unlinked samples for each stage of linkage and each cohort (1990/91, 1996/97, 1999/00, and 2004/05).

RESULTS

Of the 2,287,671 pupil records, 2,174,601 (95%) linked to HES. Linkage rates improved over time (92% in 1990/91 to 99% in 2004/05). Ethnic minority pupils and those living in more deprived areas were less likely to be matched to hospital records, but differences in pupil characteristics between linked and unlinked samples were moderate to small.

CONCLUSION

We linked nearly all pupils to at least one hospital record. The high coverage of the linkage represents a unique opportunity for wide-scale analyses across the domains of health and education. However, missed links disproportionately affected ethnic minorities or those living in the poorest neighbourhoods: selection bias could be mitigated by increasing the quality and completeness of identifiers recorded in administrative data or the application of statistical methods that account for missed links.

HIGHLIGHTS

Longitudinal administrative records for all children attending state school and acute hospital services in England have been used for research for more than two decades, but lack of a shared unique identifier has limited scope for linkage between these databases.We applied multi-step deterministic linkage algorithms to 4 one-year cohorts of children born 1 September-31 August in 1990/91, 1996/97, 1999/00 and 2004/05. In stage 1, full names, date of birth, and postcode histories from education data in the National Pupil Database were linked to the NHS Personal Demographic Service. In stage 2, NHS number, postcode, date of birth and sex were linked to hospital records in Hospital Episode Statistics.Between 92% and 99% of school pupils linked to at least one hospital record. Ethnic minority pupils and pupils who were living in the most deprived areas were least likely to link. Ethnic minority pupils were less likely than white children to link at the first step in both algorithms.Bias due to linkage errors could lead to an underestimate of the health needs in disadvantaged groups. Improved data quality, more sensitive linkage algorithms, and/or statistical methods that account for missed links in analyses, should be considered to reduce linkage bias.

Collapse

Aflaki K, Park AL, Nelson C, Luo W, Ray JG. Identifying maternal deaths with the use of hospital data versus death certificates: a retrospective population-based study. CMAJ Open 2021;9:E539-E547. [PMID: 34021011 PMCID: PMC8177910 DOI: 10.9778/cmajo.20200201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open

Abstract

BACKGROUND

Accurate identification of maternal deaths is paramount for audit and policy purposes. Our aim was to determine the accuracy and completeness of data on maternal deaths in hospital and those recorded on a death certificate, and the level of agreement between the 2 data sources.

METHODS

We conducted a retrospective population-based study using data for Ontario, Canada, from Apr. 1, 2002, to Dec. 31, 2015. We used Canadian Institute for Health Information (CIHI) databases to identify deaths during inpatient, emergency department and same-day surgery encounters. We captured Vital Statistics deaths in the Office of the Registrar General, Deaths (ORGD) data set. Deaths were considered within 42 days and within 365 days after a pregnancy outcome (live birth, miscarriage, ectopic pregnancy or induced abortion) for all multiple and singleton pregnancies. We calculated agreement statistics and 95% confidence intervals (CIs).

RESULTS

Among 1 679 455 live births and stillbirths, 398 pregnancy-related deaths in the ORGD data set were mapped to a birth in CIHI databases, and 77 (16.2%) were not. Among 2 039 849 recognized pregnancies, 534 pregnancy-related deaths in the ORGD data set were linked to CIHI records, and 68 (11.3%) were not. Among live births and stillbirths, after pregnancy-related deaths in the ORGD data set not matched to a maternal death in the CIHI databases were removed, concordance measures between CIHI and ORGD records for maternal death within 42 days after delivery included a κ value of 0.87 (95% CI 0.82-0.91) and positive percent agreement of 0.88 (95% CI 0.83-0.94). The corresponding measures were similar for maternal death within 42 days after the end of a recognized pregnancy. When unlinked pregnancy-related deaths in the ORGD data set were retained, agreement measures declined for death within 42 days after a live birth or stillbirth (κ = 0.68, 95% CI 0.62-0.74). For maternal death within 365 days after a live birth or stillbirth, or after the end of a recognized pregnancy, the concordance statistics were generally favourable when unlinked pregnancy-related deaths in the ORGD data set were removed but were substantially declined when they were retained.

INTERPRETATION

Maternal mortality cannot be ascertained solely with the use of hospital data, including beyond 42 days after the end of pregnancy. To improve linkage, we propose including health insurance numbers on provincial and territorial medical death certificates.

Collapse

Chen Y, Wen H, Griffin R, Roach MJ, Kelly ML. Linking Individual Data From the Spinal Cord Injury Model Systems Center and Local Trauma Registry: Development and Validation of Probabilistic Matching Algorithm. Top Spinal Cord Inj Rehabil 2021;26:221-231. [PMID: 33536727 PMCID: PMC7831288 DOI: 10.46292/sci20-00015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Jewell A, Broadbent M, Hayes RD, Gilbert R, Stewart R, Downs J. Impact of matching error on linked mortality outcome in a data linkage of secondary mental health data with Hospital Episode Statistics (HES) and mortality records in South East London: a cross-sectional study. BMJ Open 2020;10:e035884. [PMID: 32641360 PMCID: PMC7342822 DOI: 10.1136/bmjopen-2019-035884] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Abstract

OBJECTIVES

Linkage of electronic health records (EHRs) to Hospital Episode Statistics (HES)-Office for National Statistics (ONS) mortality data has provided compelling evidence for lower life expectancy in people with severe mental illness. However, linkage error may underestimate these estimates. Using a clinical sample (n=265 300) of individuals accessing mental health services, we examined potential biases introduced through missed matching and examined the impact on the association between clinical disorders and mortality.

SETTING

The South London and Maudsley NHS Foundation Trust (SLaM) is a secondary mental healthcare provider in London. A deidentified version of SLaM's EHR was available via the Clinical Record Interactive Search system linked to HES-ONS mortality records.

PARTICIPANTS

Records from SLaM for patients active between January 2006 and December 2016.

OUTCOME MEASURES

Two sources of death data were available for SLaM participants: accurate and contemporaneous date of death via local batch tracing (gold standard) and date of death via linked HES-ONS mortality data. The effect of linkage error on mortality estimates was evaluated by comparing sociodemographic and clinical risk factor analyses using gold standard death data against HES-ONS mortality records.

RESULTS

Of the total sample, 93.74% were successfully matched to HES-ONS records. We found a number of statistically significant administrative, sociodemographic and clinical differences between matched and unmatched records. Of note, schizophrenia diagnosis showed a significant association with higher mortality using gold standard data (OR 1.08; 95% CI 1.01 to 1.15; p=0.02) but not in HES-ONS data (OR 1.05; 95% CI 0.98 to 1.13; p=0.16). Otherwise, little change was found in the strength of associated risk factors and mortality after accounting for missed matching bias.

CONCLUSIONS

Despite significant clinical and sociodemographic differences between matched and unmatched records, changes in mortality estimates were minimal. However, researchers and policy analysts using HES-ONS linked resources should be aware that administrative linkage processes can introduce error.

Collapse

Tapuria A, Kalra D, Curcin V. Feasibility of Using EN 13606 Clinical Archetypes for Defining Computable Phenotypes. Stud Health Technol Inform 2020;270:228-232. [PMID: 32570380 DOI: 10.3233/shti200156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Abstract

INTRODUCTION

Computable phenotypes are gaining importance as structured and reproducible method of using electronic health data to identify people with certain clinical conditions. A formal standard is not available for defining and formally representing phenotyping algorithms. In this paper, we have tried to build a formal representation of such phenotyping algorithm.

METHODS

We built EN 13606 EHR standard for building clinical archetypes to represent the computable phenotyping algorithm for 'diagnosis of cardiac failure'. As part of this work, we created a set of new clinical archetypes for defining 'cardiac failure diagnosis'. The EN13606 editor called Object Dictionary Client was used which was in-house developed by University College London. We evaluated the ability of EN 13606 to provide clinical archetypes to define EHR phenotyping algorithms using the predefined desiderata for the purpose [Mo et al].

RESULTS

EN 13606 archetypes could represent phenotype components grouped and nested based on their logical meaning. It was possible to build the EHR phenotyping algorithm with the clinical elements and their interrelationships along with hierarchical structure and temporal criteria. But the specific mathematical calculation and temporal relations involved in the algorithm was difficult to incorporate. These will need to be coded and integrated within the clinical information system. These archetypes can be mapped for comparison with the openEHR models. Binding to external clinical terminology is fully supported. However, it does not satisfy all the desiderata defined by Mo et al. A possible way could be an approach using phenotype ontologies and its architectural representation integrated with ISO interoperability.

CONCLUSION

The EN13606 archetypes can be used to define the phenotype algorithm that basically identifies patients by a set of clinical characteristics in their records. Phenotype representations defined in EN 13606 do not satisfy all the desiderata proposed by Mo et al. and thus currently has a limited ability to define the computable phenotyping algorithms. Further work is required to make the EN13606 standard to fully support the objective.

Collapse

Lindoerfer D, Mansmann U, Reinhardt I. Incorporation of Multiple Sources into IT - and Data Protection Concepts: Lessons Learned from the FARKOR Project. Stud Health Technol Inform 2020;270:262-266. [PMID: 32570387 DOI: 10.3233/shti200163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Nechuta S, Mukhopadhyay S, Krishnaswami S, Golladay M, McPheeters M. Record Linkage Approaches Using Prescription Drug Monitoring Program and Mortality Data for Public Health Analyses and Epidemiologic Studies. Epidemiology 2020;31:22-31. [PMID: 31592867 PMCID: PMC6889900 DOI: 10.1097/ede.0000000000001110] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Accepted: 09/25/2019] [Indexed: 11/25/2022]

Fraser C, Muller-Pebody B, Blackburn R, Gray J, Oddie SJ, Gilbert RE, Harron K. Linking surveillance and clinical data for evaluating trends in bloodstream infection rates in neonatal units in England. PLoS One 2019;14:e0226040. [PMID: 31830076 PMCID: PMC6907823 DOI: 10.1371/journal.pone.0226040] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Accepted: 11/19/2019] [Indexed: 11/19/2022] Open

Abstract

OBJECTIVE

To evaluate variation in trends in bloodstream infection (BSI) rates in neonatal units (NNUs) in England according to the data sources and linkage methods used.

METHODS

We used deterministic and probabilistic methods to link clinical records from 112 NNUs in the National Neonatal Research Database (NNRD) to national laboratory infection surveillance data from Public Health England. We calculated the proportion of babies in NNRD (aged <1 year and admitted between 2010-2017) with a BSI caused by clearly pathogenic organisms between two days after admission and two days after discharge. We used Poisson regression to determine trends in the proportion of babies with BSI based on i) deterministic and probabilistic linkage of NNRD and surveillance data (primary measure), ii) deterministic linkage of NNRD-surveillance data, iii) NNRD records alone, and iv) linked NNRD-surveillance data augmented with clinical records of laboratory-confirmed BSI in NNRD.

RESULTS

Using deterministic and probabilistic linkage, 5,629 of 349,740 babies admitted to a NNU in NNRD linked with 6,660 BSI episodes accounting for 38% of 17,388 BSI records aged <1 year in surveillance data. The proportion of babies with BSI due to clearly pathogenic organisms during their NNU admission was 1.0% using deterministic plus probabilistic linkage (primary measure), compared to 1.0% using deterministic linkage alone, 0.6% using NNRD records alone, and 1.2% using linkage augmented with clinical records of BSI in NNRD. Equivalent proportions for babies born before 32 weeks of gestation were 5.0%, 4.8%, 2.9% and 5.9%. The proportion of babies who linked to a BSI decreased by 7.5% each year (95% confidence interval [CI]: -14.3%, -0.1%) using deterministic and probabilistic linkage but was stable using clinical records of BSI or deterministic linkage alone.

CONCLUSION

Linkage that combines BSI records from national laboratory surveillance and clinical NNU data sources, and use of probabilistic methods, substantially improved ascertainment of BSI and estimates of BSI trends over time, compared with single data sources.

Collapse

Brennan JM, Wruck L, Pencina MJ, Clare RM, Lopes RD, Alexander JH, O'Brien S, Krucoff M, Rao SV, Wang TY, Curtis LH, Newby LK, Granger CB, Patel M, Mahaffey K, Ross JS, Normand SL, Eloff BC, Caños DA, Lokhnygina YV, Roe MT, Califf RM, Marinac-Dabic D, Peterson ED. Claims-based cardiovascular outcome identification for clinical research: Results from 7 large randomized cardiovascular clinical trials. Am Heart J 2019;218:110-122. [PMID: 31726314 DOI: 10.1016/j.ahj.2019.09.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 09/05/2019] [Indexed: 12/14/2022]

Abstract

BACKGROUND

Medicare insurance claims may provide an efficient means to ascertain follow-up of older participants in clinical research. We sought to determine the accuracy and completeness of claims- versus site-based follow-up with clinical event committee (+CEC) adjudication of cardiovascular outcomes.

METHODS

We performed a retrospective study using linked Medicare and Duke Database of Clinical Trials data. Medicare claims were linked to clinical data from 7 randomized cardiovascular clinical trials. Of 52,476 trial participants, linking resulted in 5,839 (of 10,497 linkage-eligible) Medicare-linked trial participants with fee-for-service A and B coverage. Death, myocardial infarction (MI), stroke, and revascularization incidences were compared using Medicare inpatient claims only, site-reported events (+CEC) only, or a combination of the 2. Randomized treatment effects were compared as a function of whether claims-based, site-based (+CEC), or a combined system was used for event detection.

RESULTS

Among the 5,839 study participants, the annual event rates were similar between claims- and site-based (+CEC) follow-up: death (overall rate 5.2% vs 5.2%; adjusted κ 0.99), MI (2.2% vs 2.3%; adjusted κ 0.96), stroke (0.7% vs 0.7%; adjusted κ 0.99), and any revascularization (7.4% vs 7.9%; adjusted κ 0.95). Of events detected by claims yet not reported by CEC, a minority were reported by sites but negatively adjudicated by CEC (39% of MIs and 18% of strokes). Differences in individual case concordance led to higher event rates when claims- and site-based (+CEC) systems were combined. Randomized treatment effects were similar among the 3 approaches for each outcome of interest.

CONCLUSIONS

Claims- versus site-based (+CEC) follow-up identified similar overall cardiovascular event rates despite meaningful differences in the events detected. Randomized treatment effects were similar using the 2 methods, suggesting claims data could be used to support clinical research leveraging routinely collected data. This approach may lead to more effective evidence generation, synthesis, and appraisal of medical products and inform the strategic approaches toward the National Evaluation System for Health Technology.

Collapse

Doidge JC, Harron KL. Reflections on modern methods: linkage error bias. Int J Epidemiol 2019;48:2050-2060. [PMID: 31633184 PMCID: PMC7020770 DOI: 10.1093/ije/dyz203] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/13/2019] [Indexed: 11/29/2022] Open

Delmestri A, Prieto-Alhambra D. CPRD GOLD and linked ONS mortality records: Reconciling guidelines. Int J Med Inform 2019;136:104038. [PMID: 32078979 DOI: 10.1016/j.ijmedinf.2019.104038] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Revised: 10/19/2019] [Accepted: 11/26/2019] [Indexed: 12/17/2022]

Abstract

BACKGROUND

The Clinical Practice Research Datalink (CPRD) GOLD is an extremely influential U.K. primary care dataset for epidemiological research having a number of published papers based on its data much bigger than any other U.K. primary care dataset. The Office for National Statistics (ONS) death data for England can be linked to GOLD at the patient level and are considered the gold standard on mortality. GOLD, which also holds death data, has been recently assessed against ONS linked dataset and the accuracy of its dates of death has been deemed sufficient for the majority of observational studies. However, there is a lack of guidance on how to manage the challenges existing when ONS mortality and GOLD datasets are linked, including linkage coverage period, linkage correctness likelihood, linkage regional limitations and data discrepancy.

OBJECTIVES

Provide reconciling guidelines on how to make maximum and at the same time trustworthy use of mortality information coming from both GOLD and ONS linked datasets with the aim of improving the quality, reproducibility, transparency and comparison of clinical research.

METHOD AND RESULTS

We have developed recommendations on how to manage mortality data coming from both GOLD and linked ONS, taking into account linkage coverage period, linkage correctness likelihood, linkage regional limitations and data discrepancies between these two datasets. We have also implemented these guidelines in an SQL algorithm for researchers to use.

CONCLUSION

We have provided detailed guidelines on the reconciliation of mortality data between GOLD and ONS linked death datasets, taking into account both their strengths and limitations. The consistent application of these guidelines made practical by an SQL algorithm, has the potential to improve clinical research quality, reproducibility, transparency and comparison.

Collapse

Norris KC, Duru OK, Alicic RZ, Daratha KB, Nicholas SB, McPherson SM, Bell DS, Shen JI, Jones CR, Moin T, Waterman AD, Neumiller JJ, Vargas RB, Bui AAT, Mangione CM, Tuttle KR. Rationale and design of a multicenter Chronic Kidney Disease (CKD) and at-risk for CKD electronic health records-based registry: CURE-CKD. BMC Nephrol 2019;20:416. [PMID: 31747918 PMCID: PMC6868861 DOI: 10.1186/s12882-019-1558-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Accepted: 09/12/2019] [Indexed: 12/20/2022] Open

Abstract

BACKGROUND

Chronic kidney disease (CKD) is a global public health problem, exhibiting sharp increases in incidence, prevalence, and attributable morbidity and mortality. There is a critical need to better understand the demographics, clinical characteristics, and key risk factors for CKD; and to develop platforms for testing novel interventions to improve modifiable risk factors, particularly for the CKD patients with a rapid decline in kidney function.

METHODS

We describe a novel collaboration between two large healthcare systems (Providence St. Joseph Health and University of California, Los Angeles Health) supported by leadership from both institutions, which was created to develop harmonized cohorts of patients with CKD or those at increased risk for CKD (hypertension/HTN, diabetes/DM, pre-diabetes) from electronic health record data.

RESULTS

The combined repository of candidate records included more than 3.3 million patients with at least a single qualifying measure for CKD and/or at-risk for CKD. The CURE-CKD registry includes over 2.6 million patients with and/or at-risk for CKD identified by stricter guide-line based criteria using a combination of administrative encounter codes, physical examinations, laboratory values and medication use. Notably, data based on race/ethnicity and geography in part, will enable robust analyses to study traditionally disadvantaged or marginalized patients not typically included in clinical trials.

DISCUSSION

CURE-CKD project is a unique multidisciplinary collaboration between nephrologists, endocrinologists, primary care physicians with health services research skills, health economists, and those with expertise in statistics, bio-informatics and machine learning. The CURE-CKD registry uses curated observations from real-world settings across two large healthcare systems and has great potential to provide important contributions for healthcare and for improving clinical outcomes in patients with and at-risk for CKD.

Collapse

Affiliation(s)

Keith C Norris David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA. UCLA Department of Medicine, Division of General Internal Medicine, 1100 Glendon Ave. Suite 900, Los Angeles, CA, 90024, USA.
O Kenrik Duru David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Radica Z Alicic Providence St. Joseph Health, Providence Medical Research Center, Spokane, Washington, USA University of Washington School of Medicine, Seattle, Washington, USA
Kenn B Daratha Providence St. Joseph Health, Providence Medical Research Center, Spokane, Washington, USA
Susanne B Nicholas David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Sterling M McPherson Providence St. Joseph Health, Providence Medical Research Center, Spokane, Washington, USA Washington State University Elson S. Floyd College of Medicine, Spokane, Washington, USA
Douglas S Bell David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Jenny I Shen David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA Los Angeles Biomedical Research Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
Cami R Jones Providence St. Joseph Health, Providence Medical Research Center, Spokane, Washington, USA
Tannaz Moin David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA VA Greater Los Angeles, Los Angeles, USA
Amy D Waterman David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Joshua J Neumiller Washington State University College of Pharmacy and Pharmaceutical Sciences, Spokane, USA
Roberto B Vargas Charles R. Drew University of Medicine and Science, Los Angeles, USA RAND Corporation, Santa Monica, CA, USA
Alex A T Bui David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Carol M Mangione David Geffen School of Medicine at University of California, Los Angeles, CA, 90095, USA
Katherine R Tuttle Providence St. Joseph Health, Providence Medical Research Center, Spokane, Washington, USA University of Washington School of Medicine, Seattle, Washington, USA

Collapse

Choudhary P, de Portu S, Arrieta A, Castañeda J, Campbell FM. Use of sensor-integrated pump therapy to reduce hypoglycaemia in people with Type 1 diabetes: a real-world study in the UK. Diabet Med 2019;36:1100-1108. [PMID: 31134668 DOI: 10.1111/dme.14043] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/25/2019] [Indexed: 01/04/2023]

Abstract

AIMS

To assess the efficacy of insulin pumps with automated insulin suspension systems in a real-world setting.

METHODS

We analysed anonymized data uploaded to CareLink^™ by people (n=920) with Type 1 diabetes using the MiniMed Paradigm Veo system and the MiniMed 640G system (Medtronic International Trading Sàrl, Tolochanez, Switzerland) with SmartGuard technology, with or without automated insulin suspension enabled, between February 2016 and June 2018. Users with ≥15 days of sensor data and ≥70% sensor-wear time were classified as sensor-augmented pump alone, sensor-integrated pump with low glucose suspend enabled or sensor-integrated pump with predictive low glucose management enabled.

RESULTS

The median (25^th -75^th percentile) system use was 161 (58-348) days. The median time spent with sensor glucose values ≤3 mmol/l was 0.8 (0.3-1.7)% in the sensor-augmented pump group, 0.3 (0.1-0.7)% in the sensor-integrated pump with low glucose suspend group, and 0.3 (0.1-0.5)% in the sensor-integrated pump with predictive low glucose management group. In individuals switching from sensor-augmented pump to sensor-integrated pump with low glucose suspend (n=31), there were significant reductions in the monthly rate of hypoglycaemic events <3 mmol/l (rate ratio 0.63, 95% CI 0.45-0.89; P=0.009) and in the percentage of time with glucose values ≤3 mmol/l [sensor-augmented pump: 0.63% (95% CI 0.34-1.29), sensor-integrated pump with low glucose suspend: 0.33% (95% CI 0.16-0.64); P=0.001]. The monthly rate of hypoglycaemic events decreased further in individuals (n=139) switching from sensor-integrated pump with low glucose suspend to sensor-integrated pump with predictive low glucose management [rate ratio 0.82 (95% CI 0.69-0.98); P<0.0274]. Similar results were seen for events <3.9 mmol/l. There was no difference in median time spent in target glucose range.

CONCLUSION

Real-world UK data show that increasing automation of insulin suspension reduces hypoglycaemia exposure in people with Type 1 diabetes.

Collapse

Langner I, Ohlmeier C, Zeeb H, Haug U, Riedel O. Individual mortality information in the German Pharmacoepidemiological Research Database (GePaRD): a validation study using a record linkage with a large cancer registry. BMJ Open 2019;9:e028223. [PMID: 31270118 PMCID: PMC6609119 DOI: 10.1136/bmjopen-2018-028223] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open

Slotwiner DJ, Tarakji KG, Al-Khatib SM, Passman RS, Saxon LA, Peters NS, McCall D, Turakhia MP, Schaeffer J, Mendenhall GS, Hindricks G, Narayan SM, Davenport EE, Marrouche NF. Transparent sharing of digital health data: A call to action. Heart Rhythm 2019;16:e95-e106. [PMID: 31077802 DOI: 10.1016/j.hrthm.2019.04.042] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2019] [Indexed: 11/18/2022]

Lavery JA, Lipitz-Snyderman A, Li DG, Bach PB, Panageas KS. Identifying Cancer-Directed Surgeries in Medicare Claims: A Validation Study Using SEER-Medicare Data. JCO Clin Cancer Inform 2019;3:1-24. [PMID: 30715928 PMCID: PMC6648680 DOI: 10.1200/cci.18.00093] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/19/2018] [Indexed: 02/06/2023] Open

Rentsch CT, Harron K, Urassa M, Todd J, Reniers G, Zaba B. Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania. BMC Med Res Methodol 2018;18:165. [PMID: 30526518 PMCID: PMC6288858 DOI: 10.1186/s12874-018-0632-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2018] [Accepted: 11/30/2018] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Studies based on high-quality linked data in developed countries show that even minor linkage errors, which occur when records of two different individuals are erroneously linked or when records belonging to the same individual are not linked, can impact bias and precision of subsequent analyses. We evaluated the impact of linkage quality on inferences drawn from analyses using data with substantial linkage errors in rural Tanzania.

METHODS

Semi-automatic point-of-contact interactive record linkage was used to establish gold standard links between community-based HIV surveillance data and medical records at clinics serving the surveillance population. Automated probabilistic record linkage was used to create analytic datasets at minimum, low, medium, and high match score thresholds. Cox proportional hazards regression models were used to compare HIV care registration rates by testing modality (sero-survey vs. clinic) in each analytic dataset. We assessed linkage quality using three approaches: quantifying linkage errors, comparing characteristics between linked and unlinked data, and evaluating bias and precision of regression estimates.

RESULTS

Between 2014 and 2017, 405 individuals with gold standard links were newly diagnosed with HIV in sero-surveys (n = 263) and clinics (n = 142). Automated probabilistic linkage correctly identified 233 individuals (positive predictive value [PPV] = 65%) at the low threshold and 95 individuals (PPV = 90%) at the high threshold. Significant differences were found between linked and unlinked records in primary exposure and outcome variables and for adjusting covariates at every threshold. As expected, differences attenuated with increasing threshold. Testing modality was significantly associated with time to registration in the gold standard data (adjusted hazard ratio [HR] 4.98 for clinic-based testing, 95% confidence interval [CI] 3.34, 7.42). Increasing false matches weakened the association (HR 2.76 at minimum match score threshold, 95% CI 1.73, 4.41). Increasing missed matches (i.e., increasing match score threshold and positive predictive value of the linkage algorithm) was strongly correlated with a reduction in the precision of coefficient estimate (R2 = 0.97; p = 0.03).

CONCLUSIONS

Similar to studies with more negligible levels of linkage errors, false matches in this setting reduced the magnitude of the association; missed matches reduced precision. Adjusting for these biases could provide more robust results using data with considerable linkage errors.

Collapse

Yan S, Kwan YH, Tan CS, Thumboo J, Low LL. A systematic review of the clinical application of data-driven population segmentation analysis. BMC Med Res Methodol 2018;18:121. [PMID: 30390641 PMCID: PMC6215625 DOI: 10.1186/s12874-018-0584-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 10/19/2018] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

Data-driven population segmentation analysis utilizes data analytics to divide a heterogeneous population into parsimonious and relatively homogenous groups with similar healthcare characteristics. It is a promising patient-centric analysis that enables effective integrated healthcare interventions specific for each segment. Although widely applied, there is no systematic review on the clinical application of data-driven population segmentation analysis.

METHODS

We carried out a systematic literature search using PubMed, Embase and Web of Science following PRISMA criteria. We included English peer-reviewed articles that applied data-driven population segmentation analysis on empirical health data. We summarized the clinical settings in which segmentation analysis was applied, compared and contrasted strengths, limitations, and practical considerations of different segmentation methods, and assessed the segmentation outcome of all included studies. The studies were assessed by two independent reviewers.

RESULTS

We retrieved 14,514 articles and included 216 articles. Data-driven population segmentation analysis was widely used in different clinical contexts. 163 studies examined the general population while 53 focused on specific population with certain diseases or conditions, including psychological, oncological, respiratory, cardiovascular, and gastrointestinal conditions. Variables used for segmentation in the studies are heterogeneous. Most studies (n = 170) utilized secondary data in community settings (n = 185). The most common segmentation method was latent class/profile/transition/growth analysis (n = 96) followed by K-means cluster analysis (n = 60) and hierarchical analysis (n = 50), each having its advantages, disadvantages, and practical considerations. We also identified key criteria to evaluate a segmentation framework: internal validity, external validity, identifiability/interpretability, substantiality, stability, actionability/accessibility, and parsimony.

CONCLUSIONS

Data-driven population segmentation has been widely applied and holds great potential in managing population health. The evaluations of segmentation outcome require the interplay of data analytics and subject matter expertise. The optimal framework for segmentation requires further research.

Collapse

Gilsenan A, Harding A, Kellier-Steele N, Harris D, Midkiff K, Andrews E. The Forteo Patient Registry linkage to multiple state cancer registries: study design and results from the first 8 years. Osteoporos Int 2018;29:2335-2343. [PMID: 29978254 PMCID: PMC6154045 DOI: 10.1007/s00198-018-4604-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Accepted: 06/13/2018] [Indexed: 11/05/2022]

Abstract

UNLABELLED

The Forteo Patient Registry (FPR) aims to estimate the incidence of osteosarcoma in US patients treated with teriparatide. Enrollment began in 2009 and will continue through 2019, with linkage planned through 2024. To date, no incident cases of osteosarcoma have been identified among patients registered in the FPR.

INTRODUCTION

The Forteo Patient Registry (FPR) was established in 2009 to estimate the incidence of osteosarcoma in US patients treated with teriparatide. The objective of this paper is to describe study methods, challenges encountered, and progress to date.

METHODS

The FPR is a prospective US registry designed to link data from participants annually with state cancer registries. Patient enrollment is planned for 10 years (2009-2019) and annual linkage with US state cancer registries for 15 years (2010-2024). All US state cancer registries and DC were invited to participate. Patients are recruited using pre-enrollment materials included in teriparatide device packaging, kits, and brochures distributed by health-care providers; a toll-free number; and a study website. A linkage algorithm is used to match data from enrolled participants with cancer registry data.

RESULTS

For the eighth annual linkage in 2017, information necessary for linkage with 63,270 patients in the FPR was submitted to each of the 42 participating registries. These patients contributed approximately 242,782 person-years of follow-up. A total of 5268 adult osteosarcoma cases diagnosed since January 1, 2009, were available for linkage from participating state cancer registries. To date, no incident cases of osteosarcoma have been identified among patients registered in the FPR.

CONCLUSIONS

Based on the estimated 242,782 person-years of observation as of the eighth annual linkage and projecting current enrollment rate to study end in 2024, it is anticipated that the completed study will be able to detect a fourfold increase in the risk of osteosarcoma if one exists.

Collapse

Martin P, Cortina-Borja M, Newburn M, Harper G, Gibson R, Dodwell M, Dattani N, Macfarlane A. Timing of singleton births by onset of labour and mode of birth in NHS maternity units in England, 2005-2014: A study of linked birth registration, birth notification, and hospital episode data. PLoS One 2018;13:e0198183. [PMID: 29902220 PMCID: PMC6002087 DOI: 10.1371/journal.pone.0198183] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 05/11/2018] [Indexed: 11/18/2022] Open

de Paula AA, Pires DF, Filho PA, de Lemos KRV, Barçante E, Pacheco AG. A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases. Int J Med Inform 2018;114:45-51. [PMID: 29673602 DOI: 10.1016/j.ijmedinf.2018.03.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 03/19/2018] [Accepted: 03/19/2018] [Indexed: 11/19/2022]

Abstract

BACKGROUND AND OBJECTIVE

While cross-referencing information from people living with HIV/AIDS (PLWHA) to the official mortality database is a critical step in monitoring the HIV/AIDS epidemic in Brazil, the accuracy of the linkage routine may compromise the validity of the final database, yielding to biased epidemiological estimates. We compared the accuracy and the total runtime of two linkage algorithms applied to retrieve vital status information from PLWHA in Brazilian public databases.

METHODS

Nominally identified records from PLWHA were obtained from three distinct government databases. Linkage routines included an algorithm in Python language (PLA) and Reclink software (RlS), a probabilistic software largely utilized in Brazil. Records from PLWHA¹ known to be alive were added to those from patients reported as deceased. Data were then searched into the mortality system. Scenarios where 5% and 50% of patients actually dead were simulated, considering both complete cases and 20% missing maternal names.

RESULTS

When complete information was available both algorithms had comparable accuracies. In the scenario of 20% missing maternal names, PLA² and RlS³ had sensitivities of 94.5% and 94.6% (p > 0.5), respectively; after manual reviewing, PLA sensitivity increased to 98.4% (96.6-100.0) exceeding that for RlS (p < 0.01). PLA had higher positive predictive value in 5% death proportion. Manual reviewing was intrinsically required by RlS in up to 14% register for people actually dead, whereas the corresponding proportion ranged from 1.5% to 2% for PLA. The lack of manual inspection did not alter PLA sensitivity when complete information was available. When incomplete data was available PLA sensitivity increased from 94.5% to 98.4%, thus exceeding that presented by RlS (94.6%, p < 0.05). RlS spanned considerably less processing time compared to PLA.

CONCLUSION

Both linkage algorithms presented interchangeable accuracies in retrieving vital status data from PLWHA. RlS had a considerably lesser runtime but intrinsically required manually reviewing a fastidious proportion of the matched registries. On the other hand, PLA spent quite more runtime but spared manual reviewing at no expense of accuracy.

Collapse

Wirth A. Thing Two and Thing One. Biomed Instrum Technol 2018;52:67-69. [PMID: 29350988 DOI: 10.2345/0899-8205-52.1.67] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Saranummi N, Ensio A, Laine M, Nykänen P, Itkonen P. National Health IT Services in Finland. Methods Inf Med 2018;46:463-9. [PMID: 17694242 DOI: 10.1160/me9054] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Oberaigner W. Errors in Survival Rates Caused by Routinely Used Deterministic Record Linkage Methods. Methods Inf Med 2018;46:420-4. [PMID: 17694235 DOI: 10.1160/me0299] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Maojo V, Crespo J, de la Calle G, Barreiro J, Garcia-Remesal M. Using Web Services for Linking Genomic Data to Medical Information Systems. Methods Inf Med 2018;46:484-92. [PMID: 17694245 DOI: 10.1160/me9056] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Kimura M, Nakayasu K, Ohshima Y, Fujita N, Nakashima N, Jozaki H, Numano T, Shimizu T, Shimomura M, Sasaki F, Fujiki T, Nakashima T, Toyoda K, Hoshi H, Sakusabe T, Naito Y, Kawaguchi K, Watanabe H, Tani S. SS-MIX: A Ministry Project to Promote Standardized Healthcare Information Exchange. Methods Inf Med 2018;50:131-9. [PMID: 21206962 DOI: 10.3414/me10-01-0015] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2010] [Accepted: 08/29/2010] [Indexed: 11/09/2022]

Abstract Summary Objectives: To promote healthcare information exchange between providers and to allow hospital information systems (HIS) export information in standardized format (HL7 and DICOM) in an environment of widespread legacy systems, which only can export data in proprietary format. Methods: Through the Shizuoka prefecture EMR project in 2004–2005, followed by the ministry’s SS-MIX project, many software products have been provided, which consist of 1) a standardized storage to receive HL7 v2.5 mes sages of patient demographics, prescription orders, laboratory results, and diagnostic disease in ICD-10, 2) a referral letter creation system, 3) a formatted document creation system, 4) a progress note/nursing record system, and 5) an archive/viewer to incorporate incoming healthcare data CD and allow users to view on HIS terminal. Meanwhile, other useful applications have been produced, such as adverse event reporting and clinical information retrieval. To achieve the above-mentioned objectives, these software products were created and propagated, because users can use these software products, provided that their HIS can export the above information to the standardized storage in HL7 v2.5 format. Results: In 20 hospitals of Japan, the standardized storage has been installed and some applications have been used. As major HIS vendors are shipping HIS with HL7 export function since 2007, HIS of 594 hospitals in Japan became capable of exporting data in HL7 v2.5 format (as of March 2010). Conclusions: In high CPOE installation rate (85% in 400+ bed hospitals), though most of them only capable of exporting data in proprietary format, prefecture and ministry projects were effective to promote healthcare information exchange between providers. The standardized storage became an infrastructure for many useful applications, and many hospitals started using them. Ministry designation of proposed healthcare standards was effective so as to allow vendors to conform their products, and users to install them. Collapse

Kilburn LS, Aresu M, Banerji J, Barrett-Lee P, Ellis P, Bliss JM. Can routine data be used to support cancer clinical trials? A historical baseline on which to build: retrospective linkage of data from the TACT (CRUK 01/001) breast cancer trial and the National Cancer Data Repository. Trials 2017;18:561. [PMID: 29179731 PMCID: PMC5702960 DOI: 10.1186/s13063-017-2308-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2016] [Accepted: 11/27/2016] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Randomised clinical trials (RCTs) are the gold standard for evaluating new cancer treatments. They are, however, expensive to conduct, particularly where long-term follow-up of participants is required. Tracking participants via routine datasets could provide a cost-effective alternative for ascertaining follow-up information required to evaluate disease outcomes. This project explores the potential for routine data to inform cancer trials, using, the historical National Cancer Data Repository (NCDR) for English NHS sites and, for validation, mature data available from the TACT trial.

METHODS

Datasets were matched using patients' NHS number, date of birth (dob) and name/initials. Demographics, clinical characteristics and outcomes were assessed for agreement and completeness. Overall survival was compared between NCDR and TACT.

RESULTS

A total of 3151 patients underwent linkage; 3047 (96.7%) of which had matched records. Extensive cleaning was required for some registry data fields, e.g. cause of death, whilst others had large amounts of missing data, e.g. tumour size (22.1%). Other data had high levels of matching such as dob (99.6%) and date of death (89.6%). There was no evidence of differential survival rates (8-year survival: TACT = 75% (95% CI 73, 76); NCDR = 76% (95% CI 74, 77)).

CONCLUSIONS

Data quality and completeness requires improvement before routine data could be used for RCTs. Introduction of new routine datasets, including COSD, is welcomed although reporting of disease-recurrence events remains a concern. Prospective validation of such datasets is required before RCTs can confidently switch patient follow-up to utilise routinely collected NHS-based data.

TACT TRIAL REGISTRATION

Clinicaltrials.gov NCT00033683 , registered on 9 April 2002; ISRCTN79718493 , registered on 1 July 2001.

Collapse

Elysee G, Herrin J, Horwitz LI. An observational study of the relationship between meaningful use-based electronic health information exchange, interoperability, and medication reconciliation capabilities. Medicine (Baltimore) 2017;96:e8274. [PMID: 29019898 PMCID: PMC5662321 DOI: 10.1097/md.0000000000008274] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open

Goldstein H, Harron K, Cortina-Borja M. A scaling approach to record linkage. Stat Med 2017;36:2514-2521. [PMID: 28303597 PMCID: PMC6205620 DOI: 10.1002/sim.7287] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Accepted: 03/02/2017] [Indexed: 11/10/2022]

Saugo M, Mastrangelo G, Blengio G, Righetto G. [Extending traceability of malignant testicular tumours using hospital discharge records: an experience in Veneto Region (Northern Italy)]. Epidemiol Prev 2017;41:184-186. [PMID: 28929714 DOI: 10.19191/ep17.3-4.p184.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Culbertson A, Goel S, Madden MB, Safaeinili N, Jackson KL, Carton T, Waitman R, Liu M, Krishnamurthy A, Hall L, Cappella N, Visweswaran S, Becich MJ, Applegate R, Bernstam E, Rothman R, Matheny M, Lipori G, Bian J, Hogan W, Bell D, Martin A, Grannis S, Klann J, Sutphen R, O'Hara AB, Kho A. The Building Blocks of Interoperability. A Multisite Analysis of Patient Demographic Attributes Available for Matching. Appl Clin Inform 2017;8:322-336. [PMID: 28378025 PMCID: PMC6241737 DOI: 10.4338/aci-2016-11-ra-0196] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 01/21/2017] [Indexed: 11/23/2022] Open

Marx MM, Dulas FM, Schumacher KM. [Improving the visibility of rare diseases in health care systems by specific routine coding]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2017;60:532-536. [PMID: 28349172 DOI: 10.1007/s00103-017-2534-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

St. Sauver JL, Carr AB, Yawn BP, Grossardt BR, Bock-Goodner CM, Klein LL, Pankratz JJ, Finney Rutten LJ, Rocca WA. Linking medical and dental health record data: a partnership with the Rochester Epidemiology Project. BMJ Open 2017;7:e012528. [PMID: 28360234 PMCID: PMC5372048 DOI: 10.1136/bmjopen-2016-012528] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Bohn J, Eddings W, Schneeweiss S. Conducting Privacy-Preserving Multivariable Propensity Score Analysis When Patient Covariate Information Is Stored in Separate Locations. Am J Epidemiol 2017;185:501-510. [PMID: 28399565 DOI: 10.1093/aje/kww155] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2015] [Accepted: 03/24/2016] [Indexed: 11/13/2022] Open

Pettus DC, Vanderveen T, Canfield RL, Schad R. Reliable and Scalable Infusion System Integration with the Electronic Medical Record. Biomed Instrum Technol 2017;51:120-129. [PMID: 28296444 DOI: 10.2345/0899-8205-51.2.120] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Ni MY, Li TK, Hui RWH, McDowell I, Leung GM. Requesting a unique personal identifier or providing a souvenir incentive did not affect overall consent to health record linkage: evidence from an RCT nested within a cohort. J Clin Epidemiol 2017;84:142-149. [PMID: 28115256 DOI: 10.1016/j.jclinepi.2017.01.003] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Revised: 12/15/2016] [Accepted: 01/13/2017] [Indexed: 11/15/2022]

Stausberg J, Waldenburger A, Borgs C, Schnell R. Combining Different Privacy-Preserving Record Linkage Methods for Hospital Admission Data. Stud Health Technol Inform 2017;235:161-165. [PMID: 28423775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Kellogg KM, Fairbanks RJ, Ratwani RM. EHR Usability: Get It Right from the Start. Biomed Instrum Technol 2017;51:197-199. [PMID: 28530885 DOI: 10.2345/0899-8205-51.3.197] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Holmgren AJ, Patel V, Charles D, Adler-Milstein J. US hospital engagement in core domains of interoperability. Am J Manag Care 2016;22:e395-e402. [PMID: 27982673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Abstract

OBJECTIVES

To assess US hospital engagement in the 4 core domains of interoperability (find, send, receive, integrate) and whether engaging in these domains is associated with electronic availability of clinical data from outside providers.

STUDY DESIGN

Retrospective analysis of survey data.

METHODS

Analysis of the American Hospital Association (AHA) Annual Survey of Hospitals and the American Hospital Association (AHA) Annual Survey of Hospitals - IT Supplement datasets for 2014. Respondents included 3307 US hospitals to the AHA Annual Survey - IT Supplement. We created measures of hospital engagement in 4 core domains of interoperability, as well as access to electronic clinical data from outside providers. Regression analysis was to identify hospital characteristics associated with each measure.

RESULTS

Twenty-one percent of US hospitals engaged in all 4 interoperability domains, and 25% engaged in none. Hospitals engaged in all 4 domains were more likely to have a "basic" (odds ratio [OR], 3.53; P < .01) or "comprehensive" (OR, 5.04; P < .01) electronic health record (EHR) in comparison to a less than "basic" EHR, participate in a Regional Health Information Organization (OR, 4.29; P < .01), use a single EHR vendor (OR, 2.15; P < .01), and have a third-party health information exchange vendor (OR, 2.32; P < .01). They also differed by non-IT characteristics, such as medical home participation (OR, 1.77; P < .01). Hospitals that find (OR, 5.51; P < .01), receive (OR, 2.56; P < .01), or integrate (OR, 2.53; P < .01) information were more likely to report routine clinical information availability from outside providers.

CONCLUSIONS

The one-fifth of US hospitals engaged in key domains of interoperability were more likely to have certain information technology infrastructure and participate in delivery reform. Encouragingly, interoperability engagement was associated with routine clinical information availability. Our results point to the need for ongoing efforts to expand interoperability, with the potential benefit of better information availability for clinicians and better care.

Collapse

Maguire A, Moriarty J, O'Reilly D, McCann M. Education as a predictor of antidepressant and anxiolytic medication use after bereavement: a population-based record linkage study. Qual Life Res 2016;26:1251-1262. [PMID: 27770330 PMCID: PMC5376389 DOI: 10.1007/s11136-016-1440-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/15/2016] [Indexed: 11/25/2022]

Abstract

Purpose

Educational attainment has been shown to be positively associated with mental health and a potential buffer to stressful events. One stressful life event likely to affect everyone in their lifetime is bereavement. This paper assesses the effect of educational attainment on mental health post-bereavement.

Methods

By utilising large administrative datasets, linking Census returns to death records and prescribed medication data, we analysed the bereavement exposure of 208,332 individuals aged 25–74 years. Two-level multi-level logistic regression models were constructed to determine the likelihood of antidepressant medication use (a proxy of mental ill health) post-bereavement given level of educational attainment.

Results

Individuals who are bereaved have greater antidepressant use than those who are not bereaved, with over a quarter (26.5 %) of those bereaved by suicide in receipt of antidepressant medication compared to just 12.4 % of those not bereaved. Within individuals bereaved by a sudden death, those with a university degree or higher qualifications are 73 % less likely to be in receipt of antidepressant medication compared to those with no qualifications, after full adjustment for demographic, socio-economic and area factors (OR 0.27, 95 % CI 0.09,0.75). Higher educational attainment and no qualifications have an equivalent effect for those bereaved by suicide.

Conclusions

Education may protect against poor mental health, as measured by the use of antidepressant medication, post-bereavement, except in those bereaved by suicide. This is likely due to the improved cognitive, personal and psychological skills gained from time spent in education.

Electronic supplementary material

The online version of this article (doi:10.1007/s11136-016-1440-1) contains supplementary material, which is available to authorized users.

Collapse

Harron K, Gilbert R, Cromwell D, van der Meulen J. Linking Data for Mothers and Babies in De-Identified Electronic Health Data. PLoS One 2016;11:e0164667. [PMID: 27764135 PMCID: PMC5072610 DOI: 10.1371/journal.pone.0164667] [Citation(s) in RCA: 64] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Accepted: 09/29/2016] [Indexed: 01/11/2023] Open