Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sauleau EA, Paumier JP, Buemi A. Medical record linkage in health information systems by approximate string matching and clustering. BMC Med Inform Decis Mak 2005;5:32. [PMID: 16219102 PMCID: PMC1274322 DOI: 10.1186/1472-6947-5-32] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2005] [Accepted: 10/11/2005] [Indexed: 11/21/2022] Open

For:	Sauleau EA, Paumier JP, Buemi A. Medical record linkage in health information systems by approximate string matching and clustering. BMC Med Inform Decis Mak 2005;5:32. [PMID: 16219102 PMCID: PMC1274322 DOI: 10.1186/1472-6947-5-32] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2005] [Accepted: 10/11/2005] [Indexed: 11/21/2022] Open

Number

Cited by Other Article(s)

FIRLA: A Fast Incremental Record Linkage Algorithm. J Biomed Inform 2022;130:104094. [PMID: 35550929 DOI: 10.1016/j.jbi.2022.104094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 05/02/2022] [Accepted: 05/04/2022] [Indexed: 11/23/2022]

Domingues MAP, Camacho R, Rodrigues PP. CMIID: A comprehensive medical information identifier for clinical search harmonization in Data Safe Havens. J Biomed Inform 2020;114:103669. [PMID: 33359111 DOI: 10.1016/j.jbi.2020.103669] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Revised: 11/28/2020] [Accepted: 12/16/2020] [Indexed: 11/27/2022]

McManus BM, Richardson Z, Schenkman M, Murphy NJ, Everhart RM, Hambidge S, Morrato E. Child characteristics and early intervention referral and receipt of services: a retrospective cohort study. BMC Pediatr 2020;20:84. [PMID: 32087676 PMCID: PMC7036184 DOI: 10.1186/s12887-020-1965-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 02/07/2020] [Indexed: 02/06/2023] Open

Abstract

BACKGROUND

Early Intervention (EI) is a federally mandated, state-administered system of care for children with developmental delays and disabilities under the age of three. Gaps exist in the process of accessing EI through pediatric primary care, and low rates of EI access are well documented and disproportionately affect poor and minority children. The aims of this paper are to examine child characteristics associated with gaps in EI (1) referral, (2) access and (3) service use. To our knowledge, this is the first study to leverage linked safety net health system pediatric primary care and EI records data to follow EI-referred children longitudinally to understand EI service use gaps from EI referral to EI service utilization.

METHODS

In a retrospective cohort design (14,710 children with developmental disability or delay), we linked pediatric primary care records between a large, integrated safety net health system in metro Denver and its corresponding EI program (2014-2016). Using adjusted marginal effects [ME, (95% CI)], we estimated gaps in EI referral, access, and service type (i.e., physical [PT], occupational [OT], speech therapy [ST] and developmental intervention [DI]). Analyses accounted for child characteristics including socio-demographics, diagnosis, condition severity, and baseline function.

RESULTS

Only 18.7% of EI-eligible children (N = 2726) received a referral; 26% of those (N = 722) received services for a net enrollment rate of 5% among EI-eligible children. Having the most severe developmental condition was positively associated with EI referral [ME = 0.334 [0.249, 0.420]) and Individualized Family Services Plan (IFSP) receipt [ME = 0.156 [0.088, 0.223]). Children less likely to be EI-referred were Black, non-Hispanic (BNH) [ME = -0.029 (- 0.054, - 0.004)] and had a diagnosed condition ([ME = - 0.046 (- 0.087, - 0.005)]. Children with a diagnosis and those with higher income were more likely to receive PT or OT. Higher baseline cognitive and adaptive skills were associated with lower likelihood of PT [ME = -0.029 (- 0.054, - 0.004)], OT [ME = -0.029 (- 0.054, - 0.004)], and ST [ME = -0.029 (- 0.054, - 0.004)].

CONCLUSIONS

We identified and characterized gaps in EI referral, access, and service use in an urban safety-net population of children with high rates of developmental delay. Interventions are needed to improve integrated systems of care affecting primary care and EI processes and coordination.

Collapse

Lattar H, Salem AB, Ben Ghezala HH. Does data cleaning improve heart disease prediction? ACTA ACUST UNITED AC 2020. [DOI: 10.1016/j.procs.2020.09.109] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Chin EL, Simmons G, Bouzid YY, Kan A, Burnett DJ, Tagkopoulos I, Lemay DG. Nutrient Estimation from 24-Hour Food Recalls Using Machine Learning and Database Mapping: A Case Study with Lactose. Nutrients 2019;11:E3045. [PMID: 31847188 PMCID: PMC6950225 DOI: 10.3390/nu11123045] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 11/30/2019] [Accepted: 12/06/2019] [Indexed: 01/03/2023] Open

Transfusion Safety: The Nature and Outcomes of Errors in Patient Registration. Transfus Med Rev 2019;33:78-83. [DOI: 10.1016/j.tmrv.2018.11.004] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 11/18/2018] [Accepted: 11/28/2018] [Indexed: 11/23/2022]

Agopian AJ, Salemi JL, Tanner JP, Kirby RS. Using birth defects surveillance programs for population-based estimation of sibling recurrence risks. Birth Defects Res 2018;110:1383-1387. [PMID: 30338928 DOI: 10.1002/bdr2.1387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 07/30/2018] [Accepted: 08/02/2018] [Indexed: 11/06/2022]

de Paula AA, Pires DF, Filho PA, de Lemos KRV, Barçante E, Pacheco AG. A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases. Int J Med Inform 2018;114:45-51. [PMID: 29673602 DOI: 10.1016/j.ijmedinf.2018.03.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 03/19/2018] [Accepted: 03/19/2018] [Indexed: 11/19/2022]

Abstract

BACKGROUND AND OBJECTIVE

While cross-referencing information from people living with HIV/AIDS (PLWHA) to the official mortality database is a critical step in monitoring the HIV/AIDS epidemic in Brazil, the accuracy of the linkage routine may compromise the validity of the final database, yielding to biased epidemiological estimates. We compared the accuracy and the total runtime of two linkage algorithms applied to retrieve vital status information from PLWHA in Brazilian public databases.

METHODS

Nominally identified records from PLWHA were obtained from three distinct government databases. Linkage routines included an algorithm in Python language (PLA) and Reclink software (RlS), a probabilistic software largely utilized in Brazil. Records from PLWHA¹ known to be alive were added to those from patients reported as deceased. Data were then searched into the mortality system. Scenarios where 5% and 50% of patients actually dead were simulated, considering both complete cases and 20% missing maternal names.

RESULTS

When complete information was available both algorithms had comparable accuracies. In the scenario of 20% missing maternal names, PLA² and RlS³ had sensitivities of 94.5% and 94.6% (p > 0.5), respectively; after manual reviewing, PLA sensitivity increased to 98.4% (96.6-100.0) exceeding that for RlS (p < 0.01). PLA had higher positive predictive value in 5% death proportion. Manual reviewing was intrinsically required by RlS in up to 14% register for people actually dead, whereas the corresponding proportion ranged from 1.5% to 2% for PLA. The lack of manual inspection did not alter PLA sensitivity when complete information was available. When incomplete data was available PLA sensitivity increased from 94.5% to 98.4%, thus exceeding that presented by RlS (94.6%, p < 0.05). RlS spanned considerably less processing time compared to PLA.

CONCLUSION

Both linkage algorithms presented interchangeable accuracies in retrieving vital status data from PLWHA. RlS had a considerably lesser runtime but intrinsically required manually reviewing a fastidious proportion of the matched registries. On the other hand, PLA spent quite more runtime but spared manual reviewing at no expense of accuracy.

Collapse

Stausberg J, Nasseh D. Evaluation of a Binary Semi-supervised Classification Technique for Probabilistic Record Linkage. Methods Inf Med 2018;55:136-43. [DOI: 10.3414/me14-01-0087] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2014] [Accepted: 03/25/2015] [Indexed: 11/09/2022]

Abstract SummaryBackground: The process of merging data of different data sources is referred to as record linkage. A medical environment with increased preconditions on privacy protection demands the transformation of clear-text attributes like first name or date of birth into one-way encrypted pseudonyms. When performing an automated or privacy preserving record linkage there might be the need of a binary classification deciding whether two records should be classified as the same entity. The classification is the final of the four main phases of the record linkage process: Preprocessing, indexing, matching and classification. The choice of binary classification techniques in dependence of project specifications in particular data quality has not extensively been studied yet.Objectives: The aim of this work is the introduction and evaluation of an automatable semi-supervised binary classification system applied within the field of record linkage capable of competing or even surpassing advanced automated techniques of the domain of unsupervised classification.Methods: This work describes the rationale leading to the model and the final implementation of an automatable semi-supervised binary classification system and the comparison of its classification performance to an advanced active learning approach out of the domain of unsupervised learning. The performance of both systems has been measured on a broad variety of artificial test sets (n = 400), based on real patient data, with distinct and unique characteristics.Results: While the classification performance for both methods measured as F-measure was relatively close on test sets with maximum defined data quality, 0.996 for semi-supervised classification, 0.993 for unsupervised classification, it incrementally diverged for test sets of worse data quality dropping to 0.964 for semi-supervised classification and 0.803 for unsupervised classification.Conclusions: Aside from supplying a viable model for semi-supervised classification for automated probabilistic record linkage, the tests conducted on a large amount of test sets suggest that semi-supervised techniques might generally be capable of outperforming unsupervised techniques especially on data with lower levels of data quality. Collapse

Corradi JP, Chhabra J, Mather JF, Waszynski CM, Dicks RS. Analysis of multi-dimensional contemporaneous EHR data to refine delirium assessments. Comput Biol Med 2016;75:267-74. [PMID: 27340924 DOI: 10.1016/j.compbiomed.2016.06.013] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2016] [Revised: 06/10/2016] [Accepted: 06/13/2016] [Indexed: 12/16/2022]

Zech J, Husk G, Moore T, Shapiro JS. Measuring the Degree of Unmatched Patient Records in a Health Information Exchange Using Exact Matching. Appl Clin Inform 2016;7:330-40. [PMID: 27437044 DOI: 10.4338/aci-2015-11-ra-0158] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Accepted: 02/26/2016] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND

Health information exchange (HIE) facilitates the exchange of patient information across different healthcare organizations. To match patient records across sites, HIEs usually rely on a master patient index (MPI), a database responsible for determining which medical records at different healthcare facilities belong to the same patient. A single patient's records may be improperly split across multiple profiles in the MPI.

OBJECTIVES

We investigated the how often two individuals shared the same first name, last name, and date of birth in the Social Security Death Master File (SSDMF), a US government database containing over 85 million individuals, to determine the feasibility of using exact matching as a split record detection tool. We demonstrated how a method based on exact record matching could be used to partially measure the degree of probable split patient records in the MPI of an HIE.

METHODS

We calculated the percentage of individuals who were uniquely identified in the SSDMF using first name, last name, and date of birth. We defined a measure consisting of the average number of unique identifiers associated with a given first name, last name, and date of birth. We calculated a reference value for this measure on a subsample of SSDMF data. We compared this measure value to data from a functioning HIE.

RESULTS

We found that it was unlikely for two individuals to share the same first name, last name, and date of birth in a large US database including over 85 million individuals. 98.81% of individuals were uniquely identified in this dataset using only these three items. We compared the value of our measure on a subsample of Social Security data (1.00089) to that of HIE data (1.1238) and found a significant difference (t-test p-value < 0.001).

CONCLUSIONS

This method may assist HIEs in detecting split patient records.

Collapse

[Completeness assessment of the Breton registry of congenital abnormalities: A checking tool based on hospital discharge data]. Rev Epidemiol Sante Publique 2015;63:223-35. [PMID: 26119557 DOI: 10.1016/j.respe.2015.04.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2014] [Revised: 03/18/2015] [Accepted: 04/08/2015] [Indexed: 11/23/2022] Open

Abstract

BACKGROUND

Exhaustiveness is required for registries. In the Breton registry of congenital abnormalities, cases are recorded at the source. We use hospital discharge data in order to verify the completeness of the registry. In this paper, we present a computerized tool for completeness assessment applied to the Breton registry.

METHODS

All the medical information departments were solicited once a year, asking for infant medical stays for newborns alive at one year old and for mother's stays if not. Files were transmitted by secure messaging and data were processed on a secure server. An identity-matching algorithm was applied and a similarity score calculated. When the record was not linked automatically or manually, the medical record had to be consulted. The exhaustiveness rate was assessed using the capture recapture method and the proportion of cases matched manually was used to assess the identity matching algorithm.

RESULTS

The computerized tool bas been used in common practice since June 2012 by the registry investigators. The results presented concerned the years 2011 and 2012. There were 470 potential cases identified from the hospital discharge data in 2011 and 538 in 2012, 35 new cases were detected in 2011 (32 children born alive and 3 stillborn), and 33 in 2012 (children born alive). There were respectively 85 and 137 false-positive cases. The theorical exhaustiveness rate reached 91% for both years. The rate of exact matching amounted to 68%; 6% of the potential cases were linked manually.

CONCLUSION

Hospital discharge databases contribute to the quality of the registry even though reports are made at the source. The implemented tool facilitates the investigator's work. In the future, use of the national identifying number, when allowed, should facilitate linkage between registry data and hospital discharge data.

Collapse

Rudniy A, Song M, Geller J. Mapping biological entities using the longest approximately common prefix method. BMC Bioinformatics 2014;15:187. [PMID: 24928653 PMCID: PMC4086698 DOI: 10.1186/1471-2105-15-187] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2013] [Accepted: 05/29/2014] [Indexed: 11/24/2022] Open

Kum HC, Krishnamurthy A, Machanavajjhala A, Reiter MK, Ahalt S. Privacy preserving interactive record linkage (PPIRL). J Am Med Inform Assoc 2013;21:212-20. [PMID: 24201028 DOI: 10.1136/amiajnl-2013-002165] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open

Cox S, Martin R, Somaia P, Smith K. The development of a data-matching algorithm to define the ‘case patient’. AUST HEALTH REV 2013;37:54-9. [DOI: 10.1071/ah11161] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2012] [Accepted: 07/02/2012] [Indexed: 11/23/2022]

Abstract Objectives. To describe a model that matches electronic patient care records within a given case to one or more patients within that case. Method. This retrospective study included data from all metropolitan Ambulance Victoria electronic patient care records (n = 445 576) for the time period 1 January 2009–31 May 2010. Data were captured via VACIS (Ambulance Victoria, Melbourne, Vic., Australia), an in-field electronic data capture system linked to an integrated data warehouse database. The case patient algorithm included ‘Jaro–Winkler’, ‘Soundex’ and ‘weight matching’ conditions. Results. The case patient matching algorithm has a sensitivity of 99.98%, a specificity of 99.91% and an overall accuracy of 99.98%. Conclusions. The case patient algorithm provides Ambulance Victoria with a sophisticated, efficient and highly accurate method of matching patient records within a given case. This method has applicability to other emergency services where unique identifiers are case based rather than patient based. What is known about the topic? Accurate pre-hospital data that can be linked to patient outcomes is widely accepted as critical to support pre-hospital patient care and system performance. What does this paper add? There is a paucity of literature describing electronic matching of patient care records at the patient level rather than the case level. Ambulance Victoria has developed a complex yet efficient and highly accurate method for electronically matching patient records, in the absence of a patient-specific unique identifier. Linkage of patient information from multiple patient care records to determine if the records are for the same individual defines the ‘case patient’. What are the implications for practitioners? This paper describes a model of record linkage where patients are matched within a given case at the patient level as opposed to the case level. This methodology is applicable to other emergency services where unique identifiers are case based. Collapse

Finney JM, Walker AS, Peto TEA, Wyllie DH. An efficient record linkage scheme using graphical analysis for identifier error detection. BMC Med Inform Decis Mak 2011;11:7. [PMID: 21284874 PMCID: PMC3039555 DOI: 10.1186/1472-6947-11-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Accepted: 02/01/2011] [Indexed: 11/10/2022] Open

Silveira DPD, Artmann E. Accuracy of probabilistic record linkage applied to health databases: systematic review. Rev Saude Publica 2009;43:875-82. [PMID: 19784456 DOI: 10.1590/s0034-89102009005000060] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2008] [Accepted: 04/15/2009] [Indexed: 11/21/2022] Open