Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Swerdel JN, Hripcsak G, Ryan PB. PheValuator: Development and evaluation of a phenotype algorithm evaluator. J Biomed Inform 2019;97:103258. [PMID: 31369862 DOI: 10.1016/j.jbi.2019.103258] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 07/09/2019] [Accepted: 07/28/2019] [Indexed: 10/26/2022]

For:	Swerdel JN, Hripcsak G, Ryan PB. PheValuator: Development and evaluation of a phenotype algorithm evaluator. J Biomed Inform 2019;97:103258. [PMID: 31369862 DOI: 10.1016/j.jbi.2019.103258] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 07/09/2019] [Accepted: 07/28/2019] [Indexed: 10/26/2022]

Number

Cited by Other Article(s)

Wang L, Golchin N, Klot SV, Salinas CA, Manlik K, Patadia V, Miller MK, Asubonteng J, McDermott R, Barberio J, Gipson G. Adopting a Framework for Rapid Real-World Data Analyses in Safety Signal Assessment. Ther Innov Regul Sci 2024:10.1007/s43441-024-00694-7. [PMID: 39242460 DOI: 10.1007/s43441-024-00694-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 08/23/2024] [Indexed: 09/09/2024]

Cai CX, Nishimura A, Bowring MG, Westlund E, Tran D, Ng JH, Nagy P, Cook M, McLeggon JA, DuVall SL, Matheny ME, Golozar A, Ostropolets A, Minty E, Desai P, Bu F, Toy B, Hribar M, Falconer T, Zhang L, Lawrence-Archer L, Boland MV, Goetz K, Hall N, Shoaibi A, Reps J, Sena AG, Blacketer C, Swerdel J, Jhaveri KD, Lee E, Gilbert Z, Zeger SL, Crews DC, Suchard MA, Hripcsak G, Ryan PB. Similar Risk of Kidney Failure among Patients with Blinding Diseases Who Receive Ranibizumab, Aflibercept, and Bevacizumab: An Observational Health Data Sciences and Informatics Network Study. Ophthalmol Retina 2024;8:733-743. [PMID: 38519026 PMCID: PMC11298306 DOI: 10.1016/j.oret.2024.03.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 03/08/2024] [Accepted: 03/12/2024] [Indexed: 03/24/2024]

Abstract

PURPOSE

To characterize the incidence of kidney failure associated with intravitreal anti-VEGF exposure; and compare the risk of kidney failure in patients treated with ranibizumab, aflibercept, or bevacizumab.

DESIGN

Retrospective cohort study across 12 databases in the Observational Health Data Sciences and Informatics (OHDSI) network.

SUBJECTS

Subjects aged ≥ 18 years with ≥ 3 monthly intravitreal anti-VEGF medications for a blinding disease (diabetic retinopathy, diabetic macular edema, exudative age-related macular degeneration, or retinal vein occlusion).

METHODS

The standardized incidence proportions and rates of kidney failure while on treatment with anti-VEGF were calculated. For each comparison (e.g., aflibercept versus ranibizumab), patients from each group were matched 1:1 using propensity scores. Cox proportional hazards models were used to estimate the risk of kidney failure while on treatment. A random effects meta-analysis was performed to combine each database's hazard ratio (HR) estimate into a single network-wide estimate.

MAIN OUTCOME MEASURES

Incidence of kidney failure while on anti-VEGF treatment, and time from cohort entry to kidney failure.

RESULTS

Of the 6.1 million patients with blinding diseases, 37 189 who received ranibizumab, 39 447 aflibercept, and 163 611 bevacizumab were included; the total treatment exposure time was 161 724 person-years. The average standardized incidence proportion of kidney failure was 678 per 100 000 persons (range, 0-2389), and incidence rate 742 per 100 000 person-years (range, 0-2661). The meta-analysis HR of kidney failure comparing aflibercept with ranibizumab was 1.01 (95% confidence interval [CI], 0.70-1.47; P = 0.45), ranibizumab with bevacizumab 0.95 (95% CI, 0.68-1.32; P = 0.62), and aflibercept with bevacizumab 0.95 (95% CI, 0.65-1.39; P = 0.60).

CONCLUSIONS

There was no substantially different relative risk of kidney failure between those who received ranibizumab, bevacizumab, or aflibercept. Practicing ophthalmologists and nephrologists should be aware of the risk of kidney failure among patients receiving intravitreal anti-VEGF medications and that there is little empirical evidence to preferentially choose among the specific intravitreal anti-VEGF agents.

FINANCIAL DISCLOSURES

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Collapse

Affiliation(s)

Cindy X Cai Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland.
Akihiko Nishimura Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
Mary G Bowring Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, Maryland
Erik Westlund Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
Diep Tran Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland
Jia H Ng Division of Kidney Diseases and Hypertension, Donald and Barbara School of Medicine at Hofstra/Northwell, New York
Paul Nagy Department of Biomedical Informatics and Data Science, Johns Hopkins School of Medicine, Johns Hopkins University, Baltimore, Maryland
Michael Cook Johns Hopkins University, Baltimore, Maryland
Jody-Ann McLeggon Department of Biomedical Informatics, Columbia University, New York, New York
Scott L DuVall VA Informatics and Computing Infrastructure, US Department of Veterans Affairs, Salt Lake City, Utah; Department of Internal Medicine Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah
Michael E Matheny VA Informatics and Computing Infrastructure, Tennessee Valley Healthcare System, Nashville, Tennessee; Department of Biomedical Informatics, Vanderbilt University, Nashville, Tennessee
Asieh Golozar Odysseus Data Services, Inc., Cambridge, Massachusetts; OHDSI Center at the Roux Institute, Northeastern University, Boston, Massachusetts
Anna Ostropolets Odysseus Data Services, Inc., Cambridge, Massachusetts
Evan Minty O'Brien Center for Public Health, Department of Medicine, University of Calgary, Canada
Priya Desai Technology / Digital Solutions, Stanford Health Care and Stanford University School of Medicine, Palo Alto, California
Fan Bu Department of Biostatistics, University of California - Los Angeles, Los Angeles, California
Brian Toy Roski Eye Institute, Keck School of Medicine, University of Southern California; Los Angeles, California
Michelle Hribar National Eye Institute, National Institutes of Health, Bethesda, Maryland; Casey Eye Institute, Oregon Health & Science University, Portland, Oregon
Thomas Falconer Department of Biomedical Informatics, Columbia University, New York, New York
Linying Zhang Department of Biomedical Informatics, Columbia University, New York, New York
Laurence Lawrence-Archer Odysseus Data Services, Inc., Cambridge, Massachusetts; OHDSI Center at the Roux Institute, Northeastern University, Boston, Massachusetts
Michael V Boland Mass Eye and Ear, and Harvard Medical School, Boston, Massachusetts
Kerry Goetz National Eye Institute, National Institutes of Health, Bethesda, Maryland
Nathan Hall Janssen Research and Development, Titusville, New Jersey
Azza Shoaibi Janssen Research and Development, Titusville, New Jersey
Jenna Reps Janssen Research and Development, Titusville, New Jersey
Anthony G Sena Janssen Research and Development, Titusville, New Jersey; Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, the Netherlands
Clair Blacketer Janssen Research and Development, Titusville, New Jersey
Joel Swerdel Janssen Research and Development, Titusville, New Jersey
Kenar D Jhaveri Glomerular Center at Northwell Health, Division of Kidney Diseases and Hypertension, Donald and Barbara School of Medicine at Hofstra/Northwell, New York
Edward Lee Roski Eye Institute, Keck School of Medicine, University of Southern California; Los Angeles, California
Zachary Gilbert Roski Eye Institute, Keck School of Medicine, University of Southern California; Los Angeles, California
Scott L Zeger Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland
Deidra C Crews Division of Nephrology, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
Marc A Suchard VA Informatics and Computing Infrastructure, US Department of Veterans Affairs, Salt Lake City, Utah; Department of Biostatistics, University of California - Los Angeles, Los Angeles, California
George Hripcsak Department of Biomedical Informatics, Columbia University, New York, New York
Patrick B Ryan Janssen Research and Development, Titusville, New Jersey

Collapse

Tang AS, Woldemariam SR, Miramontes S, Norgeot B, Oskotsky TT, Sirota M. Harnessing EHR data for health research. Nat Med 2024;30:1847-1855. [PMID: 38965433 DOI: 10.1038/s41591-024-03074-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 05/17/2024] [Indexed: 07/06/2024]

Bu F, Arshad F, Hripcsak G, Ryan PB, Schuemie MJ, Suchard MA. Authors' Response to Huang et al.'s Comment on "Serially Combining Epidemiological Designs Does Not Improve Overall Signal Detection in Vaccine Safety Surveillance". Drug Saf 2024;47:403-404. [PMID: 38441750 DOI: 10.1007/s40264-024-01411-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/12/2024] [Indexed: 03/21/2024]

Gao J, Bonzel CL, Hong C, Varghese P, Zakir K, Gronsbell J. Semi-supervised ROC analysis for reliable and streamlined evaluation of phenotyping algorithms. J Am Med Inform Assoc 2024;31:640-650. [PMID: 38128118 PMCID: PMC10873838 DOI: 10.1093/jamia/ocad226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 09/22/2023] [Accepted: 11/20/2023] [Indexed: 12/23/2023] Open

Swerdel JN, Conover MM. Comparing broad and narrow phenotype algorithms: differences in performance characteristics and immortal time incurred. JOURNAL OF PHARMACY & PHARMACEUTICAL SCIENCES : A PUBLICATION OF THE CANADIAN SOCIETY FOR PHARMACEUTICAL SCIENCES, SOCIETE CANADIENNE DES SCIENCES PHARMACEUTIQUES 2024;26:12095. [PMID: 38235322 PMCID: PMC10791821 DOI: 10.3389/jpps.2023.12095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 12/15/2023] [Indexed: 01/19/2024]

Abstract

Introduction: When developing phenotype algorithms for observational research, there is usually a trade-off between definitions that are sensitive or specific. The objective of this study was to estimate the performance characteristics of phenotype algorithms designed for increasing specificity and to estimate the immortal time associated with each algorithm. Materials and methods: We examined algorithms for 11 chronic health conditions. The analyses were from data from five databases. For each health condition, we created five algorithms to examine performance (sensitivity and positive predictive value (PPV)) differences: one broad algorithm using a single code for the health condition and four narrow algorithms where a second diagnosis code was required 1-30 days, 1-90 days, 1-365 days, or 1- all days in a subject's continuous observation period after the first code. We also examined the proportion of immortal time relative to time-at-risk (TAR) for four outcomes. The TAR's were: 0-30 days after the first condition occurrence (the index date), 0-90 days post-index, 0-365 days post-index, and 0-1,095 days post-index. Performance of algorithms for chronic health conditions was estimated using PheValuator (V2.1.4) from the OHDSI toolstack. Immortal time was calculated as the time from the index date until the first of the following: 1) the outcome; 2) the end of the outcome TAR; 3) the occurrence of the second code for the chronic health condition. Results: In the first analysis, the narrow phenotype algorithms, i.e., those requiring a second condition code, produced higher estimates for PPV and lower estimates for sensitivity compared to the single code algorithm. In all conditions, increasing the time to the required second code increased the sensitivity of the algorithm. In the second analysis, the amount of immortal time increased as the window used to identify the second diagnosis code increased. The proportion of TAR that was immortal was highest in the 30 days TAR analyses compared to the 1,095 days TAR analyses. Conclusion: Attempting to increase the specificity of a health condition algorithm by adding a second code is a potentially valid approach to increase specificity, albeit at the cost of incurring immortal time.

Collapse

Didden E, Lu D, Hsi A, Brand M, Hedlin H, Zamanian RT. Clinical evaluation of code-based algorithms to identify patients with pulmonary arterial hypertension in healthcare databases. Pulm Circ 2024;14:e12333. [PMID: 38333073 PMCID: PMC10851026 DOI: 10.1002/pul2.12333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 11/24/2023] [Accepted: 12/21/2023] [Indexed: 02/10/2024] Open

He S, Park S, Fujii Y, Pierce SL, Kraus EM, Wall HK, Therrien NL, Jackson SL. State-Level Hypertension Prevalence and Control Among Adults in the U.S. Am J Prev Med 2024;66:46-54. [PMID: 37877903 PMCID: PMC10898652 DOI: 10.1016/j.amepre.2023.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/08/2023] [Accepted: 09/08/2023] [Indexed: 10/26/2023]

Abstract

INTRODUCTION

Improving hypertension control is a national priority. Electronic health record data have the potential to augment traditional surveillance systems. This study aimed to assess hypertension prevalence and control at the state level using a previously established electronic health record-based phenotype for hypertension.

METHODS

Adult patients (N=11,031,368) were included from the IQVIA ambulatory electronic medical record-U.S. 2019 data set. IQVIA ambulatory electronic medical record comprises electronic health records from >100,000 providers and includes patients from every U.S. state and Washington DC. Authors compared hypertension prevalence and control estimates against those from the Behavioral Risk Factor Surveillance System 2019. Results were age-standardized and stratified by state and sociodemographic characteristics. Statistical analyses were conducted in 2022-2023.

RESULTS

IQVIA ambulatory electronic medical record-U.S. patients had a median age of 55 years, and 56.7% were women. Overall age-standardized hypertension prevalence was higher in IQVIA ambulatory electronic medical record-U.S. (35.0%) than in the Behavioral Risk Factor Surveillance System (29.7%), however, state-level geographic patterns were similar, with the highest burden in the South and Appalachia. Similar patterns were also observed by sociodemographic characteristics in both data sets: hypertension prevalence was higher in older age groups (than younger), men (than women), and Black patients (than other races). Hypertension control varied widely across states: among states with >1% data coverage, control rates were lowest in Nevada (51.1%), Washington DC (52.0%), and Mississippi (55.2%); highest in Kansas (73.4%), New Jersey (72.3%), and Iowa (71.9%).

CONCLUSIONS

This study provided the first-ever estimates of hypertension control for all states and Washington DC. Electronic health record-based surveillance could support hypertension prevention and control efforts at the state level.

Collapse

Ostropolets A, Hripcsak G, Husain SA, Richter LR, Spotnitz M, Elhussein A, Ryan PB. Scalable and interpretable alternative to chart review for phenotype evaluation using standardized structured data from electronic health records. J Am Med Inform Assoc 2023;31:119-129. [PMID: 37847668 PMCID: PMC10746303 DOI: 10.1093/jamia/ocad202] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 09/23/2023] [Accepted: 10/02/2023] [Indexed: 10/19/2023] Open

Sun TY, Bhave SA, Altosaar J, Elhadad N. Assessing Phenotype Definitions for Algorithmic Fairness. AMIA ... ANNUAL SYMPOSIUM PROCEEDINGS. AMIA SYMPOSIUM 2023;2022:1032-1041. [PMID: 37128361 PMCID: PMC10148336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]

He T, Belouali A, Patricoski J, Lehmann H, Ball R, Anagnostou V, Kreimeyer K, Botsis T. Trends and opportunities in computable clinical phenotyping: A scoping review. J Biomed Inform 2023;140:104335. [PMID: 36933631 DOI: 10.1016/j.jbi.2023.104335] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 03/07/2023] [Accepted: 03/09/2023] [Indexed: 03/18/2023]

Abstract

Identifying patient cohorts meeting the criteria of specific phenotypes is essential in biomedicine and particularly timely in precision medicine. Many research groups deliver pipelines that automatically retrieve and analyze data elements from one or more sources to automate this task and deliver high-performing computable phenotypes. We applied a systematic approach based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses guidelines to conduct a thorough scoping review on computable clinical phenotyping. Five databases were searched using a query that combined the concepts of automation, clinical context, and phenotyping. Subsequently, four reviewers screened 7960 records (after removing over 4000 duplicates) and selected 139 that satisfied the inclusion criteria. This dataset was analyzed to extract information on target use cases, data-related topics, phenotyping methodologies, evaluation strategies, and portability of developed solutions. Most studies supported patient cohort selection without discussing the application to specific use cases, such as precision medicine. Electronic Health Records were the primary source in 87.1 % (N = 121) of all studies, and International Classification of Diseases codes were heavily used in 55.4 % (N = 77) of all studies, however, only 25.9 % (N = 36) of the records described compliance with a common data model. In terms of the presented methods, traditional Machine Learning (ML) was the dominant method, often combined with natural language processing and other approaches, while external validation and portability of computable phenotypes were pursued in many cases. These findings revealed that defining target use cases precisely, moving away from sole ML strategies, and evaluating the proposed solutions in the real setting are essential opportunities for future work. There is also momentum and an emerging need for computable phenotyping to support clinical and epidemiological research and precision medicine.

Collapse

Swerdel JN, Ramcharran D, Hardin J. Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus. PLoS One 2023;18:e0281929. [PMID: 36795690 PMCID: PMC9934349 DOI: 10.1371/journal.pone.0281929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Accepted: 02/04/2023] [Indexed: 02/17/2023] Open

Yang S, Varghese P, Stephenson E, Tu K, Gronsbell J. Machine learning approaches for electronic health records phenotyping: a methodical review. J Am Med Inform Assoc 2023;30:367-381. [PMID: 36413056 PMCID: PMC9846699 DOI: 10.1093/jamia/ocac216] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/27/2022] [Accepted: 10/27/2022] [Indexed: 11/23/2022] Open

Abstract

OBJECTIVE

Accurate and rapid phenotyping is a prerequisite to leveraging electronic health records for biomedical research. While early phenotyping relied on rule-based algorithms curated by experts, machine learning (ML) approaches have emerged as an alternative to improve scalability across phenotypes and healthcare settings. This study evaluates ML-based phenotyping with respect to (1) the data sources used, (2) the phenotypes considered, (3) the methods applied, and (4) the reporting and evaluation methods used.

MATERIALS AND METHODS

We searched PubMed and Web of Science for articles published between 2018 and 2022. After screening 850 articles, we recorded 37 variables on 100 studies.

RESULTS

Most studies utilized data from a single institution and included information in clinical notes. Although chronic conditions were most commonly considered, ML also enabled the characterization of nuanced phenotypes such as social determinants of health. Supervised deep learning was the most popular ML paradigm, while semi-supervised and weakly supervised learning were applied to expedite algorithm development and unsupervised learning to facilitate phenotype discovery. ML approaches did not uniformly outperform rule-based algorithms, but deep learning offered a marginal improvement over traditional ML for many conditions.

DISCUSSION

Despite the progress in ML-based phenotyping, most articles focused on binary phenotypes and few articles evaluated external validity or used multi-institution data. Study settings were infrequently reported and analytic code was rarely released.

CONCLUSION

Continued research in ML-based phenotyping is warranted, with emphasis on characterizing nuanced phenotypes, establishing reporting and evaluation standards, and developing methods to accommodate misclassified phenotypes due to algorithm errors in downstream applications.

Collapse

Didden E, Lee E, Wyckmans J, Quinn D, Perchenet L. Time to diagnosis of pulmonary hypertension and diagnostic burden: A retrospective analysis of nationwide US healthcare data. Pulm Circ 2023;13:e12188. [PMID: 36694845 PMCID: PMC9843478 DOI: 10.1002/pul2.12188] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 12/21/2022] [Accepted: 12/26/2022] [Indexed: 01/09/2023] Open

Swertz M, van Enckevort E, Oliveira JL, Fortier I, Bergeron J, Thurin NH, Hyde E, Kellmann A, Pahoueshnja R, Sturkenboom M, Cunnington M, Nybo Andersen AM, Marcon Y, Gonçalves G, Gini R. Towards an Interoperable Ecosystem of Research Cohort and Real-world Data Catalogues Enabling Multi-center Studies. Yearb Med Inform 2022;31:262-272. [PMID: 36463884 PMCID: PMC9719789 DOI: 10.1055/s-0042-1742522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022] Open

Abstract

OBJECTIVES

Existing individual-level human data cover large populations on many dimensions such as lifestyle, demography, laboratory measures, clinical parameters, etc. Recent years have seen large investments in data catalogues to FAIRify data descriptions to capitalise on this great promise, i.e. make catalogue contents more Findable, Accessible, Interoperable and Reusable. However, their valuable diversity also created heterogeneity, which poses challenges to optimally exploit their richness.

METHODS

In this opinion review, we analyse catalogues for human subject research ranging from cohort studies to surveillance, administrative and healthcare records.

RESULTS

We observe that while these catalogues are heterogeneous, have various scopes, and use different terminologies, still the underlying concepts seem potentially harmonizable. We propose a unified framework to enable catalogue data sharing, with catalogues of multi-center cohorts nested as a special case in catalogues of real-world data sources. Moreover, we list recommendations to create an integrated community of metadata catalogues and an open catalogue ecosystem to sustain these efforts and maximise impact.

CONCLUSIONS

We propose to embrace the autonomy of motivated catalogue teams and invest in their collaboration via minimal standardisation efforts such as clear data licensing, persistent identifiers for linking same records between catalogues, minimal metadata 'common data elements' using shared ontologies, symmetric architectures for data sharing (push/pull) with clear provenance tracks to process updates and acknowledge original contributors. And most importantly, we encourage the creation of environments for collaboration and resource sharing between catalogue developers, building on international networks such as OpenAIRE and research data alliance, as well as domain specific ESFRIs such as BBMRI and ELIXIR.

Collapse

Hardin J, Murray G, Swerdel J. Phenotype Algorithms to Identify Hidradenitis Suppurativa Using Real-World Data: Development and Validation Study. JMIR DERMATOLOGY 2022;5:e38783. [PMID: 37632892 PMCID: PMC10334943 DOI: 10.2196/38783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Revised: 11/09/2022] [Accepted: 11/10/2022] [Indexed: 11/12/2022] Open

Swerdel JN, Schuemie M, Murray G, Ryan PB. PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation. J Biomed Inform 2022;135:104177. [PMID: 35995107 DOI: 10.1016/j.jbi.2022.104177] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2022] [Revised: 08/11/2022] [Accepted: 08/15/2022] [Indexed: 10/31/2022]

Abstract

PURPOSE

Phenotype algorithms are central to performing analyses using observational data. These algorithms translate the clinical idea of a health condition into an executable set of rules allowing for queries of data elements from a database. PheValuator, a software package in the Observational Health Data Sciences and Informatics (OHDSI) tool stack, provides a method to assess the performance characteristics of these algorithms, namely, sensitivity, specificity, and positive and negative predictive value. It uses machine learning to develop predictive models for determining a probabilistic gold standard of subjects for assessment of cases and non-cases of health conditions. PheValuator was developed to complement or even replace the traditional approach of algorithm validation, i.e., by expert assessment of subject records through chart review. Results in our first PheValuator paper suggest a systematic underestimation of the PPV compared to previous results using chart review. In this paper we evaluate modifications made to the method designed to improve its performance.

METHODS

The major changes to PheValuator included allowing all diagnostic conditions, clinical observations, drug prescriptions, and laboratory measurements to be included as predictors within the modeling process whereas in the prior version there were significant restrictions on the included predictors. We also have allowed for the inclusion of the temporal relationships of the predictors in the model. To evaluate the performance of the new method, we compared the results from the new and original methods against results found from the literature using traditional validation of algorithms for 19 phenotypes. We performed these tests using data from five commercial databases.

RESULTS

In the assessment aggregating all phenotype algorithms, the median difference between the PheValuator estimate and the gold standard estimate for PPV was reduced from -21 (IQR -34, -3) in Version 1.0 to 4 (IQR -3, 15) using Version 2.0. We found a median difference in specificity of 3 (IQR 1, 4.25) for Version 1.0 and 3 (IQR 1, 4) for Version 2.0. The median difference between the two versions of PheValuator and the gold standard for estimates of sensitivity was reduced from -39 (-51, -20) to -16 (-34, -6).

CONCLUSION

PheValuator 2.0 produces estimates for the performance characteristics for phenotype algorithms that are significantly closer to estimates from traditional validation through chart review compared to version 1.0. With this tool in researcher's toolkits, methods, such as quantitative bias analysis, may now be used to improve the reliability and reproducibility of research studies using observational data.

Collapse

Fortin SP, Swerdel J, Sarnecki M, Doua J, Colasurdo J, Geurtsen J. Performance characteristics of code‐based algorithms to identify urinary tract infections in large United States administrative claims databases. Pharmacoepidemiol Drug Saf 2022;31:953-962. [DOI: 10.1002/pds.5492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 05/23/2022] [Accepted: 06/06/2022] [Indexed: 11/06/2022]

Applying Machine Learning in Distributed Data Networks for Pharmacoepidemiologic and Pharmacovigilance Studies: Opportunities, Challenges, and Considerations. Drug Saf 2022;45:493-510. [PMID: 35579813 PMCID: PMC9112258 DOI: 10.1007/s40264-022-01158-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2022] [Indexed: 01/28/2023]

Dhruva SS, Jiang G, Doshi AA, Friedman DJ, Brandt E, Chen J, Akar JG, Ross JS, Ervin KR, Collison Farr K, Shah ND, Coplan P, Noseworthy PA, Zhang S, Forsyth T, Schulz WL, Yu Y, Drozda, Jr. JP. Feasibility of using real-world data in the evaluation of cardiac ablation catheters: a test-case of the National Evaluation System for Health Technology Coordinating Center. BMJ SURGERY, INTERVENTIONS, & HEALTH TECHNOLOGIES 2021;3:e000089. [PMID: 35047806 PMCID: PMC8749235 DOI: 10.1136/bmjsit-2021-000089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 09/24/2021] [Indexed: 11/26/2022] Open

Affiliation(s)

Sanket S Dhruva Department of Medicine, University of California San Francisco School of Medicine, San Francisco, California, USA
Guoqian Jiang Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
Amit A Doshi Mercy Clinic, St. Louis, Missouri, USA
Daniel J Friedman Department of Internal Medicine, Cardiovascular Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Eric Brandt Mercy Research, Chesterfield, Missouri, USA
Jiajing Chen Mercy Research, Chesterfield, Missouri, USA
Joseph G Akar Department of Internal Medicine, Cardiovascular Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Joseph S Ross Department of Internal Medicine, Yale School of Medicine, New Haven, Connecticut, USA Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, New Haven, Connecticut, USA
Keondae R Ervin National Evaluation System for health Technology Coordinating Center (NESTcc), Medical Device Innovation Consortium, Arlington, Virginia, USA
Kimberly Collison Farr Mercy Research, Chesterfield, Missouri, USA
Nilay D Shah Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
Paul Coplan Medical Device Epidemiology and Real-World Data Science, Johnson & Johnson, New Brunswick, New Jersey, USA
Peter A. Noseworthy Department of Cardiovascular Medicine, Mayo Clinic, Rochester, Minnesota, USA
Shumin Zhang Medical Device Epidemiology and Real-World Data Science, Johnson & Johnson, New Brunswick, New Jersey, USA
Thomas Forsyth Mercy Research, Chesterfield, Missouri, USA
Wade L Schulz Center for Outcomes Research and Evaluation, Yale-New Haven Hospital, New Haven, Connecticut, USA Department of Laboratory Medicine, Yale School of Medicine, New Haven, Connecticut, USA
Yue Yu Department of Health Sciences Research, Mayo Clinic, Rochester, Minnesota, USA
Joseph P Drozda, Jr. Mercy Research, Chesterfield, Missouri, USA

Collapse

Ostropolets A, Zachariah P, Ryan P, Chen R, Hripcsak G. Data Consult Service: Can we use observational data to address immediate clinical needs? J Am Med Inform Assoc 2021;28:2139-2146. [PMID: 34333606 PMCID: PMC8449613 DOI: 10.1093/jamia/ocab122] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 03/30/2021] [Accepted: 06/02/2021] [Indexed: 01/08/2023] Open

Chapman M, Mumtaz S, Rasmussen LV, Karwath A, Gkoutos GV, Gao C, Thayer D, Pacheco JA, Parkinson H, Richesson RL, Jefferson E, Denaxas S, Curcin V. Desiderata for the development of next-generation electronic health record phenotype libraries. Gigascience 2021;10:giab059. [PMID: 34508578 PMCID: PMC8434766 DOI: 10.1093/gigascience/giab059] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2021] [Revised: 07/15/2021] [Accepted: 08/18/2021] [Indexed: 11/22/2022] Open

Haendel MA, Chute CG, Bennett TD, Eichmann DA, Guinney J, Kibbe WA, Payne PRO, Pfaff ER, Robinson PN, Saltz JH, Spratt H, Suver C, Wilbanks J, Wilcox AB, Williams AE, Wu C, Blacketer C, Bradford RL, Cimino JJ, Clark M, Colmenares EW, Francis PA, Gabriel D, Graves A, Hemadri R, Hong SS, Hripscak G, Jiao D, Klann JG, Kostka K, Lee AM, Lehmann HP, Lingrey L, Miller RT, Morris M, Murphy SN, Natarajan K, Palchuk MB, Sheikh U, Solbrig H, Visweswaran S, Walden A, Walters KM, Weber GM, Zhang XT, Zhu RL, Amor B, Girvin AT, Manna A, Qureshi N, Kurilla MG, Michael SG, Portilla LM, Rutter JL, Austin CP, Gersing KR. The National COVID Cohort Collaborative (N3C): Rationale, design, infrastructure, and deployment. J Am Med Inform Assoc 2021;28:427-443. [PMID: 32805036 PMCID: PMC7454687 DOI: 10.1093/jamia/ocaa196] [Citation(s) in RCA: 311] [Impact Index Per Article: 103.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 08/14/2020] [Indexed: 01/12/2023] Open

Abstract

Objective

Coronavirus disease 2019 (COVID-19) poses societal challenges that require expeditious data and knowledge sharing. Though organizational clinical data are abundant, these are largely inaccessible to outside researchers. Statistical, machine learning, and causal analyses are most successful with large-scale data beyond what is available in any given organization. Here, we introduce the National COVID Cohort Collaborative (N3C), an open science community focused on analyzing patient-level data from many centers.

Materials and Methods

The Clinical and Translational Science Award Program and scientific community created N3C to overcome technical, regulatory, policy, and governance barriers to sharing and harmonizing individual-level clinical data. We developed solutions to extract, aggregate, and harmonize data across organizations and data models, and created a secure data enclave to enable efficient, transparent, and reproducible collaborative analytics.

Results

Organized in inclusive workstreams, we created legal agreements and governance for organizations and researchers; data extraction scripts to identify and ingest positive, negative, and possible COVID-19 cases; a data quality assurance and harmonization pipeline to create a single harmonized dataset; population of the secure data enclave with data, machine learning, and statistical analytics tools; dissemination mechanisms; and a synthetic data pilot to democratize data access.

Conclusions

The N3C has demonstrated that a multisite collaborative learning health network can overcome barriers to rapidly build a scalable infrastructure incorporating multiorganizational clinical data for COVID-19 analytics. We expect this effort to save lives by enabling rapid collaboration among clinicians, researchers, and data scientists to identify treatments and specialized care and thereby reduce the immediate and long-term impacts of COVID-19.

Collapse

Affiliation(s)

Melissa A Haendel Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, Oregon, USA.,Translational and Integrative Sciences Center, Department of Molecular Toxicology, Oregon State University, Corvallis, Oregon, USA
Christopher G Chute Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, Maryland, USA
Tellen D Bennett Section of Informatics and Data Science, Department of Pediatrics, University of Colorado School of Medicine, University of Colorado, Aurora, Colorado, USA
David A Eichmann School of Library and Information Science, The University of Iowa, Iowa City, Iowa, USA
Justin Guinney Sage Bionetworks, Seattle, Washington, USA
Warren A Kibbe Duke University, Durham,North Carolina, USA
Philip R O Payne Institute for Informatics, Washington University in St. Louis, Saint Louis,Missouri, USA
Emily R Pfaff North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Peter N Robinson Jackson Laboratory, Bar Harbor, Maine, USA
Joel H Saltz Department of Biomedical Informatics, Stony Brook University, Stony Brook, New York, USA
Heidi Spratt University of Texas Medical Branch, Galveston, Texas, USA
Christine Suver Sage Bionetworks, Seattle, Washington, USA
John Wilbanks Sage Bionetworks, Seattle, Washington, USA
Adam B Wilcox University of Washington, Seattle, Washington, USA
Andrew E Williams Tufts Medical Center Clinical and Translational Science Institute, Tufts Medical Center, Boston,Massachusetts, USA
Chunlei Wu Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, California, USA
Clair Blacketer Janssen Research and Development, LLC, Raritan, New Jersey, USA
Robert L Bradford North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
James J Cimino University of Alabama-Birmingham, Birmingham, Alabama, USA
Marshall Clark North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Evan W Colmenares Department of Pharmaceutical Outcomes and Policy, University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Patricia A Francis Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Davera Gabriel Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Alexis Graves University of Iowa Institute for Clinical and Translational Science, The University of Iowa, Iowa City, Iowa, USA
Raju Hemadri National Center for Advancing Translational Science, Bethesda, Maryland, USA
Stephanie S Hong Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
George Hripscak Department of Biomedical Informatics, Columbia University, New York, New York, USA
Dazhi Jiao Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Jeffrey G Klann Harvard Medical School, Boston,Massachusetts, USA
Kristin Kostka IQVIA, Durham, North Carolina, USA
Adam M Lee University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Harold P Lehmann Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Lora Lingrey TriNetX, Cambridge,Massachusetts, USA
Robert T Miller Tufts Clinical and Translational Science Institute, Tufts University, Boston,Massachusetts, USA
Michele Morris Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh,Pennsylvania, USA
Shawn N Murphy Mass General Brigham, Boston,Massachusetts, USA
Karthik Natarajan Irving Medical Center, Columbia University, New York, New York, USA
Matvey B Palchuk TriNetX, Cambridge,Massachusetts, USA
Usman Sheikh National Center for Advancing Translational Science, Bethesda, Maryland, USA
Harold Solbrig Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Shyam Visweswaran Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh,Pennsylvania, USA
Anita Walden Oregon Clinical and Translational Research Institute, Oregon Health and Science University, Portland, Oregon, USA.,Sage Bionetworks, Seattle, Washington, USA
Kellie M Walters North Carolina Translational and Clinical Sciences Institute (NC TraCS), University of North Carolina at Chapel Hill, Chapel Hill,North Carolina, USA
Griffin M Weber Department of Biomedical Informatics, Harvard Medical School, Boston,Massachusetts, USA
Xiaohan Tanner Zhang Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Richard L Zhu Johns Hopkins University School of Medicine, Baltimore, Maryland, USA
Benjamin Amor Palantir Technologies, Palo Alto, California, USA
Andrew T Girvin Palantir Technologies, Palo Alto, California, USA
Amin Manna Palantir Technologies, Palo Alto, California, USA
Nabeel Qureshi Palantir Technologies, Palo Alto, California, USA
Michael G Kurilla Division of Clinical Innovation, National Center for Advancing Translational Science, Bethesda, Maryland, USA
Sam G Michael National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Lili M Portilla Office of Strategic Alliances, National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Joni L Rutter Office of the Director, National Center for Advancing Translational Science, Bethesda, Maryland, USA
Christopher P Austin National Center for Advancing Translational Sciences, National Institutes of Health, Bethesda, Maryland, USA
Ken R Gersing National Center for Advancing Translational Science, Bethesda, Maryland, USA

Collapse

Kashyap M, Seneviratne M, Banda JM, Falconer T, Ryu B, Yoo S, Hripcsak G, Shah NH. Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network. J Am Med Inform Assoc 2021;27:877-883. [PMID: 32374408 PMCID: PMC7309227 DOI: 10.1093/jamia/ocaa032] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 12/17/2019] [Accepted: 03/12/2020] [Indexed: 11/16/2022] Open

Abstract

Objective

Accurate electronic phenotyping is essential to support collaborative observational research. Supervised machine learning methods can be used to train phenotype classifiers in a high-throughput manner using imperfectly labeled data. We developed 10 phenotype classifiers using this approach and evaluated performance across multiple sites within the Observational Health Data Sciences and Informatics (OHDSI) network.

Materials and Methods

We constructed classifiers using the Automated PHenotype Routine for Observational Definition, Identification, Training and Evaluation (APHRODITE) R-package, an open-source framework for learning phenotype classifiers using datasets in the Observational Medical Outcomes Partnership Common Data Model. We labeled training data based on the presence of multiple mentions of disease-specific codes. Performance was evaluated on cohorts derived using rule-based definitions and real-world disease prevalence. Classifiers were developed and evaluated across 3 medical centers, including 1 international site.

Results

Compared to the multiple mentions labeling heuristic, classifiers showed a mean recall boost of 0.43 with a mean precision loss of 0.17. Performance decreased slightly when classifiers were shared across medical centers, with mean recall and precision decreasing by 0.08 and 0.01, respectively, at a site within the USA, and by 0.18 and 0.10, respectively, at an international site.

Discussion and Conclusion

We demonstrate a high-throughput pipeline for constructing and sharing phenotype classifiers across sites within the OHDSI network using APHRODITE. Classifiers exhibit good portability between sites within the USA, however limited portability internationally, indicating that classifier generalizability may have geographic limitations, and, consequently, sharing the classifier-building recipe, rather than the pretrained classifiers, may be more useful for facilitating collaborative observational research.

Collapse

Okui T, Nojiri C, Kimura S, Abe K, Maeno S, Minami M, Maeda Y, Tajima N, Kawamura T, Nakashima N. Performance evaluation of case definitions of type 1 diabetes for health insurance claims data in Japan. BMC Med Inform Decis Mak 2021;21:52. [PMID: 33573645 PMCID: PMC7879626 DOI: 10.1186/s12911-021-01422-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 01/25/2021] [Indexed: 12/18/2022] Open

Sprecher VP, Didden EM, Swerdel JN, Muller A. Evaluation of code-based algorithms to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients in large administrative databases. Pulm Circ 2020;10:2045894020961713. [PMID: 33240487 PMCID: PMC7675881 DOI: 10.1177/2045894020961713] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 09/05/2020] [Indexed: 01/27/2023] Open

Abstract

Large administrative healthcare (including insurance claims) databases are used for various retrospective real-world evidence studies. However, in pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension, identifying patients retrospectively based on administrative codes remains challenging, as it relies on code combinations (algorithms) and the accuracy for patient identification of most of them is unknown. This study aimed to assess the performance of various algorithms in correctly identifying patients with pulmonary arterial hypertension or chronic thromboembolic pulmonary hypertension in administrative databases. A systematic literature review was performed to find publications detailing code-based algorithms used to identify pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension patients. PheValuator, a diagnostic predictive modelling tool, was applied to three US claims databases, yielding models that estimated the probability of a patient having the disease. These models were used to evaluate the performance characteristics of selected pulmonary arterial hypertension and chronic thromboembolic pulmonary hypertension algorithms. With increasing algorithm complexity, average positive predictive value increased (pulmonary arterial hypertension: 13.4–66.0%; chronic thromboembolic pulmonary hypertension: 10.3–75.1%) and average sensitivity decreased (pulmonary arterial hypertension: 61.5–2.7%; chronic thromboembolic pulmonary hypertension: 20.7–0.2%). Specificities and negative predictive values were high (≥97.5%) for all algorithms. Several of the algorithms performed well overall when considering all of these four performance parameters, and all algorithms performed with similar accuracy across the three claims databases studied, even though most were designed for patient identification in a specific database. Therefore, it is the objective of a study that will determine which algorithm may be most suitable; one- or two-component algorithms are most inclusive and three- or four-component algorithms identify most precise pulmonary arterial hypertension or chronic thromboembolic pulmonary hypertension populations, respectively.

Collapse

Weng C, Shah NH, Hripcsak G. Deep phenotyping: Embracing complexity and temporality-Towards scalability, portability, and interoperability. J Biomed Inform 2020;105:103433. [PMID: 32335224 PMCID: PMC7179504 DOI: 10.1016/j.jbi.2020.103433] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Accepted: 04/20/2020] [Indexed: 01/07/2023]