1
|
Ahmadi N, Zoch M, Guengoeze O, Facchinello C, Mondorf A, Stratmann K, Musleh K, Erasmus HP, Tchertov J, Gebler R, Schaaf J, Frischen LS, Nasirian A, Dai J, Henke E, Tremblay D, Srisuwananukorn A, Bornhäuser M, Röllig C, Eckardt JN, Middeke JM, Wolfien M, Sedlmayr M. How to customize common data models for rare diseases: an OMOP-based implementation and lessons learned. Orphanet J Rare Dis 2024; 19:298. [PMID: 39143600 PMCID: PMC11325822 DOI: 10.1186/s13023-024-03312-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 08/06/2024] [Indexed: 08/16/2024] Open
Abstract
BACKGROUND Given the geographical sparsity of Rare Diseases (RDs), assembling a cohort is often a challenging task. Common data models (CDM) can harmonize disparate sources of data that can be the basis of decision support systems and artificial intelligence-based studies, leading to new insights in the field. This work is sought to support the design of large-scale multi-center studies for rare diseases. METHODS In an interdisciplinary group, we derived a list of elements of RDs in three medical domains (endocrinology, gastroenterology, and pneumonology) according to specialist knowledge and clinical guidelines in an iterative process. We then defined a RDs data structure that matched all our data elements and built Extract, Transform, Load (ETL) processes to transfer the structure to a joint CDM. To ensure interoperability of our developed CDM and its subsequent usage for further RDs domains, we ultimately mapped it to Observational Medical Outcomes Partnership (OMOP) CDM. We then included a fourth domain, hematology, as a proof-of-concept and mapped an acute myeloid leukemia (AML) dataset to the developed CDM. RESULTS We have developed an OMOP-based rare diseases common data model (RD-CDM) using data elements from the three domains (endocrinology, gastroenterology, and pneumonology) and tested the CDM using data from the hematology domain. The total study cohort included 61,697 patients. After aligning our modules with those of Medical Informatics Initiative (MII) Core Dataset (CDS) modules, we leveraged its ETL process. This facilitated the seamless transfer of demographic information, diagnoses, procedures, laboratory results, and medication modules from our RD-CDM to the OMOP. For the phenotypes and genotypes, we developed a second ETL process. We finally derived lessons learned for customizing our RD-CDM for different RDs. DISCUSSION This work can serve as a blueprint for other domains as its modularized structure could be extended towards novel data types. An interdisciplinary group of stakeholders that are actively supporting the project's progress is necessary to reach a comprehensive CDM. CONCLUSION The customized data structure related to our RD-CDM can be used to perform multi-center studies to test data-driven hypotheses on a larger scale and take advantage of the analytical tools offered by the OHDSI community.
Collapse
Affiliation(s)
- Najia Ahmadi
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany.
| | - Michele Zoch
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Oya Guengoeze
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Carlo Facchinello
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Antonia Mondorf
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Katharina Stratmann
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Khader Musleh
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Hans-Peter Erasmus
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Jana Tchertov
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Richard Gebler
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Jannik Schaaf
- Goethe University Frankfurt, University Hospital, Institute of Medical Informatics, Frankfurt, Germany
| | - Lena S Frischen
- University Hospital Frankfurt, Goethe University, Executive Department for Medical IT-Systems and Digitalization, Frankfurt, Germany
| | - Azadeh Nasirian
- Center of Medical Informatics, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Jiabin Dai
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Elisa Henke
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Douglas Tremblay
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Martin Bornhäuser
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Christoph Röllig
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Jan-Niklas Eckardt
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
- Else-Kroener-Fresenius-Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Jan Moritz Middeke
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
- Else-Kroener-Fresenius-Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Markus Wolfien
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, Dresden, Germany
| | - Martin Sedlmayr
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| |
Collapse
|
2
|
Scheible R, Thomczyk F, Blum M, Rautenberg M, Prunotto A, Yazijy S, Boeker M. Integrating row level security in i2b2: segregation of medical records into data marts without data replication and synchronization. JAMIA Open 2023; 6:ooad068. [PMID: 37583654 PMCID: PMC10425194 DOI: 10.1093/jamiaopen/ooad068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 07/28/2023] [Accepted: 08/03/2023] [Indexed: 08/17/2023] Open
Abstract
Objective i2b2 offers the possibility to store biomedical data of different projects in subject oriented data marts of the data warehouse, which potentially requires data replication between different projects and also data synchronization in case of data changes. We present an approach that can save this effort and assess its query performance in a case study that reflects real-world scenarios. Material and Methods For data segregation, we used PostgreSQL's row level security (RLS) feature, the unit test framework pgTAP for validation and testing as well as the i2b2 application. No change of the i2b2 code was required. Instead, to leverage orchestration and deployment, we additionally implemented a command line interface (CLI). We evaluated performance using 3 different queries generated by i2b2, which we performed on an enlarged Harvard demo dataset. Results We introduce the open source Python CLI i2b2rls, which orchestrates and manages security roles to implement data marts so that they do not need to be replicated and synchronized as different i2b2 projects. Our evaluation showed that our approach is on average 3.55 and on median 2.71 times slower compared to classic i2b2 data marts, but has more flexibility and easier setup. Conclusion The RLS-based approach is particularly useful in a scenario with many projects, where data is constantly updated, user and group requirements change frequently or complex user authorization requirements have to be defined. The approach applies to both the i2b2 interface and direct database access.
Collapse
Affiliation(s)
- Raphael Scheible
- Institute of Artificial Intelligence and Informatics in Medicine (AIIM), Chair of Medical Informatics, University Hospital rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
- Center for Chronic Immunodeficiency (CCI), Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Fabian Thomczyk
- Data Inintegration Center (DIC), University of Freiburg, Freiburg, Germany
| | - Marco Blum
- Data Inintegration Center (DIC), University of Freiburg, Freiburg, Germany
| | - Micha Rautenberg
- Institute of Medical Biometry and Statistics, Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
- Zentrum für Digitalisierung und Informationstechnologie (ZDI), Medical Center, University of Freiburg, Freiburg, Germany
| | - Andrea Prunotto
- Data Inintegration Center (DIC), University of Freiburg, Freiburg, Germany
| | - Suhail Yazijy
- Institute of Medical Biometry and Statistics, Medical Center, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Martin Boeker
- Institute of Artificial Intelligence and Informatics in Medicine (AIIM), Chair of Medical Informatics, University Hospital rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| |
Collapse
|
3
|
Wagholikar KB, Ainsworth L, Zelle D, Chaney K, Mendis M, Klann J, Blood AJ, Miller A, Chulyadyo R, Oates M, Gordon WJ, Aronson SJ, Scirica BM, Murphy SN. I2b2-etl: Python application for importing electronic health data into the informatics for integrating biology and the bedside platform. Bioinformatics 2022; 38:4833-4836. [PMID: 36053173 PMCID: PMC9563689 DOI: 10.1093/bioinformatics/btac595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Revised: 07/15/2022] [Accepted: 08/31/2022] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION The i2b2 platform is used at major academic health institutions and research consortia for querying for electronic health data. However, a major obstacle for wider utilization of the platform is the complexity of data loading that entails a steep curve of learning the platform's complex data schemas. To address this problem, we have developed the i2b2-etl package that simplifies the data loading process, which will facilitate wider deployment and utilization of the platform. RESULTS We have implemented i2b2-etl as a Python application that imports ontology and patient data using simplified input file schemas and provides inbuilt record number de-identification and data validation. We describe a real-world deployment of i2b2-etl for a population-management initiative at MassGeneral Brigham. AVAILABILITY AND IMPLEMENTATION i2b2-etl is a free, open-source application implemented in Python available under the Mozilla 2 license. The application can be downloaded as compiled docker images. A live demo is available at https://i2b2clinical.org/demo-i2b2etl/ (username: demo, password: Etl@2021). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kavishwar B Wagholikar
- Harvard Medical School, Boston, MA 02115, USA.,Massachusetts General Hospital, Boston, MA 02114, USA
| | | | - David Zelle
- Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Kira Chaney
- Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | - Jeffery Klann
- Harvard Medical School, Boston, MA 02115, USA.,Massachusetts General Hospital, Boston, MA 02114, USA
| | - Alexander J Blood
- Harvard Medical School, Boston, MA 02115, USA.,Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | | | | | - William J Gordon
- Harvard Medical School, Boston, MA 02115, USA.,Mass General Brigham, Boston, MA 02199, USA.,Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | - Benjamin M Scirica
- Harvard Medical School, Boston, MA 02115, USA.,Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Shawn N Murphy
- Harvard Medical School, Boston, MA 02115, USA.,Massachusetts General Hospital, Boston, MA 02114, USA
| |
Collapse
|
4
|
Wagholikar KB, Zelle D, Ainsworth L, Chaney K, Blood AJ, Miller A, Chulyadyo R, Oates M, Gordon WJ, Aronson SJ, Scirica BM, Murphy SN. Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis. INFORMATICS IN MEDICINE UNLOCKED 2022; 31. [PMID: 35874460 PMCID: PMC9306316 DOI: 10.1016/j.imu.2022.100996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone. As a potential resolution, we explore an alternative approach wherein a graphical interface that automatically generates the SQL queries is used to perform the analysis. The latter allows clinical experts to directly perform complex queries on the data, despite their unfamiliarity with SQL syntax. The interface provides an intuitive understanding of the query logic which makes the analysis transparent and comprehensible to the clinical study-staff, thereby enhancing the transparency and validity of the analysis. This study demonstrates the feasibility of using a user-friendly interface that automatically generate SQL for analysis of health data. It outlines challenges that will be useful for designing user-friendly tools to improve transparency and reproducibility of data analysis.
Collapse
|
5
|
Ye Y, Barapatre S, Davis MK, Elliston KO, Davatzikos C, Fedorov A, Fillion-Robin JC, Foster I, Gilbertson JR, Lasso A, Miller JV, Morgan M, Pieper S, Raumann BE, Sarachan BD, Savova G, Silverstein JC, Taylor DP, Zelnis JB, Zhang GQ, Cuticchia J, Becich MJ. Open-source Software Sustainability Models: Initial White Paper From the Informatics Technology for Cancer Research Sustainability and Industry Partnership Working Group. J Med Internet Res 2021; 23:e20028. [PMID: 34860667 PMCID: PMC8686402 DOI: 10.2196/20028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2020] [Revised: 12/14/2020] [Accepted: 09/23/2021] [Indexed: 11/13/2022] Open
Abstract
Background The National Cancer Institute Informatics Technology for Cancer Research (ITCR) program provides a series of funding mechanisms to create an ecosystem of open-source software (OSS) that serves the needs of cancer research. As the ITCR ecosystem substantially grows, it faces the challenge of the long-term sustainability of the software being developed by ITCR grantees. To address this challenge, the ITCR sustainability and industry partnership working group (SIP-WG) was convened in 2019. Objective The charter of the SIP-WG is to investigate options to enhance the long-term sustainability of the OSS being developed by ITCR, in part by developing a collection of business model archetypes that can serve as sustainability plans for ITCR OSS development initiatives. The working group assembled models from the ITCR program, from other studies, and from the engagement of its extensive network of relationships with other organizations (eg, Chan Zuckerberg Initiative, Open Source Initiative, and Software Sustainability Institute) in support of this objective. Methods This paper reviews the existing sustainability models and describes 10 OSS use cases disseminated by the SIP-WG and others, including 3D Slicer, Bioconductor, Cytoscape, Globus, i2b2 (Informatics for Integrating Biology and the Bedside) and tranSMART, Insight Toolkit, Linux, Observational Health Data Sciences and Informatics tools, R, and REDCap (Research Electronic Data Capture), in 10 sustainability aspects: governance, documentation, code quality, support, ecosystem collaboration, security, legal, finance, marketing, and dependency hygiene. Results Information available to the public reveals that all 10 OSS have effective governance, comprehensive documentation, high code quality, reliable dependency hygiene, strong user and developer support, and active marketing. These OSS include a variety of licensing models (eg, general public license version 2, general public license version 3, Berkeley Software Distribution, and Apache 3) and financial models (eg, federal research funding, industry and membership support, and commercial support). However, detailed information on ecosystem collaboration and security is not publicly provided by most OSS. Conclusions We recommend 6 essential attributes for research software: alignment with unmet scientific needs, a dedicated development team, a vibrant user community, a feasible licensing model, a sustainable financial model, and effective product management. We also stress important actions to be considered in future ITCR activities that involve the discussion of the sustainability and licensing models for ITCR OSS, the establishment of a central library, the allocation of consulting resources to code quality control, ecosystem collaboration, security, and dependency hygiene.
Collapse
Affiliation(s)
- Ye Ye
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Seemran Barapatre
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Michael K Davis
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Keith O Elliston
- Axiomedix, Inc., Bedford, MA, United States.,PHEMI Systems Corp., Vancouver, BC, Canada.,tranSMART foundation, Wakefield, MA, United States
| | - Christos Davatzikos
- Department of Radiology, School of Medicine, University of Pennsylvania, Philadelphia, PA, United States
| | - Andrey Fedorov
- Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States
| | | | - Ian Foster
- Department of Computer Science, University of Chicago, Chicago, IL, United States
| | - John R Gilbertson
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Andras Lasso
- The Perk Lab for Percutaneous Surgery, School of Computing, Queen's University, Kingston, ON, Canada
| | | | - Martin Morgan
- Department of Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, United States
| | | | | | | | - Guergana Savova
- Boston Children's Hospital, Harvard Medical School, Boston, MA, United States
| | - Jonathan C Silverstein
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Donald P Taylor
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Joyce B Zelnis
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| | - Guo-Qiang Zhang
- The University of Texas Health Science Center at Houston, Houston, TX, United States
| | | | - Michael J Becich
- Department of Biomedical Informatics, School of Medicine, University of Pittsburgh, Pittsburgh, PA, United States
| |
Collapse
|
6
|
Cloud Services for Patient Cohort Identification Using the Informatics for Integrating Biology and the Bedside Platform. BIOMED RESEARCH INTERNATIONAL 2020; 2020:2851713. [PMID: 32724799 PMCID: PMC7366204 DOI: 10.1155/2020/2851713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 06/08/2020] [Accepted: 06/15/2020] [Indexed: 11/17/2022]
Abstract
Despite the widespread use of the “Informatics for Integrating Biology and the Bedside” (i2b2) platform, there are substantial challenges for loading electronic health records (EHR) into i2b2 and for querying i2b2. We have previously presented a simplified framework for semantic abstraction of EHR records into i2b2. Building on our previous work, we have created a proof-of-concept implementation of cloud services on an i2b2 data store for cohort identification. Specifically, we have implemented a graphical user interface (GUI) that declares the key components for data import, transformation, and query of EHR data. The GUI integrates with Azure cloud services to create data pipelines for importing EHR data into i2b2, creation of derived facts, and querying for generating Sankey-like flow diagrams that characterize the patient cohorts. We have evaluated the implementation using the real-world MIMIC-III dataset. We discuss the key features of this implementation and direction for future work, which will advance the efforts of the research community for patient cohort identification.
Collapse
|
7
|
Spengler H, Lang C, Mahapatra T, Gatz I, Kuhn KA, Prasser F. Enabling Agile Clinical and Translational Data Warehousing: Platform Development and Evaluation. JMIR Med Inform 2020; 8:e15918. [PMID: 32706673 PMCID: PMC7404007 DOI: 10.2196/15918] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2019] [Revised: 02/16/2020] [Accepted: 05/06/2020] [Indexed: 01/16/2023] Open
Abstract
Background Modern data-driven medical research provides new insights into the development and course of diseases and enables novel methods of clinical decision support. Clinical and translational data warehouses, such as Informatics for Integrating Biology and the Bedside (i2b2) and tranSMART, are important infrastructure components that provide users with unified access to the large heterogeneous data sets needed to realize this and support use cases such as cohort selection, hypothesis generation, and ad hoc data analysis. Objective Often, different warehousing platforms are needed to support different use cases and different types of data. Moreover, to achieve an optimal data representation within the target systems, specific domain knowledge is needed when designing data-loading processes. Consequently, informaticians need to work closely with clinicians and researchers in short iterations. This is a challenging task as installing and maintaining warehousing platforms can be complex and time consuming. Furthermore, data loading typically requires significant effort in terms of data preprocessing, cleansing, and restructuring. The platform described in this study aims to address these challenges. Methods We formulated system requirements to achieve agility in terms of platform management and data loading. The derived system architecture includes a cloud infrastructure with unified management interfaces for multiple warehouse platforms and a data-loading pipeline with a declarative configuration paradigm and meta-loading approach. The latter compiles data and configuration files into forms required by existing loading tools, thereby automating a wide range of data restructuring and cleansing tasks. We demonstrated the fulfillment of the requirements and the originality of our approach by an experimental evaluation and a comparison with previous work. Results The platform supports both i2b2 and tranSMART with built-in security. Our experiments showed that the loading pipeline accepts input data that cannot be loaded with existing tools without preprocessing. Moreover, it lowered efforts significantly, reducing the size of configuration files required by factors of up to 22 for tranSMART and 1135 for i2b2. The time required to perform the compilation process was roughly equivalent to the time required for actual data loading. Comparison with other tools showed that our solution was the only tool fulfilling all requirements. Conclusions Our platform significantly reduces the efforts required for managing clinical and translational warehouses and for loading data in various formats and structures, such as complex entity-attribute-value structures often found in laboratory data. Moreover, it facilitates the iterative refinement of data representations in the target platforms, as the required configuration files are very compact. The quantitative measurements presented are consistent with our experiences of significantly reduced efforts for building warehousing platforms in close cooperation with medical researchers. Both the cloud-based hosting infrastructure and the data-loading pipeline are available to the community as open source software with comprehensive documentation.
Collapse
Affiliation(s)
- Helmut Spengler
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Claudia Lang
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Tanmaya Mahapatra
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Ingrid Gatz
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Klaus A Kuhn
- Institute of Medical Informatics, Statistics and Epidemiology, University Medical Center rechts der Isar, School of Medicine, Technical University of Munich, Munich, Germany
| | - Fabian Prasser
- Charité - Universitätsmedizin Berlin, Berlin, Germany.,Berlin Institute of Health, Berlin, Germany
| |
Collapse
|
8
|
Campion TR, Pompea ST, Turner SP, Sholle ET, Cole CL, Kaushal R. A Method for Integrating Healthcare Provider Organization and Research Sponsor Systems and Workflows to Support Large-Scale Studies. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2019; 2019:648-655. [PMID: 31259020 PMCID: PMC6568055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Healthcare provider organizations (HPOs) increasingly participate in large-scale research efforts sponsored by external organizations that require use of consent management systems that may not integrate seamlessly with local workflows. The resulting inefficiency can hinder the ability of HPOs to participate in studies. To overcome this challenge, we developed a method using REDCap, a widely adopted electronic data capture system, and novel middleware that can potentially generalize to other settings. In this paper, we describe the method, illustrate its use to support the NIHAll of Us Research Program and PCORI ADAPTABLE studies at our HPO, and encourage other HPOs to test replicability of the method to facilitate similar research efforts. Code is available on GitHub at https://github.com/wcmc-research-informatics/.
Collapse
Affiliation(s)
- Thomas R Campion
- Information Technologies & Services Department, Weill Cornell Medicine, New York, NY
- Department of Healthcare Policy & Research, Weill Cornell Medicine, New York, NY
- Department of Pediatrics, Weill Cornell Medicine, New York, NY
| | - Sean T Pompea
- Information Technologies & Services Department, Weill Cornell Medicine, New York, NY
| | - Scott P Turner
- Information Technologies & Services Department, Weill Cornell Medicine, New York, NY
| | - Evan T Sholle
- Information Technologies & Services Department, Weill Cornell Medicine, New York, NY
| | - Curtis L Cole
- Information Technologies & Services Department, Weill Cornell Medicine, New York, NY
- Department of Medicine, Weill Cornell Medicine, New York, NY
| | - Rainu Kaushal
- Department of Healthcare Policy & Research, Weill Cornell Medicine, New York, NY
| |
Collapse
|
9
|
Wagholikar KB, Ainsworth L, Vernekar VP, Pathak A, Glynn C, Zelle D, Zagade A, Karipineni N, Herrick CD, McPartlin M, Bui TV, Mendis M, Klann J, Oates M, Gordon W, Cannon C, Patel R, Aronson SJ, MacRae CA, Scirica BM, Murphy SN. Extending i2b2 into a framework for semantic abstraction of EHR to facilitate rapid development and portability of Health IT applications. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2019; 2019:370-378. [PMID: 31258990 PMCID: PMC6568124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
The wide gap between a care provider's conceptualization of electronic health record (EHR) and the structures for electronic health record (EHR) data storage and transmission, presents a multitude of obstacles for development of innovative Health IT applications. While developers model the EHR view of the clinicians at one end, they work with a different data view to construct health IT applications. Although there has been considerable progress to bridge this gap by evolution of developer friendly standards and tools for terminology mapping and data warehousing, there is a need for a simplified framework to facilitate development of interoperable applications. To this end, we propose a framework for creating a layer of semantic abstraction on the EHR and describe preliminary work on the implementation of this framework for management of hyperlipidemia and hypertension. Our goal is to facilitate the rapid development and portability of Health IT applications.
Collapse
Affiliation(s)
- Kavishwar B Wagholikar
- Harvard Medical School, Boston, MA
- Massachusetts General Hospital, Boston, MA
- Partners Healthcare Boston, MA
| | | | | | | | | | | | | | - Neelima Karipineni
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
| | | | - Marian McPartlin
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
- Massachusetts General Hospital, Boston, MA
- Persistent Systems, Pune, India
- Partners Healthcare Boston, MA
| | - Tiffany V Bui
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
- Massachusetts General Hospital, Boston, MA
- Persistent Systems, Pune, India
- Partners Healthcare Boston, MA
| | | | - Jeffery Klann
- Harvard Medical School, Boston, MA
- Massachusetts General Hospital, Boston, MA
- Partners Healthcare Boston, MA
| | | | | | - Christopher Cannon
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
- Massachusetts General Hospital, Boston, MA
- Persistent Systems, Pune, India
- Partners Healthcare Boston, MA
| | | | | | - Calum A MacRae
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
| | - Benjamin M Scirica
- Harvard Medical School, Boston, MA
- Brigham and Women's Hospital, Boston, MA
| | - Shawn N Murphy
- Harvard Medical School, Boston, MA
- Massachusetts General Hospital, Boston, MA
| |
Collapse
|