1
|
Kryszyn J, Cywoniuk K, Smolik WT, Wanta D, Wróblewski P, Midura M. Performance of an openEHR based hospital information system. Int J Med Inform 2022; 162:104757. [PMID: 35395475 DOI: 10.1016/j.ijmedinf.2022.104757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 01/28/2022] [Accepted: 03/27/2022] [Indexed: 11/21/2022]
Abstract
BACKGROUND A desirable feature of hospital information systems is interoperability, which is generally quite limited due to the lack of standardization of the data model. This results in high development and maintenance costs for such systems. The openEHR standard addresses this problem. Due to its two-level modelling, it allows the separation of demographic and medical data and the storage of this data so that it can be easily processed and exchanged. However, it introduces an additional software layer that may affect system performance. This article examines the performance of a system based on the openEHR standard and compares it with the performance of a proprietary system developed in a classic way. METHODS Two hospital information systems with the same functionality were designed and developed. One was based on an openEHR server, and another was using proprietary data model having both demographic and medical data. Systems were deployed on Azure platform and load tests using JMeter were conducted to calculate statistics of elapsed time of requests as well as throughput of both systems. RESULTS Endpoints which fetch only demographic data had the same performance, but when medical data had to be queried, a decrease in performance of the openEHR based system was noticed. The system based on a proprietary data had about 6 times bigger throughput in terms of medical data fetching. CONCLUSIONS OpenEHR adds another layer to the architecture of a hospital information system which might result in performance issues. Such a system must be designed to operate on a sufficiently strong architecture if it is intended to serve many users.
Collapse
Affiliation(s)
- Jacek Kryszyn
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland.
| | - Kamil Cywoniuk
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
| | - Waldemar T Smolik
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
| | - Damian Wanta
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
| | - Przemysław Wróblewski
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
| | - Mateusz Midura
- Warsaw University of Technology, Faculty of Electronics and Information Technology, Institute of Radioelectronics and Multimedia Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland
| |
Collapse
|
2
|
Li M, Cai H, Zhi Y, Fu Z, Duan H, Lu X. A configurable method for clinical quality measurement through electronic health records based on openEHR and CQL. BMC Med Inform Decis Mak 2022; 22:37. [PMID: 35144618 PMCID: PMC8830083 DOI: 10.1186/s12911-022-01763-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 01/19/2022] [Indexed: 11/28/2022] Open
Abstract
Background One of the primary obstacles to measure clinical quality is the lack of configurable solutions to make computers understand and compute clinical quality indicators. The paper presents a solution that can help clinical staff develop clinical quality measurement more easily and generate the corresponding data reports and visualization by a configurable method based on openEHR and Clinical Quality Language (CQL). Methods First, expression logic adopted from CQL was combined with openEHR to express clinical quality indicators. Archetype binding provides the clinical information models used in expression logic, terminology binding makes the medical concepts consistent used in clinical quality artifacts and metadata is regarded as the essential component for sharing and management. Then, a systematic approach was put forward to facilitate the development of clinical quality indicators and the generation of corresponding data reports and visualization. Finally, clinical physicians were invited to test our system and give their opinions. Results With the combination of openEHR and CQL, 64 indicators from Centers for Medicare & Medicaid Services (CMS) were expressed for verification and a complicated indicator was shown as an example. 68 indicators from 17 different scenes in the local environment were also expressed and computed in our system. A platform was built to support the development of indicators in a unified way. Also, an execution engine can parse and compute these indicators. Based on a clinical data repository (CDR), indicators were used to generate data reports and visualization and shown in a dashboard. Conclusion Our method is capable of expressing clinical quality indicators formally. With the computer-interpretable indicators, a systematic approach can make it more easily to define clinical indicators and generate medical data reports and visualization, and facilitate the adoption of clinical quality measurements.
Collapse
Affiliation(s)
- Mengyang Li
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Hailing Cai
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Yunlong Zhi
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Zehai Fu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China. .,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China. .,School of Industrial Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
| |
Collapse
|
3
|
Li M, Cai H, Nan S, Li J, Lu X, Duan H. A Patient-Screening Tool for Clinical Research Based on Electronic Health Records Using OpenEHR: Development Study. JMIR Med Inform 2021; 9:e33192. [PMID: 34673526 PMCID: PMC8569542 DOI: 10.2196/33192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2021] [Revised: 09/27/2021] [Accepted: 09/27/2021] [Indexed: 11/28/2022] Open
Abstract
Background The widespread adoption of electronic health records (EHRs) has facilitated the secondary use of EHR data for clinical research. However, screening eligible patients from EHRs is a challenging task. The concepts in eligibility criteria are not completely matched with EHRs, especially derived concepts. The lack of high-level expression of Structured Query Language (SQL) makes it difficult and time consuming to express them. The openEHR Expression Language (EL) as a domain-specific language based on clinical information models shows promise to represent complex eligibility criteria. Objective The study aims to develop a patient-screening tool based on EHRs for clinical research using openEHR to solve concept mismatch and improve query performance. Methods A patient-screening tool based on EHRs using openEHR was proposed. It uses the advantages of information models and EL in openEHR to provide high-level expressions and improve query performance. First, openEHR archetypes and templates were chosen to define concepts called simple concepts directly from EHRs. Second, openEHR EL was used to generate derived concepts by combining simple concepts and constraints. Third, a hierarchical index corresponding to archetypes in Elasticsearch (ES) was generated to improve query performance for subqueries and join queries related to the derived concepts. Finally, we realized a patient-screening tool for clinical research. Results In total, 500 sentences randomly selected from 4691 eligibility criteria in 389 clinical trials on stroke from the Chinese Clinical Trial Registry (ChiCTR) were evaluated. An openEHR-based clinical data repository (CDR) in a grade A tertiary hospital in China was considered as an experimental environment. Based on these, 589 medical concepts were found in the 500 sentences. Of them, 513 (87.1%) concepts could be represented, while the others could not be, because of a lack of information models and coarse-grained requirements. In addition, our case study on 6 queries demonstrated that our tool shows better query performance among 4 cases (66.67%). Conclusions We developed a patient-screening tool using openEHR. It not only helps solve concept mismatch but also improves query performance to reduce the burden on researchers. In addition, we demonstrated a promising solution for secondary use of EHR data using openEHR, which can be referenced by other researchers.
Collapse
Affiliation(s)
- Mengyang Li
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Hailing Cai
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Shan Nan
- Hainan University School of Biomedical Engineering, Haikou City, China
| | - Jialin Li
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| |
Collapse
|
4
|
Min L, Atalag K, Tian Q, Chen Y, Lu X. Verifying the Feasibility of Implementing Semantic Interoperability in Different Countries Based on the OpenEHR Approach: Comparative Study of Acute Coronary Syndrome Registries. JMIR Med Inform 2021; 9:e31288. [PMID: 34665150 PMCID: PMC8564664 DOI: 10.2196/31288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2021] [Revised: 07/19/2021] [Accepted: 08/01/2021] [Indexed: 11/13/2022] Open
Abstract
Background The semantic interoperability of health care information has been a critical challenge in medical informatics and has influenced the integration, sharing, analysis, and use of medical big data. International standard organizations have developed standards, approaches, and models to improve and implement semantic interoperability. The openEHR approach—one of the standout semantic interoperability approaches—has been implemented worldwide to improve semantic interoperability based on reused archetypes. Objective This study aimed to verify the feasibility of implementing semantic interoperability in different countries by comparing the openEHR-based information models of 2 acute coronary syndrome (ACS) registries from China and New Zealand. Methods A semantic archetype comparison method was proposed to determine the semantics reuse degree of reused archetypes in 2 ACS-related clinical registries from 2 countries. This method involved (1) determining the scope of reused archetypes; (2) identifying corresponding data items within corresponding archetypes; (3) comparing the semantics of corresponding data items; and (4) calculating the number of mappings in corresponding data items and analyzing results. Results Among the related archetypes in the two ACS-related, openEHR-based clinical registries from China and New Zealand, there were 8 pairs of reusable archetypes, which included 89 pairs of corresponding data items and 120 noncorresponding data items. Of the 89 corresponding data item pairs, 87 pairs (98%) were mappable and therefore supported semantic interoperability, and 71 pairs (80%) were labeled as “direct mapping” data items. Of the 120 noncorresponding data items, 114 (95%) data items were generated via archetype evolution, and 6 (5%) data items were generated via archetype localization. Conclusions The results of the semantic comparison between the two ACS-related clinical registries prove the feasibility of establishing the semantic interoperability of health care data from different countries based on the openEHR approach. Archetype reuse provides data on the degree to which semantic interoperability exists when using the openEHR approach. Although the openEHR community has effectively promoted archetype reuse and semantic interoperability by providing archetype modeling methods, tools, model repositories, and archetype design patterns, the uncontrolled evolution of archetypes and inconsistent localization have resulted in major challenges for achieving higher levels of semantic interoperability.
Collapse
Affiliation(s)
- Lingtong Min
- School of Electronics and Information, Northwestern Polytechnical University, Xi'an, China
| | - Koray Atalag
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Qi Tian
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, China
| | - Yani Chen
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, China
| | - Xudong Lu
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Hangzhou, China
| |
Collapse
|
5
|
Sun B, Zhang F, Li J, Yang Y, Diao X, Zhao W, Shu T. Using NLP in openEHR archetypes retrieval to promote interoperability: a feasibility study in China. BMC Med Inform Decis Mak 2021; 21:199. [PMID: 34174874 PMCID: PMC8234679 DOI: 10.1186/s12911-021-01554-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 06/08/2021] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND With the development and application of medical information system, semantic interoperability is essential for accurate and advanced health-related computing and electronic health record (EHR) information sharing. The openEHR approach can improve semantic interoperability. One key improvement of openEHR is that it allows for the use of existing archetypes. The crucial problem is how to improve the precision and resolve ambiguity in the archetype retrieval. METHOD Based on the query expansion technology and Word2Vec model in Nature Language Processing (NLP), we propose to find synonyms as substitutes for original search terms in archetype retrieval. Test sets in different medical professional level are used to verify the feasibility. RESULT Applying the approach to each original search term (n = 120) in test sets, a total of 69,348 substitutes were constructed. Precision at 5 (P@5) was improved by 0.767, on average. For the best result, the P@5 was up to 0.975. CONCLUSIONS We introduce a novel approach that using NLP technology and corpus to find synonyms as substitutes for original search terms. Compared to simply mapping the element contained in openEHR to an external dictionary, this approach could greatly improve precision and resolve ambiguity in retrieval tasks. This is helpful to promote the application of openEHR and advance EHR information sharing.
Collapse
Affiliation(s)
- Bo Sun
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 167 North Lishi Road, Xicheng District, Beijing, 100037 China
| | - Fei Zhang
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 167 North Lishi Road, Xicheng District, Beijing, 100037 China
| | - Jing Li
- Beijing Advanced Innovation Center for Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, 37 Xueyuan Road, Haidian District, Beijing, 100191 China
| | - Yicheng Yang
- Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 167 North Lishi Road, Xicheng District, Beijing, 100191 China
| | - Xiaolin Diao
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 167 North Lishi Road, Xicheng District, Beijing, 100037 China
| | - Wei Zhao
- Department of Information Center, Fuwai Hospital, National Center for Cardiovascular Diseases, Chinese Academy of Medical Sciences and Peking Union Medical College, No. 167 North Lishi Road, Xicheng District, Beijing, 100037 China
| | - Ting Shu
- National Institute of Hospital Administration, National Health Commission, Building 3, yard 6, Shouti South Road, Haidian, Beijing, 100044 China
| |
Collapse
|
6
|
Gomes DC, Abreu N, Sousa P, Moro C, Carvalho DR, Cubas MR. Representation of Diagnosis and Nursing Interventions in OpenEHR Archetypes. Appl Clin Inform 2021; 12:340-347. [PMID: 33853142 DOI: 10.1055/s-0041-1728706] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
OBJECTIVE The study aimed to represent the content of nursing diagnosis and interventions in the openEHR standard. METHODS This is a developmental study with the models developed according to ISO 18104: 2014. The Ocean Archetype Editor tool from the openEHR Foundation was used. RESULTS Two archetypes were created; one to represent the nursing diagnosis concept and the other the nursing intervention concept. Existing archetypes available in the Clinical Knowledge Manager were reused in modeling. CONCLUSION The representation of nursing diagnosis and interventions based on the openEHR standard contributes to representing nursing care phenomena and needs in health information systems.
Collapse
Affiliation(s)
- Denilsen Carvalho Gomes
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Nuno Abreu
- Department of Medicine, Centro Hospitalar Universitário do Porto, Porto, Portugal
| | - Paulino Sousa
- Center for Research in Health Technologies and Information Systems (CINTESIS), Faculty of Medicine, University of Porto, Porto, Portugal
| | - Claudia Moro
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Deborah Ribeiro Carvalho
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| | - Marcia Regina Cubas
- Graduate Program in Health Technology, Pontifícia Universidade Católica do Paraná, Curitiba, Brazil
| |
Collapse
|
7
|
Tian Q, Han Z, Yu P, An J, Lu X, Duan H. Application of openEHR archetypes to automate data quality rules for electronic health records: a case study. BMC Med Inform Decis Mak 2021; 21:113. [PMID: 33812388 PMCID: PMC8019503 DOI: 10.1186/s12911-021-01481-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Accepted: 03/25/2021] [Indexed: 12/02/2022] Open
Abstract
Background Ensuring data is of appropriate quality is essential for the secondary use of electronic health records (EHRs) in research and clinical decision support. An effective method of data quality assessment (DQA) is automating data quality rules (DQRs) to replace the time-consuming, labor-intensive manual process of creating DQRs, which is difficult to guarantee standard and comparable DQA results. This paper presents a case study of automatically creating DQRs based on openEHR archetypes in a Chinese hospital to investigate the feasibility and challenges of automating DQA for EHR data. Methods The clinical data repository (CDR) of the Shanxi Dayi Hospital is an archetype-based relational database. Four steps are undertaken to automatically create DQRs in this CDR database. First, the keywords and features relevant to DQA of archetypes were identified via mapping them to a well-established DQA framework, Kahn’s DQA framework. Second, the templates of DQRs in correspondence with these identified keywords and features were created in the structured query language (SQL). Third, the quality constraints were retrieved from archetypes. Fourth, these quality constraints were automatically converted to DQRs according to the pre-designed templates and mapping relationships of archetypes and data tables. We utilized the archetypes of the CDR to automatically create DQRs to meet quality requirements of the Chinese Application-Level Ranking Standard for EHR Systems (CARSES) and evaluated their coverage by comparing with expert-created DQRs. Results We used 27 archetypes to automatically create 359 DQRs. 319 of them are in agreement with the expert-created DQRs, covering 84.97% (311/366) requirements of the CARSES. The auto-created DQRs had varying levels of coverage of the four quality domains mandated by the CARSES: 100% (45/45) of consistency, 98.11% (208/212) of completeness, 54.02% (57/87) of conformity, and 50% (11/22) of timeliness. Conclusion It’s feasible to create DQRs automatically based on openEHR archetypes. This study evaluated the coverage of the auto-created DQRs to a typical DQA task of Chinese hospitals, the CARSES. The challenges of automating DQR creation were identified, such as quality requirements based on semantic, and complex constraints of multiple elements. This research can enlighten the exploration of DQR auto-creation and contribute to the automatic DQA. Supplementary Information The online version contains supplementary material available at 10.1186/s12911-021-01481-2.
Collapse
Affiliation(s)
- Qi Tian
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Zhexi Han
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Ping Yu
- Centre for Digital Transformation, School of Computing and Information Technology, University of Wollongong, Wollongong, NSW, 2522, Australia
| | - Jiye An
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China. .,School of Industrial Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands. .,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China.
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road 38, Hangzhou, 310027, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, China
| |
Collapse
|
8
|
Standardized electronic health record data modeling and persistence: A comparative review. J Biomed Inform 2020; 114:103670. [PMID: 33359548 DOI: 10.1016/j.jbi.2020.103670] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 12/15/2020] [Accepted: 12/20/2020] [Indexed: 12/12/2022]
Abstract
With the extensive adoption of electronic health records (EHRs) by several healthcare organizations, more efforts are needed to manage and utilize such massive, various, and complex healthcare data. Databases' performance and suitability to health care tasks are dramatically affected by how their data storage model and query capabilities are well-adapted to the use case scenario. On the other hand, standardized healthcare data modeling is one of the most favorable paths for achieving semantic interoperability, facilitating patient data integration from different healthcare systems. This paper compares the state-of-the-art of the most crucial database management systems used for storing standardized EHRs data. It discusses different database models' appropriateness for meeting different EHRs functions with different database specifications and workload scenarios. Insights into relevant literature show how flexible NoSQL databases (document, column, and graph) effectively deal with standardized EHRs data's distinctive features, especially in the distributed healthcare system, leading to better EHR.
Collapse
|
9
|
Li M, Leslie H, Qi B, Nan S, Feng H, Cai H, Lu X, Duan H. Development of an openEHR Template for COVID-19 Based on Clinical Guidelines. J Med Internet Res 2020; 22:e20239. [PMID: 32496207 PMCID: PMC7288685 DOI: 10.2196/20239] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Revised: 06/03/2020] [Accepted: 06/03/2020] [Indexed: 01/08/2023] Open
Abstract
Background The coronavirus disease (COVID-19) was discovered in China in December 2019. It has developed into a threatening international public health emergency. With the exception of China, the number of cases continues to increase worldwide. A number of studies about disease diagnosis and treatment have been carried out, and many clinically proven effective results have been achieved. Although information technology can improve the transferring of such knowledge to clinical practice rapidly, data interoperability is still a challenge due to the heterogeneous nature of hospital information systems. This issue becomes even more serious if the knowledge for diagnosis and treatment is updated rapidly as is the case for COVID-19. An open, semantic-sharing, and collaborative-information modeling framework is needed to rapidly develop a shared data model for exchanging data among systems. openEHR is such a framework and is supported by many open software packages that help to promote information sharing and interoperability. Objective This study aims to develop a shared data model based on the openEHR modeling approach to improve the interoperability among systems for the diagnosis and treatment of COVID-19. Methods The latest Guideline of COVID-19 Diagnosis and Treatment in China was selected as the knowledge source for modeling. First, the guideline was analyzed and the data items used for diagnosis and treatment, and management were extracted. Second, the data items were classified and further organized into domain concepts with a mind map. Third, searching was executed in the international openEHR Clinical Knowledge Manager (CKM) to find the existing archetypes that could represent the concepts. New archetypes were developed for those concepts that could not be found. Fourth, these archetypes were further organized into a template using Ocean Template Editor. Fifth, a test case of data exchanging between the clinical data repository and clinical decision support system based on the template was conducted to verify the feasibility of the study. Results A total of 203 data items were extracted from the guideline in China, and 16 domain concepts (16 leaf nodes in the mind map) were organized. There were 22 archetypes used to develop the template for all data items extracted from the guideline. All of them could be found in the CKM and reused directly. The archetypes and templates were reviewed and finally released in a public project within the CKM. The test case showed that the template can facilitate the data exchange and meet the requirements of decision support. Conclusions This study has developed the openEHR template for COVID-19 based on the latest guideline from China using openEHR modeling methodology. It represented the capability of the methodology for rapidly modeling and sharing knowledge through reusing the existing archetypes, which is especially useful in a new and fast-changing area such as with COVID-19.
Collapse
Affiliation(s)
- Mengyang Li
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Heather Leslie
- Atomica Informatics, Melbourne, Australia.,openEHR Foundation, London, United Kingdom
| | - Bin Qi
- Hangzhou Joyrun Medical Technology Cooperation, Hangzhou, Zhejiang, China
| | - Shan Nan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Hongshuo Feng
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Hailing Cai
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China.,School of Industrial Engineering, Eindhoven University of Technology, Eindhoven, Netherlands
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Hangzhou, Zhejiang, China.,Key Laboratory for Biomedical Engineering, Ministry of Education, Hangzhou, Zhejiang, China
| |
Collapse
|
10
|
Yang L, Huang X, Li J. Discovering Clinical Information Models Online to Promote Interoperability of Electronic Health Records: A Feasibility Study of OpenEHR. J Med Internet Res 2019; 21:e13504. [PMID: 31140433 PMCID: PMC6658308 DOI: 10.2196/13504] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Revised: 04/18/2019] [Accepted: 05/02/2019] [Indexed: 02/06/2023] Open
Abstract
Background Clinical information models (CIMs) enabling semantic interoperability are crucial for electronic health record (EHR) data use and reuse. Dual model methodology, which distinguishes the CIMs from the technical domain, could help enable the interoperability of EHRs at the knowledge level. How to help clinicians and domain experts discover CIMs from an open repository online to represent EHR data in a standard manner becomes important. Objective This study aimed to develop a retrieval method to identify CIMs online to represent EHR data. Methods We proposed a graphical retrieval method and validated its feasibility using an online CIM repository: openEHR Clinical Knowledge Manager (CKM). First, we represented CIMs (archetypes) using an extended Bayesian network. Then, an inference process was run in the network to discover relevant archetypes. In the evaluation, we defined three retrieval tasks (medication, laboratory test, and diagnosis) and compared our method with three typical retrieval methods (BM25F, simple Bayesian network, and CKM), using mean average precision (MAP), average precision (AP), and precision at 10 (P@10) as evaluation metrics. Results We downloaded all available archetypes from the CKM. Then, the graphical model was applied to represent the archetypes as a four-level clinical resources network. The network consisted of 5513 nodes, including 3982 data element nodes, 504 concept nodes, 504 duplicated concept nodes, and 523 archetype nodes, as well as 9867 edges. The results showed that our method achieved the best MAP (MAP=0.32), and the AP was almost equal across different retrieval tasks (AP=0.35, 0.31, and 0.30, respectively). In the diagnosis retrieval task, our method could successfully identify the models covering “diagnostic reports,” “problem list,” “patients background,” “clinical decision,” etc, as well as models that other retrieval methods could not find, such as “problems and diagnoses.” Conclusions The graphical retrieval method we propose is an effective approach to meet the uncertainty of finding CIMs. Our method can help clinicians and domain experts identify CIMs to represent EHR data in a standard manner, enabling EHR data to be exchangeable and interoperable.
Collapse
Affiliation(s)
- Lin Yang
- Institute of Medical Information / Medical Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Xiaoshuo Huang
- Institute of Medical Information / Medical Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Jiao Li
- Institute of Medical Information / Medical Library, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| |
Collapse
|
11
|
Kalogiannis S, Deltouzos K, Zacharaki EI, Vasilakis A, Moustakas K, Ellul J, Megalooikonomou V. Integrating an openEHR-based personalized virtual model for the ageing population within HBase. BMC Med Inform Decis Mak 2019; 19:25. [PMID: 30691467 PMCID: PMC6350370 DOI: 10.1186/s12911-019-0745-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Accepted: 01/14/2019] [Indexed: 11/17/2022] Open
Abstract
Background Frailty is a common clinical syndrome in ageing population that carries an increased risk for adverse health outcomes including falls, hospitalization, disability, and mortality. As these outcomes affect the health and social care planning, during the last years there is a tendency of investing in monitoring and preventing strategies. Although a number of electronic health record (EHR) systems have been developed, including personalized virtual patient models, there are limited ageing population oriented systems. Methods We exploit the openEHR framework for the representation of frailty in ageing population in order to attain semantic interoperability, and we present the methodology for adoption or development of archetypes. We also propose a framework for a one-to-one mapping between openEHR archetypes and a column-family NoSQL database (HBase) aiming at the integration of existing and newly developed archetypes into it. Results The requirement analysis of our study resulted in the definition of 22 coherent and clinically meaningful parameters for the description of frailty in older adults. The implemented openEHR methodology led to the direct use of 22 archetypes, the modification and reuse of two archetypes, and the development of 28 new archetypes. Additionally, the mapping procedure led to two different HBase tables for the storage of the data. Conclusions In this work, an openEHR-based virtual patient model has been designed and integrated into an HBase storage system, exploiting the advantages of the underlying technologies. This framework can serve as a base for the development of a decision support system using the openEHR’s Guideline Definition Language in the future.
Collapse
Affiliation(s)
- Spyridon Kalogiannis
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| | - Konstantinos Deltouzos
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece.
| | - Evangelia I Zacharaki
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| | - Andreas Vasilakis
- Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Charilaou-Thermi Rd, Thessaloniki, 57001, Greece
| | - Konstantinos Moustakas
- Information Technologies Institute, Centre for Research and Technology Hellas, 6th km Charilaou-Thermi Rd, Thessaloniki, 57001, Greece
| | - John Ellul
- Department of Neurology, School of Medicine, University of Patras, University Campus, Rio, 26504, Greece
| | - Vasileios Megalooikonomou
- Computer Engineering and Informatics Department, University of Patras, University Campus, Rio, 26504, Greece
| |
Collapse
|
12
|
Min L, Tian Q, Lu X, Duan H. Modeling EHR with the openEHR approach: an exploratory study in China. BMC Med Inform Decis Mak 2018; 18:75. [PMID: 30157838 PMCID: PMC6116359 DOI: 10.1186/s12911-018-0650-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Accepted: 07/04/2018] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND The openEHR approach can improve the interoperability of electronic health record (EHR) through two-level modeling. Developing archetypes for the complete EHR dataset is essential for implementing a large-scale interoperable EHR system with the openEHR approach. Although the openEHR approach has been applied in different domains, the feasibility of archetyping a complete EHR dataset in a hospital has not been reported in academic literature, especially in a country where using openEHR is still in its infancy stage, like China. This paper presents a case study of modeling an EHR in China aiming to investigate the feasibility and challenges of archetyping a complete EHR dataset with the openEHR approach. METHODS We proposed an archetype modeling method including an iterative process of collecting requirements, normalizing data elements, organizing concepts, searching corresponding archetypes, editing archetypes and reviewing archetypes. Two representative EHR systems from Chinese vendors and the existing Chinese EHR standards have been used as resources to identify the requirements of EHR in China, and a case study of modeling EHR in China has been conducted. Based on the models developed in this case study, we have implemented a clinical data repository (CDR) to verify the feasibility of modeling EHR with archetypes. RESULTS Sixty four archetypes were developed to represent all requirements of a complete EHR dataset. 59 (91%) archetypes could be found in Clinical Knowledge Manager (CKM), of which 35 could be reused directly without change, and 23 required further development including two revisions, two new versions, 18 extensions and one specialization. Meanwhile, 6 (9%) archetypes were newly developed. The legacy data of the EHR system in hospitals could be integrated into the CDR developed with these archetypes successfully. CONCLUSIONS The existing archetypes in CKM can faithfully represent most of the EHR requirements in China except customizations for local hospital management. This case study verified the feasibility of modeling EHR with the openEHR approach and identified the fact that the challenges such as localization, tool support, and an agile publishing process still exist for a broader application of the openEHR approach.
Collapse
Affiliation(s)
- Lingtong Min
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Zheda Road, Hangzhou, 310027, China
| | - Qi Tian
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Zheda Road, Hangzhou, 310027, China
| | - Xudong Lu
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Zheda Road, Hangzhou, 310027, China.
| | - Huilong Duan
- College of Biomedical Engineering & Instrument Science, Zhejiang University, Zheda Road, Hangzhou, 310027, China
| |
Collapse
|
13
|
iT2DMS: a Standard-Based Diabetic Disease Data Repository and its Pilot Experiment on Diabetic Retinopathy Phenotyping and Examination Results Integration. J Med Syst 2018; 42:131. [DOI: 10.1007/s10916-018-0939-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2018] [Accepted: 03/14/2018] [Indexed: 01/18/2023]
|
14
|
Maranhão PA, Bacelar-Silva GM, Ferreira DNG, Calhau C, Vieira-Marques P, Cruz-Correia RJ. Nutrigenomic Information in the openEHR Data Set. Appl Clin Inform 2018; 9:221-231. [PMID: 29590680 DOI: 10.1055/s-0038-1635115] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
BACKGROUND The traditional concept of personalized nutrition is based on adapting diets according to individual needs and preferences. Discussions about personalized nutrition have been on since the Human Genome Project, which has sequenced the human genome. Thenceforth, topics such as nutrigenomics have been assessed to help in better understanding the genetic variation influence on the dietary response and association between nutrients and gene expression. Hence, some challenges impaired the understanding about the nowadays important clinical data and about clinical data assumed to be important in the future. OBJECTIVE Finding the main clinical statements in the personalized nutrition field (nutrigenomics) to create the future-proof health information system to the openEHR server based on archetypes, as well as a specific nutrigenomic template. METHODS A systematic literature search was conducted in electronic databases such as PubMed. The aim of this systemic review was to list the chief clinical statements and create archetype and templates for openEHR modeling tools, namely, Ocean Archetype Editor and Ocean Template Design. RESULTS The literature search led to 51 articles; however, just 26 articles were analyzed after all the herein adopted inclusion criteria were assessed. Of these total, 117 clinical statements were identified, as well as 27 archetype-friendly concepts. Our group modeled four new archetypes (waist-to-height ratio, genetic test results, genetic summary, and diet plan) and finally created the specific nutrigenomic template for nutrition care. CONCLUSION The archetypes and the specific openEHR template developed in this study gave dieticians and other health professionals an important tool to their nutrigenomic clinical practices, besides a set of nutrigenomic data to clinical research.
Collapse
Affiliation(s)
- Priscila Alves Maranhão
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal
| | - Gustavo Marísio Bacelar-Silva
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal
| | - Duarte Nuno Gonçalves Ferreira
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal
| | - Conceição Calhau
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal.,Faculty of Medical Science, Nova de Lisboa University, Nova Medical School, Lisboa, Portugal
| | - Pedro Vieira-Marques
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal
| | - Ricardo João Cruz-Correia
- Faculty of Medicine, Center for Research in Health Technologies and Information Systems (CINTESIS), University of Porto, Porto, Portugal
| |
Collapse
|
15
|
Min L, Tian Q, Lu X, An J, Duan H. An openEHR based approach to improve the semantic interoperability of clinical data registry. BMC Med Inform Decis Mak 2018; 18:15. [PMID: 29589572 PMCID: PMC5872380 DOI: 10.1186/s12911-018-0596-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
Background Clinical data registry is designed to collect and manage information about the practices and outcomes of a patient population for improving the quality and safety of care and facilitating novel researches. Semantic interoperability is a challenge when integrating the data from more than one clinical data registry. The openEHR approach can represent the information and knowledge semantics by multi-level modeling, and it advocates the use of collaborative modeling to facilitate reusing existing archetypes with consistent semantics so as to be a potential solution to improve the semantic interoperability. Methods This paper proposed an openEHR based approach to improve the semantic interoperability of clinical data registry. The approach consists of five steps: clinical data registry meta-information collection, data element definition, archetype modeling, template editing, and implementation. Through collaborative modeling and maximum reusing of existing archetype at the archetype modeling step, the approach can improve semantic interoperability. To verify the feasibility of the approach, this paper conducted a case study of building a Coronary Computed Tomography Angiography (CCTA) registry that can interoperate with an existing Electronic Health Record (EHR) system. Results The CCTA registry includes 183 data elements, which involves 20 archetypes. A total number of 45 CCTA data elements and EHR data elements have semantic overlap. Among them, 38 (84%) CCTA data elements can be found in the 10 reused EHR archetypes. These corresponding clinical data can be collected from the EHR system directly without transformation. The other 7 (16%) CCTA data elements correspond to one coarse-grained EHR data elements, and these clinical data can be collected with mapping rules. The results show that the approach can improve semantic interoperability of clinical data registry. Conclusions Using an openEHR based approach to develop clinical data registry can improve the semantic interoperability. Meanwhile, some challenges for broader semantic interoperability are identified, including domain experts’ involvement, archetype sharing and reusing, and archetype semantic mapping. Collaborative modeling, easy-to-use tools, and semantic relationship establishment are potential solutions for these challenges. This study provides some experience and insight about clinical modeling and clinical data registry development.
Collapse
Affiliation(s)
- Lingtong Min
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Qi Tian
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China.
| | - Jiye An
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Zheda Road, Hanghzou, 310027, China
| |
Collapse
|
16
|
Sánchez-de-Madariaga R, Muñoz A, Castro AL, Moreno O, Pascual M. Executing Complexity-Increasing Queries in Relational (MySQL) and NoSQL (MongoDB and EXist) Size-Growing ISO/EN 13606 Standardized EHR Databases. J Vis Exp 2018. [PMID: 29608174 PMCID: PMC5933229 DOI: 10.3791/57439] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/31/2022] Open
Abstract
This research shows a protocol to assess the computational complexity of querying relational and non-relational (NoSQL (not only Structured Query Language)) standardized electronic health record (EHR) medical information database systems (DBMS). It uses a set of three doubling-sized databases, i.e. databases storing 5000, 10,000 and 20,000 realistic standardized EHR extracts, in three different database management systems (DBMS): relational MySQL object-relational mapping (ORM), document-based NoSQL MongoDB, and native extensible markup language (XML) NoSQL eXist. The average response times to six complexity-increasing queries were computed, and the results showed a linear behavior in the NoSQL cases. In the NoSQL field, MongoDB presents a much flatter linear slope than eXist. NoSQL systems may also be more appropriate to maintain standardized medical information systems due to the special nature of the updating policies of medical information, which should not affect the consistency and efficiency of the data stored in NoSQL databases. One limitation of this protocol is the lack of direct results of improved relational systems such as archetype relational mapping (ARM) with the same data. However, the interpolation of doubling-size database results to those presented in the literature and other published results suggests that NoSQL systems might be more appropriate in many specific scenarios and problems to be solved. For example, NoSQL may be appropriate for document-based tasks such as EHR extracts used in clinical practice, or edition and visualization, or situations where the aim is not only to query medical information, but also to restore the EHR in exactly its original form.
Collapse
Affiliation(s)
| | - Adolfo Muñoz
- Telemedicine and Information Society Department, Health Institute "Carlos III";
| | - Antonio L Castro
- Telemedicine and Information Society Department, Health Institute "Carlos III"
| | - Oscar Moreno
- Telemedicine and Information Society Department, Health Institute "Carlos III"
| | - Mario Pascual
- Telemedicine and Information Society Department, Health Institute "Carlos III"
| |
Collapse
|
17
|
Kopanitsa G. Integration of Hospital Information and Clinical Decision Support Systems to Enable the Reuse of Electronic Health Record Data. Methods Inf Med 2018; 56:238-247. [DOI: 10.3414/me16-01-0057] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2016] [Accepted: 01/10/2017] [Indexed: 01/08/2023]
Abstract
SummaryBackground: The efficiency and acceptance of clinical decision support systems (CDSS) can increase if they reuse medical data captured during health care delivery. High heterogeneity of the existing legacy data formats has become the main barrier for the reuse of data. Thus, we need to apply data modeling mechanisms that provide standardization, transformation, accumulation and querying medical data to allow its reuse.Objectives: In this paper, we focus on the interoperability issues of the hospital information systems (HIS) and CDSS data integration.Materials and Methods: Our study is based on the approach proposed by Marcos et al. where archetypes are used as a standardized mechanism for the interaction of a CDSS with an electronic health record (EHR). We build an integration tool to enable CDSSs collect data from various institutions without a need for modifications in the implementation. The approach implies development of a conceptual level as a set of archetypes representing concepts required by a CDSS.Results: Treatment case data from Regional Clinical Hospital in Tomsk, Russia was extracted, transformed and loaded to the archetype database of a clinical decision support system. Test records’ normalization has been performed by defining transformation and aggregation rules between the EHR data and the archetypes. These mapping rules were used to automatically generate openEHR compliant data. After the transformation, archetype data instances were loaded into the CDSS archetype based data storage. The performance times showed acceptable performance for the extraction stage with a mean of 17.428 s per year (3436 case records). The transformation times were also acceptable with 136.954 s per year (0.039 s per one instance). The accuracy evaluation showed the correctness and applicability of the method for the wide range of HISes. These operations were performed without interrupting the HIS workflow to prevent the HISes from disturbing the service provision to the users.Conclusions: The project results have proven that archetype based technologies are mature enough to be applied in routine operations that require extraction, transformation, loading and querying medical data from heterogeneous EHR systems. Inference models in clinical research and CDSS can benefit from this by defining queries to a valid data set with known structure and constraints. The standard based nature of the archetype approach allows an easy integration of CDSSs with existing EHR systems.
Collapse
|
18
|
Teodoro D, Sundvall E, João Junior M, Ruch P, Miranda Freire S. ORBDA: An openEHR benchmark dataset for performance assessment of electronic health record servers. PLoS One 2018; 13:e0190028. [PMID: 29293556 PMCID: PMC5749730 DOI: 10.1371/journal.pone.0190028] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2017] [Accepted: 12/06/2017] [Indexed: 11/19/2022] Open
Abstract
The openEHR specifications are designed to support implementation of flexible and interoperable Electronic Health Record (EHR) systems. Despite the increasing number of solutions based on the openEHR specifications, it is difficult to find publicly available healthcare datasets in the openEHR format that can be used to test, compare and validate different data persistence mechanisms for openEHR. To foster research on openEHR servers, we present the openEHR Benchmark Dataset, ORBDA, a very large healthcare benchmark dataset encoded using the openEHR formalism. To construct ORBDA, we extracted and cleaned a de-identified dataset from the Brazilian National Healthcare System (SUS) containing hospitalisation and high complexity procedures information and formalised it using a set of openEHR archetypes and templates. Then, we implemented a tool to enrich the raw relational data and convert it into the openEHR model using the openEHR Java reference model library. The ORBDA dataset is available in composition, versioned composition and EHR openEHR representations in XML and JSON formats. In total, the dataset contains more than 150 million composition records. We describe the dataset and provide means to access it. Additionally, we demonstrate the usage of ORBDA for evaluating inserting throughput and query latency performances of some NoSQL database management systems. We believe that ORBDA is a valuable asset for assessing storage models for openEHR-based information systems during the software engineering process. It may also be a suitable component in future standardised benchmarking of available openEHR storage platforms.
Collapse
Affiliation(s)
- Douglas Teodoro
- Departamento de Tecnologia da Informação e Educação em Saúde, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil
- SIB Text Mining, Swiss Institute of Bioinformatics, Geneva, Switzerland
- Department of Information Science, HEG-Geneva, HES-SO, Geneva, Switzerland
| | - Erik Sundvall
- Department of Biomedical Engineering, Linköping University, Linköping, Sweden
- Region Östergötland, Linköping, Sweden
| | - Mario João Junior
- Departamento de Tecnologia da Informação e Educação em Saúde, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Patrick Ruch
- SIB Text Mining, Swiss Institute of Bioinformatics, Geneva, Switzerland
- Department of Information Science, HEG-Geneva, HES-SO, Geneva, Switzerland
| | - Sergio Miranda Freire
- Departamento de Tecnologia da Informação e Educação em Saúde, Universidade do Estado do Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
19
|
|
20
|
Sánchez-de-Madariaga R, Muñoz A, Lozano-Rubí R, Serrano-Balazote P, Castro AL, Moreno O, Pascual M. Examining database persistence of ISO/EN 13606 standardized electronic health record extracts: relational vs. NoSQL approaches. BMC Med Inform Decis Mak 2017; 17:123. [PMID: 28821246 PMCID: PMC5563027 DOI: 10.1186/s12911-017-0515-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 07/31/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The objective of this research is to compare the relational and non-relational (NoSQL) database systems approaches in order to store, recover, query and persist standardized medical information in the form of ISO/EN 13606 normalized Electronic Health Record XML extracts, both in isolation and concurrently. NoSQL database systems have recently attracted much attention, but few studies in the literature address their direct comparison with relational databases when applied to build the persistence layer of a standardized medical information system. METHODS One relational and two NoSQL databases (one document-based and one native XML database) of three different sizes have been created in order to evaluate and compare the response times (algorithmic complexity) of six different complexity growing queries, which have been performed on them. Similar appropriate results available in the literature have also been considered. RESULTS Relational and non-relational NoSQL database systems show almost linear algorithmic complexity query execution. However, they show very different linear slopes, the former being much steeper than the two latter. Document-based NoSQL databases perform better in concurrency than in isolation, and also better than relational databases in concurrency. CONCLUSION Non-relational NoSQL databases seem to be more appropriate than standard relational SQL databases when database size is extremely high (secondary use, research applications). Document-based NoSQL databases perform in general better than native XML NoSQL databases. EHR extracts visualization and edition are also document-based tasks more appropriate to NoSQL database systems. However, the appropriate database solution much depends on each particular situation and specific problem.
Collapse
Affiliation(s)
- Ricardo Sánchez-de-Madariaga
- Telemedicine and Information Society Department, Health Institute “Carlos III” (ISCIII), c/Sinesio Delgado, 4 –, 28029 Madrid, Spain
| | - Adolfo Muñoz
- Telemedicine and Information Society Department, Health Institute “Carlos III” (ISCIII), c/Sinesio Delgado, 4 –, 28029 Madrid, Spain
| | - Raimundo Lozano-Rubí
- Medical Informatics, Hospital Clínic, Unit of Medical Informatics, University of Barcelona, Barcelona, Spain
- Department of Computer Science, Autonomous Univerity of Barcelona, Barcelona, Spain
| | | | - Antonio L. Castro
- Telemedicine and Information Society Department, Health Institute “Carlos III” (ISCIII), c/Sinesio Delgado, 4 –, 28029 Madrid, Spain
| | - Oscar Moreno
- Telemedicine and Information Society Department, Health Institute “Carlos III” (ISCIII), c/Sinesio Delgado, 4 –, 28029 Madrid, Spain
| | - Mario Pascual
- Telemedicine and Information Society Department, Health Institute “Carlos III” (ISCIII), c/Sinesio Delgado, 4 –, 28029 Madrid, Spain
| |
Collapse
|
21
|
Bhalla S, Sachdeva S, Batra S. Semantic Interoperability in Electronic Health Record Databases: Standards, Architecture and e-Health Systems. BIG DATA ANALYTICS 2017. [DOI: 10.1007/978-3-319-72413-3_16] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
22
|
Lozano-Rubí R, Muñoz Carrero A, Serrano Balazote P, Pastor X. OntoCR: A CEN/ISO-13606 clinical repository based on ontologies. J Biomed Inform 2016; 60:224-33. [DOI: 10.1016/j.jbi.2016.02.007] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2015] [Revised: 02/01/2016] [Accepted: 02/14/2016] [Indexed: 11/28/2022]
|
23
|
Comparing the Performance of NoSQL Approaches for Managing Archetype-Based Electronic Health Record Data. PLoS One 2016; 11:e0150069. [PMID: 26958859 PMCID: PMC4784924 DOI: 10.1371/journal.pone.0150069] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2015] [Accepted: 02/07/2016] [Indexed: 11/19/2022] Open
Abstract
This study provides an experimental performance evaluation on population-based queries of NoSQL databases storing archetype-based Electronic Health Record (EHR) data. There are few published studies regarding the performance of persistence mechanisms for systems that use multilevel modelling approaches, especially when the focus is on population-based queries. A healthcare dataset with 4.2 million records stored in a relational database (MySQL) was used to generate XML and JSON documents based on the openEHR reference model. Six datasets with different sizes were created from these documents and imported into three single machine XML databases (BaseX, eXistdb and Berkeley DB XML) and into a distributed NoSQL database system based on the MapReduce approach, Couchbase, deployed in different cluster configurations of 1, 2, 4, 8 and 12 machines. Population-based queries were submitted to those databases and to the original relational database. Database size and query response times are presented. The XML databases were considerably slower and required much more space than Couchbase. Overall, Couchbase had better response times than MySQL, especially for larger datasets. However, Couchbase requires indexing for each differently formulated query and the indexing time increases with the size of the datasets. The performances of the clusters with 2, 4, 8 and 12 nodes were not better than the single node cluster in relation to the query response time, but the indexing time was reduced proportionally to the number of nodes. The tested XML databases had acceptable performance for openEHR-based data in some querying use cases and small datasets, but were generally much slower than Couchbase. Couchbase also outperformed the response times of the relational database, but required more disk space and had a much longer indexing time. Systems like Couchbase are thus interesting research targets for scalable storage and querying of archetype-based EHR data when population-based use cases are of interest.
Collapse
|
24
|
Wang L, Min L, Wang R, Lu X, Duan H. Erratum to: Archetype relational mapping - a practical openEHR persistence solution. BMC Med Inform Decis Mak 2016; 16:21. [PMID: 26873222 PMCID: PMC4752799 DOI: 10.1186/s12911-016-0259-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 02/05/2016] [Indexed: 11/10/2022] Open
Affiliation(s)
- Li Wang
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Room 512, Zhouyiqing Building, Zhejiang University, 38 Zheda Road, Hangzhou, Zhejiang, China
| | - Lingtong Min
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Room 512, Zhouyiqing Building, Zhejiang University, 38 Zheda Road, Hangzhou, Zhejiang, China
| | - Rui Wang
- Department of Information Technology, Shanxi Dayi Hospital, Taiyuan, Shanxi, China
| | - Xudong Lu
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Room 512, Zhouyiqing Building, Zhejiang University, 38 Zheda Road, Hangzhou, Zhejiang, China.
| | - Huilong Duan
- College of Biomedical Engineering and Instrument Science, Zhejiang University, Room 512, Zhouyiqing Building, Zhejiang University, 38 Zheda Road, Hangzhou, Zhejiang, China
| |
Collapse
|