1
|
Ahmadi N, Zoch M, Guengoeze O, Facchinello C, Mondorf A, Stratmann K, Musleh K, Erasmus HP, Tchertov J, Gebler R, Schaaf J, Frischen LS, Nasirian A, Dai J, Henke E, Tremblay D, Srisuwananukorn A, Bornhäuser M, Röllig C, Eckardt JN, Middeke JM, Wolfien M, Sedlmayr M. How to customize common data models for rare diseases: an OMOP-based implementation and lessons learned. Orphanet J Rare Dis 2024; 19:298. [PMID: 39143600 PMCID: PMC11325822 DOI: 10.1186/s13023-024-03312-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 08/06/2024] [Indexed: 08/16/2024] Open
Abstract
BACKGROUND Given the geographical sparsity of Rare Diseases (RDs), assembling a cohort is often a challenging task. Common data models (CDM) can harmonize disparate sources of data that can be the basis of decision support systems and artificial intelligence-based studies, leading to new insights in the field. This work is sought to support the design of large-scale multi-center studies for rare diseases. METHODS In an interdisciplinary group, we derived a list of elements of RDs in three medical domains (endocrinology, gastroenterology, and pneumonology) according to specialist knowledge and clinical guidelines in an iterative process. We then defined a RDs data structure that matched all our data elements and built Extract, Transform, Load (ETL) processes to transfer the structure to a joint CDM. To ensure interoperability of our developed CDM and its subsequent usage for further RDs domains, we ultimately mapped it to Observational Medical Outcomes Partnership (OMOP) CDM. We then included a fourth domain, hematology, as a proof-of-concept and mapped an acute myeloid leukemia (AML) dataset to the developed CDM. RESULTS We have developed an OMOP-based rare diseases common data model (RD-CDM) using data elements from the three domains (endocrinology, gastroenterology, and pneumonology) and tested the CDM using data from the hematology domain. The total study cohort included 61,697 patients. After aligning our modules with those of Medical Informatics Initiative (MII) Core Dataset (CDS) modules, we leveraged its ETL process. This facilitated the seamless transfer of demographic information, diagnoses, procedures, laboratory results, and medication modules from our RD-CDM to the OMOP. For the phenotypes and genotypes, we developed a second ETL process. We finally derived lessons learned for customizing our RD-CDM for different RDs. DISCUSSION This work can serve as a blueprint for other domains as its modularized structure could be extended towards novel data types. An interdisciplinary group of stakeholders that are actively supporting the project's progress is necessary to reach a comprehensive CDM. CONCLUSION The customized data structure related to our RD-CDM can be used to perform multi-center studies to test data-driven hypotheses on a larger scale and take advantage of the analytical tools offered by the OHDSI community.
Collapse
Affiliation(s)
- Najia Ahmadi
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany.
| | - Michele Zoch
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Oya Guengoeze
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Carlo Facchinello
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Antonia Mondorf
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Katharina Stratmann
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Khader Musleh
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Hans-Peter Erasmus
- Department of Internal Medicine I, University Hospital Frankfurt, Goethe University, Frankfurt, Germany
| | - Jana Tchertov
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Richard Gebler
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Jannik Schaaf
- Goethe University Frankfurt, University Hospital, Institute of Medical Informatics, Frankfurt, Germany
| | - Lena S Frischen
- University Hospital Frankfurt, Goethe University, Executive Department for Medical IT-Systems and Digitalization, Frankfurt, Germany
| | - Azadeh Nasirian
- Center of Medical Informatics, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Jiabin Dai
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Elisa Henke
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| | - Douglas Tremblay
- Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Martin Bornhäuser
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Christoph Röllig
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
| | - Jan-Niklas Eckardt
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
- Else-Kroener-Fresenius-Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Jan Moritz Middeke
- Department of Internal Medicine I, University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany
- Else-Kroener-Fresenius-Center for Digital Health, TUD Dresden University of Technology, Dresden, Germany
| | - Markus Wolfien
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI) Dresden/Leipzig, Dresden, Germany
| | - Martin Sedlmayr
- Institute for Medical Informatics and Biometry, Carl Gustav Carus Faculty of Medicine, TUD Dresden University of Technology, Fetscherstraße 74, 01307, Dresden, Germany
| |
Collapse
|
2
|
Tarride JE, Okoh A, Aryal K, Prada C, Milinkovic D, Keepanasseril A, Iorio A. Scoping review of the recommendations and guidance for improving the quality of rare disease registries. Orphanet J Rare Dis 2024; 19:187. [PMID: 38711103 PMCID: PMC11075280 DOI: 10.1186/s13023-024-03193-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 04/19/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND Rare disease registries (RDRs) are valuable tools for improving clinical care and advancing research. However, they often vary qualitatively, structurally, and operationally in ways that can determine their potential utility as a source of evidence to support decision-making regarding the approval and funding of new treatments for rare diseases. OBJECTIVES The goal of this research project was to review the literature on rare disease registries and identify best practices to improve the quality of RDRs. METHODS In this scoping review, we searched MEDLINE and EMBASE as well as the websites of regulatory bodies and health technology assessment agencies from 2010 to April 2023 for literature offering guidance or recommendations to ensure, improve, or maintain quality RDRs. RESULTS The search yielded 1,175 unique references, of which 64 met the inclusion criteria. The characteristics of RDRs deemed to be relevant to their quality align with three main domains and several sub-domains considered to be best practices for quality RDRs: (1) governance (registry purpose and description; governance structure; stakeholder engagement; sustainability; ethics/legal/privacy; data governance; documentation; and training and support); (2) data (standardized disease classification; common data elements; data dictionary; data collection; data quality and assurance; and data analysis and reporting); and (3) information technology (IT) infrastructure (physical and virtual infrastructure; and software infrastructure guided by FAIR principles (Findability; Accessibility; Interoperability; and Reusability). CONCLUSIONS Although RDRs face numerous challenges due to their small and dispersed populations, RDRs can generate quality data to support healthcare decision-making through the use of standards and principles on strong governance, quality data practices, and IT infrastructure.
Collapse
Affiliation(s)
- J E Tarride
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
- Centre for Health Economics and Policy Analysis (CHEPA), McMaster University, Hamilton, Canada
- Programs for the Assessment of Technologies in Health (PATH), The Research Institute of St. Joe's Hamilton, St. Joseph's Healthcare Hamilton, Hamilton, ON, Canada
| | - A Okoh
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - K Aryal
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - C Prada
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - Deborah Milinkovic
- Centre for Health Economics and Policy Analysis (CHEPA), McMaster University, Hamilton, Canada.
| | - A Keepanasseril
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| | - A Iorio
- Department of Health Research Methods, Evidence and Impact, Faculty of Health Sciences, McMaster University, Hamilton, Canada
| |
Collapse
|
3
|
Bebbington E, Miles J, Young A, van Baar ME, Bernal N, Brekke RL, van Dammen L, Elmasry M, Inoue Y, McMullen KA, Paton L, Thamm OC, Tracy LM, Zia N, Singer Y, Dunn K. Exploring the similarities and differences of burn registers globally: Results from a data dictionary comparison study. Burns 2024; 50:850-865. [PMID: 38267291 DOI: 10.1016/j.burns.2024.01.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2023] [Revised: 12/08/2023] [Accepted: 01/10/2024] [Indexed: 01/26/2024]
Abstract
INTRODUCTION Pooling and comparing data from the existing global network of burn registers represents a powerful, yet untapped, opportunity to improve burn prevention and care. There have been no studies investigating whether registers are sufficiently similar to allow data comparisons. It is also not known what differences exist that could bias analyses. Understanding this information is essential prior to any future data sharing. The aim of this project was to compare the variables collected in countrywide and intercountry burn registers to understand their similarities and differences. METHODS Register custodians were invited to participate and share their data dictionaries. Inclusion and exclusion criteria were compared to understand each register population. Descriptive statistics were calculated for the number of unique variables. Variables were classified into themes. Definition, method, timing of measurement, and response options were compared for a sample of register concepts. RESULTS 13 burn registries participated in the study. Inclusion criteria varied between registers. Median number of variables per register was 94 (range 28 - 890), of which 24% (range 4.8 - 100%) were required to be collected. Six themes (patient information, admission details, injury, inpatient, outpatient, other) and 41 subthemes were identified. Register concepts of age and timing of injury show similarities in data collection. Intent, mechanism, inhalational injury, infection, and patient death show greater variation in measurement. CONCLUSIONS We found some commonalities between registers and some differences. Commonalities would assist in any future efforts to pool and compare data between registers. Differences between registers could introduce selection and measurement bias, which needs to be addressed in any strategy aiming to facilitate burn register data sharing. We recommend the development of common data elements used in an international minimum data set for burn injuries, including standard definitions and methods of measurement, as the next step in achieving burn register data sharing.
Collapse
Affiliation(s)
- Emily Bebbington
- Centre for Mental Health and Society, Bangor University, Wrexham Academic Unit, Technology Park, Wrexham LL13 7YP, UK.
| | - Joanna Miles
- Plastic and Reconstructive Surgery Department, Norfolk and Norwich University Hospital, Colney Lane, Norwich NR4 7UY, UK
| | - Amber Young
- Bristol Centre for Surgical Research, Bristol Medical School, Department of Population Health Sciences, Canynge Hall, 39 Whatley Road, Bristol BS8 2PS, UK
| | - Margriet E van Baar
- Dutch Burn Repository R3, Association of Dutch Burn Centres, Maasstad Hospital, Maasstadweg 21, 3079 DZ Rotterdam, the Netherlands
| | - Nicole Bernal
- The Ohio State University Wexner Medical Center, 410 W 10th Ave, Columbus, OH 43235, USA; Burn Care Quality Platform, American Burn Association, 311 S. Wacker Drive, Suite 950, Chicago, IL, USA
| | - Ragnvald Ljones Brekke
- Norwegian Burn Registry, Norwegian National Burn Center, Haukeland University Hospital, Haukelandsveien 22, 5009 Bergen, Norway
| | - Lotte van Dammen
- Burn Centres Outcomes Registry The Netherlands, Dutch Burns Foundation, Zeestraat 29, 1941 AJ Beverwijk, the Netherlands
| | - Moustafa Elmasry
- Burn Unit Database, Swedish Burn Register, Department of Hand Surgery, Plastic Surgery and Burns, Linköping University, Linköping, Sweden
| | - Yoshiaki Inoue
- Japanese Burn Register, Japanese Society for Burn Injuries, Shunkosha Inc. Lambdax Building, 2-4-12 Ohkubo, Shinjuku-ku, Tokyo 169-0072, Japan
| | - Kara A McMullen
- Burn Model System, Burn Model System National Data and Statistical Center, Department of Rehabilitation Medicine, University of Washington, Box 354237, Seattle, WA 98195-4237, USA
| | - Lia Paton
- Care of Burns in Scotland, National Managed Clinical Network, NHS National Services Scotland, Gyle Square, 1 South Gyle Crescent, Edinburgh EH12 9EB, UK
| | - Oliver C Thamm
- German Burn Registry, German Society for Burn Treatment (DGV), Luisenstrasse 58-59, 10117 Berlin, Germany; University of Witten/Herdecke, Alfred-Herrenhausen-Strasse 50, 58455 Witten, Germany
| | - Lincoln M Tracy
- School of Public Health & Preventive Medicine, Monash University, 553 St Kilda Road, Melbourne, VIC 3004, Australia
| | - Nukhba Zia
- South Asia Burn Registry, Johns Hopkins International Injury Research Unit, Department of International Health, Health Systems Program, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD 21205, USA
| | - Yvonne Singer
- School of Nursing and Midwifery, Griffith University, Nathan Campus, 170 Kessels Road, Brisbane, QLD, Australia
| | - Ken Dunn
- Burn Care Informatics Group, NHS, UK
| |
Collapse
|
4
|
Abad-Navarro F, Martínez-Costa C. A knowledge graph-based data harmonization framework for secondary data reuse. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2024; 243:107918. [PMID: 37981455 DOI: 10.1016/j.cmpb.2023.107918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 10/02/2023] [Accepted: 11/05/2023] [Indexed: 11/21/2023]
Abstract
BACKGROUND AND OBJECTIVE The adoption of new technologies in clinical care systems has propitiated the availability of a great amount of valuable data. However, this data is usually heterogeneous, requiring its harmonization to be integrated and analysed. We propose a semantic-driven harmonization framework that (1) enables the meaningful sharing and integration of healthcare data across institutions and (2) facilitates the analysis and exploitation of the shared data. METHODS The framework includes an ontology-based common data model (i.e. SCDM), a data transformation pipeline and a semantic query system. Heterogeneous datasets, mapped to different terminologies, are integrated by using an ontology-based infrastructure rooted in a top-level ontology. A graph database is generated by using these mappings, and web-based semantic query system facilitates data exploration. RESULTS Several datasets from different European institutions have been integrated by using the framework in the context of the European H2020 Precise4Q project. Through the query system, data scientists were able to explore data and use it for building machine learning models. CONCLUSIONS The flexible data representation using RDF, together with the formal semantic underpinning provided by the SCDM, have enabled the semantic integration, query and advanced exploitation of heterogeneous data in the context of the Precise4Q project.
Collapse
Affiliation(s)
- Francisco Abad-Navarro
- Departamento de Informática y Sistemas, Universidad de Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, 30100, Murcia, Spain.
| | - Catalina Martínez-Costa
- Departamento de Informática y Sistemas, Universidad de Murcia, CEIR Campus Mare Nostrum, IMIB-Arrixaca, 30100, Murcia, Spain.
| |
Collapse
|
5
|
van de Velde NM, Krom YD, Bongers J, Hoek RJA, Ikelaar NA, van der Holst M, Naarding KJ, van den Bergen JC, Vroom E, Horemans A, Hendriksen JGM, de Groot IJM, Houwen-van Opstal SLS, Verschuuren JJGM, van Duyvenvoorde HA, Snijder RR, Niks EH. The Dutch Dystrophinopathy Database: A National Registry with Standardized Patient and Clinician Reported Real-World Data. J Neuromuscul Dis 2024; 11:1095-1109. [PMID: 39031379 PMCID: PMC11380288 DOI: 10.3233/jnd-240061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/22/2024]
Abstract
Background Duchenne and Becker muscular dystrophy lack curative treatments. Registers can facilitate therapy development, serving as a platform to study epidemiology, assess clinical trial feasibility, identify eligible candidates, collect real-world data, perform post-market surveillance, and collaborate in (inter)national data-driven initiatives. Objective In addressing these facets, it's crucial to gather high-quality, interchangeable, and reusable data from a representative population. We introduce the Dutch Dystrophinopathy Database (DDD), a national registry for patients with DMD or BMD, and females with pathogenic DMD variants, outlining its design, governance, and use. Methods The design of DDD is based on a system-independent information model that ensures interoperable and reusable data adhering to international standards. To maximize enrollment, patients can provide consent online and participation is allowed on different levels with contact details and clinical diagnosis as minimal requirement. Participants can opt-in for yearly online questionnaires on disease milestones and medication and to have clinical data stored from visits to one of the national reference centers. Governance involves a general board, advisory board and database management. Results On November 1, 2023, 742 participants were enrolled. Self-reported data were provided by 291 Duchenne, 122 Becker and 38 female participants. 96% of the participants visiting reference centers consented to store clinical data. Eligible patients were informed about clinical studies through DDD, and multiple data requests have been approved to use coded clinical data for quality control, epidemiology and natural history studies. Conclusion The Dutch Dystrophinopathy Database captures long-term patient and high-quality standardized clinician reported healthcare data, supporting trial readiness, post-marketing surveillance, and effective data use using a multicenter design that is scalable to other neuromuscular disorders.
Collapse
Affiliation(s)
- N M van de Velde
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - Y D Krom
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - J Bongers
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - R J A Hoek
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - N A Ikelaar
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - M van der Holst
- Duchenne Center Netherlands, Leiden, The Netherlands
- Department of Orthopaedics, Rehabilitation and Physiotherapy, Leiden University Medical Center, Leiden, The Netherlands
| | - K J Naarding
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - J C van den Bergen
- Department of Neurology, University Medical Center Utrecht, Utrecht, The Netherlands
| | - E Vroom
- Duchenne Parent Project, Veenendaal, The Netherlands
| | - A Horemans
- Spierziekten Nederland, Baarn, The Netherlands
| | - J G M Hendriksen
- Duchenne Center Netherlands, Leiden, The Netherlands
- Kempenhaeghe Center for Neurological Learning Disabilities, Heeze, The Netherlands
| | - I J M de Groot
- Duchenne Center Netherlands, Leiden, The Netherlands
- Department of Rehabilitation, Donders Center of Neuroscience, Radboud University Nijmegen Medical Center, Nijmegen, The Netherlands
| | - S L S Houwen-van Opstal
- Duchenne Center Netherlands, Leiden, The Netherlands
- Department of Rehabilitation, Donders Center of Neuroscience, Radboud University Nijmegen Medical Center, Nijmegen, The Netherlands
| | - J J G M Verschuuren
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| | - H A van Duyvenvoorde
- Duchenne Center Netherlands, Leiden, The Netherlands
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - R R Snijder
- LUMC Biobank Organization, Leiden University Medical Center, Leiden, The Netherlands
| | - E H Niks
- Department of Neurology, Leiden University Medical Center, Leiden, The Netherlands
- Duchenne Center Netherlands, Leiden, The Netherlands
| |
Collapse
|
6
|
Bernabé CH, Queralt-Rosinach N, Silva Souza VE, Bonino da Silva Santos LO, Mons B, Jacobsen A, Roos M. The use of foundational ontologies in biomedical research. J Biomed Semantics 2023; 14:21. [PMID: 38082345 PMCID: PMC10712036 DOI: 10.1186/s13326-023-00300-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 11/29/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND The FAIR principles recommend the use of controlled vocabularies, such as ontologies, to define data and metadata concepts. Ontologies are currently modelled following different approaches, sometimes describing conflicting definitions of the same concepts, which can affect interoperability. To cope with that, prior literature suggests organising ontologies in levels, where domain specific (low-level) ontologies are grounded in domain independent high-level ontologies (i.e., foundational ontologies). In this level-based organisation, foundational ontologies work as translators of intended meaning, thus improving interoperability. Despite their considerable acceptance in biomedical research, there are very few studies testing foundational ontologies. This paper describes a systematic literature mapping that was conducted to understand how foundational ontologies are used in biomedical research and to find empirical evidence supporting their claimed (dis)advantages. RESULTS From a set of 79 selected papers, we identified that foundational ontologies are used for several purposes: ontology construction, repair, mapping, and ontology-based data analysis. Foundational ontologies are claimed to improve interoperability, enhance reasoning, speed up ontology development and facilitate maintainability. The complexity of using foundational ontologies is the most commonly cited downside. Despite being used for several purposes, there were hardly any experiments (1 paper) testing the claims for or against the use of foundational ontologies. In the subset of 49 papers that describe the development of an ontology, it was observed a low adherence to ontology construction (16 papers) and ontology evaluation formal methods (4 papers). CONCLUSION Our findings have two main implications. First, the lack of empirical evidence about the use of foundational ontologies indicates a need for evaluating the use of such artefacts in biomedical research. Second, the low adherence to formal methods illustrates how the field could benefit from a more systematic approach when dealing with the development and evaluation of ontologies. The understanding of how foundational ontologies are used in the biomedical field can drive future research towards the improvement of ontologies and, consequently, data FAIRness. The adoption of formal methods can impact the quality and sustainability of ontologies, and reusing these methods from other fields is encouraged.
Collapse
Affiliation(s)
- César H Bernabé
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands.
| | | | | | - Luiz Olavo Bonino da Silva Santos
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
- University of Twente, Enschede, The Netherlands
| | - Barend Mons
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Annika Jacobsen
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Marco Roos
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands.
| |
Collapse
|
7
|
Raycheva R, Kostadinov K, Mitova E, Bogoeva N, Iskrov G, Stefanov G, Stefanov R. Challenges in mapping European rare disease databases, relevant for ML-based screening technologies in terms of organizational, FAIR and legal principles: scoping review. Front Public Health 2023; 11:1214766. [PMID: 37780450 PMCID: PMC10540868 DOI: 10.3389/fpubh.2023.1214766] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 08/30/2023] [Indexed: 10/03/2023] Open
Abstract
Background Given the increased availability of data sources such as hospital information systems, electronic health records, and health-related registries, a novel approach is required to develop artificial intelligence-based decision support that can assist clinicians in their diagnostic decision-making and shorten rare disease patients' diagnostic odyssey. The aim is to identify key challenges in the process of mapping European rare disease databases, relevant to ML-based screening technologies in terms of organizational, FAIR and legal principles. Methods A scoping review was conducted based on the PRISMA-ScR checklist. The primary article search was conducted in three electronic databases (MEDLINE/Pubmed, Scopus, and Web of Science) and a secondary search was performed in Google scholar and on the organizations' websites. Each step of this review was carried out independently by two researchers. A charting form for relevant study analysis was developed and used to categorize data and identify data items in three domains - organizational, FAIR and legal. Results At the end of the screening process, 73 studies were eligible for review based on inclusion and exclusion criteria with more than 60% (n = 46) of the research published in the last 5 years and originated only from EU/EEA countries. Over the ten-year period (2013-2022), there is a clear cycling trend in the publications, with a peak of challenges reporting every four years. Within this trend, the following dynamic was identified: except for 2016, organizational challenges dominated the articles published up to 2018; legal challenges were the most frequently discussed topic from 2018 to 2022. The following distribution of the data items by domains was observed - (1) organizational (n = 36): data accessibility and sharing (20.2%); long-term sustainability (18.2%); governance, planning and design (17.2%); lack of harmonization and standardization (17.2%); quality of data collection (16.2%); and privacy risks and small sample size (11.1%); (2) FAIR (n = 15): findable (17.9%); accessible sustainability (25.0%); interoperable (39.3%); and reusable (17.9%); and (3) legal (n = 33): data protection by all means (34.4%); data management and ownership (22.9%); research under GDPR and member state law (20.8%); trust and transparency (13.5%); and digitalization of health (8.3%). We observed a specific pattern repeated in all domains during the process of data charting and data item identification - in addition to the outlined challenges, good practices, guidelines, and recommendations were also discussed. The proportion of publications addressing only good practices, guidelines, and recommendations for overcoming challenges when mapping RD databases in at least one domain was calculated to be 47.9% (n = 35). Conclusion Despite the opportunities provided by innovation - automation, electronic health records, hospital-based information systems, biobanks, rare disease registries and European Reference Networks - the results of the current scoping review demonstrate a diversity of the challenges that must still be addressed, with immediate actions on ensuring better governance of rare disease registries, implementing FAIR principles, and enhancing the EU legal framework.
Collapse
Affiliation(s)
- Ralitsa Raycheva
- Department of Social Medicine and Public Health, Faculty of Public Health, Medical University of Plovdiv, Plovdiv, Bulgaria
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Kostadin Kostadinov
- Department of Social Medicine and Public Health, Faculty of Public Health, Medical University of Plovdiv, Plovdiv, Bulgaria
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Elena Mitova
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Nataliya Bogoeva
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Georgi Iskrov
- Department of Social Medicine and Public Health, Faculty of Public Health, Medical University of Plovdiv, Plovdiv, Bulgaria
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Georgi Stefanov
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| | - Rumen Stefanov
- Department of Social Medicine and Public Health, Faculty of Public Health, Medical University of Plovdiv, Plovdiv, Bulgaria
- Bulgarian Association for Promotion of Education and Science, Institute for Rare Disease, Plovdiv, Bulgaria
| |
Collapse
|
8
|
Hedley V, Bolz-Johnson M, Hernando I, Kenward R, Nabbout R, Romero C, Schaefer F, Upadhyaya S. Together4RD position statement on collaboration between European reference networks and industry. Orphanet J Rare Dis 2023; 18:272. [PMID: 37670358 PMCID: PMC10478454 DOI: 10.1186/s13023-023-02853-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Accepted: 08/08/2023] [Indexed: 09/07/2023] Open
Abstract
Notwithstanding two decades of policy and legislation in Europe, aimed to foster research and development in rare conditions, only 5-6% of rare diseases have dedicated treatments. Given with the huge number of conditions classed as rare (which is increasing all the time), this equates to major unmet need for patients (over 30 million in the EU alone). Worryingly, the pace of Research and Innovation in Europe is lagging behind other regions of the world, and a seismic shift in the way in which research is planned and delivered is required, in order to remain competitive and-most importantly-bring meaningful, disease-altering treatments to those who desperately need them. The European Reference Networks (ERNs), launched in 2017, hold major potential to alleviate many of these challenges, and more, but only if adequately supported (financially, technically, and via robust policies and infrastructure) to realise that potential: and even then, only if able to forge robust collaborations harnessing the expertise, resources, knowledge and data of all stakeholders involved in rare disease, including Industry. To-date, however, ERN-Industry interactions have been largely limited, for a range of reasons (concerning barriers both tangible and perceived). This Position Statement analyses these barriers, and explains how Together4RD is seeking to move the needle here, by learning from case studies, exploring frameworks for collaboration, and launching pilots to explore how best to plan and deliver multistakeholder interactions addressing real research needs.
Collapse
Affiliation(s)
| | | | | | | | - Rima Nabbout
- Pediatric Neurology Department, Hôpital Necker Enfants Malades, APHP, Universite Paris Cité, Institut Imagine, Paris, France
| | | | - Franz Schaefer
- Center For Pediatrics and Adolescent Medicine, University of Heidelberg, Heidelberg, Germany
| | | |
Collapse
|
9
|
Charlet J, Cui L. Knowledge Representation and Management 2022: Findings in Ontology Development and Applications. Yearb Med Inform 2023; 32:225-229. [PMID: 38147864 PMCID: PMC10751114 DOI: 10.1055/s-0043-1768747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2023] Open
Abstract
OBJECTIVES To select, present, and summarize the best papers in 2022 for the Knowledge Representation and Management (KRM) section of the International Medical Informatics Association (IMIA) Yearbook. METHODS We conducted PubMed queries and followed the IMIA Yearbook guidelines for performing biomedical informatics literature review to select the best papers in KRM published in 2022. RESULTS We retrieved 1,847 publications from PubMed. We nominated 15 candidate best papers, and two of them were finally selected as the best papers in the KRM section. The topics covered by the candidate papers include ontology and knowledge graph creation, ontology applications, ontology quality assurance, ontology mapping standard, and conceptual model. CONCLUSIONS In the KRM best paper selection for 2022, the candidate best papers encompassed a broad range of topics, with ontology and knowledge graph creation remaining a considerable research focus.
Collapse
Affiliation(s)
- Jean Charlet
- Sorbonne Université, INSERM, Univ Sorbonne Paris Nord, LIMICS, Paris, France
- AP-HP, DRCI, Paris, France
| | - Licong Cui
- McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | | |
Collapse
|
10
|
Kreuzthaler M, Brochhausen M, Zayas C, Blobel B, Schulz S. Linguistic and ontological challenges of multiple domains contributing to transformed health ecosystems. Front Med (Lausanne) 2023; 10:1073313. [PMID: 37007792 PMCID: PMC10050682 DOI: 10.3389/fmed.2023.1073313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Accepted: 02/13/2023] [Indexed: 03/17/2023] Open
Abstract
This paper provides an overview of current linguistic and ontological challenges which have to be met in order to provide full support to the transformation of health ecosystems in order to meet precision medicine (5 PM) standards. It highlights both standardization and interoperability aspects regarding formal, controlled representations of clinical and research data, requirements for smart support to produce and encode content in a way that humans and machines can understand and process it. Starting from the current text-centered communication practices in healthcare and biomedical research, it addresses the state of the art in information extraction using natural language processing (NLP). An important aspect of the language-centered perspective of managing health data is the integration of heterogeneous data sources, employing different natural languages and different terminologies. This is where biomedical ontologies, in the sense of formal, interchangeable representations of types of domain entities come into play. The paper discusses the state of the art of biomedical ontologies, addresses their importance for standardization and interoperability and sheds light to current misconceptions and shortcomings. Finally, the paper points out next steps and possible synergies of both the field of NLP and the area of Applied Ontology and Semantic Web to foster data interoperability for 5 PM.
Collapse
Affiliation(s)
- Markus Kreuzthaler
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Graz, Austria
| | - Mathias Brochhausen
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Cilia Zayas
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, United States
| | - Bernd Blobel
- Medical Faculty, University of Regensburg, Regensburg, Germany
- eHealth Competence Center Bavaria, Deggendorf Institute of Technology, Deggendorf, Germany
- First Medical Faculty, Charles University Prague, Prague, Czechia
| | - Stefan Schulz
- Institute for Medical Informatics, Statistics and Documentation, Medical University of Graz, Graz, Austria
- Averbis GmbH, Freiburg, Germany
- *Correspondence: Stefan Schulz,
| |
Collapse
|
11
|
Gelain E, Tesi M, Mazzariol M, Vaglio A. Registries of rare diseases: current knowledge and future perspectives. Intern Emerg Med 2023; 18:19-21. [PMID: 36401715 DOI: 10.1007/s11739-022-03151-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 11/02/2022] [Indexed: 11/21/2022]
Affiliation(s)
- Elena Gelain
- Nephrology and Dialysis Unit, Meyer Children's Hospital, Firenze, Italy
| | - Michelangelo Tesi
- Nephrology and Dialysis Unit, Meyer Children's Hospital, Firenze, Italy
| | - Martina Mazzariol
- Nephrology and Dialysis Unit, Meyer Children's Hospital, Firenze, Italy
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Augusto Vaglio
- Nephrology and Dialysis Unit, Meyer Children's Hospital, Firenze, Italy.
- Department of Biomedical, Clinical and Experimental Sciences "Mario Serio", University of Firenze, Firenze, Italy.
| |
Collapse
|
12
|
Abstract
BACKGROUND One Digital Health (ODH) aims to propose a framework that merges One Health's and Digital Health's specific features into an innovative landscape. FAIR (Findable, Accessible, Interoperable, and Reusable) principles consider applications and computational agents (or, in other terms, data, metadata, and infrastructures) as stakeholders with the capacity to find, access, interoperate, and reuse data with none or minimal human intervention. OBJECTIVES This paper aims to elicit how the ODH framework is compliant with FAIR principles and metrics, providing some thinking guide to investigate and define whether adapted metrics need to be figured out for an effective ODH Intervention setup. METHODS An integrative analysis of the literature was conducted to extract instances of the need-or of the eventual already existing deployment-of FAIR principles, for each of the three layers (keys, perspectives and dimensions) of the ODH framework. The scope was to assess the extent of scatteredness in pursuing the many facets of FAIRness, descending from the lack of a unifying and balanced framework. RESULTS A first attempt to interpret the different technological components existing in the different layers of the ODH framework, in the light of the FAIR principles, was conducted. Although the mature and working examples of workflows for data FAIRification processes currently retrievable in the literature provided a robust ground to work on, a nonsuitable capacity to fully assess FAIR aspects for highly interconnected scenarios, which the ODH-based ones are, has emerged. Rooms for improvement are anyway possible to timely deal with all the underlying features of topics like the delivery of health care in a syndemic scenario, the digital transformation of human and animal health data, or the digital nature conservation through digital technology-based intervention. CONCLUSIONS ODH pillars account for the availability (findability, accessibility) of human, animal, and environmental data allowing a unified understanding of complex interactions (interoperability) over time (reusability). A vision of integration between these two worlds, under the vest of ODH Interventions featuring FAIRness characteristics, toward the development of a systemic lookup of health and ecology in a digitalized way, is therefore auspicable.
Collapse
Affiliation(s)
- Oscar Tamburis
- Institute of Biostructures and Bioimaging, National Research Council of Italy, Naples, Italy
| | - Arriel Benis
- Faculty of Industrial Engineering and Technology Management, Holon Institute of Technology, Holon, Israel,Faculty of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel,Address for correspondence Arriel Benis, PhD Faculty of Industrial Engineering and Technology Management, Holon Institute of TechnologyGolomb St 52, PoB 305, HolonIsrael
| |
Collapse
|