1
|
Sulaieva O, Dudin O, Koshyk O, Panko M, Kobyliak N. Digital pathology implementation in cancer diagnostics: towards informed decision-making. Front Digit Health 2024; 6:1358305. [PMID: 38873358 PMCID: PMC11169727 DOI: 10.3389/fdgth.2024.1358305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 05/16/2024] [Indexed: 06/15/2024] Open
Abstract
Digital pathology (DP) has become a part of the cancer healthcare system, creating additional value for cancer patients. DP implementation in clinical practice provides plenty of benefits but also harbors hidden ethical challenges affecting physician-patient relationships. This paper addresses the ethical obligation to transform the physician-patient relationship for informed and responsible decision-making when using artificial intelligence (AI)-based tools for cancer diagnostics. DP application allows to improve the performance of the Human-AI Team shifting focus from AI challenges towards the Augmented Human Intelligence (AHI) benefits. AHI enhances analytical sensitivity and empowers pathologists to deliver accurate diagnoses and assess predictive biomarkers for further personalized treatment of cancer patients. At the same time, patients' right to know about using AI tools, their accuracy, strengths and limitations, measures for privacy protection, acceptance of privacy concerns and legal protection defines the duty of physicians to provide the relevant information about AHI-based solutions to patients and the community for building transparency, understanding and trust, respecting patients' autonomy and empowering informed decision-making in oncology.
Collapse
Affiliation(s)
- Oksana Sulaieva
- Medical LaboratoryCSD, Kyiv, Ukraine
- Endocrinology Department, Bogomolets National Medical University, Kyiv, Ukraine
| | | | | | | | - Nazarii Kobyliak
- Medical LaboratoryCSD, Kyiv, Ukraine
- Endocrinology Department, Bogomolets National Medical University, Kyiv, Ukraine
| |
Collapse
|
2
|
Rujano MA, Boiten JW, Ohmann C, Canham S, Contrino S, David R, Ewbank J, Filippone C, Connellan C, Custers I, van Nuland R, Mayrhofer MT, Holub P, Álvarez EG, Bacry E, Hughes N, Freeberg MA, Schaffhauser B, Wagener H, Sánchez-Pla A, Bertolini G, Panagiotopoulou M. Sharing sensitive data in life sciences: an overview of centralized and federated approaches. Brief Bioinform 2024; 25:bbae262. [PMID: 38836701 DOI: 10.1093/bib/bbae262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 04/19/2024] [Indexed: 06/06/2024] Open
Abstract
Biomedical data are generated and collected from various sources, including medical imaging, laboratory tests and genome sequencing. Sharing these data for research can help address unmet health needs, contribute to scientific breakthroughs, accelerate the development of more effective treatments and inform public health policy. Due to the potential sensitivity of such data, however, privacy concerns have led to policies that restrict data sharing. In addition, sharing sensitive data requires a secure and robust infrastructure with appropriate storage solutions. Here, we examine and compare the centralized and federated data sharing models through the prism of five large-scale and real-world use cases of strategic significance within the European data sharing landscape: the French Health Data Hub, the BBMRI-ERIC Colorectal Cancer Cohort, the federated European Genome-phenome Archive, the Observational Medical Outcomes Partnership/OHDSI network and the EBRAINS Medical Informatics Platform. Our analysis indicates that centralized models facilitate data linkage, harmonization and interoperability, while federated models facilitate scaling up and legal compliance, as the data typically reside on the data generator's premises, allowing for better control of how data are shared. This comparative study thus offers guidance on the selection of the most appropriate sharing strategy for sensitive datasets and provides key insights for informed decision-making in data sharing efforts.
Collapse
Affiliation(s)
- Maria A Rujano
- European Clinical Research Infrastructure Network (ECRIN), Boulevard Saint Jacques 30, 75014, Paris, France
| | - Jan-Willem Boiten
- Foundation Lygature, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Christian Ohmann
- European Clinical Research Infrastructure Network (ECRIN), Boulevard Saint Jacques 30, 75014, Paris, France
| | - Steve Canham
- European Clinical Research Infrastructure Network (ECRIN), Boulevard Saint Jacques 30, 75014, Paris, France
| | - Sergio Contrino
- European Clinical Research Infrastructure Network (ECRIN), Boulevard Saint Jacques 30, 75014, Paris, France
| | - Romain David
- European Research Infrastructure on Highly Pathogenic Agents (ERINHA AISBL), rue du Trône 98/Boîte 4B, 1050, Brussels, Belgium
| | - Jonathan Ewbank
- European Research Infrastructure on Highly Pathogenic Agents (ERINHA AISBL), rue du Trône 98/Boîte 4B, 1050, Brussels, Belgium
| | - Claudia Filippone
- European Research Infrastructure on Highly Pathogenic Agents (ERINHA AISBL), rue du Trône 98/Boîte 4B, 1050, Brussels, Belgium
| | - Claire Connellan
- European Research Infrastructure on Highly Pathogenic Agents (ERINHA AISBL), rue du Trône 98/Boîte 4B, 1050, Brussels, Belgium
| | - Ilse Custers
- Foundation Lygature, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Rick van Nuland
- Foundation Lygature, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands
| | - Michaela Th Mayrhofer
- Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-ERIC), Neue Stiftingtalstrasse 2/B/6, 8010, Graz, Austria
| | - Petr Holub
- Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-ERIC), Neue Stiftingtalstrasse 2/B/6, 8010, Graz, Austria
| | - Eva García Álvarez
- Biobanking and Biomolecular Resources Research Infrastructure (BBMRI-ERIC), Neue Stiftingtalstrasse 2/B/6, 8010, Graz, Austria
| | - Emmanuel Bacry
- Health Data Hub (HDH), rue Georges Pitard 9, 75015, Paris, France
| | - Nigel Hughes
- Janssen Research and Development, Antwerpseweg 15, 2340, Beerse, Belgium
| | - Mallory A Freeberg
- European Molecular Biology Laboratory (EMBL), European Bioinformatics Institute (EBI), Wellcome Genome Campus, CB10 1SD, Hinxton, Cambridgeshire, United Kingdom
| | - Birgit Schaffhauser
- Department of Clinical Neurosciences, Centre Hospitalier Universitaire Vaudois (CHUV), Rue du Bugnon 21, 1011, Lausanne, Switzerland
| | - Harald Wagener
- Center for Digital Health, BIH@Charité University Medicine, Anna-Louisa-Karsch-Straße 2, 10178, Berlin, Germany
| | - Alex Sánchez-Pla
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona, Diagonal 643, 08028, Barcelona, Spain
| | - Guido Bertolini
- Laboratory of Clinical Epidemiology, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via GB Camozzi 3, 24020, Ranica (Bergamo), Italy
| | - Maria Panagiotopoulou
- European Clinical Research Infrastructure Network (ECRIN), Boulevard Saint Jacques 30, 75014, Paris, France
| |
Collapse
|
3
|
Brancato V, Esposito G, Coppola L, Cavaliere C, Mirabelli P, Scapicchio C, Borgheresi R, Neri E, Salvatore M, Aiello M. Standardizing digital biobanks: integrating imaging, genomic, and clinical data for precision medicine. J Transl Med 2024; 22:136. [PMID: 38317237 PMCID: PMC10845786 DOI: 10.1186/s12967-024-04891-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 01/14/2024] [Indexed: 02/07/2024] Open
Abstract
Advancements in data acquisition and computational methods are generating a large amount of heterogeneous biomedical data from diagnostic domains such as clinical imaging, pathology, and next-generation sequencing (NGS), which help characterize individual differences in patients. However, this information needs to be available and suitable to promote and support scientific research and technological development, supporting the effective adoption of the precision medicine approach in clinical practice. Digital biobanks can catalyze this process, facilitating the sharing of curated and standardized imaging data, clinical, pathological and molecular data, crucial to enable the development of a comprehensive and personalized data-driven diagnostic approach in disease management and fostering the development of computational predictive models. This work aims to frame this perspective, first by evaluating the state of standardization of individual diagnostic domains and then by identifying challenges and proposing a possible solution towards an integrative approach that can guarantee the suitability of information that can be shared through a digital biobank. Our analysis of the state of the art shows the presence and use of reference standards in biobanks and, generally, digital repositories for each specific domain. Despite this, standardization to guarantee the integration and reproducibility of the numerical descriptors generated by each domain, e.g. radiomic, pathomic and -omic features, is still an open challenge. Based on specific use cases and scenarios, an integration model, based on the JSON format, is proposed that can help address this problem. Ultimately, this work shows how, with specific standardization and promotion efforts, the digital biobank model can become an enabling technology for the comprehensive study of diseases and the effective development of data-driven technologies at the service of precision medicine.
Collapse
Affiliation(s)
| | - Giuseppina Esposito
- Bio Check Up S.R.L, 80121, Naples, Italy
- Department of Advanced Biomedical Sciences, University of Naples Federico II, 80131, Naples, Italy
| | | | | | - Peppino Mirabelli
- UOS Laboratori di Ricerca e Biobanca, AORN Santobono-Pausilipon, Via Teresa Ravaschieri, 8, 80122, Naples, Italy
| | - Camilla Scapicchio
- Academic Radiology, Department of Translational Research, University of Pisa, via Roma, 67, 56126, Pisa, Italy
| | - Rita Borgheresi
- Academic Radiology, Department of Translational Research, University of Pisa, via Roma, 67, 56126, Pisa, Italy
| | - Emanuele Neri
- Academic Radiology, Department of Translational Research, University of Pisa, via Roma, 67, 56126, Pisa, Italy
| | | | | |
Collapse
|
4
|
Wittner R, Holub P, Mascia C, Frexia F, Müller H, Plass M, Allocca C, Betsou F, Burdett T, Cancio I, Chapman A, Chapman M, Courtot M, Curcin V, Eder J, Elliot M, Exter K, Goble C, Golebiewski M, Kisler B, Kremer A, Leo S, Lin‐Gibson S, Marsano A, Mattavelli M, Moore J, Nakae H, Perseil I, Salman A, Sluka J, Soiland‐Reyes S, Strambio‐De‐Castillia C, Sussman M, Swedlow JR, Zatloukal K, Geiger J. Toward a common standard for data and specimen provenance in life sciences. Learn Health Syst 2024; 8:e10365. [PMID: 38249839 PMCID: PMC10797572 DOI: 10.1002/lrh2.10365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 03/17/2023] [Accepted: 03/24/2023] [Indexed: 01/23/2024] Open
Abstract
Open and practical exchange, dissemination, and reuse of specimens and data have become a fundamental requirement for life sciences research. The quality of the data obtained and thus the findings and knowledge derived is thus significantly influenced by the quality of the samples, the experimental methods, and the data analysis. Therefore, a comprehensive and precise documentation of the pre-analytical conditions, the analytical procedures, and the data processing are essential to be able to assess the validity of the research results. With the increasing importance of the exchange, reuse, and sharing of data and samples, procedures are required that enable cross-organizational documentation, traceability, and non-repudiation. At present, this information on the provenance of samples and data is mostly either sparse, incomplete, or incoherent. Since there is no uniform framework, this information is usually only provided within the organization and not interoperably. At the same time, the collection and sharing of biological and environmental specimens increasingly require definition and documentation of benefit sharing and compliance to regulatory requirements rather than consideration of pure scientific needs. In this publication, we present an ongoing standardization effort to provide trustworthy machine-actionable documentation of the data lineage and specimens. We would like to invite experts from the biotechnology and biomedical fields to further contribute to the standard.
Collapse
Affiliation(s)
- Rudolf Wittner
- BBMRI‐ERICGrazAustria
- Institute of Computer Science & Faculty of InformaticsMasaryk UniversityBrnoCzechia
| | - Petr Holub
- BBMRI‐ERICGrazAustria
- Institute of Computer Science & Faculty of InformaticsMasaryk UniversityBrnoCzechia
| | - Cecilia Mascia
- CRS4—Center for Advanced StudiesResearch and Development in SardiniaPulaItaly
| | - Francesca Frexia
- CRS4—Center for Advanced StudiesResearch and Development in SardiniaPulaItaly
| | | | | | - Clare Allocca
- National Institute of Standards and TechnologyGaithersburgMarylandUSA
| | - Fay Betsou
- Biological Resource Center of Institut Pasteur (CRBIP)ParisFrance
| | - Tony Burdett
- EMBL's European Bioinformatics Institute (EMBL‐EBI)CambridgeUK
| | - Ibon Cancio
- Plentzia Marine Station (PiE‐UPV/EHU)University of the Basque Country, EMBRC‐SpainBilbaoSpain
| | | | | | | | | | | | - Mark Elliot
- Department of Social Statistics, School of Social SciencesUniversity of ManchesterManchesterUK
| | - Katrina Exter
- Flanders Marine Institute (VLIZ), EMBRC‐BelgiumOstendBelgium
| | - Carole Goble
- Department of Computer ScienceUniversity of ManchesterManchesterUK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies (HITS gGmbH)HeidelbergGermany
| | | | | | - Simone Leo
- CRS4—Center for Advanced StudiesResearch and Development in SardiniaPulaItaly
| | | | - Anna Marsano
- Department of BiomedicineUniversity of BaselBaselSwitzerland
| | - Marco Mattavelli
- SCI‐STI‐MMÉcole Politechnique Fédérale de LausanneLausanneSwitzerland
| | - Josh Moore
- Centre for Gene Regulation and Expression and Division of Computational Biology, School of Life SciencesUniversity of DundeeDundeeUK
- German BioImaging–Gesellschaft für Mikroskopie und Bildanalyse e.V.KonstanzGermany
| | - Hiroki Nakae
- Japan bio‐Measurement and Analysis ConsortiumTokyoJapan
| | - Isabelle Perseil
- INSERM–Institut National de la Sante et de la Recherche MedicaleParisFrance
| | - Ayat Salman
- Standards Council of CanadaOttawaOntarioCanada
- Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Department of Family MedicineQueen's UniversityKingstonOntarioCanada
| | - James Sluka
- Biocomplexity InstituteIndiana UniversityBloomingtonIndianaUSA
| | - Stian Soiland‐Reyes
- Department of Computer ScienceUniversity of ManchesterManchesterUK
- Informatics InstituteUniversity of AmsterdamAmsterdamThe Netherlands
| | | | - Michael Sussman
- US Department of AgricultureWashingtonDistrict of ColumbiaUSA
| | - Jason R. Swedlow
- Centre for Gene Regulation and Expression and Division of Computational Biology, School of Life SciencesUniversity of DundeeDundeeUK
| | | | - Jörg Geiger
- Interdisciplinary Bank of Biomaterials and Data Würzburg (ibdw)WürzburgGermany
| |
Collapse
|
5
|
Müller H, Lopes-Dias C, Holub P, Plass M, Jungwirth E, Reihs R, Torke PR, Malatras A, Berger A, Coombs H, Dillner J, Merino-Martinez R. BIBBOX, a FAIR toolbox and App Store for life science research. N Biotechnol 2023; 77:12-19. [PMID: 37295722 DOI: 10.1016/j.nbt.2023.06.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/12/2023]
Abstract
Data quality has recently become a critical topic for the research community. European guidelines recommend that scientific data should be made FAIR: findable, accessible, interoperable and reusable. However, as FAIR guidelines do not specify how the stated principles should be implemented, it might not be straightforward for researchers to know how actually to make their data FAIR. This can prevent life-science researchers from sharing their datasets and pipelines, ultimately hindering the progress of research. To address this difficulty, we developed the BIBBOX, which is a platform that supports researchers publishing their datasets and the associated software in a FAIR manner.
Collapse
Affiliation(s)
- Heimo Müller
- Medical University of Graz, Neue Stiftingtalstraße 6, A-8010 Graz, Austria.
| | | | - Petr Holub
- BBMRI-ERIC, Neue Stiftingtalstraße 2/B/6, A-8010 Graz, Austria
| | - Markus Plass
- Medical University of Graz, Neue Stiftingtalstraße 6, A-8010 Graz, Austria
| | - Emilian Jungwirth
- Medical University of Graz, Neue Stiftingtalstraße 6, A-8010 Graz, Austria
| | - Robert Reihs
- Medical University of Graz, Neue Stiftingtalstraße 6, A-8010 Graz, Austria
| | - Paul R Torke
- Medical University of Graz, Neue Stiftingtalstraße 6, A-8010 Graz, Austria
| | | | - Anouk Berger
- International Agency for Research on Cancer (IARC), 25 avenue Tony Garnier, 69366 Lyon, France
| | - Heather Coombs
- International Agency for Research on Cancer (IARC), 25 avenue Tony Garnier, 69366 Lyon, France
| | - Joakim Dillner
- Karolinska Institutet, Alfred Nobels Allé 8, 14152 Huddinge, Sweden
| | | |
Collapse
|
6
|
Giri J, Pezzi L, Cachay R, Gèlvez Ramirez RM, Tami A, Bethencourt S, Lozano A, Gotuzzo Herencia JE, Poje J, Jaenisch T, Chu M. Specimen sharing for epidemic preparedness: Building a virtual biorepository system from local governance to global partnerships. PLOS GLOBAL PUBLIC HEALTH 2023; 3:e0001568. [PMID: 37819913 PMCID: PMC10566708 DOI: 10.1371/journal.pgph.0001568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/14/2023] [Accepted: 09/03/2023] [Indexed: 10/13/2023]
Abstract
We present a framework for a federated, virtual biorepository system (VBS) with locally collected and managed specimens, as a 'global public good' model based on principles of equitable access and benefit sharing. The VBS is intended to facilitate timely access to biological specimens and associated data for outbreak-prone infectious diseases to accelerate the development and evaluation of diagnostics, assess vaccine efficacy, and to support surveillance and research needs. The VBS is aimed to be aligned with the WHO BioHub and other specimen sharing efforts as a force multiplier to meet the needs of strengthening global tools for countering epidemics. The purpose of our initial research is to lay the basis of the collaboration, management and principles of equitable sharing focused on low- and middle-income country partners. Here we report on surveys and interviews undertaken with biorepository-interested parties to better understand needs and barriers for specimen access and share examples from the ZIKAlliance partnership on the governance and operations of locally organized biorepositories.
Collapse
Affiliation(s)
- Judith Giri
- Center for Global Health, Colorado School of Public Health, Anschutz Medical Center, Aurora, Colorado, United States of America
| | - Laura Pezzi
- Unité des Virus Émergents (UVE: Aix-Marseille Univ-IRD 190-Inserm 1207), Marseille, France
| | - Rodrigo Cachay
- Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, San Martín de Porres, Lima, Peru
| | | | - Adriana Tami
- Facultad de Ciencias de la Salud, Departamento de Parasitología, Universidad de Carabobo, Valencia, Venezuela
| | - Sarah Bethencourt
- Departamento de Estudios Clínicos-Department of Clinical Studies, Universidad de Carabobo, Valencia, Venezuela
| | - Anyela Lozano
- Centro de Investigaciones Epidemiológicas, Universidad Industrial de Santander, Bucamaranga, Colombia
| | - José Eduardo Gotuzzo Herencia
- Instituto de Medicina Tropical Alexander von Humboldt, Universidad Peruana Cayetano Heredia, San Martín de Porres, Lima, Peru
| | - Julia Poje
- Center for Global Health, Colorado School of Public Health, Anschutz Medical Center, Aurora, Colorado, United States of America
| | - Thomas Jaenisch
- Center for Global Health, Colorado School of Public Health, Anschutz Medical Center, Aurora, Colorado, United States of America
| | - May Chu
- Center for Global Health, Colorado School of Public Health, Anschutz Medical Center, Aurora, Colorado, United States of America
| |
Collapse
|
7
|
Inau ET, Sack J, Waltemath D, Zeleke AA. Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review. J Med Internet Res 2023; 25:e45013. [PMID: 37639292 PMCID: PMC10495848 DOI: 10.2196/45013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/25/2023] [Accepted: 04/14/2023] [Indexed: 08/29/2023] Open
Abstract
BACKGROUND Thorough data stewardship is a key enabler of comprehensive health research. Processes such as data collection, storage, access, sharing, and analytics require researchers to follow elaborate data management strategies properly and consistently. Studies have shown that findable, accessible, interoperable, and reusable (FAIR) data leads to improved data sharing in different scientific domains. OBJECTIVE This scoping review identifies and discusses concepts, approaches, implementation experiences, and lessons learned in FAIR initiatives in health research data. METHODS The Arksey and O'Malley stage-based methodological framework for scoping reviews was applied. PubMed, Web of Science, and Google Scholar were searched to access relevant publications. Articles written in English, published between 2014 and 2020, and addressing FAIR concepts or practices in the health domain were included. The 3 data sources were deduplicated using a reference management software. In total, 2 independent authors reviewed the eligibility of each article based on defined inclusion and exclusion criteria. A charting tool was used to extract information from the full-text papers. The results were reported using the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. RESULTS A total of 2.18% (34/1561) of the screened articles were included in the final review. The authors reported FAIRification approaches, which include interpolation, inclusion of comprehensive data dictionaries, repository design, semantic interoperability, ontologies, data quality, linked data, and requirement gathering for FAIRification tools. Challenges and mitigation strategies associated with FAIRification, such as high setup costs, data politics, technical and administrative issues, privacy concerns, and difficulties encountered in sharing health data despite its sensitive nature were also reported. We found various workflows, tools, and infrastructures designed by different groups worldwide to facilitate the FAIRification of health research data. We also uncovered a wide range of problems and questions that researchers are trying to address by using the different workflows, tools, and infrastructures. Although the concept of FAIR data stewardship in the health research domain is relatively new, almost all continents have been reached by at least one network trying to achieve health data FAIRness. Documented outcomes of FAIRification efforts include peer-reviewed publications, improved data sharing, facilitated data reuse, return on investment, and new treatments. Successful FAIRification of data has informed the management and prognosis of various diseases such as cancer, cardiovascular diseases, and neurological diseases. Efforts to FAIRify data on a wider variety of diseases have been ongoing since the COVID-19 pandemic. CONCLUSIONS This work summarises projects, tools, and workflows for the FAIRification of health research data. The comprehensive review shows that implementing the FAIR concept in health data stewardship carries the promise of improved research data management and transparency in the era of big data and open research publishing. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) RR2-10.2196/22505.
Collapse
Affiliation(s)
- Esther Thea Inau
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Jean Sack
- International Health Department, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Dagmar Waltemath
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Atinkut Alamirrew Zeleke
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| |
Collapse
|
8
|
Wolfien M, Ahmadi N, Fitzer K, Grummt S, Heine KL, Jung IC, Krefting D, Kühn A, Peng Y, Reinecke I, Scheel J, Schmidt T, Schmücker P, Schüttler C, Waltemath D, Zoch M, Sedlmayr M. Ten Topics to Get Started in Medical Informatics Research. J Med Internet Res 2023; 25:e45948. [PMID: 37486754 PMCID: PMC10407648 DOI: 10.2196/45948] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 03/29/2023] [Accepted: 04/11/2023] [Indexed: 07/25/2023] Open
Abstract
The vast and heterogeneous data being constantly generated in clinics can provide great wealth for patients and research alike. The quickly evolving field of medical informatics research has contributed numerous concepts, algorithms, and standards to facilitate this development. However, these difficult relationships, complex terminologies, and multiple implementations can present obstacles for people who want to get active in the field. With a particular focus on medical informatics research conducted in Germany, we present in our Viewpoint a set of 10 important topics to improve the overall interdisciplinary communication between different stakeholders (eg, physicians, computational experts, experimentalists, students, patient representatives). This may lower the barriers to entry and offer a starting point for collaborations at different levels. The suggested topics are briefly introduced, then general best practice guidance is given, and further resources for in-depth reading or hands-on tutorials are recommended. In addition, the topics are set to cover current aspects and open research gaps of the medical informatics domain, including data regulations and concepts; data harmonization and processing; and data evaluation, visualization, and dissemination. In addition, we give an example on how these topics can be integrated in a medical informatics curriculum for higher education. By recognizing these topics, readers will be able to (1) set clinical and research data into the context of medical informatics, understanding what is possible to achieve with data or how data should be handled in terms of data privacy and storage; (2) distinguish current interoperability standards and obtain first insights into the processes leading to effective data transfer and analysis; and (3) value the use of newly developed technical approaches to utilize the full potential of clinical data.
Collapse
Affiliation(s)
- Markus Wolfien
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence, Dresden, Germany
| | - Najia Ahmadi
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Kai Fitzer
- Core Unit Data Integration Center, University Medicine Greifswald, Greifswald, Germany
| | - Sophia Grummt
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Kilian-Ludwig Heine
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Ian-C Jung
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Dagmar Krefting
- Department of Medical Informatics, University Medical Center, Goettingen, Germany
| | - Andreas Kühn
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Yuan Peng
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Ines Reinecke
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Julia Scheel
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | - Tobias Schmidt
- Institute for Medical Informatics, University of Applied Sciences Mannheim, Mannheim, Germany
| | - Paul Schmücker
- Institute for Medical Informatics, University of Applied Sciences Mannheim, Mannheim, Germany
| | - Christina Schüttler
- Central Biobank Erlangen, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Dagmar Waltemath
- Core Unit Data Integration Center, University Medicine Greifswald, Greifswald, Germany
- Department of Medical Informatics, University Medicine Greifswald, Greifswald, Germany
| | - Michele Zoch
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Martin Sedlmayr
- Institute for Medical Informatics and Biometry, Faculty of Medicine Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- Center for Scalable Data Analytics and Artificial Intelligence, Dresden, Germany
| |
Collapse
|
9
|
Holzinger A, Keiblinger K, Holub P, Zatloukal K, Müller H. AI for life: Trends in artificial intelligence for biotechnology. N Biotechnol 2023; 74:16-24. [PMID: 36754147 DOI: 10.1016/j.nbt.2023.02.001] [Citation(s) in RCA: 51] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 02/05/2023] [Accepted: 02/05/2023] [Indexed: 02/08/2023]
Abstract
Due to popular successes (e.g., ChatGPT) Artificial Intelligence (AI) is on everyone's lips today. When advances in biotechnology are combined with advances in AI unprecedented new potential solutions become available. This can help with many global problems and contribute to important Sustainability Development Goals. Current examples include Food Security, Health and Well-being, Clean Water, Clean Energy, Responsible Consumption and Production, Climate Action, Life below Water, or protect, restore and promote sustainable use of terrestrial ecosystems, sustainably manage forests, combat desertification, and halt and reverse land degradation and halt biodiversity loss. AI is ubiquitous in the life sciences today. Topics include a wide range from machine learning and Big Data analytics, knowledge discovery and data mining, biomedical ontologies, knowledge-based reasoning, natural language processing, decision support and reasoning under uncertainty, temporal and spatial representation and inference, and methodological aspects of explainable AI (XAI) with applications of biotechnology. In this pre-Editorial paper, we provide an overview of open research issues and challenges for each of the topics addressed in this special issue. Potential authors can directly use this as a guideline for developing their paper.
Collapse
Affiliation(s)
- Andreas Holzinger
- University of Natural Resources and Life Sciences Vienna, Austria; Medical University Graz, Austria; Alberta Machine Intelligence Institute Edmonton, Canada.
| | | | | | | | | |
Collapse
|
10
|
Parciak M, Suhr M, Schmidt C, Bönisch C, Löhnhardt B, Kesztyüs D, Kesztyüs T. FAIRness through automation: development of an automated medical data integration infrastructure for FAIR health data in a maximum care university hospital. BMC Med Inform Decis Mak 2023; 23:94. [PMID: 37189148 DOI: 10.1186/s12911-023-02195-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 05/09/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND Secondary use of routine medical data is key to large-scale clinical and health services research. In a maximum care hospital, the volume of data generated exceeds the limits of big data on a daily basis. This so-called "real world data" are essential to complement knowledge and results from clinical trials. Furthermore, big data may help in establishing precision medicine. However, manual data extraction and annotation workflows to transfer routine data into research data would be complex and inefficient. Generally, best practices for managing research data focus on data output rather than the entire data journey from primary sources to analysis. To eventually make routinely collected data usable and available for research, many hurdles have to be overcome. In this work, we present the implementation of an automated framework for timely processing of clinical care data including free texts and genetic data (non-structured data) and centralized storage as Findable, Accessible, Interoperable, Reusable (FAIR) research data in a maximum care university hospital. METHODS We identify data processing workflows necessary to operate a medical research data service unit in a maximum care hospital. We decompose structurally equal tasks into elementary sub-processes and propose a framework for general data processing. We base our processes on open-source software-components and, where necessary, custom-built generic tools. RESULTS We demonstrate the application of our proposed framework in practice by describing its use in our Medical Data Integration Center (MeDIC). Our microservices-based and fully open-source data processing automation framework incorporates a complete recording of data management and manipulation activities. The prototype implementation also includes a metadata schema for data provenance and a process validation concept. All requirements of a MeDIC are orchestrated within the proposed framework: Data input from many heterogeneous sources, pseudonymization and harmonization, integration in a data warehouse and finally possibilities for extraction or aggregation of data for research purposes according to data protection requirements. CONCLUSION Though the framework is not a panacea for bringing routine-based research data into compliance with FAIR principles, it provides a much-needed possibility to process data in a fully automated, traceable, and reproducible manner.
Collapse
Affiliation(s)
- Marcel Parciak
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
- University MS Center, Biomedical Research Institute (BIOMED), Hasselt University, Agoralaan Building C, 3590, Diepenbeek, Belgium
- Data Science Institute (DSI), Hasselt University, Agoralaan Building D, 3590, Diepenbeek, Belgium
| | - Markus Suhr
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
- NextLytics AG, Kapellenstrasse 37, 65719, Hofheim Am Taunus, Germany
| | - Christian Schmidt
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Caroline Bönisch
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Benjamin Löhnhardt
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Dorothea Kesztyüs
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany.
| | - Tibor Kesztyüs
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| |
Collapse
|
11
|
Holub P, Müller H, Bíl T, Pireddu L, Plass M, Prasser F, Schlünder I, Zatloukal K, Nenutil R, Brázdil T. Privacy risks of whole-slide image sharing in digital pathology. Nat Commun 2023; 14:2577. [PMID: 37142591 PMCID: PMC10160114 DOI: 10.1038/s41467-023-37991-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 04/11/2023] [Indexed: 05/06/2023] Open
Abstract
Access to large volumes of so-called whole-slide images-high-resolution scans of complete pathological slides-has become a cornerstone of the development of novel artificial intelligence methods in pathology for diagnostic use, education/training of pathologists, and research. Nevertheless, a methodology based on risk analysis for evaluating the privacy risks associated with sharing such imaging data and applying the principle "as open as possible and as closed as necessary" is still lacking. In this article, we develop a model for privacy risk analysis for whole-slide images which focuses primarily on identity disclosure attacks, as these are the most important from a regulatory perspective. We introduce a taxonomy of whole-slide images with respect to privacy risks and mathematical model for risk assessment and design . Based on this risk assessment model and the taxonomy, we conduct a series of experiments to demonstrate the risks using real-world imaging data. Finally, we develop guidelines for risk assessment and recommendations for low-risk sharing of whole-slide image data.
Collapse
Affiliation(s)
- Petr Holub
- BBMRI-ERIC, Graz, Austria.
- Institute of Computer Science, Masaryk University, Brno, Czech Republic.
| | - Heimo Müller
- BBMRI.at & Diagnostic & Research Center for Molecular BioMedicine, Medical University of Graz, Graz, A-8010, Austria
| | - Tomáš Bíl
- Institute of Computer Science, Masaryk University, Brno, Czech Republic
| | - Luca Pireddu
- Visual and Data-intensive Computing Group, CRS4, Pula, Italy
| | - Markus Plass
- BBMRI.at & Diagnostic & Research Center for Molecular BioMedicine, Medical University of Graz, Graz, A-8010, Austria
| | - Fabian Prasser
- Berlin Institute of Health @ Charité - Universitätsmedizin Berlin, Berlin, Germany
| | | | - Kurt Zatloukal
- BBMRI.at & Diagnostic & Research Center for Molecular BioMedicine, Medical University of Graz, Graz, A-8010, Austria
| | - Rudolf Nenutil
- BBMRI.cz & Masaryk Memorial Cancer Institute, Brno, Czech Republic
| | - Tomáš Brázdil
- Faculty of Informatics, Masaryk University, Brno, Czech Republic
| |
Collapse
|
12
|
England A, Thompson J, Dorey S, Al-Islam S, Long M, Maiorino C, McEntee MF. A comparison of perceived image quality between computer display monitors and augmented reality smart glasses. Radiography (Lond) 2023; 29:641-646. [PMID: 37130492 DOI: 10.1016/j.radi.2023.04.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 03/28/2023] [Accepted: 04/14/2023] [Indexed: 05/04/2023]
Abstract
INTRODUCTION Augmented-reality (AR) smart glasses provide an alternative to standard computer display monitors (CDM). AR smart glasses may provide an opportunity to improve visualisation during fluoroscopy and interventional radiology (IR) procedures when there can be difficulty in viewing intra-procedural images on a CDM. The aim of this study was to evaluate radiographer perception of image quality (IQ) when comparing CDM and AR smart glasses. METHODS 38 radiographers attending an international congress evaluated ten fluoroscopic-guided surgery and IR images on both a CDM (1920 × 1200 pixels) and a set of Epson Moverio BT-40 AR smart glasses (1920 × 1080 pixels). Participants provided oral responses to pre-defined IQ questions generated by study researchers. Summative IQ scores for each participant/image were compared between CDM and AR smart glasses. RESULTS Of the 38 participants, the mean age was 39 ± 1 years. 23 (60.5%) participants required corrective glasses. In terms of generalisability, participants were from 12 different countries, the majority (n = 9, 23.7%) from the United Kingdom. For eight out of ten images, the AR smart glasses demonstrated a statistically significant increase in perceived IQ (median [IQR] 2.0 [-1.0 to 7.0] points) when compared to the CDM. CONCLUSION AR smart glasses appear to show improvements in perceived IQ when compared to a CDM. AR smart glasses could provide an option for improving the experiences of radiographers involved in image-guided procedures and should be subject to further clinical evaluations. IMPLICATIONS FOR PRACTICE Opportunities exist to improve perceived IQ for radiographers when reviewing fluoroscopy and IR images. AR smart glasses should be further evaluated as a potential opportunity to improve practice when visual attention is split between positioning equipment and image review.
Collapse
Affiliation(s)
- A England
- Discipline of Medical Imaging & Radiation Therapy, University College Cork, Cork, Ireland.
| | - J Thompson
- University Hospitals of Morecambe Bay NHS Foundation Trust, Barrow-in-Furness, UK
| | - S Dorey
- Tameside and Glossop Integrated Care NHS Foundation Trust, Tameside, UK
| | - S Al-Islam
- East Lancashire Hospitals NHS Trust, Blackburn, UK
| | - M Long
- Discipline of Medical Imaging & Radiation Therapy, University College Cork, Cork, Ireland
| | - C Maiorino
- Discipline of Medical Imaging & Radiation Therapy, University College Cork, Cork, Ireland
| | - M F McEntee
- Discipline of Medical Imaging & Radiation Therapy, University College Cork, Cork, Ireland
| |
Collapse
|
13
|
Cano MA, Tsueng G, Zhou X, Xin J, Hughes LD, Mullen JL, Su AI, Wu C. Schema Playground: a tool for authoring, extending, and using metadata schemas to improve FAIRness of biomedical data. BMC Bioinformatics 2023; 24:159. [PMID: 37081398 PMCID: PMC10116472 DOI: 10.1186/s12859-023-05258-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 03/27/2023] [Indexed: 04/22/2023] Open
Abstract
BACKGROUND Biomedical researchers are strongly encouraged to make their research outputs more Findable, Accessible, Interoperable, and Reusable (FAIR). While many biomedical research outputs are more readily accessible through open data efforts, finding relevant outputs remains a significant challenge. Schema.org is a metadata vocabulary standardization project that enables web content creators to make their content more FAIR. Leveraging Schema.org could benefit biomedical research resource providers, but it can be challenging to apply Schema.org standards to biomedical research outputs. We created an online browser-based tool that empowers researchers and repository developers to utilize Schema.org or other biomedical schema projects. RESULTS Our browser-based tool includes features which can help address many of the barriers towards Schema.org-compliance such as: The ability to easily browse for relevant Schema.org classes, the ability to extend and customize a class to be more suitable for biomedical research outputs, the ability to create data validation to ensure adherence of a research output to a customized class, and the ability to register a custom class to our schema registry enabling others to search and re-use it. We demonstrate the use of our tool with the creation of the Outbreak.info schema-a large multi-class schema for harmonizing various COVID-19 related resources. CONCLUSIONS We have created a browser-based tool to empower biomedical research resource providers to leverage Schema.org classes to make their research outputs more FAIR.
Collapse
Affiliation(s)
| | | | | | - Jiwen Xin
- The Scripps Research Institute, San Diego, USA
| | | | | | - Andrew I Su
- The Scripps Research Institute, San Diego, USA
| | - Chunlei Wu
- The Scripps Research Institute, San Diego, USA
| |
Collapse
|
14
|
Baum L, Johns M, Poikela M, Möller R, Ananthasubramaniam B, Prasser F. Data integration and analysis for circadian medicine. Acta Physiol (Oxf) 2023; 237:e13951. [PMID: 36790321 DOI: 10.1111/apha.13951] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 02/04/2023] [Accepted: 02/12/2023] [Indexed: 02/16/2023]
Abstract
Data integration, data sharing, and standardized analyses are important enablers for data-driven medical research. Circadian medicine is an emerging field with a particularly high need for coordinated and systematic collaboration between researchers from different disciplines. Datasets in circadian medicine are multimodal, ranging from molecular circadian profiles and clinical parameters to physiological measurements and data obtained from (wearable) sensors or reported by patients. Uniquely, data spanning both the time dimension and the spatial dimension (across tissues) are needed to obtain a holistic view of the circadian system. The study of human rhythms in the context of circadian medicine has to confront the heterogeneity of clock properties within and across subjects and our inability to repeatedly obtain relevant biosamples from one subject. This requires informatics solutions for integrating and visualizing relevant data types at various temporal resolutions ranging from milliseconds and seconds to minutes and several hours. Associated challenges range from a lack of standards that can be used to represent all required data in a common interoperable form, to challenges related to data storage, to the need to perform transformations for integrated visualizations, and to privacy issues. The downstream analysis of circadian rhythms requires specialized approaches for the identification, characterization, and discrimination of rhythms. We conclude that circadian medicine research provides an ideal environment for developing innovative methods to address challenges related to the collection, integration, visualization, and analysis of multimodal multidimensional biomedical data.
Collapse
Affiliation(s)
- Lena Baum
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Marco Johns
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Maija Poikela
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Ralf Möller
- Institute of Information Systems, University of Lübeck, Lübeck, Germany
| | | | - Fabian Prasser
- Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
15
|
Sinaci AA, Gencturk M, Teoman HA, Laleci Erturkmen GB, Alvarez-Romero C, Martinez-Garcia A, Poblador-Plou B, Carmona-Pírez J, Löbe M, Parra-Calderon CL. A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study. J Med Internet Res 2023; 25:e42822. [PMID: 36884270 PMCID: PMC10034606 DOI: 10.2196/42822] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Revised: 01/04/2023] [Accepted: 01/31/2023] [Indexed: 03/09/2023] Open
Abstract
BACKGROUND Sharing health data is challenging because of several technical, ethical, and regulatory issues. The Findable, Accessible, Interoperable, and Reusable (FAIR) guiding principles have been conceptualized to enable data interoperability. Many studies provide implementation guidelines, assessment metrics, and software to achieve FAIR-compliant data, especially for health data sets. Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) is a health data content modeling and exchange standard. OBJECTIVE Our goal was to devise a new methodology to extract, transform, and load existing health data sets into HL7 FHIR repositories in line with FAIR principles, develop a Data Curation Tool to implement the methodology, and evaluate it on health data sets from 2 different but complementary institutions. We aimed to increase the level of compliance with FAIR principles of existing health data sets through standardization and facilitate health data sharing by eliminating the associated technical barriers. METHODS Our approach automatically processes the capabilities of a given FHIR end point and directs the user while configuring mappings according to the rules enforced by FHIR profile definitions. Code system mappings can be configured for terminology translations through automatic use of FHIR resources. The validity of the created FHIR resources can be automatically checked, and the software does not allow invalid resources to be persisted. At each stage of our data transformation methodology, we used particular FHIR-based techniques so that the resulting data set could be evaluated as FAIR. We performed a data-centric evaluation of our methodology on health data sets from 2 different institutions. RESULTS Through an intuitive graphical user interface, users are prompted to configure the mappings into FHIR resource types with respect to the restrictions of selected profiles. Once the mappings are developed, our approach can syntactically and semantically transform existing health data sets into HL7 FHIR without loss of data utility according to our privacy-concerned criteria. In addition to the mapped resource types, behind the scenes, we create additional FHIR resources to satisfy several FAIR criteria. According to the data maturity indicators and evaluation methods of the FAIR Data Maturity Model, we achieved the maximum level (level 5) for being Findable, Accessible, and Interoperable and level 3 for being Reusable. CONCLUSIONS We developed and extensively evaluated our data transformation approach to unlock the value of existing health data residing in disparate data silos to make them available for sharing according to the FAIR principles. We showed that our method can successfully transform existing health data sets into HL7 FHIR without loss of data utility, and the result is FAIR in terms of the FAIR Data Maturity Model. We support institutional migration to HL7 FHIR, which not only leads to FAIR data sharing but also eases the integration with different research networks.
Collapse
Affiliation(s)
- A Anil Sinaci
- Software Research & Development and Consultancy Corporation (SRDC), Cankaya, Turkey
| | - Mert Gencturk
- Software Research & Development and Consultancy Corporation (SRDC), Cankaya, Turkey
- Department of Computer Engineering, Middle East Technical University, Cankaya, Turkey
| | - Huseyin Alper Teoman
- Software Research & Development and Consultancy Corporation (SRDC), Cankaya, Turkey
- Department of Computer Engineering, Middle East Technical University, Cankaya, Turkey
| | | | - Celia Alvarez-Romero
- Group of Computational Health Informatics, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Spanish National Research Council, University of Seville, Seville, Spain
| | - Alicia Martinez-Garcia
- Group of Computational Health Informatics, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Spanish National Research Council, University of Seville, Seville, Spain
| | - Beatriz Poblador-Plou
- EpiChron Research Group, Aragon Health Sciences Institute (IACS), Aragon Health Research Institute (IIS Aragon), Zaragoza, Spain
| | - Jonás Carmona-Pírez
- EpiChron Research Group, Aragon Health Sciences Institute (IACS), Aragon Health Research Institute (IIS Aragon), Zaragoza, Spain
| | - Matthias Löbe
- Institute for Medical Informatics, Statistics and Epidemiology (IMISE), University of Leipzig, Leipzig, Germany
| | - Carlos Luis Parra-Calderon
- Group of Computational Health Informatics, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Spanish National Research Council, University of Seville, Seville, Spain
| |
Collapse
|
16
|
Alharbi E, Skeva R, Juty N, Jay C, Goble C. A FAIR-Decide framework for pharmaceutical R&D: FAIR data cost-benefit assessment. Drug Discov Today 2023; 28:103510. [PMID: 36716952 DOI: 10.1016/j.drudis.2023.103510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 01/24/2023] [Accepted: 01/24/2023] [Indexed: 01/29/2023]
Abstract
The FAIR (findable, accessible, interoperable and reusable) principles are data management and stewardship guidelines aimed at increasing the effective use of scientific research data. Adherence to these principles in managing data assets in pharmaceutical research and development (R&D) offers pharmaceutical companies the potential to maximise the value of such assets, but the endeavour is costly and challenging. We describe the 'FAIR-Decide' framework, which aims to guide decision-making on the retrospective FAIRification of existing datasets by using business analysis techniques to estimate costs and expected benefits. This framework supports decision-making on FAIRification in the pharmaceutical R&D industry and can be integrated into a company's data management strategy.
Collapse
Affiliation(s)
- Ebtisam Alharbi
- College of Computer and Information Systems, Umm Al-Qura University, Mecca, Saudi Arabia.
| | - Rigina Skeva
- Department of Computer Science, University of Manchester, Manchester, UK
| | - Nick Juty
- Department of Computer Science, University of Manchester, Manchester, UK
| | - Caroline Jay
- Department of Computer Science, University of Manchester, Manchester, UK.
| | - Carole Goble
- Department of Computer Science, University of Manchester, Manchester, UK.
| |
Collapse
|
17
|
Haber AC, Sax U, Prasser F. Open tools for quantitative anonymization of tabular phenotype data: literature review. Brief Bioinform 2022; 23:6754758. [PMID: 36215114 PMCID: PMC9677485 DOI: 10.1093/bib/bbac440] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 09/05/2022] [Accepted: 09/12/2022] [Indexed: 12/14/2022] Open
Abstract
Precision medicine relies on molecular and systems biology methods as well as bidirectional association studies of phenotypes and (high-throughput) genomic data. However, the integrated use of such data often faces obstacles, especially in regards to data protection. An important prerequisite for research data processing is usually informed consent. But collecting consent is not always feasible, in particular when data are to be analyzed retrospectively. For phenotype data, anonymization, i.e. the altering of data in such a way that individuals cannot be identified, can provide an alternative. Several re-identification attacks have shown that this is a complex task and that simply removing directly identifying attributes such as names is usually not enough. More formal approaches are needed that use mathematical models to quantify risks and guide their reduction. Due to the complexity of these techniques, it is challenging and not advisable to implement them from scratch. Open software libraries and tools can provide a robust alternative. However, also the range of available anonymization tools is heterogeneous and obtaining an overview of their strengths and weaknesses is difficult due to the complexity of the problem space. We therefore performed a systematic review of open anonymization tools for structured phenotype data described in the literature between 1990 and 2021. Through a two-step eligibility assessment process, we selected 13 tools for an in-depth analysis. By comparing the supported anonymization techniques and further aspects, such as maturity, we derive recommendations for tools to use for anonymizing phenotype datasets with different properties.
Collapse
Affiliation(s)
- Anna C Haber
- Health Data Science Center, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Ulrich Sax
- Department of Medical Informatics, University Medical Center Göttingen, Göttingen, Germany,Campus-Institute Data Science, Georg-August-University Göttingen
| | - Fabian Prasser
- Corresponding author. Fabian Prasser, Health Data Science Center, Berlin Institute of Health at Charité—Universitätsmedizin Berlin. Tel.: +49 152 04 31 80 73; E-mail:
| | | |
Collapse
|
18
|
Niarakis A, Waltemath D, Glazier J, Schreiber F, Keating SM, Nickerson D, Chaouiya C, Siegel A, Noël V, Hermjakob H, Helikar T, Soliman S, Calzone L. Addressing barriers in comprehensiveness, accessibility, reusability, interoperability and reproducibility of computational models in systems biology. Brief Bioinform 2022; 23:bbac212. [PMID: 35671510 PMCID: PMC9294410 DOI: 10.1093/bib/bbac212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 04/20/2022] [Accepted: 05/06/2022] [Indexed: 11/14/2022] Open
Abstract
Computational models are often employed in systems biology to study the dynamic behaviours of complex systems. With the rise in the number of computational models, finding ways to improve the reusability of these models and their ability to reproduce virtual experiments becomes critical. Correct and effective model annotation in community-supported and standardised formats is necessary for this improvement. Here, we present recent efforts toward a common framework for annotated, accessible, reproducible and interoperable computational models in biology, and discuss key challenges of the field.
Collapse
Affiliation(s)
- Anna Niarakis
- Université Paris-Saclay, Laboratoire Européen de Recherche pour la Polyarthrite rhumatoïde - Genhotel, Univ Evry, Evry, France
- Lifeware Group, Inria, Saclay-île de France, 91120 Palaiseau, France
| | - Dagmar Waltemath
- Department of Medical Informatics, University Medicine Greifswald, Greifswald, Germany
| | - James Glazier
- Biocomplexity Institute and Department of Intelligent Systems Engineering, Indiana University, Bloomington, IN, USA
| | - Falk Schreiber
- Department of Computer and Information Science, University of Konstanz, Konstanz, Germany
- Faculty of Information Technology, Monash University, Clayton, Australia
| | | | - David Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | | | - Anne Siegel
- Univ Rennes, CNRS, Inria - IRISA lab. Rennes
| | - Vincent Noël
- Institut Curie, PSL Research University, Paris, France
- INSERM, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Henning Hermjakob
- EMBL-European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, UK
| | - Tomáš Helikar
- Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, NE, 68588, USA
| | - Sylvain Soliman
- Lifeware Group, Inria, Saclay-île de France, 91120 Palaiseau, France
| | - Laurence Calzone
- Institut Curie, PSL Research University, Paris, France
- INSERM, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| |
Collapse
|
19
|
Eva G, Liese G, Stephanie B, Petr H, Leslie M, Roel V, Martine V, Sergi B, Mette H, Sarah J, Laura RM, Arnout S, Morris A S, Jan T, Xenia T, Nina V, Koert VE, Sylvie R, Greet S. Position paper on management of personal data in environment and health research in Europe. ENVIRONMENT INTERNATIONAL 2022; 165:107334. [PMID: 35696847 DOI: 10.1016/j.envint.2022.107334] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 05/30/2022] [Accepted: 06/01/2022] [Indexed: 06/15/2023]
Abstract
Management of datasets that include health information and other sensitive personal information of European study participants has to be compliant with the General Data Protection Regulation (GDPR, Regulation (EU) 2016/679). Within scientific research, the widely subscribed'FAIR' data principles should apply, meaning that research data should be findable, accessible, interoperable and re-usable. Balancing the aim of open science driven FAIR data management with GDPR compliant personal data protection safeguards is now a common challenge for many research projects dealing with (sensitive) personal data. In December 2020 a workshop was held with representatives of several large EU research consortia and of the European Commission to reflect on how to apply the FAIR data principles for environment and health research (E&H). Several recent data intensive EU funded E&H research projects face this challenge and work intensively towards developing solutions to access, exchange, store, handle, share, process and use such sensitive personal data, with the aim to support European and transnational collaborations. As a result, several recommendations, opportunities and current limitations were formulated. New technical developments such as federated data management and analysis systems, machine learning together with advanced search software, harmonized ontologies and data quality standards should in principle facilitate the FAIRification of data. To address ethical, legal, political and financial obstacles to the wider re-use of data for research purposes, both specific expertise and underpinning infrastructure are needed. There is a need for the E&H research data to find their place in the European Open Science Cloud. Communities using health and population data, environmental data and other publicly available data have to interconnect and synergize. To maximize the use and re-use of environment and health data, a dedicated supporting European infrastructure effort, such as the EIRENE research infrastructure within the ESFRI roadmap 2021, is needed that would interact with existing infrastructures.
Collapse
Affiliation(s)
- Govarts Eva
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium.
| | - Gilles Liese
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Bopp Stephanie
- European Commission, Joint Research Centre (JRC), Ispra, Italy
| | | | - Matalonga Leslie
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Vermeulen Roel
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, Netherlands
| | - Vrijheid Martine
- ISGlobal, Barcelona, Spain; Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Beltran Sergi
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Universitat de Barcelona (UB), Barcelona, Spain
| | - Hartlev Mette
- Faculty of Law, University of Copenhagen, Copenhagen, Denmark
| | | | | | - Standaert Arnout
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Swertz Morris A
- Department of Genetics & Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
| | - Theunis Jan
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Trier Xenia
- European Environment Agency (EEA), Copenhagen, Denmark
| | - Vogel Nina
- German Environment Agency (UBA), Berlin, Germany
| | | | - Remy Sylvie
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium
| | - Schoeters Greet
- VITO Health, Flemish Institute for Technological Research (VITO), Mol, Belgium; Department of Biomedical Sciences, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
20
|
Alharbi E, Gadiya Y, Henderson D, Zaliani A, Delfin-Rossaro A, Cambon-Thomsen A, Kohler M, Witt G, Welter D, Juty N, Jay C, Engkvist O, Goble C, Reilly DS, Satagopam V, Ioannidis V, Gu W, Gribbon P. Selection of data sets for FAIRification in drug discovery and development: Which, why, and how? Drug Discov Today 2022; 27:2080-2085. [PMID: 35595012 PMCID: PMC9236643 DOI: 10.1016/j.drudis.2022.05.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 04/28/2022] [Accepted: 05/10/2022] [Indexed: 11/30/2022]
Abstract
Research organisations are focussed on quantifying the costs and benefits of implementing FAIR. Criteria used for the selection of data for FAIRification can be opaque and inconsistent. FAIRification effort depends on individual skills, competencies, resources, and time available. FAIRification should satisfy reuse scenarios, and lead to scientific and economic impacts. Organisational challenges include providing training to individuals and developing a FAIR organisation culture.
Despite the intuitive value of adopting the Findable, Accessible, Interoperable, and Reusable (FAIR) principles in both academic and industrial sectors, challenges exist in resourcing, balancing long- versus short-term priorities, and achieving technical implementation. This situation is exacerbated by the unclear mechanisms by which costs and benefits can be assessed when decisions on FAIR are made. Scientific and research and development (R&D) leadership need reliable evidence of the potential benefits and information on effective implementation mechanisms and remediating strategies. In this article, we describe procedures for cost–benefit evaluation, and identify best-practice approaches to support the decision-making process involved in FAIR implementation.
Collapse
Affiliation(s)
- Ebtisam Alharbi
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester, UK
| | - Yojana Gadiya
- Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Schnackenburgallee 114, 22525 Hamburg, and Theodor Stern Kai 7, 60590 Frankfurt, Germany; Fraunhofer Cluster of Excellence for Immune Mediated Diseases (CIMD), Theodor Stern Kai 7, 60590 Frankfurt, Germany
| | - David Henderson
- Bayer AG, Research & Development, Pharmaceuticals, Müllerstrasse 178, 13353 Berlin, Germany
| | - Andrea Zaliani
- Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Schnackenburgallee 114, 22525 Hamburg, and Theodor Stern Kai 7, 60590 Frankfurt, Germany; Fraunhofer Cluster of Excellence for Immune Mediated Diseases (CIMD), Theodor Stern Kai 7, 60590 Frankfurt, Germany
| | | | | | - Manfred Kohler
- Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Schnackenburgallee 114, 22525 Hamburg, and Theodor Stern Kai 7, 60590 Frankfurt, Germany; Fraunhofer Cluster of Excellence for Immune Mediated Diseases (CIMD), Theodor Stern Kai 7, 60590 Frankfurt, Germany
| | - Gesa Witt
- Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Schnackenburgallee 114, 22525 Hamburg, and Theodor Stern Kai 7, 60590 Frankfurt, Germany; Fraunhofer Cluster of Excellence for Immune Mediated Diseases (CIMD), Theodor Stern Kai 7, 60590 Frankfurt, Germany
| | - Danielle Welter
- Luxembourg Centre for Systems Biomedicine, ELIXIR Luxembourg, University of Luxembourg, L-4367 Belval, Luxembourg
| | - Nick Juty
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester, UK
| | - Caroline Jay
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester, UK
| | - Ola Engkvist
- Discovery Sciences, R&D, AstraZeneca, SE-43183 Mölndal, Sweden
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester, UK
| | - Dorothy S Reilly
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, Basel, Switzerland
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, ELIXIR Luxembourg, University of Luxembourg, L-4367 Belval, Luxembourg
| | - Vassilios Ioannidis
- SIB Swiss Institute of Bioinformatics, Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland.
| | - Wei Gu
- Novartis Institutes for BioMedical Research, Novartis Pharma AG, Basel, Switzerland.
| | - Philip Gribbon
- Fraunhofer Institute for Translational Medicine and Pharmacology (ITMP), Schnackenburgallee 114, 22525 Hamburg, and Theodor Stern Kai 7, 60590 Frankfurt, Germany; Fraunhofer Cluster of Excellence for Immune Mediated Diseases (CIMD), Theodor Stern Kai 7, 60590 Frankfurt, Germany.
| |
Collapse
|
21
|
Stumptner C, Stadlbauer V, O’Neil D, Gessner A, Hiergeist A, Zatloukal K, Abuja PM. The Pre-Analytical CEN/TS Standard for Microbiome Diagnostics-How Can Research and Development Benefit? Nutrients 2022; 14:1976. [PMID: 35565946 PMCID: PMC9104691 DOI: 10.3390/nu14091976] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 05/05/2022] [Accepted: 05/07/2022] [Indexed: 11/27/2022] Open
Abstract
Recently, CEN/TS 17626:2021, the European pre-analytical standard for human specimens intended for microbiome DNA analysis, was published. Although this standard relates to diagnostic procedures for microbiome analysis and is relevant for in vitro diagnostic (IVD) manufacturers and diagnostic laboratories, it also has implications for research and development (R&D). We present here why standards are needed in biomedical research, what pre-analytical standards can accomplish, and which elements of the pre-analytical workflow they cover. The benefits of standardization for the generation of FAIR (findable, accessible, interoperable, reusable) data and to support innovation are briefly discussed.
Collapse
Affiliation(s)
- Conny Stumptner
- Diagnostic and Research Center for Molecular Biomedicine, Institute of Pathology, Medical University of Graz, 8010 Graz, Austria; (C.S.); (K.Z.)
| | - Vanessa Stadlbauer
- Department of Internal Medicine, Division of Gastroenterology and Hepatology, Medical University of Graz, 8010 Graz, Austria;
- Center of Biomarker Research CBMed, 8010 Graz, Austria
| | | | - André Gessner
- Institute of Microbiology and Hygiene, 93053 Regensburg, Germany; (A.G.); (A.H.)
| | - Andreas Hiergeist
- Institute of Microbiology and Hygiene, 93053 Regensburg, Germany; (A.G.); (A.H.)
| | - Kurt Zatloukal
- Diagnostic and Research Center for Molecular Biomedicine, Institute of Pathology, Medical University of Graz, 8010 Graz, Austria; (C.S.); (K.Z.)
| | - Peter M. Abuja
- Diagnostic and Research Center for Molecular Biomedicine, Institute of Pathology, Medical University of Graz, 8010 Graz, Austria; (C.S.); (K.Z.)
| |
Collapse
|
22
|
Merino Martinez R, Müller H, Negru S, Ormenisan A, Arroyo Mühr LS, Zhang X, Trier Møller F, Clements MS, Kozlakidis Z, Pimenoff VN, Wilkowski B, Boeckhout M, Öhman H, Chong S, Holzinger A, Lehtinen M, van Veen EB, Bała P, Widschwendter M, Dowling J, Törnroos J, Snyder MP, Dillner J. Human exposome assessment platform. Environ Epidemiol 2021; 5:e182. [PMID: 34909561 PMCID: PMC8663864 DOI: 10.1097/ee9.0000000000000182] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 11/14/2021] [Indexed: 11/26/2022] Open
Abstract
The Human Exposome Assessment Platform (HEAP) is a research resource for the integrated and efficient management and analysis of human exposome data. The project will provide the complete workflow for obtaining exposome actionable knowledge from population-based cohorts. HEAP is a state-of-the-science service composed of computational resources from partner institutions, accessed through a software framework that provides the world's fastest Hadoop platform for data warehousing and applied artificial intelligence (AI). The software, will provide a decision support system for researchers and policymakers. All the data managed and processed by HEAP, together with the analysis pipelines, will be available for future research. In addition, the platform enables adding new data and analysis pipelines. HEAP's final product can be deployed in multiple instances to create a network of shareable and reusable knowledge on the impact of exposures on public health.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Frederik Trier Møller
- Infectious Disease Epidemiology and Prevention, Statens Serum Institut, Copenhagen, Denmark
| | | | - Zisis Kozlakidis
- International Agency for Research on Cancer, World Health Organization, Lyon, France
| | - Ville N. Pimenoff
- Karolinska Institutet, Stockholm, Sweden
- Faculty of Medicine, University of Oulu, Oulu, Finland
- Tampere University, Tampere, Finland
| | | | | | - Hanna Öhman
- Faculty of Medicine, University of Oulu, Oulu, Finland
- Biobank Borealis of Northern Finland, Oulu University Hospital, Oulu, Finland
| | - Steven Chong
- Danish National Biobank, Statens Serum Institut, Copenhagen, Denmark
| | | | - Matti Lehtinen
- Karolinska Institutet, Stockholm, Sweden
- Tampere University, Tampere, Finland
| | | | | | - Martin Widschwendter
- Research Institute for Biomedical Aging Research, Universität Innsbruck, Innsbruck, Austria
| | | | | | | | | |
Collapse
|
23
|
Alvarez-Romero C, Martinez-Garcia A, Ternero Vega J, Díaz-Jimènez P, Jimènez-Juan C, Nieto-Martín MD, Román Villarán E, Kovacevic T, Bokan D, Hromis S, Djekic Malbasa J, Beslać S, Zaric B, Gencturk M, Sinaci AA, Ollero Baturone M, Parra Calderón CL. Predicting 30-days Readmission Risk for COPD Patients Care through a Federated Machine Learning Architecture on FAIR Data: Development and Validation Study (Preprint). JMIR Med Inform 2021; 10:e35307. [PMID: 35653170 PMCID: PMC9204581 DOI: 10.2196/35307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 03/16/2022] [Accepted: 04/21/2022] [Indexed: 12/16/2022] Open
Abstract
Background Owing to the nature of health data, their sharing and reuse for research are limited by legal, technical, and ethical implications. In this sense, to address that challenge and facilitate and promote the discovery of scientific knowledge, the Findable, Accessible, Interoperable, and Reusable (FAIR) principles help organizations to share research data in a secure, appropriate, and useful way for other researchers. Objective The objective of this study was the FAIRification of existing health research data sets and applying a federated machine learning architecture on top of the FAIRified data sets of different health research performing organizations. The entire FAIR4Health solution was validated through the assessment of a federated model for real-time prediction of 30-day readmission risk in patients with chronic obstructive pulmonary disease (COPD). Methods The application of the FAIR principles on health research data sets in 3 different health care settings enabled a retrospective multicenter study for the development of specific federated machine learning models for the early prediction of 30-day readmission risk in patients with COPD. This predictive model was generated upon the FAIR4Health platform. Finally, an observational prospective study with 30 days follow-up was conducted in 2 health care centers from different countries. The same inclusion and exclusion criteria were used in both retrospective and prospective studies. Results Clinical validation was demonstrated through the implementation of federated machine learning models on top of the FAIRified data sets from different health research performing organizations. The federated model for predicting the 30-day hospital readmission risk was trained using retrospective data from 4.944 patients with COPD. The assessment of the predictive model was performed using the data of 100 recruited (22 from Spain and 78 from Serbia) out of 2070 observed (records viewed) patients during the observational prospective study, which was executed from April 2021 to September 2021. Significant accuracy (0.98) and precision (0.25) of the predictive model generated upon the FAIR4Health platform were observed. Therefore, the generated prediction of 30-day readmission risk was confirmed in 87% (87/100) of cases. Conclusions Implementing a FAIR data policy in health research performing organizations to facilitate data sharing and reuse is relevant and needed, following the discovery, access, integration, and analysis of health research data. The FAIR4Health project proposes a technological solution in the health domain to facilitate alignment with the FAIR principles.
Collapse
Affiliation(s)
- Celia Alvarez-Romero
- Computational Health Informatics Group, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Consejo Superior de Investigaciones Científicas, University of Seville, Seville, Spain
| | - Alicia Martinez-Garcia
- Computational Health Informatics Group, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Consejo Superior de Investigaciones Científicas, University of Seville, Seville, Spain
| | - Jara Ternero Vega
- Internal Medicine Department, Virgen del Rocío University Hospital, Seville, Spain
| | - Pablo Díaz-Jimènez
- Internal Medicine Department, Virgen del Rocío University Hospital, Seville, Spain
| | - Carlos Jimènez-Juan
- Internal Medicine Department, Virgen del Rocío University Hospital, Seville, Spain
| | | | - Esther Román Villarán
- Computational Health Informatics Group, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Consejo Superior de Investigaciones Científicas, University of Seville, Seville, Spain
| | - Tomi Kovacevic
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
- Medical Faculty, University of Novi Sad, Novi Sad,
| | - Darijo Bokan
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
| | - Sanja Hromis
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
- Medical Faculty, University of Novi Sad, Novi Sad,
| | - Jelena Djekic Malbasa
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
- Medical Faculty, University of Novi Sad, Novi Sad,
| | - Suzana Beslać
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
| | - Bojan Zaric
- Institute for Pulmonary Diseases of Vojvodina, Sremska Kamenica,
- Medical Faculty, University of Novi Sad, Novi Sad,
| | - Mert Gencturk
- Software Research & Development and Consultancy Corporation, Ankara, Turkey
| | - A Anil Sinaci
- Software Research & Development and Consultancy Corporation, Ankara, Turkey
| | | | - Carlos Luis Parra Calderón
- Computational Health Informatics Group, Institute of Biomedicine of Seville, Virgen del Rocío University Hospital, Consejo Superior de Investigaciones Científicas, University of Seville, Seville, Spain
| |
Collapse
|
24
|
Devriendt T, Ammann C, W. Asselbergs F, Bernier A, Costas R, Friedrich MG, Gelpi JL, Jarvelin MR, Kuulasmaa K, Lekadir K, Mayrhofer MT, Papez V, Pasterkamp G, Petersen SE, Schmidt CO, Schulz-Menger J, Söderberg S, Shabani M, Veronesi G, Viezzer DS, Borry P. An agenda-setting paper on data sharing platforms: euCanSHare workshop. OPEN RESEARCH EUROPE 2021; 1:80. [PMID: 37645200 PMCID: PMC10445835 DOI: 10.12688/openreseurope.13860.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 11/22/2021] [Indexed: 08/31/2023]
Abstract
Various data sharing platforms are being developed to enhance the sharing of cohort data by addressing the fragmented state of data storage and access systems. However, policy challenges in several domains remain unresolved. The euCanSHare workshop was organized to identify and discuss these challenges and to set the future research agenda. Concerns over the multiplicity and long-term sustainability of platforms, lack of resources, access of commercial parties to medical data, credit and recognition mechanisms in academia and the organization of data access committees are outlined. Within these areas, solutions need to be devised to ensure an optimal functioning of platforms.
Collapse
Affiliation(s)
- Thijs Devriendt
- Centre for Biomedical Ethics and Law, Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium
| | - Clemens Ammann
- Working Group on Cardiovascular Magnetic Resonance, Experimental and Clinical Research Center, a joint cooperation between the Charité - Universitätsmedizin Berlin and the Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
| | - Folkert W. Asselbergs
- Department of Cardiology, Division Heart & Lungs, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
- Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, UK
- Institute of Health Informatics, University College London, London, UK
- Health Data Research UK, London, UK
| | - Alexander Bernier
- Centre of Genomics and Policy, Faculty of Medicine, McGill University, Montreal, Canada
| | - Rodrigo Costas
- Centre for Science and Technology Studies (CWTS), Leiden University, Leiden, The Netherlands
| | - Matthias G. Friedrich
- Departments of Medicine and Diagnostic Radiology, McGill University Health Centre, Montreal, Canada
| | - Josep L. Gelpi
- Department of Biochemistry and Molecular Biomedicine, University of Barcelona, Barcelona, Spain
- Barcelona Supercomputing Center (BSC), Barcelona, Spain
| | - Marjo-Riitta Jarvelin
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
- MRC-PHE Centre for Environment and Health, Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Unit of Primary Health Care, Oulu University Hospital, Oulu, Finland
| | - Kari Kuulasmaa
- National Institute for Health and Welfare, Helsinki, Finland
| | - Karim Lekadir
- Artificial Intelligence in Medicine Lab (BCN-AIM), Department of Mathematics and Computer Science, University of Barcelona, Barcelona, Spain
| | | | - Vaclav Papez
- Institute of Health Informatics, University College London, London, UK
- Health Data Research UK, London, UK
| | - Gerard Pasterkamp
- Department of Clinical Diagnostics Laboratories, University Medical Center Utrecht, Utrecht University, Utrecht, The Netherlands
| | - Steffen E. Petersen
- Health Data Research UK, London, UK
- Barts Heart Centre, Barts Health NHS Trust, London, UK
- William Harvey Research Institute, NIHR Barts Biomedical Research Centre, Queen Mary University of London, London, UK
- The Alan Turing Institute, London, UK
| | - Carsten Oliver Schmidt
- Institute for Community Medicine, Department SHIP-KEF, Greifswald University Medical Center, Greifswald, Germany
| | - Jeanette Schulz-Menger
- Working Group on Cardiovascular Magnetic Resonance, Experimental and Clinical Research Center, a joint cooperation between the Charité - Universitätsmedizin Berlin and the Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- DZHK (German Centre for Cardiovascular Research) partner site, Berlin, Germany
- Department of Cardiology and Nephrology, HELIOS Hospital Berlin-Buch, Berlin, Germany
| | - Stefan Söderberg
- Department of Public Health and Clinical Medicine, Heart Centre, Umeå University, Umeå, Sweden
| | - Mahsa Shabani
- METAMEDICA, Department of Law and Criminology, Ghent University, Ghent, Belgium
| | - Giovanni Veronesi
- Research Center in Epidemiology and Preventive Medicine (EPIMED), Department of Medicine and Surgery, University of Insubria in Varese, Varese, Italy
| | - Darian Steven Viezzer
- Working Group on Cardiovascular Magnetic Resonance, Experimental and Clinical Research Center, a joint cooperation between the Charité - Universitätsmedizin Berlin and the Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- DZHK (German Centre for Cardiovascular Research) partner site, Berlin, Germany
| | - Pascal Borry
- Centre for Biomedical Ethics and Law, Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium
| |
Collapse
|
25
|
Role of Biobanks for Cancer Research and Precision Medicine in Hepatocellular Carcinoma. J Gastrointest Cancer 2021; 52:1232-1247. [PMID: 34807351 DOI: 10.1007/s12029-021-00759-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/05/2021] [Indexed: 10/19/2022]
Abstract
INTRODUCTION Hepatocellular carcinoma (HCC) is a highly complex and deadly cancer. There is an urgent need for new and effective treatment modalities. Since the primary goal in the management of cancer is to cure and improve survival, personalized therapy can increase survival, reduce mortality rates, and improve quality of life. Biobanks hold potential in leading to breakthroughs in biomedical research and precision medicine (PM). They serve as a biorepository, collecting, processing, storing, and supplying specimens and relevant data for basic, translational, and clinical research. OBJECTIVE We aimed to highlight the fundamental role of biobanks, harboring high quality, sustainable collections of patient samples in adequate size and variability, for developing diagnostic, prognostic, and predictive biomarkers to develop and PM approaches in the management of HCC. METHOD We obtained information from previously published articles and BBMRI directory. RESULTS AND CONCLUSION Biobanking of high-quality biospecimens along with patient clinical information provides a fundamental scientific infrastructure for basic, translational, and clinical research. Biobanks that control and eliminate pre-analytical variability of biospecimens, provide a platform to identify reliable biomarkers for the application of PM. We believe, establishing HCC biobanks will empower to underpin molecular mechanisms of HCC and generate strategies for PM. Thus, first, we will review current therapy approaches in HCC care. Then, we will summarize challenges in HCC management. Lastly, we will focus on the best practices for establishing HCC biobanking to support research, translational medicine in the light of new experimental research conducted with the aim of delivering PM for HCC patients.
Collapse
|
26
|
Alharbi E, Skeva R, Juty N, Jay C, Goble C. Exploring the Current Practices, Costs and Benefits of FAIR
Implementation in Pharmaceutical Research and Development: A Qualitative
Interview Study. DATA INTELLIGENCE 2021. [DOI: 10.1162/dint_a_00109] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The findable, accessible, interoperable, reusable (FAIR) principles for scientific data management and stewardship aim to facilitate data reuse at scale by both humans and machines. Research and development (R&D) in the pharmaceutical industry is becoming increasingly data driven, but managing its data assets according to FAIR principles remains costly and challenging. To date, little scientific evidence exists about how FAIR is currently implemented in practice, what its associated costs and benefits are, and how decisions are made about the retrospective FAIRification of data sets in pharmaceutical R&D. This paper reports the results of semi-structured interviews with 14 pharmaceutical professionals who participate in various stages of drug R&D in seven pharmaceutical businesses. Inductive thematic analysis identified three primary themes of the benefits and costs of FAIRification, and the elements that influence the decision-making process for FAIRifying legacy data sets. Participants collectively acknowledged the potential contribution of FAIRification to data reusability in diverse research domains and the subsequent potential for cost-savings. Implementation costs, however, were still considered a barrier by participants, with the need for considerable expenditure in terms of resources, and cultural change. How decisions were made about FAIRification was influenced by legal and ethical considerations, management commitment, and data prioritisation. The findings have significant implications for those in the pharmaceutical R&D industry who are engaged in driving FAIR implementation, and for external parties who seek to better understand existing practices and challenges.
Collapse
Affiliation(s)
- Ebtisam Alharbi
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
- College of Computer and Information Systems, Umm Al-Qura University, Mecca, Makkah 21421, Saudi Arabia
| | - Rigina Skeva
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Nick Juty
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Caroline Jay
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Carole Goble
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| |
Collapse
|
27
|
Read KB, Ganshorn H, Rutley S, Scott DR. Data-sharing practices in publications funded by the Canadian Institutes of Health Research: a descriptive analysis. CMAJ Open 2021; 9:E980-E987. [PMID: 34753787 PMCID: PMC8580829 DOI: 10.9778/cmajo.20200303] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND As Canada increases requirements for research data management and sharing, there is value in identifying how research data are shared and what has been done to make them findable and reusable. This study aimed to understand Canada's data-sharing landscape by reviewing how data funded by the Canadian Institutes of Health Research (CIHR) are shared and comparing researchers' data-sharing practices to best practices for research data management and sharing. METHODS We performed a descriptive analysis of CIHR-funded publications from PubMed and PubMed Central published between 1946 and Dec. 31, 2019, that indicated that the research data underlying the results of the publication were shared. We analyzed each publication to identify how and where data were shared, who shared data and what documentation was included to support data reuse. RESULTS Of 4144 CIHR-funded publications identified, 1876 (45.2%) included accessible data, 935 (22.6%) stated that data were available via request or application, and 300 (7.2%) stated that data sharing was not applicable or possible; we found no evidence of data sharing in 1558 publications (37.6%). Frequent data-sharing methods included via a repository (1549 [37.4%]), within supplementary files (1048 [25.3%]) and via request or application (935 [22.6%]). Overall, 554 publications (13.4%) included documentation that would facilitate data reuse. INTERPRETATION Publications funded by the CIHR largely lack the metadata, access instructions and documentation to facilitate data discovery and reuse. Without measures to address these concerns and enhanced support for researchers seeking to implement best practices for research data management and sharing, much CIHR-funded research data will remain hidden, inaccessible and unusable.
Collapse
Affiliation(s)
- Kevin B Read
- Leslie & Irene Dubé Health Sciences Library (Read), University of Saskatchewan, Saskatoon, Sask.; Taylor Family Digital Library (Ganshorn), University of Calgary, Calgary, Alta.; University Library (Rutley), University of Saskatchewan, Saskatoon, Sask.; University Library (Scott), University of Lethbridge, Lethbridge, Alta.
| | - Heather Ganshorn
- Leslie & Irene Dubé Health Sciences Library (Read), University of Saskatchewan, Saskatoon, Sask.; Taylor Family Digital Library (Ganshorn), University of Calgary, Calgary, Alta.; University Library (Rutley), University of Saskatchewan, Saskatoon, Sask.; University Library (Scott), University of Lethbridge, Lethbridge, Alta
| | - Sarah Rutley
- Leslie & Irene Dubé Health Sciences Library (Read), University of Saskatchewan, Saskatoon, Sask.; Taylor Family Digital Library (Ganshorn), University of Calgary, Calgary, Alta.; University Library (Rutley), University of Saskatchewan, Saskatoon, Sask.; University Library (Scott), University of Lethbridge, Lethbridge, Alta
| | - David R Scott
- Leslie & Irene Dubé Health Sciences Library (Read), University of Saskatchewan, Saskatoon, Sask.; Taylor Family Digital Library (Ganshorn), University of Calgary, Calgary, Alta.; University Library (Rutley), University of Saskatchewan, Saskatoon, Sask.; University Library (Scott), University of Lethbridge, Lethbridge, Alta
| |
Collapse
|
28
|
Cano M, Tsueng G, Zhou X, Hughes LD, Mullen JL, Xin J, Su AI, Wu C. Schema Playground: A tool for authoring, extending, and using metadata schemas to improve FAIRness of biomedical data.. [PMID: 35677074 PMCID: PMC9176648 DOI: 10.1101/2021.09.02.458726] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Background: Biomedical researchers are strongly encouraged to make their research outputs more Findable, Accessible, Interoperable, and Reusable (FAIR). While many biomedical research outputs are more readily accessible through open data efforts, finding relevant outputs remains a significant challenge. Schema.org is a metadata vocabulary standardization project that enables web content creators to make their content more FAIR. Leveraging schema.org could benefit biomedical research resource providers, but it can be challenging to apply schema.org standards to biomedical research outputs. We created an online browser-based tool that empowers researchers and repository developers to utilize schema.org or other biomedical schema projects. Results: Our browser-based tool includes features which can help address many of the barriers towards schema.org-compliance such as: The ability to easily browse for relevant schema.org classes, the ability to extend and customize a class to be more suitable for biomedical research outputs, the ability to create data validation to ensure adherence of a research output to a customized class, and the ability to register a custom class to our schema registry enabling others to search and re-use it. We demonstrate the use of our tool with the creation of the Outbreak.info schema—a large multi-class schema for harmonizing various COVID-19 related resources. Conclusions: We have created a browser-based tool to empower biomedical research resource providers to leverage schema.org classes to make their research outputs more FAIR.
Collapse
|
29
|
Mayer G, Müller W, Schork K, Uszkoreit J, Weidemann A, Wittig U, Rey M, Quast C, Felden J, Glöckner FO, Lange M, Arend D, Beier S, Junker A, Scholz U, Schüler D, Kestler HA, Wibberg D, Pühler A, Twardziok S, Eils J, Eils R, Hoffmann S, Eisenacher M, Turewicz M. Implementing FAIR data management within the German Network for Bioinformatics Infrastructure (de.NBI) exemplified by selected use cases. Brief Bioinform 2021; 22:bbab010. [PMID: 33589928 PMCID: PMC8425304 DOI: 10.1093/bib/bbab010] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2020] [Revised: 12/21/2020] [Accepted: 01/06/2021] [Indexed: 12/21/2022] Open
Abstract
This article describes some use case studies and self-assessments of FAIR status of de.NBI services to illustrate the challenges and requirements for the definition of the needs of adhering to the FAIR (findable, accessible, interoperable and reusable) data principles in a large distributed bioinformatics infrastructure. We address the challenge of heterogeneity of wet lab technologies, data, metadata, software, computational workflows and the levels of implementation and monitoring of FAIR principles within the different bioinformatics sub-disciplines joint in de.NBI. On the one hand, this broad service landscape and the excellent network of experts are a strong basis for the development of useful research data management plans. On the other hand, the large number of tools and techniques maintained by distributed teams renders FAIR compliance challenging.
Collapse
Affiliation(s)
- Gerhard Mayer
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
- Ulm University, Institute of Medical Systems Biology, Ulm, Germany
| | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Karin Schork
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Julian Uszkoreit
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Andreas Weidemann
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | - Maja Rey
- Heidelberg Institute for Theoretical Studies (HITS gGmbH), Scientific Databases and Visualization Group, Heidelberg, Germany
| | | | - Janine Felden
- Jacobs University Bremen gGmbH, Bremen, Germany
- University of Bremen, MARUM - Center for Marine Environmental Sciences, Bremen, Germany
| | - Frank Oliver Glöckner
- Jacobs University Bremen gGmbH, Bremen, Germany
- University of Bremen, MARUM - Center for Marine Environmental Sciences, Bremen, Germany
- Alfred Wegener Institute - Helmholtz Center for Polar- and Marine Research, Bremerhaven, Germany
| | - Matthias Lange
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Daniel Arend
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Sebastian Beier
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Astrid Junker
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Danuta Schüler
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, Germany
| | - Hans A Kestler
- Ulm University, Institute of Medical Systems Biology, Ulm, Germany
- Leibniz Institute on Ageing - Fritz Lipmann Institute, Jena
| | - Daniel Wibberg
- Bielefeld University, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Alfred Pühler
- Bielefeld University, Center for Biotechnology (CeBiTec), Bielefeld, Germany
| | - Sven Twardziok
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
| | - Jürgen Eils
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
| | - Roland Eils
- Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Center for Digital Health, Berlin, Germany
- Heidelberg University Hospital and BioQuant, Health Data Science Unit, Heidelberg, Germany
| | - Steve Hoffmann
- Leibniz Institute on Ageing - Fritz Lipmann Institute, Jena
| | - Martin Eisenacher
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| | - Michael Turewicz
- Ruhr University Bochum, Faculty of Medicine, Medizinisches Proteom-Center, Bochum, Germany
- Ruhr University Bochum, Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Bochum, Germany
| |
Collapse
|
30
|
Meyer A, Faverjon C, Hostens M, Stegeman A, Cameron A. Systematic review of the status of veterinary epidemiological research in two species regarding the FAIR guiding principles. BMC Vet Res 2021; 17:270. [PMID: 34380468 PMCID: PMC8355576 DOI: 10.1186/s12917-021-02971-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2021] [Accepted: 07/06/2021] [Indexed: 01/08/2023] Open
Abstract
Background The FAIR (Findable, Accessible, Interoperable, Reusable) principles were proposed in 2016 to set a path towards reusability of research datasets. In this systematic review, we assessed the FAIRness of datasets associated with peer-reviewed articles in veterinary epidemiology research published since 2017, specifically looking at salmonids and dairy cattle. We considered the differences in practices between molecular epidemiology, the branch of epidemiology using genetic sequences of pathogens and hosts to describe disease patterns, and non-molecular epidemiology. Results A total of 152 articles were included in the assessment. Consistent with previous assessments conducted in other disciplines, our results showed that most datasets used in non-molecular epidemiological studies were not available (i.e., neither findable nor accessible). Data availability was much higher for molecular epidemiology papers, in line with a strong repository base available to scientists in this discipline. The available data objects generally scored favourably for Findable, Accessible and Reusable indicators, but Interoperability was more problematic. Conclusions None of the datasets assessed in this study met all the requirements set by the FAIR principles. Interoperability, in particular, requires specific skills in data management which may not yet be broadly available in the epidemiology community. In the discussion, we present recommendations on how veterinary research could move towards greater reusability according to FAIR principles. Overall, although many initiatives to improve data access have been started in the research community, their impact on the availability of datasets underlying published articles remains unclear to date. Supplementary Information The online version contains supplementary material available at 10.1186/s12917-021-02971-1.
Collapse
Affiliation(s)
- Anne Meyer
- Ausvet Europe, 3 rue Camille Jordan, 69001, Lyon, France. .,Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands.
| | | | - Miel Hostens
- Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands
| | - Arjan Stegeman
- Department of Farm Animal Health, Utrecht University, 3512 JE, Utrecht, the Netherlands
| | - Angus Cameron
- Ausvet Europe, 3 rue Camille Jordan, 69001, Lyon, France
| |
Collapse
|
31
|
Sharma A, Nilsen TB, Czerwinska KP, Onitiu D, Brenna L, Johansen D, Johansen HD. Up-to-the-Minute Privacy Policies via Gossips in Participatory Epidemiological Studies. Front Big Data 2021; 4:624424. [PMID: 34056584 PMCID: PMC8155614 DOI: 10.3389/fdata.2021.624424] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 04/01/2021] [Indexed: 12/03/2022] Open
Abstract
Researchers and researched populations are actively involved in participatory epidemiology. Such studies collect many details about an individual. Recent developments in statistical inferences can lead to sensitive information leaks from seemingly insensitive data about individuals. Typical safeguarding mechanisms are vetted by ethics committees; however, the attack models are constantly evolving. Newly discovered threats, change in applicable laws or an individual's perception can raise concerns that affect the study. Addressing these concerns is imperative to maintain trust with the researched population. We are implementing Lohpi: an infrastructure for building accountability in data processing for participatory epidemiology. We address the challenge of data-ownership by allowing institutions to host data on their managed servers while being part of Lohpi. We update data access policies using gossips. We present Lohpi as a novel architecture for research data processing and evaluate the dissemination, overhead, and fault-tolerance.
Collapse
Affiliation(s)
- Aakash Sharma
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | - Thomas Bye Nilsen
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | - Katja Pauline Czerwinska
- Faculty of Design, Computer Science, Media, RheinMain University of Applied Sciences, Wiesbaden, Germany
| | - Daria Onitiu
- Northumbria Law School, Northumbria University, Newcastle upon Tyne, United Kingdom
| | - Lars Brenna
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | - Dag Johansen
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| | - Håvard D Johansen
- Department of Computer Science, UiT The Arctic University of Norway, Tromsø, Norway
| |
Collapse
|
32
|
Martani A, Geneviève LD, Egli SM, Erard F, Wangmo T, Elger BS. Evolution or Revolution? Recommendations to Improve the Swiss Health Data Framework. Front Public Health 2021; 9:668386. [PMID: 34136456 PMCID: PMC8200489 DOI: 10.3389/fpubh.2021.668386] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 04/29/2021] [Indexed: 11/23/2022] Open
Abstract
Background: Facilitating access to health data for public health and research purposes is an important element in the health policy agenda of many countries. Improvements in this sense can only be achieved with the development of an appropriate data infrastructure and the implementations of policies that also respect societal preferences. Switzerland is a revealing example of a country that has been struggling to achieve this aim. The objective of the study is to reflect on stakeholders' recommendations on how to improve the health data framework of this country. Methods: We analysed the recommendations collected as part of a qualitative study including 48 expert stakeholders from Switzerland that have been working principally with health databases. Recommendations were divided in themes and subthemes according to applied thematic analysis. Results: Stakeholders recommended several potential improvements of the health data framework in Switzerland. At the general level of mind-set and attitude, they suggested to foster the development of an explicit health data strategy, better communication and the respect of societal preferences. In terms of infrastructure, there were calls for the creation of a national data center, the improvement of IT solutions and the use of a Unique Identifier for patient data. Lastly, they recommended harmonising procedures for data access and to clarify data protection and consent rules. Conclusion: Recommendations show several potential improvements of the health data framework, but they have to be reconciled with existing policies, infrastructures and ethico-legal limitations. Achieving a gradual implementation of the recommended solutions is the preferable way forward for Switzerland and a lesson for other countries that are also seeking to improve health data access for public health and research purposes.
Collapse
Affiliation(s)
- Andrea Martani
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
| | | | - Sophia Mira Egli
- Master Student, Faculty of Medicine, University of Basel, Basel, Switzerland
| | - Frédéric Erard
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tenzin Wangmo
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland
| | - Bernice Simone Elger
- Institute for Biomedical Ethics, University of Basel, Basel, Switzerland.,University Center of Legal Medicine, University of Geneva, Geneva, Switzerland
| |
Collapse
|
33
|
Open Research Data and Open Peer Review: Perceptions of a Medical and Health Sciences Community in Greece. PUBLICATIONS 2021. [DOI: 10.3390/publications9020014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Recently significant initiatives have been launched for the dissemination of Open Access as part of the Open Science movement. Nevertheless, two other major pillars of Open Science such as Open Research Data (ORD) and Open Peer Review (OPR) are still in an early stage of development among the communities of researchers and stakeholders. The present study sought to unveil the perceptions of a medical and health sciences community about these issues. Through the investigation of researchers‘ attitudes, valuable conclusions can be drawn, especially in the field of medicine and health sciences, where an explosive growth of scientific publishing exists. A quantitative survey was conducted based on a structured questionnaire, with 179 valid responses. The participants in the survey agreed with the Open Peer Review principles. However, they ignored basic terms like FAIR (Findable, Accessible, Interoperable, and Reusable) and appeared incentivized to permit the exploitation of their data. Regarding Open Peer Review (OPR), participants expressed their agreement, implying their support for a trustworthy evaluation system. Conclusively, researchers need to receive proper training for both Open Research Data principles and Open Peer Review processes which combined with a reformed evaluation system will enable them to take full advantage of the opportunities that arise from the new scholarly publishing and communication landscape.
Collapse
|
34
|
Benis A, Tamburis O, Chronaki C, Moen A. One Digital Health: A Unified Framework for Future Health Ecosystems. J Med Internet Res 2021; 23:e22189. [PMID: 33492240 PMCID: PMC7886486 DOI: 10.2196/22189] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 08/09/2020] [Accepted: 01/24/2021] [Indexed: 12/13/2022] Open
Abstract
One Digital Health is a proposed unified structure. The conceptual framework of the One Digital Health Steering Wheel is built around two keys (ie, One Health and digital health), three perspectives (ie, individual health and well-being, population and society, and ecosystem), and five dimensions (ie, citizens’ engagement, education, environment, human and veterinary health care, and Healthcare Industry 4.0). One Digital Health aims to digitally transform future health ecosystems, by implementing a systemic health and life sciences approach that takes into account broad digital technology perspectives on human health, animal health, and the management of the surrounding environment. This approach allows for the examination of how future generations of health informaticians can address the intrinsic complexity of novel health and care scenarios in digitally transformed health ecosystems. In the emerging hybrid landscape, citizens and their health data have been called to play a central role in the management of individual-level and population-level perspective data. The main challenges of One Digital Health include facilitating and improving interactions between One Health and digital health communities, to allow for efficient interactions and the delivery of near–real-time, data-driven contributions in systems medicine and systems ecology. However, digital health literacy; the capacity to understand and engage in health prevention activities; self-management; and collaboration in the prevention, control, and alleviation of potential problems are necessary in systemic, ecosystem-driven public health and data science research. Therefore, people in a healthy One Digital Health ecosystem must use an active and forceful approach to prevent and manage health crises and disasters, such as the COVID-19 pandemic.
Collapse
Affiliation(s)
- Arriel Benis
- Faculty of Technology Management, Holon Institute of Technology, Holon, Israel.,Faculty of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel
| | - Oscar Tamburis
- Department of Veterinary Medicine and Animal Productions, University of Naples Federico II, Naples, Italy
| | | | - Anne Moen
- Faculty of Medicine, Institute for Health and Society, University of Oslo, Oslo, Norway
| |
Collapse
|
35
|
Inau ET, Sack J, Waltemath D, Zeleke AA. Initiatives, Concepts, and Implementation Practices of FAIR (Findable, Accessible, Interoperable, and Reusable) Data Principles in Health Data Stewardship Practice: Protocol for a Scoping Review. JMIR Res Protoc 2021; 10:e22505. [PMID: 33528373 PMCID: PMC7886612 DOI: 10.2196/22505] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Revised: 12/02/2020] [Accepted: 12/08/2020] [Indexed: 01/21/2023] Open
Abstract
Background Data stewardship is an essential driver of research and clinical practice. Data collection, storage, access, sharing, and analytics are dependent on the proper and consistent use of data management principles among the investigators. Since 2016, the FAIR (findable, accessible, interoperable, and reusable) guiding principles for research data management have been resonating in scientific communities. Enabling data to be findable, accessible, interoperable, and reusable is currently believed to strengthen data sharing, reduce duplicated efforts, and move toward harmonization of data from heterogeneous unconnected data silos. FAIR initiatives and implementation trends are rising in different facets of scientific domains. It is important to understand the concepts and implementation practices of the FAIR data principles as applied to human health data by studying the flourishing initiatives and implementation lessons relevant to improved health research, particularly for data sharing during the coronavirus pandemic. Objective This paper aims to conduct a scoping review to identify concepts, approaches, implementation experiences, and lessons learned in FAIR initiatives in the health data domain. Methods The Arksey and O’Malley stage-based methodological framework for scoping reviews will be used for this review. PubMed, Web of Science, and Google Scholar will be searched to access relevant primary and grey publications. Articles written in English and published from 2014 onwards with FAIR principle concepts or practices in the health domain will be included. Duplication among the 3 data sources will be removed using a reference management software. The articles will then be exported to a systematic review management software. At least two independent authors will review the eligibility of each article based on defined inclusion and exclusion criteria. A pretested charting tool will be used to extract relevant information from the full-text papers. Qualitative thematic synthesis analysis methods will be employed by coding and developing themes. Themes will be derived from the research questions and contents in the included papers. Results The results will be reported using the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-analyses Extension for Scoping Reviews) reporting guidelines. We anticipate finalizing the manuscript for this work in 2021. Conclusions We believe comprehensive information about the FAIR data principles, initiatives, implementation practices, and lessons learned in the FAIRification process in the health domain is paramount to supporting both evidence-based clinical practice and research transparency in the era of big data and open research publishing. International Registered Report Identifier (IRRID) PRR1-10.2196/22505
Collapse
Affiliation(s)
- Esther Thea Inau
- Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Jean Sack
- International Health Department, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Dagmar Waltemath
- Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Atinkut Alamirrew Zeleke
- Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| |
Collapse
|
36
|
Denecke K. Biomedical Standards and Open Health Data. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11527-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
|
37
|
Koesten L, Vougiouklis P, Simperl E, Groth P. Dataset Reuse: Toward Translating Principles to Practice. PATTERNS (NEW YORK, N.Y.) 2020; 1:100136. [PMID: 33294873 PMCID: PMC7691392 DOI: 10.1016/j.patter.2020.100136] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2020] [Revised: 08/28/2020] [Accepted: 10/12/2020] [Indexed: 10/28/2022]
Abstract
The web provides access to millions of datasets that can have additional impact when used beyond their original context. We have little empirical insight into what makes a dataset more reusable than others and which of the existing guidelines and frameworks, if any, make a difference. In this paper, we explore potential reuse features through a literature review and present a case study on datasets on GitHub, a popular open platform for sharing code and data. We describe a corpus of more than 1.4 million data files, from over 65,000 repositories. Using GitHub's engagement metrics as proxies for dataset reuse, we relate them to reuse features from the literature and devise an initial model, using deep neural networks, to predict a dataset's reusability. This demonstrates the practical gap between principles and actionable insights that allow data publishers and tools designers to implement functionalities that provably facilitate reuse.
Collapse
Affiliation(s)
| | | | | | - Paul Groth
- University of Amsterdam, Amsterdam 1090 GH, the Netherlands
| |
Collapse
|
38
|
Salgado D, Armean IM, Baudis M, Beltran S, Capella-Gutierrez S, Carvalho-Silva D, Dominguez Del Angel V, Dopazo J, Furlong LI, Gao B, Garcia L, Gerloff D, Gut I, Gyenesei A, Habermann N, Hancock JM, Hanauer M, Hovig E, Johansson LF, Keane T, Korbel J, Lauer KB, Laurie S, Leskošek B, Lloyd D, Marques-Bonet T, Mei H, Monostory K, Piñero J, Poterlowicz K, Rath A, Samarakoon P, Sanz F, Saunders G, Sie D, Swertz MA, Tsukanov K, Valencia A, Vidak M, Yenyxe González C, Ylstra B, Béroud C. The ELIXIR Human Copy Number Variations Community: building bioinformatics infrastructure for research. F1000Res 2020; 9. [PMID: 34367618 PMCID: PMC8311797 DOI: 10.12688/f1000research.24887.1] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/27/2020] [Indexed: 02/02/2023] Open
Abstract
Copy number variations (CNVs) are major causative contributors both in the genesis of genetic diseases and human neoplasias. While “High-Throughput” sequencing technologies are increasingly becoming the primary choice for genomic screening analysis, their ability to efficiently detect CNVs is still heterogeneous and remains to be developed. The aim of this white paper is to provide a guiding framework for the future contributions of ELIXIR’s recently established
human CNV Community, with implications beyond human disease diagnostics and population genomics. This white paper is the direct result of a strategy meeting that took place in September 2018 in Hinxton (UK) and involved representatives of 11 ELIXIR Nodes. The meeting led to the definition of priority objectives and tasks, to address a wide range of CNV-related challenges ranging from detection and interpretation to sharing and training. Here, we provide suggestions on how to align these tasks within the ELIXIR Platforms strategy, and on how to frame the activities of this new ELIXIR Community in the international context.
Collapse
Affiliation(s)
| | - Irina M Armean
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Michael Baudis
- Department of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Sergi Beltran
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 4, Barcelona 08028, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Salvador Capella-Gutierrez
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Spanish National Bioinformatics Institute (INB)/ELIXIR-ES, Barcelona, Spain
| | - Denise Carvalho-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.,Open Targets, Wellcome Genome Campus, Hinxton, Cambridgeshire, CB10 1SD, UK
| | | | - Joaquin Dopazo
- Clinical Bioinformatics Area, Fundación Progreso y Salud, CDCA, Hospital Virgen del Rocio, Sevilla, Spain
| | - Laura I Furlong
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
| | - Bo Gao
- Department of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | - Leyla Garcia
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK.,ZB MED Information Centre for Life Sciences, Cologne, Germany.,ELIXIR Hub, Hinxton, UK
| | - Dietlind Gerloff
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, Luxembourg
| | - Ivo Gut
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 4, Barcelona 08028, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Attila Gyenesei
- Szentágothai Research Center, University of Pécs, Pécs, Hungary
| | - Nina Habermann
- Genome Biology, European Molecular Biological Laboratory, Heidelberg, Germany
| | | | | | - Eivind Hovig
- Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo, Norway.,Centre for bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
| | - Lennart F Johansson
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Thomas Keane
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Jan Korbel
- Genome Biology, European Molecular Biological Laboratory, Heidelberg, Germany
| | | | - Steve Laurie
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Baldiri Reixac 4, Barcelona 08028, Spain
| | - Brane Leskošek
- Faculty of Medicine - ELIXIR Slovenia, University of Ljubljana, Ljubljana, Slovenia
| | | | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), Catalan Institution for Research and Advanced Studies, Barcelona, Spain
| | - Hailiang Mei
- Sequencing Analysis Support Core, Leiden University Medical Center, Leiden, The Netherlands
| | - Katalin Monostory
- Institute of Enzymology, Research Centre for Natural Sciences, Budapest, Hungary
| | - Janet Piñero
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
| | | | | | - Pubudu Samarakoon
- Department of Medical Genetics, Oslo University Hospital, Oslo, Norway
| | - Ferran Sanz
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
| | | | - Daoud Sie
- Department of Clinical Genetics, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Morris A Swertz
- Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Kirill Tsukanov
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Alfonso Valencia
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Spanish National Bioinformatics Institute (INB)/ELIXIR-ES, Barcelona, Spain.,Catalan Institution of Research and Advanced Studies, Barcelona, Spain
| | - Marko Vidak
- Faculty of Medicine - ELIXIR Slovenia, University of Ljubljana, Ljubljana, Slovenia
| | - Cristina Yenyxe González
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, UK
| | - Bauke Ylstra
- Department of Pathology, Cancer Center Amsterdam, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Christophe Béroud
- Aix Marseille Univ, INSERM, MMG, Marseille, France.,Département de Génétique Médicale et de Biologie Cellulaire, APHM, Hôpital d'enfants de la Timone, 13385 Marseille, France
| |
Collapse
|
39
|
Biobanks for life sciences and personalized medicine: importance of standardization, biosafety, biosecurity, and data management. Curr Opin Biotechnol 2020; 65:45-51. [DOI: 10.1016/j.copbio.2019.12.004] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2019] [Accepted: 12/09/2019] [Indexed: 02/06/2023]
|
40
|
Holub P, Kozera L, Florindi F, van Enckevort E, Swertz M, Reihs R, Wutte A, Valík D, Mayrhofer MT. BBMRI-ERIC's contributions to research and knowledge exchange on COVID-19. Eur J Hum Genet 2020; 28:728-731. [PMID: 32444797 PMCID: PMC7242892 DOI: 10.1038/s41431-020-0634-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 04/15/2020] [Accepted: 04/20/2020] [Indexed: 11/22/2022] Open
Abstract
During the COVID-19 pandemic, the European biobanking infrastructure is in a unique position to preserve valuable biological material complemented with detailed data for future research purposes. Biobanks can be either integrated into healthcare, where preservation of the biological material is a fork in clinical routine diagnostics and medical treatment processes or they can also host prospective cohorts or material related to clinical trials. The paper discussed objectives of BBMRI-ERIC, the European research infrastructure established to facilitate access to quality-defined biological materials and data for research purposes, with respect to the COVID-19 crisis: (a) to collect information on available European as well as non-European COVID-19-relevant biobanking resources in BBMRI-ERIC Directory and to facilitate access to these via BBMRI-ERIC Negotiator platform; (b) to help harmonizing guidelines on how data and biological material is to be collected to maximize utility for future research, including large-scale data processing in artificial intelligence, by participating in activities such as COVID-19 Host Genetics Initiative; (c) to minimize risks for all involved parties dealing with (potentially) infectious material by developing recommendations and guidelines; (d) to provide a European-wide platform of exchange in relation to ethical, legal, and societal issues (ELSI) specific to the collection of biological material and data during the COVID-19 pandemic.
Collapse
Affiliation(s)
- Petr Holub
- BBMRI-ERIC, Neue Stiftingtalstraße 2/B/6, 8010, Graz, Austria. .,Institute of Computer Science, Masaryk University, Botanická 68a, 60200, Brno, Czech Republic.
| | - Lukasz Kozera
- BBMRI-ERIC, Neue Stiftingtalstraße 2/B/6, 8010, Graz, Austria
| | | | - Esther van Enckevort
- Department of Genetics CB50, University Medical Center Groningen & BBMRI-NL, Hanzeplein 1, 9713 GZ, Groningen, Netherlands
| | - Morris Swertz
- Department of Genetics CB50, University Medical Center Groningen & BBMRI-NL, Hanzeplein 1, 9713 GZ, Groningen, Netherlands
| | - Robert Reihs
- Medical University Graz & BBMRI.at, Auenbruggerpl. 2, 8036, Graz, Austria
| | - Andrea Wutte
- BBMRI-ERIC, Neue Stiftingtalstraße 2/B/6, 8010, Graz, Austria
| | - Dalibor Valík
- Masaryk Memorial Cancer Institute, Žlutý kopec 543/7, 602 00, Brno, Czech Republic
| | | | | |
Collapse
|
41
|
Mamo N, Martin GM, Desira M, Ellul B, Ebejer JP. Dwarna: a blockchain solution for dynamic consent in biobanking. Eur J Hum Genet 2020; 28:609-626. [PMID: 31844175 PMCID: PMC7170942 DOI: 10.1038/s41431-019-0560-9] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 11/13/2019] [Accepted: 11/26/2019] [Indexed: 11/08/2022] Open
Abstract
Dynamic consent aims to empower research partners and facilitate active participation in the research process. Used within the context of biobanking, it gives individuals access to information and control to determine how and where their biospecimens and data should be used. We present Dwarna-a web portal for 'dynamic consent' that acts as a hub connecting the different stakeholders of the Malta Biobank: biobank managers, researchers, research partners, and the general public. The portal stores research partners' consent in a blockchain to create an immutable audit trail of research partners' consent changes. Dwarna's structure also presents a solution to the European Union's General Data Protection Regulation's right to erasure-a right that is seemingly incompatible with the blockchain model. Dwarna's transparent structure increases trustworthiness in the biobanking process by giving research partners more control over which research studies they participate in, by facilitating the withdrawal of consent and by making it possible to request that the biospecimen and associated data are destroyed.
Collapse
Affiliation(s)
- Nicholas Mamo
- Centre for Molecular Medicine and Biobanking, Biomedical Sciences Building, University of Malta, Msida, MSD 2080, Malta
| | - Gillian M Martin
- Centre for Molecular Medicine and Biobanking, Biomedical Sciences Building, University of Malta, Msida, MSD 2080, Malta
- Department of Sociology, Faculty of Arts, University of Malta, Msida, MSD 2080, Malta
- BBMRI-ERIC, Neue Stiftingtalstraße 2/B/6, 8010, Graz, Austria
| | - Maria Desira
- Centre for Molecular Medicine and Biobanking, Biomedical Sciences Building, University of Malta, Msida, MSD 2080, Malta
| | - Bridget Ellul
- Department of Pathology, Faculty of Medicine and Surgery, University of Malta, Msida, MSD 2080, Malta
| | - Jean-Paul Ebejer
- Centre for Molecular Medicine and Biobanking, Biomedical Sciences Building, University of Malta, Msida, MSD 2080, Malta.
| |
Collapse
|
42
|
Wittenburg P, de Jong F, van Uytvanck D, Cocco M, Jeffery K, Lautenschlager M, Thiemann H, Hellström M, Asmi A, Holub P. State of FAIRness in ESFRI Projects. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_a_00045] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Since 2009 initiatives that were selected for the roadmap of the European Strategy Forum on Research Infrastructures started working to build research infrastructures for a wide range of research disciplines. An important result of the strategic discussions was that distributed infrastructure scenarios were now seen as “complex research facilities” in addition to, for example traditional centralised infrastructures such as CERN. In this paper we look at five typical examples of such distributed infrastructures where many researchers working in different centres are contributing data, tools/services and knowledge and where the major task of the research infrastructure initiative is to create a virtually integrated suite of resources allowing researchers to carry out state-of-the-art research. Careful analysis shows that most of these research infrastructures worked on the Findability, Accessibility, Interoperability and Reusability dimensions before the term “FAIR” was actually coined. The definition of the FAIR principles and their wide acceptance can be seen as a confirmation of what these initiatives were doing and it gives new impulse to close still existing gaps. These initiatives also seem to be ready to take up the next steps which will emerge from the definition of FAIR maturity indicators. Experts from these infrastructures should bring in their 10-years' experience in this definition process.
Collapse
Affiliation(s)
- Peter Wittenburg
- Max Planck Computing and Data Facility, Gießenbachstraße 2, 85748 Garching, Germany
| | | | | | - Massimo Cocco
- EPOS ERIC, Via di Vigna Murata 605, Rome 00143, Italy
| | | | | | - Hannes Thiemann
- DKRZ Ringgold standard institution, Bundesstr. 45a, Hamburg, Hamburg 20146, Germany
| | - Margareta Hellström
- Department of Physical Geography and Ecosystem, Ringgold standard institution, Lund University, Scienc Sölvegatan 12, Lund 22362, Sweden
| | - Ari Asmi
- University of Helsinki Ringgold Standard Institution – Institution of Atmospheric and Earth System Sciences, P.O. Box 64, Helsinki 00014, Finland
| | - Petr Holub
- BBMRI-ERIC Neue Stiftingtalstrasse 2/B/6, Graz 8010, Austria
| |
Collapse
|
43
|
Garg L, Chukwu E, Nasser N, Chakraborty C, Garg G. Anonymity Preserving IoT-Based COVID-19 and Other Infectious Disease Contact Tracing Model. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:159402-159414. [PMID: 34786286 PMCID: PMC8545304 DOI: 10.1109/access.2020.3020513] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 08/28/2020] [Indexed: 05/18/2023]
Abstract
Automated digital contact tracing is effective and efficient, and one of the non-pharmaceutical complementary approaches to mitigate and manage epidemics like Coronavirus disease 2019 (COVID-19). Despite the advantages of digital contact tracing, it is not widely used in the western world, including the US and Europe, due to strict privacy regulations and patient rights. We categorized the current approaches for contact tracing, namely: mobile service-provider-application, mobile network operators' call detail, citizen-application, and IoT-based. Current measures for infection control and tracing do not include animals and moving objects like cars despite evidence that these moving objects can be infection carriers. In this article, we designed and presented a novel privacy anonymous IoT model. We presented an RFID proof-of-concept for this model. Our model leverages blockchain's trust-oriented decentralization for on-chain data logging and retrieval. Our model solution will allow moving objects to receive or send notifications when they are close to a flagged, probable, or confirmed diseased case, or flagged place or object. We implemented and presented three prototype blockchain smart contracts for our model. We then simulated contract deployments and execution of functions. We presented the cost differentials. Our simulation results show less than one-second deployment and call time for smart contracts, though, in real life, it can be up to 25 seconds on Ethereum public blockchain. Our simulation results also show that it costs an average of $1.95 to deploy our prototype smart contracts, and an average of $0.34 to call our functions. Our model will make it easy to identify clusters of infection contacts and help deliver a notification for mass isolation while preserving individual privacy. Furthermore, it can be used to understand better human connectivity, model similar other infection spread network, and develop public policies to control the spread of COVID-19 while preparing for future epidemics.
Collapse
Affiliation(s)
- Lalit Garg
- Department of Computer Information System (CIS)Faculty of Information Communication Technology (ICT), University of Malta 2080 Msida Malta
| | - Emeka Chukwu
- Department of Computer Information System (CIS)Faculty of Information Communication Technology (ICT), University of Malta 2080 Msida Malta
| | - Nidal Nasser
- College of EngineeringAlfaisal University Riyadh 50927 Saudi Arabia
| | - Chinmay Chakraborty
- Department of Electronics and Communication EngineeringBirla Institute of Technology at Mesra Ranchi 835215 India
| | - Gaurav Garg
- ABV-Indian Institute of Information Technology and Management Gwalior 474015 India
| |
Collapse
|
44
|
Brewster C, Nouwt B, Raaijmakers S, Verhoosel J. Ontology-based Access Control for FAIR Data. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_a_00029] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
This paper focuses on fine-grained, secure access to FAIR data, for which we propose ontology-based data access policies. These policies take into account both the FAIR aspects of the data relevant to access (such as provenance and licence), expressed as metadata, and additional metadata describing users. With this tripartite approach (data, associated metadata expressing FAIR information, and additional metadata about users), secure and controlled access to object data can be obtained. This yields a security dimension to the “A” (accessible) in FAIR, which is clearly needed in domains like security and intelligence. These domains need data to be shared under tight controls, with widely varying individual access rights. In this paper, we propose an approach called Ontology-Based Access Control (OBAC), which utilizes concepts and relations from a data set's domain ontology. We argue that ontology-based access policies contribute to data reusability and can be reconciled with privacy-aware data access policies. We illustrate our OBAC approach through a proof-of-concept and propose that OBAC to be adopted as a best practice for access management of FAIR data.
Collapse
Affiliation(s)
- Christopher Brewster
- Data Science Department at TNO, Kampweg 55, Soesterberg 3769 DE, The Netherlands
- Institute of Data Science, Maastricht University, Maastricht 6229 ER, The Netherlands
| | - Barry Nouwt
- Data Science Department at TNO, Kampweg 55, Soesterberg 3769 DE, The Netherlands
| | - Stephan Raaijmakers
- Data Science Department at TNO, Kampweg 55, Soesterberg 3769 DE, The Netherlands
| | - Jack Verhoosel
- Data Science Department at TNO, Kampweg 55, Soesterberg 3769 DE, The Netherlands
| |
Collapse
|
45
|
Mechanistic integration of exposure and effects: advances to apply systems toxicology in support of regulatory decision-making. CURRENT OPINION IN TOXICOLOGY 2019. [DOI: 10.1016/j.cotox.2019.09.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
46
|
Wise J, de Barron AG, Splendiani A, Balali-Mood B, Vasant D, Little E, Mellino G, Harrow I, Smith I, Taubert J, van Bochove K, Romacker M, Walgemoed P, Jimenez RC, Winnenburg R, Plasterer T, Gupta V, Hedley V. Implementation and relevance of FAIR data principles in biopharmaceutical R&D. Drug Discov Today 2019; 24:933-938. [PMID: 30690198 DOI: 10.1016/j.drudis.2019.01.008] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2018] [Revised: 12/21/2018] [Accepted: 01/20/2019] [Indexed: 10/27/2022]
Abstract
Biopharmaceutical industry R&D, and indeed other life sciences R&D such as biomedical, environmental, agricultural and food production, is becoming increasingly data-driven and can significantly improve its efficiency and effectiveness by implementing the FAIR (findable, accessible, interoperable, reusable) guiding principles for scientific data management and stewardship. By so doing, the plethora of new and powerful analytical tools such as artificial intelligence and machine learning will be able, automatically and at scale, to access the data from which they learn, and on which they thrive. FAIR is a fundamental enabler for digital transformation.
Collapse
|
47
|
Abstract
With the ethical, legal, and societal issues (ELSI) Knowledge Base, we introduce a key element of the Biobanking and Biomolecular Resources Research Infrastructure-European Research Infrastructure Consortium (BBMRI-ERIC) Common Service ELSI, which provides ethical, legal, and societal support for researchers and biobankers involved in transnational research. In contrast to the customized support provided by the ELSI Helpdesk, the ELSI Knowledge Base will be available to the user on a self-serve basis. The information that is made available through a knowledge base comes from multiple sources, usually from several expert contributors who are well versed in the subject matter. The knowledge base provides users with a first orientation on the subject matter, as well as allowing them to explore more detailed information if desired in a self-service manner. It is crucial that the information and knowledge provided are shared in a manner that is user friendly. Long lists of links, legalistic language, and multiple links have to be avoided wherever possible. The long-term sustainability and accuracy of a knowledge base need to be ensured by placing its expert curation and technical maintenance under the responsibility of an organization rather than a research consortium. In its core, it builds on a scenario-based approach using a nonlegalistic language. In addition, the knowledge base connects to frequently asked questions, promotes contract and informed consent templates, how-to-guides, best-practice models, and scripts. The ELSI Knowledge Base is a key element of the BBMRI-ERIC Common Service ELSI, which currently serves biobanks but will be enlarged to serve the biological and medical sciences community. In contrast to the ELSI Helpdesk, which provides customized support, the ELSI Knowledge Base is available to the user on a self-serve basis. The conceptualization of the ELSI Knowledge Base builds on assessments of several ethical, legal, and societal guidance tools that favor a single sustainable knowledge base for closing the knowledge gap by providing practical hands-on guidance for researchers. Ultimately, the ELSI Knowledge Base aims at promoting practical know-how and skills for conducting responsible research.
Collapse
|
48
|
Zatloukal K, Stumptner C, Kungl P, Mueller H. Biobanks in personalized medicine. EXPERT REVIEW OF PRECISION MEDICINE AND DRUG DEVELOPMENT 2018. [DOI: 10.1080/23808993.2018.1493921] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Affiliation(s)
- Kurt Zatloukal
- Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, Graz, Austria
| | - Cornelia Stumptner
- Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, Graz, Austria
| | - Penelope Kungl
- Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, Graz, Austria
| | - Heimo Mueller
- Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, Graz, Austria
| |
Collapse
|
49
|
Gajović S. Knowledge-for-data trade at the interface between precision medicine and person-centered care. Croat Med J 2018; 59:132-135. [PMID: 29972736 PMCID: PMC6045896 DOI: 10.3325/cmj.2018.59.132] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Affiliation(s)
- Srećko Gajović
- Srećko Gajović, Croatian Institute for Brain Research, University of Zagreb School of Medicine, Zagreb, Croatia,
| |
Collapse
|
50
|
Abstract
Biobanking and BioMolecular resources Research Infrastructure (BBMRI)- European Research Infrastructure Consortium (ERIC) is the largest infrastructure launched in Europe in health research. By nature it is a distributed infrastructure, in which biological samples and data are hosted by the European Member States biobanks. As of today, BBMRI-ERIC consists of 19 European Member States and 1 international organization, the International Agency for Research on Cancer. This means that BBMRI-ERIC has a population of >500 million individuals in Europe. BBMRI-ERIC is a truly Pan-European Research Infrastructure for health research. Given that BBMRI-ERIC is set up to become a key source for users in both academic and scientific institutions as well as in the pharmaceutical and life science industries, it contributes directly to the Innovation Union concept. It is pan-European because BBMRI-ERIC already shows an excellent geographic and regional coverage all over Europe involving countries from South, East, West, North, and Central Europe. BBMRI-ERIC is a service-driven infrastructure for the European Member States, driven by science. The BBMRI-ERIC Directory consists of 100 million samples and a roadmap for better-defined quality in European biobanks for improving reproducibility and reliability of the biological sample and data.
Collapse
Affiliation(s)
- Jan-Eric Litton
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|