1
|
Inau ET, Sack J, Waltemath D, Zeleke AA. Initiatives, Concepts, and Implementation Practices of the Findable, Accessible, Interoperable, and Reusable Data Principles in Health Data Stewardship: Scoping Review. J Med Internet Res 2023; 25:e45013. [PMID: 37639292 PMCID: PMC10495848 DOI: 10.2196/45013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Revised: 03/25/2023] [Accepted: 04/14/2023] [Indexed: 08/29/2023] Open
Abstract
BACKGROUND Thorough data stewardship is a key enabler of comprehensive health research. Processes such as data collection, storage, access, sharing, and analytics require researchers to follow elaborate data management strategies properly and consistently. Studies have shown that findable, accessible, interoperable, and reusable (FAIR) data leads to improved data sharing in different scientific domains. OBJECTIVE This scoping review identifies and discusses concepts, approaches, implementation experiences, and lessons learned in FAIR initiatives in health research data. METHODS The Arksey and O'Malley stage-based methodological framework for scoping reviews was applied. PubMed, Web of Science, and Google Scholar were searched to access relevant publications. Articles written in English, published between 2014 and 2020, and addressing FAIR concepts or practices in the health domain were included. The 3 data sources were deduplicated using a reference management software. In total, 2 independent authors reviewed the eligibility of each article based on defined inclusion and exclusion criteria. A charting tool was used to extract information from the full-text papers. The results were reported using the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines. RESULTS A total of 2.18% (34/1561) of the screened articles were included in the final review. The authors reported FAIRification approaches, which include interpolation, inclusion of comprehensive data dictionaries, repository design, semantic interoperability, ontologies, data quality, linked data, and requirement gathering for FAIRification tools. Challenges and mitigation strategies associated with FAIRification, such as high setup costs, data politics, technical and administrative issues, privacy concerns, and difficulties encountered in sharing health data despite its sensitive nature were also reported. We found various workflows, tools, and infrastructures designed by different groups worldwide to facilitate the FAIRification of health research data. We also uncovered a wide range of problems and questions that researchers are trying to address by using the different workflows, tools, and infrastructures. Although the concept of FAIR data stewardship in the health research domain is relatively new, almost all continents have been reached by at least one network trying to achieve health data FAIRness. Documented outcomes of FAIRification efforts include peer-reviewed publications, improved data sharing, facilitated data reuse, return on investment, and new treatments. Successful FAIRification of data has informed the management and prognosis of various diseases such as cancer, cardiovascular diseases, and neurological diseases. Efforts to FAIRify data on a wider variety of diseases have been ongoing since the COVID-19 pandemic. CONCLUSIONS This work summarises projects, tools, and workflows for the FAIRification of health research data. The comprehensive review shows that implementing the FAIR concept in health data stewardship carries the promise of improved research data management and transparency in the era of big data and open research publishing. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) RR2-10.2196/22505.
Collapse
Affiliation(s)
- Esther Thea Inau
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Jean Sack
- International Health Department, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Dagmar Waltemath
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| | - Atinkut Alamirrew Zeleke
- Department of Medical Informatics, Institute for Community Medicine, University Medicine Greifswald, Greifswald, Germany
| |
Collapse
|
2
|
van Reisen M, Oladipo F, Stokmans M, Mpezamihgo M, Folorunso S, Schultes E, Basajja M, Aktau A, Amare SY, Taye GT, Purnama Jati PH, Chindoza K, Wirtz M, Ghardallou M, van Stam G, Ayele W, Nalugala R, Abdullahi I, Osigwe O, Graybeal J, Medhanyie AA, Kawu AA, Liu F, Wolstencroft K, Flikkenschild E, Lin Y, Stocker J, Musen MA. Design of a FAIR digital data health infrastructure in Africa for COVID-19 reporting and research. ADVANCED GENETICS (HOBOKEN, N.J.) 2021; 2:e10050. [PMID: 34514430 PMCID: PMC8420285 DOI: 10.1002/ggn2.10050] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 05/20/2021] [Accepted: 05/21/2021] [Indexed: 12/13/2022]
Abstract
The limited volume of COVID-19 data from Africa raises concerns for global genome research, which requires a diversity of genotypes for accurate disease prediction, including on the provenance of the new SARS-CoV-2 mutations. The Virus Outbreak Data Network (VODAN)-Africa studied the possibility of increasing the production of clinical data, finding concerns about data ownership, and the limited use of health data for quality treatment at point of care. To address this, VODAN Africa developed an architecture to record clinical health data and research data collected on the incidence of COVID-19, producing these as human- and machine-readable data objects in a distributed architecture of locally governed, linked, human- and machine-readable data. This architecture supports analytics at the point of care and-through data visiting, across facilities-for generic analytics. An algorithm was run across FAIR Data Points to visit the distributed data and produce aggregate findings. The FAIR data architecture is deployed in Uganda, Ethiopia, Liberia, Nigeria, Kenya, Somalia, Tanzania, Zimbabwe, and Tunisia.
Collapse
Affiliation(s)
- Mirjam van Reisen
- Leiden UniversityLeidenNetherlands
- Leiden University Medical Centre (LUMC)Leiden UniversityLeidenNetherlands
- Leiden Institute of Advanced Computer Science (LIACS)Leiden UniversityLeidenNetherlands
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
| | | | - Mia Stokmans
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
| | | | - Sakinat Folorunso
- Department of Computer ScienceOlabisi Onabanjo UniversityAgo IwoyeNigeria
| | | | - Mariam Basajja
- Leiden UniversityLeidenNetherlands
- Leiden Institute of Advanced Computer Science (LIACS)Leiden UniversityLeidenNetherlands
| | - Aliya Aktau
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
| | | | - Getu Tadele Taye
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
- Department of Health informatics, School of Public HealthMekelle UniversityMek'eleEthiopia
| | - Putu Hadi Purnama Jati
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
- Badan Pusat StatistikCentral JakartaIndonesia
| | - Kudakwashe Chindoza
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
- Department of Computer ScienceGreat Zimbabwe UniversityMasvingoZimbabwe
| | - Morgane Wirtz
- Faculty of Humanities and Digital SciencesTilburg UniversityTilburgNetherlands
| | | | | | - Wondimu Ayele
- Department of Biostatistics and Epidemiology, School of Public health College of Health SciencesAddis Ababa UniversityAddis AbabaEthiopia
| | | | | | | | - John Graybeal
- Stanford Center for Biomedical Informatics ResearchStanford UniversityStanfordCaliforniaUSA
| | - Araya Abrha Medhanyie
- Department of Reproductive health, School of Public HealthMekelle UniversityMek'eleEthiopia
| | | | | | | | - Erik Flikkenschild
- Leiden University Medical Centre (LUMC)Leiden UniversityLeidenNetherlands
| | - Yi Lin
- Leiden UniversityLeidenNetherlands
| | - Joëlle Stocker
- Department of GeosciencesUtrecht UniversityUtrechtNetherlands
| | - Mark A. Musen
- Stanford Center for Biomedical Informatics ResearchStanford UniversityStanfordCaliforniaUSA
| |
Collapse
|
3
|
Jacobsen A, de Miranda Azevedo R, Juty N, Batista D, Coles S, Cornet R, Courtot M, Crosas M, Dumontier M, Evelo CT, Goble C, Guizzardi G, Hansen KK, Hasnain A, Hettne K, Heringa J, Hooft RW, Imming M, Jeffery KG, Kaliyaperumal R, Kersloot MG, Kirkpatrick CR, Kuhn T, Labastida I, Magagna B, McQuilton P, Meyers N, Montesanti A, van Reisen M, Rocca-Serra P, Pergl R, Sansone SA, da Silva Santos LOB, Schneider J, Strawn G, Thompson M, Waagmeester A, Weigel T, Wilkinson MD, Willighagen EL, Wittenburg P, Roos M, Mons B, Schultes E. FAIR Principles: Interpretations and Implementation Considerations. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_r_00024] [Citation(s) in RCA: 77] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
The FAIR principles have been widely cited, endorsed and adopted by a broad range of stakeholders since their publication in 2016. By intention, the 15 FAIR guiding principles do not dictate specific technological implementations, but provide guidance for improving Findability, Accessibility, Interoperability and Reusability of digital resources. This has likely contributed to the broad adoption of the FAIR principles, because individual stakeholder communities can implement their own FAIR solutions. However, it has also resulted in inconsistent interpretations that carry the risk of leading to incompatible implementations. Thus, while the FAIR principles are formulated on a high level and may be interpreted and implemented in different ways, for true interoperability we need to support convergence in implementation choices that are widely accessible and (re)-usable. We introduce the concept of FAIR implementation considerations to assist accelerated global participation and convergence towards accessible, robust, widespread and consistent FAIR implementations. Any self-identified stakeholder community may either choose to reuse solutions from existing implementations, or when they spot a gap, accept the challenge to create the needed solution, which, ideally, can be used again by other communities in the future. Here, we provide interpretations and implementation considerations (choices and challenges) for each FAIR principle.
Collapse
Affiliation(s)
- Annika Jacobsen
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | - Ricardo de Miranda Azevedo
- Institute of Data Science, Maastricht University, Universiteitssingel 60, Maastricht 6229 ER, The Netherlands
| | - Nick Juty
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Dominique Batista
- Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
| | - Simon Coles
- School of Chemistry, Faculty of Engineering and Physical Sciences, University of Southampton, SO17 1BJ, UK
| | - Ronald Cornet
- Amsterdam UMC, University of Amsterdam, Amsterdam 1000 GG, The Netherlands
| | - Mélanie Courtot
- European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Mercè Crosas
- Harvard University, Cambridge, Massachusetts 02138, USA
| | - Michel Dumontier
- Institute of Data Science, Maastricht University, Universiteitssingel 60, Maastricht 6229 ER, The Netherlands
| | - Chris T. Evelo
- Department of Bioinformatics – BiGCaT, NUTRIM, Maastricht University, Maastricht 6229 ER, The Netherlands
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Oxford Road, Manchester M13 9PL, UK
| | - Giancarlo Guizzardi
- Conceptual and Cognitive Modeling Research Group (CORE), Free University of Bozen-Bolzano, Bolzano 39100, Italy
| | | | - Ali Hasnain
- Insight Centre for Data Analytics, National University of Ireland Galway, H91 TK33, Ireland
| | - Kristina Hettne
- Centre for Digital Scholarship, Leiden University Libraries, Leiden, 2333 ZA, The Netherlands
| | - Jaap Heringa
- Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands
| | - Rob W.W. Hooft
- Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands
- Dutch Techcentre for Life Sciences (DTL), Utrecht, The Netherlands
| | | | | | | | - Martijn G. Kersloot
- Amsterdam UMC, University of Amsterdam, Amsterdam 1000 GG, The Netherlands
- Castor EDC, Paasheuvelweg 25, Wing 5D, 1105 BP, Amsterdam, The Netherlands
| | - Christine R. Kirkpatrick
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
| | - Tobias Kuhn
- Department of Computer Science, Vrije Universiteit Amsterdam, De Boelelaan 11051081 HV Amsterdam, The Netherlands
| | - Ignasi Labastida
- Learning and Research Resources Centre (CRAI), Universitat de Barcelona, 08007 Barcelona, Spain
| | | | - Peter McQuilton
- Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
| | | | | | - Mirjam van Reisen
- Liacs Institute of Advanced Computer Science, Leiden University, 2311 GJ Leiden, The Netherlands
| | - Philippe Rocca-Serra
- Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
| | - Robert Pergl
- Czech Technical University in Prague, Faculty of Information Technology (FIT CTU), 160 00 Prague 6, Czech Republic
| | - Susanna-Assunta Sansone
- Oxford e-Research Centre, Department of Engineering Sciences, University of Oxford, Oxford OX13PJ, UK
| | | | - Juliane Schneider
- Harvard Catalyst
- Clinical and Translational Science Center, Boston, MA 02115, USA
| | - George Strawn
- US National Academy of Sciences, Washington DC 20418, USA
| | - Mark Thompson
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | | | - Tobias Weigel
- Deutsches Klimarechenzentrum, Bundesstrasse 45a, 20146 Hamburg, Germany
| | - Mark D. Wilkinson
- Center for Plant Biotechnology and Genomics UPM-INIA, Madrid 28040, Spain
| | - Egon L. Willighagen
- Department of Bioinformatics – BiGCaT, NUTRIM, Maastricht University, Maastricht 6229 ER, The Netherlands
| | - Peter Wittenburg
- Max Planck Computing and Data Facility, Gießenbachstraße 2, 85748 Garching, Germany
| | - Marco Roos
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| | - Barend Mons
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
- GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
| | - Erik Schultes
- GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
- Leiden Center for Data Science, 2311 EZ Leiden, The Netherlands
| |
Collapse
|
4
|
van Reisen M, Stokmans M, Basajja M, Ong'ayo AO, Kirkpatrick C, Mons B. Towards the Tipping Point for FAIR Implementation. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_a_00049] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
This article explores the global implementation of the FAIR Guiding Principles for scientific management and data stewardship, which provide that data should be findable, accessible, interoperable and reusable. The implementation of these principles is designed to lead to the stewardship of data as FAIR digital objects and the establishment of the Internet of FAIR Data and Services (IFDS). If implementation reaches a tipping point, IFDS has the potential to revolutionize how data is managed by making machine and human readable data discoverable for reuse. Accordingly, this article examines the expansion of the implementation of FAIR Guiding Principles, especially how and in which geographies (locations) and areas (topic domains) implementation is taking place. A literature review of academic articles published between 2016 and 2019 on the use of FAIR Guiding Principles is presented. The investigation also includes an analysis of the domains in the IFDS Implementation Networks (INs). Its uptake has been mainly in the Western hemisphere. The investigation found that implementation of FAIR Guiding Principles has taken firm hold in the domain of bio and natural sciences. To achieve a tipping point for FAIR implementation, it is now time to ensure the inclusion of non-European ascendants and of other scientific domains. Apart from equal opportunity and genuine global partnership issues, a permanent European bias poses challenges with regard to the representativeness and validity of data and could limit the potential of IFDS to reach across continental boundaries. The article concludes that, despite efforts to be inclusive, acceptance of the FAIR Guiding Principles and IFDS in different scientific communities is limited and there is a need to act now to prevent dampening of the momentum in the development and implementation of the IFDS. It is further concluded that policy entrepreneurs and the GO FAIR INs may contribute to making the FAIR Guiding Principles more flexible in including different research epistemologies, especially through its GO CHANGE pillar.
Collapse
Affiliation(s)
- Mirjam van Reisen
- Liacs Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
| | - Mia Stokmans
- Tilburg School of Humanities and Digital Sciences, Tilburg University, 90153 5000 LE Tilburg, The Netherlands
| | - Mariam Basajja
- Liacs Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
| | - Antony Otieno Ong'ayo
- International Institute of Social Studies, Erasmus University, 29776 2502 LT The Hague, The Netherlands
| | - Christine Kirkpatrick
- San Diego Supercomputer Center, University of California San Diego, San Diego CA 92093, USA
| | - Barend Mons
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| |
Collapse
|