1
|
Hardy K, Heyse S. FAIR data policies can benefit biotech startups. Nat Biotechnol 2023; 41:1060-1061. [PMID: 37568019 DOI: 10.1038/s41587-023-01892-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/13/2023]
|
2
|
Parciak M, Suhr M, Schmidt C, Bönisch C, Löhnhardt B, Kesztyüs D, Kesztyüs T. FAIRness through automation: development of an automated medical data integration infrastructure for FAIR health data in a maximum care university hospital. BMC Med Inform Decis Mak 2023; 23:94. [PMID: 37189148 DOI: 10.1186/s12911-023-02195-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 05/09/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND Secondary use of routine medical data is key to large-scale clinical and health services research. In a maximum care hospital, the volume of data generated exceeds the limits of big data on a daily basis. This so-called "real world data" are essential to complement knowledge and results from clinical trials. Furthermore, big data may help in establishing precision medicine. However, manual data extraction and annotation workflows to transfer routine data into research data would be complex and inefficient. Generally, best practices for managing research data focus on data output rather than the entire data journey from primary sources to analysis. To eventually make routinely collected data usable and available for research, many hurdles have to be overcome. In this work, we present the implementation of an automated framework for timely processing of clinical care data including free texts and genetic data (non-structured data) and centralized storage as Findable, Accessible, Interoperable, Reusable (FAIR) research data in a maximum care university hospital. METHODS We identify data processing workflows necessary to operate a medical research data service unit in a maximum care hospital. We decompose structurally equal tasks into elementary sub-processes and propose a framework for general data processing. We base our processes on open-source software-components and, where necessary, custom-built generic tools. RESULTS We demonstrate the application of our proposed framework in practice by describing its use in our Medical Data Integration Center (MeDIC). Our microservices-based and fully open-source data processing automation framework incorporates a complete recording of data management and manipulation activities. The prototype implementation also includes a metadata schema for data provenance and a process validation concept. All requirements of a MeDIC are orchestrated within the proposed framework: Data input from many heterogeneous sources, pseudonymization and harmonization, integration in a data warehouse and finally possibilities for extraction or aggregation of data for research purposes according to data protection requirements. CONCLUSION Though the framework is not a panacea for bringing routine-based research data into compliance with FAIR principles, it provides a much-needed possibility to process data in a fully automated, traceable, and reproducible manner.
Collapse
Affiliation(s)
- Marcel Parciak
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
- University MS Center, Biomedical Research Institute (BIOMED), Hasselt University, Agoralaan Building C, 3590, Diepenbeek, Belgium
- Data Science Institute (DSI), Hasselt University, Agoralaan Building D, 3590, Diepenbeek, Belgium
| | - Markus Suhr
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
- NextLytics AG, Kapellenstrasse 37, 65719, Hofheim Am Taunus, Germany
| | - Christian Schmidt
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Caroline Bönisch
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Benjamin Löhnhardt
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| | - Dorothea Kesztyüs
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany.
| | - Tibor Kesztyüs
- Department of Medical Informatics, University Medical Center Göttingen, Von-Siebold-Straße 3, 37075, Göttingen, Germany
| |
Collapse
|
3
|
Alharbi E, Skeva R, Juty N, Jay C, Goble C. A FAIR-Decide framework for pharmaceutical R&D: FAIR data cost-benefit assessment. Drug Discov Today 2023; 28:103510. [PMID: 36716952 DOI: 10.1016/j.drudis.2023.103510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 01/24/2023] [Accepted: 01/24/2023] [Indexed: 01/29/2023]
Abstract
The FAIR (findable, accessible, interoperable and reusable) principles are data management and stewardship guidelines aimed at increasing the effective use of scientific research data. Adherence to these principles in managing data assets in pharmaceutical research and development (R&D) offers pharmaceutical companies the potential to maximise the value of such assets, but the endeavour is costly and challenging. We describe the 'FAIR-Decide' framework, which aims to guide decision-making on the retrospective FAIRification of existing datasets by using business analysis techniques to estimate costs and expected benefits. This framework supports decision-making on FAIRification in the pharmaceutical R&D industry and can be integrated into a company's data management strategy.
Collapse
Affiliation(s)
- Ebtisam Alharbi
- College of Computer and Information Systems, Umm Al-Qura University, Mecca, Saudi Arabia.
| | - Rigina Skeva
- Department of Computer Science, University of Manchester, Manchester, UK
| | - Nick Juty
- Department of Computer Science, University of Manchester, Manchester, UK
| | - Caroline Jay
- Department of Computer Science, University of Manchester, Manchester, UK.
| | - Carole Goble
- Department of Computer Science, University of Manchester, Manchester, UK.
| |
Collapse
|
4
|
Cianflone A, Savoia F, Parasole R, Mirabelli P. Pediatric biobanks to enhance clinical and translational research for children. Eur J Pediatr 2023; 182:1459-1468. [PMID: 36692622 PMCID: PMC9871420 DOI: 10.1007/s00431-023-04818-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 11/18/2022] [Accepted: 11/26/2022] [Indexed: 01/25/2023]
Abstract
Including children in biomedical research is an argument for continual reflection and practice refinement from an ethical and legal standpoint. Indeed, as children reach adulthood, a reconsent method should be used, and data connected with samples should ideally be updated based on the children's growth and long-term results. Furthermore, because most pediatric disorders are uncommon, children's research initiatives should conform to standard operating procedures (SOPs) set by worldwide scientific organizations for successfully sharing data and samples. Here, we examine how pediatric biobanks can help address some challenges to improve biomedical research for children. Indeed, modern biobanks are evolving as complex research platforms with specialized employees, dedicated spaces, information technologies services (ITS), and ethical and legal expertise. In the case of research for children, biobanks can collaborate with scientific networks (i.e., BBMRI-ERIC) and provide the collection, storage, and distribution of biosamples in agreement with international standard procedures (ISO-20387). Close collaboration among biobanks provides shared avenues for maximizing scarce biological samples, which is required to promote the translation of scientific breakthroughs for developing clinical care and health policies tailored to the pediatric population. Moreover, biobanks, through their science communication and dissemination activities (i.e., European Biobank Week), may be helpful for children to understand what it means to be engaged in a research study, allowing them to see it as a pleasant, useful, and empowering experience. Additionally, biobanks can notify each participant about which projects have been accomplished (i.e., through their websites, social media networks, etc.); they can facilitate future reconsent procedures and update sample-associated data based on the children's growth. Finally, because of the increasing interest from public and commercial organizations in research efforts that include the sharing and reuse of health data, pediatric biobanks have a crucial role in this context. Consequently, they could benefit from funding opportunities for sustaining research activities even regarding rare pediatric disorders. Conclusion: Pediatric biobanks are helpful for providing biological material for research purposes, addressing ethical and legal issues (i.e. data protection, consent, etc.), and providing control samples from healthy children of various ages and from different geographical regions and ethnicities. Therefore, it is vital to encourage and maintain children's engagement in medical research programs and biobanking activities, especially as children become adults, and reconsent procedures must be applied. What is Known: • Biobanks are critical research infrastructures for medical research, especially in the era of "omic" science. However, in light of their fragility and rights children's participation in biobanking and medical research programs is a complex argument of continuous debate in scientific literature. What is New: • We propose a review of the literature on pediatric biobanks with a particular focus on oncological biobanks. The main current limitations and challenges for pediatric biobanks are presented and possible solutions are discussed.
Collapse
Affiliation(s)
- Alessandra Cianflone
- grid.415247.10000 0004 1756 8081Clinical and Translational Research Unit, Santobono-Pausilipon Children’s Hospital, 80129 Naples, Italy
| | - Fabio Savoia
- grid.415247.10000 0004 1756 8081Childhood Cancer Registry of Campania, Santobono-Pausilipon Children’s Hospital, 80129 Naples, Italy
| | - Rosanna Parasole
- grid.415247.10000 0004 1756 8081Clinical and Translational Research Unit, Santobono-Pausilipon Children’s Hospital, 80129 Naples, Italy
| | - Peppino Mirabelli
- Clinical and Translational Research Unit, Santobono-Pausilipon Children's Hospital, 80129, Naples, Italy.
| |
Collapse
|
5
|
Rehnert M, Takors R. FAIR research data management as community approach in bioengineering. Eng Life Sci 2023; 23:e2200005. [PMID: 36619883 PMCID: PMC9815074 DOI: 10.1002/elsc.202200005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 02/03/2022] [Indexed: 01/11/2023] Open
Abstract
Research data management (RDM) requires standards, policies, and guidelines. Findable, accessible, interoperable, and reusable (FAIR) data management is critical for sustainable research. Therefore, collaborative approaches for managing FAIR-structured data are becoming increasingly important for long-term, sustainable RDM. However, they are rather hesitantly applied in bioengineering. One of the reasons may be found in the interdisciplinary character of the research field. In addition, bioengineering as application of principles of biology and tools of process engineering, often have to meet different criteria. In consequence, RDM is complicated by the fact that researchers from different scientific institutions must meet the criteria of their home institution, which can lead to additional conflicts. Therefore, centrally provided general repositories implementing a collaborative approach that enables data storage from the outset In a biotechnology research network with over 20 tandem projects, it was demonstrated how FAIR-RDM can be implemented through a collaborative approach and the use of a data structure. In addition, the importance of a structure within a repository was demonstrated to keep biotechnology research data available throughout the entire data lifecycle. Furthermore, the biotechnology research network highlighted the importance of a structure within a repository to keep research data available throughout the entire data lifecycle.
Collapse
Affiliation(s)
- Martina Rehnert
- Institute of Biochemical EngineeringUniversity of StuttgartStuttgartGermany
| | - Ralf Takors
- Institute of Biochemical EngineeringUniversity of StuttgartStuttgartGermany
| |
Collapse
|
6
|
Richter J, Lange F, Scheper T, Solle D, Beutel S. Digitale Zwillinge in der Bioprozesstechnik – Chancen und Möglichkeiten. CHEM-ING-TECH 2022. [DOI: 10.1002/cite.202200166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Jannik Richter
- Leibniz Universität Hannover Institut für Technische Chemie Callinstraße 5 30167 Hannover Deutschland
| | - Ferdinand Lange
- Leibniz Universität Hannover Institut für Technische Chemie Callinstraße 5 30167 Hannover Deutschland
| | - Thomas Scheper
- Leibniz Universität Hannover Institut für Technische Chemie Callinstraße 5 30167 Hannover Deutschland
| | - Dörte Solle
- Leibniz Universität Hannover Institut für Technische Chemie Callinstraße 5 30167 Hannover Deutschland
| | - Sascha Beutel
- Leibniz Universität Hannover Institut für Technische Chemie Callinstraße 5 30167 Hannover Deutschland
| |
Collapse
|
7
|
Maximizing data value for biopharma through FAIR and quality implementation: FAIR plus Q. Drug Discov Today 2022; 27:1441-1447. [DOI: 10.1016/j.drudis.2022.01.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 01/10/2022] [Accepted: 01/17/2022] [Indexed: 12/15/2022]
|
8
|
Alharbi E, Skeva R, Juty N, Jay C, Goble C. Exploring the Current Practices, Costs and Benefits of FAIR
Implementation in Pharmaceutical Research and Development: A Qualitative
Interview Study. DATA INTELLIGENCE 2021. [DOI: 10.1162/dint_a_00109] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The findable, accessible, interoperable, reusable (FAIR) principles for scientific data management and stewardship aim to facilitate data reuse at scale by both humans and machines. Research and development (R&D) in the pharmaceutical industry is becoming increasingly data driven, but managing its data assets according to FAIR principles remains costly and challenging. To date, little scientific evidence exists about how FAIR is currently implemented in practice, what its associated costs and benefits are, and how decisions are made about the retrospective FAIRification of data sets in pharmaceutical R&D. This paper reports the results of semi-structured interviews with 14 pharmaceutical professionals who participate in various stages of drug R&D in seven pharmaceutical businesses. Inductive thematic analysis identified three primary themes of the benefits and costs of FAIRification, and the elements that influence the decision-making process for FAIRifying legacy data sets. Participants collectively acknowledged the potential contribution of FAIRification to data reusability in diverse research domains and the subsequent potential for cost-savings. Implementation costs, however, were still considered a barrier by participants, with the need for considerable expenditure in terms of resources, and cultural change. How decisions were made about FAIRification was influenced by legal and ethical considerations, management commitment, and data prioritisation. The findings have significant implications for those in the pharmaceutical R&D industry who are engaged in driving FAIR implementation, and for external parties who seek to better understand existing practices and challenges.
Collapse
Affiliation(s)
- Ebtisam Alharbi
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
- College of Computer and Information Systems, Umm Al-Qura University, Mecca, Makkah 21421, Saudi Arabia
| | - Rigina Skeva
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Nick Juty
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Caroline Jay
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| | - Carole Goble
- School of Computer Science, University of Manchester, Manchester, Manchester M13 9PL, UK
| |
Collapse
|
9
|
Furxhi I, Koivisto AJ, Murphy F, Trabucco S, Del Secco B, Arvanitis A. Data Shepherding in Nanotechnology. The Exposure Field Campaign Template. NANOMATERIALS (BASEL, SWITZERLAND) 2021; 11:1818. [PMID: 34361203 PMCID: PMC8308211 DOI: 10.3390/nano11071818] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Revised: 06/30/2021] [Accepted: 07/09/2021] [Indexed: 12/29/2022]
Abstract
In this paper, we demonstrate the realization process of a pragmatic approach on developing a template for capturing field monitoring data in nanomanufacturing processes. The template serves the fundamental principles which make data scientifically Findable, Accessible, Interoperable and Reusable (FAIR principles), as well as encouraging individuals to reuse it. In our case, the data shepherds' (the guider of data) template creation workflow consists of the following steps: (1) Identify relevant stakeholders, (2) Distribute questionnaires to capture a general description of the data to be generated, (3) Understand the needs and requirements of each stakeholder, (4) Interactive simple communication with the stakeholders for variables/descriptors selection, and (5) Design of the template and annotation of descriptors. We provide an annotated template for capturing exposure field campaign monitoring data, and increase their interoperability, while comparing it with existing templates. This paper enables the data creators of exposure field campaign data to store data in a FAIR way and helps the scientific community, such as data shepherds, by avoiding extensive steps for template creation and by utilizing the pragmatic structure and/or the template proposed herein, in the case of a nanotechnology project (Anticipating Safety Issues at the Design of Nano Product Development, ASINA).
Collapse
Affiliation(s)
- Irini Furxhi
- Transgero Limited, Cullinagh, Newcastle West, V42V384 Limerick, Ireland;
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94T9PX Limerick, Ireland
| | - Antti Joonas Koivisto
- Air Pollution Management, Willemoesgade 16, st tv, DK-2100 Copenhagen, Denmark;
- ARCHE Consulting, Liefkensstraat 35D, B-9032 Wondelgem, Belgium
- Institute for Atmospheric and Earth System Research (INAR), University of Helsinki, PL 64, FI-00014 Helsinki, Finland
| | - Finbarr Murphy
- Transgero Limited, Cullinagh, Newcastle West, V42V384 Limerick, Ireland;
- Department of Accounting and Finance, Kemmy Business School, University of Limerick, V94T9PX Limerick, Ireland
| | - Sara Trabucco
- Institute of Atmospheric Sciences and Climate (CNR-ISAC) Via Gobetti 101, 40129 Bologna, Italy; (S.T.); (B.D.S.)
| | - Benedetta Del Secco
- Institute of Atmospheric Sciences and Climate (CNR-ISAC) Via Gobetti 101, 40129 Bologna, Italy; (S.T.); (B.D.S.)
| | - Athanasios Arvanitis
- Environmental Informatics Research Group, Department of Mechanical Engineering, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece;
| |
Collapse
|
10
|
Data Shepherding in Nanotechnology. The Initiation. NANOMATERIALS 2021; 11:nano11061520. [PMID: 34201308 PMCID: PMC8230087 DOI: 10.3390/nano11061520] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 05/25/2021] [Accepted: 06/07/2021] [Indexed: 01/26/2023]
Abstract
In this paper we describe the pragmatic approach of initiating, designing and implementing the Data Management Plan (DMP) and the data FAIRification process in the multidisciplinary Horizon 2020 nanotechnology project, Anticipating Safety Issues at the Design Stage of NAno Product Development (ASINA). We briefly describe the general DMP requirements, emphasizing that the initial steps in the direction towards data FAIRification must be conceptualized and visualized in a systematic way. We demonstrate the use of a generic questionnaire to capture primary data and metadata description from our consortium (data creators/experimentalists and data analysts/modelers). We then display the interactive process with external FAIR data initiatives (data curators/quality assessors), regarding guidance for data and metadata capturing and future integration into repositories. After the preliminary data capturing and FAIRification template is formed, the inner-communication process begins between the partners, which leads to developing case-specific templates. This paper assists future data creators, data analysts, stewards and shepherds engaged in the multi-faceted data shepherding process, in any project, by providing a roadmap, demonstrated in the case of ASINA.
Collapse
|
11
|
Bloemers M, Montesanti A. The FAIR Funding Model: Providing a Framework for Research Funders to Drive the Transition toward FAIR Data Management and Stewardship Practices. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_a_00039] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
A growing number of research funding organizations (RFOs) are taking responsibility to increase the scientific and social impact of research output. Also reusable research data are recognized as relevant output for gaining impact. RFOs are therefore promoting FAIR research data management and stewardship (RDM) in their research funding cycle. However, the implementation of FAIR RDM still faces important obstacles and challenges. To solve these, stakeholders work together to develop innovative tools and practices. Here we elaborate on the role of RFOs in developing a FAIR funding model to support the FAIR RDM in the funding cycle, integrated with research community specific guidance, criteria and metadata, and enabling automatic assessments of progress and output from RDM. The model facilitates to create research data with a high level of FAIRness that are meaningful for a research community. To fully benefit from the model, RFOs, research institutions and service providers need to implement machine actionability in their FAIR RDM tools and procedures. As many stakeholders still need to get familiar with “human actionable” FAIR data practices, the introduction of the model will be stepwise, with an active role of the RFOs in driving FAIR RDM processes as effectively as possible.
Collapse
Affiliation(s)
- Margreet Bloemers
- The Netherlands Organization for Health Research and Development (ZonMw), 2509 AE, The Hague, The Netherlands
| | | |
Collapse
|
12
|
Mons B, Schultes E, Liu F, Jacobsen A. The FAIR Principles: First Generation Implementation Choices and Challenges. DATA INTELLIGENCE 2020. [DOI: 10.1162/dint_e_00023] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Affiliation(s)
- Barend Mons
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
- GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
| | - Erik Schultes
- GO FAIR International Support & Coordination Office (GFISCO), Leiden, The Netherlands
| | - Fenghong Liu
- National Science Library, Chinese Academy of Sciences, Beijing 100093, China
- School of Economics and Management, University of Chinese Academy of Sciences, Beijing 100084, China
- School of Information Management, Nanjing University, Nanjing 210023, China
| | - Annika Jacobsen
- Leiden University Medical Center, Leiden, 2333 ZA, The Netherlands
| |
Collapse
|