1
|
Polesie S, Alsterholm M. A systematic review investigating the proportion of clinical images shared in prospective randomized controlled trials involving patients with atopic dermatitis and systemic pharmacotherapy. J DERMATOL TREAT 2024; 35:2338280. [PMID: 38569598 DOI: 10.1080/09546634.2024.2338280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 03/27/2024] [Indexed: 04/05/2024]
Abstract
For individuals with atopic dermatitis (AD), interpreting scientific papers that present clinical outcomes including the Eczema Area and Severity Index (EASI) and Investigators Global Assessment may be difficult. When compared to tabulated data and graphs, images from before and after treatment are often far more meaningful to these patients that ultimately will be candidates for the treatment. This systematic review focused on determining the frequency of clinical image sharing in AD research. Conducted in accordance with PRISMA guidelines, the review concentrated on randomized controlled trials that investigated predefined and available systemic treatments for AD. The search was performed in the MEDLINE database for studies published from the inception until 21 December 2023. The review included 60 studies, encompassing 17,799 randomized patients. Across these studies, 16 images representing 6 patients were shared in the manuscripts, leading to a sharing rate of 0.3‰. The almost missing inclusion of patient images in clinical trial publications hinders patient understanding. Adding images to scientific manuscripts could significantly improve patients' comprehension of potential treatment outcomes. This review highlights the need for authors, the pharmaceutical industry, study sponsors, and publishers to enhance and promote patient information through increased use of visual data.
Collapse
|
2
|
Rabin BA, Smith JD, Dressler EV, Cohen DJ, Lee RM, Goodman MS, D'Angelo H, Norton WE, Oh AY. Designing for data sharing: Considerations for advancing health equity in data management and dissemination. Transl Behav Med 2024:ibae049. [PMID: 39331485 DOI: 10.1093/tbm/ibae049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2024] Open
Abstract
Data sharing, the act of making scientific research data available to others, can accelerate innovation and discoveries, and ultimately enhance public health. The National Cancer Institute Implementation Science Centers in Cancer Control convened a diverse group of research scientists, practitioners, and community partners in three interactive workshops (May-June 2022) to identify and discuss factors that must be considered when designing research for equitable data sharing with a specific emphasis on implementation science and social, behavioral, and population health research. This group identified and operationalized a set of seven key considerations for equitable data sharing-conceptualized as an inclusive process that fairly includes the perspectives and priorities of all partners involved in and impacted by data sharing, with consideration of ethics, history, and benefits-that were integrated into a framework. Key data-sharing components particularly important for health equity included: elevating data sharing into a core research activity, incorporating diverse perspectives, and meaningfully engaging partners in data-sharing decisions throughout the project lifecycle. As the process of data sharing grows in research, it is critical to continue considering the potential positive and adverse impact of data sharing on diverse beneficiaries of health data and research.
Collapse
|
3
|
Ranatunge R, Karunapema P, Siribaddana P, Ranwala L, Fernando P. From Fragmentation to Open Access: Building an Open Data Portal for Health Data Dissemination. Stud Health Technol Inform 2024; 318:192-193. [PMID: 39320212 DOI: 10.3233/shti240923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2024]
Abstract
Open data is defined as data that can be used and redistributed by anyone with minimal or no restrictions. A design science research methodology was applied to the development of an open data portal for theMinistry of Health Sri Lanka (MoH) to share national datasets. Following requirement gathering and literature review, the open data portal was developed using open-source software and implemented at the MoH Sri Lanka. Fifty datasets obtained from the MoH were categorised and published in the open data portal. However, several barriers cast doubt on the long-term feasibility of the open data portal project.
Collapse
|
4
|
Wallace ML, Redline S, Oryshkewych N, Hoepel SJW, Luik AI, Stone KL, Kolko RP, Chung J, Leng Y, Robbins R, Zhang Y, Barnes LL, Lim AS, Yu L, Buysse DJ. Pioneering a multi-phase framework to harmonize self-reported sleep data across cohorts. Sleep 2024; 47:zsae115. [PMID: 38752786 PMCID: PMC11381567 DOI: 10.1093/sleep/zsae115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/29/2024] [Indexed: 06/15/2024] Open
Abstract
STUDY OBJECTIVES Harmonizing and aggregating data across studies enables pooled analyses that support external validation and enhance replicability and generalizability. However, the multidimensional nature of sleep poses challenges for data harmonization and aggregation. Here we describe and implement our process for harmonizing self-reported sleep data. METHODS We established a multi-phase framework to harmonize self-reported sleep data: (1) compile items, (2) group items into domains, (3) harmonize items, and (4) evaluate harmonizability. We applied this process to produce a pooled multi-cohort sample of five US cohorts plus a separate yet fully harmonized sample from Rotterdam, Netherlands. Sleep and sociodemographic data are described and compared to demonstrate the utility of harmonization and aggregation. RESULTS We collected 190 unique self-reported sleep items and grouped them into 15 conceptual domains. Using these domains as guiderails, we developed 14 harmonized items measuring aspects of satisfaction, alertness/sleepiness, timing, efficiency, duration, insomnia, and sleep apnea. External raters determined that 13 of these 14 items had moderate-to-high harmonizability. Alertness/Sleepiness items had lower harmonizability, while continuous, quantitative items (e.g. timing, total sleep time, and efficiency) had higher harmonizability. Descriptive statistics identified features that are more consistent (e.g. wake-up time and duration) and more heterogeneous (e.g. time in bed and bedtime) across samples. CONCLUSIONS Our process can guide researchers and cohort stewards toward effective sleep harmonization and provide a foundation for further methodological development in this expanding field. Broader national and international initiatives promoting common data elements across cohorts are needed to enhance future harmonization and aggregation efforts.
Collapse
|
5
|
Alkhatib R, Gaede KI. Data Management in Biobanking: Strategies, Challenges, and Future Directions. BIOTECH 2024; 13:34. [PMID: 39311336 PMCID: PMC11417763 DOI: 10.3390/biotech13030034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 08/23/2024] [Accepted: 08/31/2024] [Indexed: 09/26/2024] Open
Abstract
Biobanking plays a pivotal role in biomedical research by providing standardized processing, precise storing, and management of biological sample collections along with the associated data. Effective data management is a prerequisite to ensure the integrity, quality, and accessibility of these resources. This review provides a current landscape of data management in biobanking, discussing key challenges, existing strategies, and potential future directions. We explore multiple aspects of data management, including data collection, storage, curation, sharing, and ethical considerations. By examining the evolving technologies and methodologies in biobanking, we aim to provide insights into addressing the complexities and maximizing the utility of biobank data for research and clinical applications.
Collapse
|
6
|
Holden NJ. Data sharing considerations to maximize the use of pathogen biological and genomics resources data for public health. J Appl Microbiol 2024; 135:lxae204. [PMID: 39113269 DOI: 10.1093/jambio/lxae204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 07/17/2024] [Accepted: 08/06/2024] [Indexed: 09/05/2024]
Abstract
Public sector data associated with health are a highly valuable resource with multiple potential end-users, from health practitioners, researchers, public bodies, policy makers, and industry. Data for infectious disease agents are used for epidemiological investigations, disease tracking and assessing emerging biological threats. Yet, there are challenges in collating and re-using it. Data may be derived from multiple sources, generated and collected for different purposes. While public sector data should be open access, providers from public health settings or from agriculture, food, or environment sources have sensitivity criteria to meet with ethical restrictions in how the data can be reused. Yet, sharable datasets need to describe the pathogens with sufficient contextual metadata for maximal utility, e.g. associated disease or disease potential and the pathogen source. As data comprise the physical resources of pathogen collections and potentially associated sequences, there is an added emerging technical issue of integration of omics 'big data'. Thus, there is a need to identify suitable means to integrate and safely access diverse data for pathogens. Established genomics alliances and platforms interpret and meet the challenges in different ways depending on their own context. Nonetheless, their templates and frameworks provide a solution for adaption to pathogen datasets.
Collapse
|
7
|
Meghani SH, Mooney-Doyle K, Barnato A, Colborn K, Gillette R, Harrison KL, Hinds PS, Kirilova D, Knafl K, Schulman-Green D, Pollak KI, Ritchie CS, Kutner JS, Karcher S. Lessons Learned Establishing the Palliative Care Research Cooperative's Qualitative Data Repository. J Pain Symptom Manage 2024; 68:308-318. [PMID: 38825257 PMCID: PMC11323161 DOI: 10.1016/j.jpainsymman.2024.05.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Revised: 05/19/2024] [Accepted: 05/22/2024] [Indexed: 06/04/2024]
Abstract
Data sharing is increasingly an expectation in health research as part of a general move toward more open sciences. In the United States, in particular, the implementation of the 2023 National Institutes of Health Data Management and Sharing Policy has made it clear that qualitative studies are not exempt from this data sharing requirement. Recognizing this trend, the Palliative Care Research Cooperative Group (PCRC) realized the value of creating a de-identified qualitative data repository to complement its existing de-identified quantitative data repository. The PCRC Data Informatics and Statistics Core leadership partnered with the Qualitative Data Repository (QDR) to establish the first serious illness and palliative care qualitative data repository in the U.S. We describe the processes used to develop this repository, called the PCRC-QDR, as well as our outreach and education among the palliative care researcher community, which led to the first ten projects to share the data in the new repository. Specifically, we discuss how we co-designed the PCRC-QDR and created tailored guidelines for depositing and sharing qualitative data depending on the original research context, establishing uniform expectations for key components of relevant documentation, and the use of suitable access controls for sensitive data. We also describe how PCRC was able to leverage its existing community to recruit and guide early depositors and outline lessons learned in evaluating the experience. This work advances the establishment of best practices in qualitative data sharing.
Collapse
|
8
|
Reis JA, Almeida JR, Almeida TM, Oliveira JL. A Chatbot-Like Platform to Enhance the Discovery of OMOP CDM Databases. Stud Health Technol Inform 2024; 316:1689-1693. [PMID: 39176535 DOI: 10.3233/shti240748] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]
Abstract
Multicentre studies become possible with the current strategies to solve the interoperability problems between databases. With the great adoption of those strategies, new problems regarding data discovery were raised. Some were solved using database catalogues and graphical dashboards for data analysis and comparison. However, when these communities grow, these strategies become obsolete. In this work, we addressed those challenges by proposing a platform with a chatbot-like mechanism to help medical researchers identify databases of interest. The tool was developed using the metadata extracted from OMOP CDM databases.
Collapse
|
9
|
Scheuermann K, Hufeland P, Haarbrandt B, Hanss S, Joseph M, Kohler S, Krefting D, Richter J, Tute E, Wolf KH, Marschollek M. An Open Information Model-Based Repository for Sustainable Re-Use of Heterogeneous Pandemics Research Data. Stud Health Technol Inform 2024; 316:1921-1925. [PMID: 39176867 DOI: 10.3233/shti240808] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]
Abstract
The COVID-19 Research Network Lower Saxony (COFONI) is a German state network of experts in Coronavirus research and development of strategies for future pandemics. One of the pillars of the COFONI technology platform is its established research data repository (Available at https://forschungsdb.cofoni.de/), which enables provision of pseudonymised data and cross-location data retrieval for heterogeneous datasets. The platform consistently uses open standards (openEHR) and open source components (EHRbase) for its data repository, taking into account the FAIR criteria. Available data include both clinical and socio-demographic patient information. A comprehensive AQL query builder interface and an integrated research request process enable new research approaches, rapid cohort assembly and customized data export for researchers from participating institutions. Our flexible and scalable platform approach can be regarded as a blueprint. It contributes, to pandemic preparedness by providing easily accessible cross-location research data in a fully standardised and open representation.
Collapse
|
10
|
Mouney F, Pierre-Jean M, Delamarre D, Bouzille G, Cuggia M, Cabon S. Synthesizing Clinical Data Warehouse Content for Enhanced External Collaboration: A Preliminary Proposal. Stud Health Technol Inform 2024; 316:221-225. [PMID: 39176713 DOI: 10.3233/shti240384] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2024]
Abstract
This paper introduces a novel approach aimed at enhancing the accessibility of clinical data warehouses (CDWs) for external users, particularly researchers and biomedical companies interested in developing and testing their solutions. The primary focus is on proposing a clinical data catalogue designed to elucidate the contents of CDWs, facilitating biomedical project launch and completion. The catalogue is designed to address three fundamental inquiries that external users may have regarding CDWs: "What data is available, how much data is present, and how was it generated?" Additionally, the paper showcases a prototype of the catalogue through a visualization example, utilizing data from the CDW of Rennes University Hospital.
Collapse
|
11
|
Tanaka SC, Kasai K, Okamoto Y, Koike S, Hayashi T, Yamashita A, Yamashita O, Johnstone T, Pestilli F, Doya K, Okada G, Shinzato H, Itai E, Takahara Y, Takamiya A, Nakamura M, Itahashi T, Aoki R, Koizumi Y, Shimizu M, Miyata J, Son S, Aki M, Okada N, Morita S, Sawamoto N, Abe M, Oi Y, Sajima K, Kamagata K, Hirose M, Aoshima Y, Hamatani S, Nohara N, Funaba M, Noda T, Inoue K, Hirano J, Mimura M, Takahashi H, Hattori N, Sekiguchi A, Kawato M, Hanakawa T. The status of MRI databases across the world focused on psychiatric and neurological disorders. Psychiatry Clin Neurosci 2024. [PMID: 39162256 DOI: 10.1111/pcn.13717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 05/13/2024] [Accepted: 07/02/2024] [Indexed: 08/21/2024]
Abstract
Neuroimaging databases for neuro-psychiatric disorders enable researchers to implement data-driven research approaches by providing access to rich data that can be used to study disease, build and validate machine learning models, and even redefine disease spectra. The importance of sharing large, multi-center, multi-disorder databases has gradually been recognized in order to truly translate brain imaging knowledge into real-world clinical practice. Here, we review MRI databases that share data globally to serve multiple psychiatric or neurological disorders. We found 42 datasets consisting of 23,293 samples from patients with psychiatry and neurological disorders and healthy controls; 1245 samples from mood disorders (major depressive disorder and bipolar disorder), 2015 samples from developmental disorders (autism spectrum disorder, attention-deficit hyperactivity disorder), 675 samples from schizophrenia, 1194 samples from Parkinson's disease, 5865 samples from dementia (including Alzheimer's disease), We recognize that large, multi-center databases should include governance processes that allow data to be shared across national boundaries. Addressing technical and regulatory issues of existing databases can lead to better design and implementation and improve data access for the research community. The current trend toward the development of shareable MRI databases will contribute to a better understanding of the pathophysiology, diagnosis and assessment, and development of early interventions for neuropsychiatric disorders.
Collapse
|
12
|
Toga AW, Neu S, Sheehan ST, Crawford K. The informatics of ADNI. Alzheimers Dement 2024. [PMID: 39140398 DOI: 10.1002/alz.14099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2024] [Revised: 05/31/2024] [Accepted: 06/03/2024] [Indexed: 08/15/2024]
Abstract
The Alzheimer's Disease Neuroimaging Initiative (ADNI) has revolutionized the landscape of Alzheimer's research through its Informatics Core, which has facilitated unprecedented data standardization and sharing. Over 20 years, ADNI established a robust informatics framework, enabling the validation of biomarkers and supporting global research efforts. The Informatics Core, centered at the Laboratory of Neuro Imaging (LONI), provides a comprehensive data hub that ensures data quality, accessibility, and security, fostering over 5600 publications and significant scientific advancements. By embracing open data sharing principles, ADNI set a gold standard in data transparency, allowing over 26,000 investigators from 169 countries to access and download a wealth of multimodal data. This collaborative approach not only accelerated biomarker discovery and drug development and advanced our understanding of Alzheimer's disease but also has served as a model for other research initiatives, demonstrating the transformative potential of carefully designed informatics models and shared data in driving global scientific progress. HIGHLIGHTS: Accelerating biomarker discovery and drug development for Alzheimer's disease. Alzheimer's Disease Neuroimaging Initiative's (ADNI's) open data sharing drives scientific progress. Data exploration and coupled analytics to data archives.
Collapse
|
13
|
Cho H, Froelicher D, Dokmai N, Nandi A, Sadhuka S, Hong MM, Berger B. Privacy-Enhancing Technologies in Biomedical Data Science. Annu Rev Biomed Data Sci 2024; 7:317-343. [PMID: 39178425 PMCID: PMC11346580 DOI: 10.1146/annurev-biodatasci-120423-120107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/25/2024]
Abstract
The rapidly growing scale and variety of biomedical data repositories raise important privacy concerns. Conventional frameworks for collecting and sharing human subject data offer limited privacy protection, often necessitating the creation of data silos. Privacy-enhancing technologies (PETs) promise to safeguard these data and broaden their usage by providing means to share and analyze sensitive data while protecting privacy. Here, we review prominent PETs and illustrate their role in advancing biomedicine. We describe key use cases of PETs and their latest technical advances and highlight recent applications of PETs in a range of biomedical domains. We conclude by discussing outstanding challenges and social considerations that need to be addressed to facilitate a broader adoption of PETs in biomedical data science.
Collapse
|
14
|
Hamilton DG, Everitt S, Page MJ, Fidler F. What do Australians affected by cancer think about oncology researchers sharing research data? A cross-sectional survey. Asia Pac J Clin Oncol 2024; 20:522-530. [PMID: 38708950 DOI: 10.1111/ajco.14075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 04/15/2024] [Accepted: 04/23/2024] [Indexed: 05/07/2024]
Abstract
AIM Previous research has shown patients and the public in Australia generally support medical researchers in making de-identified research data available to other scientists. However, this research has focussed on certain types of data and recipients. We surveyed Australians affected by cancer to characterize their attitudes toward the sharing of research data with multiple third parties, including the public. METHODS A short, anonymous online survey of Australians with a previous diagnosis of cancer was advertised between October 27, 2022, and February 27, 2023. Quantitative responses were analyzed with descriptive statistics. Free-text responses were coded deductively and summarised using content analysis. RESULTS In total, 551 respondents contributed data to the survey. There was strong support for cancer researchers sharing non-human and de-identified human research data with clinicians (90% and 95%, respectively) and non-profit researchers (both 94%). However, fewer participants supported sharing data with for-profit researchers (both 64%) or publicly (both 61%). When asked if they would hypothetically consent to researchers at their treatment location using and sharing their de-identified data publicly, only half agreed. In contrast, after being shown a visual representation of the de-identified survey data, 80% of respondents supported sharing it publicly. CONCLUSION Australians affected by cancer support the sharing of research data, particularly with clinicians and non-profit researchers. Our results also imply that visualization of the data to be shared may enhance support for making it publicly available. These results should help alleviate any concerns about research participants' attitudes toward data sharing, as well as boost researchers' motivation for sharing.
Collapse
|
15
|
Rozhyna A, Somfai GM, Atzori M, DeBuc DC, Saad A, Zoellin J, Müller H. Exploring Publicly Accessible Optical Coherence Tomography Datasets: A Comprehensive Overview. Diagnostics (Basel) 2024; 14:1668. [PMID: 39125544 PMCID: PMC11312046 DOI: 10.3390/diagnostics14151668] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 07/15/2024] [Accepted: 07/25/2024] [Indexed: 08/12/2024] Open
Abstract
Artificial intelligence has transformed medical diagnostic capabilities, particularly through medical image analysis. AI algorithms perform well in detecting abnormalities with a strong performance, enabling computer-aided diagnosis by analyzing the extensive amounts of patient data. The data serve as a foundation upon which algorithms learn and make predictions. Thus, the importance of data cannot be underestimated, and clinically corresponding datasets are required. Many researchers face a lack of medical data due to limited access, privacy concerns, or the absence of available annotations. One of the most widely used diagnostic tools in ophthalmology is Optical Coherence Tomography (OCT). Addressing the data availability issue is crucial for enhancing AI applications in the field of OCT diagnostics. This review aims to provide a comprehensive analysis of all publicly accessible retinal OCT datasets. Our main objective is to compile a list of OCT datasets and their properties, which can serve as an accessible reference, facilitating data curation for medical image analysis tasks. For this review, we searched through the Zenodo repository, Mendeley Data repository, MEDLINE database, and Google Dataset search engine. We systematically evaluated all the identified datasets and found 23 open-access datasets containing OCT images, which significantly vary in terms of size, scope, and ground-truth labels. Our findings indicate the need for improvement in data-sharing practices and standardized documentation. Enhancing the availability and quality of OCT datasets will support the development of AI algorithms and ultimately improve diagnostic capabilities in ophthalmology. By providing a comprehensive list of accessible OCT datasets, this review aims to facilitate better utilization and development of AI in medical image analysis.
Collapse
|
16
|
Yi W, Wang C, Kuzmin S, Gerasimov I, Cheng X. Weighted Attribute-Based Proxy Re-Encryption Scheme with Distributed Multi-Authority Attributes. SENSORS (BASEL, SWITZERLAND) 2024; 24:4939. [PMID: 39123985 PMCID: PMC11314711 DOI: 10.3390/s24154939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 07/27/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024]
Abstract
Existing attribute-based proxy re-encryption schemes suffer from issues like complex access policies, large ciphertext storage space consumption, and an excessive authority of the authorization center, leading to weak security and controllability of data sharing in cloud storage. This study proposes a Weighted Attribute Authority Multi-Authority Proxy Re-Encryption (WAMA-PRE) scheme that introduces attribute weights to elevate the expression of access policies from binary to multi-valued, simplifying policies and reducing ciphertext storage space. Simultaneously, the multiple attribute authorities and the authorization center construct a joint key, reducing reliance on a single authorization center. The proposed distributed attribute authority network enhances the anti-attack capability of cloud storage. Experimental results show that introducing attribute weights can reduce ciphertext storage space by 50%, proxy re-encryption saves 63% time compared to repeated encryption, and the joint key construction time is only 1% of the benchmark scheme. Security analysis proves that WAMA-PRE achieves CPA security under the decisional q-parallel BDHE assumption in the random oracle model. This study provides an effective solution for secure data sharing in cloud storage.
Collapse
|
17
|
Riley M, Kilkenny MF, Robinson K, Leggat SG. Researchers' perceptions of the trustworthiness, for reuse purposes, of government health data in Victoria, Australia: Implications for policy and practice. HEALTH INF MANAG J 2024:18333583241256049. [PMID: 39045683 DOI: 10.1177/18333583241256049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/25/2024]
Abstract
In 2022 the Australian Data Availability and Transparency Act (DATA) commenced, enabling accredited "data users" to access data from "accredited data service providers." However, the DATA Scheme lacks guidance on "trustworthiness" of the data to be utilised for reuse purposes. Objectives: To determine: (i) Do researchers using government health datasets trust the data? (ii) What factors influence their perceptions of data trustworthiness? and (iii) What are the implications for government and data custodians? Method: Authors of published studies (2008-2020) that utilised Victorian government health datasets were surveyed via a case study approach. Twenty-eight trust constructs (identified via literature review) were grouped into data factors, management properties and provider factors. Results: Fifty experienced health researchers responded. Most (88%) believed that Victorian government health data were trustworthy. When grouped, data factors and management properties were more important than data provider factors in building trust. The most important individual trust constructs were: "compliant with ethical regulation" (100%) and "monitoring privacy and confidentiality" (98%). Constructs of least importance were knowledge of "participant consent" (56%) and "major focus of the data provider was research" (50%). Conclusion: Overall, the researchers trusted government health data, but data factors and data management properties were more important than data provider factors in building trust. Implications: Government should ensure the DATA Scheme incorporates mechanisms to validate those data utilised by accredited data users and data providers have sufficient quality (intrinsic and extrinsic) to meet the requirements of "trustworthiness," and that evidentiary documentation is provided to support these "accredited data."
Collapse
|
18
|
Middleweek B, Klinger L. 'I just LOVE data': perceptions and practices of data sharing and privacy among users of the Lioness. CULTURE, HEALTH & SEXUALITY 2024:1-19. [PMID: 38970796 DOI: 10.1080/13691058.2024.2369596] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 06/14/2024] [Indexed: 07/08/2024]
Abstract
High profile data breaches and the proliferation of self-tracking technologies generating bio-feedback data have raised concerns about data privacy and data sharing practices among users of these devices. However, our understanding of how self-trackers in sexual health populations, where the data may be sensitive, personal, and stigmatising, perceive data privacy and sharing is limited. This study combined industry consultation with a survey of users of the world's first biofeedback smart vibrator, the Lioness, that enables users to monitor and analyse their sexual response intensity and orgasm duration over time. We found users of the Lioness are motivated to self-track by both individual and altruistic goals: to learn more about their bodies, and to contribute to research that leads to better sexual health outcomes. Perceptions of data privacy and data sharing were shaped by an eagerness to collaborate with sexual health researchers to challenge traditional male-centric perspectives in biomedical research on women's sexual health, where gender plays a crucial role in defining healthcare systems and outcomes. This study extends our understanding of the non-digital aspects of self-tracking by emphasising the role of gender and inclusive healthcare advocacy in shaping perceptions of data privacy and sharing within sexual health populations.
Collapse
|
19
|
Jabour AM. Putting patients at the center of health information exchange design: An exploration of patient preferences for data sharing. Health Informatics J 2024; 30:14604582241277029. [PMID: 39142341 DOI: 10.1177/14604582241277029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2024]
Abstract
BACKGROUND Despite the many benefits of Health Information Exchange (HIE), Studies reported patients concerns about the privacy and security of sharing their health information. To address these concerns, it is important to understand their needs, preferences, and priorities in the design and implementing HIE systems. OBJECTIVE The aim of this study is to investigate patients' preferences for HIE consent option and examine the extent to which they are comfortable sharing the different parts of their medical records. METHOD A self-administered survey was conducted. The survey was administrated online and the total number of respondents was 660 participants. RESULTS The most popular option selected by participants for sharing HIE information was to share information with their permission once when they register (33.3%) followed by the option to share their information temporarily on demand during their clinical visit (23.8%). The types of information which participants were willing to share the most were general data such as age, weight, height, and gender, followed closely by data needed for medical emergency. In contrast, the information which participants were less likely to share were data related to financial status or income, followed by data related to sexual disease, and mental illnesses.
Collapse
|
20
|
Tecuatl C, Ljungquist B, Ascoli GA. Accelerating the continuous community sharing of digital neuromorphology data. FASEB Bioadv 2024; 6:207-221. [PMID: 38974113 PMCID: PMC11226999 DOI: 10.1096/fba.2024-00048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/28/2024] [Accepted: 06/03/2024] [Indexed: 07/09/2024] Open
Abstract
The tree-like morphology of neurons and glia is a key cellular determinant of circuit connectivity and metabolic function in the nervous system of essentially all animals. To elucidate the contribution of specific cell types to both physiological and pathological brain states, it is important to access detailed neuroanatomy data for quantitative analysis and computational modeling. NeuroMorpho.Org is the largest online collection of freely available digital neural reconstructions and related metadata and is continuously updated with new uploads. Earlier in the project, we released multiple datasets together yearly, but this process caused an average delay of several months in making the data public. Moreover, in the past 5 years, >80% of invited authors agreed to share their data with the community via NeuroMorpho.Org, up from <20% in the first 5 years of the project. In the same period, the average number of reconstructions per publication increased 600%, creating the need for automatic processing to release more reconstructions in less time. The progressive automation of our pipeline enabled the transition to agile releases of individual datasets as soon as they are ready. The overall time from data identification to public sharing decreased by 63.7%; 78% of the datasets are now released in less than 3 months with an average workflow duration below 40 days. Furthermore, the mean processing time per reconstruction dropped from 3 h to 2 min. With these continuous improvements, NeuroMorpho.Org strives to forge a positive culture of open data. Most importantly, the new, original research enabled through reuse of datasets across the world has a multiplicative effect on science discovery, benefiting both authors and users.
Collapse
|
21
|
DeFranco JF, Roberts J, Ferraiolo D, Compton DC. An infrastructure for secure data sharing: a clinical data implementation. JAMIA Open 2024; 7:ooae040. [PMID: 38751412 PMCID: PMC11095973 DOI: 10.1093/jamiaopen/ooae040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 02/02/2024] [Accepted: 04/23/2024] [Indexed: 05/18/2024] Open
Abstract
Objective To address database interoperability challenges to improve collaboration among disparate organizations. Materials and Methods We developed a lightweight system to allow broad but well-controlled data sharing while preserving local data protection policies. We used 2 NIST-developed technologies-Next-generation Database Access Control (NDAC) and the Data Block Matrix (DBM)-to create a proof-of-concept system called the Secure Federated Data Sharing System (SFDS). NDAC controls access to database resources down to the field level based on attributes assigned to users. The DBM manages and shares authoritative user-attribute assignments across a federation of organizations, implemented using a modified open-source permissioned blockchain, to manage and share authoritative user-attribute assignments across a federation of organizations. We used synthetic data to demonstrate a clinical research data-sharing use case using the SFDS. Results We demonstrated, through consent, the onboarding of previously unknown users into NDAC via assignments to their DBM-validated attributes, allowing those users policy-preserving access to local database resources. The SFDS main system components-NDAC and DBM-also showed excellent performance metrics. Discussion The SFDS provides a generic data-sharing infrastructure that effectively and securely achieves data-sharing objectives. It is completely transparent to the otherwise normal business operations of participating organizations. It requires no changes to database management systems or existing methods of authenticating and authorizing local user access to local resources. Conclusion This efficiency, flexibility of deployment, and granularity of control make this new infrastructure solution practical for meeting the data-sharing and protection objectives of the clinical research community.
Collapse
|
22
|
Trunnell M, Frankenberger C, Hota B, Hughes T, Martinov P, Ravichandran U, Shah NS, Grossman RL. The Pandemic Response Commons. JAMIA Open 2024; 7:ooae025. [PMID: 38617994 PMCID: PMC11009464 DOI: 10.1093/jamiaopen/ooae025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/25/2023] [Accepted: 04/05/2024] [Indexed: 04/16/2024] Open
Abstract
Objectives A data commons is a software platform for managing, curating, analyzing, and sharing data with a community. The Pandemic Response Commons (PRC) is a data commons designed to provide a data platform for researchers studying an epidemic or pandemic. Methods The PRC was developed using the open source Gen3 data platform and is based upon consortium, data, and platform agreements developed by the not-for-profit Open Commons Consortium. A formal consortium of Chicagoland area organizations was formed to develop and operate the PRC. Results The consortium developed a general PRC and an instance of it for the Chicagoland region called the Chicagoland COVID-19 Commons. A Gen3 data platform was set up and operated with policies, procedures, and controls for a NIST SP 800-53 revision 4 Moderate system. A consensus data model for the commons was developed, and a variety of datasets were curated, harmonized and ingested, including statistical summary data about COVID cases, patient level clinical data, and SARS-CoV-2 viral variant data. Discussion and conclusions Given the various legal and data agreements required to operate a data commons, a PRC is designed to be in place and operating at a low level prior to the occurrence of an epidemic, with the activities increasing as required during an epidemic. A regional instance of a PRC can also be part of a broader data ecosystem or data mesh consisting of multiple regional commons supporting pandemic response through sharing regional data.
Collapse
|
23
|
Koivisto E, Mäntylä E. Are Open Science instructions targeted to ecologists and evolutionary biologists sufficient? A literature review of guidelines and journal data policies. Ecol Evol 2024; 14:e11698. [PMID: 38994214 PMCID: PMC11237169 DOI: 10.1002/ece3.11698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 06/02/2024] [Accepted: 06/24/2024] [Indexed: 07/13/2024] Open
Abstract
Open science (OS) awareness and skills are increasingly becoming an essential part of everyday scientific work as e.g., many journals require authors to share data. However, following an OS workflow can seem challenging at first. Thus, instructions by journals and other guidelines are important. But how comprehensive are they in the field of ecology and evolutionary biology (Ecol Evol)? To find this out, we reviewed 20 published OS guideline articles aimed for ecologists or evolutionary biologists, together with the data policies of 17 Ecol Evol journals to chart the current landscape of OS guidelines in the field, find potential gaps, identify field-specific barriers for OS and discuss solutions to overcome these challenges. We found that many of the guideline articles covered similar topics, despite being written for a narrow field or specific target audience. Likewise, many of the guideline articles mentioned similar obstacles that could hinder or postpone a transition to open data sharing. Thus, there could be a need for a more widely known, general OS guideline for Ecol Evol. Following the same guideline could also enhance the uniformity of the OS practices carried on in the field. However, some topics, like long-term experiments and physical samples, were mentioned surprisingly seldom, although they are typical issues in Ecol Evol. Of the journals, 15 out of 17 expected or at least encouraged data sharing either for all articles or under specific conditions, e.g. for registered reports and 10 of those required data sharing at the submission phase. The coverage of journal data policies varied greatly between journals, from practically non-existing to very extensive. As journals can contribute greatly by leading the way and making open data useful, we recommend that the publishers and journals would invest in clear and comprehensive data policies and instructions for authors.
Collapse
|
24
|
Scoggins B, Robertson MP. Measuring transparency in the social sciences: political science and international relations. ROYAL SOCIETY OPEN SCIENCE 2024; 11:240313. [PMID: 39076374 PMCID: PMC11285849 DOI: 10.1098/rsos.240313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 05/11/2024] [Accepted: 05/23/2024] [Indexed: 07/31/2024]
Abstract
The scientific method is predicated on transparency-yet the pace at which transparent research practices are being adopted by the scientific community is slow. The replication crisis in psychology showed that published findings employing statistical inference are threatened by undetected errors, data manipulation and data falsification. To mitigate these problems and bolster research credibility, open data and preregistration practices have gained traction in the natural and social sciences. However, the extent of their adoption in different disciplines is unknown. We introduce computational procedures to identify the transparency of a research field using large-scale text analysis and machine learning classifiers. Using political science and international relations as an illustrative case, we examine 93 931 articles across the top 160 political science and international relations journals between 2010 and 2021. We find that approximately 21% of all statistical inference papers have open data and 5% of all experiments are preregistered. Despite this shortfall, the example of leading journals in the field shows that change is feasible and can be effected quickly.
Collapse
|
25
|
Richter G, Krawczak M. How to Elucidate Consent-Free Research Use of Medical Data: A Case for "Health Data Literacy". JMIR Med Inform 2024; 12:e51350. [PMID: 38889087 PMCID: PMC11196244 DOI: 10.2196/51350] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 01/19/2024] [Accepted: 04/21/2024] [Indexed: 06/20/2024] Open
Abstract
Unlabelled The extensive utilization of personal health data is one of the key success factors of modern medical research. Obtaining consent to the use of such data during clinical care, however, bears the risk of low and unequal approval rates and risk of consequent methodological problems in the scientific use of the data. In view of these shortcomings, and of the proven willingness of people to contribute to medical research by sharing personal health data, the paradigm of informed consent needs to be reconsidered. The European General Data Protection Regulation gives the European member states considerable leeway with regard to permitting the research use of health data without consent. Following this approach would however require alternative offers of information that compensate for the lack of direct communication with experts during medical care. We therefore introduce the concept of "health data literacy," defined as the capacity to find, understand, and evaluate information about the risks and benefits of the research use of personal health data and to act accordingly. Specifically, health data literacy includes basic knowledge about the goals and methods of data-rich medical research and about the possibilities and limits of data protection. Although the responsibility for developing the necessary resources lies primarily with those directly involved in data-rich medical research, improving health data literacy should ultimately be of concern to everyone interested in the success of this type of research.
Collapse
|