1
|
Zuiderwijk A, Türk BO, Brazier F. Identifying the most important facilitators of open research data sharing and reuse in Epidemiology: A mixed-methods study. PLoS One 2024; 19:e0297969. [PMID: 38330007 PMCID: PMC10852342 DOI: 10.1371/journal.pone.0297969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2023] [Accepted: 01/15/2024] [Indexed: 02/10/2024] Open
Abstract
To understand how open research data sharing and reuse can be further improved in the field of Epidemiology, this study explores the facilitating role that infrastructural and institutional arrangements play in this research discipline. It addresses two research questions: 1) What influence do infrastructural and institutional arrangements have on open research data sharing and reuse practices in the field of Epidemiology? And 2) how could infrastructural and institutional instruments used in Epidemiology potentially be useful to other research disciplines? First, based on a systematic literature review, a conceptual framework of infrastructural and institutional instruments for open research data facilitation is developed. Second, the conceptual framework is applied in interviews with Epidemiology researchers. The interviews show that two infrastructural and institutional instruments have a very high influence on open research data sharing and reuse practices in the field of Epidemiology, namely (a) access to a powerful search engine that meets open data search needs and (b) support by data stewards and data managers. Third, infrastructural and institutional instruments with a medium, high, or very high influence were discussed in a research workshop involving data stewards and research data officers from different research fields. This workshop suggests that none of the influential instruments identified in the interviews are specific to Epidemiology. Some of our findings thus seem to apply to multiple other disciplines. This study contributes to Science by identifying field-specific facilitators and challenges for open research data in Epidemiology, while at the same time revealing that none of the identified influential infrastructural and institutional instruments were specific to this field. Practically, this implies that open data infrastructure developers, policymakers, and research funding organizations may apply certain infrastructural and institutional arrangements to multiple research disciplines to facilitate and enhance open research data sharing and reuse.
Collapse
Affiliation(s)
- Anneke Zuiderwijk
- Faculty of Technology, Policy and Management, Delft University of Technology, Delft, the Netherlands
| | - Berkay Onur Türk
- Education and Student Affairs, Eindhoven University of Technology, Eindhoven, the Netherlands
| | - Frances Brazier
- Faculty of Technology, Policy and Management, Delft University of Technology, Delft, the Netherlands
| |
Collapse
|
2
|
Kumar S, Kumar H, Kumar G, Singh SP, Bijalwan A, Diwakar M. A methodical exploration of imaging modalities from dataset to detection through machine learning paradigms in prominent lung disease diagnosis: a review. BMC Med Imaging 2024; 24:30. [PMID: 38302883 PMCID: PMC10832080 DOI: 10.1186/s12880-024-01192-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 01/03/2024] [Indexed: 02/03/2024] Open
Abstract
BACKGROUND Lung diseases, both infectious and non-infectious, are the most prevalent cause of mortality overall in the world. Medical research has identified pneumonia, lung cancer, and Corona Virus Disease 2019 (COVID-19) as prominent lung diseases prioritized over others. Imaging modalities, including X-rays, computer tomography (CT) scans, magnetic resonance imaging (MRIs), positron emission tomography (PET) scans, and others, are primarily employed in medical assessments because they provide computed data that can be utilized as input datasets for computer-assisted diagnostic systems. Imaging datasets are used to develop and evaluate machine learning (ML) methods to analyze and predict prominent lung diseases. OBJECTIVE This review analyzes ML paradigms, imaging modalities' utilization, and recent developments for prominent lung diseases. Furthermore, the research also explores various datasets available publically that are being used for prominent lung diseases. METHODS The well-known databases of academic studies that have been subjected to peer review, namely ScienceDirect, arXiv, IEEE Xplore, MDPI, and many more, were used for the search of relevant articles. Applied keywords and combinations used to search procedures with primary considerations for review, such as pneumonia, lung cancer, COVID-19, various imaging modalities, ML, convolutional neural networks (CNNs), transfer learning, and ensemble learning. RESULTS This research finding indicates that X-ray datasets are preferred for detecting pneumonia, while CT scan datasets are predominantly favored for detecting lung cancer. Furthermore, in COVID-19 detection, X-ray datasets are prioritized over CT scan datasets. The analysis reveals that X-rays and CT scans have surpassed all other imaging techniques. It has been observed that using CNNs yields a high degree of accuracy and practicability in identifying prominent lung diseases. Transfer learning and ensemble learning are complementary techniques to CNNs to facilitate analysis. Furthermore, accuracy is the most favored metric for assessment.
Collapse
Affiliation(s)
- Sunil Kumar
- Department of Computer Engineering, J. C. Bose University of Science and Technology, YMCA, Faridabad, India
- Department of Information Technology, School of Engineering and Technology (UIET), CSJM University, Kanpur, India
| | - Harish Kumar
- Department of Computer Engineering, J. C. Bose University of Science and Technology, YMCA, Faridabad, India
| | - Gyanendra Kumar
- Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, India
| | | | - Anchit Bijalwan
- Faculty of Electrical and Computer Engineering, Arba Minch University, Arba Minch, Ethiopia.
| | - Manoj Diwakar
- Department of Computer Science and Engineering, Graphic Era Deemed to Be University, Dehradun, India
| |
Collapse
|
3
|
Hamilton DG, Hong K, Fraser H, Rowhani-Farid A, Fidler F, Page MJ. Prevalence and predictors of data and code sharing in the medical and health sciences: systematic review with meta-analysis of individual participant data. BMJ 2023; 382:e075767. [PMID: 37433624 PMCID: PMC10334349 DOI: 10.1136/bmj-2023-075767] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/07/2023] [Indexed: 07/13/2023]
Abstract
OBJECTIVES To synthesise research investigating data and code sharing in medicine and health to establish an accurate representation of the prevalence of sharing, how this frequency has changed over time, and what factors influence availability. DESIGN Systematic review with meta-analysis of individual participant data. DATA SOURCES Ovid Medline, Ovid Embase, and the preprint servers medRxiv, bioRxiv, and MetaArXiv were searched from inception to 1 July 2021. Forward citation searches were also performed on 30 August 2022. REVIEW METHODS Meta-research studies that investigated data or code sharing across a sample of scientific articles presenting original medical and health research were identified. Two authors screened records, assessed the risk of bias, and extracted summary data from study reports when individual participant data could not be retrieved. Key outcomes of interest were the prevalence of statements that declared that data or code were publicly or privately available (declared availability) and the success rates of retrieving these products (actual availability). The associations between data and code availability and several factors (eg, journal policy, type of data, trial design, and human participants) were also examined. A two stage approach to meta-analysis of individual participant data was performed, with proportions and risk ratios pooled with the Hartung-Knapp-Sidik-Jonkman method for random effects meta-analysis. RESULTS The review included 105 meta-research studies examining 2 121 580 articles across 31 specialties. Eligible studies examined a median of 195 primary articles (interquartile range 113-475), with a median publication year of 2015 (interquartile range 2012-2018). Only eight studies (8%) were classified as having a low risk of bias. Meta-analyses showed a prevalence of declared and actual public data availability of 8% (95% confidence interval 5% to 11%) and 2% (1% to 3%), respectively, between 2016 and 2021. For public code sharing, both the prevalence of declared and actual availability were estimated to be <0.5% since 2016. Meta-regressions indicated that only declared public data sharing prevalence estimates have increased over time. Compliance with mandatory data sharing policies ranged from 0% to 100% across journals and varied by type of data. In contrast, success in privately obtaining data and code from authors historically ranged between 0% and 37% and 0% and 23%, respectively. CONCLUSIONS The review found that public code sharing was persistently low across medical research. Declarations of data sharing were also low, increasing over time, but did not always correspond to actual sharing of data. The effectiveness of mandatory data sharing policies varied substantially by journal and type of data, a finding that might be informative for policy makers when designing policies and allocating resources to audit compliance. SYSTEMATIC REVIEW REGISTRATION Open Science Framework doi:10.17605/OSF.IO/7SX8U.
Collapse
Affiliation(s)
- Daniel G Hamilton
- MetaMelb Research Group, School of BioSciences, University of Melbourne, Melbourne, VIC, Australia
- Melbourne Medical School, Faculty of Medicine, Dentistry, and Health Sciences, University of Melbourne, Melbourne, VIC, Australia
| | - Kyungwan Hong
- Department of Practice, Sciences, and Health Outcomes Research, University of Maryland School of Pharmacy, Baltimore, MD, USA
| | - Hannah Fraser
- MetaMelb Research Group, School of BioSciences, University of Melbourne, Melbourne, VIC, Australia
| | - Anisa Rowhani-Farid
- Department of Practice, Sciences, and Health Outcomes Research, University of Maryland School of Pharmacy, Baltimore, MD, USA
| | - Fiona Fidler
- MetaMelb Research Group, School of BioSciences, University of Melbourne, Melbourne, VIC, Australia
- School of Historical and Philosophical Studies, University of Melbourne, Melbourne, VIC, Australia
| | - Matthew J Page
- Methods in Evidence Synthesis Unit, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| |
Collapse
|
4
|
Stott J, Wright T, Holmes J, Wilson J, Griffiths-Jones S, Foster D, Wright B. A systematic review of non-coding RNA genes with differential expression profiles associated with autism spectrum disorders. PLoS One 2023; 18:e0287131. [PMID: 37319303 PMCID: PMC10270643 DOI: 10.1371/journal.pone.0287131] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 05/30/2023] [Indexed: 06/17/2023] Open
Abstract
AIMS To identify differential expression of shorter non-coding RNA (ncRNA) genes associated with autism spectrum disorders (ASD). BACKGROUND ncRNA are functional molecules that derive from non-translated DNA sequence. The HUGO Gene Nomenclature Committee (HGNC) have approved ncRNA gene classes with alignment to the reference human genome. One subset is microRNA (miRNA), which are highly conserved, short RNA molecules that regulate gene expression by direct post-transcriptional repression of messenger RNA. Several miRNA genes are implicated in the development and regulation of the nervous system. Expression of miRNA genes in ASD cohorts have been examined by multiple research groups. Other shorter classes of ncRNA have been examined less. A comprehensive systematic review examining expression of shorter ncRNA gene classes in ASD is timely to inform the direction of research. METHODS We extracted data from studies examining ncRNA gene expression in ASD compared with non-ASD controls. We included studies on miRNA, piwi-interacting RNA (piRNA), small NF90 (ILF3) associated RNA (snaR), small nuclear RNA (snRNA), small nucleolar RNA (snoRNA), transfer RNA (tRNA), vault RNA (vtRNA) and Y RNA. The following electronic databases were searched: Cochrane Library, EMBASE, PubMed, Web of Science, PsycINFO, ERIC, AMED and CINAHL for papers published from January 2000 to May 2022. Studies were screened by two independent investigators with a third resolving discrepancies. Data was extracted from eligible papers. RESULTS Forty-eight eligible studies were included in our systematic review with the majority examining miRNA gene expression alone. Sixty-four miRNA genes had differential expression in ASD compared to controls as reported in two or more studies, but often in opposing directions. Four miRNA genes had differential expression in the same direction in the same tissue type in at least 3 separate studies. Increased expression was reported in miR-106b-5p, miR-155-5p and miR-146a-5p in blood, post-mortem brain, and across several tissue types, respectively. Decreased expression was reported in miR-328-3p in bloods samples. Seven studies examined differential expression from other classes of ncRNA, including piRNA, snRNA, snoRNA and Y RNA. No individual ncRNA genes were reported in more than one study. Six studies reported differentially expressed snoRNA genes in ASD. A meta-analysis was not possible because of inconsistent methodologies, disparate tissue types examined, and varying forms of data presented. CONCLUSION There is limited but promising evidence associating the expression of certain miRNA genes and ASD, although the studies are of variable methodological quality and the results are largely inconsistent. There is emerging evidence associating differential expression of snoRNA genes in ASD. It is not currently possible to say whether the reports of differential expression in ncRNA may relate to ASD aetiology, a response to shared environmental factors linked to ASD such as sleep and nutrition, other molecular functions, human diversity, or chance findings. To improve our understanding of any potential association, we recommend improved and standardised methodologies and reporting of raw data. Further high-quality research is required to shine a light on possible associations, which may yet yield important information.
Collapse
Affiliation(s)
- Jon Stott
- Child Oriented Mental Health Intervention Collaborative (COMIC), University of York in Collaboration with Leeds and York Partnership NHS Foundation Trust, York, United Kingdom
- Tees, Esk & Wear Valleys NHS Foundation Trust, Foss Park Hospital, York, United Kingdom
| | - Thomas Wright
- Manchester Centre for Genomic Medicine, Clinical Genetics Service, Saint Mary’s Hospital, Manchester University NHS Foundation Trust, Manchester, United Kingdom
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom
| | - Jannah Holmes
- Child Oriented Mental Health Intervention Collaborative (COMIC), University of York in Collaboration with Leeds and York Partnership NHS Foundation Trust, York, United Kingdom
- Hull York Medical School, University of York, Heslington, York, United Kingdom
| | - Julie Wilson
- Department of Mathematics, University of York, Heslington, York, United Kingdom
| | - Sam Griffiths-Jones
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom
| | - Deborah Foster
- Tees, Esk & Wear Valleys NHS Foundation Trust, Foss Park Hospital, York, United Kingdom
| | - Barry Wright
- Child Oriented Mental Health Intervention Collaborative (COMIC), University of York in Collaboration with Leeds and York Partnership NHS Foundation Trust, York, United Kingdom
- Hull York Medical School, University of York, Heslington, York, United Kingdom
| |
Collapse
|
5
|
Habibi A, Sofyan S, Mukminin A. Factors affecting digital technology access in vocational education. Sci Rep 2023; 13:5682. [PMID: 37029180 PMCID: PMC10080178 DOI: 10.1038/s41598-023-32755-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 04/01/2023] [Indexed: 04/09/2023] Open
Abstract
If policies are not thoroughly designed, technology integration may fail. As a result, users' perceptions of technology, especially access to digital technology, are critical for technology integration in education. This study aimed to develop and validate a scale to model factors affecting digital technology access for instructional use in Indonesian vocational schools. The study also reports the structural model of the path analysis and tests of differences based on geographical areas. A scale adapted from prior studies was established, validated, and examined for its validity and reliability. A total of 1355 responses were measurable; partial least squares structural equation modeling (PLS-SEM) and t-test procedures were applied for the data analysis. The findings informed that the scale was valid and reliable. For the structural model, the strongest relationship emerged between motivational access and skills access, while the lowest existed between material access and skills access. However, motivational access has an insignificant effect on instructional use. The t-test results show that geographical areas were significantly different regarding all involved variables.
Collapse
Affiliation(s)
- Akhmad Habibi
- Fakultas Keguruan dan Ilmu Pendidikan, Universitas Jambi, Jambi, Indonesia.
| | - Sofyan Sofyan
- Fakultas Keguruan dan Ilmu Pendidikan, Universitas Jambi, Jambi, Indonesia
| | - Amirul Mukminin
- Fakultas Keguruan dan Ilmu Pendidikan, Universitas Jambi, Jambi, Indonesia
| |
Collapse
|
6
|
Zhang L, Ma L. Is open science a double-edged sword?: data sharing and the changing citation pattern of Chinese economics articles. Scientometrics 2023; 128:2803-2818. [PMID: 37101973 PMCID: PMC10028759 DOI: 10.1007/s11192-023-04684-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 03/05/2023] [Indexed: 03/24/2023]
Abstract
Data sharing is an important part of open science (OS), and more and more institutions and journals have been enforcing open data (OD) policies. OD is advocated to help increase academic influences and promote scientific discovery and development, but such a proposition has not been elaborated on well. This study explores the nuanced effects of the OD policies on the citation pattern of articles by using the case of Chinese economics journals. China Industrial Economics (CIE) is the first and only Chinese social science journal so far to adopt a compulsory OD policy, requiring all published articles to share original data and processing codes. We use the article-level data and difference-in-differences (DID) approach to compare the citation performance of articles published in CIE and 36 comparable journals. Firstly, we find that the OD policy quickly increased the number of citations, and each article on average received 0.25, 1.19, 0.86, and 0.44 more citations in the first four years after publication respectively. Furthermore, we also found that the citation benefit of the OD policy rapidly decreased over time, and even became negative in the fifth year after publication. In conclusion, this changing citation pattern suggests that an OD policy can be double edged sword, which can quickly increase citation performance but simultaneously accelerate the aging of articles.
Collapse
Affiliation(s)
- Liwei Zhang
- grid.27255.370000 0004 1761 1174School of Innovation and Entrepreneurship, Shandong University, 72 Binhai Road, Jimo District, Qingdao, Shandong Province 266237 China
| | - Liang Ma
- grid.24539.390000 0004 0368 8103School of Public Administration and Policy, Renmin University of China, 59 Zhongguancun Avenue, Haidian District, Beijing, 100872 China
| |
Collapse
|
7
|
Hou J, Wang Y, Zhang Y, Wang D. How do scholars and non-scholars participate in dataset dissemination on Twitter. J Informetr 2022. [DOI: 10.1016/j.joi.2021.101223] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
8
|
Attenuated total reflection FTIR dataset for identification of type 2 diabetes using saliva. Comput Struct Biotechnol J 2022; 20:4542-4548. [PMID: 36090816 PMCID: PMC9428842 DOI: 10.1016/j.csbj.2022.08.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2022] [Revised: 07/25/2022] [Accepted: 08/16/2022] [Indexed: 12/04/2022] Open
Abstract
Diabetes is one of the top 5 non-communicable diseases that occur worldwide according to the World Health Organization. Despite not being a fatal disease, a late diagnosis as well as poor control can cause a fatal outcome, because of that, several studies have been carried out with the aim of proposing additional techniques to the gold standard to assist in the diagnosis and control of this disease in a non-invasive way. Considering the above, and in order to provide a solid starting point for future researches, we share a primary research dataset with 1040 saliva samples obtained by Fourier Transform Infrared Spectroscopy considering the Attenuated Total Reflectance method. Database include: gender, age, individuals (patients) with/without diabetes, the glucose value, and the result to the A1C test for the diabetic population. We believe that sharing dataset as is could increase experimentation, research, and analysis of spectra through different strategies broaden its range of applicability by chemists, doctors, physicists, computer scientists, among others, to identify the effects that the virus causes in the body and to propose possible clinical treatments as well as to develop devices that allow us to assist in the characterization of possible carriers.
Collapse
|
9
|
Yi T, Pan I, Collins S, Chen F, Cueto R, Hsieh B, Hsieh C, Smith JL, Yang L, Liao WH, Merck LH, Bai H, Merck D. DICOM Image ANalysis and Archive (DIANA): an Open-Source System for Clinical AI Applications. J Digit Imaging 2021; 34:1405-1413. [PMID: 34727303 PMCID: PMC8669082 DOI: 10.1007/s10278-021-00488-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2020] [Revised: 02/22/2021] [Accepted: 04/19/2021] [Indexed: 12/26/2022] Open
Abstract
In the era of data-driven medicine, rapid access and accurate interpretation of medical images are becoming increasingly important. The DICOM Image ANalysis and Archive (DIANA) system is an open-source, lightweight, and scalable Python interface that enables users to interact with hospital Picture Archiving and Communications Systems (PACS) to access such data. In this work, DIANA functionality was detailed and evaluated in the context of retrospective PACS data retrieval and two prospective clinical artificial intelligence (AI) pipelines: bone age (BA) estimation and intra-cranial hemorrhage (ICH) detection. DIANA orchestrates activity beginning with post-acquisition study discovery and ending with online notifications of findings. For AI applications, system latency (exam completion to system report time) was quantified and compared to that of clinicians (exam completion to initial report creation time). Mean DIANA latency was 9.04 ± 3.83 and 20.17 ± 10.16 min compared to clinician latency of 51.52 ± 58.9 and 65.62 ± 110.39 min for BA and ICH, respectively, with DIANA latencies being significantly lower (p < 0.001). DIANA's capabilities were also explored and found effective in retrieving and anonymizing protected health information for "big-data" medical imaging research and analysis. Mean per-image retrieval times were 1.12 ± 0.50 and 0.08 ± 0.01 s across x-ray and computed tomography studies, respectively. The data herein demonstrate that DIANA can flexibly integrate into existing hospital infrastructure and improve the process by which researchers/clinicians access imaging repository data. This results in a simplified workflow for large data retrieval and clinical integration of AI models.
Collapse
Affiliation(s)
- Thomas Yi
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
- Warren Alpert Medical School of Brown University, Providence, RI, USA
| | - Ian Pan
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
- Warren Alpert Medical School of Brown University, Providence, RI, USA
| | - Scott Collins
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
| | - Fiona Chen
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
- Warren Alpert Medical School of Brown University, Providence, RI, USA
| | | | - Ben Hsieh
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
| | - Celina Hsieh
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
- Warren Alpert Medical School of Brown University, Providence, RI, USA
| | - Jessica L Smith
- Department of Emergency Medicine, Rhode Island Hospital, Providence, RI, USA
| | - Li Yang
- Department of Neurology, Second Xiangya Hospital, Changsha, China
| | - Wei-Hua Liao
- Department of Radiology, Xiangya Hospital, Changsha, China
| | - Lisa H Merck
- Warren Alpert Medical School of Brown University, Providence, RI, USA
- Department of Emergency Medicine, Rhode Island Hospital, Providence, RI, USA
- University of Florida, Gainesville, FL, USA
| | - Harrison Bai
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA.
- Warren Alpert Medical School of Brown University, Providence, RI, USA.
| | - Derek Merck
- Department of Diagnostic Imaging, Rhode Island Hospital, Providence, RI, USA
- Warren Alpert Medical School of Brown University, Providence, RI, USA
- University of Florida, Gainesville, FL, USA
| |
Collapse
|
10
|
Igumbor JO, Bosire EN, Vicente-Crespo M, Igumbor EU, Olalekan UA, Chirwa TF, Kinyanjui SM, Kyobutungi C, Fonn S. Considerations for an integrated population health databank in Africa: lessons from global best practices. Wellcome Open Res 2021; 6:214. [PMID: 35224211 PMCID: PMC8844538 DOI: 10.12688/wellcomeopenres.17000.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/12/2021] [Indexed: 12/17/2022] Open
Abstract
Background: The rising digitisation and proliferation of data sources and repositories cannot be ignored. This trend expands opportunities to integrate and share population health data. Such platforms have many benefits, including the potential to efficiently translate information arising from such data to evidence needed to address complex global health challenges. There are pockets of quality data on the continent that may benefit from greater integration. Integration of data sources is however under-explored in Africa. The aim of this article is to identify the requirements and provide practical recommendations for developing a multi-consortia public and population health data-sharing framework for Africa. Methods: We conducted a narrative review of global best practices and policies on data sharing and its optimisation. We searched eight databases for publications and undertook an iterative snowballing search of articles cited in the identified publications. The Leximancer software © enabled content analysis and selection of a sample of the most relevant articles for detailed review. Themes were developed through immersion in the extracts of selected articles using inductive thematic analysis. We also performed interviews with public and population health stakeholders in Africa to gather their experiences, perceptions, and expectations of data sharing. Results: Our findings described global stakeholder experiences on research data sharing. We identified some challenges and measures to harness available resources and incentivise data sharing. We further highlight progress made by the different groups in Africa and identified the infrastructural requirements and considerations when implementing data sharing platforms. Furthermore, the review suggests key reforms required, particularly in the areas of consenting, privacy protection, data ownership, governance, and data access. Conclusions: The findings underscore the critical role of inclusion, social justice, public good, data security, accountability, legislation, reciprocity, and mutual respect in developing a responsive, ethical, durable, and integrated research data sharing ecosystem.
Collapse
Affiliation(s)
- Jude O. Igumbor
- School of Public Health, University of the Witwatersrand, Johannesburg, Gauteng, 2193, South Africa
| | - Edna N. Bosire
- School of Public Health, University of the Witwatersrand, Johannesburg, Gauteng, 2193, South Africa
| | - Marta Vicente-Crespo
- School of Public Health, University of the Witwatersrand, Johannesburg, Gauteng, 2193, South Africa
- African Population and Health Research Centre, Nairobi, Kenya
| | - Ehimario U. Igumbor
- Nigeria Centre for Disease Control, Abuja, Nigeria
- School of Public Health, University of the Western Cape, Cape Town, Western Cape, South Africa
| | - Uthman A. Olalekan
- Warwick-Centre for Applied Health Research and Delivery (WCAHRD), Division of Health Sciences, Warwick Medical School, University of Warwick, Coventry, UK
| | - Tobias F. Chirwa
- School of Public Health, University of the Witwatersrand, Johannesburg, Gauteng, 2193, South Africa
| | | | | | - Sharon Fonn
- School of Public Health, University of the Witwatersrand, Johannesburg, Gauteng, 2193, South Africa
| |
Collapse
|
11
|
Almeida JR, Silva JF, Matos S, Oliveira JL. A two-stage workflow to extract and harmonize drug mentions from clinical notes into observational databases. J Biomed Inform 2021; 120:103849. [PMID: 34214696 DOI: 10.1016/j.jbi.2021.103849] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 06/04/2021] [Accepted: 06/19/2021] [Indexed: 01/02/2023]
Abstract
BACKGROUND The content of the clinical notes that have been continuously collected along patients' health history has the potential to provide relevant information about treatments and diseases, and to increase the value of structured data available in Electronic Health Records (EHR) databases. EHR databases are currently being used in observational studies which lead to important findings in medical and biomedical sciences. However, the information present in clinical notes is not being used in those studies, since the computational analysis of this unstructured data is much complex in comparison to structured data. METHODS We propose a two-stage workflow for solving an existing gap in Extraction, Transformation and Loading (ETL) procedures regarding observational databases. The first stage of the workflow extracts prescriptions present in patient's clinical notes, while the second stage harmonises the extracted information into their standard definition and stores the resulting information in a common database schema used in observational studies. RESULTS We validated this methodology using two distinct data sets, in which the goal was to extract and store drug related information in a new Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM) database. We analysed the performance of the used annotator as well as its limitations. Finally, we described some practical examples of how users can explore these datasets once migrated to OMOP CDM databases. CONCLUSION With this methodology, we were able to show a strategy for using the information extracted from the clinical notes in business intelligence tools, or for other applications such as data exploration through the use of SQL queries. Besides, the extracted information complements the data present in OMOP CDM databases which was not directly available in the EHR database.
Collapse
Affiliation(s)
- João Rafael Almeida
- DETI/IEETA, University of Aveiro, Aveiro, Portugal; Department of Computation, University of A Coruña, A Coruña, Spain.
| | | | - Sérgio Matos
- DETI/IEETA, University of Aveiro, Aveiro, Portugal.
| | | |
Collapse
|
12
|
Abstract
Journal publishers play an important role in the open research data ecosystem. Through open data policies that include public data archiving mandates and data availability statements, journal publishers help promote transparency in research and wider access to a growing scholarly record. The library and information science (LIS) discipline has a unique relationship with both open data initiatives and academic publishing and may be well-positioned to adopt rigorous open data policies. This study examines the information provided on public-facing websites of LIS journals in order to describe the extent, and nature, of open data guidance provided to prospective authors. Open access journals in the discipline have disproportionately adopted detailed, strict open data policies. Commercial publishers, which account for the largest share of publishing in the discipline, have largely adopted weaker policies. Rigorous policies, adopted by a minority of journals, describe the rationale, application, and expectations for open research data, while most journals that provide guidance on the matter use hesitant and vague language. Recommendations are provided for strengthening journal open data policies.
Collapse
|
13
|
Lucas-Dominguez R, Alonso-Arroyo A, Vidal-Infer A, Aleixandre-Benavent R. The sharing of research data facing the COVID-19 pandemic. Scientometrics 2021; 126:4975-4990. [PMID: 33935332 PMCID: PMC8072296 DOI: 10.1007/s11192-021-03971-6] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Accepted: 03/24/2021] [Indexed: 11/25/2022]
Abstract
During the previous Ebola and Zika outbreaks, researchers shared their data, allowing many published epidemiological studies to be produced only from open research data, to speed up investigations and control of these infections. This study aims to evaluate the dissemination of the COVID-19 research data underlying scientific publications. Analysis of COVID-19 publications from December 1, 2019, to April 30, 2020, was conducted through the PubMed Central repository to evaluate the research data available through its publication as supplementary material or deposited in repositories. The PubMed Central search generated 5,905 records, of which 804 papers included complementary research data, especially as supplementary material (77.4%). The most productive journals were The New England Journal of Medicine, The Lancet and The Lancet Infectious Diseases, the most frequent keyword was pneumonia, and the most used repositories were GitHub and GenBank. An expected growth in the number of published articles following the course of the pandemics is confirmed in this work, while the underlying research data are only 13.6%. It can be deduced that data sharing is not a common practice, even in health emergencies, such as the present one. High-impact generalist journals have accounted for a large share of global publishing. The topics most often covered are related to epidemiological and public health concepts, genetics, virology and respiratory diseases, such as pneumonia. However, it is essential to interpret these data with caution following the evolution of publications and their funding in the coming months.
Collapse
Affiliation(s)
- Rut Lucas-Dominguez
- Department of the History of Science and Information Science, School of Medicine and Dentistry, University of Valencia, Avda. Blasco Ibañez 15, 46010 Valencia, Spain
- UISYS, Joint Research Unit CSIC–University of Valencia, Pza. Cisneros 4, 46003 Valencia, Spain
- CIBERONC, Valencia, Spain
| | - Adolfo Alonso-Arroyo
- Department of the History of Science and Information Science, School of Medicine and Dentistry, University of Valencia, Avda. Blasco Ibañez 15, 46010 Valencia, Spain
- UISYS, Joint Research Unit CSIC–University of Valencia, Pza. Cisneros 4, 46003 Valencia, Spain
| | - Antonio Vidal-Infer
- Department of the History of Science and Information Science, School of Medicine and Dentistry, University of Valencia, Avda. Blasco Ibañez 15, 46010 Valencia, Spain
- UISYS, Joint Research Unit CSIC–University of Valencia, Pza. Cisneros 4, 46003 Valencia, Spain
| | - Rafael Aleixandre-Benavent
- UISYS, Joint Research Unit CSIC–University of Valencia, Pza. Cisneros 4, 46003 Valencia, Spain
- Ingenio (CSIC-Politechnic University of Valencia), Ciudad Politécnica de La Innovación, Edif 8E 4º, Camino de Vera s/n, 46022 Valencia, Spain
| |
Collapse
|
14
|
Zhang L, Ma L. Does open data boost journal impact: evidence from Chinese economics. Scientometrics 2021; 126:3393-3419. [PMID: 33612885 PMCID: PMC7882418 DOI: 10.1007/s11192-021-03897-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 02/02/2021] [Indexed: 11/23/2022]
Abstract
To encourage research transparency and replication, more and more journals have been requiring authors to share original datasets and analytic procedures supporting their publications. Does open data boost journal impact? In this article, we report one of the first empirical studies to assess the effects of open data on journal impact. China Industrial Economics (CIE) mandated authors to open their research data in the end of 2016, which is the first to embrace open data among Chinese journals and provides a natural experiment for policy evaluation. We use the data of 37 Chinese economics journals from 2001 to 2019 and apply synthetic control method to causally estimate the effects of open data, and our results show that open data has significantly increased the citations of journal articles. On average, the current- and second-year citations of articles published with CIE have increased by 1 ~ 4 times, and articles published before the open data policy also benefited from the spillover effect. Our findings suggest that journals can leverage compulsory open data to develop reputation and amplify academic impacts.
Collapse
Affiliation(s)
- Liwei Zhang
- School of Innovation and Entrepreneurship, Shandong University, Jimo District, Qingdao, Shandong province China
| | - Liang Ma
- School of Public Administration and Policy, Renmin University of China, Haidian District, Beijing, China
| |
Collapse
|
15
|
Adolph KE. Oh, Behave!: PRESIDENTIAL ADDRESS, XXth International Conference on Infant Studies New Orleans, LA, US May 2016. INFANCY 2020; 25:374-392. [PMID: 33100922 PMCID: PMC7580788 DOI: 10.1111/infa.12336] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 10/30/2019] [Indexed: 01/18/2023]
Abstract
Behavior is essential for understanding infant learning and development. Although behavior is transient and ephemeral, we have the technology to make it tangible and enduring. Video uniquely captures and preserves the details of behavior and the surrounding context. By sharing videos for documentation and data reuse, we can exploit the tremendous opportuni-ties provided by infancy research and overcome the important challenges in studying behavior. The Datavyu video coding software and Databrary digital video library provide tools and infrastructure for mining and sharing the richness of video. This article is based on my Presidential Address to the International Congress on Infant Studies in New Orleans, May 22, 2016 (Video 1 at https://www.databrary.org/volume/955/slot/39352/-?asset=190106. Given that the article de-scribes the power of video for understanding behavior, I use video clips rather than static images to illustrate most of my points, and the videos are shared on the Databrary library.
Collapse
Affiliation(s)
- Karen E Adolph
- Department of Psychology, New York University, New York, NY, USA
| |
Collapse
|
16
|
Abstract
This work provides an overview of a Spanish survey on research data, which was carried out within the framework of the project Datasea at the beginning of 2015. It is covered by the objectives of sustainable development (goal 9) to support the research. The purpose of the study was to identify the habits and current experiences of Spanish researchers in the health sciences in relation to the management and sharing of raw research data. Method: An electronic questionnaire composed of 40 questions divided into three blocks was designed. The three Section s contained questions on the following aspects: (A) personal information; (B) creation and reuse of data; and (C) preservation of data. The questionnaire was sent by email to a list of universities in Spain to be distributed among their researchers and professors. A total of 1063 researchers completed the questionnaire. More than half of the respondents (54.9%) lacked a data management plan; nearly a quarter had storage systems for the research group; 81.5% used personal computers to store data; “Contact with colleagues” was the most frequent means used to locate and access other researchers’ data; and nearly 60% of researchers stated their data were available to the research group and collaborating colleagues. The main fears about sharing were legal questions (47.9%), misuse or interpretation of data (42.7%), and loss of authorship (28.7%). The results allow us to understand the state of data sharing among Spanish researchers and can serve as a basis to identify the needs of researchers to share data, optimize existing infrastructure, and promote data sharing among those who do not practice it yet.
Collapse
|
17
|
Abstract
Purpose
Chemical databases have had a significant impact on the way scientists search for and use information. The purpose of this paper is to spark informed discussion and fuel debate on the issue of citations to chemical databases.
Design/methodology/approach
A citation analysis to four major chemical databases was undertaken to examine resource coverage and impact in the scientific literature. Two commercial databases (SciFinder and Reaxys) and two public databases (PubChem and ChemSpider) were analyzed using the “Cited Reference Search” in the Science Citation Index Expanded from the Web of Science (WoS) database. Citations to these databases between 2000 and 2016 (inclusive) were evaluated by document types and publication growth curves. A review of the distribution trends of chemical databases in peer-reviewed articles was conducted through a citation count analysis by country, organization, journal and WoS category.
Findings
In total, 862 scholarly articles containing a citation to one or more of the four databases were identified as only steadily increasing since 2000. The study determined that authors at academic institutions worldwide reference chemical databases in high-impact journals from notable publishers and mainly in the field of chemistry.
Originality/value
The research is a first attempt to evaluate the practice of citation to major chemical databases in the scientific literature. This paper proposes that citing chemical databases gives merit and recognition to the resources as well as credibility and validity to the scholarly communication process and also further discusses recommendations for citing and referencing databases.
Collapse
|
18
|
Ju B, Kim Y. The formation of research ethics for data sharing by biological scientists: an empirical analysis. ASLIB J INFORM MANAG 2019. [DOI: 10.1108/ajim-12-2018-0296] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this paper is to investigate how biological scientists form research ethics for data sharing, and what the major factors affecting biological scientists’ formation of research ethics for data sharing are.
Design/methodology/approach
A research model for data sharing was developed based on the consequential theorists’ perspective of ethics. An online survey of 577 participants was administered, and the proposed research model was validated with a structural equation modeling technique.
Findings
The results show that egoism factors (perceived reputation, perceived risk, perceived effort), utilitarianism factors (perceived community benefit and perceived reciprocity) and norm of practice factors (perceived pressure by funding agency, perceived pressure by journal and norm of data sharing) all contribute to the formation of research ethics for data sharing.
Research limitations/implications
This research employed the consequentialist perspective of ethics for its research model development, and the proposed research model nicely explained how egoism, utilitarianism and norm of practice factors influence biological scientists’ research ethics for data sharing, which eventually leads to their data sharing intentions.
Practical implications
This research provides important practical implications for examining scientists’ data sharing behaviors from the perspective of research ethics. This research suggests that scientists’ data sharing behaviors can be better facilitated by emphasizing their egoism, utilitarianism and normative factors involved in research ethics for data sharing.
Originality/value
The ethical perspectives in data sharing research has been under-studied; this research sheds light on biological scientists’ formation of research ethics for data sharing, which can be applied in promoting scientists’ data sharing behaviors across different disciplines.
Collapse
|
19
|
Abstract
Data sharing can be defined as the release of research data that can be used by others. With the recent open-science movement, there has been a call for free access to data, tools and methods in academia. In recent years, subject-based and institutional repositories and data centres have emerged along with online publishing. Many scientific records, including published articles and data, have been made available via new platforms. In the United Kingdom, most major research funders had a data policy and require researchers to include a ‘data-sharing plan’ when applying for funding. However, there are a number of barriers to the full-scale adoption of data sharing. Those barriers are not only technical, but also psychological and social. A survey was conducted with over 1800 UK-based academics to explore the extent of support of data sharing and the characteristics and factors associated with data-sharing practice. It found that while most academics recognised the importance of sharing research data, most of them had never shared or reused research data. There were differences in the extent of data sharing between different gender, academic disciplines, age and seniority. It also found that the awareness of Research Council UK’s (RCUK) Open-Access (OA) policy, experience of Gold and Green OA publishing, attitudes towards the importance of data sharing and experience of using secondary data were associated with the practice of data sharing. A small group of researchers used social media such as Twitter, blogs and Facebook to promote the research data they had shared online. Our findings contribute to the knowledge and understanding of open science and offer recommendations to academic institutions, journals and funding agencies.
Collapse
|
20
|
Rüegg SR, Nielsen LR, Buttigieg SC, Santa M, Aragrande M, Canali M, Ehlinger T, Chantziaras I, Boriani E, Radeski M, Bruce M, Queenan K, Häsler B. A Systems Approach to Evaluate One Health Initiatives. Front Vet Sci 2018; 5:23. [PMID: 29594154 PMCID: PMC5854661 DOI: 10.3389/fvets.2018.00023] [Citation(s) in RCA: 73] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Accepted: 02/05/2018] [Indexed: 11/13/2022] Open
Abstract
Challenges calling for integrated approaches to health, such as the One Health (OH) approach, typically arise from the intertwined spheres of humans, animals, and ecosystems constituting their environment. Initiatives addressing such wicked problems commonly consist of complex structures and dynamics. As a result of the EU COST Action (TD 1404) "Network for Evaluation of One Health" (NEOH), we propose an evaluation framework anchored in systems theory to address the intrinsic complexity of OH initiatives and regard them as subsystems of the context within which they operate. Typically, they intend to influence a system with a view to improve human, animal, and environmental health. The NEOH evaluation framework consists of four overarching elements, namely: (1) the definition of the initiative and its context, (2) the description of the theory of change with an assessment of expected and unexpected outcomes, (3) the process evaluation of operational and supporting infrastructures (the "OH-ness"), and (4) an assessment of the association(s) between the process evaluation and the outcomes produced. It relies on a mixed methods approach by combining a descriptive and qualitative assessment with a semi-quantitative scoring for the evaluation of the degree and structural balance of "OH-ness" (summarised in an OH-index and OH-ratio, respectively) and conventional metrics for different outcomes in a multi-criteria-decision-analysis. Here, we focus on the methodology for Elements (1) and (3) including ready-to-use Microsoft Excel spreadsheets for the assessment of the "OH-ness". We also provide an overview of Element (2), and refer to the NEOH handbook for further details, also regarding Element (4) (http://neoh.onehealthglobal.net). The presented approach helps researchers, practitioners, and evaluators to conceptualise and conduct evaluations of integrated approaches to health and facilitates comparison and learning across different OH activities thereby facilitating decisions on resource allocation. The application of the framework has been described in eight case studies in the same Frontiers research topic and provides first data on OH-index and OH-ratio, which is an important step towards their validation and the creation of a dataset for future benchmarking, and to demonstrate under which circumstances OH initiatives provide added value compared to disciplinary or conventional health initiatives.
Collapse
Affiliation(s)
- Simon R. Rüegg
- Vetsuisse Faculty, University of Zurich, Zurich, Switzerland
| | | | | | - Mijalche Santa
- Faculty of Economics—Skopje, Saints Cyril and Methodius University of Skopje, Skopje, Macedonia
| | - Maurizio Aragrande
- Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| | - Massimo Canali
- Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy
| | - Timothy Ehlinger
- Center for Global Health Equity, University of Wisconsin Milwaukee, Milwaukee, WI, United States
| | | | - Elena Boriani
- Global Decision Support Initiative (GDSI), Technical University of Denmark, Kongens Lyngby, Denmark
- National Food Institute, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Miroslav Radeski
- Faculty of Veterinary Medicine, Saints Cyril and Methodius University of Skopje, Skopje, Macedonia
| | - Mieghan Bruce
- School of Veterinary and Life Science, Murdoch University, Perth, WA, Australia
| | | | | |
Collapse
|
21
|
Houtkoop BL, Chambers C, Macleod M, Bishop DVM, Nichols TE, Wagenmakers EJ. Data Sharing in Psychology: A Survey on Barriers and Preconditions. ADVANCES IN METHODS AND PRACTICES IN PSYCHOLOGICAL SCIENCE 2018. [DOI: 10.1177/2515245917751886] [Citation(s) in RCA: 80] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Despite its potential to accelerate academic progress in psychological science, public data sharing remains relatively uncommon. In order to discover the perceived barriers to public data sharing and possible means for lowering them, we conducted a survey, which elicited responses from 600 authors of articles in psychology. The results confirmed that data are shared only infrequently. Perceived barriers included respondents’ belief that sharing is not a common practice in their fields, their preference to share data only upon request, their perception that sharing requires extra work, and their lack of training in sharing data. Our survey suggests that strong encouragement from institutions, journals, and funders will be particularly effective in overcoming these barriers, in combination with educational materials that demonstrate where and how data can be shared effectively.
Collapse
Affiliation(s)
| | - Chris Chambers
- Brain Research Imaging Centre (CUBRIC), School of Psychology, Cardiff University
| | | | | | - Thomas E. Nichols
- Oxford Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, Nuffield Department of Population Health, University of Oxford
- Wellcome Centre for Integrative Neuroimaging, FMRIB, Nuffield Department of Clinical Neurosciences, University of Oxford
- Department of Statistics, University of Warwick
| | | |
Collapse
|
22
|
Mongeon P, Robinson-Garcia N, Jeng W, Costas R. Incorporating data sharing to the reward system of science. ASLIB J INFORM MANAG 2017. [DOI: 10.1108/ajim-01-2017-0024] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
It is widely recognized that sharing data is beneficial not only for science but also for the common good, and researchers are increasingly expected to share their data. However, many researchers are still not making their data available, one of the reasons being that this activity is not adequately recognized in the current reward system of science. Since the attribution of data sets to individual researchers is necessary if we are to include them in research evaluation processes, the purpose of this paper is to explore the feasibility of linking data set records from DataCite to the authors of articles indexed in the Web of Science.
Design/methodology/approach
DataCite and WoS records are linked together based on the similarity between the names of the data sets’ creators and the articles’ authors, as well as the similarity between the noun phrases in the titles of the data sets and the titles and abstract of the articles.
Findings
The authors report that a large number of DataCite records can be attributed to specific authors in WoS, and the authors demonstrate that the prevalence of data sharing varies greatly depending on the research discipline.
Originality/value
It is yet unclear how data sharing can provide adequate recognition for individual researchers. Bibliometric indicators are commonly used for research evaluation, but to date no large-scale assessment of individual researchers’ data sharing activities has been carried out.
Collapse
|
23
|
Kim Y. Fostering scientists’ data sharing behaviors via data repositories, journal supplements, and personal communication methods. Inf Process Manag 2017. [DOI: 10.1016/j.ipm.2017.03.003] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
24
|
Understanding Perspectives on Sharing Neutron Data at Oak Ridge National Laboratory. DATA SCIENCE JOURNAL 2017. [DOI: 10.5334/dsj-2017-035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
25
|
|
26
|
He L, Han Z. Do usage counts of scientific data make sense? An investigation of the Dryad repository. LIBRARY HI TECH 2017. [DOI: 10.1108/lht-12-2016-0158] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Purpose
The purpose of this paper is to evaluate the impact of scientific data in order to assess the reliability of data to support data curation, to establish trust between researchers to support reuse of digital data and encourage researchers to share more data.
Design/methodology/approach
The authors compared the correlations between usage counts of associated data in Dryad and citation counts of articles in Web of Science in different subject areas in order to assess the possibility of using altmetric indicators to evaluate scientific data.
Findings
There are high positive correlations between usage counts of data and citation counts of associated articles. The citation counts of article’s shared data are higher than the average citation counts in most of the subject areas examined by the authors.
Practical implications
The paper suggests that usage counts of data could be potentially used to evaluate scholarly impact of scientific data, especially for those subject areas without special data repositories.
Originality/value
The study examines the possibility to use usage counts to evaluate the impact of scientific data in a generic repository Dryad by different subject categories.
Collapse
|
27
|
Perrier L, Blondal E, Ayala AP, Dearborn D, Kenny T, Lightfoot D, Reka R, Thuna M, Trimble L, MacDonald H. Research data management in academic institutions: A scoping review. PLoS One 2017; 12:e0178261. [PMID: 28542450 PMCID: PMC5441653 DOI: 10.1371/journal.pone.0178261] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Accepted: 04/26/2017] [Indexed: 11/18/2022] Open
Abstract
OBJECTIVE The purpose of this study is to describe the volume, topics, and methodological nature of the existing research literature on research data management in academic institutions. MATERIALS AND METHODS We conducted a scoping review by searching forty literature databases encompassing a broad range of disciplines from inception to April 2016. We included all study types and data extracted on study design, discipline, data collection tools, and phase of the research data lifecycle. RESULTS We included 301 articles plus 10 companion reports after screening 13,002 titles and abstracts and 654 full-text articles. Most articles (85%) were published from 2010 onwards and conducted within the sciences (86%). More than three-quarters of the articles (78%) reported methods that included interviews, cross-sectional, or case studies. Most articles (68%) included the Giving Access to Data phase of the UK Data Archive Research Data Lifecycle that examines activities such as sharing data. When studies were grouped into five dominant groupings (Stakeholder, Data, Library, Tool/Device, and Publication), data quality emerged as an integral element. CONCLUSION Most studies relied on self-reports (interviews, surveys) or accounts from an observer (case studies) and we found few studies that collected empirical evidence on activities amongst data producers, particularly those examining the impact of research data management interventions. As well, fewer studies examined research data management at the early phases of research projects. The quality of all research outputs needs attention, from the application of best practices in research data management studies, to data producers depositing data in repositories for long-term use.
Collapse
Affiliation(s)
- Laure Perrier
- Gerstein Science Information Centre, University of Toronto, Toronto, Ontario, Canada
| | - Erik Blondal
- Institute of Health Policy, Management and Evaluation, University of Toronto, Toronto, Ontario, Canada
| | - A. Patricia Ayala
- Gerstein Science Information Centre, University of Toronto, Toronto, Ontario, Canada
| | - Dylanne Dearborn
- Gerstein Science Information Centre, University of Toronto, Toronto, Ontario, Canada
| | - Tim Kenny
- Gibson D. Lewis Health Science Library, UNT Health Science Center, Fort Worth, Texas, United States of America
| | - David Lightfoot
- St. Michael’s Hospital Library, St. Michael’s Hospital, Toronto, Ontario, Canada
| | - Roger Reka
- Faculty of Information, University of Toronto, Toronto, Ontario, Canada
| | - Mindy Thuna
- Engineering & Computer Science Library, University of Toronto, Toronto, Ontario, Canada
| | - Leanne Trimble
- Map and Data Library, University of Toronto, Toronto, Ontario, Canada
| | | |
Collapse
|
28
|
Vasilevsky NA, Minnier J, Haendel MA, Champieux RE. Reproducible and reusable research: are journal data sharing policies meeting the mark? PeerJ 2017; 5:e3208. [PMID: 28462024 PMCID: PMC5407277 DOI: 10.7717/peerj.3208] [Citation(s) in RCA: 68] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2016] [Accepted: 03/20/2017] [Indexed: 11/20/2022] Open
Abstract
Background There is wide agreement in the biomedical research community that research data sharing is a primary ingredient for ensuring that science is more transparent and reproducible. Publishers could play an important role in facilitating and enforcing data sharing; however, many journals have not yet implemented data sharing policies and the requirements vary widely across journals. This study set out to analyze the pervasiveness and quality of data sharing policies in the biomedical literature. Methods The online author’s instructions and editorial policies for 318 biomedical journals were manually reviewed to analyze the journal’s data sharing requirements and characteristics. The data sharing policies were ranked using a rubric to determine if data sharing was required, recommended, required only for omics data, or not addressed at all. The data sharing method and licensing recommendations were examined, as well any mention of reproducibility or similar concepts. The data was analyzed for patterns relating to publishing volume, Journal Impact Factor, and the publishing model (open access or subscription) of each journal. Results A total of 11.9% of journals analyzed explicitly stated that data sharing was required as a condition of publication. A total of 9.1% of journals required data sharing, but did not state that it would affect publication decisions. 23.3% of journals had a statement encouraging authors to share their data but did not require it. A total of 9.1% of journals mentioned data sharing indirectly, and only 14.8% addressed protein, proteomic, and/or genomic data sharing. There was no mention of data sharing in 31.8% of journals. Impact factors were significantly higher for journals with the strongest data sharing policies compared to all other data sharing criteria. Open access journals were not more likely to require data sharing than subscription journals. Discussion Our study confirmed earlier investigations which observed that only a minority of biomedical journals require data sharing, and a significant association between higher Impact Factors and journals with a data sharing requirement. Moreover, while 65.7% of the journals in our study that required data sharing addressed the concept of reproducibility, as with earlier investigations, we found that most data sharing policies did not provide specific guidance on the practices that ensure data is maximally available and reusable.
Collapse
Affiliation(s)
- Nicole A Vasilevsky
- OHSU Library, Oregon Health & Science University, Portland, OR, United States.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, United States
| | - Jessica Minnier
- OHSU-PSU School of Public Health, Oregon Health & Science University, Portland, OR, United States
| | - Melissa A Haendel
- OHSU Library, Oregon Health & Science University, Portland, OR, United States.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, United States
| | - Robin E Champieux
- OHSU Library, Oregon Health & Science University, Portland, OR, United States
| |
Collapse
|
29
|
Tripathi M, Chand M, Sonkar SK, Jeevan VKJ. A brief assessment of researchers’ perceptions towards research data in India. IFLA JOURNAL-INTERNATIONAL FEDERATION OF LIBRARY ASSOCIATIONS 2017. [DOI: 10.1177/0340035216686984] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The present paper dwells upon the importance of raw data for the development of science and research. The study includes an interview of 40 researchers and faculty members to understand their perception towards the raw data. It has suggested that the libraries can play a pivotal role in extending support to the researchers for organizing, archiving and preserving raw data for future use. Libraries may evolve a system at the university level wherein the researchers and faculty members be encouraged to deposit their raw research data in the institutional repositories, which most of the university libraries have developed.
Collapse
Affiliation(s)
| | | | - S. K. Sonkar
- Babasaheb Bhimrao Ambedkar University, Lucknow, India
| | | |
Collapse
|
30
|
Li K, Lin X, Greenberg J. Software citation, reuse and metadata considerations: An exploratory study examining LAMMPS. ACTA ACUST UNITED AC 2016. [DOI: 10.1002/pra2.2016.14505301072] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- Kai Li
- College of Computing and Informatics; Drexel University; 3141 Chestnut Street Philadelphia PA 19104
| | - Xia Lin
- College of Computing and Informatics; Drexel University; 3141 Chestnut Street Philadelphia PA 19104
| | - Jane Greenberg
- College of Computing and Informatics; Drexel University; 3141 Chestnut Street Philadelphia PA 19104
| |
Collapse
|
31
|
Llamas C, González MA, Hernández C, Vegas J. Open source platform for collaborative construction of wearable sensor datasets for human motion analysis and an application for gait analysis. J Biomed Inform 2016; 63:249-258. [PMID: 27593165 DOI: 10.1016/j.jbi.2016.08.025] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Revised: 08/26/2016] [Accepted: 08/31/2016] [Indexed: 11/17/2022]
Abstract
Nearly every practical improvement in modeling human motion is well founded in a properly designed collection of data or datasets. These datasets must be made publicly available for the community could validate and accept them. It is reasonable to concede that a collective, guided enterprise could serve to devise solid and substantial datasets, as a result of a collaborative effort, in the same sense as the open software community does. In this way datasets could be complemented, extended and expanded in size with, for example, more individuals, samples and human actions. For this to be possible some commitments must be made by the collaborators, being one of them sharing the same data acquisition platform. In this paper, we offer an affordable open source hardware and software platform based on inertial wearable sensors in a way that several groups could cooperate in the construction of datasets through common software suitable for collaboration. Some experimental results about the throughput of the overall system are reported showing the feasibility of acquiring data from up to 6 sensors with a sampling frequency no less than 118Hz. Also, a proof-of-concept dataset is provided comprising sampled data from 12 subjects suitable for gait analysis.
Collapse
Affiliation(s)
- César Llamas
- Departamento de Informática, Universidad de Valladolid, Spain.
| | | | | | - Jesús Vegas
- Departamento de Informática, Universidad de Valladolid, Spain.
| |
Collapse
|
32
|
Rowhani-Farid A, Barnett AG. Has open data arrived at the British Medical Journal (BMJ)? An observational study. BMJ Open 2016; 6:e011784. [PMID: 27737882 PMCID: PMC5073489 DOI: 10.1136/bmjopen-2016-011784] [Citation(s) in RCA: 43] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 05/27/2016] [Accepted: 08/25/2016] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVE To quantify data sharing trends and data sharing policy compliance at the British Medical Journal (BMJ) by analysing the rate of data sharing practices, and investigate attitudes and examine barriers towards data sharing. DESIGN Observational study. SETTING The BMJ research archive. PARTICIPANTS 160 randomly sampled BMJ research articles from 2009 to 2015, excluding meta-analysis and systematic reviews. MAIN OUTCOME MEASURES Percentages of research articles that indicated the availability of their raw data sets in their data sharing statements, and those that easily made their data sets available on request. RESULTS 3 articles contained the data in the article. 50 out of 157 (32%) remaining articles indicated the availability of their data sets. 12 used publicly available data and the remaining 38 were sent email requests to access their data sets. Only 1 publicly available data set could be accessed and only 6 out of 38 shared their data via email. So only 7/157 research articles shared their data sets, 4.5% (95% CI 1.8% to 9%). For 21 clinical trials bound by the BMJ data sharing policy, the per cent shared was 24% (8% to 47%). CONCLUSIONS Despite the BMJ's strong data sharing policy, sharing rates are low. Possible explanations for low data sharing rates could be: the wording of the BMJ data sharing policy, which leaves room for individual interpretation and possible loopholes; that our email requests ended up in researchers spam folders; and that researchers are not rewarded for sharing their data. It might be time for a more effective data sharing policy and better incentives for health and medical researchers to share their data.
Collapse
Affiliation(s)
- Anisa Rowhani-Farid
- Australian Centre for Health Services Innovation (AusHSI), Institute of Health and Biomedical Innovation (IHBI), Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| | - Adrian G Barnett
- Australian Centre for Health Services Innovation (AusHSI), Institute of Health and Biomedical Innovation (IHBI), Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| |
Collapse
|
33
|
Pan X, Yan E, Hua W. Disciplinary differences of software use and impact in scientific literature. Scientometrics 2016. [DOI: 10.1007/s11192-016-2138-4] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
34
|
Mayernik MS, Hart DL, Maull KE, Weber NM. Assessing and tracing the outcomes and impact of research infrastructures. J Assoc Inf Sci Technol 2016. [DOI: 10.1002/asi.23721] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Affiliation(s)
- Matthew S. Mayernik
- NCAR Library, National Center for Atmospheric Research (NCAR)P.O. Box 3000Boulder CO80307‐3000
| | - David L. Hart
- Computational & Information Systems Laboratory, National Center for Atmospheric Research (NCAR)P.O. Box 3000Boulder CO 80307‐3000
| | - Keith E. Maull
- NCAR Library, National Center for Atmospheric Research (NCAR)P.O. Box 3000Boulder CO80307‐3000
| | - Nicholas M. Weber
- Information School, University of WashingtonBox 352840Seattle WA98195‐2840
| |
Collapse
|
35
|
Abstract
In this study, we explore the citedness of research data, its distribution over time and its relation to the availability of a digital object identifier (DOI) in the Thomson Reuters database Data Citation Index (DCI). We investigate if cited research data "impacts" the (social) web, reflected by altmetrics scores, and if there is any relationship between the number of citations and the sum of altmetrics scores from various social media platforms. Three tools are used to collect altmetrics scores, namely PlumX, ImpactStory, and Altmetric.com, and the corresponding results are compared. We found that out of the three altmetrics tools, PlumX has the best coverage. Our experiments revealed that research data remain mostly uncited (about 85 %), although there has been an increase in citing data sets published since 2008. The percentage of the number of cited research data with a DOI in DCI has decreased in the last years. Only nine repositories are responsible for research data with DOIs and two or more citations. The number of cited research data with altmetrics "foot-prints" is even lower (4-9 %) but shows a higher coverage of research data from the last decade. In our study, we also found no correlation between the number of citations and the total number of altmetrics scores. Yet, certain data types (i.e. survey, aggregate data, and sequence data) are more often cited and also receive higher altmetrics scores. Additionally, we performed citation and altmetric analyses of all research data published between 2011 and 2013 in four different disciplines covered by the DCI. In general, these results correspond very well with the ones obtained for research data cited at least twice and also show low numbers in citations and in altmetrics. Finally, we observed that there are disciplinary differences in the availability and extent of altmetrics scores.
Collapse
|
36
|
Roche DG, Kruuk LEB, Lanfear R, Binning SA. Public Data Archiving in Ecology and Evolution: How Well Are We Doing? PLoS Biol 2015; 13:e1002295. [PMID: 26556502 PMCID: PMC4640582 DOI: 10.1371/journal.pbio.1002295] [Citation(s) in RCA: 130] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Policies that mandate public data archiving (PDA) successfully increase accessibility to data underlying scientific publications. However, is the data quality sufficient to allow reuse and reanalysis? We surveyed 100 datasets associated with nonmolecular studies in journals that commonly publish ecological and evolutionary research and have a strong PDA policy. Out of these datasets, 56% were incomplete, and 64% were archived in a way that partially or entirely prevented reuse. We suggest that cultural shifts facilitating clearer benefits to authors are necessary to achieve high-quality PDA and highlight key guidelines to help authors increase their data's reuse potential and compliance with journal data policies.
Collapse
Affiliation(s)
- Dominique G. Roche
- Division of Evolution, Ecology and Genetics, Research School of Biology, The Australian National University, Canberra, Australian Capital Territory, Australia
- Éco-Éthologie, Institut de Biologie, Université de Neuchâtel, Neuchâtel, Switzerland
- * E-mail:
| | - Loeske E. B. Kruuk
- Division of Evolution, Ecology and Genetics, Research School of Biology, The Australian National University, Canberra, Australian Capital Territory, Australia
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Robert Lanfear
- Division of Evolution, Ecology and Genetics, Research School of Biology, The Australian National University, Canberra, Australian Capital Territory, Australia
- Department of Biological Sciences, Macquarie University, Sydney, Australia
| | - Sandra A. Binning
- Division of Evolution, Ecology and Genetics, Research School of Biology, The Australian National University, Canberra, Australian Capital Territory, Australia
- Éco-Éthologie, Institut de Biologie, Université de Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
37
|
To Share or Not to Share? A Survey of Biomedical Researchers in the U.S. Southwest, an Ethnically Diverse Region. PLoS One 2015; 10:e0138239. [PMID: 26378445 PMCID: PMC4574947 DOI: 10.1371/journal.pone.0138239] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 08/26/2015] [Indexed: 01/14/2023] Open
Abstract
Background Cancer health disparities research depends on access to biospecimens from diverse racial/ethnic populations. This multimethodological study, using mixed methods for quantitative and qualitative analysis of survey results, assessed barriers, concerns, and practices for sharing biospecimens/data among researchers working with biospecimens from minority populations in a 5 state region of the United States (Arizona, Colorado, New Mexico, Oklahoma, and Texas). The ultimate goals of this research were to understand data sharing barriers among biomedical researchers; guide strategies to increase participation in biospecimen research; and strengthen collaborative opportunities among researchers. Methods and Population Email invitations to anonymous participants (n = 605 individuals identified by the NIH RePORT database), resulted in 112 responses. The survey assessed demographics, specimen collection data, and attitudes about virtual biorepositories. Respondents were primarily principal investigators at PhD granting institutions (91.1%) conducting basic (62.3%) research; most were non-Hispanic White (63.4%) and men (60.6%). The low response rate limited the statistical power of the analyses, further the number of respondents for each survey question was variable. Results Findings from this study identified barriers to biospecimen research, including lack of access to sufficient biospecimens, and limited availability of diverse tissue samples. Many of these barriers can be attributed to poor annotation of biospecimens, and researchers’ unwillingness to share existing collections. Addressing these barriers to accessing biospecimens is essential to combating cancer in general and cancer health disparities in particular. This study confirmed researchers’ willingness to participate in a virtual biorepository (n = 50 respondents agreed). However, researchers in this region listed clear specifications for establishing and using such a biorepository: specifications related to standardized procedures, funding, and protections of human subjects and intellectual property. The results help guide strategies to increase data sharing behaviors and to increase participation of researchers with multiethnic biospecimen collections in collaborative research endeavors Conclusions Data sharing by researchers is essential to leveraging knowledge and resources needed for the advancement of research on cancer health disparities. Although U.S. funding entities have guidelines for data and resource sharing, future efforts should address researcher preferences in order to promote collaboration to address cancer health disparities.
Collapse
|
38
|
|
39
|
Kim Y, Burns CS. Norms of data sharing in biological sciences: The roles of metadata, data repository, and journal and funding requirements. J Inf Sci 2015. [DOI: 10.1177/0165551515592098] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Institutional environments, comprising regulative pressures by funding agencies and journal publishers, and institutional resources, including the availabilities of data repositories and standards for metadata, function as important determinants in scientists’ data-sharing norms, attitudes and behaviours. This research investigates how these functions influence biological scientists’ data-sharing norms and how the data-sharing norms influence their data-sharing behaviours mediated by attitudes towards data sharing. The research model was developed based on the integration of institutional theory and theory of planned behaviour. The proposed research model was validated based on a total of 608 responses from a national survey conducted in the USA. The Partial Least Squares (PLS) was employed to analyse the survey data. Results show how institutional pressures by funding agencies and journals and the availabilities of data repository and metadata standards all have significant influences on data-sharing norms, which have significant influences on data-sharing behaviours, as mediated by attitudes towards data sharing.
Collapse
Affiliation(s)
- Youngseek Kim
- School of Information Science, University of Kentucky, USA
| | - C. Sean Burns
- School of Information Science, University of Kentucky, USA
| |
Collapse
|
40
|
Kim Y, Zhang P. Understanding data sharing behaviors of STEM researchers: The roles of attitudes, norms, and data repositories. LIBRARY & INFORMATION SCIENCE RESEARCH 2015. [DOI: 10.1016/j.lisr.2015.04.006] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
41
|
Robinson-García N, Jiménez-Contreras E, Torres-Salinas D. Analyzing data citation practices using the data citation index. J Assoc Inf Sci Technol 2015. [DOI: 10.1002/asi.23529] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Nicolas Robinson-García
- EC3 Evaluación de la Ciencia y de la Documentación Científica; Colegio Máximo de Cartuja; Universidad de Granada; s/n Granada 18071 Spain
- EC3Metrics; Gran Vía de Colón, 48, 5° 10 Granada 18071 Spain
| | - Evaristo Jiménez-Contreras
- EC3 Evaluación de la Ciencia y de la Documentación Científica; Colegio Máximo de Cartuja; Universidad de Granada; s/n Granada 18071 Spain
- EC3Metrics; Gran Vía de Colón, 48, 5° 10 Granada 18071 Spain
| | - Daniel Torres-Salinas
- EC3Metrics; EC3 Evaluación de la Ciencia y de la Documentación Científica; Universidad de Navarra; Gran Vía de Colón, 48, 5° 10 Granada 18071 Spain
| |
Collapse
|
42
|
Faniel IM, Kriesberg A, Yakel E. Social scientists' satisfaction with data reuse. J Assoc Inf Sci Technol 2015. [DOI: 10.1002/asi.23480] [Citation(s) in RCA: 49] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
| | - Adam Kriesberg
- School of Information; University of Michigan; 105 South State Street Ann Arbor MI 48109-1285
| | - Elizabeth Yakel
- School of Information; University of Michigan; 105 South State Street Ann Arbor MI 48109-1285
| |
Collapse
|
43
|
Sturges P, Bamkin M, Anders JH, Hubbard B, Hussain A, Heeley M. Research data sharing: Developing a stakeholder-driven model for journal policies. J Assoc Inf Sci Technol 2015. [DOI: 10.1002/asi.23336] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Paul Sturges
- School of Business and Economics; Loughborough University; Loughborough LE11 3TU United Kingdom
| | - Marianne Bamkin
- Centre for Research Communications; Nottingham University; Nottingham NG7 2UH United Kingdom
| | - Jane H.S. Anders
- Centre for Research Communications; Nottingham University; Nottingham NG7 2UH United Kingdom
| | - Bill Hubbard
- Centre for Research Communications; Nottingham University; Nottingham NG7 2UH United Kingdom
| | - Azhar Hussain
- Centre for Research Communications; Nottingham University; Nottingham NG7 2UH United Kingdom
| | - Melanie Heeley
- Libraries, Research and Learning Resources; Nottingham University; Nottingham NG7 2UH United Kingdom
| |
Collapse
|
44
|
Aleixandre-Benavent R, Alonso-Arroyo A, Vidal-Infer A, Catalá-López F. [Promotion of public access and share of raw data from scientific research]. Med Clin (Barc) 2015; 144:283-4. [PMID: 24952665 DOI: 10.1016/j.medcli.2014.04.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2014] [Accepted: 04/10/2014] [Indexed: 10/25/2022]
Affiliation(s)
- Rafael Aleixandre-Benavent
- Unidad de Información e Investigación Social y Sanitaria, Instituto de Historia de la Medicina y de la Ciencia López Piñero (CSIC-Universitat de València), Valencia, España.
| | - Adolfo Alonso-Arroyo
- Departamento de Historia de la Ciencia y Documentación, Universitat de València, Valencia, España
| | - Antonio Vidal-Infer
- Departamento de Historia de la Ciencia y Documentación, Universitat de València, Valencia, España
| | - Ferrán Catalá-López
- División de Farmacoepidemiología y Farmacovigilancia, Agencia Española de Medicamentos y Productos Sanitarios (AEMPS), Madrid, España; Fundación Instituto de Investigación en Servicios de Salud, Valencia, España
| |
Collapse
|
45
|
Kim Y, Stanton JM. Institutional and individual factors affecting scientists' data-sharing behaviors: A multilevel analysis. J Assoc Inf Sci Technol 2015. [DOI: 10.1002/asi.23424] [Citation(s) in RCA: 79] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Youngseek Kim
- School of Library and Information Science; University of Kentucky; 331 Little Library Building Lexington KY 40506--0224
| | - Jeffrey M. Stanton
- School of Information Studies; Syracuse University; 206 Hinds Hall Syracuse NY 13244-4100
| |
Collapse
|
46
|
Abstract
Despite widespread support from policy makers, funding agencies, and scientific journals, academic researchers rarely make their research data available to others. At the same time, data sharing in research is attributed a vast potential for scientific progress. It allows the reproducibility of study results and the reuse of old data for new research questions. Based on a systematic review of 98 scholarly papers and an empirical survey among 603 secondary data users, we develop a conceptual framework that explains the process of data sharing from the primary researcher’s point of view. We show that this process can be divided into six descriptive categories: Data donor, research organization, research community, norms, data infrastructure, and data recipients. Drawing from our findings, we discuss theoretical implications regarding knowledge creation and dissemination as well as research policy measures to foster academic collaboration. We conclude that research data cannot be regarded as knowledge commons, but research policies that better incentivise data sharing are needed to improve the quality of research results and foster scientific progress.
Collapse
Affiliation(s)
- Benedikt Fecher
- Internet-enabled Innovation, Alexander von Humboldt Institute for Internet and Society, Berlin, Germany
- Research Infrastructure, German Institute for Economic Research, Berlin, Germany
- * E-mail:
| | - Sascha Friesike
- Internet-enabled Innovation, Alexander von Humboldt Institute for Internet and Society, Berlin, Germany
| | - Marcel Hebing
- German Socio-Economic Panel, German Institute for Economic Research, Berlin, Germany
| |
Collapse
|
47
|
Kim Y, Stanton JM. Institutional and individual influences on scientists' data sharing behaviors: A multilevel analysis. ACTA ACUST UNITED AC 2014. [DOI: 10.1002/meet.14505001093] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
48
|
Vines TH, Andrew RL, Bock DG, Franklin MT, Gilbert KJ, Kane NC, Moore JS, Moyers BT, Renaut S, Rennison DJ, Veen T, Yeaman S. Mandated data archiving greatly improves access to research data. FASEB J 2013; 27:1304-8. [PMID: 23288929 DOI: 10.1096/fj.12-218164] [Citation(s) in RCA: 119] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The data underlying scientific papers should be accessible to researchers both now and in the future, but how best can we ensure that these data are available? Here we examine the effectiveness of four approaches to data archiving: no stated archiving policy, recommending (but not requiring) archiving, and two versions of mandating data deposition at acceptance. We control for differences between data types by trying to obtain data from papers that use a single, widespread population genetic analysis, structure. At one extreme, we found that mandated data archiving policies that require the inclusion of a data availability statement in the manuscript improve the odds of finding the data online almost 1000-fold compared to having no policy. However, archiving rates at journals with less stringent policies were only very slightly higher than those with no policy at all. We also assessed the effectiveness of asking for data directly from authors and obtained over half of the requested datasets, albeit with ∼8 d delay and some disagreement with authors. Given the long-term benefits of data accessibility to the academic community, we believe that journal-based mandatory data archiving policies and mandatory data availability statements should be more widely adopted.
Collapse
Affiliation(s)
- Timothy H Vines
- Biodiversity Department, University of British Columbia, 6270 University Blvd Vancouver BC, Canada, V6T 1Z4.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
49
|
Sayogo DS, Pardo TA. Exploring the determinants of scientific data sharing: Understanding the motivation to publish research data. GOVERNMENT INFORMATION QUARTERLY 2013. [DOI: 10.1016/j.giq.2012.06.011] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
50
|
Affiliation(s)
- Christine L. Borgman
- UCLA Department of Information Studies; GSEIS Bldg; Rm. 235, Box 951520; Los Angeles; California; 90095-1520
| |
Collapse
|