1
|
Carter KW, Francis RW, Carter KW, Francis RW, Bresnahan M, Gissler M, Grønborg TK, Gross R, Gunnes N, Hammond G, Hornig M, Hultman CM, Huttunen J, Langridge A, Leonard H, Newman S, Parner ET, Petersson G, Reichenberg A, Sandin S, Schendel DE, Schalkwyk L, Sourander A, Steadman C, Stoltenberg C, Suominen A, Surén P, Susser E, Sylvester Vethanayagam A, Yusof Z. ViPAR: a software platform for the Virtual Pooling and Analysis of Research Data. Int J Epidemiol 2015; 45:408-416. [PMID: 26452388 PMCID: PMC4864874 DOI: 10.1093/ije/dyv193] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background:
Research studies exploring the determinants of disease require sufficient statistical power to detect meaningful effects. Sample size is often increased through centralized pooling of disparately located datasets, though ethical, privacy and data ownership issues can often hamper this process. Methods that facilitate the sharing of research data that are sympathetic with these issues and which allow flexible and detailed statistical analyses are therefore in critical need. We have created a software platform for the Virtual Pooling and Analysis of Research data (ViPAR), which employs free and open source methods to provide researchers with a web-based platform to analyse datasets housed in disparate locations.
Methods:
Database federation permits controlled access to remotely located datasets from a central location. The Secure Shell protocol allows data to be securely exchanged between devices over an insecure network. ViPAR combines these free technologies into a solution that facilitates ‘virtual pooling’ where data can be temporarily pooled into computer memory and made available for analysis without the need for permanent central storage.
Results:
Within the ViPAR infrastructure, remote sites manage their own harmonized research dataset in a database hosted at their site, while a central server hosts the data federation component and a secure analysis portal. When an analysis is initiated, requested data are retrieved from each remote site and virtually pooled at the central site. The data are then analysed by statistical software and, on completion, results of the analysis are returned to the user and the virtually pooled data are removed from memory.
Conclusions:
ViPAR is a secure, flexible and powerful analysis platform built on open source technology that is currently in use by large international consortia, and is made publicly available at [
http://bioinformatics.childhealthresearch.org.au/software/vipar/
].
Collapse
Affiliation(s)
| | | | - K W Carter
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - R W Francis
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - M Bresnahan
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA, New York State Psychiatric Institute, New York, NY, USA
| | - M Gissler
- National Institute for Health and Welfare, Helsinki, Finland, NHV Nordic School of Public Health, Gothenburg, Sweden
| | - T K Grønborg
- Department of Public Health, University of Aarhus, Aarhus, Denmark
| | - R Gross
- Division of Psychiatry, Sheba Medical Center, Tel Hashomer, Israel, Department of Epidemiology and Preventive Medicine, Sackler Faculty of Medicine, Tel Aviv University, Ramat Aviv, Israel
| | - N Gunnes
- Norwegian Institute of Public Health, Oslo, Norway
| | - G Hammond
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - M Hornig
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA, Center for Infection and Immunity, Mailman School of Public Health, Columbia University, New York, NY, USA
| | | | | | - A Langridge
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - H Leonard
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - S Newman
- Institute of Psychiatry, King's College London, London, UK
| | - E T Parner
- Department of Public Health, University of Aarhus, Aarhus, Denmark
| | | | - A Reichenberg
- Department of Psychosis Studies, Institute of Psychiatry, King's College London, London, UK, Departments of Preventative Medicine and Psychiatry, Ischan School of Medicine at Mount Sinai, New York, NY, USA
| | - S Sandin
- Karolinska Institutet, Stockholm, Sweden
| | - D E Schendel
- Department of Public Health, Section for Epidemiology, University of Aarhus, Aarhus, Denmark, Department of Economics and Business, National Centre for Register-based Research, University of Aarhus, Aarhus, Denmark, Lundbeck Foundation Initiative for Integrative Psychiatric Research, iPSYCH, Copenhagen, Denmark
| | - L Schalkwyk
- Institute of Psychiatry, King's College London, London, UK
| | - A Sourander
- Child Psychiatry Research Center, Department of Child Psychiatry, Turku University, Turku, Finland, Turku University Hospital, Turku, Finland
| | - C Steadman
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - C Stoltenberg
- Norwegian Institute of Public Health, Oslo, Norway, Department of Global Public Health and Primary Care, University of Bergen, Bergen, Norway
| | - A Suominen
- Department of Child Psychiatry, Turku University, Turku, Finland and
| | - P Surén
- Norwegian Institute of Public Health, Oslo, Norway
| | - E Susser
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA, New York State Psychiatric Institute, New York, NY, USA
| | | | - Z Yusof
- Karolinska Institutet, Stockholm, Sweden
| | | |
Collapse
|