1
|
Stawiski M, Bucciarelli V, Vogel D, Hemm S. Optimizing neuroscience data management by combining REDCap, BIDS and SQLite: a case study in Deep Brain Stimulation. Front Neuroinform 2024; 18:1435971. [PMID: 39301120 PMCID: PMC11410584 DOI: 10.3389/fninf.2024.1435971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Accepted: 08/19/2024] [Indexed: 09/22/2024] Open
Abstract
Neuroscience studies entail the generation of massive collections of heterogeneous data (e.g. demographics, clinical records, medical images). Integration and analysis of such data in research centers is pivotal for elucidating disease mechanisms and improving clinical outcomes. However, data collection in clinics often relies on non-standardized methods, such as paper-based documentation. Moreover, diverse data types are collected in different departments hindering efficient data organization, secure sharing and compliance to the FAIR (Findable, Accessible, Interoperable, Reusable) principles. Henceforth, in this manuscript we present a specialized data management system designed to enhance research workflows in Deep Brain Stimulation (DBS), a state-of-the-art neurosurgical procedure employed to treat symptoms of movement and psychiatric disorders. The system leverages REDCap to promote accurate data capture in hospital settings and secure sharing with research institutes, Brain Imaging Data Structure (BIDS) as image storing standard and a DBS-specific SQLite database as comprehensive data store and unified interface to all data types. A self-developed Python tool automates the data flow between these three components, ensuring their full interoperability. The proposed framework has already been successfully employed for capturing and analyzing data of 107 patients from 2 medical institutions. It effectively addresses the challenges of managing, sharing and retrieving diverse data types, fostering advancements in data quality, organization, analysis, and collaboration among medical and research institutions.
Collapse
Affiliation(s)
- Marc Stawiski
- Neuroengineering Group, Institute for Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| | - Vittoria Bucciarelli
- Neuroengineering Group, Institute for Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| | - Dorian Vogel
- Neuroengineering Group, Institute for Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| | - Simone Hemm
- Neuroengineering Group, Institute for Medical Engineering and Medical Informatics, School of Life Sciences, FHNW University of Applied Sciences and Arts Northwestern Switzerland, Muttenz, Switzerland
| |
Collapse
|
2
|
Van Horn JD. Editorial: What the New White House Rules on Equitable Access Mean for the Neurosciences. Neuroinformatics 2023; 21:1-4. [PMID: 36567364 DOI: 10.1007/s12021-022-09618-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/05/2022] [Indexed: 12/27/2022]
Affiliation(s)
- John Darrell Van Horn
- Professor of Psychology and Data Science, University of Virginia, Charlottesville, VA, 22903, USA.
| |
Collapse
|
3
|
Kuplicki R, Touthang J, Al Zoubi O, Mayeli A, Misaki M, Aupperle RL, Teague TK, McKinney BA, Paulus MP, Bodurka J. Common Data Elements, Scalable Data Management Infrastructure, and Analytics Workflows for Large-Scale Neuroimaging Studies. Front Psychiatry 2021; 12:682495. [PMID: 34220587 PMCID: PMC8247461 DOI: 10.3389/fpsyt.2021.682495] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/19/2021] [Accepted: 05/19/2021] [Indexed: 01/16/2023] Open
Abstract
Neuroscience studies require considerable bioinformatic support and expertise. Numerous high-dimensional and multimodal datasets must be preprocessed and integrated to create robust and reproducible analysis pipelines. We describe a common data elements and scalable data management infrastructure that allows multiple analytics workflows to facilitate preprocessing, analysis and sharing of large-scale multi-level data. The process uses the Brain Imaging Data Structure (BIDS) format and supports MRI, fMRI, EEG, clinical, and laboratory data. The infrastructure provides support for other datasets such as Fitbit and flexibility for developers to customize the integration of new types of data. Exemplar results from 200+ participants and 11 different pipelines demonstrate the utility of the infrastructure.
Collapse
Affiliation(s)
- Rayus Kuplicki
- Laureate Institute for Brain Research, Tulsa, OK, United States
| | - James Touthang
- Laureate Institute for Brain Research, Tulsa, OK, United States
| | - Obada Al Zoubi
- Laureate Institute for Brain Research, Tulsa, OK, United States
| | - Ahmad Mayeli
- Laureate Institute for Brain Research, Tulsa, OK, United States
| | - Masaya Misaki
- Laureate Institute for Brain Research, Tulsa, OK, United States
| | - NeuroMAP-Investigators
- Laureate Institute for Brain Research, Tulsa, OK, United States
- Department of Community Medicine, Oxley College of Health Sciences, University of Tulsa, Tulsa, OK, United States
| | - Robin L. Aupperle
- Laureate Institute for Brain Research, Tulsa, OK, United States
- Department of Community Medicine, Oxley College of Health Sciences, University of Tulsa, Tulsa, OK, United States
| | - T. Kent Teague
- Department of Surgery, University of Oklahoma School of Community Medicine, Tulsa, OK, United States
- Department of Psychiatry, University of Oklahoma School of Community Medicine, Tulsa, OK, United States
- Department of Biochemistry and Microbiology, Oklahoma State University Center for Health Sciences, Tulsa, OK, United States
| | - Brett A. McKinney
- Department of Mathematics, University of Tulsa, Tulsa, OK, United States
- Tandy School of Computer Science, University of Tulsa, Tulsa, OK, United States
| | | | - Jerzy Bodurka
- Laureate Institute for Brain Research, Tulsa, OK, United States
- Stephenson School of Biomedical Engineering, University of Oklahoma, Norman, OK, United States
| |
Collapse
|
4
|
Seok D, Smyk N, Jaskir M, Cook P, Elliott M, Girelli T, Scott JC, Balderston N, Beer J, Stock J, Makhoul W, Gur RC, Davatzikos C, Shinohara R, Sheline Y. Dimensional connectomics of anxious misery, a human connectome study related to human disease: Overview of protocol and data quality. NEUROIMAGE-CLINICAL 2020; 28:102489. [PMID: 33395980 PMCID: PMC7708855 DOI: 10.1016/j.nicl.2020.102489] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Revised: 10/09/2020] [Accepted: 10/27/2020] [Indexed: 11/19/2022]
Abstract
We present a new imaging study of 200 adults experiencing depression and anxiety. Quantitative measures of image quality indicate comparable quality to the HCP-YA. In addition, a comprehensive set of assessments measured patients’ symptom profiles. Data will be publicly available through the NIMH Data Archive starting fall 2020.
Disparate diagnostic categories from the Diagnostic and Statistical Manual of Mental Disorders (DSM), including generalized anxiety disorder, major depressive disorder and post-traumatic stress disorder, share common behavioral and phenomenological dysfunctions. While high levels of comorbidity and common features across these disorders suggest shared mechanisms, past research in psychopathology has largely proceeded based on the syndromal taxonomy established by the DSM rather than on a biologically-informed framework of neural, cognitive and behavioral dysfunctions. In line with the National Institute of Mental Health’s Research Domain Criteria (RDoC) framework, we present a Human Connectome Study Related to Human Disease that is intentionally designed to generate and test novel, biologically-motivated dimensions of psychopathology. The Dimensional Connectomics of Anxious Misery study is collecting neuroimaging, cognitive and behavioral data from a heterogeneous population of adults with varying degrees of depression, anxiety and trauma, as well as a set of healthy comparators (to date, n = 97 and n = 24, respectively). This sample constitutes a dataset uniquely situated to elucidate relationships between brain circuitry and dysfunctions of the Negative Valence construct of the RDoC framework. We present a comprehensive overview of the eligibility criteria, clinical procedures and neuroimaging methods of our project. After describing our protocol, we present group-level activation maps from task fMRI data and independent components maps from resting state data. Finally, using quantitative measures of neuroimaging data quality, we demonstrate excellent data quality relative to a subset of the Human Connectome Project of Young Adults (n = 97), as well as comparable profiles of cortical thickness from T1-weighted imaging and generalized fractional anisotropy from diffusion weighted imaging. This manuscript presents results from the first 121 participants of our full target 250 participant dataset, timed with the release of this data to the National Institute of Mental Health Data Archive in fall 2020, with the remaining half of the dataset to be released in 2021.
Collapse
Affiliation(s)
- Darsol Seok
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Nathan Smyk
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Marc Jaskir
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Philip Cook
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, United States
| | - Mark Elliott
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, United States
| | - Tommaso Girelli
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - J Cobb Scott
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Nicholas Balderston
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Joanne Beer
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, United States
| | - Janet Stock
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Walid Makhoul
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Ruben C Gur
- Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States
| | - Christos Davatzikos
- Department of Radiology, Perelman School of Medicine, University of Pennsylvania, United States
| | - Russell Shinohara
- Penn Statistics in Imaging and Visualization Center, Department of Biostatistics, Epidemiology, and Informatics, Perelman School of Medicine, University of Pennsylvania, United States
| | - Yvette Sheline
- Center for Neuromodulation in Depression and Stress, Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States; Department of Radiology, Perelman School of Medicine, University of Pennsylvania, United States; Department of Psychiatry, Perelman School of Medicine, University of Pennsylvania, United States.
| |
Collapse
|
5
|
Keshavan A, Poline JB. From the Wet Lab to the Web Lab: A Paradigm Shift in Brain Imaging Research. Front Neuroinform 2019; 13:3. [PMID: 30881299 PMCID: PMC6405692 DOI: 10.3389/fninf.2019.00003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 01/22/2019] [Indexed: 01/08/2023] Open
Abstract
Web technology has transformed our lives, and has led to a paradigm shift in the computational sciences. As the neuroimaging informatics research community amasses large datasets to answer complex neuroscience questions, we find that the web is the best medium to facilitate novel insights by way of improved collaboration and communication. Here, we review the landscape of web technologies used in neuroimaging research, and discuss future applications, areas for improvement, and the limitations of using web technology in research. Fully incorporating web technology in our research lifecycle requires not only technical skill, but a widespread culture change; a shift from the small, focused "wet lab" to a multidisciplinary and largely collaborative "web lab."
Collapse
Affiliation(s)
- Anisha Keshavan
- Department of Speech and Hearing, Institute for Neuroengineering, eScience Institute, University of Washington, Seattle, WA, United States
| | - Jean-Baptiste Poline
- Faculty of Medicine, McConnell Brain Imaging Centre, Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute and Hospital, McGill University, Montreal, QC, Canada
- Henry H. Wheeler Jr. Brain Imaging Center, Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
6
|
Duncan D, Vespa P, Pitkänen A, Braimah A, Lapinlampi N, Toga AW. Big data sharing and analysis to advance research in post-traumatic epilepsy. Neurobiol Dis 2019; 123:127-136. [PMID: 29864492 PMCID: PMC6274619 DOI: 10.1016/j.nbd.2018.05.026] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 05/24/2018] [Accepted: 05/31/2018] [Indexed: 11/26/2022] Open
Abstract
We describe the infrastructure and functionality for a centralized preclinical and clinical data repository and analytic platform to support importing heterogeneous multi-modal data, automatically and manually linking data across modalities and sites, and searching content. We have developed and applied innovative image and electrophysiology processing methods to identify candidate biomarkers from MRI, EEG, and multi-modal data. Based on heterogeneous biomarkers, we present novel analytic tools designed to study epileptogenesis in animal model and human with the goal of tracking the probability of developing epilepsy over time.
Collapse
Affiliation(s)
- Dominique Duncan
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA.
| | - Paul Vespa
- Division of Neurosurgery and Department of Neurology, University of California at Los Angeles School of Medicine, Los Angeles, CA, USA
| | - Asla Pitkänen
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, Finland
| | - Adebayo Braimah
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA
| | - Niina Lapinlampi
- A.I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, Finland
| | - Arthur W Toga
- Laboratory of Neuro Imaging, USC Stevens Neuroimaging and Informatics Institute, Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
7
|
Das S, Lecours Boucher X, Rogers C, Makowski C, Chouinard-Decorte F, Oros Klein K, Beck N, Rioux P, Brown ST, Mohaddes Z, Zweber C, Foing V, Forest M, O'Donnell KJ, Clark J, Meaney MJ, Greenwood CMT, Evans AC. Integration of "omics" Data and Phenotypic Data Within a Unified Extensible Multimodal Framework. Front Neuroinform 2018; 12:91. [PMID: 30631270 PMCID: PMC6315165 DOI: 10.3389/fninf.2018.00091] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 11/16/2018] [Indexed: 12/11/2022] Open
Abstract
Analysis of “omics” data is often a long and segmented process, encompassing multiple stages from initial data collection to processing, quality control and visualization. The cross-modal nature of recent genomic analyses renders this process challenging to both automate and standardize; consequently, users often resort to manual interventions that compromise data reliability and reproducibility. This in turn can produce multiple versions of datasets across storage systems. As a result, scientists can lose significant time and resources trying to execute and monitor their analytical workflows and encounter difficulties sharing versioned data. In 2015, the Ludmer Centre for Neuroinformatics and Mental Health at McGill University brought together expertise from the Douglas Mental Health University Institute, the Lady Davis Institute and the Montreal Neurological Institute (MNI) to form a genetics/epigenetics working group. The objectives of this working group are to: (i) design an automated and seamless process for (epi)genetic data that consolidates heterogeneous datasets into the LORIS open-source data platform; (ii) streamline data analysis; (iii) integrate results with provenance information; and (iv) facilitate structured and versioned sharing of pipelines for optimized reproducibility using high-performance computing (HPC) environments via the CBRAIN processing portal. This article outlines the resulting generalizable “omics” framework and its benefits, specifically, the ability to: (i) integrate multiple types of biological and multi-modal datasets (imaging, clinical, demographics and behavioral); (ii) automate the process of launching analysis pipelines on HPC platforms; (iii) remove the bioinformatic barriers that are inherent to this process; (iv) ensure standardization and transparent sharing of processing pipelines to improve computational consistency; (v) store results in a queryable web interface; (vi) offer visualization tools to better view the data; and (vii) provide the mechanisms to ensure usability and reproducibility. This framework for workflows facilitates brain research discovery by reducing human error through automation of analysis pipelines and seamless linking of multimodal data, allowing investigators to focus on research instead of data handling.
Collapse
Affiliation(s)
- Samir Das
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Xavier Lecours Boucher
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Christine Rogers
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Carolina Makowski
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada.,Douglas Hospital Research Centre, McGill University, Montreal, QC, Canada
| | - François Chouinard-Decorte
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Kathleen Oros Klein
- Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada.,Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, QC, Canada
| | - Natacha Beck
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Pierre Rioux
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Shawn T Brown
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Zia Mohaddes
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Cole Zweber
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Victoria Foing
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Marie Forest
- Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada.,Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, QC, Canada
| | - Kieran J O'Donnell
- Douglas Hospital Research Centre, McGill University, Montreal, QC, Canada.,Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada
| | - Joanne Clark
- Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada
| | - Michael J Meaney
- Douglas Hospital Research Centre, McGill University, Montreal, QC, Canada.,Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada
| | - Celia M T Greenwood
- Ludmer Centre for Neuroinformatics & Mental Health, McGill University, Montreal, QC, Canada.,Lady Davis Institute, Jewish General Hospital, McGill University, Montreal, QC, Canada
| | - Alan C Evans
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, Montreal, QC, Canada.,Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| |
Collapse
|
8
|
Kiar G, Gorgolewski KJ, Kleissas D, Roncal WG, Litt B, Wandell B, Poldrack RA, Wiener M, Vogelstein RJ, Burns R, Vogelstein JT. Science in the cloud (SIC): A use case in MRI connectomics. Gigascience 2017; 6:1-10. [PMID: 28327935 PMCID: PMC5467033 DOI: 10.1093/gigascience/gix013] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2016] [Revised: 02/13/2017] [Accepted: 03/01/2017] [Indexed: 01/08/2023] Open
Abstract
Modern technologies are enabling scientists to collect extraordinary amounts of complex and sophisticated data across a huge range of scales like never before. With this onslaught of data, we can allow the focal point to shift from data collection to data analysis. Unfortunately, lack of standardized sharing mechanisms and practices often make reproducing or extending scientific results very difficult. With the creation of data organization structures and tools that drastically improve code portability, we now have the opportunity to design such a framework for communicating extensible scientific discoveries. Our proposed solution leverages these existing technologies and standards, and provides an accessible and extensible model for reproducible research, called 'science in the cloud' (SIC). Exploiting scientific containers, cloud computing, and cloud data services, we show the capability to compute in the cloud and run a web service that enables intimate interaction with the tools and data presented. We hope this model will inspire the community to produce reproducible and, importantly, extensible results that will enable us to collectively accelerate the rate at which scientific breakthroughs are discovered, replicated, and extended.
Collapse
Affiliation(s)
- Gregory Kiar
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Imaging Science, Johns Hopkins University, Baltimore, MD, USA
| | | | - Dean Kleissas
- Johns Hopkins University Applied Physics Lab, Columbia, MD, USA
| | - William Gray Roncal
- Johns Hopkins University Applied Physics Lab, Columbia, MD, USA
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Brian Litt
- Department of Bioengineering, University of Pennsylvania, Philadelphia, PA, USA
- Department of Neurology, Hospital of the University of Pennsylvania, Philadelphia, PA, USA
| | - Brian Wandell
- Department of Psychology, Stanford University, Stanford, CA, USA
- Center for Cognitive and Neurobiological Imaging, Stanford University, Stanford, CA, USA
| | | | - Martin Wiener
- Department of Psychology, George Mason University, Fairfax, VA, USA
| | | | - Randal Burns
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Joshua T. Vogelstein
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- Center for Imaging Science, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
9
|
Farber GK. Can data repositories help find effective treatments for complex diseases? Prog Neurobiol 2017; 152:200-212. [PMID: 27018167 PMCID: PMC5035561 DOI: 10.1016/j.pneurobio.2016.03.008] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Revised: 12/31/2015] [Accepted: 03/22/2016] [Indexed: 01/28/2023]
Abstract
There are many challenges to developing treatments for complex diseases. This review explores the question of whether it is possible to imagine a data repository that would increase the pace of understanding complex diseases sufficiently well to facilitate the development of effective treatments. First, consideration is given to the amount of data that might be needed for such a data repository and whether the existing data storage infrastructure is enough. Several successful data repositories are then examined to see if they have common characteristics. An area of science where unsuccessful attempts to develop a data infrastructure is then described to see what lessons could be learned for a data repository devoted to complex disease. Then, a variety of issues related to sharing data are discussed. In some of these areas, it is reasonably clear how to move forward. In other areas, there are significant open questions that need to be addressed by all data repositories. Using that baseline information, the question of whether data archives can be effective in understanding a complex disease is explored. The major goal of such a data archive is likely to be identifying biomarkers that define sub-populations of the disease.
Collapse
Affiliation(s)
- Gregory K Farber
- Office of Technology Development and Coordination, National Institute of Mental Health, National Institutes of Health, 6001 Executive Boulevard, Room 7162, Rockville, MD 20892-9640, USA.
| |
Collapse
|
10
|
Bui AAT, Van Horn JD. Envisioning the future of 'big data' biomedicine. J Biomed Inform 2017; 69:115-117. [PMID: 28366789 PMCID: PMC5613673 DOI: 10.1016/j.jbi.2017.03.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2017] [Revised: 03/17/2017] [Accepted: 03/29/2017] [Indexed: 01/23/2023]
Abstract
Through the increasing availability of more efficient data collection procedures, biomedical scientists are now confronting ever larger sets of data, often finding themselves struggling to process and interpret what they have gathered. This, while still more data continues to accumulate. This torrent of biomedical information necessitates creative thinking about how the data are being generated, how they might be best managed, analyzed, and eventually how they can be transformed into further scientific understanding for improving patient care. Recognizing this as a major challenge, the National Institutes of Health (NIH) has spearheaded the "Big Data to Knowledge" (BD2K) program - the agency's most ambitious biomedical informatics effort ever undertaken to date. In this commentary, we describe how the NIH has taken on "big data" science head-on, how a consortium of leading research centers are developing the means for handling large-scale data, and how such activities are being marshalled for the training of a new generation of biomedical data scientists. All in all, the NIH BD2K program seeks to position data science at the heart of 21st Century biomedical research.
Collapse
Affiliation(s)
- Alex A T Bui
- BD2K Centers Coordinating Center (BD2K CCC), University of California, Los Angeles, Los Angeles, CA, USA. http://www.bd2kccc.org
| | - John Darrell Van Horn
- BD2K Training Coordinating Center (BD2K TCC), University of Southern California, Los Angeles, CA, USA. http://www.bigdatau.org
| |
Collapse
|
11
|
Grigis A, Goyard D, Cherbonnier R, Gareau T, Papadopoulos Orfanos D, Chauvat N, Di Mascio A, Schumann G, Spooren W, Murphy D, Frouin V. Neuroimaging, Genetics, and Clinical Data Sharing in Python Using the CubicWeb Framework. Front Neuroinform 2017; 11:18. [PMID: 28360851 PMCID: PMC5352661 DOI: 10.3389/fninf.2017.00018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Accepted: 02/22/2017] [Indexed: 12/05/2022] Open
Abstract
In neurosciences or psychiatry, the emergence of large multi-center population imaging studies raises numerous technological challenges. From distributed data collection, across different institutions and countries, to final data publication service, one must handle the massive, heterogeneous, and complex data from genetics, imaging, demographics, or clinical scores. These data must be both efficiently obtained and downloadable. We present a Python solution, based on the CubicWeb open-source semantic framework, aimed at building population imaging study repositories. In addition, we focus on the tools developed around this framework to overcome the challenges associated with data sharing and collaborative requirements. We describe a set of three highly adaptive web services that transform the CubicWeb framework into a (1) multi-center upload platform, (2) collaborative quality assessment platform, and (3) publication platform endowed with massive-download capabilities. Two major European projects, IMAGEN and EU-AIMS, are currently supported by the described framework. We also present a Python package that enables end users to remotely query neuroimaging, genetics, and clinical data from scripts.
Collapse
Affiliation(s)
- Antoine Grigis
- UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France
| | - David Goyard
- UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Robin Cherbonnier
- UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France
| | - Thomas Gareau
- UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France
| | | | | | | | - Gunter Schumann
- Medical Research Council, Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, Psychology and Neuroscience, King’s College London, London, UK
| | - Will Spooren
- F. Hoffmann-La Roche Pharmaceuticals, Basel, Switzerland
| | | | - Vincent Frouin
- UNATI, Neurospin, CEA, Université Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
12
|
Functional connectomics from a "big data" perspective. Neuroimage 2017; 160:152-167. [PMID: 28232122 DOI: 10.1016/j.neuroimage.2017.02.031] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Revised: 01/21/2017] [Accepted: 02/13/2017] [Indexed: 01/10/2023] Open
Abstract
In the last decade, explosive growth regarding functional connectome studies has been observed. Accumulating knowledge has significantly contributed to our understanding of the brain's functional network architectures in health and disease. With the development of innovative neuroimaging techniques, the establishment of large brain datasets and the increasing accumulation of published findings, functional connectomic research has begun to move into the era of "big data", which generates unprecedented opportunities for discovery in brain science and simultaneously encounters various challenging issues, such as data acquisition, management and analyses. Big data on the functional connectome exhibits several critical features: high spatial and/or temporal precision, large sample sizes, long-term recording of brain activity, multidimensional biological variables (e.g., imaging, genetic, demographic, cognitive and clinic) and/or vast quantities of existing findings. We review studies regarding functional connectomics from a big data perspective, with a focus on recent methodological advances in state-of-the-art image acquisition (e.g., multiband imaging), analysis approaches and statistical strategies (e.g., graph theoretical analysis, dynamic network analysis, independent component analysis, multivariate pattern analysis and machine learning), as well as reliability and reproducibility validations. We highlight the novel findings in the application of functional connectomic big data to the exploration of the biological mechanisms of cognitive functions, normal development and aging and of neurological and psychiatric disorders. We advocate the urgent need to expand efforts directed at the methodological challenges and discuss the direction of applications in this field.
Collapse
|
13
|
Al-Jawahiri R, Milne E. Resources available for autism research in the big data era: a systematic review. PeerJ 2017; 5:e2880. [PMID: 28097074 PMCID: PMC5237363 DOI: 10.7717/peerj.2880] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Accepted: 12/07/2016] [Indexed: 12/31/2022] Open
Abstract
Recently, there has been a move encouraged by many stakeholders towards generating big, open data in many areas of research. One area where big, open data is particularly valuable is in research relating to complex heterogeneous disorders such as Autism Spectrum Disorder (ASD). The inconsistencies of findings and the great heterogeneity of ASD necessitate the use of big and open data to tackle important challenges such as understanding and defining the heterogeneity and potential subtypes of ASD. To this end, a number of initiatives have been established that aim to develop big and/or open data resources for autism research. In order to provide a useful data reference for autism researchers, a systematic search for ASD data resources was conducted using the Scopus database, the Google search engine, and the pages on 'recommended repositories' by key journals, and the findings were translated into a comprehensive list focused on ASD data. The aim of this review is to systematically search for all available ASD data resources providing the following data types: phenotypic, neuroimaging, human brain connectivity matrices, human brain statistical maps, biospecimens, and ASD participant recruitment. A total of 33 resources were found containing different types of data from varying numbers of participants. Description of the data available from each data resource, and links to each resource is provided. Moreover, key implications are addressed and underrepresented areas of data are identified.
Collapse
Affiliation(s)
- Reem Al-Jawahiri
- Department of Psychology, University of Sheffield , Sheffield , United Kingdom
| | - Elizabeth Milne
- Department of Psychology, University of Sheffield , Sheffield , United Kingdom
| |
Collapse
|
14
|
Angulo DA, Schneider C, Oliver JH, Charpak N, Hernandez JT. A Multi-facetted Visual Analytics Tool for Exploratory Analysis of Human Brain and Function Datasets. Front Neuroinform 2016; 10:36. [PMID: 27601990 PMCID: PMC4993811 DOI: 10.3389/fninf.2016.00036] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Accepted: 08/03/2016] [Indexed: 11/13/2022] Open
Abstract
Brain research typically requires large amounts of data from different sources, and often of different nature. The use of different software tools adapted to the nature of each data source can make research work cumbersome and time consuming. It follows that data is not often used to its fullest potential thus limiting exploratory analysis. This paper presents an ancillary software tool called BRAVIZ that integrates interactive visualization with real-time statistical analyses, facilitating access to multi-facetted neuroscience data and automating many cumbersome and error-prone tasks required to explore such data. Rather than relying on abstract numerical indicators, BRAVIZ emphasizes brain images as the main object of the analysis process of individuals or groups. BRAVIZ facilitates exploration of trends or relationships to gain an integrated view of the phenomena studied, thus motivating discovery of new hypotheses. A case study is presented that incorporates brain structure and function outcomes together with different types of clinical data.
Collapse
Affiliation(s)
- Diego A Angulo
- IMAGINE, Systems and Computing Engineering, Universidad de los Andes Bogota, Colombia
| | | | | | | | - Jose T Hernandez
- IMAGINE, Systems and Computing Engineering, Universidad de los Andes Bogota, Colombia
| |
Collapse
|
15
|
The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments. Sci Data 2016; 3:160044. [PMID: 27326542 PMCID: PMC4978148 DOI: 10.1038/sdata.2016.44] [Citation(s) in RCA: 793] [Impact Index Per Article: 99.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 05/19/2016] [Indexed: 11/15/2022] Open
Abstract
The development of magnetic resonance imaging (MRI) techniques has defined modern neuroimaging. Since its inception, tens of thousands of studies using techniques such as functional MRI and diffusion weighted imaging have allowed for the non-invasive study of the brain. Despite the fact that MRI is routinely used to obtain data for neuroscience research, there has been no widely adopted standard for organizing and describing the data collected in an imaging experiment. This renders sharing and reusing data (within or between labs) difficult if not impossible and unnecessarily complicates the application of automatic pipelines and quality assurance protocols. To solve this problem, we have developed the Brain Imaging Data Structure (BIDS), a standard for organizing and describing MRI datasets. The BIDS standard uses file formats compatible with existing software, unifies the majority of practices already common in the field, and captures the metadata necessary for most common data processing operations.
Collapse
|
16
|
Das S, Glatard T, MacIntyre LC, Madjar C, Rogers C, Rousseau ME, Rioux P, MacFarlane D, Mohades Z, Gnanasekaran R, Makowski C, Kostopoulos P, Adalat R, Khalili-Mahani N, Niso G, Moreau JT, Evans AC. The MNI data-sharing and processing ecosystem. Neuroimage 2015; 124:1188-1195. [PMID: 26364860 DOI: 10.1016/j.neuroimage.2015.08.076] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 08/22/2015] [Accepted: 08/24/2015] [Indexed: 11/29/2022] Open
Abstract
Neuroimaging has been facing a data deluge characterized by the exponential growth of both raw and processed data. As a result, mining the massive quantities of digital data collected in these studies offers unprecedented opportunities and has become paramount for today's research. As the neuroimaging community enters the world of "Big Data", there has been a concerted push for enhanced sharing initiatives, whether within a multisite study, across studies, or federated and shared publicly. This article will focus on the database and processing ecosystem developed at the Montreal Neurological Institute (MNI) to support multicenter data acquisition both nationally and internationally, create database repositories, facilitate data-sharing initiatives, and leverage existing software toolkits for large-scale data processing.
Collapse
Affiliation(s)
- Samir Das
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada.
| | - Tristan Glatard
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada; Université de Lyon, CREATIS ; CNRS UMR5220 ; Inserm U1044 ; INSA-Lyon ; Université Claude Bernard Lyon 1, France
| | - Leigh C MacIntyre
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Cecile Madjar
- Douglas Mental Health University Institute, Montreal, Canada
| | - Christine Rogers
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Marc-Etienne Rousseau
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Pierre Rioux
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Dave MacFarlane
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Zia Mohades
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Rathi Gnanasekaran
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Carolina Makowski
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Penelope Kostopoulos
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Reza Adalat
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Najmeh Khalili-Mahani
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Guiomar Niso
- McConnell Brain Imaging Centre (BIC), Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Jeremy T Moreau
- McConnell Brain Imaging Centre (BIC), Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| | - Alan C Evans
- McGill Centre for Integrative Neuroscience (MCIN), Ludmer Centre for Neuroinformatics and Mental Health, Montreal Neurological Institute (MNI), McGill University, Montreal, Canada
| |
Collapse
|
17
|
Neuroinformatics Software Applications Supporting Electronic Data Capture, Management, and Sharing for the Neuroimaging Community. Neuropsychol Rev 2015; 25:356-68. [PMID: 26267019 DOI: 10.1007/s11065-015-9293-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2015] [Accepted: 07/24/2015] [Indexed: 10/22/2022]
Abstract
Accelerating insight into the relation between brain and behavior entails conducting small and large-scale research endeavors that lead to reproducible results. Consensus is emerging between funding agencies, publishers, and the research community that data sharing is a fundamental requirement to ensure all such endeavors foster data reuse and fuel reproducible discoveries. Funding agency and publisher mandates to share data are bolstered by a growing number of data sharing efforts that demonstrate how information technologies can enable meaningful data reuse. Neuroinformatics evaluates scientific needs and develops solutions to facilitate the use of data across the cognitive and neurosciences. For example, electronic data capture and management tools designed to facilitate human neurocognitive research can decrease the setup time of studies, improve quality control, and streamline the process of harmonizing, curating, and sharing data across data repositories. In this article we outline the advantages and disadvantages of adopting software applications that support these features by reviewing the tools available and then presenting two contrasting neuroimaging study scenarios in the context of conducting a cross-sectional and a multisite longitudinal study.
Collapse
|
18
|
The PLORAS Database: A data repository for Predicting Language Outcome and Recovery After Stroke. Neuroimage 2015; 124:1208-1212. [PMID: 25882753 PMCID: PMC4658335 DOI: 10.1016/j.neuroimage.2015.03.083] [Citation(s) in RCA: 76] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Revised: 03/11/2015] [Accepted: 03/29/2015] [Indexed: 11/21/2022] Open
Abstract
The PLORAS Database is a relational repository of anatomical and functional imaging data that has primarily been acquired from stroke survivors, along with standardized scores on a wide range of sensory, motor and cognitive abilities, demographic details and medical history. As of January 2015, we have data from 750 patients with an expected accrual rate of 200 patients per year. Expansion will accelerate as we extend our collaborations. The main aim of the database is to Predict Language Outcome and Recovery After Stroke (PLORAS) on the basis of a single structural (anatomical) brain scan that indexes the stereotactic location and extent of brain damage. Predictions are made for individual patients by indicating how other patients with the most similar brain damage, cognitive abilities and demographic details recovered their language skills over time. Predictions are validated by longitudinal follow-ups of patients who initially presented with speech and language difficulties. The PLORAS Database can also be used to predict recovery of other cognitive abilities on the basis of anatomical brain scans. The functional imaging data can be used to understand the neural mechanisms that support recovery from brain damage; and all the data can be used to understand the main sources of inter-subject variability in structure–function mappings in the human brain. Data will be made available for sharing, subject to: funding, ethical approval and patient consent. The PLORAS Database is a repository of data from hundreds of stroke patients. Lesion site is identified from T1-weighted structural MRI scans. Impairments are assessed using the Comprehensive Aphasia Test. Functional MRI data are collected from 14 different speech and language tasks. All data contribute to understanding and modeling inter-subject variability.
Collapse
|
19
|
Nguyen TD, Raniga P, Barnes DG, Egan GF. Design, implementation and operation of a multimodality research imaging informatics repository. Health Inf Sci Syst 2015; 3:S6. [PMID: 25870760 PMCID: PMC4383058 DOI: 10.1186/2047-2501-3-s1-s6] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background Biomedical imaging research increasingly involves acquiring, managing and processing large amounts of distributed imaging data. Integrated systems that combine data, meta-data and workflows are crucial for realising the opportunities presented by advances in imaging facilities. Methods This paper describes the design, implementation and operation of a multi-modality research imaging data management system that manages imaging data obtained from biomedical imaging scanners operated at Monash Biomedical Imaging (MBI), Monash University in Melbourne, Australia. In addition to Digital Imaging and Communications in Medicine (DICOM) images, raw data and non-DICOM biomedical data can be archived and distributed by the system. Imaging data are annotated with meta-data according to a study-centric data model and, therefore, scientific users can find, download and process data easily. Results The research imaging data management system ensures long-term usability, integrity inter-operability and integration of large imaging data. Research users can securely browse and download stored images and data, and upload processed data via subject-oriented informatics frameworks including the Distributed and Reflective Informatics System (DaRIS), and the Extensible Neuroimaging Archive Toolkit (XNAT).
Collapse
Affiliation(s)
- Toan D Nguyen
- Monash Biomedical Imaging, Monash University, Melbourne, Australia.,Monash e-Research Centre, Monash University, Melbourne, Australia
| | - Parnesh Raniga
- Monash Biomedical Imaging, Monash University, Melbourne, Australia.,CSIRO, Melbourne, Australia
| | - David G Barnes
- Monash Biomedical Imaging, Monash University, Melbourne, Australia.,Monash e-Research Centre, Monash University, Melbourne, Australia.,Faculty of Information Technology, Monash University; VLSCI Life Sciences Computation Centre, Melbourne, Australia
| | - Gary F Egan
- Monash Biomedical Imaging, Monash University, Melbourne, Australia.,School of Psychology and Psychiatry, Monash University, Melbourne, Australia
| |
Collapse
|
20
|
Abstract
The maturation of in vivo neuroimaging has led to incredible quantities of digital information about the human brain. While much is made of the data deluge in science, neuroimaging represents the leading edge of this onslaught of "big data". A range of neuroimaging databasing approaches has streamlined the transmission, storage, and dissemination of data from such brain imaging studies. Yet few, if any, common solutions exist to support the science of neuroimaging. In this article, we discuss how modern neuroimaging research represents a multifactorial and broad ranging data challenge, involving the growing size of the data being acquired; sociological and logistical sharing issues; infrastructural challenges for multi-site, multi-datatype archiving; and the means by which to explore and mine these data. As neuroimaging advances further, e.g. aging, genetics, and age-related disease, new vision is needed to manage and process this information while marshalling of these resources into novel results. Thus, "big data" can become "big" brain science.
Collapse
|
21
|
Making big data open: data sharing in neuroimaging. Nat Neurosci 2014; 17:1510-7. [PMID: 25349916 DOI: 10.1038/nn.3818] [Citation(s) in RCA: 235] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Accepted: 08/21/2014] [Indexed: 12/11/2022]
|
22
|
Goh SYM, Irimia A, Torgerson CM, Horn JDV. Neuroinformatics challenges to the structural, connectomic, functional and electrophysiological multimodal imaging of human traumatic brain injury. Front Neuroinform 2014; 8:19. [PMID: 24616696 PMCID: PMC3935464 DOI: 10.3389/fninf.2014.00019] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 02/11/2014] [Indexed: 01/14/2023] Open
Abstract
Throughout the past few decades, the ability to treat and rehabilitate traumatic brain injury (TBI) patients has become critically reliant upon the use of neuroimaging to acquire adequate knowledge of injury-related effects upon brain function and recovery. As a result, the need for TBI neuroimaging analysis methods has increased in recent years due to the recognition that spatiotemporal computational analyses of TBI evolution are useful for capturing the effects of TBI dynamics. At the same time, however, the advent of such methods has brought about the need to analyze, manage, and integrate TBI neuroimaging data using informatically inspired approaches which can take full advantage of their large dimensionality and informational complexity. Given this perspective, we here discuss the neuroinformatics challenges for TBI neuroimaging analysis in the context of structural, connectivity, and functional paradigms. Within each of these, the availability of a wide range of neuroimaging modalities can be leveraged to fully understand the heterogeneity of TBI pathology; consequently, large-scale computer hardware resources and next-generation processing software are often required for efficient data storage, management, and analysis of TBI neuroimaging data. However, each of these paradigms poses challenges in the context of informatics such that the ability to address them is critical for augmenting current capabilities to perform neuroimaging analysis of TBI and to improve therapeutic efficacy.
Collapse
Affiliation(s)
- S Y Matthew Goh
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - Andrei Irimia
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - Carinna M Torgerson
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| | - John D Van Horn
- Department of Neurology, Institute for Neuroimaging and Informatics, Keck School of Medicine, University of Southern California Los Angeles, CA, USA
| |
Collapse
|
23
|
Nichols BN, Mejino JL, Detwiler LT, Nilsen TT, Martone ME, Turner JA, Rubin DL, Brinkley JF. Neuroanatomical domain of the foundational model of anatomy ontology. J Biomed Semantics 2014; 5:1. [PMID: 24398054 PMCID: PMC3944952 DOI: 10.1186/2041-1480-5-1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2013] [Accepted: 12/24/2013] [Indexed: 11/10/2022] Open
Abstract
Background The diverse set of human brain structure and function analysis methods represents a difficult challenge for reconciling multiple views of neuroanatomical organization. While different views of organization are expected and valid, no widely adopted approach exists to harmonize different brain labeling protocols and terminologies. Our approach uses the natural organizing framework provided by anatomical structure to correlate terminologies commonly used in neuroimaging. Description The Foundational Model of Anatomy (FMA) Ontology provides a semantic framework for representing the anatomical entities and relationships that constitute the phenotypic organization of the human body. In this paper we describe recent enhancements to the neuroanatomical content of the FMA that models cytoarchitectural and morphological regions of the cerebral cortex, as well as white matter structure and connectivity. This modeling effort is driven by the need to correlate and reconcile the terms used in neuroanatomical labeling protocols. By providing an ontological framework that harmonizes multiple views of neuroanatomical organization, the FMA provides developers with reusable and computable knowledge for a range of biomedical applications. Conclusions A requirement for facilitating the integration of basic and clinical neuroscience data from diverse sources is a well-structured ontology that can incorporate, organize, and associate neuroanatomical data. We applied the ontological framework of the FMA to align the vocabularies used by several human brain atlases, and to encode emerging knowledge about structural connectivity in the brain. We highlighted several use cases of these extensions, including ontology reuse, neuroimaging data annotation, and organizing 3D brain models.
Collapse
|
24
|
Muehlboeck JS, Westman E, Simmons A. TheHiveDB image data management and analysis framework. Front Neuroinform 2014; 7:49. [PMID: 24432000 PMCID: PMC3880907 DOI: 10.3389/fninf.2013.00049] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Accepted: 12/19/2013] [Indexed: 01/25/2023] Open
Abstract
The hive database system (theHiveDB) is a web-based brain imaging database, collaboration, and activity system which has been designed as an imaging workflow management system capable of handling cross-sectional and longitudinal multi-center studies. It can be used to organize and integrate existing data from heterogeneous projects as well as data from ongoing studies. It has been conceived to guide and assist the researcher throughout the entire research process, integrating all relevant types of data across modalities (e.g., brain imaging, clinical, and genetic data). TheHiveDB is a modern activity and resource management system capable of scheduling image processing on both private compute resources and the cloud. The activity component supports common image archival and management tasks as well as established pipeline processing (e.g., Freesurfer for extraction of scalar measures from magnetic resonance images). Furthermore, via theHiveDB activity system algorithm developers may grant access to virtual machines hosting versioned releases of their tools to collaborators and the imaging community. The application of theHiveDB is illustrated with a brief use case based on organizing, processing, and analyzing data from the publically available Alzheimer Disease Neuroimaging Initiative.
Collapse
Affiliation(s)
- J-Sebastian Muehlboeck
- Department of Neuroimaging, Institute of Psychiatry, King's College London London, UK ; Department of Neurobiology, Care Sciences and Society, Karolinska Institutet Stockholm, Sweden ; J-S Muehlboeck Inc., Montreal QC, Canada
| | - Eric Westman
- Department of Neuroimaging, Institute of Psychiatry, King's College London London, UK ; Department of Neurobiology, Care Sciences and Society, Karolinska Institutet Stockholm, Sweden
| | - Andrew Simmons
- Department of Neuroimaging, Institute of Psychiatry, King's College London London, UK ; NIHR Biomedical Research Centre for Mental Health, King's College London London, UK ; NIHR Biomedical Research Unit for Dementia, King's College London London, UK
| |
Collapse
|
25
|
Shin DD, Ozyurt IB, Liu TT. The Cerebral Blood Flow Biomedical Informatics Research Network (CBFBIRN) database and analysis pipeline for arterial spin labeling MRI data. Front Neuroinform 2013; 7:21. [PMID: 24151465 PMCID: PMC3798866 DOI: 10.3389/fninf.2013.00021] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 09/26/2013] [Indexed: 11/13/2022] Open
Abstract
Arterial spin labeling (ASL) is a magnetic resonance imaging technique that provides a non-invasive and quantitative measure of cerebral blood flow (CBF). After more than a decade of active research, ASL is now emerging as a robust and reliable CBF measurement technique with increased availability and ease of use. There is a growing number of research and clinical sites using ASL for neuroscience research and clinical care. In this paper, we present an online CBF Database and Analysis Pipeline, collectively called the Cerebral Blood Flow Biomedical Informatics Research Network (CBFBIRN) that allows researchers to upload and share ASL and clinical data. In addition to serving the role as a central data repository, the CBFBIRN provides a streamlined data processing infrastructure for CBF quantification and group analysis, which has the potential to accelerate the discovery of new scientific and clinical knowledge. All capabilities and features built into the CBFBIRN are accessed online using a web browser through a secure login. In this work, we begin with a general description of the CBFBIRN system data model and its architecture, then devote the remainder of the paper to the CBFBIRN capabilities. The latter part of our work is divided into two processing modules: (1) Data Upload and CBF Quantification Module; (2) Group Analysis Module that supports three types of analysis commonly used in neuroscience research. To date, the CBFBIRN hosts CBF maps and associated clinical data from more than 1,300 individual subjects. The data have been contributed by more than 20 different research studies, investigating the effect of various conditions on CBF including Alzheimer’s, schizophrenia, bipolar disorder, depression, traumatic brain injury, HIV, caffeine usage, and methamphetamine abuse. Several example results, generated by the CBFBIRN processing modules, are presented. We conclude with the lessons learned during implementation and deployment of the CBFBIRN and our experience in promoting data sharing.
Collapse
Affiliation(s)
- David D Shin
- Center for Functional Magnetic Resonance Imaging, University of California at San Diego La Jolla, CA, USA
| | | | | |
Collapse
|
26
|
Lin MK, Nicolini O, Waxenegger H, Galloway GJ, Ullmann JFP, Janke AL. Interpretation of medical imaging data with a mobile application: a mobile digital imaging processing environment. Front Neurol 2013; 4:85. [PMID: 23847587 PMCID: PMC3701154 DOI: 10.3389/fneur.2013.00085] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2013] [Accepted: 06/19/2013] [Indexed: 11/28/2022] Open
Abstract
Digital Imaging Processing (DIP) requires data extraction and output from a visualization tool to be consistent. Data handling and transmission between the server and a user is a systematic process in service interpretation. The use of integrated medical services for management and viewing of imaging data in combination with a mobile visualization tool can be greatly facilitated by data analysis and interpretation. This paper presents an integrated mobile application and DIP service, called M-DIP. The objective of the system is to (1) automate the direct data tiling, conversion, pre-tiling of brain images from Medical Imaging NetCDF (MINC), Neuroimaging Informatics Technology Initiative (NIFTI) to RAW formats; (2) speed up querying of imaging measurement; and (3) display high-level of images with three dimensions in real world coordinates. In addition, M-DIP provides the ability to work on a mobile or tablet device without any software installation using web-based protocols. M-DIP implements three levels of architecture with a relational middle-layer database, a stand-alone DIP server, and a mobile application logic middle level realizing user interpretation for direct querying and communication. This imaging software has the ability to display biological imaging data at multiple zoom levels and to increase its quality to meet users’ expectations. Interpretation of bioimaging data is facilitated by an interface analogous to online mapping services using real world coordinate browsing. This allows mobile devices to display multiple datasets simultaneously from a remote site. M-DIP can be used as a measurement repository that can be accessed by any network environment, such as a portable mobile or tablet device. In addition, this system and combination with mobile applications are establishing a virtualization tool in the neuroinformatics field to speed interpretation services.
Collapse
Affiliation(s)
- Meng Kuan Lin
- Centre for Advanced Imaging, The University of Queensland , Brisbane, QLD , Australia
| | | | | | | | | | | |
Collapse
|
27
|
Bui AAT, Hsu W, Arnold C, El-Saden S, Aberle DR, Taira RK. Imaging-based observational databases for clinical problem solving: the role of informatics. J Am Med Inform Assoc 2013; 20:1053-8. [PMID: 23775172 DOI: 10.1136/amiajnl-2012-001340] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Imaging has become a prevalent tool in the diagnosis and treatment of many diseases, providing a unique in vivo, multi-scale view of anatomic and physiologic processes. With the increased use of imaging and its progressive technical advances, the role of imaging informatics is now evolving--from one of managing images, to one of integrating the full scope of clinical information needed to contextualize and link observations across phenotypic and genotypic scales. Several challenges exist for imaging informatics, including the need for methods to transform clinical imaging studies and associated data into structured information that can be organized and analyzed. We examine some of these challenges in establishing imaging-based observational databases that can support the creation of comprehensive disease models. The development of these databases and ensuing models can aid in medical decision making and knowledge discovery and ultimately, transform the use of imaging to support individually-tailored patient care.
Collapse
Affiliation(s)
- Alex A T Bui
- Medical Imaging Informatics (MII) Group, Department of Radiological Sciences, UCLA David Geffen School of Medicine, Los Angeles, California, USA
| | | | | | | | | | | |
Collapse
|
28
|
Gorgolewski KJ, Margulies DS, Milham MP. Making data sharing count: a publication-based solution. Front Neurosci 2013; 7:9. [PMID: 23390412 PMCID: PMC3565154 DOI: 10.3389/fnins.2013.00009] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2012] [Accepted: 01/11/2013] [Indexed: 11/22/2022] Open
Abstract
The neuroimaging community has been increasingly called up to openly share data. Although data sharing has been a cornerstone of large-scale data consortia, the incentive for the individual researcher remains unclear. Other fields have benefited from embracing a data publication form – the data paper – that allows researchers to publish their datasets as a citable scientific publication. Such publishing mechanisms both give credit that is recognizable within the scientific ecosystem, and also ensure the quality of the published data and metadata through the peer review process. We discuss the specific challenges of adapting data papers to the needs of the neuroimaging community, and we propose guidelines for the structure as well as review process.
Collapse
|
29
|
Schwartz Y, Barbot A, Thyreau B, Frouin V, Varoquaux G, Siram A, Marcus DS, Poline JB. PyXNAT: XNAT in Python. Front Neuroinform 2012; 6:12. [PMID: 22654752 PMCID: PMC3354345 DOI: 10.3389/fninf.2012.00012] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2011] [Accepted: 03/28/2012] [Indexed: 11/13/2022] Open
Abstract
As neuroimaging databases grow in size and complexity, the time researchers spend investigating and managing the data increases to the expense of data analysis. As a result, investigators rely more and more heavily on scripting using high-level languages to automate data management and processing tasks. For this, a structured and programmatic access to the data store is necessary. Web services are a first step toward this goal. They however lack in functionality and ease of use because they provide only low-level interfaces to databases. We introduce here PyXNAT, a Python module that interacts with The Extensible Neuroimaging Archive Toolkit (XNAT) through native Python calls across multiple operating systems. The choice of Python enables PyXNAT to expose the XNAT Web Services and unify their features with a higher level and more expressive language. PyXNAT provides XNAT users direct access to all the scientific packages in Python. Finally PyXNAT aims to be efficient and easy to use, both as a back-end library to build XNAT clients and as an alternative front-end from the command line.
Collapse
|
30
|
Turner JA, Van Horn JD. Electronic data capture, representation, and applications for neuroimaging. Front Neuroinform 2012; 6:16. [PMID: 22586393 PMCID: PMC3345526 DOI: 10.3389/fninf.2012.00016] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 04/11/2012] [Indexed: 11/24/2022] Open
Affiliation(s)
- Jessica A Turner
- Mind Research Network, University of New Mexico Albuquerque, NM, USA
| | | |
Collapse
|
31
|
Data-Brain driven systematic human brain data analysis: A case study in numerical inductive reasoning centric investigation. COGN SYST RES 2012. [DOI: 10.1016/j.cogsys.2010.12.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
32
|
Bowman I, Joshi SH, Van Horn JD. Visual systems for interactive exploration and mining of large-scale neuroimaging data archives. Front Neuroinform 2012; 6:11. [PMID: 22536181 PMCID: PMC3332235 DOI: 10.3389/fninf.2012.00011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2011] [Accepted: 03/19/2012] [Indexed: 02/05/2023] Open
Abstract
While technological advancements in neuroimaging scanner engineering have improved the efficiency of data acquisition, electronic data capture methods will likewise significantly expedite the populating of large-scale neuroimaging databases. As they do and these archives grow in size, a particular challenge lies in examining and interacting with the information that these resources contain through the development of compelling, user-driven approaches for data exploration and mining. In this article, we introduce the informatics visualization for neuroimaging (INVIZIAN) framework for the graphical rendering of, and dynamic interaction with the contents of large-scale neuroimaging data sets. We describe the rationale behind INVIZIAN, detail its development, and demonstrate its usage in examining a collection of over 900 T1-anatomical magnetic resonance imaging (MRI) image volumes from across a diverse set of clinical neuroimaging studies drawn from a leading neuroimaging database. Using a collection of cortical surface metrics and means for examining brain similarity, INVIZIAN graphically displays brain surfaces as points in a coordinate space and enables classification of clusters of neuroanatomically similar MRI images and data mining. As an initial step toward addressing the need for such user-friendly tools, INVIZIAN provides a highly unique means to interact with large quantities of electronic brain imaging archives in ways suitable for hypothesis generation and data mining.
Collapse
Affiliation(s)
- Ian Bowman
- Laboratory of Neuro Imaging, Department of Neurology, David Geffen School of Medicine, University of California Los Angeles, CA, USA
| | | | | |
Collapse
|
33
|
Poline JB, Breeze JL, Ghosh S, Gorgolewski K, Halchenko YO, Hanke M, Haselgrove C, Helmer KG, Keator DB, Marcus DS, Poldrack RA, Schwartz Y, Ashburner J, Kennedy DN. Data sharing in neuroimaging research. Front Neuroinform 2012; 6:9. [PMID: 22493576 PMCID: PMC3319918 DOI: 10.3389/fninf.2012.00009] [Citation(s) in RCA: 148] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2011] [Accepted: 03/09/2012] [Indexed: 11/13/2022] Open
Abstract
Significant resources around the world have been invested in neuroimaging studies of brain function and disease. Easier access to this large body of work should have profound impact on research in cognitive neuroscience and psychiatry, leading to advances in the diagnosis and treatment of psychiatric and neurological disease. A trend toward increased sharing of neuroimaging data has emerged in recent years. Nevertheless, a number of barriers continue to impede momentum. Many researchers and institutions remain uncertain about how to share data or lack the tools and expertise to participate in data sharing. The use of electronic data capture (EDC) methods for neuroimaging greatly simplifies the task of data collection and has the potential to help standardize many aspects of data sharing. We review here the motivations for sharing neuroimaging data, the current data sharing landscape, and the sociological or technical barriers that still need to be addressed. The INCF Task Force on Neuroimaging Datasharing, in conjunction with several collaborative groups around the world, has started work on several tools to ease and eventually automate the practice of data sharing. It is hoped that such tools will allow researchers to easily share raw, processed, and derived neuroimaging data, with appropriate metadata and provenance records, and will improve the reproducibility of neuroimaging studies. By providing seamless integration of data sharing and analysis tools within a commodity research environment, the Task Force seeks to identify and minimize barriers to data sharing in the field of neuroimaging.
Collapse
Affiliation(s)
- Jean-Baptiste Poline
- Neurospin, Commissariat à l'Energie Atomique et aux Energies Alternatives Gif-sur-Yvette, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Abstract
In neuroscience, collaboration and data sharing are undermined by concerns over the management of protected health information (PHI) and personal identifying information (PII) in neuroimage datasets. The HIPAA Privacy Rule mandates measures for the preservation of subject privacy in neuroimaging studies. Unfortunately for the researcher, the management of information privacy is a burdensome task. Wide scale data sharing of neuroimages is challenging for three primary reasons: (i) A dearth of tools to systematically expunge PHI/PII from neuroimage data sets, (ii) a facility for tracking patient identities in redacted datasets has not been produced, and (iii) a sanitization workflow remains conspicuously absent. This article describes the XNAT Redaction Toolkit-an integrated redaction workflow which extends a popular neuroimage data management toolkit to remove PHI/PII from neuroimages. Quickshear defacing is also presented as a complementary technique for deidentifying the image data itself. Together, these tools improve subject privacy through systematic removal of PII/PHI.
Collapse
Affiliation(s)
- Matt Matlock
- Center for Biological Systems Engineering, Department of Pathology and Immunology, Washington University School of Medicine
| | | | | | | | | |
Collapse
|
35
|
Dickie DA, Job DE, Poole I, Ahearn TS, Staff RT, Murray AD, Wardlaw JM. Do brain image databanks support understanding of normal ageing brain structure? A systematic review. Eur Radiol 2012; 22:1385-94. [PMID: 22354559 DOI: 10.1007/s00330-012-2392-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 12/05/2011] [Accepted: 12/29/2011] [Indexed: 10/28/2022]
Abstract
OBJECTIVE To document accessible magnetic resonance (MR) brain images, metadata and statistical results from normal older subjects that may be used to improve diagnoses of dementia. METHODS We systematically reviewed published brain image databanks (print literature and Internet) concerned with normal ageing brain structure. RESULTS From nine eligible databanks, there appeared to be 944 normal subjects aged ≥60 years. However, many subjects were in more than one databank and not all were fully representative of normal ageing clinical characteristics. Therefore, there were approximately 343 subjects aged ≥60 years with metadata representative of normal ageing, but only 98 subjects were openly accessible. No databank had the range of MR image sequences, e.g. T2*, fluid-attenuated inversion recovery (FLAIR), required to effectively characterise the features of brain ageing. No databank supported random subject retrieval; therefore, manual selection bias and errors may occur in studies that use these subjects as controls. Finally, no databank stored results from statistical analyses of its brain image and metadata that may be validated with analyses of further data. CONCLUSION Brain image databanks require open access, more subjects, metadata, MR image sequences, searchability and statistical results to improve understanding of normal ageing brain structure and diagnoses of dementia. KEY POINTS • We reviewed databanks with structural MR brain images of normal older people. • Among these nine databanks, 98 normal subjects ≥60 years were openly accessible. • None had all the required sequences, random subject retrieval or statistical results. • More access, subjects, sequences, metadata, searchability and results are needed. • These may improve understanding of normal brain ageing and diagnoses of dementia.
Collapse
Affiliation(s)
- David Alexander Dickie
- Division of Clinical Neurosciences, Western General Hospital, Brain Research Imaging Centre (BRIC), University of Edinburgh, Crewe Road, Edinburgh, EH4 2XU, UK.
| | | | | | | | | | | | | |
Collapse
|
36
|
Scott NA, Murphy TH, Illes J. Incidental findings in neuroimaging research: a framework for anticipating the next frontier. J Empir Res Hum Res Ethics 2012; 7:53-7. [PMID: 22378134 PMCID: PMC10460148 DOI: 10.1525/jer.2012.7.1.53] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
While strategies for handling unusual and possibly clinically significant anatomical findings on brain scans of research volunteers have been developed and implemented across neuroimaging laboratories worldwide, few concrete steps have been taken to consider the next frontier: functional anomalies. Drawing on the genetics literature, early work in neuroimaging considered actionability to be a driving force for determining if and when findings should be disclosed to individuals in whom they are detected, as inherent uncertainty raises potential ethical dilemmas of misdiagnosing and mislabelling people as patients. Here we consider the possibility of incidental findings in brain function during the resting state. Our approach does not anchor the resting state as the sine qua non of functional incidental findings, but as a path to thinking about where they may emerge in the future and how our professional communities need to think about thinking about them. We suggest that considering the issues proactively today, within a framework that is maximally flexible and open to modification, is better than responding reactively after the fact and with no framework at all. We argue that there is a duty to consider possible incidental findings despite the ambiguities of data interpretation, while working hard to prevent unnecessary alarm.
Collapse
Affiliation(s)
- Nadia A Scott
- National Core for Neuroethics, Division of Neurology, Department of Medicine, The University of British Columbia, 2211 Wesbrook Mall, Koerner, Vancouver, BC, Canada
| | | | | |
Collapse
|
37
|
Das S, Zijdenbos AP, Harlap J, Vins D, Evans AC. LORIS: a web-based data management system for multi-center studies. Front Neuroinform 2012; 5:37. [PMID: 22319489 PMCID: PMC3262165 DOI: 10.3389/fninf.2011.00037] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Accepted: 12/21/2011] [Indexed: 12/04/2022] Open
Abstract
Longitudinal Online Research and Imaging System (LORIS) is a modular and extensible web-based data management system that integrates all aspects of a multi-center study: from heterogeneous data acquisition (imaging, clinical, behavior, and genetics) to storage, processing, and ultimately dissemination. It provides a secure, user-friendly, and streamlined platform to automate the flow of clinical trials and complex multi-center studies. A subject-centric internal organization allows researchers to capture and subsequently extract all information, longitudinal or cross-sectional, from any subset of the study cohort. Extensive error-checking and quality control procedures, security, data management, data querying, and administrative functions provide LORIS with a triple capability (1) continuous project coordination and monitoring of data acquisition (2) data storage/cleaning/querying, (3) interface with arbitrary external data processing “pipelines.” LORIS is a complete solution that has been thoroughly tested through a full 10 year life cycle of a multi-center longitudinal project1 and is now supporting numerous international neurodevelopment and neurodegeneration research projects.
Collapse
Affiliation(s)
- Samir Das
- Montreal Neurological Institute, McGill University Montreal, Canada
| | | | | | | | | |
Collapse
|
38
|
Scott A, Courtney W, Wood D, de la Garza R, Lane S, King M, Wang R, Roberts J, Turner JA, Calhoun VD. COINS: An Innovative Informatics and Neuroimaging Tool Suite Built for Large Heterogeneous Datasets. Front Neuroinform 2011; 5:33. [PMID: 22275896 PMCID: PMC3250631 DOI: 10.3389/fninf.2011.00033] [Citation(s) in RCA: 142] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2011] [Accepted: 12/02/2011] [Indexed: 11/23/2022] Open
Abstract
The availability of well-characterized neuroimaging data with large numbers of subjects, especially for clinical populations, is critical to advancing our understanding of the healthy and diseased brain. Such data enables questions to be answered in a much more generalizable manner and also has the potential to yield solutions derived from novel methods that were conceived after the original studies’ implementation. Though there is currently growing interest in data sharing, the neuroimaging community has been struggling for years with how to best encourage sharing data across brain imaging studies. With the advent of studies that are much more consistent across sites (e.g., resting functional magnetic resonance imaging, diffusion tensor imaging, and structural imaging) the potential of pooling data across studies continues to gain momentum. At the mind research network, we have developed the collaborative informatics and neuroimaging suite (COINS; http://coins.mrn.org) to provide researchers with an information system based on an open-source model that includes web-based tools to manage studies, subjects, imaging, clinical data, and other assessments. The system currently hosts data from nine institutions, over 300 studies, over 14,000 subjects, and over 19,000 MRI, MEG, and EEG scan sessions in addition to more than 180,000 clinical assessments. In this paper we provide a description of COINS with comparison to a valuable and popular system known as XNAT. Although there are many similarities between COINS and other electronic data management systems, the differences that may concern researchers in the context of multi-site, multi-organizational data sharing environments with intuitive ease of use and PHI security are emphasized as important attributes.
Collapse
Affiliation(s)
- Adam Scott
- The Mind Research Network Albuquerque, NM, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Nielsen TA, Nilsson H, Matheson T. A formal mathematical framework for physiological observations, experiments and analyses. J R Soc Interface 2011; 9:1040-50. [PMID: 21976637 DOI: 10.1098/rsif.2011.0616] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Experiments can be complex and produce large volumes of heterogeneous data, which make their execution, analysis, independent replication and meta-analysis difficult. We propose a mathematical model for experimentation and analysis in physiology that addresses these problems. We show that experiments can be composed from time-dependent quantities, and be expressed as purely mathematical equations. Our structure for representing physiological observations can carry information of any type and therefore provides a precise ontology for a wide range of observations. Our framework is concise, allowing entire experiments to be defined unambiguously in a few equations. In order to demonstrate that our approach can be implemented, we show the equations that we have used to run and analyse two non-trivial experiments describing visually stimulated neuronal responses and dynamic clamp of vertebrate neurons. Our ideas could provide a theoretical basis for developing new standards of data acquisition, analysis and communication in neurophysiology.
Collapse
|
40
|
Abstract
Neurological imaging represents a powerful paradigm for investigation of brain structure, physiology and function across different scales. The diverse phenotypes and significant normal and pathological brain variability demand reliable and efficient statistical methodologies to model, analyze and interpret raw neurological images and derived geometric information from these images. The validity, reproducibility and power of any statistical brain map require appropriate inference on large cohorts, significant community validation, and multidisciplinary collaborations between physicians, engineers and statisticians.
Collapse
Affiliation(s)
- Ivo D Dinov
- SOCR Resource and Laboratory of Neuro Imaging, UCLA Statistics, 8125 Mathematical Science Bldg, Los Angeles, CA 90095, USA, Tel.: +1 310 825 8430
| |
Collapse
|
41
|
Visscher KM, Weissman DH. Would the field of cognitive neuroscience be advanced by sharing functional MRI data? BMC Med 2011; 9:34. [PMID: 21477286 PMCID: PMC3080821 DOI: 10.1186/1741-7015-9-34] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/21/2010] [Accepted: 04/08/2011] [Indexed: 11/10/2022] Open
Abstract
During the past two decades, the advent of functional magnetic resonance imaging (fMRI) has fundamentally changed our understanding of brain-behavior relationships. However, the data from any one study add only incrementally to the big picture. This fact raises important questions about the dominant practice of performing studies in isolation. To what extent are the findings from any single study reproducible? Are researchers who lack the resources to conduct a fMRI study being needlessly excluded? Is pre-existing fMRI data being used effectively to train new students in the field? Here, we will argue that greater sharing and synthesis of raw fMRI data among researchers would make the answers to all of these questions more favorable to scientific discovery than they are today and that such sharing is an important next step for advancing the field of cognitive neuroscience.
Collapse
Affiliation(s)
- Kristina M Visscher
- Department of Neurobiology, University of Alabama, Birmingham, AL 35294, USA.
| | | |
Collapse
|
42
|
Hawrylycz M, Baldock RA, Burger A, Hashikawa T, Johnson GA, Martone M, Ng L, Lau C, Larsen SD, Nissanov J, Puelles L, Ruffins S, Verbeek F, Zaslavsky I, Boline J. Digital atlasing and standardization in the mouse brain. PLoS Comput Biol 2011; 7:e1001065. [PMID: 21304938 PMCID: PMC3033370 DOI: 10.1371/journal.pcbi.1001065] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Affiliation(s)
- Michael Hawrylycz
- Allen Institute for Brain Science, Seattle, Washington, United States of America
| | - Richard A. Baldock
- MRC Human Genetics Unit, Institute of Genetics and Molecular Medicine, Edinburgh, United Kingdom
| | - Albert Burger
- MRC Human Genetics Unit, Edinburgh and Heriot-Watt University, Edinburgh, United Kingdom
| | | | - G. Allan Johnson
- Duke University, Center for In Vivo Microscopy, Durham, North Carolina, United States of America
| | - Maryann Martone
- National Center for Microscopy and Imaging Research (NCMIR), University of California, San Diego, La Jolla, California, United States of America
| | - Lydia Ng
- Allen Institute for Brain Science, Seattle, Washington, United States of America
| | - Chris Lau
- Allen Institute for Brain Science, Seattle, Washington, United States of America
| | - Stephen D. Larsen
- National Center for Microscopy and Imaging Research (NCMIR), University of California, San Diego, La Jolla, California, United States of America
| | - Jonathan Nissanov
- Department of Basic Sciences, Touro University Nevada, College of Osteopathic Medicine, Henderson, Nevada, United States of America
| | - Luis Puelles
- CIBER en Enfermedades Raras 736 and Faculty of Medicine, University of Murcia, Murcia, Spain
| | - Seth Ruffins
- Laboratory of Neuro Imaging (LONI), University of California, Los Angeles, California, United States of America
| | - Fons Verbeek
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
| | - Ilya Zaslavsky
- San Diego Supercomputer Center, University of California San Diego, San Diego, California, United States of America
| | - Jyl Boline
- Informed Minds, Wilton Manors, Florida, United States of America
| |
Collapse
|
43
|
Cognitive neuroscience 2.0: building a cumulative science of human brain function. Trends Cogn Sci 2010; 14:489-96. [PMID: 20884276 DOI: 10.1016/j.tics.2010.08.004] [Citation(s) in RCA: 136] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2010] [Revised: 08/28/2010] [Accepted: 08/30/2010] [Indexed: 11/20/2022]
Abstract
Cognitive neuroscientists increasingly recognize that continued progress in understanding human brain function will require not only the acquisition of new data, but also the synthesis and integration of data across studies and laboratories. Here we review ongoing efforts to develop a more cumulative science of human brain function. We discuss the rationale for an increased focus on formal synthesis of the cognitive neuroscience literature, provide an overview of recently developed tools and platforms designed to facilitate the sharing and integration of neuroimaging data, and conclude with a discussion of several emerging developments that hold even greater promise in advancing the study of human brain function.
Collapse
|
44
|
Ito K. Technical and organizational considerations for the long-term maintenance and development of digital brain atlases and web-based databases. Front Syst Neurosci 2010; 4:26. [PMID: 20661458 PMCID: PMC2907256 DOI: 10.3389/fnsys.2010.00026] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2009] [Accepted: 05/31/2010] [Indexed: 11/25/2022] Open
Abstract
Digital brain atlas is a kind of image database that specifically provide information about neurons and glial cells in the brain. It has various advantages that are unmatched by conventional paper-based atlases. Such advantages, however, may become disadvantages if appropriate cares are not taken. Because digital atlases can provide unlimited amount of data, they should be designed to minimize redundancy and keep consistency of the records that may be added incrementally by different staffs. The fact that digital atlases can easily be revised necessitates a system to assure that users can access previous versions that might have been cited in papers at a particular period. To inherit our knowledge to our descendants, such databases should be maintained for a very long period, well over 100 years, like printed books and papers. Technical and organizational measures to enable long-term archive should be considered seriously. Compared to the initial development of the database, subsequent efforts to increase the quality and quantity of its contents are not regarded highly, because such tasks do not materialize in the form of publications. This fact strongly discourages continuous expansion of, and external contributions to, the digital atlases after its initial launch. To solve these problems, the role of the biocurators is vital. Appreciation of the scientific achievements of the people who do not write papers, and establishment of the secure academic career path for them, are indispensable for recruiting talents for this very important job.
Collapse
Affiliation(s)
- Kei Ito
- Institute of Molecular and Cellular Biosciences, The University of Tokyo Tokyo, Japan
| |
Collapse
|
45
|
Price CJ, Seghier ML, Leff AP. Predicting language outcome and recovery after stroke: the PLORAS system. Nat Rev Neurol 2010; 6:202-10. [PMID: 20212513 PMCID: PMC3556582 DOI: 10.1038/nrneurol.2010.15] [Citation(s) in RCA: 98] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The ability to comprehend and produce speech after stroke depends on whether the areas of the brain that support language have been damaged. Here, we review two different ways to predict language outcome after stroke. The first depends on understanding the neural circuits that support language. This model-based approach is a challenging endeavor because language is a complex cognitive function that involves the interaction of many different brain areas. The second approach, by contrast, does not require an understanding of why a lesion impairs language; instead, predictions are made on the basis of the recovery of previous patients with the same lesion. This approach requires a database that records the speech and language capabilities of a large population of patients who have, collectively, incurred a comprehensive range of focal brain lesions. In addition, a system is required that converts an MRI scan from a new patient into a three-dimensional description of the lesion and compares this lesion against all others on the database. The outputs of this system are the longitudinal language outcomes of corresponding patients in the database. This approach will provide the patient with a range of probable recovery patterns over a variety of language measures.
Collapse
Affiliation(s)
- Cathy J Price
- Wellcome Trust Center for Neuroimaging, Institute of Neurology, University College London, 12 Queen Square, London WC1N 3BG, UK.
| | | | | |
Collapse
|
46
|
Ashburner J, Klöppel S. Multivariate models of inter-subject anatomical variability. Neuroimage 2010; 56:422-39. [PMID: 20347998 PMCID: PMC3084454 DOI: 10.1016/j.neuroimage.2010.03.059] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Revised: 01/22/2010] [Accepted: 03/19/2010] [Indexed: 11/28/2022] Open
Abstract
This paper presents a very selective review of some of the approaches for multivariate modelling of inter-subject variability among brain images. It focusses on applying probabilistic kernel-based pattern recognition approaches to pre-processed anatomical MRI, with the aim of most accurately modelling the difference between populations of subjects. Some of the principles underlying the pattern recognition approaches of Gaussian process classification and regression are briefly described, although the reader is advised to look elsewhere for full implementational details. Kernel pattern recognition methods require matrices that encode the degree of similarity between the images of each pair of subjects. This review focusses on similarity measures derived from the relative shapes of the subjects' brains. Pre-processing is viewed as generative modelling of anatomical variability, and there is a special emphasis on the diffeomorphic image registration framework, which provides a very parsimonious representation of relative shapes. Although the review is largely methodological, excessive mathematical notation is avoided as far as possible, as the paper attempts to convey a more intuitive understanding of various concepts. The paper should be of interest to readers wishing to apply pattern recognition methods to MRI data, with the aim of clinical diagnosis or biomarker development. It also tries to explain that the best models are those that most accurately predict, so similar approaches should also be relevant to basic science. Knowledge of some basic linear algebra and probability theory should make the review easier to follow, although it may still have something to offer to those readers whose mathematics may be more limited.
Collapse
|
47
|
Tang Y, Hojatkashani C, Dinov ID, Sun B, Fan L, Lin X, Qi H, Hua X, Liu S, Toga AW. The construction of a Chinese MRI brain atlas: a morphometric comparison study between Chinese and Caucasian cohorts. Neuroimage 2010; 51:33-41. [PMID: 20152910 DOI: 10.1016/j.neuroimage.2010.01.111] [Citation(s) in RCA: 113] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2009] [Revised: 01/22/2010] [Accepted: 01/29/2010] [Indexed: 11/29/2022] Open
Abstract
We developed a novel brain atlas template to facilitate computational brain studies of Chinese subjects and populations using high quality magnetic resonance imaging (MRI) and well-validated image analysis techniques. To explore the ethnicity-based structural brain differences, we used the MRI scans of 35 Chinese male subjects (24.03+/-2.06 years) and compared them to an age-matched cohort of 35 Caucasian males (24.03+/-2.06 years). Global volumetric measures were used to identify significant group differences in the brain length, width, height and AC-PC line distance. Using the LONI BrainParser, 56 brain structures were automatically labeled and analyzed for all subjects. We identified significant ethnicity differences in brain structure volumes, suggesting that a population-specific brain atlas may be more appropriate for studies involving Chinese populations. To address this, we constructed a 3D Chinese brain atlas based on high resolution 3.0T MRI scans of 56 right-handed male Chinese volunteers (24.46+/-1.81 years). All Chinese brains were spatially normalized by using linear and nonlinear transformation via the "AIR Make Atlas" pipeline workflow within the LONI pipeline environment. This high-resolution Chinese brain atlas was compared to the ICBM152 template, which was constructed using Caucasian brains.
Collapse
Affiliation(s)
- Yuchun Tang
- Research Center for Sectional and Imaging Anatomy, Shandong University School of Medicine, Jinan, Shandong 250012, China
| | | | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Redolfi A, McClatchey R, Anjum A, Zijdenbos A, Manset D, Barkhof F, Spenger C, Legré Y, Wahlund LO, di San Pietro CB, Frisoni GB. Grid infrastructures for computational neuroscience: the neuGRID example. FUTURE NEUROLOGY 2009. [DOI: 10.2217/fnl.09.53] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Neuroscience is increasingly making use of statistical and mathematical tools to extract information from images of biological tissues. Computational neuroimaging tools require substantial computational resources and the increasing availability of large image datasets will further enhance this need. Many efforts have been directed towards creating brain image repositories including the recent US Alzheimer Disease Neuroimaging Initiative. Multisite-distributed computing infrastructures have been launched with the goal of fostering shared resources and facilitating data analysis in the study of neurodegenerative diseases. Currently, some Grid- and non-Grid-based projects are aiming to establish distributed e-infrastructures, interconnecting compatible imaging datasets and to supply neuroscientists with the most advanced information and communication technologies tools to study markers of Alzheimer’s and other brain diseases, but they have so far failed to make a difference in the larger neuroscience community. NeuGRID is an Europeon comission-funded effort arising from the needs of the Alzheimer’s disease imaging community, which will allow the collection and archiving of large amounts of imaging data coupled with Grid-based algorithms and sufficiently powered computational resources. The major benefit will be the faster discovery of new disease markers that will be valuable for earlier diagnosis and development of innovative drugs. The initial setup of neuGRID will feature three nodes equipped with supercomputer capabilities and resources of more than 300 processor cores, 300 GB of RAM memory and approximately 20 TB of physical space. The scope of this article is highlights the new perspectives and potential for the study of the neurodegenerative disorders using the emerging Grid technology.
Collapse
Affiliation(s)
- Alberto Redolfi
- Fatebenefratelli – Centro San Giovanni di Dio, Laboratory of Epidemiology & Neuroimaging, Via Pilastroni 4, I-25125 Brescia, Italy
| | - Richard McClatchey
- University of the West of England, The Centre for Complex Cooperative Systems, Frenchay Campus, Coldharbour Lane, Bristol BS16 1QY, UK
| | - Ashiq Anjum
- University of the West of England, The Centre for Complex Cooperative Systems, Frenchay Campus, Coldharbour Lane, Bristol BS16 1QY, UK
| | - Alex Zijdenbos
- Prodema Medical, Industriestrasse 6B, PO Box 51, 9620 Bronschhofen, Switzerland
| | - David Manset
- maat Gknowledge, Immeuble Alliance Entrée A, 74160 Archamps, France
| | - Frederik Barkhof
- VU University Medical Center, Department of Radiology, De Boelelaan 1118, 1081 HV Amsterdam, The Netherlands
| | - Christian Spenger
- Prodema Medical, Industriestrasse 6B, PO Box 51, 9620 Bronschhofen, Switzerland
| | - Yannik Legré
- HealthGrid, 36 rue Charles de Montesquieu, F-63430 Pont-du-Château, France
| | - Lars-Olof Wahlund
- Karolinska Institutet, Stockholm, Department of Neurobiology, Caring Sciences & Society, Division of Clinical Geriatrics Novum 5th floor, 141 86 Stockholm, Sweden
| | - Chiara Barattieri di San Pietro
- Fatebenefratelli – Centro San Giovanni di Dio, Laboratory of Epidemiology & Neuroimaging, Via Pilastroni 4, I-25125 Brescia, Italy
| | - Giovanni B Frisoni
- Fatebenefratelli – Centro San Giovanni di Dio, Laboratory of Epidemiology & Neuroimaging, Via Pilastroni 4, I-25125 Brescia, Italy
| |
Collapse
|