1
|
Pircalabelu E, Claeskens G, Jahfari S, Waldorp LJ. A focused information criterion for graphical models in fMRI connectivity with high-dimensional data. Ann Appl Stat 2015. [DOI: 10.1214/15-aoas882] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
2
|
de Hollander G, Wagenmakers EJ, Waldorp L, Forstmann B. An antidote to the imager's fallacy, or how to identify brain areas that are in limbo. PLoS One 2014; 9:e115700. [PMID: 25546581 PMCID: PMC4278760 DOI: 10.1371/journal.pone.0115700] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Accepted: 11/26/2014] [Indexed: 11/19/2022] Open
Abstract
Traditionally, fMRI data are analyzed using statistical parametric mapping approaches. Regardless of the precise thresholding procedure, these approaches ultimately divide the brain in regions that do or do not differ significantly across experimental conditions. This binary classification scheme fosters the so-called imager's fallacy, where researchers prematurely conclude that region A is selectively involved in a certain cognitive task because activity in that region reaches statistical significance and activity in region B does not. For such a conclusion to be statistically valid, however, a test on the differences in activation across these two regions is required. Here we propose a simple GLM-based method that defines an "in-between" category of brain regions that are neither significantly active nor inactive, but rather "in limbo". For regions that are in limbo, the activation pattern is inconclusive: it does not differ significantly from baseline, but neither does it differ significantly from regions that do show significant changes from baseline. This pattern indicates that measurement was insufficiently precise. By directly testing differences in activation, our procedure helps reduce the impact of the imager's fallacy. The method is illustrated using concrete examples.
Collapse
Affiliation(s)
- Gilles de Hollander
- University of Amsterdam, Amsterdam Brain and Cognition (ABC), Amsterdam, the Netherlands
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands
- * E-mail:
| | | | - Lourens Waldorp
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands
| | - Birte Forstmann
- University of Amsterdam, Amsterdam Brain and Cognition (ABC), Amsterdam, the Netherlands
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands
| |
Collapse
|
3
|
Guillaume B, Hua X, Thompson PM, Waldorp L, Nichols TE. Fast and accurate modelling of longitudinal and repeated measures neuroimaging data. Neuroimage 2014; 94:287-302. [PMID: 24650594 PMCID: PMC4073654 DOI: 10.1016/j.neuroimage.2014.03.029] [Citation(s) in RCA: 131] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2013] [Revised: 02/25/2014] [Accepted: 03/10/2014] [Indexed: 02/01/2023] Open
Abstract
Despite the growing importance of longitudinal data in neuroimaging, the standard analysis methods make restrictive or unrealistic assumptions (e.g., assumption of Compound Symmetry--the state of all equal variances and equal correlations--or spatially homogeneous longitudinal correlations). While some new methods have been proposed to more accurately account for such data, these methods are based on iterative algorithms that are slow and failure-prone. In this article, we propose the use of the Sandwich Estimator method which first estimates the parameters of interest with a simple Ordinary Least Square model and second estimates variances/covariances with the "so-called" Sandwich Estimator (SwE) which accounts for the within-subject correlation existing in longitudinal data. Here, we introduce the SwE method in its classic form, and we review and propose several adjustments to improve its behaviour, specifically in small samples. We use intensive Monte Carlo simulations to compare all considered adjustments and isolate the best combination for neuroimaging data. We also compare the SwE method to other popular methods and demonstrate its strengths and weaknesses. Finally, we analyse a highly unbalanced longitudinal dataset from the Alzheimer's Disease Neuroimaging Initiative and demonstrate the flexibility of the SwE method to fit within- and between-subject effects in a single model. Software implementing this SwE method has been made freely available at http://warwick.ac.uk/tenichols/SwE.
Collapse
Affiliation(s)
- Bryan Guillaume
- Cyclotron Research Centre, University of Liège, 4000 Liège, Belgium; Department of Statistics, University of Warwick, Coventry, UK; Global Imaging Unit, GlaxoSmithKline, Stevenage, UK
| | - Xue Hua
- Imaging Genetics Center, Laboratory of Neuro Imaging, Dept. of Neurology & Psychiatry, UCLA School of Medicine, Los Angeles, CA 90095, USA
| | - Paul M Thompson
- Imaging Genetics Center, Laboratory of Neuro Imaging, Dept. of Neurology & Psychiatry, UCLA School of Medicine, Los Angeles, CA 90095, USA
| | - Lourens Waldorp
- Department of Psychological Methods, University of Amsterdam, Amsterdam, The Netherlands
| | - Thomas E Nichols
- Department of Statistics, University of Warwick, Coventry, UK; Warwick Manufacturing Group, University of Warwick, Coventry, UK; Oxford Centre for Functional MRI of the Brain, University of Oxford, Oxford, UK
| |
Collapse
|
4
|
Development of the Complex General Linear Model in the Fourier Domain: Application to fMRI Multiple Input-Output Evoked Responses for Single Subjects. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2013; 2013:645043. [PMID: 23840281 PMCID: PMC3697143 DOI: 10.1155/2013/645043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2013] [Revised: 05/03/2013] [Accepted: 05/13/2013] [Indexed: 11/17/2022]
Abstract
A linear time-invariant model based on statistical time series analysis in the Fourier domain for single subjects is further developed and applied to functional MRI (fMRI) blood-oxygen level-dependent (BOLD) multivariate data. This methodology was originally developed to analyze multiple stimulus input evoked response BOLD data. However, to analyze clinical data generated using a repeated measures experimental design, the model has been extended to handle multivariate time series data and demonstrated on control and alcoholic subjects taken from data previously analyzed in the temporal domain. Analysis of BOLD data is typically carried out in the time domain where the data has a high temporal correlation. These analyses generally employ parametric models of the hemodynamic response function (HRF) where prewhitening of the data is attempted using autoregressive (AR) models for the noise. However, this data can be analyzed in the Fourier domain. Here, assumptions made on the noise structure are less restrictive, and hypothesis tests can be constructed based on voxel-specific nonparametric estimates of the hemodynamic transfer function (HRF in the Fourier domain). This is especially important for experimental designs involving multiple states (either stimulus or drug induced) that may alter the form of the response function.
Collapse
|
5
|
Lindquist MA, Spicer J, Asllani I, Wager TD. Estimating and testing variance components in a multi-level GLM. Neuroimage 2011; 59:490-501. [PMID: 21835242 DOI: 10.1016/j.neuroimage.2011.07.077] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2011] [Revised: 07/18/2011] [Accepted: 07/25/2011] [Indexed: 11/19/2022] Open
Abstract
Most analysis of multi-subject fMRI data is concerned with determining whether there exists a significant population-wide 'activation' in a comparison between two or more conditions. This is typically assessed by testing the average value of a contrast of parameter estimates (COPE) against zero in a general linear model (GLM) analysis. However, important information can also be obtained by testing whether there exist significant individual differences in effect magnitude between subjects, i.e. whether the variance of a COPE is significantly different from zero. Intuitively, such a test amounts to testing whether inter-individual differences are larger than would be expected given the within-subject error variance. We compare several methods for estimating variance components, including a) a naïve estimate using ordinary least squares (OLS); b) linear mixed effects in R (LMER); c) a novel Matlab implementation of iterative generalized least squares (IGLS) and its restricted maximum likelihood variant (RIGLS). All methods produced reasonable estimates of within- and between-subject variance components, with IGLS providing an attractive balance between sensitivity and appropriate control of false positives. Finally, we use the IGLS method to estimate inter-subject variance in a perfusion fMRI study (N=18) of social evaluative threat, and show evidence for significant inter-individual differences in ventromedial prefrontal cortex (VMPFC), amygdala, hippocampus and medial temporal lobes, insula, and brainstem, with predicted inverse coupling between VMPFC and the midbrain periaqueductal gray only when high inter-individual variance was used to define the seed for functional connectivity analyses. In sum, tests of variance provides a way of selecting regions that show significant inter-individual variability for subsequent analyses that attempt to explain those individual differences.
Collapse
|
6
|
Monti MM. Statistical Analysis of fMRI Time-Series: A Critical Review of the GLM Approach. Front Hum Neurosci 2011; 5:28. [PMID: 21442013 PMCID: PMC3062970 DOI: 10.3389/fnhum.2011.00028] [Citation(s) in RCA: 146] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Accepted: 03/06/2011] [Indexed: 11/13/2022] Open
Abstract
Functional magnetic resonance imaging (fMRI) is one of the most widely used tools to study the neural underpinnings of human cognition. Standard analysis of fMRI data relies on a general linear model (GLM) approach to separate stimulus induced signals from noise. Crucially, this approach relies on a number of assumptions about the data which, for inferences to be valid, must be met. The current paper reviews the GLM approach to analysis of fMRI time-series, focusing in particular on the degree to which such data abides by the assumptions of the GLM framework, and on the methods that have been developed to correct for any violation of those assumptions. Rather than biasing estimates of effect size, the major consequence of non-conformity to the assumptions is to introduce bias into estimates of the variance, thus affecting test statistics, power, and false positive rates. Furthermore, this bias can have pervasive effects on both individual subject and group-level statistics, potentially yielding qualitatively different results across replications, especially after the thresholding procedures commonly used for inference-making.
Collapse
Affiliation(s)
- Martin M. Monti
- Department of Psychology, University of CaliforniaLos Angeles, CA, USA
| |
Collapse
|
7
|
Effective connectivity of fMRI data using ancestral graph theory: Dealing with missing regions. Neuroimage 2011; 54:2695-705. [DOI: 10.1016/j.neuroimage.2010.10.054] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2009] [Revised: 08/21/2010] [Accepted: 10/18/2010] [Indexed: 11/22/2022] Open
|
8
|
Weeda WD, Waldorp LJ, Grasman RPPP, van Gaal S, Huizenga HM. Functional connectivity analysis of fMRI data using parameterized regions-of-interest. Neuroimage 2010; 54:410-6. [PMID: 20637877 DOI: 10.1016/j.neuroimage.2010.07.022] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2010] [Revised: 07/07/2010] [Accepted: 07/09/2010] [Indexed: 11/30/2022] Open
Abstract
Connectivity analysis of fMRI data requires correct specification of regions-of-interest (ROIs). Selection of ROIs based on outcomes of a GLM analysis may be hindered by conservativeness of the multiple comparison correction, while selection based on brain anatomy may be biased due to inconsistent structure-to-function mapping. To alleviate these problems we propose a method to define functional ROIs without the need for a stringent multiple comparison correction. We extend a flexible framework for fMRI analysis (Activated Region Fitting, Weeda et al. 2009) to connectivity analysis of fMRI data. This method describes an entire fMRI data volume by regions of activation defined by a limited number of parameters. Therefore a less stringent multiple comparison procedure is required. The regions of activation from this analysis can be directly used to estimate functional connectivity. Simulations show that Activated Region Fitting can recover the connectivity of brain regions. An application to real data of a Go/No-Go experiment highlights the advantages of the method.
Collapse
Affiliation(s)
- Wouter D Weeda
- University of Amsterdam, Department of Psychology, Amsterdam, The Netherlands.
| | | | | | | | | |
Collapse
|
9
|
Hajduch M, Hearne LB, Miernyk JA, Casteel JE, Joshi T, Agrawal GK, Song Z, Zhou M, Xu D, Thelen JJ. Systems analysis of seed filling in Arabidopsis: using general linear modeling to assess concordance of transcript and protein expression. PLANT PHYSIOLOGY 2010; 152:2078-87. [PMID: 20118269 PMCID: PMC2850034 DOI: 10.1104/pp.109.152413] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Accepted: 01/26/2010] [Indexed: 05/18/2023]
Abstract
Previous systems analyses in plants have focused on a single developmental stage or time point, although it is often important to additionally consider time-index changes. During seed development a cascade of events occurs within a relatively brief time scale. We have collected protein and transcript expression data from five sequential stages of Arabidopsis (Arabidopsis thaliana) seed development encompassing the period of reserve polymer accumulation. Protein expression profiling employed two-dimensional gel electrophoresis coupled with tandem mass spectrometry, while transcript profiling used oligonucleotide microarrays. Analyses in biological triplicate yielded robust expression information for 523 proteins and 22,746 genes across the five developmental stages, and established 319 protein/transcript pairs for subsequent pattern analysis. General linear modeling was used to evaluate the protein/transcript expression patterns. Overall, application of this statistical assessment technique showed concurrence for a slight majority (56%) of expression pairs. Many specific examples of discordant protein/transcript expression patterns were detected, suggesting that this approach will be useful in revealing examples of posttranscriptional regulation.
Collapse
|