101
|
Çelik E, Keles U, Kiremitçi İ, Gallant JL, Çukur T. Cortical networks of dynamic scene category representation in the human brain. Cortex 2021; 143:127-147. [PMID: 34411847 DOI: 10.1016/j.cortex.2021.07.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2020] [Revised: 06/28/2021] [Accepted: 07/14/2021] [Indexed: 10/20/2022]
Abstract
Humans have an impressive ability to rapidly process global information in natural scenes to infer their category. Yet, it remains unclear whether and how scene categories observed dynamically in the natural world are represented in cerebral cortex beyond few canonical scene-selective areas. To address this question, here we examined the representation of dynamic visual scenes by recording whole-brain blood oxygenation level-dependent (BOLD) responses while subjects viewed natural movies. We fit voxelwise encoding models to estimate tuning for scene categories that reflect statistical ensembles of objects and actions in the natural world. We find that this scene-category model explains a significant portion of the response variance broadly across cerebral cortex. Cluster analysis of scene-category tuning profiles across cortex reveals nine spatially-segregated networks of brain regions consistently across subjects. These networks show heterogeneous tuning for a diverse set of dynamic scene categories related to navigation, human activity, social interaction, civilization, natural environment, non-human animals, motion-energy, and texture, suggesting that the organization of scene category representation is quite complex.
Collapse
Affiliation(s)
- Emin Çelik
- Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey; National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara, Turkey.
| | - Umit Keles
- National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara, Turkey; Division of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA
| | - İbrahim Kiremitçi
- Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey; National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara, Turkey
| | - Jack L Gallant
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, USA; Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA; Department of Psychology, University of California, Berkeley, CA, USA
| | - Tolga Çukur
- Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey; National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara, Turkey; Department of Electrical and Electronics Engineering, Bilkent University, Ankara, Turkey
| |
Collapse
|
102
|
Fast Estimation of L1-Regularized Linear Models in the Mass-Univariate Setting. Neuroinformatics 2021; 19:385-392. [PMID: 32935193 PMCID: PMC8233242 DOI: 10.1007/s12021-020-09489-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
In certain modeling approaches, activation analyses of task-based fMRI data can involve a relatively large number of predictors. For example, in the encoding model approach, complex stimuli are represented in a high-dimensional feature space, resulting in design matrices with many predictors. Similarly, single-trial models and finite impulse response models may also encompass a large number of predictors. In settings where only few of those predictors are expected to be informative, a sparse model fit can be obtained via L1-regularization. However, estimating L1-regularized models requires an iterative fitting procedure, which considerably increases computation time compared to estimating unregularized or L2-regularized models, and complicates the application of L1-regularization on whole-brain data and large sample sizes. Here we provide several functions for estimating L1-regularized models that are optimized for the mass-univariate analysis approach. The package includes a parallel implementation of the coordinate descent algorithm for CPU-only systems and two implementations of the alternating direction method of multipliers algorithm requiring a GPU device. While the core algorithms are implemented in C++/CUDA, data input/output and parameter settings can be conveniently handled via Matlab. The CPU-based implementation is highly memory-efficient and provides considerable speed-up compared to the standard implementation not optimized for the mass-univariate approach. Further acceleration can be achieved on systems equipped with a CUDA-enabled GPU. Using the fastest GPU-based implementation, computation time for whole-brain estimates can be reduced from 9 h to 5 min in an exemplary data setting. Overall, the provided package facilitates the use of L1-regularization for fMRI activation analyses and enables an efficient employment of L1-regularization on whole-brain data and large sample sizes.
Collapse
|
103
|
Russ BE, Petkov CI, Kwok SC, Zhu Q, Belin P, Vanduffel W, Hamed SB. Common functional localizers to enhance NHP & cross-species neuroscience imaging research. Neuroimage 2021; 237:118203. [PMID: 34048898 PMCID: PMC8529529 DOI: 10.1016/j.neuroimage.2021.118203] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 05/15/2021] [Accepted: 05/24/2021] [Indexed: 11/25/2022] Open
Abstract
Functional localizers are invaluable as they can help define regions of interest, provide cross-study comparisons, and most importantly, allow for the aggregation and meta-analyses of data across studies and laboratories. To achieve these goals within the non-human primate (NHP) imaging community, there is a pressing need for the use of standardized and validated localizers that can be readily implemented across different groups. The goal of this paper is to provide an overview of the value of localizer protocols to imaging research and we describe a number of commonly used or novel localizers within NHPs, and keys to implement them across studies. As has been shown with the aggregation of resting-state imaging data in the original PRIME-DE submissions, we believe that the field is ready to apply the same initiative for task-based functional localizers in NHP imaging. By coming together to collect large datasets across research group, implementing the same functional localizers, and sharing the localizers and data via PRIME-DE, it is now possible to fully test their robustness, selectivity and specificity. To do this, we reviewed a number of common localizers and we created a repository of well-established localizer that are easily accessible and implemented through the PRIME-RE platform.
Collapse
Affiliation(s)
- Brian E Russ
- Center for Biomedical Imaging and Neuromodulation, Nathan Kline Institute, Orangeburg, NY, United States; Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York City, NY, United States; Department of Psychiatry, New York University at Langone, New York City, NY, United States.
| | - Christopher I Petkov
- Biosciences Institute, Newcastle University Medical School, Newcastle upon Tyne, United Kingdom
| | - Sze Chai Kwok
- Shanghai Key Laboratory of Brain Functional Genomics, Key Laboratory of Brain Functional Genomics Ministry of Education, Shanghai Key Laboratory of Magnetic Resonance, Affiliated Mental Health Center (ECNU), School of Psychology and Cognitive Science, East China Normal University, Shanghai, China; Division of Natural and Applied Sciences, Duke Kunshan University, Kunshan, Jiangsu, China; NYU-ECNU Institute of Brain and Cognitive Science at NYU Shanghai, Shanghai, China
| | - Qi Zhu
- Cognitive Neuroimaging Unit, INSERM, CEA, Université Paris-Saclay, NeuroSpin Center, 91191 Gif/Yvette, France; Laboratory for Neuro-and Psychophysiology, Department of Neurosciences, KU Leuven Medical School, Leuven, 3000, Belgium
| | - Pascal Belin
- Institut de Neurosciences de La Timone, Aix-Marseille Université et CNRS, Marseille, 13005, France
| | - Wim Vanduffel
- Laboratory for Neuro-and Psychophysiology, Department of Neurosciences, KU Leuven Medical School, Leuven, 3000, Belgium; Leuven Brain Institute, KU Leuven, Leuven, 3000, Belgium; Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Charlestown, MA 02129, United States; Department of Radiology, Harvard Medical School, Boston, MA 02144, United States.
| | - Suliann Ben Hamed
- Institut des Sciences Cognitives Marc Jeannerod, UMR 5229, Université de Lyon - CNRS, France.
| |
Collapse
|
104
|
|
105
|
Fedorenko E. The early origins and the growing popularity of the individual-subject analytic approach in human neuroscience. Curr Opin Behav Sci 2021. [DOI: 10.1016/j.cobeha.2021.02.023] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
|
106
|
Pasternak T, Tadin D. Linking Neuronal Direction Selectivity to Perceptual Decisions About Visual Motion. Annu Rev Vis Sci 2021; 6:335-362. [PMID: 32936737 DOI: 10.1146/annurev-vision-121219-081816] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Psychophysical and neurophysiological studies of responses to visual motion have converged on a consistent set of general principles that characterize visual processing of motion information. Both types of approaches have shown that the direction and speed of target motion are among the most important encoded stimulus properties, revealing many parallels between psychophysical and physiological responses to motion. Motivated by these parallels, this review focuses largely on more direct links between the key feature of the neuronal response to motion, direction selectivity, and its utilization in memory-guided perceptual decisions. These links were established during neuronal recordings in monkeys performing direction discriminations, but also by examining perceptual effects of widespread elimination of cortical direction selectivity produced by motion deprivation during development. Other approaches, such as microstimulation and lesions, have documented the importance of direction-selective activity in the areas that are active during memory-guided direction comparisons, area MT and the prefrontal cortex, revealing their likely interactions during behavioral tasks.
Collapse
Affiliation(s)
- Tatiana Pasternak
- Department of Neuroscience, University of Rochester, Rochester, New York 14642, USA; , .,Department of Brain and Cognitive Sciences, University of Rochester, Rochester, New York 14627, USA.,Center for Visual Science, University of Rochester, Rochester, New York 14627, USA.,Del Monte Institute for Neuroscience, University of Rochester, Rochester, New York 14642, USA
| | - Duje Tadin
- Department of Neuroscience, University of Rochester, Rochester, New York 14642, USA; , .,Department of Brain and Cognitive Sciences, University of Rochester, Rochester, New York 14627, USA.,Center for Visual Science, University of Rochester, Rochester, New York 14627, USA.,Del Monte Institute for Neuroscience, University of Rochester, Rochester, New York 14642, USA.,Department of Ophthalmology, University of Rochester, Rochester, New York 14642, USA
| |
Collapse
|
107
|
Heins N, Pomp J, Kluger DS, Vinbrüx S, Trempler I, Kohler A, Kornysheva K, Zentgraf K, Raab M, Schubotz RI. Surmising synchrony of sound and sight: Factors explaining variance of audiovisual integration in hurdling, tap dancing and drumming. PLoS One 2021; 16:e0253130. [PMID: 34293800 PMCID: PMC8298114 DOI: 10.1371/journal.pone.0253130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 05/31/2021] [Indexed: 11/18/2022] Open
Abstract
Auditory and visual percepts are integrated even when they are not perfectly temporally aligned with each other, especially when the visual signal precedes the auditory signal. This window of temporal integration for asynchronous audiovisual stimuli is relatively well examined in the case of speech, while other natural action-induced sounds have been widely neglected. Here, we studied the detection of audiovisual asynchrony in three different whole-body actions with natural action-induced sounds–hurdling, tap dancing and drumming. In Study 1, we examined whether audiovisual asynchrony detection, assessed by a simultaneity judgment task, differs as a function of sound production intentionality. Based on previous findings, we expected that auditory and visual signals should be integrated over a wider temporal window for actions creating sounds intentionally (tap dancing), compared to actions creating sounds incidentally (hurdling). While percentages of perceived synchrony differed in the expected way, we identified two further factors, namely high event density and low rhythmicity, to induce higher synchrony ratings as well. Therefore, we systematically varied event density and rhythmicity in Study 2, this time using drumming stimuli to exert full control over these variables, and the same simultaneity judgment tasks. Results suggest that high event density leads to a bias to integrate rather than segregate auditory and visual signals, even at relatively large asynchronies. Rhythmicity had a similar, albeit weaker effect, when event density was low. Our findings demonstrate that shorter asynchronies and visual-first asynchronies lead to higher synchrony ratings of whole-body action, pointing to clear parallels with audiovisual integration in speech perception. Overconfidence in the naturally expected, that is, synchrony of sound and sight, was stronger for intentional (vs. incidental) sound production and for movements with high (vs. low) rhythmicity, presumably because both encourage predictive processes. In contrast, high event density appears to increase synchronicity judgments simply because it makes the detection of audiovisual asynchrony more difficult. More studies using real-life audiovisual stimuli with varying event densities and rhythmicities are needed to fully uncover the general mechanisms of audiovisual integration.
Collapse
Affiliation(s)
- Nina Heins
- Department of Psychology, University of Muenster, Muenster, Germany
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
| | - Jennifer Pomp
- Department of Psychology, University of Muenster, Muenster, Germany
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
| | - Daniel S. Kluger
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
- Institute for Biomagnetism and Biosignal Analysis, University Hospital Muenster, Muenster, Germany
| | - Stefan Vinbrüx
- Institute of Sport and Exercise Sciences, Human Performance and Training, University of Muenster, Muenster, Germany
| | - Ima Trempler
- Department of Psychology, University of Muenster, Muenster, Germany
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
| | - Axel Kohler
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
| | - Katja Kornysheva
- School of Psychology and Bangor Neuroimaging Unit, Bangor University, Wales, United Kingdom
| | - Karen Zentgraf
- Department of Movement Science and Training in Sports, Institute of Sport Sciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Markus Raab
- Institute of Psychology, German Sport University Cologne, Cologne, Germany
- School of Applied Sciences, London South Bank University, London, United Kingdom
| | - Ricarda I. Schubotz
- Department of Psychology, University of Muenster, Muenster, Germany
- Otto Creutzfeldt Center for Cognitive and Behavioral Neuroscience, University of Muenster, Muenster, Germany
- * E-mail:
| |
Collapse
|
108
|
Abstract
Selectivity for many basic properties of visual stimuli, such as orientation, is thought to be organized at the scale of cortical columns, making it difficult or impossible to measure directly with noninvasive human neuroscience measurement. However, computational analyses of neuroimaging data have shown that selectivity for orientation can be recovered by considering the pattern of response across a region of cortex. This suggests that computational analyses can reveal representation encoded at a finer spatial scale than is implied by the spatial resolution limits of measurement techniques. This potentially opens up the possibility to study a much wider range of neural phenomena that are otherwise inaccessible through noninvasive measurement. However, as we review in this article, a large body of evidence suggests an alternative hypothesis to this superresolution account: that orientation information is available at the spatial scale of cortical maps and thus easily measurable at the spatial resolution of standard techniques. In fact, a population model shows that this orientation information need not even come from single-unit selectivity for orientation tuning, but instead can result from population selectivity for spatial frequency. Thus, a categorical error of interpretation can result whereby orientation selectivity can be confused with spatial frequency selectivity. This is similarly problematic for the interpretation of results from numerous studies of more complex representations and cognitive functions that have built upon the computational techniques used to reveal stimulus orientation. We suggest in this review that these interpretational ambiguities can be avoided by treating computational analyses as models of the neural processes that give rise to measurement. Building upon the modeling tradition in vision science using considerations of whether population models meet a set of core criteria is important for creating the foundation for a cumulative and replicable approach to making valid inferences from human neuroscience measurements. Expected final online publication date for the Annual Review of Vision Science, Volume 7 is September 2021. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Justin L Gardner
- Department of Psychology, Stanford University, Stanford, California 94305, USA;
| | - Elisha P Merriam
- Laboratory of Brain and Cognition, National Institute of Mental Health, National Institutes of Health, Bethesda, Maryland 20892, USA;
| |
Collapse
|
109
|
Vu AT, Feinberg DA. The Role of Cerebral Metabolism in Improving Time Pressured Decisions. Front Psychol 2021; 12:690198. [PMID: 34354635 PMCID: PMC8329240 DOI: 10.3389/fpsyg.2021.690198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 06/14/2021] [Indexed: 11/13/2022] Open
Abstract
Speed-accuracy tradeoff (SAT) theory dictates that decisions can be made more quickly by sacrificing accuracy. Here we investigate whether the human brain can operate in a brief metabolic overdrive to overcome SAT and successfully make decisions requiring both high levels of speed and accuracy. In the context of BOLD fMRI we expect “a brief metabolic overdrive” to involve an increase in cerebral oxygen metabolism prior to increased cerebral blood flow–a phenomenon known as the “initial dip” which results from a sudden drop in oxyhemoglobin in perfusing blood. Human subjects performed a motion discrimination task consisting of different difficulties while emphasizing either accuracy (i.e., without time pressure) or both speed and accuracy (i.e., with time pressure). Using simultaneous multi-slice fMRI, for very fast (333 ms) measurement of whole brain BOLD activity, revealed two modes of physiological overdrive responses when subjects emphasized both speed and accuracy. The majority of subjects exhibited the hypothesized enhancement of initial dip amplitude in posterior visual cortex (PVC) with the size of the enhancement significantly correlated with improvement in behavioral performance. For these subjects, the traditionally analyzed post-stimulus overshoot was not affected by task emphasis. These results demonstrate the complexity and variability of the BOLD hemodynamic response. The discovered relationships between BOLD response and behavior were only observed when subjects emphasized both speed and accuracy in more difficult trials suggesting that the brain can perform in a state of metabolic overdrive with enhanced neural processing of sensory information specifically in challenging situations.
Collapse
Affiliation(s)
- An Thanh Vu
- San Francisco VA Health Care System, San Francisco, CA, United States.,Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, United States
| | - David A Feinberg
- Advanced Magnetic Resonance Imaging (MRI) Technologies, Sebastopol, CA, United States.,Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
110
|
Bridewell W, Isaac AMC. Apophatic science: how computational modeling can explain consciousness. Neurosci Conscious 2021; 2021:niab010. [PMID: 34141451 PMCID: PMC8206510 DOI: 10.1093/nc/niab010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 02/08/2021] [Accepted: 03/16/2021] [Indexed: 11/17/2022] Open
Abstract
This study introduces a novel methodology for consciousness science. Consciousness as we understand it pretheoretically is inherently subjective, yet the data available to science are irreducibly intersubjective. This poses a unique challenge for attempts to investigate consciousness empirically. We meet this challenge by combining two insights. First, we emphasize the role that computational models play in integrating results relevant to consciousness from across the cognitive sciences. This move echoes Alan Newell’s call that the language and concepts of computer science serve as a lingua franca for integrative cognitive science. Second, our central contribution is a new method for validating computational models that treats them as providing negative data on consciousness: data about what consciousness is not. This method is designed to support a quantitative science of consciousness while avoiding metaphysical commitments. We discuss how this methodology applies to current and future research and address questions that others have raised.
Collapse
Affiliation(s)
- Will Bridewell
- Navy Center for Applied Research in Artificial Intelligence, U.S. Naval Research Laboratory, 4555 Overlook Ave SW, Washington, DC 20375, USA
| | | |
Collapse
|
111
|
Iliski, a software for robust calculation of transfer functions. PLoS Comput Biol 2021; 17:e1008614. [PMID: 34125846 PMCID: PMC8224889 DOI: 10.1371/journal.pcbi.1008614] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 06/24/2021] [Accepted: 05/18/2021] [Indexed: 12/27/2022] Open
Abstract
Understanding the relationships between biological processes is paramount to unravel pathophysiological mechanisms. These relationships can be modeled with Transfer Functions (TFs), with no need of a priori hypotheses as to the shape of the transfer function. Here we present Iliski, a software dedicated to TFs computation between two signals. It includes different pre-treatment routines and TF computation processes: deconvolution, deterministic and non-deterministic optimization algorithms that are adapted to disparate datasets. We apply Iliski to data on neurovascular coupling, an ensemble of cellular mechanisms that link neuronal activity to local changes of blood flow, highlighting the software benefits and caveats in the computation and evaluation of TFs. We also propose a workflow that will help users to choose the best computation according to the dataset. Iliski is available under the open-source license CC BY 4.0 on GitHub (https://github.com/alike-aydin/Iliski) and can be used on the most common operating systems, either within the MATLAB environment, or as a standalone application. Iliski is a software helping the user to find the relationship between two sets of data, namely transfer functions. Although transfer functions are widely used in many scientific fields to link two signals, their computation can be tricky due to data features such as multisource noise, or to specific shape requirements imposed by the nature of the signals, e.g. in biological data. Iliski offers a user-friendly graphical interface to ease the computation of transfer functions for both experienced and users with no coding skills. It proposes several signal pre-processing methods and allows rapid testing of different computing approaches, either based on deconvolution or on optimization of multi-parametric functions. This article, combined with a User Manual, provides a detailed description of Iliski functionalities and a thorough description of the advantages and drawbacks of each computing method using experimental biological data. In the era of Big Data, scientists strive to find new models for patho-physiological mechanisms, and Iliski fulfils the requirements of rigorous, flexible, and fast data driven hypothesis testing.
Collapse
|
112
|
Kim YJ, Brackbill N, Batty E, Lee J, Mitelut C, Tong W, Chichilnisky EJ, Paninski L. Nonlinear Decoding of Natural Images From Large-Scale Primate Retinal Ganglion Recordings. Neural Comput 2021; 33:1719-1750. [PMID: 34411268 DOI: 10.1162/neco_a_01395] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 01/25/2021] [Indexed: 11/04/2022]
Abstract
Decoding sensory stimuli from neural activity can provide insight into how the nervous system might interpret the physical environment, and facilitates the development of brain-machine interfaces. Nevertheless, the neural decoding problem remains a significant open challenge. Here, we present an efficient nonlinear decoding approach for inferring natural scene stimuli from the spiking activities of retinal ganglion cells (RGCs). Our approach uses neural networks to improve on existing decoders in both accuracy and scalability. Trained and validated on real retinal spike data from more than 1000 simultaneously recorded macaque RGC units, the decoder demonstrates the necessity of nonlinear computations for accurate decoding of the fine structures of visual stimuli. Specifically, high-pass spatial features of natural images can only be decoded using nonlinear techniques, while low-pass features can be extracted equally well by linear and nonlinear methods. Together, these results advance the state of the art in decoding natural stimuli from large populations of neurons.
Collapse
|
113
|
Nishida S, Blanc A, Maeda N, Kado M, Nishimoto S. Behavioral correlates of cortical semantic representations modeled by word vectors. PLoS Comput Biol 2021; 17:e1009138. [PMID: 34161315 PMCID: PMC8260002 DOI: 10.1371/journal.pcbi.1009138] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 07/06/2021] [Accepted: 06/01/2021] [Indexed: 12/03/2022] Open
Abstract
The quantitative modeling of semantic representations in the brain plays a key role in understanding the neural basis of semantic processing. Previous studies have demonstrated that word vectors, which were originally developed for use in the field of natural language processing, provide a powerful tool for such quantitative modeling. However, whether semantic representations in the brain revealed by the word vector-based models actually capture our perception of semantic information remains unclear, as there has been no study explicitly examining the behavioral correlates of the modeled brain semantic representations. To address this issue, we compared the semantic structure of nouns and adjectives in the brain estimated from word vector-based brain models with that evaluated from human behavior. The brain models were constructed using voxelwise modeling to predict the functional magnetic resonance imaging (fMRI) response to natural movies from semantic contents in each movie scene through a word vector space. The semantic dissimilarity of brain word representations was then evaluated using the brain models. Meanwhile, data on human behavior reflecting the perception of semantic dissimilarity between words were collected in psychological experiments. We found a significant correlation between brain model- and behavior-derived semantic dissimilarities of words. This finding suggests that semantic representations in the brain modeled via word vectors appropriately capture our perception of word meanings.
Collapse
Affiliation(s)
- Satoshi Nishida
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology (NICT), Suita, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
| | - Antoine Blanc
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology (NICT), Suita, Osaka, Japan
| | | | | | - Shinji Nishimoto
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology (NICT), Suita, Osaka, Japan
- Graduate School of Frontier Biosciences, Osaka University, Suita, Osaka, Japan
- Graduate School of Medicine, Osaka University, Suita, Osaka, Japan
| |
Collapse
|
114
|
Nakai T, Yamaguchi HQ, Nishimoto S. Convergence of Modality Invariance and Attention Selectivity in the Cortical Semantic Circuit. Cereb Cortex 2021; 31:4825-4839. [PMID: 33999141 PMCID: PMC8408468 DOI: 10.1093/cercor/bhab125] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Revised: 04/05/2021] [Accepted: 04/10/2021] [Indexed: 11/12/2022] Open
Abstract
The human linguistic system is characterized by modality invariance and attention selectivity. Previous studies have examined these properties independently and reported perisylvian region involvement for both; however, their relationship and the linguistic information they harbor remain unknown. Participants were assessed by functional magnetic resonance imaging, while spoken narratives (auditory) and written texts (visual) were presented, either separately or simultaneously. Participants were asked to attend to one stimulus when both were presented. We extracted phonemic and semantic features from these auditory and visual modalities, to train multiple, voxel-wise encoding models. Cross-modal examinations of the trained models revealed that perisylvian regions were associated with modality-invariant semantic representations. Attentional selectivity was quantified by examining the modeling performance for attended and unattended conditions. We have determined that perisylvian regions exhibited attention selectivity. Both modality invariance and attention selectivity are both prominent in models that use semantic but not phonemic features. Modality invariance was significantly correlated with attention selectivity in some brain regions; however, we also identified cortical regions associated with only modality invariance or only attention selectivity. Thus, paying selective attention to a specific sensory input modality may regulate the semantic information that is partly processed in brain networks that are shared across modalities.
Collapse
Affiliation(s)
- Tomoya Nakai
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka 565-0871, Japan.,Graduate School of Frontier Biosciences, Osaka University, Osaka 565-0871, Japan
| | - Hiroto Q Yamaguchi
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka 565-0871, Japan.,Graduate School of Frontier Biosciences, Osaka University, Osaka 565-0871, Japan
| | - Shinji Nishimoto
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka 565-0871, Japan.,Graduate School of Frontier Biosciences, Osaka University, Osaka 565-0871, Japan.,Graduate School of Medicine, Osaka University, Osaka 565-0871, Japan
| |
Collapse
|
115
|
Zhang T, Gao JS, Çukur T, Gallant JL. Voxel-Based State Space Modeling Recovers Task-Related Cognitive States in Naturalistic fMRI Experiments. Front Neurosci 2021; 14:565976. [PMID: 34045937 PMCID: PMC8145286 DOI: 10.3389/fnins.2020.565976] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 09/24/2020] [Indexed: 11/23/2022] Open
Abstract
Complex natural tasks likely recruit many different functional brain networks, but it is difficult to predict how such tasks will be represented across cortical areas and networks. Previous electrophysiology studies suggest that task variables are represented in a low-dimensional subspace within the activity space of neural populations. Here we develop a voxel-based state space modeling method for recovering task-related state spaces from human fMRI data. We apply this method to data acquired in a controlled visual attention task and a video game task. We find that each task induces distinct brain states that can be embedded in a low-dimensional state space that reflects task parameters, and that attention increases state separation in the task-related subspace. Our results demonstrate that the state space framework offers a powerful approach for modeling human brain activity elicited by complex natural tasks.
Collapse
Affiliation(s)
- Tianjiao Zhang
- Program in Bioengineering, University of California, Berkeley, Berkeley, CA, United States
| | - James S Gao
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States
| | - Tolga Çukur
- Department of Electrical and Electronics Engineering, Bilkent University, Ankara, Turkey.,National Magnetic Resonance Research Center (UMRAM), Bilkent University, Ankara, Turkey.,Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey
| | - Jack L Gallant
- Program in Bioengineering, University of California, Berkeley, Berkeley, CA, United States.,Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, United States.,Department of Psychology, University of California, Berkeley, Berkeley, CA, United States
| |
Collapse
|
116
|
El Ouardi L. The Moroccan Arabic verb and noun test for language mapping (MAVNT-LP) under nTMS and DES. APPLIED NEUROPSYCHOLOGY-ADULT 2021; 29:1413-1424. [PMID: 33689513 DOI: 10.1080/23279095.2021.1883020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
To maximize tumor resection and minimize postoperative neurological sequelae, intraoperative Direct Electrical Stimulation (DES) coupled with preoperative Navigated Transcranial Magnetic Stimulation (nTMS) is adopted as a more valid procedure when a tumor is located in or near language-positive cortical and subcortical brain areas/networks. To map language functions peri- and intraoperatively, naming tasks are usually administered given their sensitivity and practicality in mapping language networks and their association with positive postoperative outcomes. Linguistic protocols designed for stimulation under nTMS are relatively scarce, and non-existent in the Arabic language. The present study attempts to fill these gaps by presenting the processes of development, piloting, and standardization of the first (Moroccan) Arabic object and action naming protocol for use preoperatively under nTMS, and intraoperatively under DES. The MAVNT-LP was developed in accordance with both DES and nTMS time requirements and was balanced for relevant psycholinguistic as well as intrinsic factors. The test underwent piloting on a population of 10 Moroccan Arabic (MA)-speaking individuals and was validated on a population of 50 participants. The standardized version of the test consisted of 61 nouns and 61 verbs. The 122 items included in the test were named accurately by at least 80% of the participants and had a high naming agreement. Correlations between psycholinguistic factors and lexical retrieval are reported and discussed. The MAVNT-LP was found to be a valid instrument for use in a clinical setting either as a planning tool or as a protocol used to stimulate eloquent brain areas under awake brain surgery.
Collapse
Affiliation(s)
- Loubna El Ouardi
- Applied Language and Culture Studies Lab, Chouaib Doukalli University, El Jadida, Morocco
| |
Collapse
|
117
|
Slivkoff S, Gallant JL. Design of complex neuroscience experiments using mixed-integer linear programming. Neuron 2021; 109:1433-1448. [PMID: 33689687 DOI: 10.1016/j.neuron.2021.02.019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2020] [Revised: 02/05/2021] [Accepted: 02/16/2021] [Indexed: 11/29/2022]
Abstract
Over the past few decades, neuroscience experiments have become increasingly complex and naturalistic. Experimental design has in turn become more challenging, as experiments must conform to an ever-increasing diversity of design constraints. In this article, we demonstrate how this design process can be greatly assisted using an optimization tool known as mixed-integer linear programming (MILP). MILP provides a rich framework for incorporating many types of real-world design constraints into a neuroscience experiment. We introduce the mathematical foundations of MILP, compare MILP to other experimental design techniques, and provide four case studies of how MILP can be used to solve complex experimental design challenges.
Collapse
Affiliation(s)
- Storm Slivkoff
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Jack L Gallant
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Psychology, University of California, Berkeley, Berkeley, CA 94720, USA; Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94720, USA.
| |
Collapse
|
118
|
Huang W, Yan H, Wang C, Yang X, Li J, Zuo Z, Zhang J, Chen H. Deep Natural Image Reconstruction from Human Brain Activity Based on Conditional Progressively Growing Generative Adversarial Networks. Neurosci Bull 2021; 37:369-379. [PMID: 33222145 PMCID: PMC7954952 DOI: 10.1007/s12264-020-00613-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2020] [Accepted: 06/16/2020] [Indexed: 01/01/2023] Open
Abstract
Brain decoding based on functional magnetic resonance imaging has recently enabled the identification of visual perception and mental states. However, due to the limitations of sample size and the lack of an effective reconstruction model, accurate reconstruction of natural images is still a major challenge. The current, rapid development of deep learning models provides the possibility of overcoming these obstacles. Here, we propose a deep learning-based framework that includes a latent feature extractor, a latent feature decoder, and a natural image generator, to achieve the accurate reconstruction of natural images from brain activity. The latent feature extractor is used to extract the latent features of natural images. The latent feature decoder predicts the latent features of natural images based on the response signals from the higher visual cortex. The natural image generator is applied to generate reconstructed images from the predicted latent features of natural images and the response signals from the visual cortex. Quantitative and qualitative evaluations were conducted with test images. The results showed that the reconstructed image achieved comparable, accurate reproduction of the presented image in both high-level semantic category information and low-level pixel information. The framework we propose shows promise for decoding the brain activity.
Collapse
Affiliation(s)
- Wei Huang
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Hongmei Yan
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China.
| | - Chong Wang
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Xiaoqing Yang
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Jiyi Li
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China
| | - Zhentao Zuo
- State Key Laboratory of Brain and Cognitive Science, Beijing MR Center for Brain Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Jiang Zhang
- Department of Medical Information Engineering, Sichuan University, Chengdu, 610065, China
| | - Huafu Chen
- The MOE Key Lab for Neuroinformation, University of Electronic Science and Technology of China, Chengdu, 610054, China.
| |
Collapse
|
119
|
Zhang Y, Tetrel L, Thirion B, Bellec P. Functional annotation of human cognitive states using deep graph convolution. Neuroimage 2021; 231:117847. [PMID: 33582272 DOI: 10.1016/j.neuroimage.2021.117847] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 02/04/2021] [Accepted: 02/05/2021] [Indexed: 01/17/2023] Open
Abstract
A key goal in neuroscience is to understand brain mechanisms of cognitive functions. An emerging approach is "brain decoding", which consists of inferring a set of experimental conditions performed by a participant, using pattern classification of brain activity. Few works so far have attempted to train a brain decoding model that would generalize across many different cognitive tasks drawn from multiple cognitive domains. To tackle this problem, we proposed a multidomain brain decoder that automatically learns the spatiotemporal dynamics of brain response within a short time window using a deep learning approach. We evaluated the decoding model on a large population of 1200 participants, under 21 different experimental conditions spanning six different cognitive domains, acquired from the Human Connectome Project task-fMRI database. Using a 10s window of fMRI response, the 21 cognitive states were identified with a test accuracy of 90% (chance level 4.8%). Performance remained good when using a 6s window (82%). It was even feasible to decode cognitive states from a single fMRI volume (720ms), with the performance following the shape of the hemodynamic response. Moreover, a saliency map analysis demonstrated that the high decoding performance was driven by the response of biologically meaningful brain regions. Together, we provide an automated tool to annotate human brain activity with fine temporal resolution and fine cognitive granularity. Our model shows potential applications as a reference model for domain adaptation, possibly making contributions in a variety of domains, including neurological and psychiatric disorders.
Collapse
Affiliation(s)
- Yu Zhang
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, Montreal, QC H3W 1W6, Canada; Department of Psychology, Université de Montréal, Montreal QC H3C 3J7, Canada
| | - Loïc Tetrel
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, Montreal, QC H3W 1W6, Canada
| | - Bertrand Thirion
- Parietal team, INRIA, Neurospin, CEA Saclay, Gif-sur-Yvette, France
| | - Pierre Bellec
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, Montreal, QC H3W 1W6, Canada; Department of Psychology, Université de Montréal, Montreal QC H3C 3J7, Canada.
| |
Collapse
|
120
|
Cui Y, Qiao K, Zhang C, Wang L, Yan B, Tong L. GaborNet Visual Encoding: A Lightweight Region-Based Visual Encoding Model With Good Expressiveness and Biological Interpretability. Front Neurosci 2021; 15:614182. [PMID: 33613179 PMCID: PMC7893978 DOI: 10.3389/fnins.2021.614182] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Accepted: 01/04/2021] [Indexed: 11/17/2022] Open
Abstract
Computational visual encoding models play a key role in understanding the stimulus-response characteristics of neuronal populations in the brain visual cortex. However, building such models typically faces challenges in the effective construction of non-linear feature spaces to fit the neuronal responses. In this work, we propose the GaborNet visual encoding (GaborNet-VE) model, a novel end-to-end encoding model for the visual ventral stream. This model comprises a Gabor convolutional layer, two regular convolutional layers, and a fully connected layer. The key design principle for the GaborNet-VE model is to replace regular convolutional kernels in the first convolutional layer with Gabor kernels with learnable parameters. One GaborNet-VE model efficiently and simultaneously encodes all voxels in one region of interest of functional magnetic resonance imaging data. The experimental results show that the proposed model achieves state-of-the-art prediction performance for the primary visual cortex. Moreover, the visualizations demonstrate the regularity of the region of interest fitting to the visual features and the estimated receptive fields. These results suggest that the lightweight region-based GaborNet-VE model based on combining handcrafted and deep learning features exhibits good expressiveness and biological interpretability.
Collapse
Affiliation(s)
| | | | | | | | | | - Li Tong
- Henan Key Laboratory of Imaging and Intelligent Processing, PLA Strategic Support Force Information Engineering University, Zhengzhou, China
| |
Collapse
|
121
|
Seeliger K, Ambrogioni L, Güçlütürk Y, van den Bulk LM, Güçlü U, van Gerven MAJ. End-to-end neural system identification with neural information flow. PLoS Comput Biol 2021; 17:e1008558. [PMID: 33539366 PMCID: PMC7888598 DOI: 10.1371/journal.pcbi.1008558] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2019] [Revised: 02/17/2021] [Accepted: 11/24/2020] [Indexed: 11/19/2022] Open
Abstract
Neural information flow (NIF) provides a novel approach for system identification in neuroscience. It models the neural computations in multiple brain regions and can be trained end-to-end via stochastic gradient descent from noninvasive data. NIF models represent neural information processing via a network of coupled tensors, each encoding the representation of the sensory input contained in a brain region. The elements of these tensors can be interpreted as cortical columns whose activity encodes the presence of a specific feature in a spatiotemporal location. Each tensor is coupled to the measured data specific to a brain region via low-rank observation models that can be decomposed into the spatial, temporal and feature receptive fields of a localized neuronal population. Both these observation models and the convolutional weights defining the information processing within regions are learned end-to-end by predicting the neural signal during sensory stimulation. We trained a NIF model on the activity of early visual areas using a large-scale fMRI dataset recorded in a single participant. We show that we can recover plausible visual representations and population receptive fields that are consistent with empirical findings.
Collapse
Affiliation(s)
- K. Seeliger
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - L. Ambrogioni
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - Y. Güçlütürk
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - L. M. van den Bulk
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - U. Güçlü
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| | - M. A. J. van Gerven
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands
| |
Collapse
|
122
|
Jolly E, Chang LJ. Multivariate spatial feature selection in fMRI. Soc Cogn Affect Neurosci 2021; 16:795-806. [PMID: 33501987 PMCID: PMC8343556 DOI: 10.1093/scan/nsab010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Revised: 09/16/2020] [Accepted: 01/25/2021] [Indexed: 01/20/2023] Open
Abstract
Multivariate neuroimaging analyses constitute a powerful class of techniques to identify psychological representations. However, not all psychological processes are represented at the same spatial scale throughout the brain. This heterogeneity is apparent when comparing hierarchically organized local representations of perceptual processes to flexible transmodal representations of more abstract cognitive processes such as social and affective operations. An open question is how the spatial scale of analytic approaches interacts with the spatial scale of the representations under investigation. In this article, we describe how multivariate analyses can be viewed as existing on a spatial spectrum, anchored by searchlights used to identify locally distributed patterns of information on one end, whole brain approach used to identify diffuse neural representations at the other and region-based approaches in between. We describe how these distinctions are an important and often overlooked analytic consideration and provide heuristics to compare these different techniques to choose based on the analyst’s inferential goals.
Collapse
Affiliation(s)
- E Jolly
- Computational Social Affective Neuroscience Laboratory, Department of Psychological and Brain Science, Dartmouth College, Hanover, NH 03755, USA
| | - L J Chang
- Computational Social Affective Neuroscience Laboratory, Department of Psychological and Brain Science, Dartmouth College, Hanover, NH 03755, USA
| |
Collapse
|
123
|
Huang S, Sun L, Yousefnezhad M, Wang M, Zhang D. Temporal Information Guided Generative Adversarial Networks for Stimuli Image Reconstruction from Human Brain Activities. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3098743] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
124
|
Nakai T, Koide-Majima N, Nishimoto S. Correspondence of categorical and feature-based representations of music in the human brain. Brain Behav 2021; 11:e01936. [PMID: 33164348 PMCID: PMC7821620 DOI: 10.1002/brb3.1936] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Revised: 09/24/2020] [Accepted: 10/21/2020] [Indexed: 01/11/2023] Open
Abstract
INTRODUCTION Humans tend to categorize auditory stimuli into discrete classes, such as animal species, language, musical instrument, and music genre. Of these, music genre is a frequently used dimension of human music preference and is determined based on the categorization of complex auditory stimuli. Neuroimaging studies have reported that the superior temporal gyrus (STG) is involved in response to general music-related features. However, there is considerable uncertainty over how discrete music categories are represented in the brain and which acoustic features are more suited for explaining such representations. METHODS We used a total of 540 music clips to examine comprehensive cortical representations and the functional organization of music genre categories. For this purpose, we applied a voxel-wise modeling approach to music-evoked brain activity measured using functional magnetic resonance imaging. In addition, we introduced a novel technique for feature-brain similarity analysis and assessed how discrete music categories are represented based on the cortical response pattern to acoustic features. RESULTS Our findings indicated distinct cortical organizations for different music genres in the bilateral STG, and they revealed representational relationships between different music genres. On comparing different acoustic feature models, we found that these representations of music genres could be explained largely by a biologically plausible spectro-temporal modulation-transfer function model. CONCLUSION Our findings have elucidated the quantitative representation of music genres in the human cortex, indicating the possibility of modeling this categorization of complex auditory stimuli based on brain activity.
Collapse
Affiliation(s)
- Tomoya Nakai
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, Suita, Japan.,Graduate School of Frontier Biosciences, Osaka University, Suita, Japan
| | - Naoko Koide-Majima
- Graduate School of Frontier Biosciences, Osaka University, Suita, Japan.,AI Science Research and Development Promotion Center, National Institute of Information and Communications Technology, Suita, Japan
| | - Shinji Nishimoto
- Center for Information and Neural Networks, National Institute of Information and Communications Technology, Suita, Japan.,Graduate School of Frontier Biosciences, Osaka University, Suita, Japan.,Graduate School of Medicine, Osaka University, Suita, Japan
| |
Collapse
|
125
|
Wu H, Zheng N, Chen B. Feature-specific denoising of neural activity for natural image identification. IEEE Trans Cogn Dev Syst 2021. [DOI: 10.1109/tcds.2021.3062067] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
126
|
Livezey JA, Glaser JI. Deep learning approaches for neural decoding across architectures and recording modalities. Brief Bioinform 2020; 22:1577-1591. [PMID: 33372958 DOI: 10.1093/bib/bbaa355] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 10/31/2020] [Accepted: 11/04/2020] [Indexed: 12/19/2022] Open
Abstract
Decoding behavior, perception or cognitive state directly from neural signals is critical for brain-computer interface research and an important tool for systems neuroscience. In the last decade, deep learning has become the state-of-the-art method in many machine learning tasks ranging from speech recognition to image segmentation. The success of deep networks in other domains has led to a new wave of applications in neuroscience. In this article, we review deep learning approaches to neural decoding. We describe the architectures used for extracting useful features from neural recording modalities ranging from spikes to functional magnetic resonance imaging. Furthermore, we explore how deep learning has been leveraged to predict common outputs including movement, speech and vision, with a focus on how pretrained deep networks can be incorporated as priors for complex decoding targets like acoustic speech or images. Deep learning has been shown to be a useful tool for improving the accuracy and flexibility of neural decoding across a wide range of tasks, and we point out areas for future scientific development.
Collapse
Affiliation(s)
- Jesse A Livezey
- Neural Systems and Data Science Laboratory at the Lawrence Berkeley National Laboratory. He obtained his PhD in Physics from the University of California, Berkeley
| | - Joshua I Glaser
- Center for Theoretical Neuroscience and Department of Statistics at Columbia University. He obtained his PhD in Neuroscience from Northwestern University
| |
Collapse
|
127
|
Semantic Space Theory: A Computational Approach to Emotion. Trends Cogn Sci 2020; 25:124-136. [PMID: 33349547 DOI: 10.1016/j.tics.2020.11.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2020] [Revised: 11/09/2020] [Accepted: 11/13/2020] [Indexed: 12/30/2022]
Abstract
Within affective science, the central line of inquiry, animated by basic emotion theory and constructivist accounts, has been the search for one-to-one mappings between six emotions and their subjective experiences, prototypical expressions, and underlying brain states. We offer an alternative perspective: semantic space theory. This computational approach uses wide-ranging naturalistic stimuli and open-ended statistical techniques to capture systematic variation in emotion-related behaviors. Upwards of 25 distinct varieties of emotional experience have distinct profiles of associated antecedents and expressions. These emotions are high-dimensional, categorical, and often blended. This approach also reveals that specific emotions, more than valence, organize emotional experience, expression, and neural processing. Overall, moving beyond traditional models to study broader semantic spaces of emotion can enrich our understanding of human experience.
Collapse
|
128
|
Bernhard SM, Lee J, Zhu M, Hsu A, Erskine A, Hires SA, Barth AL. An automated homecage system for multiwhisker detection and discrimination learning in mice. PLoS One 2020; 15:e0232916. [PMID: 33264281 PMCID: PMC7710058 DOI: 10.1371/journal.pone.0232916] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 11/16/2020] [Indexed: 12/19/2022] Open
Abstract
Automated, homecage behavioral training for rodents has many advantages: it is low stress, requires little interaction with the experimenter, and can be easily manipulated to adapt to different experimental conditions. We have developed an inexpensive, Arduino-based, homecage training apparatus for sensory association training in freely-moving mice using multiwhisker air current stimulation coupled to a water reward. Animals learn this task readily, within 1–2 days of training, and performance progressively improves with training. We examined the parameters that regulate task acquisition using different stimulus intensities, directions, and reward valence. Learning was assessed by comparing anticipatory licking for the stimulus compared to the no-stimulus (blank) trials. At high stimulus intensities (>9 psi), animals showed markedly less participation in the task. Conversely, very weak air current intensities (1–2 psi) were not sufficient to generate rapid learning behavior. At intermediate stimulus intensities (5–6 psi), a majority of mice learned that the multiwhisker stimulus predicted the water reward after 24–48 hrs of training. Both exposure to isoflurane and lack of whiskers decreased animals’ ability to learn the task. Following training at an intermediate stimulus intensity, mice were able to transfer learning behavior when exposed to a lower stimulus intensity, an indicator of perceptual learning. Mice learned to discriminate between two directions of stimulation rapidly and accurately, even when the angular distance between the stimuli was <15 degrees. Switching the reward to a more desirable reward, aspartame, had little effect on learning trajectory. Our results show that a tactile association task in an automated homecage environment can be monitored by anticipatory licking to reveal rapid and progressive behavioral change. These Arduino-based, automated mouse cages enable high-throughput training that facilitate analysis of large numbers of genetically modified mice with targeted manipulations of neural activity.
Collapse
Affiliation(s)
- Sarah M. Bernhard
- Department of Psychology, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Jiseok Lee
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Mo Zhu
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Alex Hsu
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| | - Andrew Erskine
- Department of Biological Sciences, Section of Neurobiology, University of Southern California, Los Angeles, California, United States of America
| | - Samuel A. Hires
- Department of Biological Sciences, Section of Neurobiology, University of Southern California, Los Angeles, California, United States of America
| | - Alison L. Barth
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
- * E-mail:
| |
Collapse
|
129
|
Nastase SA, Goldstein A, Hasson U. Keep it real: rethinking the primacy of experimental control in cognitive neuroscience. Neuroimage 2020; 222:117254. [PMID: 32800992 PMCID: PMC7789034 DOI: 10.1016/j.neuroimage.2020.117254] [Citation(s) in RCA: 99] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 07/08/2020] [Accepted: 08/04/2020] [Indexed: 01/17/2023] Open
Abstract
Naturalistic experimental paradigms in neuroimaging arose from a pressure to test the validity of models we derive from highly-controlled experiments in real-world contexts. In many cases, however, such efforts led to the realization that models developed under particular experimental manipulations failed to capture much variance outside the context of that manipulation. The critique of non-naturalistic experiments is not a recent development; it echoes a persistent and subversive thread in the history of modern psychology. The brain has evolved to guide behavior in a multidimensional world with many interacting variables. The assumption that artificially decoupling and manipulating these variables will lead to a satisfactory understanding of the brain may be untenable. We develop an argument for the primacy of naturalistic paradigms, and point to recent developments in machine learning as an example of the transformative power of relinquishing control. Naturalistic paradigms should not be deployed as an afterthought if we hope to build models of brain and behavior that extend beyond the laboratory into the real world.
Collapse
Affiliation(s)
- Samuel A Nastase
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA.
| | - Ariel Goldstein
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Uri Hasson
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA; Department of Psychology, Princeton University, Princeton, NJ, USA
| |
Collapse
|
130
|
Zheng H, Yao L, Chen M, Long Z. 3D Contrast Image Reconstruction From Human Brain Activity. IEEE Trans Neural Syst Rehabil Eng 2020; 28:2699-2710. [PMID: 33147146 DOI: 10.1109/tnsre.2020.3035818] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Several studies demonstrated that functional magnetic resonance imaging (fMRI) signals in early visual cortex can be used to reconstruct 2-dimensional (2D) visual contents. However, it remains unknown how to reconstruct 3-dimensional (3D) visual stimuli from fMRI signals in visual cortex. 3D visual stimuli contain 2D visual features and depth information. Moreover, binocular disparity is an important cue for depth perception. Thus, it is more challenging to reconstruct 3D visual stimuli than 2D visual stimuli from the fMRI signals of visual cortex. This study aimed to reconstruct 3D visual images by constructing three decoding models: contrast-decoding, disparity-decoding and contrast-disparity-decoding models, and testing these models with fMRI data from humans viewing 3D contrast images. The results revealed that the 3D contrast stimuli can be reconstructed from the visual cortex. And the early visual regions (V1, V2) showed predominant advantages in reconstructing the contrast in 3D images for the contrast-decoding model. The dorsal visual regions (V3A, V7 and MT) showed predominant advantages in decoding the disparity in 3D images for the disparity-decoding model. The combination of the early and dorsal visual regions showed predominant advantages in decoding both the contrast and disparity for the contrast-disparity-decoding model. The results suggested that the contrast and disparity in 3D images were mainly represented in the early and dorsal visual regions separately. The two visual systems may interact with each other to decode 3D-contrast images.
Collapse
|
131
|
Decoding visual information from high-density diffuse optical tomography neuroimaging data. Neuroimage 2020; 226:117516. [PMID: 33137479 PMCID: PMC8006181 DOI: 10.1016/j.neuroimage.2020.117516] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 10/12/2020] [Accepted: 10/23/2020] [Indexed: 12/27/2022] Open
Abstract
Background: Neural decoding could be useful in many ways, from serving as a neuroscience research tool to providing a means of augmented communication for patients with neurological conditions. However, applications of decoding are currently constrained by the limitations of traditional neuroimaging modalities. Electrocorticography requires invasive neurosurgery, magnetic resonance imaging (MRI) is too cumbersome for uses like daily communication, and alternatives like functional near-infrared spectroscopy (fNIRS) offer poor image quality. High-density diffuse optical tomography (HD-DOT) is an emerging modality that uses denser optode arrays than fNIRS to combine logistical advantages of optical neuroimaging with enhanced image quality. Despite the resulting promise of HD-DOT for facilitating field applications of neuroimaging, decoding of brain activity as measured by HD-DOT has yet to be evaluated. Objective: To assess the feasibility and performance of decoding with HD-DOT in visual cortex. Methods and Results: To establish the feasibility of decoding at the single-trial level with HD-DOT, a template matching strategy was used to decode visual stimulus position. A receiver operating characteristic (ROC) analysis was used to quantify the sensitivity, specificity, and reproducibility of binary visual decoding. Mean areas under the curve (AUCs) greater than 0.97 across 10 imaging sessions in a highly sampled participant were observed. ROC analyses of decoding across 5 participants established both reproducibility in multiple individuals and the feasibility of inter-individual decoding (mean AUCs > 0.7), although decoding performance varied between individuals. Phase-encoded checkerboard stimuli were used to assess more complex, non-binary decoding with HD-DOT. Across 3 highly sampled participants, the phase of a 60° wide checkerboard wedge rotating 10° per second through 360° was decoded with a within-participant error of 25.8±24.7°. Decoding between participants was also feasible based on permutation-based significance testing. Conclusions: Visual stimulus information can be decoded accurately, reproducibly, and across a range of detail (for both binary and non-binary outcomes) at the single-trial level (without needing to block-average test data) using HD-DOT data. These results lay the foundation for future studies of more complex decoding with HD-DOT and applications in clinical populations.
Collapse
|
132
|
Wang C, Yan H, Huang W, Li J, Yang J, Li R, Zhang L, Li L, Zhang J, Zuo Z, Chen H. ‘When’ and ‘what’ did you see? A novel fMRI-based visual decoding framework. J Neural Eng 2020; 17:056013. [DOI: 10.1088/1741-2552/abb691] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
133
|
Aliko S, Huang J, Gheorghiu F, Meliss S, Skipper JI. A naturalistic neuroimaging database for understanding the brain using ecological stimuli. Sci Data 2020; 7:347. [PMID: 33051448 PMCID: PMC7555491 DOI: 10.1038/s41597-020-00680-2] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 09/16/2020] [Indexed: 12/22/2022] Open
Abstract
Neuroimaging has advanced our understanding of human psychology using reductionist stimuli that often do not resemble information the brain naturally encounters. It has improved our understanding of the network organization of the brain mostly through analyses of 'resting-state' data for which the functions of networks cannot be verifiably labelled. We make a 'Naturalistic Neuroimaging Database' (NNDb v1.0) publically available to allow for a more complete understanding of the brain under more ecological conditions during which networks can be labelled. Eighty-six participants underwent behavioural testing and watched one of 10 full-length movies while functional magnetic resonance imaging was acquired. Resulting timeseries data are shown to be of high quality, with good signal-to-noise ratio, few outliers and low movement. Data-driven functional analyses provide further evidence of data quality. They also demonstrate accurate timeseries/movie alignment and how movie annotations might be used to label networks. The NNDb can be used to answer questions previously unaddressed with standard neuroimaging approaches, progressing our knowledge of how the brain works in the real world.
Collapse
Affiliation(s)
- Sarah Aliko
- London Interdisciplinary Biosciences Consortium, University College London, London, UK.
- Experimental Psychology, University College London, London, UK.
| | - Jiawen Huang
- Experimental Psychology, University College London, London, UK
| | | | - Stefanie Meliss
- Experimental Psychology, University College London, London, UK
- School of Psychology and Clinical Language Sciences, University of Reading, Reading, UK
| | | |
Collapse
|
134
|
Oldfield CS, Grossrubatscher I, Chávez M, Hoagland A, Huth AR, Carroll EC, Prendergast A, Qu T, Gallant JL, Wyart C, Isacoff EY. Experience, circuit dynamics, and forebrain recruitment in larval zebrafish prey capture. eLife 2020; 9:e56619. [PMID: 32985972 PMCID: PMC7561350 DOI: 10.7554/elife.56619] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Accepted: 09/26/2020] [Indexed: 01/16/2023] Open
Abstract
Experience influences behavior, but little is known about how experience is encoded in the brain, and how changes in neural activity are implemented at a network level to improve performance. Here we investigate how differences in experience impact brain circuitry and behavior in larval zebrafish prey capture. We find that experience of live prey compared to inert food increases capture success by boosting capture initiation. In response to live prey, animals with and without prior experience of live prey show activity in visual areas (pretectum and optic tectum) and motor areas (cerebellum and hindbrain), with similar visual area retinotopic maps of prey position. However, prey-experienced animals more readily initiate capture in response to visual area activity and have greater visually-evoked activity in two forebrain areas: the telencephalon and habenula. Consequently, disruption of habenular neurons reduces capture performance in prey-experienced fish. Together, our results suggest that experience of prey strengthens prey-associated visual drive to the forebrain, and that this lowers the threshold for prey-associated visual activity to trigger activity in motor areas, thereby improving capture performance.
Collapse
Affiliation(s)
- Claire S Oldfield
- Helen Wills Neuroscience Institute and Graduate Program, University of California BerkeleyBerkeleyUnited States
| | - Irene Grossrubatscher
- Helen Wills Neuroscience Institute and Graduate Program, University of California BerkeleyBerkeleyUnited States
| | | | - Adam Hoagland
- Department of Molecular and Cell Biology, University of California BerkeleyBerkeleyUnited States
| | - Alex R Huth
- Helen Wills Neuroscience Institute and Graduate Program, University of California BerkeleyBerkeleyUnited States
| | - Elizabeth C Carroll
- Department of Molecular and Cell Biology, University of California BerkeleyBerkeleyUnited States
| | - Andrew Prendergast
- CNRS-UMRParisFrance
- INSERM UMRSParisFrance
- Institut du Cerveau et de la Moelle épinière (ICM), Hôpital de la Pitié-SalpêtrièreParisFrance
| | - Tony Qu
- Department of Molecular and Cell Biology, University of California BerkeleyBerkeleyUnited States
| | - Jack L Gallant
- Helen Wills Neuroscience Institute and Graduate Program, University of California BerkeleyBerkeleyUnited States
- Department of Psychology, University of California, BerkeleyBerkeleyUnited States
| | - Claire Wyart
- CNRS-UMRParisFrance
- INSERM UMRSParisFrance
- Institut du Cerveau et de la Moelle épinière (ICM), Hôpital de la Pitié-SalpêtrièreParisFrance
| | - Ehud Y Isacoff
- Helen Wills Neuroscience Institute and Graduate Program, University of California BerkeleyBerkeleyUnited States
- Department of Molecular and Cell Biology, University of California BerkeleyBerkeleyUnited States
- Bioscience Division, Lawrence Berkeley National LaboratoryBerkeleyUnited States
| |
Collapse
|
135
|
Ellis CT, Skalaban LJ, Yates TS, Bejjanki VR, Córdova NI, Turk-Browne NB. Re-imagining fMRI for awake behaving infants. Nat Commun 2020; 11:4523. [PMID: 32908125 PMCID: PMC7481790 DOI: 10.1038/s41467-020-18286-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 08/10/2020] [Indexed: 11/09/2022] Open
Abstract
Thousands of functional magnetic resonance imaging (fMRI) studies have provided important insight into the human brain. However, only a handful of these studies tested infants while they were awake, because of the significant and unique methodological challenges involved. We report our efforts to address these challenges, with the goal of creating methods for awake infant fMRI that can reveal the inner workings of the developing, preverbal mind. We use these methods to collect and analyze two fMRI datasets obtained from infants during cognitive tasks, released publicly with this paper. In these datasets, we explore and evaluate data quantity and quality, task-evoked activity, and preprocessing decisions. We disseminate these methods by sharing two software packages that integrate infant-friendly cognitive tasks and eye-gaze monitoring with fMRI acquisition and analysis. These resources make fMRI a feasible and accessible technique for cognitive neuroscience in awake and behaving human infants.
Collapse
Affiliation(s)
- C T Ellis
- Department of Psychology, Yale University, New Haven, CT, 06511, USA
| | - L J Skalaban
- Department of Psychology, Yale University, New Haven, CT, 06511, USA
| | - T S Yates
- Department of Psychology, Yale University, New Haven, CT, 06511, USA
| | - V R Bejjanki
- Department of Psychology, Hamilton College, Clinton, NY, 13323, USA
| | - N I Córdova
- Department of Psychology, Yale University, New Haven, CT, 06511, USA
| | - N B Turk-Browne
- Department of Psychology, Yale University, New Haven, CT, 06511, USA.
| |
Collapse
|
136
|
Nestor A, Lee ACH, Plaut DC, Behrmann M. The Face of Image Reconstruction: Progress, Pitfalls, Prospects. Trends Cogn Sci 2020; 24:747-759. [PMID: 32674958 PMCID: PMC7429291 DOI: 10.1016/j.tics.2020.06.006] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2020] [Revised: 05/27/2020] [Accepted: 06/15/2020] [Indexed: 10/23/2022]
Abstract
Recent research has demonstrated that neural and behavioral data acquired in response to viewing face images can be used to reconstruct the images themselves. However, the theoretical implications, promises, and challenges of this direction of research remain unclear. We evaluate the potential of this research for elucidating the visual representations underlying face recognition. Specifically, we outline complementary and converging accounts of the visual content, the representational structure, and the neural dynamics of face processing. We illustrate how this research addresses fundamental questions in the study of normal and impaired face recognition, and how image reconstruction provides a powerful framework for uncovering face representations, for unifying multiple types of empirical data, and for facilitating both theoretical and methodological progress.
Collapse
Affiliation(s)
- Adrian Nestor
- Department of Psychology at Scarborough, University of Toronto, Toronto, Ontario, Canada.
| | - Andy C H Lee
- Department of Psychology at Scarborough, University of Toronto, Toronto, Ontario, Canada; Rotman Research Institute, Baycrest Centre, Toronto, Ontario, Canada
| | - David C Plaut
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA; Carnegie Mellon Neuroscience Institute, Pittsburgh, PA, USA
| | - Marlene Behrmann
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA, USA; Carnegie Mellon Neuroscience Institute, Pittsburgh, PA, USA
| |
Collapse
|
137
|
Yılmaz Ö, Çelik E, Çukur T. Informed feature regularization in voxelwise modeling for naturalistic fMRI experiments. Eur J Neurosci 2020; 52:3394-3410. [PMID: 32343012 PMCID: PMC9748846 DOI: 10.1111/ejn.14760] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 03/18/2020] [Accepted: 04/21/2020] [Indexed: 12/16/2022]
Abstract
Voxelwise modeling is a powerful framework to predict single-voxel functional selectivity for the stimulus features that exist in complex natural stimuli. Yet, because VM disregards potential correlations across stimulus features or neighboring voxels, it may yield suboptimal sensitivity in measuring functional selectivity in the presence of high levels of measurement noise. Here, we introduce a novel voxelwise modeling approach that simultaneously utilizes stimulus correlations in model features and response correlations among voxel neighborhoods. The proposed method performs feature and spatial regularization while still generating single-voxel response predictions. We demonstrated the performance of our approach on a functional magnetic resonance imaging dataset from a natural vision experiment. Compared to VM, the proposed method yields clear improvements in prediction performance, together with increased feature coherence and spatial coherence of voxelwise models. Overall, the proposed method can offer improved sensitivity in modeling of single voxels in naturalistic functional magnetic resonance imaging experiments.
Collapse
Affiliation(s)
- Özgür Yılmaz
- National Magnetic Resonance Research Center, Bilkent University, Ankara, Turkey,Department of Electrical and Electronics Engineering, Bilkent University, Ankara, Turkey
| | - Emin Çelik
- National Magnetic Resonance Research Center, Bilkent University, Ankara, Turkey,Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey
| | - Tolga Çukur
- National Magnetic Resonance Research Center, Bilkent University, Ankara, Turkey,Department of Electrical and Electronics Engineering, Bilkent University, Ankara, Turkey,Neuroscience Program, Sabuncu Brain Research Center, Bilkent University, Ankara, Turkey
| |
Collapse
|
138
|
Koide-Majima N, Nakai T, Nishimoto S. Distinct dimensions of emotion in the human brain and their representation on the cortical surface. Neuroimage 2020; 222:117258. [PMID: 32798681 DOI: 10.1016/j.neuroimage.2020.117258] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2019] [Revised: 08/02/2020] [Accepted: 08/05/2020] [Indexed: 12/24/2022] Open
Abstract
We experience a rich variety of emotions in daily life, and a fundamental goal of affective neuroscience is to determine how these emotions are represented in the brain. Recent psychological studies have used naturalistic stimuli (e.g., movies) to reveal high dimensional representational structures of diverse daily-life emotions. However, relatively little is known about how such diverse emotions are represented in the brain because most of the affective neuroscience studies have used only a small number of controlled stimuli. To reveal that, we measured functional MRI to obtain blood-oxygen-level-dependent (BOLD) responses from human subjects while they watched emotion-inducing audiovisual movies over a period of 3 hours. For each of the one-second movie scenes, we annotated the movies with respect to 80 emotions selected based on a wide range of previous emotion literature. By quantifying canonical correlations between the emotion ratings and the BOLD responses, the results suggest that around 25 distinct dimensions (ranging from 18 to 36 and being subject-dependent) of the emotion ratings contribute to emotion representations in the brain. For demonstrating how the 80 emotion categories were represented in the cortical surface, we visualized a continuous semantic space of the emotion representation and mapped it on the cortical surface. We found that the emotion categories were changed from unimodal to transmodal regions on the cortical surface. This study presents a cortical representation of a rich variety of emotion categories, which covers many of the emotional experiences of daily living.
Collapse
Affiliation(s)
| | - Tomoya Nakai
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka, Japan; Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan
| | - Shinji Nishimoto
- Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology, Osaka, Japan; Graduate School of Frontier Biosciences, Osaka University, Osaka, Japan; Graduate School of Medicine, Osaka University, Osaka, Japan.
| |
Collapse
|
139
|
Haxby JV, Gobbini MI, Nastase SA. Naturalistic stimuli reveal a dominant role for agentic action in visual representation. Neuroimage 2020; 216:116561. [DOI: 10.1016/j.neuroimage.2020.116561] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2019] [Revised: 01/07/2020] [Accepted: 01/14/2020] [Indexed: 11/26/2022] Open
|
140
|
Shahdloo M, Çelik E, Çukur T. Biased competition in semantic representation during natural visual search. Neuroimage 2020; 216:116383. [DOI: 10.1016/j.neuroimage.2019.116383] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 10/31/2019] [Accepted: 11/18/2019] [Indexed: 10/25/2022] Open
|
141
|
DuPre E, Hanke M, Poline JB. Nature abhors a paywall: How open science can realize the potential of naturalistic stimuli. Neuroimage 2020; 216:116330. [PMID: 31704292 PMCID: PMC7198323 DOI: 10.1016/j.neuroimage.2019.116330] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 10/31/2019] [Accepted: 10/31/2019] [Indexed: 11/26/2022] Open
Abstract
Naturalistic stimuli show significant potential to inform behavioral, cognitive, and clinical neuroscience. To date, this impact is still limited by the relative inaccessibility of both generated neuroimaging data as well as the supporting naturalistic stimuli. In this perspective, we highlight currently available naturalistic datasets and technical solutions such as DataLad that continue to advance our ability to share this data. We also review scientific and sociological challenges in selecting naturalistic stimuli for reproducible research. Overall, we encourage researchers to share their naturalistic datasets to the full extent possible under local copyright law.
Collapse
Affiliation(s)
| | - Michael Hanke
- Institute of Neuroscience and Medicine, Brain & Behaviour (INM-7), Research Centre Jülich, Jülich, Germany; Institute of Systems Neuroscience, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | | |
Collapse
|
142
|
Simony E, Chang C. Analysis of stimulus-induced brain dynamics during naturalistic paradigms. Neuroimage 2020; 216:116461. [PMID: 31843711 PMCID: PMC7418522 DOI: 10.1016/j.neuroimage.2019.116461] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 10/19/2019] [Accepted: 12/10/2019] [Indexed: 10/28/2022] Open
Abstract
Naturalistic stimuli offer promising avenues for investigating brain function across the rich, realistic spectrum of human experiences. Functional magnetic resonance imaging (fMRI) studies of brain activity during naturalistic paradigms have provided new information about dynamic neural processing in ecologically valid contexts. Yet, the complex, uncontrolled nature of such stimuli -- and the resulting mixture of neuronal and physiological responses embedded within the fMRI signals -- present challenges with respect to data analysis and interpretation. In this brief commentary, we discuss methods and open challenges in naturalistic fMRI investigations, with a focus on extracting and interpreting stimulus-induced fMRI signals.
Collapse
Affiliation(s)
- Erez Simony
- Faculty of Electrical Engineering, Holon Institute of Technology, Holon, Israel.
| | - Catie Chang
- Department of Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, USA; Department of Biomedical Engineering, Vanderbilt University, Nashville, TN, USA; Vanderbilt University Institute of Imaging Science, Vanderbilt University, Nashville, TN, USA.
| |
Collapse
|
143
|
van Assen JJR, Nishida S, Fleming RW. Visual perception of liquids: Insights from deep neural networks. PLoS Comput Biol 2020; 16:e1008018. [PMID: 32813688 PMCID: PMC7437867 DOI: 10.1371/journal.pcbi.1008018] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2019] [Accepted: 06/05/2020] [Indexed: 11/19/2022] Open
Abstract
Visually inferring material properties is crucial for many tasks, yet poses significant computational challenges for biological vision. Liquids and gels are particularly challenging due to their extreme variability and complex behaviour. We reasoned that measuring and modelling viscosity perception is a useful case study for identifying general principles of complex visual inferences. In recent years, artificial Deep Neural Networks (DNNs) have yielded breakthroughs in challenging real-world vision tasks. However, to model human vision, the emphasis lies not on best possible performance, but on mimicking the specific pattern of successes and errors humans make. We trained a DNN to estimate the viscosity of liquids using 100.000 simulations depicting liquids with sixteen different viscosities interacting in ten different scenes (stirring, pouring, splashing, etc). We find that a shallow feedforward network trained for only 30 epochs predicts mean observer performance better than most individual observers. This is the first successful image-computable model of human viscosity perception. Further training improved accuracy, but predicted human perception less well. We analysed the network's features using representational similarity analysis (RSA) and a range of image descriptors (e.g. optic flow, colour saturation, GIST). This revealed clusters of units sensitive to specific classes of feature. We also find a distinct population of units that are poorly explained by hand-engineered features, but which are particularly important both for physical viscosity estimation, and for the specific pattern of human responses. The final layers represent many distinct stimulus characteristics-not just viscosity, which the network was trained on. Retraining the fully-connected layer with a reduced number of units achieves practically identical performance, but results in representations focused on viscosity, suggesting that network capacity is a crucial parameter determining whether artificial or biological neural networks use distributed vs. localized representations.
Collapse
Affiliation(s)
- Jan Jaap R. van Assen
- Human Information Science Laboratory, NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation, Atsugi, Kanagawa, Japan
| | - Shin’ya Nishida
- Human Information Science Laboratory, NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation, Atsugi, Kanagawa, Japan
- Graduate School of Informatics, Kyoto University, Yoshida-honmachi, Sakyo-ku, Kyoto, Japan
| | - Roland W. Fleming
- Department of Experimental Psychology, University of Giessen, Giessen, Hessen, Germany
- Centre for Mind, Brain and Behaviour (CMBB), University of Marburg and Justus Liebig University, Giessen, Germany
| |
Collapse
|
144
|
Rosenke M, Davidenko N, Grill-Spector K, Weiner KS. Combined Neural Tuning in Human Ventral Temporal Cortex Resolves the Perceptual Ambiguity of Morphed 2D Images. Cereb Cortex 2020; 30:4882-4898. [PMID: 32372098 PMCID: PMC7391265 DOI: 10.1093/cercor/bhaa081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
We have an amazing ability to categorize objects in the world around us. Nevertheless, how cortical regions in human ventral temporal cortex (VTC), which is critical for categorization, support this behavioral ability, is largely unknown. Here, we examined the relationship between neural responses and behavioral performance during the categorization of morphed silhouettes of faces and hands, which are animate categories processed in cortically adjacent regions in VTC. Our results reveal that the combination of neural responses from VTC face- and body-selective regions more accurately explains behavioral categorization than neural responses from either region alone. Furthermore, we built a model that predicts a person's behavioral performance using estimated parameters of brain-behavior relationships from a different group of people. Moreover, we show that this brain-behavior model generalizes to adjacent face- and body-selective regions in lateral occipitotemporal cortex. Thus, while face- and body-selective regions are located within functionally distinct domain-specific networks, cortically adjacent regions from both networks likely integrate neural responses to resolve competing and perceptually ambiguous information from both categories.
Collapse
Affiliation(s)
- Mona Rosenke
- Psychology Department, Stanford University, Stanford, CA 94305, USA
| | - Nicolas Davidenko
- Psychology Department, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Kalanit Grill-Spector
- Psychology Department, Stanford University, Stanford, CA 94305, USA
- Neuroscience Institute, Stanford University, Stanford, CA 94305, USA
| | - Kevin S Weiner
- Psychology Department, University of California, Berkeley, Berkeley, CA 94720, USA
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
145
|
Brain meta-state transitions demarcate thoughts across task contexts exposing the mental noise of trait neuroticism. Nat Commun 2020; 11:3480. [PMID: 32661242 PMCID: PMC7359033 DOI: 10.1038/s41467-020-17255-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Accepted: 05/26/2020] [Indexed: 12/15/2022] Open
Abstract
Researchers have observed large-scale neural meta-state transitions that align to narrative events during movie-viewing. However, group or training-derived priors have been needed to detect them. Here, we introduce methods to sample transitions without any priors. Transitions detected by our methods predict narrative events, are similar across task and rest, and are correlated with activation of regions associated with spontaneous thought. Based on the centrality of semantics to thought, we argue these transitions serve as general, implicit neurobiological markers of new thoughts, and that their frequency, which is stable across contexts, approximates participants' mentation rate. By enabling observation of idiosyncratic transitions, our approach supports many applications, including phenomenological access to the black box of resting cognition. To illustrate the utility of this access, we regress resting fMRI transition rate and movie-viewing transition conformity against trait neuroticism, thereby providing a first neural confirmation of mental noise theory.
Collapse
|
146
|
Cai MB, Shvartsman M, Wu A, Zhang H, Zhu X. Incorporating structured assumptions with probabilistic graphical models in fMRI data analysis. Neuropsychologia 2020; 144:107500. [PMID: 32433952 PMCID: PMC7387580 DOI: 10.1016/j.neuropsychologia.2020.107500] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 05/09/2020] [Accepted: 05/15/2020] [Indexed: 01/27/2023]
Abstract
With the wide adoption of functional magnetic resonance imaging (fMRI) by cognitive neuroscience researchers, large volumes of brain imaging data have been accumulated in recent years. Aggregating these data to derive scientific insights often faces the challenge that fMRI data are high-dimensional, heterogeneous across people, and noisy. These challenges demand the development of computational tools that are tailored both for the neuroscience questions and for the properties of the data. We review a few recently developed algorithms in various domains of fMRI research: fMRI in naturalistic tasks, analyzing full-brain functional connectivity, pattern classification, inferring representational similarity and modeling structured residuals. These algorithms all tackle the challenges in fMRI similarly: they start by making clear statements of assumptions about neural data and existing domain knowledge, incorporate those assumptions and domain knowledge into probabilistic graphical models, and use those models to estimate properties of interest or latent structures in the data. Such approaches can avoid erroneous findings, reduce the impact of noise, better utilize known properties of the data, and better aggregate data across groups of subjects. With these successful cases, we advocate wider adoption of explicit model construction in cognitive neuroscience. Although we focus on fMRI, the principle illustrated here is generally applicable to brain data of other modalities.
Collapse
Affiliation(s)
- Ming Bo Cai
- International Research Center for Neurointelligence (WPI-IRCN), UTIAS, The University of Tokyo, Japan; Princeton Neuroscience Institute, Princeton University, United States.
| | | | - Anqi Wu
- Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, United States
| | - Hejia Zhang
- Department of Electrical Engineering, Princeton University, United States
| | - Xia Zhu
- Intel Corporation, United States
| |
Collapse
|
147
|
Abstract
Support vector machines (SVMs) are being used increasingly in affective science as a data-driven classification method and feature reduction technique. Whereas traditional statistical methods typically compare group averages on selected variables, SVMs use a predictive algorithm to learn multivariate patterns that optimally discriminate between groups. In this review, we provide a framework for understanding the methods of SVM-based analyses and summarize the findings of seminal studies that use SVMs for classification or data reduction in the behavioral and neural study of emotion and affective disorders. We conclude by discussing promising directions and potential applications of SVMs in future research in affective science.
Collapse
Affiliation(s)
| | - Matthew D. Sacchet
- Center for Depression, Anxiety, and Stress Research, McLean Hospital, Harvard Medical School, USA
| | - Ian H. Gotlib
- Department of Psychology, Stanford Neurosciences Institute, Stanford University, USA
| |
Collapse
|
148
|
Neuroscience-based Psychiatric Assessments of Criminal Responsibility: Beyond Self-Report? Camb Q Healthc Ethics 2020; 29:446-458. [PMID: 32484135 DOI: 10.1017/s0963180120000195] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Many legal systems have an insanity defense, which means that although a person has committed a crime, she is not held criminally responsible for the act. A challenge with regard to these assessments is that forensic psychiatrists have to rely to a considerable extent on the defendant's self-report. Could neuroscience be a way to make these evaluations more objective? The current value of neuroimaging in insanity assessments will be examined. The author argues that neuroscience can be valuable for diagnosing neurological illnesses, rather than psychiatric disorders. Next, he discusses to what extent neurotechnological 'mind reading' techniques, if they would become available in the future, could be useful to get beyond self-report in forensic psychiatry.
Collapse
|
149
|
Haxby JV, Guntupalli JS, Nastase SA, Feilong M. Hyperalignment: Modeling shared information encoded in idiosyncratic cortical topographies. eLife 2020; 9:e56601. [PMID: 32484439 PMCID: PMC7266639 DOI: 10.7554/elife.56601] [Citation(s) in RCA: 63] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 05/14/2020] [Indexed: 01/13/2023] Open
Abstract
Information that is shared across brains is encoded in idiosyncratic fine-scale functional topographies. Hyperalignment captures shared information by projecting pattern vectors for neural responses and connectivities into a common, high-dimensional information space, rather than by aligning topographies in a canonical anatomical space. Individual transformation matrices project information from individual anatomical spaces into the common model information space, preserving the geometry of pairwise dissimilarities between pattern vectors, and model cortical topography as mixtures of overlapping, individual-specific topographic basis functions, rather than as contiguous functional areas. The fundamental property of brain function that is preserved across brains is information content, rather than the functional properties of local features that support that content. In this Perspective, we present the conceptual framework that motivates hyperalignment, its computational underpinnings for joint modeling of a common information space and idiosyncratic cortical topographies, and discuss implications for understanding the structure of cortical functional architecture.
Collapse
Affiliation(s)
- James V Haxby
- Center for Cognitive Neuroscience, Dartmouth CollegeHanoverUnited States
| | | | | | - Ma Feilong
- Center for Cognitive Neuroscience, Dartmouth CollegeHanoverUnited States
| |
Collapse
|
150
|
Pfeiffer C, Hollenstein N, Zhang C, Langer N. Neural dynamics of sentiment processing during naturalistic sentence reading. Neuroimage 2020; 218:116934. [PMID: 32416227 DOI: 10.1016/j.neuroimage.2020.116934] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 04/24/2020] [Accepted: 05/07/2020] [Indexed: 12/15/2022] Open
Abstract
When we read, our eyes move through the text in a series of fixations and high-velocity saccades to extract visual information. This process allows the brain to obtain meaning, e.g., about sentiment, or the emotional valence, expressed in the written text. How exactly the brain extracts the sentiment of single words during naturalistic reading is largely unknown. This is due to the challenges of naturalistic imaging, which has previously led researchers to employ highly controlled, timed word-by-word presentations of custom reading materials that lack ecological validity. Here, we aimed to assess the electrical neural correlates of word sentiment processing during naturalistic reading of English sentences. We used a publicly available dataset of simultaneous electroencephalography (EEG), eye-tracking recordings, and word-level semantic annotations from 7129 words in 400 sentences (Zurich Cognitive Language Processing Corpus; Hollenstein et al., 2018). We computed fixation-related potentials (FRPs), which are evoked electrical responses time-locked to the onset of fixations. A general linear mixed model analysis of FRPs cleaned from visual- and motor-evoked activity showed a topographical difference between the positive and negative sentiment condition in the 224-304 ms interval after fixation onset in left-central and right-posterior electrode clusters. An additional analysis that included word-, phrase-, and sentence-level sentiment predictors showed the same FRP differences for the word-level sentiment, but no additional FRP differences for phrase- and sentence-level sentiment. Furthermore, decoding analysis that classified word sentiment (positive or negative) from sentiment-matched 40-trial average FRPs showed a 0.60 average accuracy (95% confidence interval: [0.58, 0.61]). Control analyses ruled out that these results were based on differences in eye movements or linguistic features other than word sentiment. Our results extend previous research by showing that the emotional valence of lexico-semantic stimuli evoke a fast electrical neural response upon word fixation during naturalistic reading. These results provide an important step to identify the neural processes of lexico-semantic processing in ecologically valid conditions and can serve to improve computer algorithms for natural language processing.
Collapse
Affiliation(s)
- Christian Pfeiffer
- Methods of Plasticity Research Laboratory, Department of Psychology, University of Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland.
| | | | - Ce Zhang
- Department of Computer Science, ETH, Zurich, Switzerland
| | - Nicolas Langer
- Methods of Plasticity Research Laboratory, Department of Psychology, University of Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), Zurich, Switzerland
| |
Collapse
|