1
|
Zhu S, Oh YJ, Trepka EB, Chen X, Moore T. Dependence of Contextual Modulation in Macaque V1 on Interlaminar Signal Flow. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.18.590176. [PMID: 38659877 PMCID: PMC11042257 DOI: 10.1101/2024.04.18.590176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
In visual cortex, neural correlates of subjective perception can be generated by modulation of activity from beyond the classical receptive field (CRF). In macaque V1, activity generated by nonclassical receptive field (nCRF) stimulation involves different intracortical circuitry than activity generated by CRF stimulation, suggesting that interactions between neurons across V1 layers differ under CRF and nCRF stimulus conditions. Using Neuropixels probes, we measured border ownership modulation within large, local populations of V1 neurons. We found that neurons in single columns preferred the same side of objects located outside of the CRF. In addition, we found that cross-correlations between pairs of neurons situated across feedback/horizontal and input layers differed between CRF and nCRF stimulation. Furthermore, independent of the comparison with CRF stimulation, we observed that the magnitude of border ownership modulation increased with the proportion of information flow from feedback/horizontal layers to input layers. These results demonstrate that the flow of signals between layers covaries with the degree to which neurons integrate information from beyond the CRF.
Collapse
|
2
|
Muller L, Churchland PS, Sejnowski TJ. Transformers and cortical waves: encoders for pulling in context across time. Trends Neurosci 2024; 47:788-802. [PMID: 39341729 DOI: 10.1016/j.tins.2024.08.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 06/07/2024] [Accepted: 08/09/2024] [Indexed: 10/01/2024]
Abstract
The capabilities of transformer networks such as ChatGPT and other large language models (LLMs) have captured the world's attention. The crucial computational mechanism underlying their performance relies on transforming a complete input sequence - for example, all the words in a sentence - into a long 'encoding vector' that allows transformers to learn long-range temporal dependencies in naturalistic sequences. Specifically, 'self-attention' applied to this encoding vector enhances temporal context in transformers by computing associations between pairs of words in the input sequence. We suggest that waves of neural activity traveling across single cortical areas, or multiple regions on the whole-brain scale, could implement a similar encoding principle. By encapsulating recent input history into a single spatial pattern at each moment in time, cortical waves may enable a temporal context to be extracted from sequences of sensory inputs, the same computational principle as that used in transformers.
Collapse
Affiliation(s)
- Lyle Muller
- Department of Mathematics, Western University, London, Ontario, Canada; Fields Laboratory for Network Science, Fields Institute, Toronto, Ontario, Canada.
| | - Patricia S Churchland
- Department of Philosophy, University of California at San Diego, San Diego, CA, USA.
| | - Terrence J Sejnowski
- Computational Neurobiology Laboratory, Salk Institute for Biological Studies, San Diego, CA, USA; Department of Neurobiology, University of California at San Diego, San Diego, CA, USA.
| |
Collapse
|
3
|
Luo J, Yokoi I, Dumoulin SO, Takemura H. Bistable perception of symbolic numbers. J Vis 2024; 24:12. [PMID: 39287596 PMCID: PMC11421664 DOI: 10.1167/jov.24.9.12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Accepted: 08/03/2024] [Indexed: 09/19/2024] Open
Abstract
Numerals, that is, semantic expressions of numbers, enable us to have an exact representation of the amount of things. Visual processing of numerals plays an indispensable role in the recognition and interpretation of numbers. Here, we investigate how visual information from numerals is processed to achieve semantic understanding. We first found that partial occlusion of some digital numerals introduces bistable interpretations. Next, by using the visual adaptation method, we investigated the origin of this bistability in human participants. We showed that adaptation to digital and normal Arabic numerals, as well as homologous shapes, but not Chinese numerals, biases the interpretation of a partially occluded digital numeral. We suggest that this bistable interpretation is driven by intermediate shape processing stages of vision, that is, by features more complex than local visual orientations, but more basic than the abstract concepts of numerals.
Collapse
Affiliation(s)
- Junxiang Luo
- Division of Sensory and Cognitive Brain Mapping, Department of System Neuroscience, National Institute for Physiological Sciences, Okazaki, Japan
| | - Isao Yokoi
- Division of Sensory and Cognitive Brain Mapping, Department of System Neuroscience, National Institute for Physiological Sciences, Okazaki, Japan
- Technical Division, National Institute for Physiological Sciences, Okazaki, Japan
| | - Serge O Dumoulin
- Spinoza Centre for Neuroimaging, Amsterdam, the Netherlands
- Computational Cognitive Neuroscience and Neuroimaging, Netherlands Institute for Neuroscience, Amsterdam, the Netherlands
- Experimental and Applied Psychology, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
- Experimental Psychology, Utrecht University, Utrecht, the Netherlands
| | - Hiromasa Takemura
- Division of Sensory and Cognitive Brain Mapping, Department of System Neuroscience, National Institute for Physiological Sciences, Okazaki, Japan
- The Graduate Institute for Advanced Studies, SOKENDAI, Hayama, Japan
- Center for Information and Neural Networks (CiNet), Advanced ICT Research Institute, National Institute of Information and Communications Technology, Suita, Japan
| |
Collapse
|
4
|
Li YL, Leu HB, Ting CH, Lim SS, Tsai TY, Wu CH, Chung IF, Liang KH. Predicting long-term time to cardiovascular incidents using myocardial perfusion imaging and deep convolutional neural networks. Sci Rep 2024; 14:3802. [PMID: 38360974 PMCID: PMC10869727 DOI: 10.1038/s41598-024-54139-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 02/08/2024] [Indexed: 02/17/2024] Open
Abstract
Myocardial perfusion imaging (MPI) is a clinical tool which can assess the heart's perfusion status, thereby revealing impairments in patients' cardiac function. Within the MPI modality, the acquired three-dimensional signals are typically represented as a sequence of two-dimensional grayscale tomographic images. Here, we proposed an end-to-end survival training approach for processing gray-scale MPI tomograms to generate a risk score which reflects subsequent time to cardiovascular incidents, including cardiovascular death, non-fatal myocardial infarction, and non-fatal ischemic stroke (collectively known as Major Adverse Cardiovascular Events; MACE) as well as Congestive Heart Failure (CHF). We recruited a total of 1928 patients who had undergone MPI followed by coronary interventions. Among them, 80% (n = 1540) were randomly reserved for the training and 5- fold cross-validation stage, while 20% (n = 388) were set aside for the testing stage. The end-to-end survival training can converge well in generating effective AI models via the fivefold cross-validation approach with 1540 patients. When a candidate model is evaluated using independent images, the model can stratify patients into below-median-risk (n = 194) and above-median-risk (n = 194) groups, the corresponding survival curves of the two groups have significant difference (P < 0.0001). We further stratify the above-median-risk group to the quartile 3 and 4 group (n = 97 each), and the three patient strata, referred to as the high, intermediate and low risk groups respectively, manifest statistically significant difference. Notably, the 5-year cardiovascular incident rate is less than 5% in the low-risk group (accounting for 50% of all patients), while the rate is nearly 40% in the high-risk group (accounting for 25% of all patients). Evaluation of patient subgroups revealed stronger effect size in patients with three blocked arteries (Hazard ratio [HR]: 18.377, 95% CI 3.719-90.801, p < 0.001), followed by those with two blocked vessels at HR 7.484 (95% CI 1.858-30.150; p = 0.005). Regarding stent placement, patients with a single stent displayed a HR of 4.410 (95% CI 1.399-13.904; p = 0.011). Patients with two stents show a HR of 10.699 (95% CI 2.262-50.601; p = 0.003), escalating notably to a HR of 57.446 (95% CI 1.922-1717.207; p = 0.019) for patients with three or more stents, indicating a substantial relationship between the disease severity and the predictive capability of the AI for subsequent cardiovascular inciidents. The success of the MPI AI model in stratifying patients into subgroups with distinct time-to-cardiovascular incidents demonstrated the feasibility of proposed end-to-end survival training approach.
Collapse
Affiliation(s)
- Yi-Lian Li
- Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei City, Taiwan
| | - Hsin-Bang Leu
- Department of Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan
| | - Chien-Hsin Ting
- Department of Nuclear Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan
| | - Su-Shen Lim
- Department of Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan
| | - Tsung-Ying Tsai
- Department of Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan
| | - Cheng-Hsueh Wu
- Department of Medicine, Taipei Veterans General Hospital, Taipei City, Taiwan
| | - I-Fang Chung
- Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei City, Taiwan.
| | - Kung-Hao Liang
- Department of Medical Research, Taipei Veterans General Hospital, Taipei City, Taiwan.
| |
Collapse
|
5
|
Wang T, Zhao Y, Jia J. Nonadditive integration of visual information in ensemble processing. iScience 2023; 26:107988. [PMID: 37822498 PMCID: PMC10562869 DOI: 10.1016/j.isci.2023.107988] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 09/03/2023] [Accepted: 09/16/2023] [Indexed: 10/13/2023] Open
Abstract
Statistically summarizing information from a stimulus array into an ensemble representation (e.g., the mean) improves the efficiency of visual processing. However, little is known about how the brain computes the ensemble statistics. Here, we propose that ensemble processing is realized by nonadditive integration, rather than linear averaging, of individual items. We used a linear regression model approach to extract EEG responses to three levels of information: the individual items, their local interactions, and their global interaction. The local and global interactions, representing nonadditive integration of individual items, elicited rapid and independent neural responses. Critically, only the neural representation of the global interaction predicted the precision of the ensemble perception at the behavioral level. Furthermore, spreading attention over the global pattern to enhance ensemble processing directly promoted rapid neural representation of the global interaction. Taken together, these findings advocate a global, nonadditive mechanism of ensemble processing in the brain.
Collapse
Affiliation(s)
- Tongyu Wang
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| | - Yuqing Zhao
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| | - Jianrong Jia
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
- Zhejiang Philosophy and Social Science Laboratory for Research in Early Development and Childcare, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| |
Collapse
|
6
|
Huang J, Zhou Y, Tzvetanov T. Influences of local and global context on local orientation perception. Eur J Neurosci 2023; 58:3503-3517. [PMID: 37547942 DOI: 10.1111/ejn.16105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/12/2023] [Accepted: 07/14/2023] [Indexed: 08/08/2023]
Abstract
Visual context modulates perception of local orientation attributes. These spatially very localised effects are considered to correspond to specific excitatory-inhibitory connectivity patterns of early visual areas as V1, creating perceptual tilt repulsion and attraction effects. Here, orientation misperception of small Gabor stimuli was used as a probe of this computational structure by sampling a large spatio-orientation space to reveal expected asymmetries due to the underlying neuronal processing. Surprisingly, the results showed a regular iso-orientation pattern of nearby location effects whose reference point was globally modulated by the spatial structure, without any complex interactions between local positions and orientation. This pattern of results was confirmed by the two perceptual parameters of bias and discrimination ability. Furthermore, the response times to stimulus configuration displayed variations that further provided evidence of how multiple early visual stages affect perception of simple stimuli.
Collapse
Affiliation(s)
- Jinfeng Huang
- Department of Psychology, Hebei Normal University, Shijiazhuang, China
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, Anhui, China
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Yifeng Zhou
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, Anhui, China
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Tzvetomir Tzvetanov
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, Anhui, China
- Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, School of Computer Science and Information Engineering, Hefei University of Technology, Hefei, China
- NEUROPSYPHY Tzvetomir TZVETANOV EIRL, Horbourg-Wihr, France
- Ciwei Kexue Yanjiu (Shenzhen) Youxian Gongsi , Shenzhen, China
| |
Collapse
|
7
|
Benigno GB, Budzinski RC, Davis ZW, Reynolds JH, Muller L. Waves traveling over a map of visual space can ignite short-term predictions of sensory input. Nat Commun 2023; 14:3409. [PMID: 37296131 PMCID: PMC10256723 DOI: 10.1038/s41467-023-39076-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 05/25/2023] [Indexed: 06/12/2023] Open
Abstract
Recent analyses have found waves of neural activity traveling across entire visual cortical areas in awake animals. These traveling waves modulate the excitability of local networks and perceptual sensitivity. The general computational role of these spatiotemporal patterns in the visual system, however, remains unclear. Here, we hypothesize that traveling waves endow the visual system with the capacity to predict complex and naturalistic inputs. We present a network model whose connections can be rapidly and efficiently trained to predict individual natural movies. After training, a few input frames from a movie trigger complex wave patterns that drive accurate predictions many frames into the future solely from the network's connections. When the recurrent connections that drive waves are randomly shuffled, both traveling waves and the ability to predict are eliminated. These results suggest traveling waves may play an essential computational role in the visual system by embedding continuous spatiotemporal structures over spatial maps.
Collapse
Affiliation(s)
- Gabriel B Benigno
- Department of Mathematics, Western University, London, ON, Canada
- Brain and Mind Institute, Western University, London, ON, Canada
- Western Academy for Advanced Research, Western University, London, ON, Canada
| | - Roberto C Budzinski
- Department of Mathematics, Western University, London, ON, Canada
- Brain and Mind Institute, Western University, London, ON, Canada
- Western Academy for Advanced Research, Western University, London, ON, Canada
| | - Zachary W Davis
- The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - John H Reynolds
- The Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Lyle Muller
- Department of Mathematics, Western University, London, ON, Canada.
- Brain and Mind Institute, Western University, London, ON, Canada.
- Western Academy for Advanced Research, Western University, London, ON, Canada.
| |
Collapse
|
8
|
Henderson MM, Tarr MJ, Wehbe L. A Texture Statistics Encoding Model Reveals Hierarchical Feature Selectivity across Human Visual Cortex. J Neurosci 2023; 43:4144-4161. [PMID: 37127366 PMCID: PMC10255092 DOI: 10.1523/jneurosci.1822-22.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 03/21/2023] [Accepted: 03/26/2023] [Indexed: 05/03/2023] Open
Abstract
Midlevel features, such as contour and texture, provide a computational link between low- and high-level visual representations. Although the nature of midlevel representations in the brain is not fully understood, past work has suggested a texture statistics model, called the P-S model (Portilla and Simoncelli, 2000), is a candidate for predicting neural responses in areas V1-V4 as well as human behavioral data. However, it is not currently known how well this model accounts for the responses of higher visual cortex to natural scene images. To examine this, we constructed single-voxel encoding models based on P-S statistics and fit the models to fMRI data from human subjects (both sexes) from the Natural Scenes Dataset (Allen et al., 2022). We demonstrate that the texture statistics encoding model can predict the held-out responses of individual voxels in early retinotopic areas and higher-level category-selective areas. The ability of the model to reliably predict signal in higher visual cortex suggests that the representation of texture statistics features is widespread throughout the brain. Furthermore, using variance partitioning analyses, we identify which features are most uniquely predictive of brain responses and show that the contributions of higher-order texture features increase from early areas to higher areas on the ventral and lateral surfaces. We also demonstrate that patterns of sensitivity to texture statistics can be used to recover broad organizational axes within visual cortex, including dimensions that capture semantic image content. These results provide a key step forward in characterizing how midlevel feature representations emerge hierarchically across the visual system.SIGNIFICANCE STATEMENT Intermediate visual features, like texture, play an important role in cortical computations and may contribute to tasks like object and scene recognition. Here, we used a texture model proposed in past work to construct encoding models that predict the responses of neural populations in human visual cortex (measured with fMRI) to natural scene stimuli. We show that responses of neural populations at multiple levels of the visual system can be predicted by this model, and that the model is able to reveal an increase in the complexity of feature representations from early retinotopic cortex to higher areas of ventral and lateral visual cortex. These results support the idea that texture-like representations may play a broad underlying role in visual processing.
Collapse
Affiliation(s)
- Margaret M Henderson
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
- Department of Psychology
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
| | - Michael J Tarr
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
- Department of Psychology
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
| | - Leila Wehbe
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
- Department of Psychology
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213
| |
Collapse
|
9
|
Schüz S, Gatt A, Zarrieß S. Rethinking symbolic and visual context in Referring Expression Generation. Front Artif Intell 2023; 6:1067125. [PMID: 37026020 PMCID: PMC10072327 DOI: 10.3389/frai.2023.1067125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 02/28/2023] [Indexed: 03/31/2023] Open
Abstract
Situational context is crucial for linguistic reference to visible objects, since the same description can refer unambiguously to an object in one context but be ambiguous or misleading in others. This also applies to Referring Expression Generation (REG), where the production of identifying descriptions is always dependent on a given context. Research in REG has long represented visual domains throughsymbolicinformation about objects and their properties, to determine identifying sets of target features during content determination. In recent years, research invisual REGhas turned to neural modeling and recasted the REG task as an inherently multimodal problem, looking at more natural settings such as generating descriptions for objects in photographs. Characterizing the precise ways in which context influences generation is challenging in both paradigms, as context is notoriously lacking precise definitions and categorization. In multimodal settings, however, these problems are further exacerbated by the increased complexity and low-level representation of perceptual inputs. The main goal of this article is to provide a systematic review of the types and functions of visual context across various approaches to REG so far and to argue for integrating and extending different perspectives on visual context that currently co-exist in research on REG. By analyzing the ways in which symbolic REG integrates context in rule-based approaches, we derive a set of categories of contextual integration, including the distinction betweenpositiveandnegative semantic forcesexerted by context during reference generation. Using this as a framework, we show that so far existing work in visual REG has considered only some of the ways in which visual context can facilitate end-to-end reference generation. Connecting with preceding research in related areas, as possible directions for future research, we highlight some additional ways in which contextual integration can be incorporated into REG and other multimodal generation tasks.
Collapse
Affiliation(s)
- Simeon Schüz
- Faculty of Linguistics and Literary Studies, Bielefeld University, Bielefeld, Germany
- *Correspondence: Simeon Schüz
| | - Albert Gatt
- Natural Language Processing Group, Department of Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
| | - Sina Zarrieß
- Faculty of Linguistics and Literary Studies, Bielefeld University, Bielefeld, Germany
| |
Collapse
|
10
|
Kirchberger L, Mukherjee S, Self MW, Roelfsema PR. Contextual drive of neuronal responses in mouse V1 in the absence of feedforward input. SCIENCE ADVANCES 2023; 9:eadd2498. [PMID: 36662858 PMCID: PMC9858514 DOI: 10.1126/sciadv.add2498] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 12/19/2022] [Indexed: 06/17/2023]
Abstract
Neurons in the primary visual cortex (V1) respond to stimuli in their receptive field (RF), which is defined by the feedforward input from the retina. However, V1 neurons are also sensitive to contextual information outside their RF, even if the RF itself is unstimulated. Here, we examined the cortical circuits for V1 contextual responses to gray disks superimposed on different backgrounds. Contextual responses began late and were strongest in the feedback-recipient layers of V1. They differed between the three main classes of inhibitory neurons, with particularly strong contextual drive of VIP neurons, indicating a contribution of disinhibitory circuits to contextual drive. Contextual drive was strongest when the gray disk was perceived as figure, occluding its background, rather than a hole. Our results link contextual drive in V1 to perceptual organization and provide previously unknown insight into how recurrent processing shapes the response of sensory neurons to facilitate figure perception.
Collapse
Affiliation(s)
- Lisa Kirchberger
- Department of Vision and Cognition, Netherlands Institute for Neuroscience, Meibergdreef 47, 1105 BA Amsterdam, Netherlands
| | - Sreedeep Mukherjee
- Department of Vision and Cognition, Netherlands Institute for Neuroscience, Meibergdreef 47, 1105 BA Amsterdam, Netherlands
| | - Matthew W. Self
- Department of Vision and Cognition, Netherlands Institute for Neuroscience, Meibergdreef 47, 1105 BA Amsterdam, Netherlands
| | - Pieter R. Roelfsema
- Department of Vision and Cognition, Netherlands Institute for Neuroscience, Meibergdreef 47, 1105 BA Amsterdam, Netherlands
- Department of Integrative Neurophysiology, Center for Neurogenomics and Cognitive Research, VU University, Amsterdam, Netherlands
- Department of Psychiatry, Academic Medical Center, Amsterdam, Netherlands
- Laboratory of Visual Brain Therapy, Sorbonne Université, Institut National de la Santé et de la Recherche Médicale, Centre National de la Recherche Scientifique, Institut de la Vision, Paris F-75012, France
| |
Collapse
|
11
|
Nakamura T, Murakami I. Temporal resolution and temporal extent of orientation repulsion. Vision Res 2022; 200:108104. [PMID: 35878472 DOI: 10.1016/j.visres.2022.108104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Revised: 06/24/2022] [Accepted: 07/12/2022] [Indexed: 01/25/2023]
Abstract
A vertical target is perceived as tilted against a slightly tilted inducer surrounding it. To identify the temporal resolution and temporal extent of this phenomenon of orientation repulsion in the same paradigm, we used an alternating pair of inducer stimuli having complementary orientation distributions and quantified repulsion at various alternation frequencies. The duration of each inducer stimulus was inversely proportional to the frequency. When an orthogonal pair of D2 patterns, a type of grating whose luminance modulation in a particular orientation was the second-order partial derivative of an isotropic 2D-Gaussian, was used as the inducer, repulsion occurred when the duration exceeded 20 ms and leveled off at 30 ms and beyond. When a custom-made texture with a narrowband orientation distribution and another texture with a complementary orientation distribution were alternated as the inducer, repulsion gradually increased until the inducer duration reached 200 ms. The gradual increase in repulsion was observed regardless of whether the orientation of the inducer that appeared simultaneously with the target was discernible. These findings reveal that contextual modulation in orientation occurs at a high temporal resolution and continues to a long temporal extent under optimal conditions.
Collapse
Affiliation(s)
- Tomoya Nakamura
- Department of Psychology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan; Japan Society for the Promotion of Science, 5-3-1 Kojimachi, Chiyoda-ku, Tokyo 102-0083, Japan.
| | - Ikuya Murakami
- Department of Psychology, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan
| |
Collapse
|
12
|
Parmar H, Tahvildar A, Ghasemi E, Jung S, Davis F, Walden E. To download or not to download? Spatial and temporal neural dynamics across the brain regions when deciding to download an app. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT 2022. [DOI: 10.1016/j.ijinfomgt.2022.102531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
13
|
Abstract
Much of forensic practice today involves human decisions about the origins of patterned sensory evidence, such as tool marks and fingerprints discovered at a crime scene. These decisions are made by trained observers who compare the evidential pattern to an exemplar pattern produced by the suspected source of the evidence. The decision consists of a determination as to whether the two patterns are similar enough to have come from the same source. Although forensic pattern comparison disciplines have for decades played a valued role in criminal investigation and prosecution, the extremely high personal and societal costs of failure-the conviction of innocent people-has elicited calls for caution and for the development of better practices. These calls have been heard by the scientific community involved in the study of human information processing, which has begun to offer much-needed perspectives on sensory measurement, discrimination, and classification in a forensic context. Here I draw from a well-established theoretical and empirical approach in sensory science to illustrate the vulnerabilities of contemporary pattern comparison disciplines and to suggest specific strategies for improvement.
Collapse
|
14
|
Facial Expression Recognition: One Attention-Modulated Contextual Spatial Information Network. ENTROPY 2022; 24:e24070882. [PMID: 35885106 PMCID: PMC9324190 DOI: 10.3390/e24070882] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/18/2022] [Accepted: 06/24/2022] [Indexed: 11/17/2022]
Abstract
Facial expression recognition (FER) in the wild is a challenging task due to some uncontrolled factors such as occlusion, illumination, and pose variation. The current methods perform well in controlled conditions. However, there are still two issues with the in-the-wild FER task: (i) insufficient descriptions of long-range dependency of expression features in the facial information space and (ii) not finely refining subtle inter-classes distinction from multiple expressions in the wild. To overcome the above issues, an end-to-end model for FER, named attention-modulated contextual spatial information network (ACSI-Net), is presented in this paper, with the manner of embedding coordinate attention (CA) modules into a contextual convolutional residual network (CoResNet). Firstly, CoResNet is constituted by arranging contextual convolution (CoConv) blocks of different levels to integrate facial expression features with long-range dependency, which generates a holistic representation of spatial information on facial expression. Then, the CA modules are inserted into different stages of CoResNet, at each of which the subtle information about facial expression acquired from CoConv blocks is first modulated by the corresponding CA module across channels and spatial locations and then flows into the next layer. Finally, to highlight facial regions related to expression, a CA module located at the end of the whole network, which produces attentional masks to multiply by input feature maps, is utilized to focus on salient regions. Different from other models, the ACSI-Net is capable of exploring intrinsic dependencies between features and yielding a discriminative representation for facial expression classification. Extensive experimental results on AffectNet and RAF_DB datasets demonstrate its effectiveness and competitiveness compared to other FER methods.
Collapse
|
15
|
Gepshtein S, Pawar AS, Kwon S, Savel’ev S, Albright TD. Spatially distributed computation in cortical circuits. SCIENCE ADVANCES 2022; 8:eabl5865. [PMID: 35452288 PMCID: PMC9032974 DOI: 10.1126/sciadv.abl5865] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 03/03/2022] [Indexed: 06/14/2023]
Abstract
The traditional view of neural computation in the cerebral cortex holds that sensory neurons are specialized, i.e., selective for certain dimensions of sensory stimuli. This view was challenged by evidence of contextual interactions between stimulus dimensions in which a neuron's response to one dimension strongly depends on other dimensions. Here, we use methods of mathematical modeling, psychophysics, and electrophysiology to address shortcomings of the traditional view. Using a model of a generic cortical circuit, we begin with the simple demonstration that cortical responses are always distributed among neurons, forming characteristic waveforms, which we call neural waves. When stimulated by patterned stimuli, circuit responses arise by interference of neural waves. Results of this process depend on interaction between stimulus dimensions. Comparison of modeled responses with responses of biological vision makes it clear that the framework of neural wave interference provides a useful alternative to the standard concept of neural computation.
Collapse
Affiliation(s)
- Sergei Gepshtein
- Systems Neurobiology Laboratories, Salk Institute for Biological Studies, La Jolla, CA, USA
- Center for Spatial Perception and Concrete Experience, University of Southern California, Los Angeles, CA, USA
| | - Ambarish S. Pawar
- Systems Neurobiology Laboratories, Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Sunwoo Kwon
- Herbert Wertheim School of Optometry & Vision Science, University of California Berkeley, Berkeley, CA, USA
| | - Sergey Savel’ev
- Department of Physics, Loughborough University, Loughborough, UK
| | - Thomas D. Albright
- Systems Neurobiology Laboratories, Salk Institute for Biological Studies, La Jolla, CA, USA
| |
Collapse
|
16
|
Niu X, Huang S, Zhu M, Wang Z, Shi L. Surround Modulation Properties of Tectal Neurons in Pigeons Characterized by Moving and Flashed Stimuli. Animals (Basel) 2022; 12:ani12040475. [PMID: 35203185 PMCID: PMC8868286 DOI: 10.3390/ani12040475] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 02/09/2022] [Accepted: 02/10/2022] [Indexed: 11/16/2022] Open
Abstract
Simple Summary Surround modulation is a basic visual attribute of sensory neurons in many species and has been extensively characterized in mammal primary visual cortex, lateral geniculate nucleus, and superior colliculus. Little attention has been paid to birds, which have a highly developed visual system. We undertook a systematic analysis on surround modulation properties of tectal neurons in pigeons (Columba livia). This study complements existing studies on surrounding modulation properties in non-mammalian species and deepens the understanding of mechanisms of figure–background segmentation performed by avians. Abstract Surround modulation has been abundantly studied in several mammalian brain areas, including the primary visual cortex, lateral geniculate nucleus, and superior colliculus (SC), but systematic analysis is lacking in the avian optic tectum (OT, homologous to mammal SC). Here, multi-units were recorded from pigeon (Columba livia) OT, and responses to different sizes of moving, flashed squares, and bars were compared. The statistical results showed that most tectal neurons presented suppressed responses to larger stimuli in both moving and flashed paradigms, and suppression induced by flashed squares was comparable with moving ones when the stimuli center crossed the near classical receptive field (CRF) center, which corresponded to the full surrounding condition. Correspondingly, the suppression grew weaker when the stimuli center moved across the CRF border, equivalent to partially surrounding conditions. Similarly, suppression induced by full surrounding flashed squares was more intense than by partially surrounding flashed bars. These results suggest that inhibitions performed on tectal neurons appear to be full surrounding rather than locally lateral. This study enriches the understanding of surround modulation properties of avian tectum neurons and provides possible hypotheses about the arrangement of inhibitions from other nuclei, both of which are important for clarifying the mechanism of target detection against clutter background performed by avians.
Collapse
Affiliation(s)
- Xiaoke Niu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China; (X.N.); (S.H.); (M.Z.); (Z.W.)
| | - Shuman Huang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China; (X.N.); (S.H.); (M.Z.); (Z.W.)
| | - Minjie Zhu
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China; (X.N.); (S.H.); (M.Z.); (Z.W.)
| | - Zhizhong Wang
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China; (X.N.); (S.H.); (M.Z.); (Z.W.)
| | - Li Shi
- Henan Key Laboratory of Brain Science and Brain-Computer Interface Technology, School of Electrical Engineering, Zhengzhou University, Zhengzhou 450001, China; (X.N.); (S.H.); (M.Z.); (Z.W.)
- Department of Automation, Tsinghua University, Beijing 100084, China
- Correspondence:
| |
Collapse
|
17
|
|
18
|
Chen T, Ding J, Yue GH, Liu H, Li J, Jiang C. Global-local consistency benefits memory-guided tracking of a moving target. Brain Behav 2022; 12:e2444. [PMID: 34859605 PMCID: PMC8785627 DOI: 10.1002/brb3.2444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/22/2020] [Revised: 10/31/2021] [Accepted: 11/08/2021] [Indexed: 11/11/2022] Open
Abstract
INTRODUCTION Previous findings have demonstrated that several Gestalt principles do facilitate VSTM performance in change detection tasks. However, few studies have investigated the role of and time-course of global-local consistency in motion perception. METHODS Participants were required to track a moving target surrounded by three different backgrounds: blank, inconsistent, or consistent. Global-local objects were be bound to move together (covariation). During the PMT, participants had to follow the moving target with their eyes and react as fast as possible when the target had just vanished behind the obstruction or would arrive at a predetermined point of interception. Variable error (VE) and constant error (CE) of estimated time-to-contact (TTC) and gain of smooth pursuit eye movements were calculated in various conditions and analyzed qualitatively. RESULTS Experiment 1 established the basic finding that VSTM performance could benefit from global-local consistency. Experiment 2 extended this finding by eye-tracking device. Both in visible phase and in occluded phase, CEs were smaller for the target in a consistent background than for the target in an inconsistent background and for the target in a blank background, with both differences significant (ps < .05). However, the difference in VE among three conditions was not significant. At early stage (100-250 ms), later stage (2750-3000 ms), and termination stage (5750-6000 ms) of smooth pursuit, the velocity gains were higher in the trials with consistent backgrounds than in the trials with inconsistent backgrounds and blank backgrounds (ps < .001). With the exception of 100-250 ms phase, the means did not differ between the inconsistent background and the blank background trials (ps > .1). CONCLUSIONS Global-local consistency could be activated within the first few hundred milliseconds to prioritize the deployment of attention and eye movement to component target. Meanwhile, it also removes ambiguity from motion tracking and TTC estimation under some unpredictable conditions, leading to the consistency advantage during smooth-pursuit termination phase. Global-local consistency may act as an important information source to TTC estimation and oculomotor response in PMT.
Collapse
Affiliation(s)
- Tingting Chen
- School of EducationBeijing Dance AcademyBeijingP.R. China
| | - Jinhong Ding
- Beijing Key Laboratory of Learning and Cognition & School of PsychologyCapital Normal UniversityBeijingP.R. China
| | - Guang H. Yue
- Human Performance and Engineering Research, Kessler FoundationWest OrangeNew Jersey
| | - Haoqiang Liu
- School of EducationShangdong Woman UniversityJinanP.R. China
| | - Jie Li
- Institute of Psychological SciencesHangzhou Normal UniversityHangzhouP.R. China
| | - Changhao Jiang
- Beijing Key Lab of Physical Fitness Evaluation and Tech AnalysisCapital University of Physical Education and SportsBeijingP.R. China
| |
Collapse
|
19
|
Chen H, Naya Y. Reunification of Object and View-Center Background Information in the Primate Medial Temporal Lobe. Front Behav Neurosci 2021; 15:756801. [PMID: 34938164 PMCID: PMC8685287 DOI: 10.3389/fnbeh.2021.756801] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 11/15/2021] [Indexed: 11/13/2022] Open
Abstract
Recent work has shown that the medial temporal lobe (MTL), including the hippocampus (HPC) and its surrounding limbic cortices, plays a role in scene perception in addition to episodic memory. The two basic factors of scene perception are the object (“what”) and location (“where”). In this review, we first summarize the anatomical knowledge related to visual inputs to the MTL and physiological studies examining object-related information processed along the ventral pathway briefly. Thereafter, we discuss the space-related information, the processing of which was unclear, presumably because of its multiple aspects and a lack of appropriate task paradigm in contrast to object-related information. Based on recent electrophysiological studies using non-human primates and the existing literature, we proposed the “reunification theory,” which explains brain mechanisms which construct object-location signals at each gaze. In this reunification theory, the ventral pathway signals a large-scale background image of the retina at each gaze position. This view-center background signal reflects the first person’s perspective and specifies the allocentric location in the environment by similarity matching between images. The spatially invariant object signal and view-center background signal, both of which are derived from the same retinal image, are integrated again (i.e., reunification) along the ventral pathway-MTL stream, particularly in the perirhinal cortex. The conjunctive signal, which represents a particular object at a particular location, may play a role in scene perception in the HPC as a key constituent element of an entire scene.
Collapse
Affiliation(s)
- He Chen
- School of Psychological and Cognitive Sciences, Peking University, Beijing, China
| | - Yuji Naya
- School of Psychological and Cognitive Sciences, Peking University, Beijing, China.,IDG/McGovern Institute for Brain Research, Peking University, Beijing, China.,Beijing Key Laboratory of Behavioral and Mental Health, Faculty of Science, College of Psychology and Cognitive Sciences, Peking University, Beijing, China
| |
Collapse
|
20
|
Abstract
During natural vision, our brains are constantly exposed to complex, but regularly structured environments. Real-world scenes are defined by typical part-whole relationships, where the meaning of the whole scene emerges from configurations of localized information present in individual parts of the scene. Such typical part-whole relationships suggest that information from individual scene parts is not processed independently, but that there are mutual influences between the parts and the whole during scene analysis. Here, we review recent research that used a straightforward, but effective approach to study such mutual influences: By dissecting scenes into multiple arbitrary pieces, these studies provide new insights into how the processing of whole scenes is shaped by their constituent parts and, conversely, how the processing of individual parts is determined by their role within the whole scene. We highlight three facets of this research: First, we discuss studies demonstrating that the spatial configuration of multiple scene parts has a profound impact on the neural processing of the whole scene. Second, we review work showing that cortical responses to individual scene parts are shaped by the context in which these parts typically appear within the environment. Third, we discuss studies demonstrating that missing scene parts are interpolated from the surrounding scene context. Bridging these findings, we argue that efficient scene processing relies on an active use of the scene's part-whole structure, where the visual brain matches scene inputs with internal models of what the world should look like.
Collapse
Affiliation(s)
- Daniel Kaiser
- Justus-Liebig-Universität Gießen, Germany.,Philipps-Universität Marburg, Germany.,University of York, United Kingdom
| | - Radoslaw M Cichy
- Freie Universität Berlin, Germany.,Humboldt-Universität zu Berlin, Germany.,Bernstein Centre for Computational Neuroscience Berlin, Germany
| |
Collapse
|
21
|
Rossi A, Hagenbuchner M, Scarselli F, Tsoi AC. A Study on the effects of recursive convolutional layers in convolutional neural networks. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2021.07.021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
22
|
Niell CM, Scanziani M. How Cortical Circuits Implement Cortical Computations: Mouse Visual Cortex as a Model. Annu Rev Neurosci 2021; 44:517-546. [PMID: 33914591 PMCID: PMC9925090 DOI: 10.1146/annurev-neuro-102320-085825] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The mouse, as a model organism to study the brain, gives us unprecedented experimental access to the mammalian cerebral cortex. By determining the cortex's cellular composition, revealing the interaction between its different components, and systematically perturbing these components, we are obtaining mechanistic insight into some of the most basic properties of cortical function. In this review, we describe recent advances in our understanding of how circuits of cortical neurons implement computations, as revealed by the study of mouse primary visual cortex. Further, we discuss how studying the mouse has broadened our understanding of the range of computations performed by visual cortex. Finally, we address how future approaches will fulfill the promise of the mouse in elucidating fundamental operations of cortex.
Collapse
Affiliation(s)
- Cristopher M. Niell
- Department of Biology and Institute of Neuroscience, University of Oregon, Eugene, Oregon 97403, USA
| | - Massimo Scanziani
- Department of Physiology and Howard Hughes Medical Institute, University of California San Francisco, San Francisco, California 94158, USA;
| |
Collapse
|
23
|
Festa D, Aschner A, Davila A, Kohn A, Coen-Cagli R. Neuronal variability reflects probabilistic inference tuned to natural image statistics. Nat Commun 2021; 12:3635. [PMID: 34131142 PMCID: PMC8206154 DOI: 10.1038/s41467-021-23838-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2020] [Accepted: 05/19/2021] [Indexed: 11/23/2022] Open
Abstract
Neuronal activity in sensory cortex fluctuates over time and across repetitions of the same input. This variability is often considered detrimental to neural coding. The theory of neural sampling proposes instead that variability encodes the uncertainty of perceptual inferences. In primary visual cortex (V1), modulation of variability by sensory and non-sensory factors supports this view. However, it is unknown whether V1 variability reflects the statistical structure of visual inputs, as would be required for inferences correctly tuned to the statistics of the natural environment. Here we combine analysis of image statistics and recordings in macaque V1 to show that probabilistic inference tuned to natural image statistics explains the widely observed dependence between spike count variance and mean, and the modulation of V1 activity and variability by spatial context in images. Our results show that the properties of a basic aspect of cortical responses-their variability-can be explained by a probabilistic representation tuned to naturalistic inputs.
Collapse
Affiliation(s)
- Dylan Festa
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Amir Aschner
- Dominick Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Aida Davila
- Dominick Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Adam Kohn
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
- Dominick Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Ruben Coen-Cagli
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA.
- Dominick Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA.
| |
Collapse
|
24
|
Hamblin-Frohman Z, Becker SI. The attentional template in high and low similarity search: Optimal tuning or tuning to relations? Cognition 2021; 212:104732. [PMID: 33862440 DOI: 10.1016/j.cognition.2021.104732] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 04/08/2021] [Accepted: 04/09/2021] [Indexed: 10/21/2022]
Abstract
The attentional template is often described as the mental representation that drives attentional selection and guidance, for instance, in visual search. Recent research suggests that this template is not a veridical representation of the sought-for target, but instead an altered representation that allows more efficient search. The current paper contrasts two such theories. Firstly, the Optimal Tuning account which posits that the attentional template shifts to an exaggerated target value to maximise the signal-to-noise ratio between similar targets and non-targets. Secondly, the Relational account which states that instead of tuning to feature values, attention is directed to the relative value created by the search context, e.g. all redder items or the reddest item. Both theories are empirically supported, but used different paradigms (perceptual decision tasks vs. visual search), and different attentional measures (probe response accuracy vs. gaze capture). The current design incorporates both paradigms and measures. The results reveal that while Optimal Tuning shifts are observed in probe trials they do not drive early attention or eye- movement behaviour in visual search. Instead, early attention follows the Relational Account, selecting all items with the relative target colour (e.g., redder). This suggests that the masked probe trials used in Optimal Tuning do not probe the attentional template that guides attention. In Experiment 3 we find that optimal tuning shifts correspond in magnitude to purely perceptual shifts created by contrast biases in the visual search arrays. This suggests that the shift in probe responses may in fact be a perceptual artefact rather than a strategic adaptation to optimise the signal-to-noise ratio. These results highlight the distinction between early attentional mechanisms and later, target identification mechanisms. SIGNIFICANCE STATEMENT: Classical theories of attention suggest that attention is guided by a feature-specific target template. In recent designs this has been challenged by an apparent non- veridical tuning of the template in situations where the target stimulus is similar to non-targets. The current studies compare two theories that propose different explanations for non-veridical tuning; the Relational and the Optimal Tuning account. We show that the Relational account describes the mechanism that guides early search behaviour, while the Optimal Tuning account describes perceptual decision-making. Optimal Tuning effects may be due to an artefact that has not been described in visual search before (simultaneous contrast).
Collapse
Affiliation(s)
| | - Stefanie I Becker
- School of Psychology, The University of Queensland, Brisbane, Australia
| |
Collapse
|
25
|
Herrero JL, Thiele A. Effects of muscarinic and nicotinic receptors on contextual modulation in macaque area V1. Sci Rep 2021; 11:8384. [PMID: 33863988 PMCID: PMC8052350 DOI: 10.1038/s41598-021-88044-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 04/05/2021] [Indexed: 02/02/2023] Open
Abstract
Context affects the salience and visibility of image elements in visual scenes. Collinear flankers can enhance or decrease the perceptual and neuronal sensitivity to flanked stimuli. These effects are mediated through lateral interactions between neurons in the primary visual cortex (area V1), in conjunction with feedback from higher visual areas. The strength of lateral interactions is affected by cholinergic neuromodulation. Blockade of muscarinic receptors should increase the strength of lateral intracortical interactions, while nicotinic blockade should reduce thalamocortical feed-forward drive. Here we test this proposal through local iontophoretic application of the muscarinic receptor antagonist scopolamine and the nicotinic receptor antagonist mecamylamine, while recording single cells in parafoveal representations in awake fixating macaque V1. Collinear flankers generally reduced neuronal contrast sensitivity. Muscarinic and nicotinic receptor blockade equally reduced neuronal contrast sensitivity. Contrary to our hypothesis, flanker interactions were not systematically affected by either receptor blockade.
Collapse
Affiliation(s)
| | - Alexander Thiele
- Biosciences Institute, Newcastle University, Henry Wellcome Building, Newcastle upon Tyne, NE2 4HH, UK.
| |
Collapse
|
26
|
Federer F, Ta'afua S, Merlin S, Hassanpour MS, Angelucci A. Stream-specific feedback inputs to the primate primary visual cortex. Nat Commun 2021; 12:228. [PMID: 33431862 PMCID: PMC7801467 DOI: 10.1038/s41467-020-20505-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Accepted: 12/03/2020] [Indexed: 11/16/2022] Open
Abstract
The sensory neocortex consists of hierarchically-organized areas reciprocally connected via feedforward and feedback circuits. Feedforward connections shape the receptive field properties of neurons in higher areas within parallel streams specialized in processing specific stimulus attributes. Feedback connections have been implicated in top-down modulations, such as attention, prediction and sensory context. However, their computational role remains unknown, partly because we lack knowledge about rules of feedback connectivity to constrain models of feedback function. For example, it is unknown whether feedback connections maintain stream-specific segregation, or integrate information across parallel streams. Using viral-mediated labeling of feedback connections arising from specific cytochrome-oxidase stripes of macaque visual area V2, here we show that feedback to the primary visual cortex (V1) is organized into parallel streams resembling the reciprocal feedforward pathways. This suggests that functionally-specialized V2 feedback channels modulate V1 responses to specific stimulus attributes, an organizational principle potentially extending to feedback pathways in other sensory systems.
Collapse
Affiliation(s)
- Frederick Federer
- Department of Ophthalmology and Visual Science Moran Eye Institute, University of Utah, 65 Mario Capecchi Drive, Salt Lake City, UT, 84132, USA
| | - Seminare Ta'afua
- Department of Ophthalmology and Visual Science Moran Eye Institute, University of Utah, 65 Mario Capecchi Drive, Salt Lake City, UT, 84132, USA
- Department of Biomedical Engineering, University of Utah, Salt Lake City, UT, 84132, USA
| | - Sam Merlin
- Department of Ophthalmology and Visual Science Moran Eye Institute, University of Utah, 65 Mario Capecchi Drive, Salt Lake City, UT, 84132, USA
- Medical Science, School of Science, Western Sydney University, Campbelltown, Sydney, NSW, 2560, Australia
| | - Mahlega S Hassanpour
- Department of Ophthalmology and Visual Science Moran Eye Institute, University of Utah, 65 Mario Capecchi Drive, Salt Lake City, UT, 84132, USA
| | - Alessandra Angelucci
- Department of Ophthalmology and Visual Science Moran Eye Institute, University of Utah, 65 Mario Capecchi Drive, Salt Lake City, UT, 84132, USA.
| |
Collapse
|
27
|
Chen Z, Whitney D. Inferential affective tracking reveals the remarkable speed of context-based emotion perception. Cognition 2020; 208:104549. [PMID: 33340812 DOI: 10.1016/j.cognition.2020.104549] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 12/08/2020] [Accepted: 12/09/2020] [Indexed: 10/22/2022]
Abstract
Understanding the emotional states of others is important for social functioning. Recent studies show that context plays an essential role in emotion recognition. However, it remains unclear whether emotion inference from visual scene context is as efficient as emotion recognition from faces. Here, we measured the speed of context-based emotion perception, using Inferential Affective Tracking (IAT) with naturalistic and dynamic videos. Using cross-correlation analyses, we found that inferring affect based on visual context alone is just as fast as tracking affect with all available information including face and body. We further demonstrated that this approach has high precision and sensitivity to sub-second lags. Our results suggest that emotion recognition from dynamic contextual information might be automatic and immediate. Seemingly complex context-based emotion perception is far more efficient than previously assumed.
Collapse
Affiliation(s)
- Zhimin Chen
- Department of Psychology, University of California, Berkeley, CA 94720, United States of America.
| | - David Whitney
- Department of Psychology, University of California, Berkeley, CA 94720, United States of America; Vision Science Program, University of California, Berkeley, CA 94720, United States of America; Helen Wills Neuroscience Institute, University of California, Berkeley, CA 94720, United States of America
| |
Collapse
|
28
|
Zarei Eskikand P, Kameneva T, Burkitt AN, Grayden DB, Ibbotson MR. Adaptive Surround Modulation of MT Neurons: A Computational Model. Front Neural Circuits 2020; 14:529345. [PMID: 33192335 PMCID: PMC7649322 DOI: 10.3389/fncir.2020.529345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 09/22/2020] [Indexed: 11/13/2022] Open
Abstract
The classical receptive field (CRF) of a spiking visual neuron is defined as the region in the visual field that can generate spikes when stimulated by a visual stimulus. Many visual neurons also have an extra-classical receptive field (ECRF) that surrounds the CRF. The presence of a stimulus in the ECRF does not generate spikes but rather modulates the response to a stimulus in the neuron's CRF. Neurons in the primate Middle Temporal (MT) area, which is a motion specialist region, can have directionally antagonistic or facilitatory surrounds. The surround's effect switches between directionally antagonistic or facilitatory based on the characteristics of the stimulus, with antagonistic effects when there are directional discontinuities but facilitatory effects when there is directional coherence. Here, we present a computational model of neurons in area MT that replicates this observation and uses computational building blocks that correlate with observed cell types in the visual pathways to explain the mechanism of this modulatory effect. The model shows that the categorization of MT neurons based on the effect of their surround depends on the input stimulus rather than being a property of the neurons. Also, in agreement with neurophysiological findings, the ECRFs of the modeled MT neurons alter their center-surround interactions depending on image contrast.
Collapse
Affiliation(s)
- Parvin Zarei Eskikand
- Department of Biomedical Engineering, The University of Melbourne, Parkville, VIC, Australia
| | - Tatiana Kameneva
- Department of Biomedical Engineering, The University of Melbourne, Parkville, VIC, Australia.,Faculty of Science, Engineering and Technology, Swinburne University of Technology, Hawthorn, VIC, Australia
| | - Anthony N Burkitt
- Department of Biomedical Engineering, The University of Melbourne, Parkville, VIC, Australia
| | - David B Grayden
- Department of Biomedical Engineering, The University of Melbourne, Parkville, VIC, Australia
| | - Michael R Ibbotson
- National Vision Research Institute, Australian College of Optometry, Carlton, VIC, Australia
| |
Collapse
|
29
|
Adaptation to feedback representation of illusory orientation produced from flash grab effect. Nat Commun 2020; 11:3925. [PMID: 32764538 PMCID: PMC7411047 DOI: 10.1038/s41467-020-17786-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 07/15/2020] [Indexed: 11/22/2022] Open
Abstract
Adaptation is a ubiquitous property of sensory systems. It is typically considered that neurons adapt to dominant energy in the ambient environment to function optimally. However, perceptual representation of the stimulus, often modulated by feedback signals, sometimes do not correspond to the input state of the stimulus, which tends to be more linked with feedforward signals. Here we investigated the relative contributions to cortical adaptation from feedforward and feedback signals, taking advantage of a visual illusion, the Flash-Grab Effect, to disassociate the feedforward and feedback representation of an adaptor. Results reveal that orientation adaptation is exclusively dependent on the perceived rather than the retinal orientation of the adaptor. Combined fMRI and EEG measurements demonstrate that the perceived orientation of the Flash-Grab Effect is indeed supported by feedback signals in the cortex. These findings highlight the important contribution of feedback signals for cortical neurons to recalibrate their sensitivity. Feedforward-feedback signal interactions are common in the brain during sensory information processing. Here, the authors show that feedback-driven representation of perceived orientation dominates visual adaptation, despite the discrepant feedforward representation of input orientation.
Collapse
|
30
|
Iyer R, Hu B, Mihalas S. Contextual Integration in Cortical and Convolutional Neural Networks. Front Comput Neurosci 2020; 14:31. [PMID: 32390818 PMCID: PMC7192314 DOI: 10.3389/fncom.2020.00031] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 03/24/2020] [Indexed: 11/28/2022] Open
Abstract
It has been suggested that neurons can represent sensory input using probability distributions and neural circuits can perform probabilistic inference. Lateral connections between neurons have been shown to have non-random connectivity and modulate responses to stimuli within the classical receptive field. Large-scale efforts mapping local cortical connectivity describe cell type specific connections from inhibitory neurons and like-to-like connectivity between excitatory neurons. To relate the observed connectivity to computations, we propose a neuronal network model that approximates Bayesian inference of the probability of different features being present at different image locations. We show that the lateral connections between excitatory neurons in a circuit implementing contextual integration in this should depend on correlations between unit activities, minus a global inhibitory drive. The model naturally suggests the need for two types of inhibitory gates (normalization, surround inhibition). First, using natural scene statistics and classical receptive fields corresponding to simple cells parameterized with data from mouse primary visual cortex, we show that the predicted connectivity qualitatively matches with that measured in mouse cortex: neurons with similar orientation tuning have stronger connectivity, and both excitatory and inhibitory connectivity have a modest spatial extent, comparable to that observed in mouse visual cortex. We incorporate lateral connections learned using this model into convolutional neural networks. Features are defined by supervised learning on the task, and the lateral connections provide an unsupervised learning of feature context in multiple layers. Since the lateral connections provide contextual information when the feedforward input is locally corrupted, we show that incorporating such lateral connections into convolutional neural networks makes them more robust to noise and leads to better performance on noisy versions of the MNIST dataset. Decomposing the predicted lateral connectivity matrices into low-rank and sparse components introduces additional cell types into these networks. We explore effects of cell-type specific perturbations on network computation. Our framework can potentially be applied to networks trained on other tasks, with the learned lateral connections aiding computations implemented by feedforward connections when the input is unreliable and demonstrate the potential usefulness of combining supervised and unsupervised learning techniques in real-world vision tasks.
Collapse
Affiliation(s)
- Ramakrishnan Iyer
- Modeling and Theory, Allen Institute for Brain Science, Seattle, WA, United States
| | - Brian Hu
- Modeling and Theory, Allen Institute for Brain Science, Seattle, WA, United States
| | - Stefan Mihalas
- Modeling and Theory, Allen Institute for Brain Science, Seattle, WA, United States
| |
Collapse
|
31
|
Zhang E, Li W. Improved fidelity of orientation perception: a learning effect dissociable from enhanced discriminability. Sci Rep 2020; 10:6572. [PMID: 32313001 PMCID: PMC7171124 DOI: 10.1038/s41598-020-62882-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 03/16/2020] [Indexed: 11/09/2022] Open
Abstract
Visual perception can be influenced by stimulus context, selective attention, and prior experience. Many previous studies have shown complex interactions among these influencing factors, but it remains unclear whether context-induced illusions could be reduced by perceptual training and whether such a change in perceptual fidelity is linked to improved perceptual discriminability. To address this question, we introduced a context-induced tilt illusion into an orientation discrimination training paradigm. This resulted in parallel and long-term improvements in the discriminability and fidelity of orientation perception. The improved discriminability was specific to the task-relevant target stimulus but nonspecific to the task-irrelevant context. By contrast, the improved perceptual fidelity was specific to the task-irrelevant contextual stimulus that induced the illusion, but not specific to the task-relevant target stimulus or task performed on one of its features. These results indicate two dissociable learning effects associated with the same training procedure. Such a dissociation was further supported by the observation that the sizes of the two learning effects were uncorrelated across the subjects. Our findings suggest two parallel learning processes: a task-dependent process giving rise to enhanced discriminability for the task-relevant stimulus attribute, and a context-dependent process leading to improved perceptual fidelity for the attended stimuli.
Collapse
Affiliation(s)
- En Zhang
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China.
| | - Wu Li
- State Key Laboratory of Cognitive Neuroscience and Learning & IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
32
|
Huang Y, Wu Q, Wang W, Wang L. Image and Sentence Matching via Semantic Concepts and Order Learning. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2020; 42:636-650. [PMID: 30507493 DOI: 10.1109/tpami.2018.2883466] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Image and sentence matching has made great progress recently, but it remains challenging due to the existing large visual-semantic discrepancy. This mainly arises from two aspects: 1) images consist of unstructured content which is not semantically abstract as the words in the sentences, so they are not directly comparable, and 2) arranging semantic concepts in different semantic order could lead to quite diverse meanings. The words in the sentences are sequentially arranged in a grammatical manner, while the semantic concepts in the images are usually unorganized. In this work, we propose a semantic concepts and order learning framework for image and sentence matching, which can improve the image representation by first predicting semantic concepts and then organizing them in a correct semantic order. Given an image, we first use a multi-regional multi-label CNN to predict its included semantic concepts in terms of object, property and action. These word-level semantic concepts are directly comparable with the words of noun, adjective and verb in the matched sentence. Then, to organize these concepts and make them express similar meanings as the matched sentence, we use a context-modulated attentional LSTM to learn the semantic order. It regards the predicted semantic concepts and image global scene as context at each timestep, and selectively attends to concept-related image regions by referring to the context in a sequential order. To further enhance the semantic order, we perform additional sentence generation on the image representation, by using the groundtruth order in the matched sentence as supervision. After obtaining the improved image representation, we learn the sentence representation with a conventional LSTM, and then jointly perform image and sentence matching and sentence generation for model learning. Extensive experiments demonstrate the effectiveness of our learned semantic concepts and order, by achieving the state-of-the-art results on two public benchmark datasets.
Collapse
|
33
|
Spitmaan M, Horno O, Chu E, Soltani A. Combinations of low-level and high-level neural processes account for distinct patterns of context-dependent choice. PLoS Comput Biol 2019; 15:e1007427. [PMID: 31609970 PMCID: PMC6812848 DOI: 10.1371/journal.pcbi.1007427] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 10/24/2019] [Accepted: 09/20/2019] [Indexed: 11/18/2022] Open
Abstract
Context effects have been explained by either low-level neural adjustments or high-level cognitive processes but not their combination. It is currently unclear how these processes interact to shape individuals’ responses to context. Here, we used a large cohort of human subjects in experiments involving choice between two or three gambles in order to study the dependence of context effects on neural adaptation and individuals’ risk attitudes. Our experiments did not provide any evidence that neural adaptation on long timescales (~100 trials) contributes to context effects. Using post-hoc analyses we identified two groups of subjects with distinct patterns of responses to decoys, both of which depended on individuals’ risk aversion. Subjects in the first group exhibited strong, consistent decoy effects and became more risk averse due to decoy presentation. In contrast, subjects in the second group did not show consistent decoy effects and became more risk seeking. The degree of change in risk aversion due to decoy presentation was positively correlated with the original degrees of risk aversion. To explain these results and reveal underlying neural mechanisms, we developed new models incorporating both low- and high-level processes and used these models to fit individuals’ choice behavior. We found that observed distinct patterns of decoy effects can be explained by a combination of adjustments in neural representations and competitive weighting of reward attributes, both of which depend on risk aversion but in opposite directions. Altogether, our results demonstrate how a combination of low- and high-level processes shapes choice behavior in more naturalistic settings, modulates overall risk preference, and explains distinct behavioral phenotypes. A large body of experimental work has illustrated that the introduction of a new, and often irrelevant, option can influence preference among the existing options, a phenomenon referred to as context or decoy effects. For example, introducing a new option that is worse than one of the two existing options in all its attributes but better than the alternative option in some attributes (and thus should not ever be selected) can increase the preference for the former option. Context effects have been explained by high-level cognitive processes—such as comparisons and competitions between attributes—or low-level adjustments of neural representations. However, it is unclear how these processes interact to shape individuals’ responses to context. Here, we show that both high-level cognitive processes and low-level neural adjustments shift risk preference during choice between multiple risky options but in opposite directions. Moreover, we demonstrate that combinations of these processes can account for distinct patterns of context effects in human subjects.
Collapse
Affiliation(s)
- Mehran Spitmaan
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
| | - Oihane Horno
- Champalimaud Research, Champalimaud Centre for the Unknown, Lisbon, Portugal
| | - Emily Chu
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
| | - Alireza Soltani
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, New Hamphire, United States of America
- * E-mail:
| |
Collapse
|
34
|
Gepshtein S, Snider J. Neuroscience for architecture: The evolving science of perceptual meaning. Proc Natl Acad Sci U S A 2019; 116:14404-14406. [PMID: 31278152 PMCID: PMC6642378 DOI: 10.1073/pnas.1908868116] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Sergei Gepshtein
- Center for Neurobiology of Vision, Salk Institute for Biological Studies, La Jolla, CA 92037;
- Center for Spatial Perception and Concrete Experience, School of Cinematic Arts, University of Southern California, Los Angeles, CA 90089
| | - Joseph Snider
- Institute for Neural Computation, University of California San Diego, La Jolla, CA 92093
| |
Collapse
|
35
|
Neural mechanisms of contextual modulation in the retinal direction selective circuit. Nat Commun 2019; 10:2431. [PMID: 31160566 PMCID: PMC6547848 DOI: 10.1038/s41467-019-10268-z] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Accepted: 04/26/2019] [Indexed: 01/07/2023] Open
Abstract
Contextual modulation of neuronal responses by surrounding environments is a fundamental attribute of sensory processing. In the mammalian retina, responses of On–Off direction selective ganglion cells (DSGCs) are modulated by motion contexts. However, the underlying mechanisms are unknown. Here, we show that posterior-preferring DSGCs (pDSGCs) are sensitive to discontinuities of moving contours owing to contextually modulated cholinergic excitation from starburst amacrine cells (SACs). Using a combination of synapse-specific genetic manipulations, patch clamp electrophysiology and connectomic analysis, we identified distinct circuit motifs upstream of On and Off SACs that are required for the contextual modulation of pDSGC activity for bright and dark contrasts. Furthermore, our results reveal a class of wide-field amacrine cells (WACs) with straight, unbranching dendrites that function as “continuity detectors” of moving contours. Therefore, divergent circuit motifs in the On and Off pathways extend the information encoding of On-Off DSGCs beyond their direction selectivity during complex stimuli. The mechanisms of contextual modulation in direction selective ganglion cells in the retina remain unclear. Here, the authors find that that On-Off direction-selective ganglion cells are differentially sensitive to discontinuities of dark and bright moving edges in the visual environment and, using synapse-specific genetic manipulations with functional measurements, reveal the microcircuits underlying this contextual sensitivity.
Collapse
|
36
|
Pawar AS, Gepshtein S, Savel'ev S, Albright TD. Mechanisms of Spatiotemporal Selectivity in Cortical Area MT. Neuron 2019; 101:514-527.e2. [PMID: 30606614 PMCID: PMC6398985 DOI: 10.1016/j.neuron.2018.12.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 06/28/2018] [Accepted: 12/03/2018] [Indexed: 11/28/2022]
Abstract
Cortical sensory neurons are characterized by selectivity to stimulation. This selectivity was originally viewed as a part of the fundamental "receptive field" characteristic of neurons. This view was later challenged by evidence that receptive fields are modulated by stimuli outside of the classical receptive field. Here, we show that even this modified view of selectivity needs revision. We measured spatial frequency selectivity of neurons in cortical area MT of alert monkeys and found that their selectivity strongly depends on luminance contrast, shifting to higher spatial frequencies as contrast increases. The changes of preferred spatial frequency are large at low temporal frequency, and they decrease monotonically as temporal frequency increases. That is, even interactions among basic stimulus dimensions of luminance contrast, spatial frequency, and temporal frequency strongly influence neuronal selectivity. This dynamic nature of neuronal selectivity is inconsistent with the notion of stimulus preference as a stable characteristic of cortical neurons.
Collapse
Affiliation(s)
- Ambarish S Pawar
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA.
| | - Sergei Gepshtein
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA; Center for Spatial Perception and Concrete Experience, School of Cinematic Arts, University of Southern California, Los Angeles, CA 90089, USA
| | - Sergey Savel'ev
- Department of Physics, Loughborough University, Loughborough LE11 3TU, UK
| | - Thomas D Albright
- Systems Neurobiology Laboratories, The Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| |
Collapse
|
37
|
Abstract
It is well known that the human visual system is sensitive to co-circularity among oriented edges, which are ubiquitous features of object contours. Here, we report a novel aftereffect in which the appearance of a texture is dramatically altered after adaptation to a texture composed of elements with co-circular structure. Following prolonged viewing of a texture made of pairs of adjacent Gabor elements arranged to form obtuse angle co-circular pairs, i.e. shallow curves, a subsequently viewed random texture appears to be composed of acute angle, i.e. near-parallel pairs. Conversely, adaptation to a texture made of parallel pairs causes a random texture to appear to be composed of shallow curves. This suggests that mechanisms sensitive to co-circularity are organized in an opponent manner, with one pole sensitive to shallow curves the other parallel shapes. This notion was tested further in a non-adaptation experiment in which co-circular and non-co-circular Gabor pairs were mixed within a single texture. Results revealed summation between pairs that fell on one side of the opponent continuum, and cancellation between pairs that fell on opposite sides of the continuum. Taken together these results support opponent interactions between mechanisms sensitive to pairwise co-circular texture features.
Collapse
|
38
|
Ceccarelli F, La Scaleia B, Russo M, Cesqui B, Gravano S, Mezzetti M, Moscatelli A, d'Avella A, Lacquaniti F, Zago M. Rolling Motion Along an Incline: Visual Sensitivity to the Relation Between Acceleration and Slope. Front Neurosci 2018; 12:406. [PMID: 29988401 PMCID: PMC6023988 DOI: 10.3389/fnins.2018.00406] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2018] [Accepted: 05/25/2018] [Indexed: 11/13/2022] Open
Abstract
People easily intercept a ball rolling down an incline, despite its acceleration varies with the slope in a complex manner. Apparently, however, they are poor at detecting anomalies when asked to judge artificial animations of descending motion. Since the perceptual deficiencies have been reported in studies involving a limited visual context, here we tested the hypothesis that judgments of naturalness of rolling motion are consistent with physics when the visual scene incorporates sufficient cues about environmental reference and metric scale, roughly comparable to those present when intercepting a ball. Participants viewed a sphere rolling down an incline located in the median sagittal plane, presented in 3D wide-field virtual reality. In different experiments, either the slope of the plane or the sphere acceleration were changed in arbitrary combinations, resulting in a kinematics that was either consistent or inconsistent with physics. In Experiment 1 (slope adjustment), participants were asked to modify the slope angle until the resulting motion looked natural for a given ball acceleration. In Experiment 2 (acceleration adjustment), instead, they were asked to modify the acceleration until the motion on a given slope looked natural. No feedback about performance was provided. For both experiments, we found that participants were rather accurate at finding the match between slope angle and ball acceleration congruent with physics, but there was a systematic effect of the initial conditions: accuracy was higher when the participants started the exploration from the combination of slope and acceleration corresponding to the congruent conditions than when they started far away from the congruent conditions. In Experiment 3, participants modified the slope angle based on an adaptive staircase, but the target never coincided with the starting condition. Here we found a generally accurate performance, irrespective of the target slope. We suggest that, provided the visual scene includes sufficient cues about environmental reference and metric scale, joint processing of slope and acceleration may facilitate the detection of natural motion. Perception of rolling motion may rely on the kind of approximate, probabilistic simulations of Newtonian mechanics that have previously been called into play to explain complex inferences in rich visual scenes.
Collapse
Affiliation(s)
| | - Barbara La Scaleia
- Laboratory of Neuromotor Physiology, IRCCS Santa Lucia Foundation, Rome, Italy
| | - Marta Russo
- Centre of Space Bio-Medicine, University of Rome Tor Vergata, Rome, Italy
| | - Benedetta Cesqui
- Centre of Space Bio-Medicine, University of Rome Tor Vergata, Rome, Italy.,Laboratory of Neuromotor Physiology, IRCCS Santa Lucia Foundation, Rome, Italy
| | - Silvio Gravano
- Centre of Space Bio-Medicine, University of Rome Tor Vergata, Rome, Italy
| | - Maura Mezzetti
- Department of Economics and Finance, University of Rome Tor Vergata, Rome, Italy
| | - Alessandro Moscatelli
- Centre of Space Bio-Medicine, University of Rome Tor Vergata, Rome, Italy.,Department of Systems Medicine, University of Rome Tor Vergata, Rome, Italy
| | - Andrea d'Avella
- Laboratory of Neuromotor Physiology, IRCCS Santa Lucia Foundation, Rome, Italy.,Department of Biomedical and Dental Sciences and Morphofunctional Imaging, University of Messina, Messina, Italy
| | - Francesco Lacquaniti
- Centre of Space Bio-Medicine, University of Rome Tor Vergata, Rome, Italy.,Laboratory of Neuromotor Physiology, IRCCS Santa Lucia Foundation, Rome, Italy.,Department of Systems Medicine, University of Rome Tor Vergata, Rome, Italy
| | - Myrka Zago
- Laboratory of Neuromotor Physiology, IRCCS Santa Lucia Foundation, Rome, Italy
| |
Collapse
|
39
|
Khan AG, Hofer SB. Contextual signals in visual cortex. Curr Opin Neurobiol 2018; 52:131-138. [PMID: 29883940 DOI: 10.1016/j.conb.2018.05.003] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2018] [Accepted: 05/11/2018] [Indexed: 11/15/2022]
Abstract
Vision is an active process. What we perceive strongly depends on our actions, intentions and expectations. During visual processing, these internal signals therefore need to be integrated with the visual information from the retina. The mechanisms of how this is achieved by the visual system are still poorly understood. Advances in recording and manipulating neuronal activity in specific cell types and axonal projections together with tools for circuit tracing are beginning to shed light on the neuronal circuit mechanisms of how internal, contextual signals shape sensory representations. Here we review recent work, primarily in mice, that has advanced our understanding of these processes, focusing on contextual signals related to locomotion, behavioural relevance and predictions.
Collapse
Affiliation(s)
- Adil G Khan
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK
| | - Sonja B Hofer
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London, UK.
| |
Collapse
|
40
|
Wang H, Wang Z, Zhou Y, Tzvetanov T. Near- and Far-Surround Suppression in Human Motion Discrimination. Front Neurosci 2018; 12:206. [PMID: 29651233 PMCID: PMC5884933 DOI: 10.3389/fnins.2018.00206] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2017] [Accepted: 03/15/2018] [Indexed: 11/27/2022] Open
Abstract
The spatial context has strong effects on visual processing. Psychophysics and modeling studies have provided evidence that the surround context can systematically modulate the perception of center stimuli. For motion direction, these center-surround interactions are considered to come from spatio-directional interactions between direction of motion tuned neurons, which are attributed to the middle temporal (MT) area. Here, we investigated through psychophysics experiments on human subjects changes with spatial separation in center-surround inhibition and motion direction interactions. Center-surround motion repulsion effects were measured under near-and far-surround conditions. Using a simple physiological model of the repulsion effect we extracted theoretical population parameters of surround inhibition strength and tuning widths with spatial distance. All 11 subjects showed clear motion repulsion effects under the near-surround condition, while only 10 subjects showed clear motion repulsion effects under the far-surround condition. The model predicted human performance well. Surround inhibition under the near-surround condition was significantly stronger than that under the far-surround condition, and the tuning widths were smaller under the near-surround condition. These results demonstrate that spatial separation can both modulate the surround inhibition strength and surround to center tuning width.
Collapse
Affiliation(s)
- Huan Wang
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, China
| | | | - Yifeng Zhou
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, China.,State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Tzvetomir Tzvetanov
- Hefei National Laboratory for Physical Sciences at Microscale, School of Life Science, University of Science and Technology of China, Hefei, China.,Anhui Province Key Laboratory of Affective Computing and Advanced Intelligent Machine, and School of Computer and Information, Hefei University of Technology, Hefei, China
| |
Collapse
|
41
|
Muller L, Chavane F, Reynolds J, Sejnowski TJ. Cortical travelling waves: mechanisms and computational principles. Nat Rev Neurosci 2018; 19:255-268. [PMID: 29563572 DOI: 10.1038/nrn.2018.20] [Citation(s) in RCA: 241] [Impact Index Per Article: 40.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Multichannel recording technologies have revealed travelling waves of neural activity in multiple sensory, motor and cognitive systems. These waves can be spontaneously generated by recurrent circuits or evoked by external stimuli. They travel along brain networks at multiple scales, transiently modulating spiking and excitability as they pass. Here, we review recent experimental findings that have found evidence for travelling waves at single-area (mesoscopic) and whole-brain (macroscopic) scales. We place these findings in the context of the current theoretical understanding of wave generation and propagation in recurrent networks. During the large low-frequency rhythms of sleep or the relatively desynchronized state of the awake cortex, travelling waves may serve a variety of functions, from long-term memory consolidation to processing of dynamic visual stimuli. We explore new avenues for experimental and computational understanding of the role of spatiotemporal activity patterns in the cortex.
Collapse
Affiliation(s)
- Lyle Muller
- Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Frédéric Chavane
- Institut de Neurosciences de la Timone (INT), Centre National de la Recherche Scientifique (CNRS) and Aix-Marseille Université, Marseille, France
| | - John Reynolds
- Salk Institute for Biological Studies, La Jolla, CA, USA
| | - Terrence J Sejnowski
- Salk Institute for Biological Studies, La Jolla, CA, USA.,Division of Biological Sciences, University of California, La Jolla, CA, USA
| |
Collapse
|
42
|
Akbarian A, Niknam K, Parsa M, Clark K, Noudoost B, Nategh N. Developing a Nonstationary Computational Framework With Application to Modeling Dynamic Modulations in Neural Spiking Responses. IEEE Trans Biomed Eng 2018; 65:241-253. [PMID: 29035203 PMCID: PMC5796416 DOI: 10.1109/tbme.2017.2762687] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
OBJECTIVE This paper aims to develop a computational model that incorporates the functional effects of modulatory covariates (such as context, task, or behavior), which dynamically alter the relationship between the stimulus and the neural response. METHODS We develop a general computational approach along with an efficient estimation procedure in the widely used generalized linear model (GLM) framework to characterize such nonstationary dynamics in spiking response and spatiotemporal characteristics of a neuron at the level of individual trials. The model employs a set of modulatory components, which nonlinearly interact with other stimulus-related signals to reproduce such nonstationary effects. RESULTS The model is tested for its ability to predict the responses of neurons in the middle temporal cortex of macaque monkeys during an eye movement task. The fitted model proves successful in capturing the fast temporal modulations in the response, reproducing the spike response temporal statistics, and accurately accounting for the neurons' dynamic spatiotemporal sensitivities, during eye movements. CONCLUSION The nonstationary GLM framework developed in this study can be used in cases where a time-varying behavioral or cognitive component makes GLM-based models insufficient to describe the dependencies of neural responses on the stimulus-related covariates. SIGNIFICANCE In addition to being quite powerful in encoding time-varying response modulations, this general framework also enables a readout of the neural code while dissociating the influence of other nonstimulus covariates. This framework will advance our ability to understand sensory processing in higher brain areas when modulated by several behavioral or cognitive variables.
Collapse
|
43
|
Tieri G, Morone G, Paolucci S, Iosa M. Virtual reality in cognitive and motor rehabilitation: facts, fiction and fallacies. Expert Rev Med Devices 2018; 15:107-117. [DOI: 10.1080/17434440.2018.1425613] [Citation(s) in RCA: 104] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Affiliation(s)
- Gaetano Tieri
- SCNLab, Fondazione Santa Lucia IRCCS, Rome, Italy
- University of Rome Unitelma Sapienza, Italy
| | - Giovanni Morone
- Clinical Laboratory of Experimental Neurorehabilitation, Fondazione Santa Lucia IRCCS, Rome, Italy
| | - Stefano Paolucci
- Clinical Laboratory of Experimental Neurorehabilitation, Fondazione Santa Lucia IRCCS, Rome, Italy
| | - Marco Iosa
- Clinical Laboratory of Experimental Neurorehabilitation, Fondazione Santa Lucia IRCCS, Rome, Italy
| |
Collapse
|
44
|
Rasmussen R, Yonehara K. Circuit Mechanisms Governing Local vs. Global Motion Processing in Mouse Visual Cortex. Front Neural Circuits 2017; 11:109. [PMID: 29311845 PMCID: PMC5743699 DOI: 10.3389/fncir.2017.00109] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2017] [Accepted: 12/14/2017] [Indexed: 11/21/2022] Open
Abstract
A withstanding question in neuroscience is how neural circuits encode representations and perceptions of the external world. A particularly well-defined visual computation is the representation of global object motion by pattern direction-selective (PDS) cells from convergence of motion of local components represented by component direction-selective (CDS) cells. However, how PDS and CDS cells develop their distinct response properties is still unresolved. The visual cortex of the mouse is an attractive model for experimentally solving this issue due to the large molecular and genetic toolbox available. Although mouse visual cortex lacks the highly ordered orientation columns of primates, it is organized in functional sub-networks and contains striate- and extrastriate areas like its primate counterparts. In this Perspective article, we provide an overview of the experimental and theoretical literature on global motion processing based on works in primates and mice. Lastly, we propose what types of experiments could illuminate what circuit mechanisms are governing cortical global visual motion processing. We propose that PDS cells in mouse visual cortex appear as the perfect arena for delineating and solving how individual sensory features extracted by neural circuits in peripheral brain areas are integrated to build our rich cohesive sensory experiences.
Collapse
Affiliation(s)
- Rune Rasmussen
- The Danish Research Institute of Translational Neuroscience-DANDRITE, Nordic EMBL Partnership for Molecular Medicine, Department of Biomedicine, Aarhus University, Aarhus, Denmark
| | - Keisuke Yonehara
- The Danish Research Institute of Translational Neuroscience-DANDRITE, Nordic EMBL Partnership for Molecular Medicine, Department of Biomedicine, Aarhus University, Aarhus, Denmark
| |
Collapse
|
45
|
|
46
|
Baruch O, Kimchi R, Goldsmith M. Attention to distinguishing features in object recognition: An interactive-iterative framework. Cognition 2017; 170:228-244. [PMID: 29078095 DOI: 10.1016/j.cognition.2017.10.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Revised: 10/03/2017] [Accepted: 10/06/2017] [Indexed: 11/17/2022]
Abstract
This article advances a framework that casts object recognition as a process of discrimination between alternative object identities, in which top-down and bottom-up processes interact-iteratively when necessary-with attention to distinguishing features playing a critical role. In two experiments, observers discriminated between different types of artificial fish. In parallel, a secondary, variable-SOA visual-probe detection task was used to examine the dynamics of visual attention. In Experiment 1, the fish varied in three distinguishing features: one indicating the general category (saltwater, freshwater), and one of the two other features indicating the specific type of fish within each category. As predicted, in the course of recognizing each fish, attention was allocated iteratively to the distinguishing features in an optimal manner: first to the general category feature, and then, based on its value, to the second feature that identified the specific fish. In Experiment 2, two types of fish could be discriminated on the basis of either of two distinguishing features, one more visually discriminable than the other. On some of the trials, one of the two alternative distinguishing features was occluded. As predicted, in the course of recognizing each fish, attention was directed initially to the more discriminable distinguishing feature, but when this feature was occluded, it was then redirected to the less discriminable feature. The implications of these findings, and the interactive-iterative framework they support, are discussed with regard to several fundamental issues having a long history in the literatures on object recognition, object categorization, and visual perception in general.
Collapse
|
47
|
Towards building a more complex view of the lateral geniculate nucleus: Recent advances in understanding its role. Prog Neurobiol 2017. [DOI: 10.1016/j.pneurobio.2017.06.002] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
48
|
Gothard KM, Mosher CP, Zimmerman PE, Putnam PT, Morrow JK, Fuglevand AJ. New perspectives on the neurophysiology of primate amygdala emerging from the study of naturalistic social behaviors. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2017; 9. [PMID: 28800678 DOI: 10.1002/wcs.1449] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Revised: 06/03/2017] [Accepted: 06/05/2017] [Indexed: 11/07/2022]
Abstract
A major challenge of primate neurophysiology, particularly in the domain of social neuroscience, is to adopt more natural behaviors without compromising the ability to relate patterns of neural activity to specific actions or sensory inputs. Traditional approaches have identified neural activity patterns in the amygdala in response to simplified versions of social stimuli such as static images of faces. As a departure from this reduced approach, single images of faces were replaced with arrays of images or videos of conspecifics. These stimuli elicited more natural behaviors and new types of neural responses: (1) attention-gated responses to faces, (2) selective responses to eye contact, and (3) selective responses to touch and somatosensory feedback during the production of facial expressions. An additional advance toward more natural social behaviors in the laboratory was the implementation of dyadic social interactions. Under these conditions, neurons encoded similarly rewards that monkeys delivered to self and to their social partner. These findings reinforce the value of bringing natural, ethologically valid, behavioral tasks under neurophysiological scrutiny. WIREs Cogn Sci 2018, 9:e1449. doi: 10.1002/wcs.1449 This article is categorized under: Psychology > Emotion and Motivation Neuroscience > Cognition Neuroscience > Physiology.
Collapse
Affiliation(s)
- Katalin M Gothard
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| | - Clayton P Mosher
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| | - Prisca E Zimmerman
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| | - Philip T Putnam
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| | - Jeremiah K Morrow
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| | - Andrew J Fuglevand
- Department of Physiology, College of Medicine, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
49
|
Cuthill IC, Allen WL, Arbuckle K, Caspers B, Chaplin G, Hauber ME, Hill GE, Jablonski NG, Jiggins CD, Kelber A, Mappes J, Marshall J, Merrill R, Osorio D, Prum R, Roberts NW, Roulin A, Rowland HM, Sherratt TN, Skelhorn J, Speed MP, Stevens M, Stoddard MC, Stuart-Fox D, Talas L, Tibbetts E, Caro T. The biology of color. Science 2017; 357:357/6350/eaan0221. [DOI: 10.1126/science.aan0221] [Citation(s) in RCA: 353] [Impact Index Per Article: 50.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
50
|
Jancke D. Catching the voltage gradient-asymmetric boost of cortical spread generates motion signals across visual cortex: a brief review with special thanks to Amiram Grinvald. NEUROPHOTONICS 2017; 4:031206. [PMID: 28217713 PMCID: PMC5301132 DOI: 10.1117/1.nph.4.3.031206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 01/12/2017] [Indexed: 06/06/2023]
Abstract
Wide-field voltage imaging is unique in its capability to capture snapshots of activity-across the full gradient of average changes in membrane potentials from subthreshold to suprathreshold levels-of hundreds of thousands of superficial cortical neurons that are simultaneously active. Here, I highlight two examples where voltage-sensitive dye imaging (VSDI) was exploited to track gradual space-time changes of activity within milliseconds across several millimeters of cortex at submillimeter resolution: the line-motion condition, measured in Amiram Grinvald's Laboratory more than 10 years ago and-coming full circle running VSDI in my laboratory-another motion-inducing condition, in which two neighboring stimuli counterchange luminance simultaneously. In both examples, cortical spread is asymmetrically boosted, creating suprathreshold activity drawn out over primary visual cortex. These rapidly propagating waves may integrate brain signals that encode motion independent of direction-selective circuits.
Collapse
Affiliation(s)
- Dirk Jancke
- Ruhr University Bochum, Optical Imaging Group, Institut für Neuroinformatik, Bochum, Germany
| |
Collapse
|