1
|
Yashiro R, Sawayama M, Amano K. Decoding time-resolved neural representations of orientation ensemble perception. Front Neurosci 2024; 18:1387393. [PMID: 39148524 PMCID: PMC11325722 DOI: 10.3389/fnins.2024.1387393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2024] [Accepted: 07/15/2024] [Indexed: 08/17/2024] Open
Abstract
The visual system can compute summary statistics of several visual elements at a glance. Numerous studies have shown that an ensemble of different visual features can be perceived over 50-200 ms; however, the time point at which the visual system forms an accurate ensemble representation associated with an individual's perception remains unclear. This is mainly because most previous studies have not fully addressed time-resolved neural representations that occur during ensemble perception, particularly lacking quantification of the representational strength of ensembles and their correlation with behavior. Here, we conducted orientation ensemble discrimination tasks and electroencephalogram (EEG) recordings to decode orientation representations over time while human observers discriminated an average of multiple orientations. We modeled EEG signals as a linear sum of hypothetical orientation channel responses and inverted this model to quantify the representational strength of orientation ensemble. Our analysis using this inverted encoding model revealed stronger representations of the average orientation over 400-700 ms. We also correlated the orientation representation estimated from EEG signals with the perceived average orientation reported in the ensemble discrimination task with adjustment methods. We found that the estimated orientation at approximately 600-700 ms significantly correlated with the individual differences in perceived average orientation. These results suggest that although ensembles can be quickly and roughly computed, the visual system may gradually compute an orientation ensemble over several hundred milliseconds to achieve a more accurate ensemble representation.
Collapse
Affiliation(s)
- Ryuto Yashiro
- Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
| | - Masataka Sawayama
- Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
| | - Kaoru Amano
- Graduate School of Information Science and Technology, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
2
|
Knox K, Pratt J, Cant JS. Examining the role of action-driven attention in ensemble processing. J Vis 2024; 24:5. [PMID: 38842835 PMCID: PMC11160948 DOI: 10.1167/jov.24.6.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 04/21/2024] [Indexed: 06/07/2024] Open
Abstract
Ensemble processing allows the visual system to condense visual information into useful summary statistics (e.g., average size), thereby overcoming capacity limitations to visual working memory and attention. To examine the role of attention in ensemble processing, we conducted three experiments using a novel paradigm that merged the action effect (a manipulation of attention) and ensemble processing. Participants were instructed to make a simple action if the feature of a cue word corresponded to a subsequent shape. Immediately after, they were shown an ensemble display of eight ovals of varying sizes and were asked to report either the average size of all ovals or the size of a single oval from the set. In Experiments 1 and 2, participants were cued with a task-relevant feature, and in Experiment 3, participants were cued with a task-irrelevant feature. Overall, the task-relevant cues that elicited an action influenced reports of average size in the ensemble phase more than the cues that were passively viewed, whereas task-irrelevant cues did not bias the reports of average size. The results of this study suggest that attention influences ensemble processing only when it is directed toward a task-relevant feature.
Collapse
Affiliation(s)
- Kristina Knox
- Department of Psychology, University of Toronto, Toronto, Canada
- Department of Psychology, University of Toronto Scarborough, Scarborough, Canada
| | - Jay Pratt
- Department of Psychology, University of Toronto, Toronto, Canada
| | - Jonathan S Cant
- Department of Psychology, University of Toronto Scarborough, Scarborough, Canada
| |
Collapse
|
3
|
Khvostov VA, Iakovlev AU, Wolfe JM, Utochkin IS. What is the basis of ensemble subset selection? Atten Percept Psychophys 2024; 86:776-798. [PMID: 38351233 DOI: 10.3758/s13414-024-02850-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2024] [Indexed: 05/03/2024]
Abstract
The visual system can rapidly calculate the ensemble statistics of a set of objects; for example, people can easily estimate an average size of apples on a tree. To accomplish this, it is not always useful to summarize all the visual information. If there are various types of objects, the visual system should select a relevant subset: only apples, not leaves and branches. Here, we ask what kind of visual information makes a "good" ensemble that can be selectively attended to provide an accurate summary estimate. We tested three candidate representations: basic features, preattentive object files, and full-fledged bound objects. In four experiments, we presented a target and several distractors' sets of differently colored objects. We found that conditions where a target ensemble had at least one unique color (basic feature) provided ensemble averaging performance comparable to the baseline displays without distractors. When the target subset was defined as a conjunction of two colors or color-shape partly shared with distractors (so that they could be differentiated only as preattentive object files), subset averaging was also possible but less accurate than in the baseline and feature conditions. Finally, performance was very poor when the target subset was defined by an exact feature relationship, such as in the spatial conjunction of two colors (spatially bound object). Overall, these results suggest that distinguishable features and, to a lesser degree, preattentive object files can serve as the representational basis of ensemble selection, while bound objects cannot.
Collapse
Affiliation(s)
- Vladislav A Khvostov
- Faculty of Psychology, School of Health Sciences, University of Iceland, Reykjavik, Iceland.
- HSE University, Moscow, Russia.
| | - Aleksei U Iakovlev
- Faculty of Psychology, School of Health Sciences, University of Iceland, Reykjavik, Iceland
| | - Jeremy M Wolfe
- Visual Attention Laboratory, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Igor S Utochkin
- Institute for Mind and Biology, University of Chicago, Chicago, IL, USA
| |
Collapse
|
4
|
Sama MA, Nestor A, Cant JS. The Neural Dynamics of Face Ensemble and Central Face Processing. J Neurosci 2024; 44:e1027232023. [PMID: 38148151 PMCID: PMC10869155 DOI: 10.1523/jneurosci.1027-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 11/21/2023] [Accepted: 12/11/2023] [Indexed: 12/28/2023] Open
Abstract
Extensive work has investigated the neural processing of single faces, including the role of shape and surface properties. However, much less is known about the neural basis of face ensemble perception (e.g., simultaneously viewing several faces in a crowd). Importantly, the contribution of shape and surface properties have not been elucidated in face ensemble processing. Furthermore, how single central faces are processed within the context of an ensemble remains unclear. Here, we probe the neural dynamics of ensemble representation using pattern analyses as applied to electrophysiology data in healthy adults (seven males, nine females). Our investigation relies on a unique set of stimuli, depicting different facial identities, which vary parametrically and independently along their shape and surface properties. These stimuli were organized into ensemble displays consisting of six surround faces arranged in a circle around one central face. Overall, our results indicate that both shape and surface properties play a significant role in face ensemble encoding, with the latter demonstrating a more pronounced contribution. Importantly, we find that the neural processing of the center face precedes that of the surround faces in an ensemble. Further, the temporal profile of center face decoding is similar to that of single faces, while those of single faces and face ensembles diverge extensively from each other. Thus, our work capitalizes on a new center-surround paradigm to elucidate the neural dynamics of ensemble processing and the information that underpins it. Critically, our results serve to bridge the study of single and ensemble face perception.
Collapse
Affiliation(s)
- Marco Agazio Sama
- Department of Psychology, University of Toronto Scarborough, Toronto, Ontario M1C 1A4, Canada
| | - Adrian Nestor
- Department of Psychology, University of Toronto Scarborough, Toronto, Ontario M1C 1A4, Canada
| | - Jonathan Samuel Cant
- Department of Psychology, University of Toronto Scarborough, Toronto, Ontario M1C 1A4, Canada
| |
Collapse
|
5
|
Son G, Im HY, Albohn DN, Kveraga K, Adams RB, Sun J, Chong SC. Americans weigh an attended emotion more than Koreans in overall mood judgments. Sci Rep 2023; 13:19323. [PMID: 37935828 PMCID: PMC10630378 DOI: 10.1038/s41598-023-46723-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 11/04/2023] [Indexed: 11/09/2023] Open
Abstract
Face ensemble coding is the perceptual ability to create a quick and overall impression of a group of faces, triggering social and behavioral motivations towards other people (approaching friendly people or avoiding an angry mob). Cultural differences in this ability have been reported, such that Easterners are better at face ensemble coding than Westerners are. The underlying mechanism has been attributed to differences in processing styles, with Easterners allocating attention globally, and Westerners focusing on local parts. However, the remaining question is how such default attention mode is influenced by salient information during ensemble perception. We created visual displays that resembled a real-world social setting in which one individual in a crowd of different faces drew the viewer's attention while the viewer judged the overall emotion of the crowd. In each trial, one face in the crowd was highlighted by a salient cue, capturing spatial attention before the participants viewed the entire group. American participants' judgment of group emotion more strongly weighed the attended individual face than Korean participants, suggesting a greater influence of local information on global perception. Our results showed that different attentional modes between cultural groups modulate social-emotional processing underlying people's perceptions and attributions.
Collapse
Affiliation(s)
- Gaeun Son
- Yonsei University, Seoul, South Korea
| | - Hee Yeon Im
- University of British Columbia, Vancouver, Canada
| | | | | | | | - Jisoo Sun
- Yonsei University, Seoul, South Korea
| | | |
Collapse
|
6
|
Wang T, Zhao Y, Jia J. Nonadditive integration of visual information in ensemble processing. iScience 2023; 26:107988. [PMID: 37822498 PMCID: PMC10562869 DOI: 10.1016/j.isci.2023.107988] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Revised: 09/03/2023] [Accepted: 09/16/2023] [Indexed: 10/13/2023] Open
Abstract
Statistically summarizing information from a stimulus array into an ensemble representation (e.g., the mean) improves the efficiency of visual processing. However, little is known about how the brain computes the ensemble statistics. Here, we propose that ensemble processing is realized by nonadditive integration, rather than linear averaging, of individual items. We used a linear regression model approach to extract EEG responses to three levels of information: the individual items, their local interactions, and their global interaction. The local and global interactions, representing nonadditive integration of individual items, elicited rapid and independent neural responses. Critically, only the neural representation of the global interaction predicted the precision of the ensemble perception at the behavioral level. Furthermore, spreading attention over the global pattern to enhance ensemble processing directly promoted rapid neural representation of the global interaction. Taken together, these findings advocate a global, nonadditive mechanism of ensemble processing in the brain.
Collapse
Affiliation(s)
- Tongyu Wang
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| | - Yuqing Zhao
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| | - Jianrong Jia
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
- Zhejiang Philosophy and Social Science Laboratory for Research in Early Development and Childcare, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China
| |
Collapse
|
7
|
Robinson MM, Brady TF. A quantitative model of ensemble perception as summed activation in feature space. Nat Hum Behav 2023; 7:1638-1651. [PMID: 37402880 PMCID: PMC10810262 DOI: 10.1038/s41562-023-01602-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 04/14/2023] [Indexed: 07/06/2023]
Abstract
Ensemble perception is a process by which we summarize complex scenes. Despite the importance of ensemble perception to everyday cognition, there are few computational models that provide a formal account of this process. Here we develop and test a model in which ensemble representations reflect the global sum of activation signals across all individual items. We leverage this set of minimal assumptions to formally connect a model of memory for individual items to ensembles. We compare our ensemble model against a set of alternative models in five experiments. Our approach uses performance on a visual memory task for individual items to generate zero-free-parameter predictions of interindividual and intraindividual differences in performance on an ensemble continuous-report task. Our top-down modelling approach formally unifies models of memory for individual items and ensembles and opens a venue for building and comparing models of distinct memory processes and representations.
Collapse
Affiliation(s)
- Maria M Robinson
- Psychology Department, University of California, San Diego, La Jolla, CA, USA.
| | - Timothy F Brady
- Psychology Department, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
8
|
Cho J, Im HY, Yoon YJ, Joo SJ, Chong SC. The effect of masks on the emotion perception of a facial crowd. Sci Rep 2023; 13:14274. [PMID: 37653061 PMCID: PMC10471755 DOI: 10.1038/s41598-023-41366-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 08/25/2023] [Indexed: 09/02/2023] Open
Abstract
The present study investigated the effect of facial masks on people's ability to perceive emotions in crowds. We presented faces with the bottom halves occluded by masks or full faces without occlusion. In two sequentially presented crowds, we varied the number of faces, emotional valence, and intensity of facial expressions, examining the impact of masks on the perception of crowd emotion. Participants reported which of the two crowds they would avoid based on the crowds' average emotions. The participants' ability to judge the average emotion of a crowd, especially a crowd expressing happiness, was impaired when the crowd wore masks. For faces covered by masks, crowd emotion judgments were more negatively biased than those without masks. However, participants could still distinguish the emotional intensities of a crowd wearing masks above chance. Additionally, participants responded more quickly to a crowd with more people without compromising accuracy, despite the perceptual challenges imposed by facial masks. Our results suggest that under ambiguous social situations in which individuals' emotions are partially hidden by masks, a large group may provide stronger social cues than a small group, thereby promoting communication and regulating social behaviors.
Collapse
Affiliation(s)
- Jieun Cho
- Graduate Program in Cognitive Science, Yonsei University, Seoul, South Korea
| | - Hee Yeon Im
- Department of Psychology, University of British Columbia, Vancouver, Canada
| | - Young Jun Yoon
- Department of Psychology, Pusan National University, Busan, South Korea
| | - Sung Jun Joo
- Department of Psychology, Pusan National University, Busan, South Korea
| | - Sang Chul Chong
- Graduate Program in Cognitive Science, Yonsei University, Seoul, South Korea.
- Department of Psychology, Yonsei University, Seoul, South Korea.
| |
Collapse
|
9
|
Zhao Y, Zeng T, Wang T, Fang F, Pan Y, Jia J. Subcortical encoding of summary statistics in humans. Cognition 2023; 234:105384. [PMID: 36736077 DOI: 10.1016/j.cognition.2023.105384] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Revised: 01/18/2023] [Accepted: 01/20/2023] [Indexed: 02/04/2023]
Abstract
Statistical encoding compresses redundant information from multiple items into a single summary metric (e.g., mean). Such statistical representation has been suggested to be automatic, but at which stage it is extracted is unknown. Here, we examined the involvement of the subcortex in the processing of summary statistics. We presented an array of circles dichoptically or monocularly while matching the number of perceived circles after binocular fusion. Experiments 1 and 2 showed that interocularly suppressed, invisible circles were automatically involved in the summary statistical representation, but only when they were presented to the same eye as the visible circles. This same-eye effect was further observed for consciously processed circles in Experiment 3, in which the estimated mean size of the circles was biased toward the information transmitted by monocular channels. Together, we provide converging evidence that the processing of summary statistics, an assumed high-level cognitive process, is mediated by subcortical structures.
Collapse
Affiliation(s)
- Yuqing Zhao
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China; Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou 310015, Zhejiang, China
| | - Ting Zeng
- Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou 310015, Zhejiang, China; School of Psychology, Jiangxi Normal University, Nanchang 330022, Jiangxi, China
| | - Tongyu Wang
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China; Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou 310015, Zhejiang, China
| | - Fang Fang
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing 100871, China; IDG/McGovern Institute for Brain Research, Peking University, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Yi Pan
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China.
| | - Jianrong Jia
- Department of Psychology, Hangzhou Normal University, Hangzhou 311121, Zhejiang, China; Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou 310015, Zhejiang, China.
| |
Collapse
|
10
|
Iakovlev AU, Utochkin IS. Ensemble averaging: What can we learn from skewed feature distributions? J Vis 2023; 23:5. [PMID: 36602815 PMCID: PMC9832727 DOI: 10.1167/jov.23.1.5] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 11/23/2022] [Indexed: 01/06/2023] Open
Abstract
Many studies have shown that observers can accurately estimate the average feature of a group of objects. However, the way the visual system relies on the information from each individual item is still under debate. Some models suggest some or all items sampled and averaged arithmetically. Another strategy implies "robust averaging," when middle elements gain greater weight than outliers. One version of a robust averaging model was recently suggested by Teng et al. (2021), who studied motion direction averaging in skewed feature distributions and found systematic biases toward their modes. They interpreted these biases as evidence for robust averaging and suggested a probabilistic weighting model based on minimization of the virtual loss function. In four experiments, we replicated systematic skew-related biases in another feature domain, namely, orientation averaging. Importantly, we show that the magnitude of the bias is not determined by the locations of the mean or mode alone, but is substantially defined by the shape of the whole feature distribution. We test a model that accounts for such distribution-dependent biases and robust averaging in a biologically plausible way. The model is based on well-established mechanisms of spatial pooling and population encoding of local features by neurons with large receptive fields. Both the loss functions model and the population coding model with a winner-take-all decoding rule accurately predicted the observed patterns, suggesting that the pooled population response model can be considered a neural implementation of the computational algorithms of information sampling and robust averaging in ensemble perception.
Collapse
Affiliation(s)
| | - Igor S Utochkin
- Institute for Mind and Biology, University of Chicago, Chicago, IL, USA
| |
Collapse
|
11
|
Kacin M, Cha O, Gauthier I. The Relation between Ensemble Coding of Length and Orientation Does Not Depend on Spatial Attention. VISION (BASEL, SWITZERLAND) 2022; 7:vision7010003. [PMID: 36649050 PMCID: PMC9844274 DOI: 10.3390/vision7010003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2022] [Revised: 12/15/2022] [Accepted: 12/23/2022] [Indexed: 12/31/2022]
Abstract
Most people are good at estimating summary statistics for different features of groups of objects. For instance, people can selectively attend to different features of a group of lines and report ensemble properties such as the mean length or mean orientation and there are reliable individual differences in such ensemble judgment abilities. Our recent study found decisive evidence in support of a correlation between the errors on mean length and mean orientation judgments (r = 0.62). The present study investigates one possible mechanism for this correlation. The ability to allocate spatial attention to single items varies across individuals, and in the recent study, this variability could have contributed to both judgments because the location of lines was unpredictable. Here, we replicate this prior work with arrays of lines with fully predictable spatial locations, to lower the contribution of the ability to distribute attention effectively over all items in a display. We observed a strong positive correlation between errors on the length and orientation averaging tasks (r = 0.65). This provides evidence against individual differences in spatial attention as a common mechanism supporting mean length and orientation judgments. The present result aligns with the growing evidence for at least one ensemble-specific ability that applies across different kinds of features and stimuli.
Collapse
Affiliation(s)
- Melanie Kacin
- Department of Psychology, Vanderbilt University, Nashville, TN 37240, USA
- Department of Psychology, Queens College, City University of New York, Flushing, NY 11367, USA
| | - Oakyoon Cha
- Department of Psychology, Vanderbilt University, Nashville, TN 37240, USA
- Department of Psychology, Sungshin Women’s University, Seoul 02844, Republic of Korea
- Correspondence:
| | - Isabel Gauthier
- Department of Psychology, Vanderbilt University, Nashville, TN 37240, USA
| |
Collapse
|
12
|
The attentional boost effect facilitates visual category learning. Atten Percept Psychophys 2022; 84:2432-2443. [PMID: 36175764 DOI: 10.3758/s13414-022-02579-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/16/2022] [Indexed: 11/08/2022]
Abstract
Increasing evidence has shown that summary visual statistics, such as the mean size or centroid of locations, can be perceived without focal attention. Here, we tested the role of attention in visual category learning - rapid learning of visual similarities among paintings of the same artist. Participants encoded paintings from two famous artists into memory while simultaneously monitoring a rapid serial visual presentation (RSVP) stream of colored squares, pressing the spacebar for target colors and making no response to distractor colors. Paintings encoded with the RSVP targets were better remembered than those encoded with the RSVP distractors, demonstrating an Attentional Boost Effect. Importantly, pairing one artist's paintings with the RSVP targets led to better visual category learning - participants were more accurate at recognizing novel paintings from this artist, relative to another artist whose paintings were presented with the RSVP distractors. Thus, visual category learning is subjected to the same constraint of attention as exemplar memory, demonstrating common mechanisms for exemplar and category learning.
Collapse
|
13
|
Ren Y, Bu X, Wang M, Gong Y, Wang J, Yang Y, Li G, Zhang M, Zhou Y, Han ST. Synaptic plasticity in self-powered artificial striate cortex for binocular orientation selectivity. Nat Commun 2022; 13:5585. [PMID: 36151070 PMCID: PMC9508249 DOI: 10.1038/s41467-022-33393-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 09/13/2022] [Indexed: 11/16/2022] Open
Abstract
Get in-depth understanding of each part of visual pathway yields insights to conquer the challenges that classic computer vision is facing. Here, we first report the bioinspired striate cortex with binocular and orientation selective receptive field based on the crossbar array of self-powered memristors which is solution-processed monolithic all-perovskite system with each cross-point containing one CsFAPbI3 solar cell directly stacking on the CsPbBr2I memristor. The plasticity of self-powered memristor can be modulated by optical stimuli following triplet-STDP rules. Furthermore, plasticity of 3 × 3 flexible crossbar array of self-powered memristors has been successfully modulated based on generalized BCM learning rule for optical-encoded pattern recognition. Finally, we implemented artificial striate cortex with binocularity and orientation selectivity based on two simulated 9 × 9 self-powered memristors networks. The emulation of striate cortex with binocular and orientation selectivity will facilitate the brisk edge and corner detection for machine vision in the future applications. Designing efficient bio-inspired vision systems remains a challenge. Here, the authors report a bio-inspired striate visual cortex with binocular and orientation selective receptive field based on self-powered memristor to enable machine vision with brisk edge and corner detection in the future applications.
Collapse
Affiliation(s)
- Yanyun Ren
- Institute for Microscale Optoelectronics, Shenzhen University, Shenzhen, 518060, PR China.,Institute for Advanced Study, Shenzhen University, Shenzhen, 518060, PR China
| | - Xiaobo Bu
- Institute for Advanced Study, Shenzhen University, Shenzhen, 518060, PR China
| | - Ming Wang
- Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Yue Gong
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Junjie Wang
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Yuyang Yang
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Guijun Li
- Key Laboratory of Optoelectronic Devices and Systems of Ministry of Education and Guangdong Province, College of Physics and Optoelectronic Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Meng Zhang
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, PR China
| | - Ye Zhou
- Institute for Advanced Study, Shenzhen University, Shenzhen, 518060, PR China
| | - Su-Ting Han
- College of Electronics and Information Engineering, Shenzhen University, Shenzhen, 518060, PR China.
| |
Collapse
|
14
|
Jia J, Wang T, Chen S, Ding N, Fang F. Ensemble size perception: Its neural signature and the role of global interaction over individual items. Neuropsychologia 2022; 173:108290. [PMID: 35697088 DOI: 10.1016/j.neuropsychologia.2022.108290] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 06/02/2022] [Accepted: 06/07/2022] [Indexed: 10/18/2022]
Abstract
To efficiently process complex visual scenes, the visual system often summarizes statistical information across individual items and represents them as an ensemble. However, due to the lack of techniques to disentangle the representation of the ensemble from that of the individual items constituting the ensemble, whether there exists a specialized neural mechanism for ensemble processing and how ensemble perception is computed in the brain remain unknown. To address these issues, we used a frequency-tagging EEG approach to track brain responses to periodically updated ensemble sizes. Neural responses tracking the ensemble size were detected in parieto-occipital electrodes, revealing a global and specialized neural mechanism of ensemble size perception. We then used the temporal response function to isolate neural responses to the individual sizes and their interactions. Notably, while the individual sizes and their local and global interactions were encoded in the EEG signals, only the global interaction contributed directly to the ensemble size perception. Finally, distributed attention to the global stimulus pattern enhanced the neural signature of the ensemble size, mainly by modulating the neural representation of the global interaction between all individual sizes. These findings advocate a specialized, global neural mechanism of ensemble size perception and suggest that global interaction between individual items contributes to ensemble perception.
Collapse
Affiliation(s)
- Jianrong Jia
- Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou, 311121, China; Institute of Psychological Sciences, Hangzhou Normal University, Hangzhou, 311121, China
| | - Tongyu Wang
- Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou, 311121, China; Institute of Psychological Sciences, Hangzhou Normal University, Hangzhou, 311121, China
| | - Siqi Chen
- Center for Cognition and Brain Disorders, The Affiliated Hospital of Hangzhou Normal University, Hangzhou, 311121, China; Institute of Psychological Sciences, Hangzhou Normal University, Hangzhou, 311121, China
| | - Nai Ding
- Key Laboratory for Biomedical Engineering of Ministry of Education, College of Biomedical Engineering and Instrument Sciences, Zhejiang University, Hangzhou, 311121, China; Research Center for Advanced Artificial Intelligence Theory, Zhejiang Lab, Hangzhou, 311121, China
| | - Fang Fang
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, 100871, China; IDG/McGovern Institute for Brain Research, Peking University, Beijing, 100871, China; Key Laboratory of Machine Perception (Ministry of Education), Peking University, Beijing, 100871, China; Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
15
|
Abstract
Visual perception is capable of pooling multiple local orientation signals into a single more accurate summary orientation. However, there is still a lack of systematic inquiry into which summary statistics are implemented in that process. Here, the task was to recognize in which direction, clockwise or counter-clockwise, the mean orientation of a set of randomly distributed Gabor patches (N = 1, 2, 4, and 8) was rotated from the implicit vertical. The mean orientation discrimination accuracy did not improve with the increase of the number N of elements in proportion to the square-root-N, as could be expected if noisy internal representations were arithmetically averaged. The Proportion of Informative Elements (PIE), defined as the percentage of elements having an orientation different from the vertical, also affected the discrimination precision, violating the arithmetic averaging rules. The decrease in the orientation discrimination precision with the increase of the PIE would suggest that the orientation pooling could be more adequately described by a quadratic or higher power mean. Thus, we parameterized the averaging process for the power parameter of the generalized mean formula. The results indicate that different pooling rules in different trials may apply, for example, the arithmetic mean in some and the maximal deviation rule in others. It is concluded that pooling of orientation information is a relatively inaccurate process for which different perceptual cues and their combination rules can be used.
Collapse
|
16
|
Franconeri SL, Padilla LM, Shah P, Zacks JM, Hullman J. The Science of Visual Data Communication: What Works. Psychol Sci Public Interest 2021; 22:110-161. [PMID: 34907835 DOI: 10.1177/15291006211051956] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Effectively designed data visualizations allow viewers to use their powerful visual systems to understand patterns in data across science, education, health, and public policy. But ineffectively designed visualizations can cause confusion, misunderstanding, or even distrust-especially among viewers with low graphical literacy. We review research-backed guidelines for creating effective and intuitive visualizations oriented toward communicating data to students, coworkers, and the general public. We describe how the visual system can quickly extract broad statistics from a display, whereas poorly designed displays can lead to misperceptions and illusions. Extracting global statistics is fast, but comparing between subsets of values is slow. Effective graphics avoid taxing working memory, guide attention, and respect familiar conventions. Data visualizations can play a critical role in teaching and communication, provided that designers tailor those visualizations to their audience.
Collapse
Affiliation(s)
| | - Lace M Padilla
- Department of Cognitive and Information Sciences, University of California, Merced
| | - Priti Shah
- Department of Psychology, University of Michigan
| | - Jeffrey M Zacks
- Department of Psychological & Brain Sciences, Washington University in St. Louis
| | | |
Collapse
|
17
|
No effect of spatial attention on the processing of a motion ensemble: Evidence from Posner cueing. Atten Percept Psychophys 2021; 84:1845-1857. [PMID: 34811633 DOI: 10.3758/s13414-021-02392-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/07/2021] [Indexed: 11/08/2022]
Abstract
The formation of ensemble codes is an efficient means through which the visual system represents vast arrays of information. This has led to the claim that ensemble representations are formed with minimal reliance on attentional resources. However, evidence is mixed regarding the effects of attention on ensemble processing, and researchers do not always make it clear how attention is being manipulated by their paradigm of choice. In this study, we examined the effects of Posner cueing - a well-established method of manipulating spatial attention - on the processing of a global motion stimulus, a naturalistic ensemble that requires the pooling of local motion signals. In Experiment 1, using a centrally presented, predictive attentional cue, we found no effect of spatial attention on global motion performance: Accuracy in invalid trials, where attention was misdirected by the cue, did not differ from accuracy in valid trials, where attention was directed to the location of the motion stimulus. In Experiment 2, we maximized the potential for our paradigm to reveal any attentional effects on global motion processing by using a threshold-based measure of performance; however, despite this change, there was again no evidence of an attentional effect on performance. Together, our results show that the processing of a global motion stimulus is unaffected when spatial attention is misdirected, and speak to the efficiency with which such ensemble stimuli are processed.
Collapse
|
18
|
Neural representations of ensemble coding in the occipital and parietal cortices. Neuroimage 2021; 245:118680. [PMID: 34718139 DOI: 10.1016/j.neuroimage.2021.118680] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 10/17/2021] [Accepted: 10/23/2021] [Indexed: 11/23/2022] Open
Abstract
The human visual system is able to extract summary statistics from sets of similar items, but the underlying neural mechanism remains poorly understood. Using functional magnetic resonance imaging (fMRI) and an encoding model, we examined how the neural representation of ensemble coding is constructed by manipulating the task-relevance of ensemble features. We found a gradual increase in orientation-selective responses to the mean orientation of multiple stimuli along the visual hierarchy only when these orientations were task-relevant. Such responses to the ensemble orientation were present in the extrastriate area, V3, even when the mean orientation was not task-relevant, indicating that the ensemble representation can co-exist with the task-relevant individual feature representation. Ensemble orientations were also represented in frontal regions, but those representations were robust only when each mean orientation was linked to a motor response dimension. Together, our findings suggest that the neural representation of the ensemble percept is formed by pooling signals at multiple levels of the visual processing stream.
Collapse
|
19
|
Multivariate summary of a complex scene. Vision Res 2021; 189:11-26. [PMID: 34508940 DOI: 10.1016/j.visres.2021.08.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 07/23/2021] [Accepted: 08/26/2021] [Indexed: 11/21/2022]
Abstract
The current study investigated how people summarize and represent objects with multiple features to cope with the complexity due to the number of objects and feature dimensions. We presented a set of circles whose color and size were either correlated perfectly (r = 1) or not correlated at all (r = 0). Using a membership identification task, we found that participants formed a statistical representation that included information about conjunctions as well as each color and size dimensions. In addition, we found that participants represented different set boundaries depending on the correlation between features of a set. Lastly, a pair-matching task revealed that participants predicted one feature value from the other feature value based on the correlation between features of a set. Our findings suggest that people represent a multi-feature ensemble statistically as a multivariate feature distribution, which is an efficient strategy to cope with scene complexity.
Collapse
|
20
|
The group extremity effect: Group ratings of negatively and positively evaluated groups of faces are more extreme than the average ratings of their members. JOURNAL OF EXPERIMENTAL SOCIAL PSYCHOLOGY 2021. [DOI: 10.1016/j.jesp.2021.104161] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
21
|
Differential neurodynamics and connectivity in the dorsal and ventral visual pathways during perception of emotional crowds and individuals: a MEG study. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE 2021; 21:776-792. [PMID: 33725334 DOI: 10.3758/s13415-021-00880-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 02/03/2021] [Indexed: 11/08/2022]
Abstract
Reading the prevailing emotion of groups of people ("crowd emotion") is critical to understanding their overall intention and disposition. It alerts us to potential dangers, such as angry mobs or panicked crowds, giving us time to escape. A critical aspect of processing crowd emotion is that it must occur rapidly, because delays often are costly. Although knowing the timing of neural events is crucial for understanding how the brain guides behaviors using coherent signals from a glimpse of multiple faces, this information is currently lacking in the literature on face ensemble coding. Therefore, we used magnetoencephalography to examine the neurodynamics in the dorsal and ventral visual streams and the periamygdaloid cortex to compare perception of groups of faces versus individual faces. Forty-six participants compared two groups of four faces or two individual faces with varying emotional expressions and chose which group or individual they would avoid. We found that the dorsal stream was activated as early as 68 msec after the onset of stimuli containing groups of faces. In contrast, the ventral stream was activated later and predominantly for individual face stimuli. The latencies of the dorsal stream activation peaks correlated with participants' response times for facial crowds. We also found enhanced connectivity earlier between the periamygdaloid cortex and the dorsal stream regions for crowd emotion perception. Our findings suggest that ensemble coding of facial crowds proceeds rapidly and in parallel by engaging the dorsal stream to mediate adaptive social behaviors, via a distinct route from single face perception.
Collapse
|
22
|
Ensemble perception: Extracting the average of perceptual versus numerical stimuli. Atten Percept Psychophys 2021; 83:956-969. [PMID: 33392976 DOI: 10.3758/s13414-020-02192-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/25/2020] [Indexed: 11/08/2022]
Abstract
Recent research has established that humans can extract the average perceptual feature over briefly presented arrays of visual elements or the average of a rapid temporal sequence of numbers. Here we compared the extraction of the average over briefly presented arrays, for a perceptual feature (orientations) and for numerical values (1-9 digits), using an identical experimental design for the two tasks. We hypothesized that the averaging of numbers, more than of orientations, would be constrained by capacity limitations. Arrays of Gabor elements or digits were simultaneously presented for 300 ms and observers were required to estimate the average on a continuous response scale. In each trial the elements were sampled from normal distributions (of various means) and we varied the set size (4-12). We found that while for orientation the averaging precision remained constant with set size, for numbers it decreased with set size. Using computational modeling we also extracted capacity parameters (the number of elements that are pooled in the average extraction). Despite marked heterogeneity between observers, the capacity for orientations (around eight items) was much larger than for numbers (around four items). The orientation task also had a larger fraction of participants relying on distributed attention to all elements. Our study thus supports the idea that numbers more than perceptual features are subject to capacity or attentional limitations when observers need to evaluate the average over an ensemble of stimuli.
Collapse
|
23
|
A direct comparison of central tendency recall and temporal integration in the successive field iconic memory task. Atten Percept Psychophys 2021; 83:1337-1356. [PMID: 33389675 PMCID: PMC7778862 DOI: 10.3758/s13414-020-02187-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/21/2020] [Indexed: 11/30/2022]
Abstract
The ensemble coding literature suggests the existence of a fast, automatic formation of some ensemble codes. Can statistical representations, such as memory for the central tendency along a particular visual feature dimension, be extracted from information held in the sensory register? Furthermore, can knowledge of early, iconic memory processes be used to determine how central tendency is extracted? We focused on the potential role of visible persistence mechanisms that support temporal integration. We tested whether mean orientation could be accurately recalled from brief visual displays using the successive field task. On separate blocks of trials, participants were asked to report the location of a split element (requiring differentiation of frames), a missing element (requiring integration across frames), and the average orientation of elements pooled across both frames (central tendency recall). Results replicate the expected tradeoff between differentiation and integration performance across inter-frame interval (IFI). In contrast, precision of mean estimates was high and invariant across IFIs. A manipulation of within-frame distributional similarity coupled with simulations using 12 models supported 2-item subsampling. The results argue against the “strategic” interpretation of subsampling since 2-item readout was predicted by information theoretic estimates of STM encoding rate: the 2 items were not from a superset in STM. Most crucially, the results argue against the various early “preattentive/parallel/global pooling” accounts and instead suggest that non-selective readout of information from iconic memory supplies a relatively small amount of item information to STM, and it is only at this point that the computation of ensemble averages begins.
Collapse
|
24
|
Quality of average representation can be enhanced by refined individual items. Atten Percept Psychophys 2020; 83:970-981. [PMID: 33033987 DOI: 10.3758/s13414-020-02139-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/06/2020] [Indexed: 11/08/2022]
Abstract
Ensemble perception is efficient because it summarizes redundant and complex information. However, it loses the fine details of individual items during the averaging process. Such characteristics of ensemble perception are similar to those of coarse processing. Here, we tested whether extracting an average of a set was similar to coarse processing. To manipulate coarse processing, we used the fast flicker adaptation known as suppressing coarse information processed by the magnocellular pathway. We hypothesized that if computing the average of a set relied on coarse processing, the precision of an averaging task should decrease after adaptation compared to baseline (no-adaptation). Across experiments with various features (orientation in Experiment 1, size in Experiment 2, and facial expression in Experiment 3), we found that suppressing coarse information did not disrupt the performance of the averaging tasks. Rather, adaptation increased the precision of mean representation. The precision of mean representation might have increased because fine information was relatively enhanced after adaptation. Our results suggest that the quality of ensemble representation relies on that of individual items.
Collapse
|