1
|
Djambazovska S, Zafer A, Ramezanpour H, Kreiman G, Kar K. The Impact of Scene Context on Visual Object Recognition: Comparing Humans, Monkeys, and Computational Models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.27.596127. [PMID: 38854011 PMCID: PMC11160639 DOI: 10.1101/2024.05.27.596127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
During natural vision, we rarely see objects in isolation but rather embedded in rich and complex contexts. Understanding how the brain recognizes objects in natural scenes by integrating contextual information remains a key challenge. To elucidate neural mechanisms compatible with human visual processing, we need an animal model that behaves similarly to humans, so that inferred neural mechanisms can provide hypotheses relevant to the human brain. Here we assessed whether rhesus macaques could model human context-driven object recognition by quantifying visual object identification abilities across variations in the amount, quality, and congruency of contextual cues. Behavioral metrics revealed strikingly similar context-dependent patterns between humans and monkeys. However, neural responses in the inferior temporal (IT) cortex of monkeys that were never explicitly trained to discriminate objects in context, as well as current artificial neural network models, could only partially explain this cross-species correspondence. The shared behavioral variance unexplained by context-naive neural data or computational models highlights fundamental knowledge gaps. Our findings demonstrate an intriguing alignment of human and monkey visual object processing that defies full explanation by either brain activity in a key visual region or state-of-the-art models.
Collapse
Affiliation(s)
- Sara Djambazovska
- York University, Department of Biology and Centre for Vision Research, Toronto, Canada
- Children’s Hospital, Harvard Medical School, MA, USA
| | - Anaa Zafer
- York University, Department of Biology and Centre for Vision Research, Toronto, Canada
| | - Hamidreza Ramezanpour
- York University, Department of Biology and Centre for Vision Research, Toronto, Canada
| | | | - Kohitij Kar
- York University, Department of Biology and Centre for Vision Research, Toronto, Canada
| |
Collapse
|
2
|
Ansado J, Eynard B, Mirofle N, Mennetrey C, Banchereau J, Sablon M, Lokietek E, Le Vourc'h F, Tissot J, Wrobel J, Martel C, Granon S, Suarez S. Adult norms for the decision-making MindPulse Digital Test. APPLIED NEUROPSYCHOLOGY. ADULT 2024:1-19. [PMID: 38354094 DOI: 10.1080/23279095.2024.2307413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/16/2024]
Abstract
We present adult normalized data for MindPulse (MP), a new tool evaluating attentional and executive functioning (AEF) in decision-making. We recruited 722 neurotypical participants (18-80 years), with 149 retested. The MP test includes three tasks: Simple Reaction Time (SRT), Go/No-go, and complex Go/No-go, involving perceptual components, motor responses, and measurements of reaction time (RT) and correctness. We compare responses, evaluating 14 cognitive indices (including new composite indices to describe AEF: Executive Speed and Reaction to Difficulty). We adjust for age/sex effects, introduce a difficulty scale, and consider standard deviations, aberrant times, and Spearman Correlation for speed-accuracy balance. Wilcoxon unpaired rank test is used to assess sex effects, and linear regression is employed to assess the age linear dependency model on the normalized database. The study demonstrated age and sex effects on RTs, in all three subtests, and the ability to correct it for individual results. The test showed excellent validity (Cronbach Alpha for the three subtasks is 92, 87, 95%) and high internal consistency (p < 0.001 for each subtask significantly faster than the more complex subtask) of the MP across the wide age range. Results showed correlation within the three RT parts of the test (p < .001 for each) and the independence of SRT, RD, and ES indices. The Retest effect was lower than intersubject variance, showing consistency over time. This study highlights the MP test's strong validity on a homogeneous, large adult sample. It emphasizes assessing AEF and Reaction to Difficulty dynamically with high sensitivity.
Collapse
Affiliation(s)
| | - Bertrand Eynard
- It's Brain SAS, Orsay, France
- IPHT/DRF/CEA Institut de Physique Théorique, Gif-sur-Yvette, France
- CRM Montréal, Montreal, Canada
| | - Nastasia Mirofle
- Institut des Neurosciences de Paris-Saclay, CNRS UMR 9197, Université Paris-Saclay, Paris, France
| | | | | | | | - Eline Lokietek
- Centre SSR Marguerite Boucicaut, Chalon sur Saône, France
| | | | | | | | - Claire Martel
- Centre de Santé Universitaire, St Martin d'Hères, France
| | - Sylvie Granon
- Institut des Neurosciences de Paris-Saclay, CNRS UMR 9197, Université Paris-Saclay, Paris, France
| | | |
Collapse
|
3
|
Conceptualizing values abstractly: Longitudinal development and the role of concept formation and working memory. COGNITIVE DEVELOPMENT 2022. [DOI: 10.1016/j.cogdev.2022.101267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
4
|
Chen ZS, Zhang X, Long X, Zhang SJ. Are Grid-Like Representations a Component of All Perception and Cognition? Front Neural Circuits 2022; 16:924016. [PMID: 35911570 PMCID: PMC9329517 DOI: 10.3389/fncir.2022.924016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 06/14/2022] [Indexed: 11/24/2022] Open
Abstract
Grid cells or grid-like responses have been reported in the rodent, bat and human brains during various spatial and non-spatial tasks. However, the functions of grid-like representations beyond the classical hippocampal formation remain elusive. Based on accumulating evidence from recent rodent recordings and human fMRI data, we make speculative accounts regarding the mechanisms and functional significance of the sensory cortical grid cells and further make theory-driven predictions. We argue and reason the rationale why grid responses may be universal in the brain for a wide range of perceptual and cognitive tasks that involve locomotion and mental navigation. Computational modeling may provide an alternative and complementary means to investigate the grid code or grid-like map. We hope that the new discussion will lead to experimentally testable hypotheses and drive future experimental data collection.
Collapse
Affiliation(s)
- Zhe Sage Chen
- Department of Psychiatry, Department of Neuroscience and Physiology, Neuroscience Institute, New York University School of Medicine, New York, NY, United States
| | - Xiaohan Zhang
- Department of Psychiatry, Department of Neuroscience and Physiology, Neuroscience Institute, New York University School of Medicine, New York, NY, United States
| | - Xiaoyang Long
- Department of Neurosurgery, Xinqiao Hospital, Army Medical University, Chongqing, China
| | - Sheng-Jia Zhang
- Department of Neurosurgery, Xinqiao Hospital, Army Medical University, Chongqing, China
| |
Collapse
|
5
|
Doradzińska Ł, Furtak M, Bola M. Perception of semantic relations in scenes: A registered report study of attention hold. Conscious Cogn 2022; 100:103315. [PMID: 35339910 DOI: 10.1016/j.concog.2022.103315] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/07/2022] [Accepted: 03/12/2022] [Indexed: 11/16/2022]
Abstract
To what extent the semantic relations present in scenes guide spatial attention automatically remains a matter of debate. Considering that spatial attention can be understood as a sequence of shifts, engagements, and disengagements, semantic relations might affect each stage of this process differently. Therefore, we investigated whether objects that violate semantic rules engage attention for longer than objects that are expected in a given context. The experiment involved a central presentation of a distractor scene that contained a semantically congruent or incongruent object, and a peripheral presentation of a small target letter. We found that incongruent scenes did not delay responses to the peripheral target, which indicates that they did not hold attention for longer than congruent scenes. Therefore, by showing that violations of semantic relations do not engage attention automatically, our study contributes to a better understanding of how attention operates in naturalistic settings.
Collapse
Affiliation(s)
- Łucja Doradzińska
- Laboratory of Brain Imaging, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland
| | - Marcin Furtak
- Laboratory of Brain Imaging, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland
| | - Michał Bola
- Laboratory of Brain Imaging, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland.
| |
Collapse
|
6
|
Furtak M, Mudrik L, Bola M. The forest, the trees, or both? Hierarchy and interactions between gist and object processing during perception of real-world scenes. Cognition 2021; 221:104983. [PMID: 34968994 DOI: 10.1016/j.cognition.2021.104983] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Revised: 11/30/2021] [Accepted: 12/02/2021] [Indexed: 11/26/2022]
Abstract
The global-to-local theories of perception assume that the gist of a scene is computed early and automatically, whereas recognition of objects occurs at a later processing stage, requires attentional resources, and is primed by the representation of gist. To test these theoretical predictions, we investigated the processing hierarchy of gist- and object-recognition and their interaction in two experiments (total N = 60). Backward-masked images of real-world scenes were presented for a range of brief durations - between 8 ms and 100 ms, and participants performed either an object or a background classification task, in separate blocks. We report three main findings. First, scenes' backgrounds were generally classified more accurately than foreground objects, but recognition of objects was boosted to the same level as backgrounds by cueing spatial attention to the exact object's location. Second, backgrounds influence objects' recognition, as objects presented within semantically incongruent backgrounds were classified less accurately. Third, objects influence background categorization, as backgrounds comprising incongruent objects were also classified less accurately. Therefore, the first two findings support the global-to-local theories, implying that gists are indeed more readily perceived than objects, probably at an earlier stage. Yet the latter finding that objects also influence gist recognition suggests a more parallel and interactive view of both processes than previously assumed.
Collapse
Affiliation(s)
- Marcin Furtak
- Laboratory of Brain Imaging, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland
| | - Liad Mudrik
- School of Psychological Science, Tel Aviv University, PO Box 39040, Tel Aviv, Israel; Sagol School of Neuroscience, Tel Aviv University, PO Box 39040, Tel Aviv, Israel
| | - Michał Bola
- Laboratory of Brain Imaging, Nencki Institute of Experimental Biology of Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland.
| |
Collapse
|
7
|
Suarez S, Eynard B, Granon S. A Dissociation of Attention, Executive Function and Reaction to Difficulty: Development of the MindPulse Test, a Novel Digital Neuropsychological Test for Precise Quantification of Perceptual-Motor Decision-Making Processes. Front Neurosci 2021; 15:650219. [PMID: 34349614 PMCID: PMC8326915 DOI: 10.3389/fnins.2021.650219] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2021] [Accepted: 06/15/2021] [Indexed: 12/02/2022] Open
Abstract
Traditionally, neuropsychological testing has assessed processing speed and precision, closely related to the ability to perform high-order cognitive tasks. An individual making a decision under time pressure must constantly rebalance its speed to action in order to account for possible errors. A deficit in processing speed appears to be afrequent disorder caused by cerebral damage — but it can be hard to pinpoint the exact cause of the slowdown. It is therefore important to separate the perceptual-motor component of processing speed from the decision-time component. We present a technique to isolate Reaction Times (RTs): a short digital test to assess the decision-making abilities of individuals by gauging their ability to balance between speed and precision. Our hypothesis is that some subjects willaccelerate, and others slow down in the face of the difficulty. This pilot study, conducted on 83 neurotypical adult volunteers, used images stimuli. The test was designed to measure RTs and correctness. After learning release gesture, the subjects were presented with three tasks: a simple Reaction Time task, a Go/No-Go, and a complex Go/No-Go with 2 simultaneous Choices. All three tasks have in common a perceptual component and a motor response. By measuring the 3 reference points requiring attentional and executive processing, while progressively increasing the conceptual complexity of the task, we were able to compare the processing times for different tasks — thus calculating the deceleration specific to the reaction time linked to difficulty. We defined the difficulty coefficient of a task as being the ratio of the group average time of this task minus the base time/average time of the unit task minus the base time. We found that RTs can be broken down into three elementary, uncorrelated components: Reaction Time, Executive Speed, and Reaction to Difficulty (RD). We hypothesized that RD reflects how the subject reacts to difficulty by accelerating (RD < 0) or decelerating (RD > 0). Thus we provide here a first proof of concept: the ability to measure four axes of the speed-precision trade-off inherent in a subject’s fundamental decision making: perceptual-motor speed, executive speed, subject accuracy, and reaction to difficulty.
Collapse
Affiliation(s)
| | - Bertrand Eynard
- IHES, Institut des Hautes Études Scientifiques, Bures-sur-Yvette, France.,IPHT/DRF/CEA Institut de Physique Théorique, Gif-sur-Yvette, France
| | - Sylvie Granon
- Université Paris-Saclay, CNRS, Institut des Neurosciences Paris-Saclay, Gif-sur-Yvette, France
| |
Collapse
|
8
|
Mansouri FA, Buckley MJ, Fehring DJ, Tanaka K. The Role of Primate Prefrontal Cortex in Bias and Shift Between Visual Dimensions. Cereb Cortex 2021; 30:85-99. [PMID: 31220222 PMCID: PMC7029686 DOI: 10.1093/cercor/bhz072] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 03/11/2019] [Accepted: 03/11/2019] [Indexed: 12/23/2022] Open
Abstract
Imaging and neural activity recording studies have shown activation in the primate prefrontal cortex when shifting attention between visual dimensions is necessary to achieve goals. A fundamental unanswered question is whether representations of these dimensions emerge from top-down attentional processes mediated by prefrontal regions or from bottom-up processes within visual cortical regions. We hypothesized a causative link between prefrontal cortical regions and dimension-based behavior. In large cohorts of humans and macaque monkeys, performing the same attention shifting task, we found that both species successfully shifted between visual dimensions, but both species also showed a significant behavioral advantage/bias to a particular dimension; however, these biases were in opposite directions in humans (bias to color) versus monkeys (bias to shape). Monkeys' bias remained after selective bilateral lesions within the anterior cingulate cortex (ACC), frontopolar cortex, dorsolateral prefrontal cortex (DLPFC), orbitofrontal cortex (OFC), or superior, lateral prefrontal cortex. However, lesions within certain regions (ACC, DLPFC, or OFC) impaired monkeys' ability to shift between these dimensions. We conclude that goal-directed processing of a particular dimension for the executive control of behavior depends on the integrity of prefrontal cortex; however, representation of competing dimensions and bias toward them does not depend on top-down prefrontal-mediated processes.
Collapse
Affiliation(s)
- Farshad A Mansouri
- Cognitive Neuroscience Laboratory, Department of Physiology, Monash Biomedicine Discovery Institute, Monash University, Victoria, Australia.,ARC Centre of Excellence for Integrative Brain Function, Monash University, Victoria, Australia
| | - Mark J Buckley
- Department of Experimental Psychology, Oxford University, Oxford, UK
| | - Daniel J Fehring
- Cognitive Neuroscience Laboratory, Department of Physiology, Monash Biomedicine Discovery Institute, Monash University, Victoria, Australia.,ARC Centre of Excellence for Integrative Brain Function, Monash University, Victoria, Australia
| | - Keiji Tanaka
- Cognitive Brain Mapping Laboratory, RIKEN Center for Brain Science, Wako, Saitama, Japan
| |
Collapse
|
9
|
Lai LY, Frömer R, Festa EK, Heindel WC. Age-related changes in the neural dynamics of bottom-up and top-down processing during visual object recognition: an electrophysiological investigation. Neurobiol Aging 2020; 94:38-49. [PMID: 32562874 DOI: 10.1016/j.neurobiolaging.2020.05.010] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Revised: 03/10/2020] [Accepted: 05/15/2020] [Indexed: 10/24/2022]
Abstract
When recognizing objects in our environments, we rely on both what we see and what we know. While older adults often display increased sensitivity to top-down influences of contextual information during object recognition, the locus of this increased sensitivity remains unresolved. To examine the effects of aging on the neural dynamics of bottom-up and top-down visual processing during rapid object recognition, we probed the differential effects of object perceptual ambiguity and scene context congruity on specific EEG event-related potential components indexing dissociable processes along the visual processing stream. Older adults displayed larger behavioral scene congruity effects than young adults. Older adults' larger visual P2 amplitudes to object perceptual ambiguity (as opposed to the scene congruity P2 effects in young adults) suggest continued resolution of perceptual ambiguity that interfered with scene congruity processing, while post-perceptual semantic integration (as indexed by N400) remained largely intact. These findings suggest that compromised bottom-up perceptual processing in healthy aging leads to an increased involvement of top-down processes to resolve greater perceptual ambiguity during object recognition.
Collapse
Affiliation(s)
- Leslie Y Lai
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - Romy Frömer
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - Elena K Festa
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912
| | - William C Heindel
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912.
| |
Collapse
|
10
|
Cauchoix M, Chaine AS, Barragan-Jason G. Cognition in Context: Plasticity in Cognitive Performance in Response to Ongoing Environmental Variables. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.00106] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
|
11
|
Kreiman G, Serre T. Beyond the feedforward sweep: feedback computations in the visual cortex. Ann N Y Acad Sci 2020; 1464:222-241. [PMID: 32112444 PMCID: PMC7456511 DOI: 10.1111/nyas.14320] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 01/24/2020] [Accepted: 01/30/2020] [Indexed: 11/28/2022]
Abstract
Visual perception involves the rapid formation of a coarse image representation at the onset of visual processing, which is iteratively refined by late computational processes. These early versus late time windows approximately map onto feedforward and feedback processes, respectively. State-of-the-art convolutional neural networks, the main engine behind recent machine vision successes, are feedforward architectures. Their successes and limitations provide critical information regarding which visual tasks can be solved by purely feedforward processes and which require feedback mechanisms. We provide an overview of recent work in cognitive neuroscience and machine vision that highlights the possible role of feedback processes for both visual recognition and beyond. We conclude by discussing important open questions for future research.
Collapse
Affiliation(s)
- Gabriel Kreiman
- Children’s Hospital, Harvard Medical School and Center for Brains, Minds, and Machines
| | - Thomas Serre
- Cognitive Linguistic & Psychological Sciences, Carney Institute for Brain Science, Brown University
| |
Collapse
|
12
|
Age effects on the neural processing of object-context associations in briefly flashed natural scenes. Neuropsychologia 2020; 136:107264. [DOI: 10.1016/j.neuropsychologia.2019.107264] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2019] [Revised: 09/30/2019] [Accepted: 11/11/2019] [Indexed: 01/31/2023]
|
13
|
Siuda-Krzywicka K, Boros M, Bartolomeo P, Witzel C. The biological bases of colour categorisation: From goldfish to the human brain. Cortex 2019; 118:82-106. [DOI: 10.1016/j.cortex.2019.04.010] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 12/03/2018] [Accepted: 04/12/2019] [Indexed: 01/29/2023]
|
14
|
Cognition in the field: comparison of reversal learning performance in captive and wild passerines. Sci Rep 2017; 7:12945. [PMID: 29021558 PMCID: PMC5636806 DOI: 10.1038/s41598-017-13179-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Accepted: 09/19/2017] [Indexed: 11/08/2022] Open
Abstract
Animal cognitive abilities have traditionally been studied in the lab, but studying cognition in nature could provide several benefits including reduced stress and reduced impact on life-history traits. However, it is not yet clear to what extent cognitive abilities can be properly measured in the wild. Here we present the first comparison of the cognitive performance of individuals from the same population, assessed using an identical test, but in contrasting contexts: in the wild vs. in controlled captive conditions. We show that free-ranging great tits (Parus major) perform similarly to deprived, captive birds in a successive spatial reversal-learning task using automated operant devices. In both captive and natural conditions, more than half of birds that contacted the device were able to perform at least one spatial reversal. Moreover, both captive and wild birds showed an improvement of performance over successive reversals, with very similar learning curves observed in both contexts for each reversal. Our results suggest that it is possible to study cognitive abilities of wild animals directly in their natural environment in much the same way that we study captive animals. Such methods open numerous possibilities to study and understand the evolution and ecology of cognition in natural populations.
Collapse
|
15
|
Perceptual category learning of photographic and painterly stimuli in rhesus macaques (Macaca mulatta) and humans. PLoS One 2017; 12:e0185576. [PMID: 28961270 PMCID: PMC5621688 DOI: 10.1371/journal.pone.0185576] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 09/17/2017] [Indexed: 11/19/2022] Open
Abstract
Humans are highly adept at categorizing visual stimuli, but studies of human categorization are typically validated by verbal reports. This makes it difficult to perform comparative studies of categorization using non-human animals. Interpretation of comparative studies is further complicated by the possibility that animal performance may merely reflect reinforcement learning, whereby discrete features act as discriminative cues for categorization. To assess and compare how humans and monkeys classified visual stimuli, we trained 7 rhesus macaques and 41 human volunteers to respond, in a specific order, to four simultaneously presented stimuli at a time, each belonging to a different perceptual category. These exemplars were drawn at random from large banks of images, such that the stimuli presented changed on every trial. Subjects nevertheless identified and ordered these changing stimuli correctly. Three monkeys learned to order naturalistic photographs; four others, close-up sections of paintings with distinctive styles. Humans learned to order both types of stimuli. All subjects classified stimuli at levels substantially greater than that predicted by chance or by feature-driven learning alone, even when stimuli changed on every trial. However, humans more closely resembled monkeys when classifying the more abstract painting stimuli than the photographic stimuli. This points to a common classification strategy in both species, one that humans can rely on in the absence of linguistic labels for categories.
Collapse
|
16
|
Juan C, Cappe C, Alric B, Roby B, Gilardeau S, Barone P, Girard P. The variability of multisensory processes of natural stimuli in human and non-human primates in a detection task. PLoS One 2017; 12:e0172480. [PMID: 28212416 PMCID: PMC5315309 DOI: 10.1371/journal.pone.0172480] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Accepted: 02/06/2017] [Indexed: 11/19/2022] Open
Abstract
Background Behavioral studies in both human and animals generally converge to the dogma that multisensory integration improves reaction times (RTs) in comparison to unimodal stimulation. These multisensory effects depend on diverse conditions among which the most studied were the spatial and temporal congruences. Further, most of the studies are using relatively simple stimuli while in everyday life, we are confronted to a large variety of complex stimulations constantly changing our attentional focus over time, a modality switch that can impact on stimuli detection. In the present study, we examined the potential sources of the variability in reaction times and multisensory gains with respect to the intrinsic features of a large set of natural stimuli. Methodology/Principle findings Rhesus macaque monkeys and human subjects performed a simple audio-visual stimulus detection task in which a large collection of unimodal and bimodal natural stimuli with semantic specificities was presented at different saliencies. Although we were able to reproduce the well-established redundant signal effect, we failed to reveal a systematic violation of the race model which is considered to demonstrate multisensory integration. In both monkeys and human species, our study revealed a large range of multisensory gains, with negative and positive values. While modality switch has clear effects on reaction times, one of the main causes of the variability of multisensory gains appeared to be linked to the intrinsic physical parameters of the stimuli. Conclusion/Significance Based on the variability of multisensory benefits, our results suggest that the neuronal mechanisms responsible of the redundant effect (interactions vs. integration) are highly dependent on the stimulus complexity suggesting different implications of uni- and multisensory brain regions. Further, in a simple detection task, the semantic values of individual stimuli tend to have no significant impact on task performances, an effect which is probably present in more cognitive tasks.
Collapse
Affiliation(s)
- Cécile Juan
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Céline Cappe
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Baptiste Alric
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Benoit Roby
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Sophie Gilardeau
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Pascal Barone
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
| | - Pascal Girard
- Cerco, CNRS UMR 5549, Toulouse, France
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
- INSERM, Toulouse, France
- * E-mail:
| |
Collapse
|
17
|
Koopman SE, Mahon BZ, Cantlon JF. Evolutionary Constraints on Human Object Perception. Cogn Sci 2016; 41:2126-2148. [PMID: 28032661 DOI: 10.1111/cogs.12470] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2016] [Revised: 09/16/2016] [Accepted: 10/10/2016] [Indexed: 11/30/2022]
Abstract
Language and culture endow humans with access to conceptual information that far exceeds any which could be accessed by a non-human animal. Yet, it is possible that, even without language or specific experiences, non-human animals represent and infer some aspects of similarity relations between objects in the same way as humans. Here, we show that monkeys' discrimination sensitivity when identifying images of animals is predicted by established measures of semantic similarity derived from human conceptual judgments. We used metrics from computer vision and computational neuroscience to show that monkeys' and humans' performance cannot be explained by low-level visual similarity alone. The results demonstrate that at least some of the underlying structure of object representations in humans is shared with non-human primates, at an abstract level that extends beyond low-level visual similarity. Because the monkeys had no experience with the objects we tested, the results suggest that monkeys and humans share a primitive representation of object similarity that is independent of formal knowledge and cultural experience, and likely derived from common evolutionary constraints on object representation.
Collapse
Affiliation(s)
- Sarah E Koopman
- Department of Brain & Cognitive Sciences, University of Rochester
| | - Bradford Z Mahon
- Department of Brain & Cognitive Sciences, University of Rochester.,Department of Neurosurgery, University of Rochester Medical School.,Center for Visual Science, University of Rochester.,Center for Language Science, University of Rochester
| | | |
Collapse
|
18
|
Vanmarcke S, Calders F, Wagemans J. The Time-Course of Ultrarapid Categorization: The Influence of Scene Congruency and Top-Down Processing. Iperception 2016; 7:2041669516673384. [PMID: 27803794 PMCID: PMC5076752 DOI: 10.1177/2041669516673384] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Although categorization can take place at different levels of abstraction, classic studies on semantic labeling identified the basic level, for example, dog, as entry point for categorization. Ultrarapid categorization tasks have contradicted these findings, indicating that participants are faster at detecting superordinate-level information, for example, animal, in a complex visual image. We argue that both seemingly contradictive findings can be reconciled within the framework of parallel distributed processing and its successor Leabra (Local, Error-driven and Associative, Biologically Realistic Algorithm). The current study aimed at verifying this prediction in an ultrarapid categorization task with a dynamically changing presentation time (PT) for each briefly presented object, followed by a perceptual mask. Furthermore, we manipulated two defining task variables: level of categorization (basic vs. superordinate categorization) and object presentation mode (object-in-isolation vs. object-in-context). In contradiction with previous ultrarapid categorization research, focusing on reaction time, we used accuracy as our main dependent variable. Results indicated a consistent superordinate processing advantage, coinciding with an overall improvement in performance with longer PT and a significantly more accurate detection of objects in isolation, compared with objects in context, at lower stimulus PT. This contextual disadvantage disappeared when PT increased, indicating that figure-ground separation with recurrent processing is vital for meaningful contextual processing to occur.
Collapse
|
19
|
The upper-hemifield advantage for masked face processing: Not just an attentional bias. Atten Percept Psychophys 2015; 78:52-68. [DOI: 10.3758/s13414-015-0965-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
20
|
Cauchoix M, Crouzet SM, Fize D, Serre T. Fast ventral stream neural activity enables rapid visual categorization. Neuroimage 2015; 125:280-290. [PMID: 26477655 DOI: 10.1016/j.neuroimage.2015.10.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Revised: 10/01/2015] [Accepted: 10/07/2015] [Indexed: 11/26/2022] Open
Abstract
Primates can recognize objects embedded in complex natural scenes in a glimpse. Rapid categorization paradigms have been extensively used to study our core perceptual abilities when the visual system is forced to operate under strong time constraints. However, the neural underpinning of rapid categorization remains to be understood, and the incredible speed of sight has yet to be reconciled with modern ventral stream cortical theories of object recognition. Here we recorded multichannel subdural electrocorticogram (ECoG) signals from intermediate areas (V4/PIT) of the ventral stream of the visual cortex while monkeys were actively engaged in a rapid animal/non-animal categorization task. A traditional event-related potential (ERP) analysis revealed short visual latencies (<50-70ms) followed by a rapidly developing visual selectivity (within ~20-30ms) for most electrodes. A multi-variate pattern analysis (MVPA) technique further confirmed that reliable animal/non-animal category information was possible from this initial ventral stream neural activity (within ~90-100ms). Furthermore, this early category-selective neural activity was (a) unaffected by the presentation of a backward (pattern) mask, (b) generalized to novel (unfamiliar) stimuli and (c) co-varied with behavioral responses (both accuracy and reaction times). Despite the strong prevalence of task-related information on the neural signal, task-irrelevant visual information could still be decoded independently of monkey behavior. Monkey behavioral responses were also found to correlate significantly with human behavioral responses for the same set of stimuli. Together, the present study establishes that rapid ventral stream neural activity induces a visually selective signal subsequently used to drive rapid visual categorization and that this visual strategy may be shared between human and non-human primates.
Collapse
Affiliation(s)
- Maxime Cauchoix
- Institute for Advanced Studies in Toulouse, France; Centre de Recherche Cerveau et Cognition, Université Paul Sabatier, Université de Toulouse, Toulouse, France; Faculté de Médecine de Purpan, CNRS, UMR 5549, Toulouse, France.
| | - Sébastien M Crouzet
- Centre de Recherche Cerveau et Cognition, Université Paul Sabatier, Université de Toulouse, Toulouse, France; Faculté de Médecine de Purpan, CNRS, UMR 5549, Toulouse, France; Cognitive, Linguistic and Psychological Sciences Department, Institute for Brain Sciences, Brown University, USA
| | - Denis Fize
- Centre de Recherche Cerveau et Cognition, Université Paul Sabatier, Université de Toulouse, Toulouse, France; Faculté de Médecine de Purpan, CNRS, UMR 5549, Toulouse, France
| | - Thomas Serre
- Cognitive, Linguistic and Psychological Sciences Department, Institute for Brain Sciences, Brown University, USA
| |
Collapse
|
21
|
Collet AC, Fize D, VanRullen R. Contextual Congruency Effect in Natural Scene Categorization: Different Strategies in Humans and Monkeys (Macaca mulatta). PLoS One 2015. [PMID: 26207915 PMCID: PMC4514779 DOI: 10.1371/journal.pone.0133721] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Rapid visual categorization is a crucial ability for survival of many animal species, including monkeys and humans. In real conditions, objects (either animate or inanimate) are never isolated but embedded in a complex background made of multiple elements. It has been shown in humans and monkeys that the contextual background can either enhance or impair object categorization, depending on context/object congruency (for example, an animal in a natural vs. man-made environment). Moreover, a scene is not only a collection of objects; it also has global physical features (i.e phase and amplitude of Fourier spatial frequencies) which help define its gist. In our experiment, we aimed to explore and compare the contribution of the amplitude spectrum of scenes in the context-object congruency effect in monkeys and humans. We designed a rapid visual categorization task, Animal versus Non-Animal, using as contexts both real scenes photographs and noisy backgrounds built from the amplitude spectrum of real scenes but with randomized phase spectrum. We showed that even if the contextual congruency effect was comparable in both species when the context was a real scene, it differed when the foreground object was surrounded by a noisy background: in monkeys we found a similar congruency effect in both conditions, but in humans the congruency effect was absent (or even reversed) when the context was a noisy background.
Collapse
Affiliation(s)
- Anne-Claire Collet
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
- CNRS, CerCo, Toulouse, France
- * E-mail:
| | - Denis Fize
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
- CNRS, CerCo, Toulouse, France
| | - Rufin VanRullen
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, Toulouse, France
- CNRS, CerCo, Toulouse, France
| |
Collapse
|
22
|
The interplay of holistic shape, local feature and color information in object categorization. Biol Psychol 2015; 109:120-31. [PMID: 25981947 DOI: 10.1016/j.biopsycho.2015.05.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Revised: 04/09/2015] [Accepted: 05/06/2015] [Indexed: 11/20/2022]
Abstract
Although it is widely accepted that colors facilitate object and scene recognition under various circumstances, several studies found no effects of color removal in tasks requiring categorization of briefly presented animals in natural scenes. In this study, three experiments were performed to test the assumption that the discrepancy between empirical data is related to variations of the available meaningful global information such as object shapes and contextual cues. Sixty-one individuals categorized chromatic and achromatic versions of intact and scrambled images containing either cars or birds. While color removal did not affect the classification of intact stimuli, the recognition of moderately scrambled achromatic images was more difficult. This effect was accompanied by amplitude modulations of occipital event-related potentials emerging from approximately 150ms post-stimulus. Our results indicate that colors facilitate stimulus classification, but this effect becomes prominent only in cases when holistic processing is not sufficient for stimulus recognition.
Collapse
|
23
|
Phillips WA, Clark A, Silverstein SM. On the functions, mechanisms, and malfunctions of intracortical contextual modulation. Neurosci Biobehav Rev 2015; 52:1-20. [PMID: 25721105 DOI: 10.1016/j.neubiorev.2015.02.010] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Revised: 02/02/2015] [Accepted: 02/15/2015] [Indexed: 10/23/2022]
Abstract
A broad neuron-centric conception of contextual modulation is reviewed and re-assessed in the light of recent neurobiological studies of amplification, suppression, and synchronization. Behavioural and computational studies of perceptual and higher cognitive functions that depend on these processes are outlined, and evidence that those functions and their neuronal mechanisms are impaired in schizophrenia is summarized. Finally, we compare and assess the long-term biological functions of contextual modulation at the level of computational theory as formalized by the theories of coherent infomax and free energy reduction. We conclude that those theories, together with the many empirical findings reviewed, show how contextual modulation at the neuronal level enables the cortex to flexibly adapt the use of its knowledge to current circumstances by amplifying and grouping relevant activities and by suppressing irrelevant activities.
Collapse
Affiliation(s)
- W A Phillips
- Department of Psychology, University of Stirling, FK9 4LA, Scotland, UK
| | - A Clark
- School of Philosophy, Psychology, and Language Sciences, University of Edinburgh, EH12 5AY, Scotland, UK
| | - S M Silverstein
- Rutgers Biomedical and Health Sciences, Piscataway, NJ, USA.
| |
Collapse
|
24
|
Abstract
Visual categorization of complex, natural stimuli has been studied for some time in human and nonhuman primates. Recent interest in the rodent as a model for visual perception, including higher-level functional specialization, leads to the question of how rodents would perform on a categorization task using natural stimuli. To answer this question, rats were trained in a two-alternative forced choice task to discriminate movies containing rats from movies containing other objects and from scrambled movies (ordinate-level categorization). Subsequently, transfer to novel, previously unseen stimuli was tested, followed by a series of control probes. The results show that the animals are capable of acquiring a decision rule by abstracting common features from natural movies to generalize categorization to new stimuli. Control probes demonstrate that they did not use single low-level features, such as motion energy or (local) luminance. Significant generalization was even present with stationary snapshots from untrained movies. The variability within and between training and test stimuli, the complexity of natural movies, and the control experiments and analyses all suggest that a more high-level rule based on more complex stimulus features than local luminance-based cues was used to classify the novel stimuli. In conclusion, natural stimuli can be used to probe ordinate-level categorization in rats.
Collapse
|
25
|
Social and nonsocial category discriminations in a chimpanzee (Pan troglodytes) and American black bears (Ursus americanus). Learn Behav 2014; 42:231-45. [DOI: 10.3758/s13420-014-0141-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
26
|
Poncet M, Fabre-Thorpe M. Stimulus duration and diversity do not reverse the advantage for superordinate-level representations: the animal is seen before the bird. Eur J Neurosci 2014; 39:1508-16. [DOI: 10.1111/ejn.12513] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 12/30/2013] [Accepted: 01/13/2014] [Indexed: 11/26/2022]
Affiliation(s)
- Marlène Poncet
- Centre de Recherche Cerveau et Cognition; UPS; Université de Toulouse; Toulouse France
- CNRS CERCO UMR 5549; Pavillon Baudot CHU Purpan; BP 25202 31052 Toulouse Cedex France
| | - Michèle Fabre-Thorpe
- Centre de Recherche Cerveau et Cognition; UPS; Université de Toulouse; Toulouse France
- CNRS CERCO UMR 5549; Pavillon Baudot CHU Purpan; BP 25202 31052 Toulouse Cedex France
| |
Collapse
|
27
|
Natural category discrimination in chimpanzees (Pan troglodytes) at three levels of abstraction. Learn Behav 2014; 41:271-84. [PMID: 23397185 DOI: 10.3758/s13420-013-0103-0] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Two adult chimpanzees were presented with a series of natural category discrimination tasks on a touch screen computer, in which the discriminations varied in degree of abstraction. At the concrete level, discriminations could be made on the basis of single perceptual features, but at the more abstract level, categories were more inclusive, containing exemplars with variant perceptual features. For instance, at the most abstract level, the chimpanzees were required to select images of animals rather than nonanimals, and exemplars within both categories were perceptually diverse. One chimpanzee showed positive transfer at each level of abstraction but required more sessions to reach criterion as the discriminations became more abstract. The other chimpanzee failed to demonstrate consistent significant acquisition of a concept. The results indicate that unlike other apes and black bears, tested previously, chimpanzees found the most abstract discriminations the most difficult to acquire. Analyses of the features of pictures that yielded high or low accuracy revealed no significant differences on several key features, suggesting that the presence of facial features, eyes, or specific coloration did not control responding. In addition, the chimpanzees performed more accurately with photos judged as less typical exemplars of the category by human raters. However, responses to pictures of particular species suggest that chimpanzees may rely on perceptual similarity to familiar exemplars when acquiring experimenter-defined natural categories.
Collapse
|
28
|
Rémy F, Vayssière N, Pins D, Boucart M, Fabre-Thorpe M. Incongruent object/context relationships in visual scenes: where are they processed in the brain? Brain Cogn 2013; 84:34-43. [PMID: 24280445 DOI: 10.1016/j.bandc.2013.10.008] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2013] [Revised: 10/25/2013] [Accepted: 10/28/2013] [Indexed: 11/18/2022]
Abstract
Rapid object visual categorization in briefly flashed natural scenes is influenced by the surrounding context. The neural correlates underlying reduced categorization performance in response to incongruent object/context associations remain unclear and were investigated in the present study using fMRI. Participants were instructed to categorize objects in briefly presented scenes (exposure duration=100ms). Half of the scenes consisted of objects pasted in an expected (congruent) context, whereas for the other half, objects were embedded in incongruent contexts. Object categorization was more accurate and faster in congruent relative to incongruent scenes. Moreover, we found that the two types of scenes elicited different patterns of cerebral activation. In particular, the processing of incongruent scenes induced increased activations in the parahippocampal cortex, as well as in the right frontal cortex. This higher activity may indicate additional neural processing of the novel (non experienced) contextual associations that were inherent to the incongruent scenes. Moreover, our results suggest that the locus of object categorization impairment due to contextual incongruence is in the right anterior parahippocampal cortex. Indeed in this region activity was correlated with the reaction time increase observed with incongruent scenes. Representations for associations between objects and their usual context of appearance might be encoded in the right anterior parahippocampal cortex.
Collapse
Affiliation(s)
- Florence Rémy
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, France; CNRS, CerCo, Toulouse, France.
| | - Nathalie Vayssière
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, France; CNRS, CerCo, Toulouse, France
| | - Delphine Pins
- Université Lille Nord de France, UDSL, Laboratoire Neurosciences Fonctionnelles et Pathologies, CHU Lille, F-59000 Lille, France; CNRS, F-59000 Lille, France
| | - Muriel Boucart
- Université Lille Nord de France, UDSL, Laboratoire Neurosciences Fonctionnelles et Pathologies, CHU Lille, F-59000 Lille, France; CNRS, F-59000 Lille, France
| | - Michèle Fabre-Thorpe
- Université de Toulouse, UPS, Centre de Recherche Cerveau et Cognition, France; CNRS, CerCo, Toulouse, France
| |
Collapse
|
29
|
Munneke J, Brentari V, Peelen MV. The influence of scene context on object recognition is independent of attentional focus. Front Psychol 2013; 4:552. [PMID: 23970878 PMCID: PMC3748376 DOI: 10.3389/fpsyg.2013.00552] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 08/04/2013] [Indexed: 11/13/2022] Open
Abstract
Humans can quickly and accurately recognize objects within briefly presented natural scenes. Previous work has provided evidence that scene context contributes to this process, demonstrating improved naming of objects that were presented in semantically consistent scenes (e.g., a sandcastle on a beach) relative to semantically inconsistent scenes (e.g., a sandcastle on a football field). The current study was aimed at investigating which processes underlie the scene consistency effect. Specifically, we tested: (1) whether the effect is due to increased visual feature and/or shape overlap for consistent relative to inconsistent scene-object pairs; and (2) whether the effect is mediated by attention to the background scene. Experiment 1 replicated the scene consistency effect of a previous report (Davenport and Potter, 2004). Using a new, carefully controlled stimulus set, Experiment 2 showed that the scene consistency effect could not be explained by low-level feature or shape overlap between scenes and target objects. Experiments 3a and 3b investigated whether focused attention modulates the scene consistency effect. By using a location cueing manipulation, participants were correctly informed about the location of the target object on a proportion of trials, allowing focused attention to be deployed toward the target object. Importantly, the effect of scene consistency on target object recognition was independent of spatial attention, and was observed both when attention was focused on the target object and when attention was focused on the background scene. These results indicate that a semantically consistent scene context benefits object recognition independently of the focus of attention. We suggest that the scene consistency effect is primarily driven by global scene properties, or "scene gist", that can be processed with minimal attentional resources.
Collapse
Affiliation(s)
- Jaap Munneke
- Center for Mind/Brain Sciences, University of Trento Trento, Italy ; Department of Cognitive Psychology, Vrije Universiteit Amsterdam, Netherlands
| | | | | |
Collapse
|
30
|
Object recognition in congruent and incongruent natural scenes: a life-span study. Vision Res 2013; 91:36-44. [PMID: 23891714 DOI: 10.1016/j.visres.2013.07.006] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2012] [Revised: 04/15/2013] [Accepted: 07/10/2013] [Indexed: 11/21/2022]
Abstract
Efficient processing of our complex visual environment is essential and many daily visual tasks rely on accurate and fast object recognition. It is therefore important to evaluate how object recognition performance evolves during the course of adulthood. Surprisingly, this ability has not yet been investigated in the aged population, although several neuroimaging studies have reported altered activity in high-level visual ventral regions when elderly subjects process natural stimuli. In the present study, color photographs of various objects embedded in contextual scenes were used to assess object categorization performance in 97 participants aged from 20 to 91. Objects were either animals or pieces of furniture, embedded in either congruent or incongruent contexts. In every age group, subjects showed reduced categorization performance, both in terms of accuracy and speed, when objects were seen in incongruent vs. congruent contexts. In subjects over 60 years old, object categorization was greatly slowed down when compared to young and middle-aged subjects. Moreover, subjects over 75 years old evidenced a significant decrease in categorization accuracy when objects were seen in incongruent contexts. This indicates that incongruence of the scene may be particularly disturbing in late adulthood, therefore impairing object recognition. Our results suggest that daily visual processing of complex natural environments may be less efficient with age, which might impact performance in everyday visual tasks.
Collapse
|
31
|
Willemet R. Reconsidering the evolution of brain, cognition, and behavior in birds and mammals. Front Psychol 2013; 4:396. [PMID: 23847570 PMCID: PMC3696912 DOI: 10.3389/fpsyg.2013.00396] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2013] [Accepted: 06/12/2013] [Indexed: 01/23/2023] Open
Abstract
Despite decades of research, some of the most basic issues concerning the extraordinarily complex brains and behavior of birds and mammals, such as the factors responsible for the diversity of brain size and composition, are still unclear. This is partly due to a number of conceptual and methodological issues. Determining species and group differences in brain composition requires accounting for the presence of taxon-cerebrotypes and the use of precise statistical methods. The role of allometry in determining brain variables should be revised. In particular, bird and mammalian brains appear to have evolved in response to a variety of selective pressures influencing both brain size and composition. “Brain” and “cognition” are indeed meta-variables, made up of the variables that are ecologically relevant and evolutionarily selected. External indicators of species differences in cognition and behavior are limited by the complexity of these differences. Indeed, behavioral differences between species and individuals are caused by cognitive and affective components. Although intra-species variability forms the basis of species evolution, some of the mechanisms underlying individual differences in brain and behavior appear to differ from those between species. While many issues have persisted over the years because of a lack of appropriate data or methods to test them; several fallacies, particularly those related to the human brain, reflect scientists' preconceptions. The theoretical framework on the evolution of brain, cognition, and behavior in birds and mammals should be reconsidered with these biases in mind.
Collapse
|
32
|
Crouzet SM, Joubert OR, Thorpe SJ, Fabre-Thorpe M. Animal detection precedes access to scene category. PLoS One 2012; 7:e51471. [PMID: 23251545 PMCID: PMC3518465 DOI: 10.1371/journal.pone.0051471] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2012] [Accepted: 11/02/2012] [Indexed: 11/30/2022] Open
Abstract
The processes underlying object recognition are fundamental for the understanding of visual perception. Humans can recognize many objects rapidly even in complex scenes, a task that still presents major challenges for computer vision systems. A common experimental demonstration of this ability is the rapid animal detection protocol, where human participants earliest responses to report the presence/absence of animals in natural scenes are observed at 250–270 ms latencies. One of the hypotheses to account for such speed is that people would not actually recognize an animal per se, but rather base their decision on global scene statistics. These global statistics (also referred to as spatial envelope or gist) have been shown to be computationally easy to process and could thus be used as a proxy for coarse object recognition. Here, using a saccadic choice task, which allows us to investigate a previously inaccessible temporal window of visual processing, we showed that animal – but not vehicle – detection clearly precedes scene categorization. This asynchrony is in addition validated by a late contextual modulation of animal detection, starting simultaneously with the availability of scene category. Interestingly, the advantage for animal over scene categorization is in opposition to the results of simulations using standard computational models. Taken together, these results challenge the idea that rapid animal detection might be based on early access of global scene statistics, and rather suggests a process based on the extraction of specific local complex features that might be hardwired in the visual system.
Collapse
Affiliation(s)
- Sébastien M. Crouzet
- Université de Toulouse, UPS, CerCo, Toulouse, France
- CNRS, UMR 5549, Toulouse, France
- Cognitive, Linguistic and Psychological Science, Brown University, Providence, Rhode Island, United States of America
| | - Olivier R. Joubert
- Université de Toulouse, UPS, CerCo, Toulouse, France
- CNRS, UMR 5549, Toulouse, France
| | - Simon J. Thorpe
- Université de Toulouse, UPS, CerCo, Toulouse, France
- CNRS, UMR 5549, Toulouse, France
| | - Michèle Fabre-Thorpe
- Université de Toulouse, UPS, CerCo, Toulouse, France
- CNRS, UMR 5549, Toulouse, France
- * E-mail:
| |
Collapse
|
33
|
|
34
|
Red SD, Patel SS, Sereno AB. Shape effects on reflexive spatial attention are driven by the dorsal stream. Vision Res 2012; 55:32-40. [PMID: 22239962 DOI: 10.1016/j.visres.2011.12.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Revised: 12/09/2011] [Accepted: 12/22/2011] [Indexed: 11/25/2022]
Abstract
In a modified reflexive spatial attention paradigm, when the cue and the target are at the same spatial location, processing of the target is faster when the cue and the target have different shapes compared to same (shape effect). Recent physiological findings suggest distinct population level encoding of shape in ventral versus dorsal cortical visual streams in monkeys. In human observers, we tested whether the effect of shape on reflexive spatial attention could be attributed to ventral and/or dorsal stream encoding of shape. In the modified reflexive spatial attention paradigm, we varied the shapes of the cue and target. Based on data from monkey physiology (Lehky & Sereno, 2007), we selected four pairs of cue and target shapes. In some pairs, cue and target were similarly encoded (similar encoding distance) by a population of cells in the lateral intraparietal cortex, a dorsal stream area, but more dissimilarly encoded (having a greater encoding distance) by a population of cells in the anterior inferotemporal cortex (AIT), a ventral stream area. In other pairs, cue and target were similarly encoded in AIT and had greater dissimilarity in LIP encoding. We found that pairs of cue and target with greater dissimilarity in LIP encoding produced larger and more consistent shape effects up to a cue to target onset asynchrony (CTOA) of 450 ms. The shape effects for cue and target pairs with greater dissimilarity in AIT encoding were smaller and inconsistent, suggesting that shape effects in reflexive spatial attention are largely driven by the dorsal stream.
Collapse
Affiliation(s)
- Stuart D Red
- Department of Neurobiology and Anatomy, University of Texas Medical School at Houston, Houston, TX 77030, USA.
| | | | | |
Collapse
|
35
|
The Neural Dynamics of Visual Processing in Monkey Extrastriate Cortex: A Comparison between Univariate and Multivariate Techniques. ACTA ACUST UNITED AC 2012. [DOI: 10.1007/978-3-642-34713-9_21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
|
36
|
Fabre-Thorpe M. The characteristics and limits of rapid visual categorization. Front Psychol 2011; 2:243. [PMID: 22007180 PMCID: PMC3184650 DOI: 10.3389/fpsyg.2011.00243] [Citation(s) in RCA: 109] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2011] [Accepted: 09/05/2011] [Indexed: 11/25/2022] Open
Abstract
Visual categorization appears both effortless and virtually instantaneous. The study by Thorpe et al. (1996) was the first to estimate the processing time necessary to perform fast visual categorization of animals in briefly flashed (20 ms) natural photographs. They observed a large differential EEG activity between target and distracter correct trials that developed from 150 ms after stimulus onset, a value that was later shown to be even shorter in monkeys! With such strong processing time constraints, it was difficult to escape the conclusion that rapid visual categorization was relying on massively parallel, essentially feed-forward processing of visual information. Since 1996, we have conducted a large number of studies to determine the characteristics and limits of fast visual categorization. The present chapter will review some of the main results obtained. I will argue that rapid object categorizations in natural scenes can be done without focused attention and are most likely based on coarse and unconscious visual representations activated with the first available (magnocellular) visual information. Fast visual processing proved efficient for the categorization of large superordinate object or scene categories, but shows its limits when more detailed basic representations are required. The representations for basic objects (dogs, cars) or scenes (mountain or sea landscapes) need additional processing time to be activated. This finding is at odds with the widely accepted idea that such basic representations are at the entry level of the system. Interestingly, focused attention is still not required to perform these time consuming basic categorizations. Finally we will show that object and context processing can interact very early in an ascending wave of visual information processing. We will discuss how such data could result from our experience with a highly structured and predictable surrounding world that shaped neuronal visual selectivity.
Collapse
Affiliation(s)
- Michèle Fabre-Thorpe
- CNRS, Université Paul Sabatier, Centre de Recherche Cerveau et CognitionToulouse, France
| |
Collapse
|