1
|
Stereopsis provides a constant feed to visual shape representation. Vision Res 2023; 204:108175. [PMID: 36571983 DOI: 10.1016/j.visres.2022.108175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 11/10/2022] [Accepted: 12/05/2022] [Indexed: 12/25/2022]
Abstract
The contribution of stereopsis in human visual shape perception was examined using stimuli with either null, normal, or reversed binocular disparity in an old/new object recognition task. The highest levels of recognition performance were observed with null and normal binocular disparity displays, which did not differ. However, reversed disparity led to significantly worse performance than either of the other display conditions. This indicates that stereopsis provides a continuous input to the mechanisms involved in shape perception.
Collapse
|
2
|
Arguin M, Marleau I, Aubin M, Zahabi S, Leek EC. A surface-based code contributes to visual shape perception. J Vis 2019; 19:6. [PMID: 31509602 DOI: 10.1167/19.11.6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Considerable uncertainty remains regarding the types of features human vision uses for shape representation. Visual-search experiments are reported which assessed the hypothesis of a surface-based (i.e., edge-bounded polygons) code for shape representation in human vision. The results indicate slower search rates and/or longer response times when the target shape shares its constituent surfaces with distractors (conjunction condition) than when the target surfaces are unique in the display (nonconjunction condition). This demonstration is made using test conditions that strictly control any potential artifact pertaining to target-distractor similarity. The surface-based code suggested by this surface-conjunction effect is strictly 2-D, since the effect occurs even when the surfaces are shared between the target and distractors in the 2-D image but not in their 3-D instantiation. Congruently, this latter finding is unaltered by manipulations of the richness of the depth information offered by the stimuli. It is proposed that human vision uses a 2-D surface-based code for shape representation which, considering other key findings in the field, probably coexists with an alternative representation mode based on a type of structural description that can integrate information pertaining to the 3-D aspect of shapes.
Collapse
Affiliation(s)
- Martin Arguin
- Centre de Recherche en Neuropsychologie Expérimentale et Cognition, Département de psychologie, Université de Montréal, Montréal, Canada
| | - Ian Marleau
- CISSS de la Montérégie-Ouest, Installation Longueuil, Longueuil, Canada
| | - Mercédès Aubin
- CÉGEP de Jonquière, Département des Sciences humaines, Jonquière, Canada
| | - Sacha Zahabi
- Centre de Recherche en Neuropsychologie Expérimentale et Cognition, Département de psychologie, Université de Montréal, Montréal, Canada
| | - E Charles Leek
- School of Psychology, Institute of Life and Human Sciences, University of Liverpool, Liverpool, UK
| |
Collapse
|
3
|
Oliver ZJ, Cristino F, Roberts MV, Pegna AJ, Leek EC. Stereo viewing modulates three-dimensional shape processing during object recognition: A high-density ERP study. J Exp Psychol Hum Percept Perform 2018; 44:518-534. [PMID: 29022728 PMCID: PMC5896504 DOI: 10.1037/xhp0000444] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Revised: 03/29/2017] [Accepted: 04/10/2017] [Indexed: 11/17/2022]
Abstract
The role of stereo disparity in the recognition of 3-dimensional (3D) object shape remains an unresolved issue for theoretical models of the human visual system. We examined this issue using high-density (128 channel) recordings of event-related potentials (ERPs). A recognition memory task was used in which observers were trained to recognize a subset of complex, multipart, 3D novel objects under conditions of either (bi-) monocular or stereo viewing. In a subsequent test phase they discriminated previously trained targets from untrained distractor objects that shared either local parts, 3D spatial configuration, or neither dimension, across both previously seen and novel viewpoints. The behavioral data showed a stereo advantage for target recognition at untrained viewpoints. ERPs showed early differential amplitude modulations to shape similarity defined by local part structure and global 3D spatial configuration. This occurred initially during an N1 component around 145-190 ms poststimulus onset, and then subsequently during an N2/P3 component around 260-385 ms poststimulus onset. For mono viewing, amplitude modulation during the N1 was greatest between targets and distracters with different local parts for trained views only. For stereo viewing, amplitude modulation during the N2/P3 was greatest between targets and distracters with different global 3D spatial configurations and generalized across trained and untrained views. The results show that image classification is modulated by stereo information about the local part, and global 3D spatial configuration of object shape. The findings challenge current theoretical models that do not attribute functional significance to stereo input during the computation of 3D object shape. (PsycINFO Database Record
Collapse
|
4
|
Pegna AJ, Darque A, Roberts MV, Leek EC. Effects of stereoscopic disparity on early ERP components during classification of three-dimensional objects. Q J Exp Psychol (Hove) 2018; 71:1419-1430. [PMID: 28524772 DOI: 10.1080/17470218.2017.1333129] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
This study investigates the effects of stereo disparity on the perception of three-dimensional (3D) object shape. We tested the hypothesis that stereo input modulates the brain activity related to perceptual analyses of 3D shape configuration during image classification. High-density (256-channel) electroencephalogram (EEG) was used to record the temporal dynamics of visual shape processing under conditions of two-dimensional (2D) and 3D visual presentation. On each trial, observers made image classification judgements ('Same'/'Different') to two briefly presented, multi-part, novel objects. On different-object trials, stimuli could either share volumetric parts but not the global 3D shape configuration and have different parts but the same global 3D shape configuration or differ on both aspects. Analyses using mass univariate contrasts showed that the earliest sensitivity to 2D versus 3D viewing appeared as a negative deflection over posterior locations on the N1 component between 160 and 220 ms post-stimulus onset. Subsequently, event-related potential (ERP) modulations during the N2 time window between 240 and 370 ms were linked to image classification. N2 activity reflected two distinct components - an early N2 (240-290 ms) and a late N2 (290-370 ms) - that showed different patterns of responses to 2D and 3D input and differential sensitivity to 3D object structure. The results revealed that stereo input modulates the neural correlates of 3D object shape. We suggest that this reflects differential perceptual processing of object shape under conditions of stereo or mono input. These findings challenge current theories that attribute no functional role for stereo input during 3D shape perception.
Collapse
Affiliation(s)
- Alan J Pegna
- 1 Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland.,2 School of Psychology, University of Queensland, St Lucia, QLD, Australia
| | - Alexandra Darque
- 1 Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
| | - Mark V Roberts
- 3 Wolfson Centre for Clinical and Cognitive Neuroscience, School of Psychology, Bangor University, Bangor, UK
| | - E Charles Leek
- 3 Wolfson Centre for Clinical and Cognitive Neuroscience, School of Psychology, Bangor University, Bangor, UK.,4 Laboratoire de Psychologie et NeuroCognition (LPNC), Université Grenoble Alpes, Saint-Martin-d'Hères, France
| |
Collapse
|
5
|
Tian M, Yamins D, Grill-Spector K. Learning the 3-D structure of objects from 2-D views depends on shape, not format. J Vis 2016; 16:7. [PMID: 27153196 PMCID: PMC4898268 DOI: 10.1167/16.7.7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2015] [Indexed: 11/24/2022] Open
Abstract
Humans can learn to recognize new objects just from observing example views. However, it is unknown what structural information enables this learning. To address this question, we manipulated the amount of structural information given to subjects during unsupervised learning by varying the format of the trained views. We then tested how format affected participants' ability to discriminate similar objects across views that were rotated 90° apart. We found that, after training, participants' performance increased and generalized to new views in the same format. Surprisingly, the improvement was similar across line drawings, shape from shading, and shape from shading + stereo even though the latter two formats provide richer depth information compared to line drawings. In contrast, participants' improvement was significantly lower when training used silhouettes, suggesting that silhouettes do not have enough information to generate a robust 3-D structure. To test whether the learned object representations were format-specific or format-invariant, we examined if learning novel objects from example views transfers across formats. We found that learning objects from example line drawings transferred to shape from shading and vice versa. These results have important implications for theories of object recognition because they suggest that (a) learning the 3-D structure of objects does not require rich structural cues during training as long as shape information of internal and external features is provided and (b) learning generates shape-based object representations independent of the training format.
Collapse
|
6
|
Cristino F, Davitt L, Hayward WG, Leek EC. Stereo disparity facilitates view generalization during shape recognition for solid multipart objects. Q J Exp Psychol (Hove) 2015; 68:2419-36. [PMID: 25679983 DOI: 10.1080/17470218.2015.1017512] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Current theories of object recognition in human vision make different predictions about whether the recognition of complex, multipart objects should be influenced by shape information about surface depth orientation and curvature derived from stereo disparity. We examined this issue in five experiments using a recognition memory paradigm in which observers (N = 134) memorized and then discriminated sets of 3D novel objects at trained and untrained viewpoints under either mono or stereo viewing conditions. In order to explore the conditions under which stereo-defined shape information contributes to object recognition we systematically varied the difficulty of view generalization by increasing the angular disparity between trained and untrained views. In one series of experiments, objects were presented from either previously trained views or untrained views rotated (15°, 30°, or 60°) along the same plane. In separate experiments we examined whether view generalization effects interacted with the vertical or horizontal plane of object rotation across 40° viewpoint changes. The results showed robust viewpoint-dependent performance costs: Observers were more efficient in recognizing learned objects from trained than from untrained views, and recognition was worse for extrapolated than for interpolated untrained views. We also found that performance was enhanced by stereo viewing but only at larger angular disparities between trained and untrained views. These findings show that object recognition is not based solely on 2D image information but that it can be facilitated by shape information derived from stereo disparity.
Collapse
Affiliation(s)
| | - Lina Davitt
- a School of Psychology , Bangor University , Bangor , UK
| | - William G Hayward
- b School of Psychology , University of Auckland , Auckland , New Zealand
| | - E Charles Leek
- c Wolfson Centre for Clinical and Cognitive Neuroscience, School of Psychology , Bangor University , Bangor , UK
| |
Collapse
|
7
|
Zhang H, Friedman A, Mou W, Waller D. View combination in recognition of 3-D virtual reality layouts. Psych J 2012; 1:82-9. [PMID: 26272759 DOI: 10.1002/pchj.12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Accepted: 06/08/2012] [Indexed: 11/07/2022]
Abstract
We investigated whether a normalization model or view combination model fit the performance of scene recognition of 3-D layouts using a virtual-reality paradigm. Participants learned a layout of seven objects from two training views (e.g., 0° and 48°) by discriminating the "correct" layout from distracters. Later, they performed a discrimination task using the training views (e.g., 0° and 48°), an interpolated view (e.g., 24°), an extrapolated view (e.g., 72°), and a far view (e.g., 96°). The results showed that the interpolated view was easier to discriminate than the extrapolated view and even easier than the training views. These results extend the applicability of view combination accounts of recognition to 3-D stimuli with stereoscopic depth information.
Collapse
Affiliation(s)
- Hui Zhang
- Institute of Psychology, Chinese Academy of Sciences, Beijing, China.,University of California, Davis, CA, USA
| | | | - Weimin Mou
- University of Alberta, Edmonton, AB, Canada
| | | |
Collapse
|