1
|
Wang G, Zheng C, Wu X, Deng Z, Sperandio I, Goodale MA, Chen J. The contribution of semantic distance knowledge to size constancy in perception and grasping when visual cues are limited. Neuropsychologia 2024; 196:108838. [PMID: 38401629 DOI: 10.1016/j.neuropsychologia.2024.108838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 01/04/2024] [Accepted: 02/21/2024] [Indexed: 02/26/2024]
Abstract
To achieve a stable perception of object size in spite of variations in viewing distance, our visual system needs to combine retinal image information and distance cues. Previous research has shown that, not only retinal cues, but also extraretinal sensory signals can provide reliable information about depth and that different neural networks (perception versus action) can exhibit preferences in the use of these different sources of information during size-distance computations. Semantic knowledge of distance, a purely cognitive signal, can also provide distance information. Do the perception and action systems show differences in their ability to use this information in calculating object size and distance? To address this question, we presented 'glow-in-the-dark' objects of different physical sizes at different real distances in a completely dark room. Participants viewed the objects monocularly through a 1-mm pinhole. They either estimated the size and distance of the objects or attempted to grasp them. Semantic knowledge was manipulated by providing an auditory cue about the actual distance of the object: "20 cm", "30 cm", and "40 cm". We found that semantic knowledge of distance contributed to some extent to size constancy operations during perceptual estimation and grasping, but size constancy was never fully restored. Importantly, the contribution of knowledge about distance to size constancy was equivalent between perception and action. Overall, our study reveals similarities and differences between the perception and action systems in the use of semantic distance knowledge and suggests that this cognitive signal is useful but not a reliable depth cue for size constancy under restricted viewing conditions.
Collapse
Affiliation(s)
- Gexiu Wang
- Center for the Study of Applied Psychology, Guangdong Key Laboratory of Mental Health and Cognitive Science, and the School of Psychology, South China Normal University, Guangzhou, Guangdong Province, 510631, China
| | - Chao Zheng
- Center for the Study of Applied Psychology, Guangdong Key Laboratory of Mental Health and Cognitive Science, and the School of Psychology, South China Normal University, Guangzhou, Guangdong Province, 510631, China
| | - Xiaoqian Wu
- Center for the Study of Applied Psychology, Guangdong Key Laboratory of Mental Health and Cognitive Science, and the School of Psychology, South China Normal University, Guangzhou, Guangdong Province, 510631, China
| | - Zhiqing Deng
- Center for the Study of Applied Psychology, Guangdong Key Laboratory of Mental Health and Cognitive Science, and the School of Psychology, South China Normal University, Guangzhou, Guangdong Province, 510631, China
| | - Irene Sperandio
- Department of Psychology and Cognitive Science, University of Trento, Rovereto, TN, 38068, Italy
| | - Melvyn A Goodale
- Western Institute for Neuroscience and the Department of Psychology, The University of Western Ontario, London, ON, N6A 5C2, Canada
| | - Juan Chen
- Center for the Study of Applied Psychology, Guangdong Key Laboratory of Mental Health and Cognitive Science, and the School of Psychology, South China Normal University, Guangzhou, Guangdong Province, 510631, China; Key Laboratory of Brain, Cognition and Education Sciences (South China Normal University), Ministry of Education, Guangzhou, Guangdong Province, 510631, China.
| |
Collapse
|
2
|
Yildiz GY, Skarbez R, Sperandio I, Chen SJ, Mulder IJ, Chouinard PA. Linear perspective cues have a greater effect on the perceptual rescaling of distant stimuli than textures in the virtual environment. Atten Percept Psychophys 2024; 86:653-665. [PMID: 38182938 DOI: 10.3758/s13414-023-02834-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/14/2023] [Indexed: 01/07/2024]
Abstract
The presence of pictorial depth cues in virtual environments is important for minimising distortions driven by unnatural viewing conditions (e.g., vergence-accommodation conflict). Our aim was to determine how different pictorial depth cues affect size constancy in virtual environments under binocular and monocular viewing conditions. We systematically removed linear perspective cues and textures of a hallway in a virtual environment. The experiment was performed using the method of constant stimuli. The task required participants to compare the size of 'far' (10 m) and 'near' (5 m) circles displayed inside a virtual environment with one or both or none of the pictorial depth cues. Participants performed the experiment under binocular and monocular viewing conditions while wearing a virtual reality headset. ANOVA revealed that size constancy was greater for both the far and the near circles in the virtual environment with pictorial depth cues compared to the one without cues. However, the effect of linear perspective cues was stronger than textures, especially for the far circle. We found no difference between the binocular and monocular viewing conditions across the different virtual environments. We conclude that linear perspective cues exert a stronger effect than textures on the perceptual rescaling of far stimuli placed in the virtual environment, and that this effect does not vary between binocular and monocular viewing conditions.
Collapse
Affiliation(s)
- Gizem Y Yildiz
- Department of Psychology, Counselling, and Therapy, La Trobe University, George Singer Building, Room 460, 75 Kingsbury Drive, Bundoora, Victoria, 3086, Australia
- Institute of Neuroscience and Medicine (INM-3), Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Richard Skarbez
- Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC, Australia
| | - Irene Sperandio
- Department of Psychology and Cognitive Science, University of Trento, Rovereto, TN, Italy
| | - Sandra J Chen
- Department of Psychology, Counselling, and Therapy, La Trobe University, George Singer Building, Room 460, 75 Kingsbury Drive, Bundoora, Victoria, 3086, Australia
| | - Indiana J Mulder
- Department of Psychology, Counselling, and Therapy, La Trobe University, George Singer Building, Room 460, 75 Kingsbury Drive, Bundoora, Victoria, 3086, Australia
| | - Philippe A Chouinard
- Department of Psychology, Counselling, and Therapy, La Trobe University, George Singer Building, Room 460, 75 Kingsbury Drive, Bundoora, Victoria, 3086, Australia.
| |
Collapse
|
3
|
Paire A, Hillairet de Boisferon A, Paeye C. Empirical validation of QUEST+ in PSE and JND estimations in visual discrimination tasks. Behav Res Methods 2023; 55:3984-4001. [PMID: 36538168 PMCID: PMC10700427 DOI: 10.3758/s13428-022-02001-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/04/2022] [Indexed: 12/24/2022]
Abstract
One of the most precise methods to establish psychometric functions and estimate threshold and slope parameters is the constant stimuli procedure. The large distribution of predetermined stimulus values presented to observers enables the psychometric functions to be fully developed, but makes this procedure time-consuming. Adaptive procedures enable reliable threshold estimation while reducing the number of trials by concentrating stimulus presentations around observers' supposed threshold. Here, the stimulus value for the next trial depends on observer's responses to the previous trials. One recent improvement of these procedures is to also estimate the slope (related to discrimination sensitivity). The Bayesian QUEST+ procedure (Watson Journal of Vision, 17(3), 10, 2017), a generalization and extension of the QUEST procedure, includes this refinement. Surprisingly, this procedure is barely used. Our goal was to empirically assess its precision to evaluate size, orientation, or temporal perception, in three yes/no discrimination tasks that increase in demands. In 72 adult participants in total, we compared points of subjective equivalence (PSEs) or simultaneity (PSSs) as well as discrimination sensitivity obtained with the QUEST+, constant stimuli, and simple up-down staircase procedures. While PSEs did not differ between procedures, sensitivity estimates obtained with the 64-trials QUEST+ procedure were overestimated (i.e., just-noticeable differences, or JNDs, were underestimated). Overall, agreement between procedures was good, and was at its best for the easiest tasks. This study empirically confirmed that the QUEST+ procedure can be considered as a method of choice to accelerate PSE estimation, while keeping in mind that sensitivity estimation should be handled with caution.
Collapse
Affiliation(s)
- Adrien Paire
- Université Paris Cité, Vision Action Cognition, F-92100, Boulogne-Billancourt, France
| | | | - Céline Paeye
- Université Paris Cité, Vision Action Cognition, F-92100, Boulogne-Billancourt, France.
| |
Collapse
|
4
|
Rogers B. When is a disparity not a disparity? Toward an old theory of three-dimensional vision. Iperception 2023; 14:20416695231202726. [PMID: 38812612 PMCID: PMC11134170 DOI: 10.1177/20416695231202726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Accepted: 09/06/2023] [Indexed: 05/31/2024] Open
Abstract
The aims of this paper are twofold: first, to discuss and analyze the concept of binocular disparity and second, to contrast the traditional "air theory" of three-dimensional vision with the much older "ground theory," first suggested by Ibn al-Haytham more than a thousand years ago. The origins of an "air theory" of perception can be traced back to Descartes and subsequently to the philosopher George Berkeley, who claimed that distance "could not be seen" because points lying along the same line of sight (in an empty space) would all project to the same location on the retina. However, Descartes was also aware that the angle of convergence of the two eyes could solve the problem of the "missing" information for the monocular observer and, since then, most visual scientists have assumed that eye vergence plays an important role both in judging absolute distance and for scaling retinal size and binocular disparities. In contrast, al-Haytham's and Gibson's "ground theories," which are based on the geometry of the textured ground plane surface that has surrounded us throughout evolution and during our lifetimes, are not just more ecologically based but they also obviate the need for disparity scaling.
Collapse
Affiliation(s)
- Brian Rogers
- Experimental Psychology, University of Oxford, Oxford, UK
| |
Collapse
|
5
|
Linton P. Minimal theory of 3D vision: new approach to visual scale and visual shape. Philos Trans R Soc Lond B Biol Sci 2023; 378:20210455. [PMID: 36511406 PMCID: PMC9745885 DOI: 10.1098/rstb.2021.0455] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Accepted: 07/20/2022] [Indexed: 12/15/2022] Open
Abstract
Since Kepler and Descartes in the early-1600s, vision science has been committed to a triangulation model of stereo vision. But in the early-1800s, we realized that disparities are responsible for stereo vision. And we have spent the past 200 years trying to shoe-horn disparities back into the triangulation account. The first part of this article argues that this is a mistake, and that stereo vision is a solution to a different problem: the eradication of rivalry between the two retinal images, rather than the triangulation of objects in space. This leads to a 'minimal theory of 3D vision', where 3D vision is no longer tied to estimating the scale, shape, and direction of objects in the world. The second part of this article then asks whether the other aspects of 3D vision, which go beyond stereo vision, really operate at the same level of visual experience as stereo vision? I argue they do not. Whilst we want a theory of real-world 3D vision, the literature risks giving us a theory of picture perception instead. And I argue for a two-stage theory, where our purely internal 'minimal' 3D percept (from stereo vision) is linked to the world through cognition. This article is part of a discussion meeting issue 'New approaches to 3D vision'.
Collapse
Affiliation(s)
- Paul Linton
- Presidential Scholars in Society and Neuroscience, Center for Science and Society, Columbia University, New York, NY 10027, USA
- Italian Academy for Advanced Studies in America, Columbia University, New York, NY 10027, USA
- Visual Inference Lab, Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA
| |
Collapse
|
6
|
The Riemannian Geometry Theory of Visually-Guided Movement Accounts for Afterimage Illusions and Size Constancy. Vision (Basel) 2022; 6:vision6020037. [PMID: 35737424 PMCID: PMC9231332 DOI: 10.3390/vision6020037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 06/14/2022] [Accepted: 06/16/2022] [Indexed: 11/17/2022] Open
Abstract
This discussion paper supplements our two theoretical contributions previously published in this journal on the geometric nature of visual space. We first show here how our Riemannian formulation explains the recent experimental finding (published in this special issue on size constancy) that, contrary to conclusions from past work, vergence does not affect perceived size. We then turn to afterimage experiments connected to that work. Beginning with the Taylor illusion, we explore how our proposed Riemannian visual–somatosensory–hippocampal association memory network accounts in the following way for perceptions that occur when afterimages are viewed in conjunction with body movement. The Riemannian metric incorporated in the association memory network accurately emulates the warping of 3D visual space that is intrinsically introduced by the eye. The network thus accurately anticipates the change in size of retinal images of objects with a change in Euclidean distance between the egocentre and the object. An object will only be perceived to change in size when there is a difference between the actual size of its image on the retina and the anticipated size of that image provided by the network. This provides a central mechanism for size constancy. If the retinal image is the afterimage of a body part, typically a hand, and that hand moves relative to the egocentre, the afterimage remains constant but the proprioceptive signals change to give the new hand position. When the network gives the anticipated size of the hand at its new position this no longer matches the fixed afterimage, hence a size-change illusion occurs.
Collapse
|
7
|
Developmental Trajectories of Size Constancy as Implicitly Examined by Simple Reaction Times. Vision (Basel) 2021; 5:vision5040050. [PMID: 34698311 PMCID: PMC8544729 DOI: 10.3390/vision5040050] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Revised: 09/24/2021] [Accepted: 10/13/2021] [Indexed: 11/16/2022] Open
Abstract
It is still unclear whether size constancy is an innate ability or whether it develops with age. As many developmental studies are limited to the child's comprehension of the task instructions, here, an implicit measure of perceived size, namely, simple manual reaction time (RT), was opted for based on the assumption that perceptually bigger objects generate faster detection times. We examined size constancy in children (from 5 to 14 years of age) and adults using a simple RT approach. Participants were presented with pictures of tennis balls on a screen that was physically moved to two viewing distances. Visual stimuli were adjusted in physical size in order to subtend the same visual angle across distances, determining two conditions: a small-near tennis ball vs. a big-far tennis ball. Thanks to size constancy, the two tennis balls were perceived as different even though they were of equal size on the retina. Stimuli were also matched in terms of luminance. Participants were asked to react as fast as possible to the onset of the stimuli. The results show that the RTs reflected the perceived rather than the retinal size of the stimuli across the different age groups, such that participants responded faster to stimuli that were perceived as bigger than those perceived as smaller. Hence, these findings are consistent with the idea that size constancy is already present in early childhood, at least from the age of five, and does not require extensive visual learning.
Collapse
|
8
|
Linton P. V1 as an egocentric cognitive map. Neurosci Conscious 2021; 2021:niab017. [PMID: 34532068 PMCID: PMC8439394 DOI: 10.1093/nc/niab017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 05/21/2021] [Accepted: 06/08/2021] [Indexed: 01/20/2023] Open
Abstract
We typically distinguish between V1 as an egocentric perceptual map and the hippocampus as an allocentric cognitive map. In this article, we argue that V1 also functions as a post-perceptual egocentric cognitive map. We argue that three well-documented functions of V1, namely (i) the estimation of distance, (ii) the estimation of size, and (iii) multisensory integration, are better understood as post-perceptual cognitive inferences. This argument has two important implications. First, we argue that V1 must function as the neural correlates of the visual perception/cognition distinction and suggest how this can be accommodated by V1's laminar structure. Second, we use this insight to propose a low-level account of visual consciousness in contrast to mid-level accounts (recurrent processing theory; integrated information theory) and higher-level accounts (higher-order thought; global workspace theory). Detection thresholds have been traditionally used to rule out such an approach, but we explain why it is a mistake to equate visibility (and therefore the presence/absence of visual experience) with detection thresholds.
Collapse
Affiliation(s)
- Paul Linton
- Centre for Applied Vision Research, City, University of London, Northampton Square, London EC1V 0HB, UK
| |
Collapse
|