1
|
Kavšek M, Heil M. Monocular gap stereopsis in infants. J Exp Child Psychol 2025; 249:106107. [PMID: 39447309 DOI: 10.1016/j.jecp.2024.106107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 09/25/2024] [Accepted: 09/25/2024] [Indexed: 10/26/2024]
Abstract
In monocular gap stereopsis, one eye perceives a complete rectangular surface while the other eye perceives two small adjacent rectangular surfaces separated by a narrow vertical gap. Our visual system interprets the difference caused by the unmatched monocular images as a depth difference between two small rectangles. In a spontaneous visual preference study, it was asked whether participants aged 4 months responded to the depth effect generated by a monocular gap. Two experimental conditions were conducted. In one (large outer edge disparity condition), the monocular depth effect was twice as strong as in the other one (small outer edge disparity condition), according to the experimental research with adult participants conducted by Pianta and Gillam (2003, Vision Research, Vol. 43, pp. 1937-1950). In both conditions, it was tested whether the stimulus bearing monocular gap stereopsis was preferred over a comparison stimulus without depth. According to the results, the participants preferred looking at the stimulus with monocular stereopsis in the large outer edge disparity condition over doing so in the small outer edge disparity condition. Moreover, the difference between experimental conditions was significant; that is, the infants displayed a stronger spontaneous preference in the condition with the large outer edge disparity than in the condition with the small outer edge disparity. These findings provide evidence to suggest that infants aged 4 months are able to respond to monocular vertical gap information.
Collapse
Affiliation(s)
| | - Martin Heil
- Institute of Experimental Psychology, Heinrich Heine University Düsseldorf, 40225 Düsseldorf, Germany
| |
Collapse
|
2
|
Kim H, Koike Y, Choi W, Lee J. The effect of different depth planes during a manual tracking task in three-dimensional virtual reality space. Sci Rep 2023; 13:21499. [PMID: 38057361 PMCID: PMC10700492 DOI: 10.1038/s41598-023-48869-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 11/30/2023] [Indexed: 12/08/2023] Open
Abstract
Unlike ballistic arm movements such as reaching, the contribution of depth information to the performance of manual tracking movements is unclear. Thus, to understand how the brain handles information, we investigated how a required movement along the depth axis would affect behavioral tracking performance, postulating that it would be affected by the amount of depth movement. We designed a visually guided planar tracking task that requires movement on three planes with different depths: a fronto-parallel plane called ROT (0), a sagittal plane called ROT (90), and a plane rotated by 45° with respect to the sagittal plane called ROT (45). Fifteen participants performed a circular manual tracking task under binocular and monocular visions in a three-dimensional (3D) virtual reality space. As a result, under binocular vision, ROT (90), which required the largest depth movement among the tasks, showed the greatest error in 3D. Similarly, the errors (deviation from the target path) on the depth axis revealed significant differences among the tasks. Under monocular vision, significant differences in errors were observed only on the lateral axis. Moreover, we observed that the errors in the lateral and depth axes were proportional to the required movement on these axes under binocular vision and confirmed that the required depth movement under binocular vision determined depth error independent of the other axes. This finding implies that the brain may independently process binocular vision information on each axis. Meanwhile, the required depth movement under monocular vision was independent of performance along the depth axis, indicating an intractable behavior. Our findings highlight the importance of handling depth movement, especially when a virtual reality situation, involving tracking tasks, is generated.
Collapse
Affiliation(s)
- Hyeonseok Kim
- Swartz Center for Computational Neuroscience, Institute for Neural Computation, University of California San Diego, La Jolla, CA, 92093, USA
| | - Yasuharu Koike
- Institute of Innovative Research, Tokyo Institute of Technology, Yokohama, 226-8503, Japan
| | - Woong Choi
- College of ICT Construction & Welfare Convergence, Kangnam University, Yongin, 16979, Republic of Korea.
| | - Jongho Lee
- Department of Clinical Engineering, Komatsu University, Komatsu, 923-0961, Japan.
| |
Collapse
|
3
|
Shi Y, Zhang J, Lin W, Chung-Fat-Yim A, Yang Q, Li H. The effect of training on sensitivity and stability of double fusion in Panum's limiting case. Atten Percept Psychophys 2023; 85:2894-2906. [PMID: 37831363 DOI: 10.3758/s13414-023-02795-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/14/2023] [Indexed: 10/14/2023]
Abstract
Panum's limiting case is a phenomenon of monocular occlusion in binocular vision. This occurs when one object is occluded by the other object for one eye, but the two objects are both visible for the other eye. Although previous studies have found that vertical gradient of horizontal disparity and cue conflict are two important factors for double fusion, the effect of training on the sensitivity and stability of Panum's limiting case remains unknown. The current study trained 26 participants for 5 days with several of Panum's configurations (Gilliam, Frisby, and Wang series). The latency and duration of double fusion were recorded to examine the effects of training on sensitivity and stability of double fusion in Panum's limiting case. For each level of vertical gradient of horizontal disparity and cue conflict, the latency of double fusion decreased and the duration of double fusion increased with each additional training session. The results showed that vertical gradient of horizontal disparity and cue conflict interacted, and the duration of high cue conflict was significantly shorter than that of medium and low cue conflict for each level of vertical gradient of horizontal disparity. The findings suggest that there is an effect of training for vertical gradient of horizontal disparity and cue conflict in Panum's limiting case, and that the three factors jointly affect the sensitivity and stability of double fusion.
Collapse
Affiliation(s)
- Yuyu Shi
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China
| | - Jiaxi Zhang
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China
| | - Wenmin Lin
- School of English Studies, Shanghai International Studies University, Shanghai, China
| | - Ashley Chung-Fat-Yim
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, USA
| | - Qihang Yang
- College of Foreign Language, Zhejiang Normal University, Jinhua, China
| | - Huayun Li
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China.
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China.
| |
Collapse
|
4
|
Rosenberg A, Thompson LW, Doudlah R, Chang TY. Neuronal Representations Supporting Three-Dimensional Vision in Nonhuman Primates. Annu Rev Vis Sci 2023; 9:337-359. [PMID: 36944312 DOI: 10.1146/annurev-vision-111022-123857] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
The visual system must reconstruct the dynamic, three-dimensional (3D) world from ambiguous two-dimensional (2D) retinal images. In this review, we synthesize current literature on how the visual system of nonhuman primates performs this transformation through multiple channels within the classically defined dorsal (where) and ventral (what) pathways. Each of these channels is specialized for processing different 3D features (e.g., the shape, orientation, or motion of objects, or the larger scene structure). Despite the common goal of 3D reconstruction, neurocomputational differences between the channels impose distinct information-limiting constraints on perception. Convergent evidence further points to the little-studied area V3A as a potential branchpoint from which multiple 3D-fugal processing channels diverge. We speculate that the expansion of V3A in humans may have supported the emergence of advanced 3D spatial reasoning skills. Lastly, we discuss future directions for exploring 3D information transmission across brain areas and experimental approaches that can further advance the understanding of 3D vision.
Collapse
Affiliation(s)
- Ari Rosenberg
- Department of Neuroscience, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA;
| | - Lowell W Thompson
- Department of Neuroscience, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA;
| | - Raymond Doudlah
- Department of Neuroscience, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin, USA;
| | - Ting-Yu Chang
- School of Medicine, National Defense Medical Center, Taipei, Taiwan
| |
Collapse
|
5
|
Li H, Wang X, Li J, Lin W. Double fusion, a depth perception mechanism in Panum's limiting case. Perception 2023; 52:613-628. [PMID: 37408435 DOI: 10.1177/03010066231177513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/07/2023]
Abstract
The origin of depth in Panum's limiting case is unclear at present, so we investigated the depth perception mechanism using a triangle type of Panum's stimulus with a slant effect and clear criterion. Experiment 1 explored whether participants can correctly perceive fixation and nonfixation features using the fixation point and quick representation of stimuli, then examined whether participants' depth judgments supported double fusion or single fusion. The results of Experiment 1 showed that participants could correctly perceive the depth of fixation and nonfixation features. That is, it supported double fusion. In Experiment 2, we examined whether the depth perceived by observers comes from depth contrast. The results of Experiment 2 showed that the depth of the two features perceived after binocular fusion did not originate from the depth contrast. The findings suggest that the depth perception mechanism of Panum's limiting case is more likely to be double fusion.
Collapse
Affiliation(s)
| | | | - Jing Li
- Zhejiang Normal University, China
| | - Wenmin Lin
- Shanghai International Studies University, China
| |
Collapse
|
6
|
Tozzi A, Mariniello L. Unusual Mathematical Approaches Untangle Nervous Dynamics. Biomedicines 2022; 10:biomedicines10102581. [PMID: 36289843 PMCID: PMC9599563 DOI: 10.3390/biomedicines10102581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 10/10/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022] Open
Abstract
The massive amount of available neurodata suggests the existence of a mathematical backbone underlying neuronal oscillatory activities. For example, geometric constraints are powerful enough to define cellular distribution and drive the embryonal development of the central nervous system. We aim to elucidate whether underrated notions from geometry, topology, group theory and category theory can assess neuronal issues and provide experimentally testable hypotheses. The Monge’s theorem might contribute to our visual ability of depth perception and the brain connectome can be tackled in terms of tunnelling nanotubes. The multisynaptic ascending fibers connecting the peripheral receptors to the neocortical areas can be assessed in terms of knot theory/braid groups. Presheaves from category theory permit the tackling of nervous phase spaces in terms of the theory of infinity categories, highlighting an approach based on equivalence rather than equality. Further, the physical concepts of soft-matter polymers and nematic colloids might shed new light on neurulation in mammalian embryos. Hidden, unexpected multidisciplinary relationships can be found when mathematics copes with neural phenomena, leading to novel answers for everlasting neuroscientific questions. For instance, our framework leads to the conjecture that the development of the nervous system might be correlated with the occurrence of local thermal changes in embryo–fetal tissues.
Collapse
Affiliation(s)
- Arturo Tozzi
- Center for Nonlinear Science, University of North Texas, Denton, TX 76203-5017, USA
- Correspondence:
| | - Lucio Mariniello
- Department of Pediatrics, University Federico II, 80131 Naples, Italy
| |
Collapse
|
7
|
Goutcher R, Barrington C, Hibbard PB, Graham B. Binocular vision supports the development of scene segmentation capabilities: Evidence from a deep learning model. J Vis 2021; 21:13. [PMID: 34289490 PMCID: PMC8300045 DOI: 10.1167/jov.21.7.13] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 06/16/2021] [Indexed: 11/24/2022] Open
Abstract
The application of deep learning techniques has led to substantial progress in solving a number of critical problems in machine vision, including fundamental problems of scene segmentation and depth estimation. Here, we report a novel deep neural network model, capable of simultaneous scene segmentation and depth estimation from a pair of binocular images. By manipulating the arrangement of binocular image pairs, presenting the model with standard left-right image pairs, identical image pairs or swapped left-right images, we show that performance levels depend on the presence of appropriate binocular image arrangements. Segmentation and depth estimation performance are both impaired when images are swapped. Segmentation performance levels are maintained, however, for identical image pairs, despite the absence of binocular disparity information. Critically, these performance levels exceed those found for an equivalent, monocularly trained, segmentation model. These results provide evidence that binocular image differences support both the direct recovery of depth and segmentation information, and the enhanced learning of monocular segmentation signals. This finding suggests that binocular vision may play an important role in visual development. Better understanding of this role may hold implications for the study and treatment of developmentally acquired perceptual impairments.
Collapse
Affiliation(s)
- Ross Goutcher
- Psychology Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| | - Christian Barrington
- Psychology Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
- Computing Science and Mathematics Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| | - Paul B Hibbard
- Department of Psychology, University of Essex, Colchester, UK
| | - Bruce Graham
- Computing Science and Mathematics Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| |
Collapse
|
8
|
Basgöze Z, White DN, Burge J, Cooper EA. Natural statistics of depth edges modulate perceptual stability. J Vis 2020; 20:10. [PMID: 32761107 PMCID: PMC7438667 DOI: 10.1167/jov.20.8.10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Binocular fusion relies on matching points in the two eyes that correspond to the same physical feature in the world; however, not all world features are binocularly visible. Near depth edges, some regions of a scene are often visible to only one eye (so-called half occlusions). Accurate detection of these monocularly visible regions is likely to be important for stable visual perception. If monocular regions are not detected as such, the visual system may attempt to binocularly fuse non-corresponding points, which can result in unstable percepts. We investigated the hypothesis that the visual system capitalizes on statistical regularities associated with depth edges in natural scenes to aid binocular fusion and facilitate perceptual stability. By sampling from a large set of stereoscopic natural images with co-registered distance information, we found evidence that monocularly visible regions near depth edges primarily result from background occlusions. Accordingly, monocular regions tended to be more visually similar to the adjacent binocularly visible background region than to the adjacent binocularly visible foreground. Consistent with our hypothesis, perceptual experiments showed that perception tended to be more stable when the image properties of the depth edge were statistically more likely given the probability of occurrence in natural scenes (i.e., when monocular regions were more visually similar to the binocular background). The generality of these results was supported by a parametric study with simulated environments. Exploiting regularities in natural environments may allow the visual system to facilitate fusion and perceptual stability when both binocular and monocular regions are visible.
Collapse
|
9
|
Mamassian P, Zannoli M. Sensory loss due to object formation. Vision Res 2020; 174:22-40. [DOI: 10.1016/j.visres.2020.05.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 05/14/2020] [Accepted: 05/20/2020] [Indexed: 11/29/2022]
|
10
|
Kavšek M. Infants' responsiveness to half-occlusions in phantom stereograms. INFANCY 2020; 25:797-808. [PMID: 32761873 DOI: 10.1111/infa.12362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 06/03/2020] [Accepted: 07/08/2020] [Indexed: 11/30/2022]
Abstract
The present natural preference study investigated infants 4 and 7 months of age for their ability to respond to phantom contoubrs, illusory surfaces generated by half-occlusions in a stereoscopic display consisting of a pair of parallel vertical lines. The left line in the half-image for the right eye and the right line in the half-image for the left eye have a gap in the middle. The visual system accounts for the binocular unmatched gaps by perceiving an illusory contour. Infants in the experimental condition were presented with a standard phantom stereogram displaying a phantom contour versus a non-standard phantom stereogram, the half-images of which were exchanged. This stereogram evokes the impression of two small separate illusory contours. In both stereograms, the gaps moved up and down. The participants aged 7 but not 4 months preferred looking at the standard phantom stereogram. A control condition supported the hypothesis that the infants 7 months of age in the experimental condition indeed responded to the coherent illusory surface instead of simply detecting differences in the geometric arrangement of the half-occlusions. The results hence indicate that infants are able to extract spatial information from monocular regions in a binocular display.
Collapse
Affiliation(s)
- Michael Kavšek
- Unit of Developmental and Educational Psychology, Department of Psychology, University of Bonn, Bonn, Germany
| |
Collapse
|
11
|
Binocular responsiveness of projection neurons of the praying mantis optic lobe in the frontal visual field. J Comp Physiol A Neuroethol Sens Neural Behav Physiol 2020; 206:165-181. [PMID: 32088748 PMCID: PMC7069917 DOI: 10.1007/s00359-020-01405-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2019] [Revised: 01/14/2020] [Accepted: 01/18/2020] [Indexed: 11/30/2022]
Abstract
Praying mantids are the only insects proven to have stereoscopic vision (stereopsis): the ability to perceive depth from the slightly shifted images seen by the two eyes. Recently, the first neurons likely to be involved in mantis stereopsis were described and a speculative neuronal circuit suggested. Here we further investigate classes of neurons in the lobula complex of the praying mantis brain and their tuning to stereoscopically-defined depth. We used sharp electrode recordings with tracer injections to identify visual projection neurons with input in the optic lobe and output in the central brain. In order to measure binocular response fields of the cells the animals watched a vertical bar stimulus in a 3D insect cinema during recordings. We describe the binocular tuning of 19 neurons projecting from the lobula complex and the medulla to central brain areas. The majority of neurons (12/19) were binocular and had receptive fields for both eyes that overlapped in the frontal region. Thus, these neurons could be involved in mantis stereopsis. We also find that neurons preferring different contrast polarity (bright vs dark) tend to be segregated in the mantis lobula complex, reminiscent of the segregation for small targets and widefield motion in mantids and other insects.
Collapse
|
12
|
Krekhov A, Cmentowski S, Waschk A, Kruger J. Deadeye Visualization Revisited: Investigation of Preattentiveness and Applicability in Virtual Environments. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2020; 26:547-557. [PMID: 31425106 DOI: 10.1109/tvcg.2019.2934370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Visualizations rely on highlighting to attract and guide our attention. To make an object of interest stand out independently from a number of distractors, the underlying visual cue, e.g., color, has to be preattentive. In our prior work, we introduced Deadeye as an instantly recognizable highlighting technique that works by rendering the target object for one eye only. In contrast to prior approaches, Deadeye excels by not modifying any visual properties of the target. However, in the case of 2D visualizations, the method requires an additional setup to allow dichoptic presentation, which is a considerable drawback. As a follow-up to requests from the community, this paper explores Deadeye as a highlighting technique for 3D visualizations, because such stereoscopic scenarios support dichoptic presentation out of the box. Deadeye suppresses binocular disparities for the target object, so we cannot assume the applicability of our technique as a given fact. With this motivation, the paper presents quantitative evaluations of Deadeye in VR, including configurations with multiple heterogeneous distractors as an important robustness challenge. After confirming the preserved preattentiveness (all average accuracies above 90%) under such real-world conditions, we explore VR volume rendering as an example application scenario for Deadeye. We depict a possible workflow for integrating our technique, conduct an exploratory survey to demonstrate benefits and limitations, and finally provide related design implications.
Collapse
|
13
|
Kavšek M, Braun S. Infant perception of von Szily contours. J Exp Child Psychol 2019; 184:82-97. [PMID: 31015100 DOI: 10.1016/j.jecp.2019.04.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Revised: 04/01/2019] [Accepted: 04/01/2019] [Indexed: 11/28/2022]
Abstract
This habituation-dishabituation study examined infants' perception of subjective von Szily contours, the illusory effect of which is generated by horizontal disparity and half-occlusions. In these contours, a foreground surface appears to partially occlude a background surface. In Experiment 1, participants aged 4 and 5 months were habituated to a von Szily figure and were then tested for their ability to perceive the difference between the habituation figure and the same figure with reversed depth relations. The infants displayed significant novelty preferences during the posthabituation period. This observation indicates that 4- and 5-month-olds respond to the stereoscopically specified depth difference between the two surfaces of von Szily figures. In Experiment 2, participants aged 4 and 5 months were tested for the ability to conduct modal completion, that is, to perceive the surface that is stereoscopically shifted into the foreground as a whole. The infants were habituated to a von Szily figure and then examined for their ability to distinguish between complete and incomplete versions of the foreground surface. Longer looking at the incomplete posthabituation pattern indicates modal completion; the infants recognize the complete pattern as familiar and regard the incomplete pattern as novel. Similarly, Experiment 3 investigated whether infants aged 5 and 7 months amodally complete the background surface, that is, the surface that is partially covered by the foreground surface. Experiment 2 found modal completion in 5-month-olds. Experiment 3 established that 5- and 7-month-olds have developed some ability to conduct amodal completion. In sum, infants perceive the depth information in von Szily contours and conduct modal and amodal completion.
Collapse
Affiliation(s)
- Michael Kavšek
- Unit of Developmental and Educational Psychology, Department of Psychology, University of Bonn, 53111 Bonn, Germany.
| | - Stephanie Braun
- Unit of Developmental and Educational Psychology, Department of Psychology, University of Bonn, 53111 Bonn, Germany
| |
Collapse
|
14
|
Marini F, Breeding KA, Snow JC. Distinct visuo-motor brain dynamics for real-world objects versus planar images. Neuroimage 2019; 195:232-242. [PMID: 30776529 DOI: 10.1016/j.neuroimage.2019.02.026] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2018] [Revised: 01/26/2019] [Accepted: 02/09/2019] [Indexed: 10/27/2022] Open
Abstract
Ultimately, we aim to generalize and translate scientific knowledge to the real world, yet current understanding of human visual perception is based predominantly on studies of two-dimensional (2-D) images. Recent cognitive-behavioral evidence shows that real objects are processed differently to images, although the neural processes that underlie these differences are unknown. Because real objects (unlike images) afford actions, they may trigger stronger or more prolonged activation in neural populations for visuo-motor action planning. Here, we recorded electroencephalography (EEG) when human observers viewed real-world three-dimensional (3-D) objects or closely matched 2-D images of the same items. Although responses to real objects and images were similar overall, there were critical differences. Compared to images, viewing real objects triggered stronger and more sustained event-related desynchronization (ERD) in the μ frequency band (8-13 Hz) - a neural signature of automatic motor preparation. Event-related potentials (ERPs) revealed a transient, early occipital negativity for real objects (versus images), likely reflecting 3-D stereoscopic differences, and a late sustained parietal amplitude modulation consistent with an 'old-new' memory advantage for real objects over images. Together, these findings demonstrate that real-world objects trigger stronger and more sustained action-related brain responses than images do. The results highlight important similarities and differences between brain responses to images and richer, more ecologically relevant, real-world objects.
Collapse
Affiliation(s)
- Francesco Marini
- Department of Psychology, University of Nevada, 1664 N Virginia St, Reno, NV, 89557-0296, USA; Swartz Center for Computational Neuroscience, University of California San Diego, 9500 Gilman Drive, La Jolla, CA, 92093-0559, USA.
| | - Katherine A Breeding
- Department of Psychology, University of Nevada, 1664 N Virginia St, Reno, NV, 89557-0296, USA
| | - Jacqueline C Snow
- Department of Psychology, University of Nevada, 1664 N Virginia St, Reno, NV, 89557-0296, USA.
| |
Collapse
|
15
|
Nityananda V, Read JCA. Stereopsis in animals: evolution, function and mechanisms. ACTA ACUST UNITED AC 2018; 220:2502-2512. [PMID: 28724702 PMCID: PMC5536890 DOI: 10.1242/jeb.143883] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Stereopsis is the computation of depth information from views acquired simultaneously from different points in space. For many years, stereopsis was thought to be confined to primates and other mammals with front-facing eyes. However, stereopsis has now been demonstrated in many other animals, including lateral-eyed prey mammals, birds, amphibians and invertebrates. The diversity of animals known to have stereo vision allows us to begin to investigate ideas about its evolution and the underlying selective pressures in different animals. It also further prompts the question of whether all animals have evolved essentially the same algorithms to implement stereopsis. If so, this must be the best way to do stereo vision, and should be implemented by engineers in machine stereopsis. Conversely, if animals have evolved a range of stereo algorithms in response to different pressures, that could inspire novel forms of machine stereopsis appropriate for distinct environments, tasks or constraints. As a first step towards addressing these ideas, we here review our current knowledge of stereo vision in animals, with a view towards outlining common principles about the evolution, function and mechanisms of stereo vision across the animal kingdom. We conclude by outlining avenues for future work, including research into possible new mechanisms of stereo vision, with implications for machine vision and the role of stereopsis in the evolution of camouflage. Summary: Stereopsis has evolved independently in different animals. We review the various functions it serves and the variety of mechanisms that could underlie stereopsis in different species.
Collapse
Affiliation(s)
- Vivek Nityananda
- Wissenschaftskolleg zu Berlin, Institute for Advanced Study, Wallotstraße 19, Berlin 14193, Germany .,Newcastle University, Institute of Neuroscience, Henry Wellcome Building, Framlington Place, Newcastle Upon Tyne NE2 4HH, UK
| | - Jenny C A Read
- Newcastle University, Institute of Neuroscience, Henry Wellcome Building, Framlington Place, Newcastle Upon Tyne NE2 4HH, UK
| |
Collapse
|
16
|
Krekhov A, Kruger J. Deadeye: A Novel Preattentive Visualization Technique Based on Dichoptic Presentation. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS 2018; 25:936-945. [PMID: 30130196 DOI: 10.1109/tvcg.2018.2864498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Preattentive visual features such as hue or flickering can effectively draw attention to an object of interest - for instance, an important feature in a scientific visualization. These features appear to pop out and can be recognized by our visual system, independently from the number of distractors. Most cues do not take advantage of the fact that most humans have two eyes. In cases where binocular vision is applied, it is almost exclusively used to convey depth by exposing stereo pairs. We present Deadeye, a novel preattentive visualization technique based on presenting different stimuli to each eye. The target object is rendered for one eye only and is instantly detected by our visual system. In contrast to existing cues, Deadeye does not modify any visual properties of the target and, thus, is particularly suited for visualization applications. Our evaluation confirms that Deadeye is indeed perceived preattentively. We also explore a conjunction search based on our technique and show that, in contrast to 3D depth, the task cannot be processed in parallel.
Collapse
|
17
|
Jaschinski W. Individual objective versus subjective fixation disparity as a function of forced vergence. PLoS One 2018; 13:e0199958. [PMID: 29980146 PMCID: PMC6035046 DOI: 10.1371/journal.pone.0199958] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2017] [Accepted: 06/18/2018] [Indexed: 11/19/2022] Open
Abstract
Inaccuracy in the vergence eye position ("fixation disparity") can occur despite a fusion stimulus. When measured with eye trackers, this inaccuracy is referred to as "objective fixation disparity". It is a matter of debate whether objective fixation disparity can be estimated with a technically simple psycho-physical procedure, i.e. the perceived offset of aligned dichoptic nonius targets, referred to as "subjective fixation disparity". To investigate the relation between these two measures, simultaneous tests were made in far vision when placing prisms in front of the eyes (for a few seconds) in order to induce forced vergence, i.e. to vary the absolute disparity (from 1 deg divergent to 3.4 deg convergent). Frequent repeated measurements in 12 observers allowed for individual analyses. Generally, fixation disparity values and the effects of prisms were much smaller in the subjective than in the objective measures. Some observers differed systematically in the characteristics of the two types of prism-induced curves. Individual regressions showed that the subjective vs. objective slope was 8% on the average (with largest individual values of 18%). This suggests that sensory fusion shifts the visual direction of the (peripheral) binocular targets by the full amount of objective fixation disparity (since single vision was achieved); however, for the (central) monocular nonius lines this shift was more or less incomplete so that the dichoptic nonius targets indicated an individual percentage of objective fixation disparity. The subjective-to-objective ratio seems to be an individual characteristic of fixation disparity in terms of the amount and in terms of the effect of prism-induced forced vergence. Therefore, on the group level the subjective measures do not allow for a precise prediction of the objective measures.
Collapse
Affiliation(s)
- Wolfgang Jaschinski
- Leibniz Research Centre of Working Environment and Human Factors, Dortmund, Germany
| |
Collapse
|
18
|
Iyer AV, Burge J. Depth variation and stereo processing tasks in natural scenes. J Vis 2018; 18:4. [PMID: 30029214 PMCID: PMC6005632 DOI: 10.1167/18.6.4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 03/30/2018] [Indexed: 01/02/2023] Open
Abstract
Local depth variation is a distinctive property of natural scenes, but its effects on perception have only recently begun to be investigated. Depth variation in natural scenes is due to depth edges between objects and surface nonuniformities within objects. Here, we demonstrate how natural depth variation impacts performance in two fundamental tasks related to stereopsis: half-occlusion detection and disparity detection. We report the results of a computational study that uses a large database of natural stereo-images and coregistered laser-based distance measurements. First, we develop a procedure for precisely sampling stereo-image patches from the stereo-images and then quantify the local depth variation in each patch by its disparity contrast. Next, we show that increased disparity contrast degrades half-occlusion detection and disparity detection performance and changes the size and shape of the spatial integration areas ("receptive fields") that optimize performance. Then, we show that a simple image-computable binocular statistic predicts disparity contrast in natural scenes. Finally, we report the most likely spatial patterns of disparity variation and disparity discontinuities (half-occlusions) in natural scenes. Our findings motivate computational and psychophysical investigations of the mechanisms that underlie stereo processing tasks in local regions of natural scenes.
Collapse
Affiliation(s)
- Arvind V Iyer
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, USA
| | - Johannes Burge
- Department of Psychology, University of Pennsylvania, Philadelphia, PA, USA
- Neuroscience Graduate Group, University of Pennsylvania, Philadelphia, PA, USA
- Bioengineering Graduate Group, University of Pennsylvania, Philadelphia, PA, USA
| |
Collapse
|
19
|
Smith D, Ropar D, Allen HA. The Integration of Occlusion and Disparity Information for Judging Depth in Autism Spectrum Disorder. J Autism Dev Disord 2017; 47:3112-3124. [PMID: 28688073 PMCID: PMC5602035 DOI: 10.1007/s10803-017-3234-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In autism spectrum disorder (ASD), atypical integration of visual depth cues may be due to flattened perceptual priors or selective fusion. The current study attempts to disentangle these explanations by psychophysically assessing within-modality integration of ordinal (occlusion) and metric (disparity) depth cues while accounting for sensitivity to stereoscopic information. Participants included 22 individuals with ASD and 23 typically developing matched controls. Although adults with ASD were found to have significantly poorer stereoacuity, they were still able to automatically integrate conflicting depth cues, lending support to the idea that priors are intact in ASD. However, dissimilarities in response speed variability between the ASD and TD groups suggests that there may be differences in the perceptual decision-making aspect of the task.
Collapse
Affiliation(s)
- Danielle Smith
- School of Psychology, University of Nottingham, Nottingham, NG7 2RD, UK
| | - Danielle Ropar
- School of Psychology, University of Nottingham, Nottingham, NG7 2RD, UK
| | - Harriet A Allen
- School of Psychology, University of Nottingham, Nottingham, NG7 2RD, UK.
| |
Collapse
|
20
|
Cammack P, Harris JM. Depth perception in disparity-defined objects: finding the balance between averaging and segregation. Philos Trans R Soc Lond B Biol Sci 2017; 371:rstb.2015.0258. [PMID: 27269601 PMCID: PMC4901452 DOI: 10.1098/rstb.2015.0258] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/09/2016] [Indexed: 11/20/2022] Open
Abstract
Deciding what constitutes an object, and what background, is an essential task for the visual system. This presents a conundrum: averaging over the visual scene is required to obtain a precise signal for object segregation, but segregation is required to define the region over which averaging should take place. Depth, obtained via binocular disparity (the differences between two eyes’ views), could help with segregation by enabling identification of object and background via differences in depth. Here, we explore depth perception in disparity-defined objects. We show that a simple object segregation rule, followed by averaging over that segregated area, can account for depth estimation errors. To do this, we compared objects with smoothly varying depth edges to those with sharp depth edges, and found that perceived peak depth was reduced for the former. A computational model used a rule based on object shape to segregate and average over a central portion of the object, and was able to emulate the reduction in perceived depth. We also demonstrated that the segregated area is not predefined but is dependent on the object shape. We discuss how this segregation strategy could be employed by animals seeking to deter binocular predators. This article is part of the themed issue ‘Vision in our three-dimensional world’.
Collapse
Affiliation(s)
- P Cammack
- School of Psychology and Neuroscience, University of St Andrews, St Andrews KY16 9JP, UK
| | - J M Harris
- School of Psychology and Neuroscience, University of St Andrews, St Andrews KY16 9JP, UK
| |
Collapse
|
21
|
Abstract
SIGNIFICANCE Increasing evidence indicates that childhood binocular vision disorders that lead to stereodeficiency may be treated in adulthood. Reports of patients who gain stereopsis as adults indicate that this achievement provides for a qualitatively different and dramatically improved sense of space and depth. PURPOSE Increasing evidence suggests that stereopsis can be achieved in adult patients despite long-standing binocular disorders. We polled individuals who gained stereopsis as adults to ascertain their initial binocular disorders, the length of time they were stereodeficient, effective treatments, and the nature of their recovered stereovision. METHODS A questionnaire was posted online and announced in a brief article in the journal Vision Development and Rehabilitation. RESULTS Of the 63 responders, 56 (89%) reported strabismus and/or amblyopia, and 55 (87%) indicated that they had been stereodeficient for as long as they could remember. All but seven participants (89%) achieved stereovision through vision training or a combination of surgery and vision training, and many reported vivid visual changes. CONCLUSIONS Despite childhood binocular disorders, patients may be able to achieve stereopsis following interventions in adulthood. This achievement provides for a qualitatively different and dramatically improved sense of space and depth.
Collapse
|
22
|
Abstract
When the corresponding retinal locations in the two eyes are presented with incompatible images, a stable percept gives way to perceptual alternations in which the two images compete for perceptual dominance. As perceptual experience evolves dynamically under constant external inputs, binocular rivalry has been used for studying intrinsic cortical computations and for understanding how the brain regulates competing inputs. Converging behavioral and EEG results have shown that binocular rivalry and attention are intertwined: binocular rivalry ceases when attention is diverted away from the rivalry stimuli. In addition, the competing image in one eye suppresses the target in the other eye through a pattern of gain changes similar to those induced by attention. These results require a revision of the current computational theories of binocular rivalry, in which the role of attention is ignored. Here, we provide a computational model of binocular rivalry. In the model, competition between two images in rivalry is driven by both attentional modulation and mutual inhibition, which have distinct selectivity (feature vs. eye of origin) and dynamics (relatively slow vs. relatively fast). The proposed model explains a wide range of phenomena reported in rivalry, including the three hallmarks: (i) binocular rivalry requires attention; (ii) various perceptual states emerge when the two images are swapped between the eyes multiple times per second; (iii) the dominance duration as a function of input strength follows Levelt's propositions. With a bifurcation analysis, we identified the parameter space in which the model's behavior was consistent with experimental results.
Collapse
|
23
|
Bi F, Ni R, Chen Y. Spatial constraints of binocularly matched information on perceived depth resulted from temporal interocular asynchrony. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2017; 2017:966-969. [PMID: 29060034 DOI: 10.1109/embc.2017.8036986] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Quantitative depth of a subjective occluding surface could only be perceived when binocularly matched information is presented along with temporal interocular unmatched (IOUM) information [1]. However, it is still unclear whether the horizontal separation that binocularly matched information is presented from the occluding surface edge would affect the quantitative depth perception. In the present study we aim to answer this question by manipulating the distance between binocularly matched and IOUM information in spatial domains. In Experiment 1, we examined the spatial limitation of binocular matched information along horizontal axes by measuring the threshold for quantitatively perceiving depth from temporal interocular asynchrony. In Experiment 2, within the threshold, temporal IOUM information was presented at three distances from the matched information along horizontal axes, respectively. Our results from these experiments showed a robust effect of binocular information in generating quantitative perceived depth from IOUM information in spatial domains. Moreover, the perceived depth of the subjective surface can be significantly influenced by spatial separations along horizontal axes. This may help to propose a computational theory of the mechanisms underlying "da Vinci stereopsis" in motion and other stereoscopic percepts.
Collapse
|
24
|
Li H, Jia H, Chung-Fat-Yim A, Jin L, Yu D. The neural correlates of vertical disparity gradient and cue conflict in Panum's limiting case. Brain Cogn 2017; 114:11-19. [PMID: 28327354 DOI: 10.1016/j.bandc.2017.02.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2016] [Accepted: 02/21/2017] [Indexed: 10/19/2022]
Abstract
Although Panum's limiting case has been extensively researched, only recently has it been discovered that in addition to horizontal disparity, the final perception of depth is influenced by (i) the vertical disparity gradient and (ii) the degree of cue conflict between 2D and 3D shapes. The present study examines the neural correlates of the two factors, using EEG while observers viewed several versions of stereoscopic stimuli, which depicted Panum's limiting case. In these patterns the vertical disparity gradient was varied from 0.1 to 0.6, while the degree of cue conflict was manipulated from low to high. The ERP data showed that the amplitude of the N170 component (exogenous) was modulated by the vertical disparity gradient and cue conflict. In contrast, the N270 component (endogenous) was modulated by cue conflict only. Such findings demonstrate that both factors affect the perception of depth in Panum's limiting case, but at different stages: the vertical disparity gradient at an early stage of processing (N170) and cue conflict at two stages (N170 and N270). Hence, vertical disparity gradient is related to low-level visual stimulus parameters and can modulate exogenous component, while cue conflict is related to both exogenous and endogenous components.
Collapse
Affiliation(s)
- Huayun Li
- Key Laboratory of Child Development and Learning Science of Ministry of Education, Research Center for Learning Science, Southeast University, Nanjing, Jiangsu, China; Centre for Vision Research, Department of Psychology, York University, Toronto, Ontario, Canada
| | - Huibin Jia
- Key Laboratory of Child Development and Learning Science of Ministry of Education, Research Center for Learning Science, Southeast University, Nanjing, Jiangsu, China
| | | | - Laipeng Jin
- Key Laboratory of Child Development and Learning Science of Ministry of Education, Research Center for Learning Science, Southeast University, Nanjing, Jiangsu, China
| | - Dongchuan Yu
- Key Laboratory of Child Development and Learning Science of Ministry of Education, Research Center for Learning Science, Southeast University, Nanjing, Jiangsu, China.
| |
Collapse
|
25
|
Abstract
Interchanging the left and right eye views of a scene (pseudoscopic viewing) has been reported to produce vivid stereoscopic effects under certain conditions. In two separate field studies, we examined the experiences of 124 observers (76 in Study 1 and 48 in Study 2) while pseudoscopically viewing a distant natural outdoor scene. We found large individual differences in both the nature and the timing of their pseudoscopic experiences. While some observers failed to notice anything unusual about the pseudoscopic scene, most experienced multiple pseudoscopic phenomena, including apparent scene depth reversals, apparent object shape reversals, apparent size and flatness changes, apparent reversals of border ownership, and even complex illusory foreground surfaces. When multiple effects were experienced, patterns of co-occurrence suggested possible causal relationships between apparent scene depth reversals and several other pseudoscopic phenomena. The latency for experiencing pseudoscopic phenomena was found to correlate significantly with observer visual acuity, but not stereoacuity, in both studies.
Collapse
Affiliation(s)
- Stephen Palmisano
- Centre for Psychophysics, Psychophysiology and Psychopharmacology, School of Psychology, University of Wollongong, NSW, Australia
| |
Collapse
|
26
|
Snow JC, Skiba RM, Coleman TL, Berryhill ME. Real-world objects are more memorable than photographs of objects. Front Hum Neurosci 2014; 8:837. [PMID: 25368568 PMCID: PMC4202719 DOI: 10.3389/fnhum.2014.00837] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Accepted: 10/01/2014] [Indexed: 11/13/2022] Open
Abstract
Research studies in psychology typically use two-dimensional (2D) images of objects as proxies for real-world three-dimensional (3D) stimuli. There are, however, a number of important differences between real objects and images that could influence cognition and behavior. Although human memory has been studied extensively, only a handful of studies have used real objects in the context of memory and virtually none have directly compared memory for real objects vs. their 2D counterparts. Here we examined whether or not episodic memory is influenced by the format in which objects are displayed. We conducted two experiments asking participants to freely recall, and to recognize, a set of 44 common household objects. Critically, the exemplars were displayed to observers in one of three viewing conditions: real-world objects, colored photographs, or black and white line drawings. Stimuli were closely matched across conditions for size, orientation, and illumination. Surprisingly, recall and recognition performance was significantly better for real objects compared to colored photographs or line drawings (for which memory performance was equivalent). We replicated this pattern in a second experiment comparing memory for real objects vs. color photos, when the stimuli were matched for viewing angle across conditions. Again, recall and recognition performance was significantly better for the real objects than matched color photos of the same items. Taken together, our data suggest that real objects are more memorable than pictorial stimuli. Our results highlight the importance of studying real-world object cognition and raise the potential for applied use in developing effective strategies for education, marketing, and further research on object-related cognition.
Collapse
Affiliation(s)
- Jacqueline C Snow
- Cognitive and Brain Sciences Group, Department of Psychology, University of Nevada Reno, NV, USA
| | - Rafal M Skiba
- Cognitive and Brain Sciences Group, Department of Psychology, University of Nevada Reno, NV, USA
| | - Taylor L Coleman
- Cognitive and Brain Sciences Group, Department of Psychology, University of Nevada Reno, NV, USA
| | - Marian E Berryhill
- Cognitive and Brain Sciences Group, Department of Psychology, University of Nevada Reno, NV, USA
| |
Collapse
|
27
|
Spang K, Gillam B, Fahle M. Electrophysiological correlates of binocular stereo depth without binocular disparities. PLoS One 2012; 7:e40562. [PMID: 22876281 PMCID: PMC3410887 DOI: 10.1371/journal.pone.0040562] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2012] [Accepted: 06/09/2012] [Indexed: 11/18/2022] Open
Abstract
A small region of background presented to only one eye in an otherwise binocular display may, under certain conditions, be resolved in the visual system by interpreting the region as a small gap between two similar objects placed at different depths, with the gap hidden in one eye by parallax. This has been called monocular gap stereopsis. We investigated the electrophysiological correlate of this type of stereopsis by means of sum potential recordings in 12 observers, comparing VEP's for this stimulus ("Gillam Stereo", Author BG has strong reservations about this term) with those for similar stimuli containing disparity based depth and with no depth (flat). In addition we included several control stimuli. The results show a pronounced early negative potential at a latency of around 170 ms (N170) for all stimuli containing non- identical elements, be they a difference caused by binocular disparity or by completely unmatched monocular contours. A second negative potential with latency around 270 ms (N270), on the other hand, is present only with stimuli leading to fusion and the perception of depth. This second component is similar for disparity-based stereopsis and monocular gap, or "Gillam Stereo" although slightly more pronounced for the former. We conjecture that the first component is related to the detection of differences between the images of the two eyes that may then either be fused, leading to stereopsis and the corresponding second potential, or else to inhibition and rivalry without a later trace in the VEP. The finding that that "Gillam Stereo" leads to cortical responses at the same short latencies as disparity based stereopsis indicates that it may partly rely on quite early cortical mechanisms.
Collapse
Affiliation(s)
- Karoline Spang
- Centre for Cognitive Science, University of Bremen, Bremen, Germany.
| | | | | |
Collapse
|
28
|
Abstract
We examined the interaction between motion and stereo cues to depth order along object boundaries. Relative depth was conveyed by a change in the speed of image motion across a boundary (motion parallax), the disappearance of features on a surface moving behind an occluding object (motion occlusion), or a difference in the stereo disparity of adjacent surfaces. We compared the perceived depth orders for different combinations of cues, incorporating conditions with conflicting depth orders and conditions with varying reliability of the individual cues. We observed large differences in performance between subjects, ranging from those whose depth order judgments were driven largely by the stereo disparity cues to those whose judgments were dominated by motion occlusion. The relative strength of these cues influenced individual subjects' behavior in conditions of cue conflict and reduced reliability.
Collapse
|
29
|
MITSUDO HIROYUKI. A minimal algorithm for computing the likelihood of binocular correspondence1. JAPANESE PSYCHOLOGICAL RESEARCH 2012. [DOI: 10.1111/j.1468-5884.2011.00504.x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
30
|
Arnold DH. Why is Binocular Rivalry Uncommon? Discrepant Monocular Images in the Real World. Front Hum Neurosci 2011; 5:116. [PMID: 22028689 PMCID: PMC3199540 DOI: 10.3389/fnhum.2011.00116] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Accepted: 09/28/2011] [Indexed: 11/13/2022] Open
Abstract
When different images project to corresponding points in the two eyes they can instigate a phenomenon called binocular rivalry (BR), wherein each image seems to intermittently disappear such that only one of the two images is seen at a time. Cautious readers may have noted an important caveat in the opening sentence – this situation can instigate BR, but usually it doesn’t. Unmatched monocular images are frequently encountered in daily life due to either differential occlusions of the two eyes or because of selective obstructions of just one eye, but this does not tend to induce BR. Here I will explore the reasons for this and discuss implications for BR in general. It will be argued that BR is resolved in favor of the instantaneously stronger neural signal, and that this process is driven by an adaptation that enhances the visibility of distant fixated objects over that of more proximate obstructions of an eye. Accordingly, BR would reflect the dynamics of an inherently visual operation that usually deals with real-world constraints.
Collapse
Affiliation(s)
- Derek Henry Arnold
- School of Psychology, The University of Queensland St Lucia, QLD, Australia
| |
Collapse
|
31
|
Zannoli M, Mamassian P. The role of transparency in da Vinci stereopsis. Vision Res 2011; 51:2186-97. [PMID: 21906614 DOI: 10.1016/j.visres.2011.08.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Revised: 08/12/2011] [Accepted: 08/16/2011] [Indexed: 10/17/2022]
Abstract
The majority of natural scenes contains zones that are visible to one eye only. Past studies have shown that these monocular regions can be seen at a precise depth even though there are no binocular disparities that uniquely constrain their locations in depth. In the so-called da Vinci stereopsis configuration, the monocular region is a vertical line placed next to a binocular rectangular occluder. The opacity of the occluder has been mentioned to be a necessary condition to obtain da Vinci stereopsis. However, this opacity constraint has never been empirically tested. In the present study, we tested whether da Vinci stereopsis and perceptual transparency can interact using a classical da Vinci configuration in which the opacity of the occluder varied. We used two different monocular objects: a line and a disk. We found no effect of the opacity of the occluder on the perceived depth of the monocular object. A careful analysis of the distribution of perceived depth revealed that the monocular object was perceived at a depth that increased with the distance between the object and the occluder. The analysis of the skewness of the distributions was not consistent with a double fusion explanation, favoring an implication of occlusion geometry in da Vinci stereopsis. A simple model that includes the geometry of the scene could account for the results. In summary, the mechanism responsible to locate monocular regions in depth is not sensitive to the material properties of objects, suggesting that da Vinci stereopsis is solved at relatively early stages of disparity processing.
Collapse
Affiliation(s)
- Marina Zannoli
- Laboratoire Psychologie de la Perception (CNRS UMR 8158), Université Paris Descartes, France.
| | | |
Collapse
|
32
|
Disparity biasing in depth from monocular occlusions. Vision Res 2011; 51:1699-711. [PMID: 21651926 DOI: 10.1016/j.visres.2011.05.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2011] [Revised: 03/31/2011] [Accepted: 05/22/2011] [Indexed: 11/23/2022]
Abstract
Monocular occlusions have been shown to play an important role in stereopsis. Among other contributions to binocular depth perception, monocular occlusions can create percepts of illusory occluding surfaces. It has been argued that the precise location in depth of these illusory occluders is based on the constraints imposed by occlusion geometry. Tsirlin et al. (2010) proposed that when these constraints are weak, the depth of the illusory occluder can be biased by a neighboring disparity-defined feature. In the present work we test this hypothesis using a variety of stimuli. We show that when monocular occlusions provide only partial constraints on the magnitude of depth of the illusory occluders, the perceived depth of the occluders can be biased by disparity-defined features in the direction unrestricted by the occlusion geometry. Using this disparity bias phenomenon we also show that in illusory occluder stimuli where disparity information is present, but weak, most observers rely on disparity while some use occlusion information instead to specify the depth of the illusory occluder. Taken together our experiments demonstrate that in binocular depth perception disparity and monocular occlusion cues interact in complex ways to resolve perceptual ambiguity.
Collapse
|
33
|
Rychkova S, Ninio J. Alternation frequency thresholds for stereopsis as a technique for exploring stereoscopic difficulties. Iperception 2011; 2:50-68. [PMID: 23145225 PMCID: PMC3485774 DOI: 10.1068/i0398] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2010] [Revised: 03/07/2011] [Indexed: 11/11/2022] Open
Abstract
When stereoscopic images are presented alternately to the two eyes, stereopsis occurs at F ≥ 1 Hz full-cycle frequencies for very simple stimuli, and F ≥ 3 Hz full-cycle frequencies for random-dot stereograms (eg Ludwig I, Pieper W, Lachnit H, 2007 “Temporal integration of monocular images separated in time: stereopsis, stereoacuity, and binocular luster” Perception & Psychophysics69 92–102). Using twenty different stereograms presented through liquid crystal shutters, we studied the transition to stereopsis with fifteen subjects. The onset of stereopsis was observed during a stepwise increase of the alternation frequency, and its disappearance was observed during a stepwise decrease in frequency. The lowest F values (around 2.5 Hz) were observed with stimuli involving two to four simple disjoint elements (circles, arcs, rectangles). Higher F values were needed for stimuli containing slanted elements or curved surfaces (about 1 Hz increment), overlapping elements at two different depths (about 2.5 Hz increment), or camouflaged overlapping surfaces (> 7 Hz increment). A textured cylindrical surface with a horizontal axis appeared easier to interpret (5.7 Hz) than a pair of slanted segments separated in depth but forming a cross in projection (8 Hz). Training effects were minimal, and F usually increased as disparities were reduced. The hierarchy of difficulties revealed in the study may shed light on various problems that the brain needs to solve during stereoscopic interpretation. During the construction of the three-dimensional percept, the loss of information due to natural decay of the stimuli traces must be compensated by refreshes of visual input. In the discussion an attempt is made to link our results with recent advances in the comprehension of visual scene memory.
Collapse
Affiliation(s)
- Svetlana Rychkova
- Moscow City Psychological and Pedagogical University, 29 Sretenka St, Moscow 127051, Russia; e-mail:
| | | |
Collapse
|
34
|
Blake R, Wilson H. Binocular vision. Vision Res 2010; 51:754-70. [PMID: 20951722 DOI: 10.1016/j.visres.2010.10.009] [Citation(s) in RCA: 122] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2010] [Revised: 10/05/2010] [Accepted: 10/06/2010] [Indexed: 10/18/2022]
Abstract
This essay reviews major developments - empirical and theoretical - in the field of binocular vision during the last 25years. We limit our survey primarily to work on human stereopsis, binocular rivalry and binocular contrast summation, with discussion where relevant of single-unit neurophysiology and human brain imaging. We identify several key controversies that have stimulated important work on these problems. In the case of stereopsis those controversies include position vs. phase encoding of disparity, dependence of disparity limits on spatial scale, role of occlusion in binocular depth and surface perception, and motion in 3D. In the case of binocular rivalry, controversies include eye vs. stimulus rivalry, role of "top-down" influences on rivalry dynamics, and the interaction of binocular rivalry and stereopsis. Concerning binocular contrast summation, the essay focuses on two representative models that highlight the evolving complexity in this field of study.
Collapse
Affiliation(s)
- Randolph Blake
- Department of Brain and Cognitive Sciences, Seoul National University, Seoul, Republic of Korea.
| | | |
Collapse
|
35
|
Ni R, Chen L, Andersen GJ. Visual constraints for the perception of quantitative depth from temporal interocular unmatched features. Vision Res 2010; 50:1571-80. [DOI: 10.1016/j.visres.2010.05.017] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2009] [Revised: 05/13/2010] [Accepted: 05/15/2010] [Indexed: 11/30/2022]
|
36
|
Vision Research reviews 'vision research'. Vision Res 2009; 49:2635. [PMID: 19878768 DOI: 10.1016/j.visres.2009.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|