1
|
Mikawa Y, Fukiage T. Low-Latency Ocular Parallax Rendering and Investigation of Its Effect on Depth Perception in Virtual Reality. IEEE Trans Vis Comput Graph 2024; 30:2228-2238. [PMID: 38442067 DOI: 10.1109/tvcg.2024.3372078] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
With a demand for an immersive experience in virtual/augmented reality (VR/AR) displays, recent efforts have incorporated eye states, such as focus and fixation, into display graphics. Among these, ocular parallax, a small parallax generated by eye rotation, has received considerable attention for its impact on depth perception. However, the substantial latency of head-mounted displays (HMDs) has made it challenging to accurately assess its true effect during free eye movements. To address this issue, we propose a high-speed (360 Hz) and low-latency (4.8 ms) ocular parallax rendering system with a custom-built eye tracker. Using this proposed system, we conducted an investigation to determine the latency requirements necessary for achieving perceptually stable ocular parallax rendering. Our findings indicate that, in binocular viewing, ocular parallax rendering is perceived as significantly less stable than conventional rendering when the latency exceeds 43.72 ms at 1.3 D and 21.50 ms at 2.0 D. We also evaluated the effects of ocular parallax rendering on binocular fusion and monocular depth perception under free viewing conditions. The results demonstrated that ocular parallax rendering can enhance binocular fusion but has a limited impact on depth perception under monocular viewing conditions when latency is minimized.
Collapse
|
2
|
Skog E, Meese TS, Sargent IMJ, Ormerod A, Schofield AJ. Classification images for aerial images capture visual expertise for binocular disparity and a prior for lighting from above. J Vis 2024; 24:11. [PMID: 38607637 PMCID: PMC11019598 DOI: 10.1167/jov.24.4.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 02/06/2024] [Indexed: 04/13/2024] Open
Abstract
Using a novel approach to classification images (CIs), we investigated the visual expertise of surveyors for luminance and binocular disparity cues simultaneously after screening for stereoacuity. Stereoscopic aerial images of hedges and ditches were classified in 10,000 trials by six trained remote sensing surveyors and six novices. Images were heavily masked with luminance and disparity noise simultaneously. Hedge and ditch images had reversed disparity on around half the trials meaning hedges became ditch-like and vice versa. The hedge and ditch images were also flipped vertically on around half the trials, changing the direction of the light source and completing a 2 × 2 × 2 stimulus design. CIs were generated by accumulating the noise textures associated with "hedge" and "ditch" classifications, respectively, and subtracting one from the other. Typical CIs had a central peak with one or two negative side-lobes. We found clear differences in the amplitudes and shapes of perceptual templates across groups and noise-type, with experts prioritizing binocular disparity and using this more effectively. Contrariwise, novices used luminance cues more than experts meaning that task motivation alone could not explain group differences. Asymmetries in the luminance CIs revealed individual differences for lighting interpretation, with experts less prone to assume lighting from above, consistent with their training on aerial images of UK scenes lit by a southerly sun. Our results show that (i) dual noise in images can be used to produce simultaneous CI pairs, (ii) expertise for disparity cues does not depend on stereoacuity, (iii) CIs reveal the visual strategies developed by experts, (iv) top-down perceptual biases can be overcome with long-term learning effects, and (v) CIs have practical potential for directing visual training.
Collapse
Affiliation(s)
- Emil Skog
- School of Psychology, College of Health and Life Sciences, Aston University, Birmingham, B4 7ET, UK
- Aston Laboratory for Immersive Virtual Environments, College of Health and Life Sciences, Aston University, Birmingham, B4 7ET, UK
- Department of Health, Learning and Technology, Luleå University of Technology, Luleå, Sweden
| | - Timothy S Meese
- Aston Laboratory for Immersive Virtual Environments, College of Health and Life Sciences, Aston University, Birmingham, B4 7ET, UK
- https://research.aston.ac.uk/en/persons/tim-s-meese
| | - Isabel M J Sargent
- Ordnance Survey, Adanac Drive, Southampton, SO16 0AS, UK
- Electronics and Computer Science, University of Southampton, University Road, Southampton, SO17 1BJ, UK
- http://www.os.uk/
| | - Andrew Ormerod
- Ordnance Survey, Adanac Drive, Southampton, SO16 0AS, UK
- http://www.os.uk/
| | - Andrew J Schofield
- School of Psychology, College of Health and Life Sciences, Aston University, Birmingham, B4 7ET, UK
- Aston Laboratory for Immersive Virtual Environments, College of Health and Life Sciences, Aston University, Birmingham, B4 7ET, UK
- https://research.aston.ac.uk/en/persons/andrew-schofield
| |
Collapse
|
3
|
Szekely B, Keys R, MacNeilage P, Alais D. Short communication: Binocular rivalry dynamics during locomotion. PLoS One 2024; 19:e0300222. [PMID: 38558003 PMCID: PMC10984525 DOI: 10.1371/journal.pone.0300222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 02/25/2024] [Indexed: 04/04/2024] Open
Abstract
Locomotion has been shown to impact aspects of visual processing in both humans and animal models. In the current study, we assess the impact of locomotion on the dynamics of binocular rivalry. We presented orthogonal gratings, one contrast-modulating at 0.8 Hz (matching average step frequency) and the other at 3.2 Hz, to participants using a virtual reality headset. We compared two conditions: stationary and walking. We continuously monitored participants' foot position using tracking devices to measure the step cycle. During the walking condition, participants viewed the rivaling gratings for 60-second trials while walking on a circular path in a virtual reality environment. During the stationary condition, observers viewed the same stimuli and environment while standing still. The task was to continuously indicate the dominant percept via button press using handheld controllers. We found no significant differences between walking and standing for normalized dominance duration distributions, mean normalized dominance distributions, mean alternation rates, or mean fitted frequencies. Although our findings do not align with prior research highlighting distinctions in normalized dominance distributions between walking and standing, our study contributes unique evidence indicating that alternation rates vary across the step cycle. Specifically, we observed that the number of alternations is at its lowest during toe-off phases and reaches its peak at heel strike. This novel insight enhances our understanding of the dynamic nature of alternation patterns throughout the step cycle.
Collapse
Affiliation(s)
- Brian Szekely
- Department of Psychology, University of Nevada, Reno, Reno, Nevada, United States of America
| | - Robert Keys
- School of Psychology, University of Sydney, Sydney, New South Wales, Australia
| | - Paul MacNeilage
- Department of Psychology, University of Nevada, Reno, Reno, Nevada, United States of America
| | - David Alais
- School of Psychology, University of Sydney, Sydney, New South Wales, Australia
| |
Collapse
|
4
|
Ding J, Lu HH, Levi DM. Absolute and relative disparity mechanisms revealed by an equivalent noise analysis. Sci Rep 2024; 14:6863. [PMID: 38514715 PMCID: PMC10958039 DOI: 10.1038/s41598-024-57406-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 03/18/2024] [Indexed: 03/23/2024] Open
Abstract
The precision of stereopsis and vergence are ultimately limited by internal binocular disparity noise. Here we propose an equivalent noise model with both global and local internal disparity noises to provide a unified explanation of both absolute and relative disparity thresholds. To test this model, we developed a psychophysical procedure to measure the equivalent internal disparity noise by adding external disparity noise to random-Gabor-patch stereograms. We used the method of constant stimuli to measure the minimum and maximum disparity thresholds (Dmin and Dmax) for both absolute and relative disparity. Consistent with previous studies, we found that Dmin thresholds are substantially worse for absolute disparity than for relative disparity. We tested three relative disparity mechanisms: (1) the difference between the monocular separations of targets projecting to the two eyes; (2) the direct measurement of relative disparity; and (3) the difference of absolute disparities of targets. Computing the difference of absolute disparities when detecting relative disparity, Mechanism 3 cancels global noise, resulting in a much lower relative Dmin threshold, and provides a reasonable fit to the experimental data. We also found that the presence of as much as 2400 arcsec of external disparity noise does not appear to affect the Dmax threshold. This observation suggests that Dmax is implicated in a mechanism that disregards the disparity variance of individual items, relying instead on the average disparity across all items, supporting the depth model proposed in our previous study (Ding & Levi, 2021), which posits distinct mechanisms governing Dmin and Dmax thresholds.
Collapse
Affiliation(s)
- Jian Ding
- Herbert Wertheim School of Optometry and Vision Science and the Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, 94720-2020, USA.
| | - Hilary H Lu
- Herbert Wertheim School of Optometry and Vision Science and the Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, 94720-2020, USA
| | - Dennis M Levi
- Herbert Wertheim School of Optometry and Vision Science and the Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA, 94720-2020, USA
| |
Collapse
|
5
|
Matsuda Y, Aida S, Shimono K. Effect of 3-D depth structure, element size, and area containing elements on total-element overestimation phenomenon. PLoS One 2024; 19:e0299307. [PMID: 38412148 PMCID: PMC10898729 DOI: 10.1371/journal.pone.0299307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Accepted: 02/08/2024] [Indexed: 02/29/2024] Open
Abstract
The number of elements distributed in a three-dimensional stimulus is overestimated compared to a two-dimensional stimulus when both stimuli have the same number of elements. We examined the effect of the properties of a three-dimensional stimulus (the number of overlapping stereo surfaces, size of the elements, and size of the area containing elements, on the overestimation phenomenon in four experiments. The two stimuli were presented side-by-side with the same diameters. Observers judged which of the three-dimensional standard and two-dimensional comparison had more elements. The results showed that (a) the overestimation phenomenon occurred for the three-dimensional standard stimuli, (b) the size of the areas affected the amount of overestimation, while the number of overlapping stereo surfaces and size of elements did not, and (c) the amount of overestimation increased when the stimuli included more than 100 elements. Implications of these findings were discussed in the framework of back-surface bias, occlusion, and disparity-processing interference models.
Collapse
Affiliation(s)
- Yusuke Matsuda
- Department of Applied Information Engineering, Faculty of Engineering, Suwa University of Science, Chino, Nagano, Japan
| | - Saori Aida
- Graduate School of Sciences and Technology for Innovation, Yamaguchi University, Ube, Yamaguchi, Japan
| | - Koichi Shimono
- Department of Logistics and Information Engineering, Tokyo University of Marine Science and Technology, Koto-ku, Tokyo, Japan
| |
Collapse
|
6
|
Maehara G, Wang Y, Murakami I. Effects of binocular disparity on binocular luminance combination. J Vis 2024; 24:4. [PMID: 38376853 PMCID: PMC10883336 DOI: 10.1167/jov.24.2.4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024] Open
Abstract
This study aimed to examine the effects of binocular disparity on binocular combination of brightness information coming from luminance increments and decrements. The point of subjective equality was determined by asking the observers to judge which stimulus appeared brighter-a bar stimulus with variable disparity or another stimulus with zero disparity. For the bar stimulus, the interocular luminance ratio was varied to trace an equal brightness curve. Binocular disparity had no effect on luminance increments presented on a gray or black background. In contrast, when luminance decrements were presented on a gray background, non-zero disparities elevated points of subjective equality for stimuli with interocular luminance differences. This means that the binocular brightness combination of the two monocular signals shifted from winner-take-all summation toward linear averaging. It has been argued that this effect may be caused by non-zero binocular disparities attenuating interocular suppression, which is deemed to operate normally with zero disparity.
Collapse
Affiliation(s)
- Goro Maehara
- Department of Human Science, Kanagawa University, Kanagawa, Japan
| | - Yiqian Wang
- Department of Psychology, The University of Tokyo, Tokyo, Japan
| | - Ikuya Murakami
- Department of Psychology, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
7
|
Manning TS, Alexander E, Cumming BG, DeAngelis GC, Huang X, Cooper EA. Transformations of sensory information in the brain suggest changing criteria for optimality. PLoS Comput Biol 2024; 20:e1011783. [PMID: 38206969 PMCID: PMC10807827 DOI: 10.1371/journal.pcbi.1011783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 01/24/2024] [Accepted: 12/22/2023] [Indexed: 01/13/2024] Open
Abstract
Neurons throughout the brain modulate their firing rate lawfully in response to sensory input. Theories of neural computation posit that these modulations reflect the outcome of a constrained optimization in which neurons aim to robustly and efficiently represent sensory information. Our understanding of how this optimization varies across different areas in the brain, however, is still in its infancy. Here, we show that neural sensory responses transform along the dorsal stream of the visual system in a manner consistent with a transition from optimizing for information preservation towards optimizing for perceptual discrimination. Focusing on the representation of binocular disparities-the slight differences in the retinal images of the two eyes-we re-analyze measurements characterizing neuronal tuning curves in brain areas V1, V2, and MT (middle temporal) in the macaque monkey. We compare these to measurements of the statistics of binocular disparity typically encountered during natural behaviors using a Fisher Information framework. The differences in tuning curve characteristics across areas are consistent with a shift in optimization goals: V1 and V2 population-level responses are more consistent with maximizing the information encoded about naturally occurring binocular disparities, while MT responses shift towards maximizing the ability to support disparity discrimination. We find that a change towards tuning curves preferring larger disparities is a key driver of this shift. These results provide new insight into previously-identified differences between disparity-selective areas of cortex and suggest these differences play an important role in supporting visually-guided behavior. Our findings emphasize the need to consider not just information preservation and neural resources, but also relevance to behavior, when assessing the optimality of neural codes.
Collapse
Affiliation(s)
- Tyler S. Manning
- Herbert Wertheim School of Optometry & Vision Science, University of California, Berkeley
| | - Emma Alexander
- Department of Computer Science, Northwestern University, Illinois, United States of America
| | - Bruce G. Cumming
- Laboratory of Sensorimotor Research, National Eye Institute, National Institutes of Health, Maryland, United States of America
| | - Gregory C. DeAngelis
- Department of Brain and Cognitive Sciences, University of Rochester, New York, United States of America
| | - Xin Huang
- Department of Neuroscience, University of Wisconsin, Madison
| | - Emily A. Cooper
- Herbert Wertheim School of Optometry & Vision Science, University of California, Berkeley
- Helen Wills Neuroscience Institute, University of California, Berkeley
| |
Collapse
|
8
|
Nie S, Katyal S, Engel SA. An Accumulating Neural Signal Underlying Binocular Rivalry Dynamics. J Neurosci 2023; 43:8777-8784. [PMID: 37907256 PMCID: PMC10727184 DOI: 10.1523/jneurosci.1325-23.2023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 09/06/2023] [Accepted: 10/23/2023] [Indexed: 11/02/2023] Open
Abstract
During binocular rivalry, conflicting images are presented one to each eye and perception alternates stochastically between them. Despite stable percepts between alternations, modeling suggests that neural signals representing the two images change gradually, and that the duration of stable percepts are determined by the time required for these signals to reach a threshold that triggers an alternation. However, direct physiological evidence for such signals has been lacking. Here, we identify a neural signal in the human visual cortex that shows these predicted properties. We measured steady-state visual evoked potentials (SSVEPs) in 84 human participants (62 females, 22 males) who were presented with orthogonal gratings, one to each eye, flickering at different frequencies. Participants indicated their percept while EEG data were collected. The time courses of the SSVEP amplitudes at the two frequencies were then compared across different percept durations, within participants. For all durations, the amplitude of signals corresponding to the suppressed stimulus increased and the amplitude corresponding to the dominant stimulus decreased throughout the percept. Critically, longer percepts were characterized by more gradual increases in the suppressed signal and more gradual decreases of the dominant signal. Changes in signals were similar and rapid at the end of all percepts, presumably reflecting perceptual transitions. These features of the SSVEP time courses are well predicted by a model in which perceptual transitions are produced by the accumulation of noisy signals. Identification of this signal underlying binocular rivalry should allow strong tests of neural models of rivalry, bistable perception, and neural suppression.SIGNIFICANCE STATEMENT During binocular rivalry, two conflicting images are presented to the two eyes and perception alternates between them, with switches occurring at seemingly random times. Rivalry is an important and longstanding model system in neuroscience, used for understanding neural suppression, intrinsic neural dynamics, and even the neural correlates of consciousness. All models of rivalry propose that it depends on gradually changing neural activity that on reaching some threshold triggers the perceptual switches. This manuscript reports the first physiological measurement of neural signals with that set of properties in human participants. The signals, measured with EEG in human observers, closely match the predictions of recent models of rivalry, and should pave the way for much future work.
Collapse
Affiliation(s)
- Shaozhi Nie
- Department of Psychology, University of Minnesota Twin Cities, Minneapolis, Minnesota 55455
| | - Sucharit Katyal
- Department of Psychology, University of Minnesota Twin Cities, Minneapolis, Minnesota 55455
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, University College London, London, WC1B 5EH, United Kingdom
| | - Stephen A Engel
- Department of Psychology, University of Minnesota Twin Cities, Minneapolis, Minnesota 55455
| |
Collapse
|
9
|
Wilcox L. Invited Session V: Binocular vision and interactions: Depth perception in virtual environments: The role of experience. J Vis 2023; 23:19. [PMID: 38109629 DOI: 10.1167/jov.23.15.19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023] Open
Abstract
In this presentation I'll review results of stereoacuity, disparity matching and depth magnitude estimation studies in which comparison of so-called naïve and experienced observers shows substantive differences in performance. I will describe our working theory that the critical difference between these groups is their tolerance to conflicts between stereopsis and other sources of depth information. Further, I will review some recent data that suggest there are some conflicts that even experienced 3D participants cannot disregard.
Collapse
|
10
|
Shi Y, Zhang J, Lin W, Chung-Fat-Yim A, Yang Q, Li H. The effect of training on sensitivity and stability of double fusion in Panum's limiting case. Atten Percept Psychophys 2023; 85:2894-2906. [PMID: 37831363 DOI: 10.3758/s13414-023-02795-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/14/2023] [Indexed: 10/14/2023]
Abstract
Panum's limiting case is a phenomenon of monocular occlusion in binocular vision. This occurs when one object is occluded by the other object for one eye, but the two objects are both visible for the other eye. Although previous studies have found that vertical gradient of horizontal disparity and cue conflict are two important factors for double fusion, the effect of training on the sensitivity and stability of Panum's limiting case remains unknown. The current study trained 26 participants for 5 days with several of Panum's configurations (Gilliam, Frisby, and Wang series). The latency and duration of double fusion were recorded to examine the effects of training on sensitivity and stability of double fusion in Panum's limiting case. For each level of vertical gradient of horizontal disparity and cue conflict, the latency of double fusion decreased and the duration of double fusion increased with each additional training session. The results showed that vertical gradient of horizontal disparity and cue conflict interacted, and the duration of high cue conflict was significantly shorter than that of medium and low cue conflict for each level of vertical gradient of horizontal disparity. The findings suggest that there is an effect of training for vertical gradient of horizontal disparity and cue conflict in Panum's limiting case, and that the three factors jointly affect the sensitivity and stability of double fusion.
Collapse
Affiliation(s)
- Yuyu Shi
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China
| | - Jiaxi Zhang
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China
| | - Wenmin Lin
- School of English Studies, Shanghai International Studies University, Shanghai, China
| | - Ashley Chung-Fat-Yim
- Department of Communication Sciences and Disorders, Northwestern University, Evanston, IL, USA
| | - Qihang Yang
- College of Foreign Language, Zhejiang Normal University, Jinhua, China
| | - Huayun Li
- School of Psychology, Zhejiang Normal University, Jinhua, 321004, China.
- Key Laboratory of Intelligent Education Technology and Application, Zhejiang Normal University, Jinhua, China.
| |
Collapse
|
11
|
Chen PY, Chen CC, Nishida S. Coarse-to-fine interaction on perceived depth in compound grating. J Vis 2023; 23:5. [PMID: 37856108 PMCID: PMC10593133 DOI: 10.1167/jov.23.12.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 09/11/2023] [Indexed: 10/20/2023] Open
Abstract
To encode binocular disparity, the visual system uses a pair of left eye and right eye bandpass filters with either a position or a phase offset between them. Such pairs are considered to exit at multiple scales to encode a wide range of disparity. However, local disparity measurements by bandpass mechanisms can be ambiguous, particularly when the actual disparity is larger than a half-cycle of the preferred spatial frequency of the filter, which often occurs in fine scales. In this study, we investigated whether the visual system uses a coarse-to-fine interaction to resolve this ambiguity at finer scales for depth estimation from disparity. The stimuli were stereo grating patches composed of a target and comparison patterns. The target patterns contained spatial frequencies of 1 and 4 cycles per degree (cpd). The phase disparity of the low-frequency component was 0° (at the horopter), -90° (uncrossed), or 90° (crossed), and that of the high-frequency components was changed independent of the low-frequency disparity, in the range between -90° (uncrossed) and 90° (crossed). The observers' task was to indicate whether the target appeared closer to the comparison pattern, which always shared the disparity with the low-frequency component of the target. Regardless of whether the comparison pattern was a 1-cpd + 4-cpd compound or a 1-cpd simple grating, the perceived depth order of the target and the comparison varied in accordance with the phase disparity of the high-frequency component of the target. This effect occurred not only when the low-frequency component was at the horopter, but also when it contained a large disparity corresponding to one cycle of the high-frequency component (±90°). Our findings suggest a coarse-to-fine interaction in multiscale disparity processing, in which the depth interpretation of the high-frequency changes based on the disparity of the low-frequency component.
Collapse
Affiliation(s)
- Pei-Yin Chen
- Department of Psychology, National Taiwan University, Taipei, Taiwan
- Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto, Japan
| | - Chien-Chung Chen
- Department of Psychology, National Taiwan University, Taipei, Taiwan
- Center for Neurobiology and Cognitive Science, National Taiwan University, Taipei, Taiwan
| | - Shin'ya Nishida
- Department of Intelligence Science and Technology, Graduate School of Informatics, Kyoto University, Kyoto, Japan
- NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation, Tokyo, Japan
| |
Collapse
|
12
|
Andoh JE, Ezekwesili AC, Nwanyanwu K, Elam A. Disparities in Eye Care Access and Utilization: A Narrative Review. Annu Rev Vis Sci 2023; 9:15-37. [PMID: 37254050 DOI: 10.1146/annurev-vision-112122-020934] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
This narrative review summarizes the literature on factors related to eye care access and utilization in the United States. Using the Healthy People 2030 framework, this review investigates social determinants of health associated with general and follow-up engagement, screenings, diagnostic visits, treatment, technology, and teleophthalmology. We provide hypotheses for these documented eye care disparities, featuring qualitative, patient-centered research. Lastly, we provide recommendations in the hopes of appropriately eliminating these disparities and reimagining eye care.
Collapse
Affiliation(s)
- Joana E Andoh
- Department of Ophthalmology, Wilmer Eye Institute, Johns Hopkins School of Medicine, Baltimore, Maryland, USA
| | - Agnes C Ezekwesili
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Kristen Nwanyanwu
- Department of Ophthalmology and Visual Science, Yale School of Medicine, New Haven, Connecticut, USA
| | - Angela Elam
- Department of Ophthalmology, WK Kellogg Eye Center, University of Michigan, Ann Arbor, Michigan, USA;
| |
Collapse
|
13
|
Jiang R, Meng M. Integration and suppression interact in binocular vision. J Vis 2023; 23:17. [PMID: 37750747 PMCID: PMC10541237 DOI: 10.1167/jov.23.10.17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2023] [Accepted: 08/25/2023] [Indexed: 09/27/2023] Open
Abstract
Contingent on stereo compatibility, two images presented dichoptically can lead to either binocular integration, thus generating stable stereopsis, or interocular suppression that induces binocular rivalry with bistable perception that alternates between the two images. The relationship between binocular integration and interocular suppression concerns how our brain processes binocular inputs to form unified visual awareness but remains unclear. Here, a series of psychophysical experiments were conducted to address this question, revealing that these collaborative and competitive binocular interactions are interconnected and would mediate one another according to their strength. Specifically, Experiments 1a and 1b showed that the presence of binocular rivalry inhibited peripheral stereopsis, significantly elevating the stereo threshold, with higher elevation resulting from increasing rivalry contrast. Experiments 2a and 2b showed that existing stereopsis with increasing binocular disparity balanced the dynamics of peripheral binocular rivalry, rendering more equivalent eye dominance. Based on these interactions, we suggest that binocular integration and interocular suppression may mediate one another through an overlapping mechanism for regulating eye dominance, with strong stereo percepts tending to reduce eye dominance and strong rivalry tending to increase eye dominance.
Collapse
Affiliation(s)
- Rong Jiang
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, China
| | - Ming Meng
- Philosophy and Social Science Laboratory of Reading and Development in Children and Adolescents (South China Normal University), Ministry of Education, China
- School of Psychology, South China Normal University, Guangzhou, China
| |
Collapse
|
14
|
Fogt N. Topical Review: Methodological Variables in Clinical and Laboratory Measurements of Fixation Disparity. Optom Vis Sci 2023; 100:572-594. [PMID: 37436811 DOI: 10.1097/opx.0000000000002041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2023] Open
Abstract
SIGNIFICANCE Fixation disparity is a small vergence error that does not disrupt fusion. Fixation disparity measures correlate with binocular symptoms. This article covers methodological differences between clinical fixation disparity measurement devices, findings when objective and subjective fixation disparities are compared, and the potential impact of binocular capture on fixation disparity measurements. Fixation disparity is a small vergence error that occurs in nonstrabismic individuals and does not disrupt fusion. This article reviews clinical fixation disparity variables and their clinical diagnostic value. Clinical devices that are used to measure these variables are described, as are studies in which the output from these devices has been compared. Methodological differences between the devices such as the location of the fusional stimulus, the rate at which judgments of dichoptic alignment are made, and the strength of the accommodative stimulus are all considered. In addition, the article covers theories of the neural origins of fixation disparity and control system models incorporating fixation disparity. Studies in which objective fixation disparities (oculomotor portion of fixation disparity assessed with an eye tracker) and subjective fixation disparities (sensory portion of fixation disparity assessed psychophysically with dichoptic Nonius lines) have been compared are also examined, and consideration is given to why some investigators find differences in these measures, whereas other investigators do not. The conclusion thus far is that there are likely complex interactions between vergence adaptation, accommodation, and the location of the fusional stimulus that lead to differences in objective and subjective fixation disparity measures. Finally, capture of the visual direction of monocular stimuli by adjacent fusional stimuli and the implications for fixation disparity measures are considered.
Collapse
|
15
|
Zhang K, Qian J. Top-down modulation on depth processing: Visual searches for metric and ordinal depth information show a pattern of dissociation. Psychon Bull Rev 2023; 30:1380-1387. [PMID: 36510093 DOI: 10.3758/s13423-022-02232-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/27/2022] [Indexed: 12/14/2022]
Abstract
Depending on the goal, one can selectively process the metric depth or the ordinal depth information in the same scene. It is unknown whether the metric depth and ordinal depth information are processed through a shared or different underlying mechanisms. Here, we investigated the processing of the metric depth and ordinal depth using visual search. Items were presented at multiple depth planes defined by the binocular disparity, with one item per depth plane. In the metric-search task, participants were required to search for the target on a particular depth plane, among one to three distractors. In the ordinal-search task, the target was specified by its depth order indicated by numbers (smaller numbers indicated nearer depth planes). We found that the ordinal search was faster and more accurate than the metric search, and the data showed a pattern of dissociation. Metric search, but not ordinal search, was slowed when the target and distractors were closer in depth, while ordinal search was slower for the middle than the edge positions but metric search was unaffected. These two opposite effects suggest that metric depth and ordinal depth may be processed differently.
Collapse
Affiliation(s)
- Ke Zhang
- Department of Psychology, Sun Yat-Sen University, Guangzhou, 510006, China
- Department of Psychology, Shaoxing University, Shaoxing, China
| | - Jiehui Qian
- Department of Psychology, Sun Yat-Sen University, Guangzhou, 510006, China.
| |
Collapse
|
16
|
Cravo MI, Bernardes R, Castelo-Branco M. Subtractive adaptation is a more effective and general mechanism in binocular rivalry than divisive adaptation. J Vis 2023; 23:18. [PMID: 37505915 PMCID: PMC10405863 DOI: 10.1167/jov.23.7.18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 06/17/2023] [Indexed: 07/30/2023] Open
Abstract
The activity of neurons is influenced by random fluctuations and can be strongly modulated by firing rate adaptation, particularly in sensory systems. Still, there is ongoing debate about the characteristics of neuronal noise and the mechanisms of adaptation, and even less is known about how exactly they affect perception. Noise and adaptation are critical in binocular rivalry, a visual phenomenon where two images compete for perceptual dominance. Here, we investigated the effects of different noise processes and adaptation mechanisms on visual perception by simulating a model of binocular rivalry with Gaussian white noise, Ornstein-Uhlenbeck noise, and pink noise, in variants with divisive adaptation, subtractive adaptation, and without adaptation. By simulating the nine models in parameter space, we find that white noise only produces rivalry when paired with subtractive adaptation and that subtractive adaptation reduces the influence of noise intensity on rivalry strength and introduces convergence of the mean percept duration, an important metric of binocular rivalry, across all noise processes. In sum, our results show that white noise is an insufficient description of background activity in the brain and that subtractive adaptation is a stronger and more general switching mechanism in binocular rivalry than divisive adaptation, with important noise-filtering properties.
Collapse
Affiliation(s)
- Maria Inês Cravo
- Coimbra Institute for Biomedical Imaging and Translational Research (CIBIT), University of Coimbra, Coimbra, Portugal
- Institute of Nuclear Sciences Applied to Health (ICNAS), University of Coimbra, Coimbra, Portugal
| | - Rui Bernardes
- Coimbra Institute for Biomedical Imaging and Translational Research (CIBIT), University of Coimbra, Coimbra, Portugal
- Institute of Nuclear Sciences Applied to Health (ICNAS), University of Coimbra, Coimbra, Portugal
- Faculty of Medicine, University of Coimbra, Coimbra, Portugal
| | - Miguel Castelo-Branco
- Coimbra Institute for Biomedical Imaging and Translational Research (CIBIT), University of Coimbra, Coimbra, Portugal
- Institute of Nuclear Sciences Applied to Health (ICNAS), University of Coimbra, Coimbra, Portugal
- Faculty of Medicine, University of Coimbra, Coimbra, Portugal
- Brain Imaging Network of Portugal, Portugal
| |
Collapse
|
17
|
Kemp JT, Cesanek E, Domini F. Perceiving depth from texture and disparity cues: Evidence for a non-probabilistic account of cue integration. J Vis 2023; 23:13. [PMID: 37486299 PMCID: PMC10382782 DOI: 10.1167/jov.23.7.13] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 06/12/2023] [Indexed: 07/25/2023] Open
Abstract
Bayesian inference theories have been extensively used to model how the brain derives three-dimensional (3D) information from ambiguous visual input. In particular, the maximum likelihood estimation (MLE) model combines estimates from multiple depth cues according to their relative reliability to produce the most probable 3D interpretation. Here, we tested an alternative theory of cue integration, termed the intrinsic constraint (IC) theory, which postulates that the visual system derives the most stable, not most probable, interpretation of the visual input amid variations in viewing conditions. The vector sum model provides a normative approach for achieving this goal where individual cue estimates are components of a multidimensional vector whose norm determines the combined estimate. Individual cue estimates are not accurate but related to distal 3D properties through a deterministic mapping. In three experiments, we show that the IC theory can more adeptly account for 3D cue integration than MLE models. In Experiment 1, we show systematic biases in the perception of depth from texture and depth from binocular disparity. Critically, we demonstrate that the vector sum model predicts an increase in perceived depth when these cues are combined. In Experiment 2, we illustrate the IC theory radical reinterpretation of the just noticeable difference (JND) and test the related vector sum model prediction of the classic finding of smaller JNDs for combined-cue versus single-cue stimuli. In Experiment 3, we confirm the vector sum prediction that biases found in cue integration experiments cannot be attributed to flatness cues, as the MLE model predicts.
Collapse
Affiliation(s)
- Jovan T Kemp
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI, USA
| | - Evan Cesanek
- Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY, USA
| | - Fulvio Domini
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI, USA
- Italian Institute of Technology, Rovereto, Italy
| |
Collapse
|
18
|
Abdul-Kabir M, Acquah EA, Quainoo EJ. Fixation disparity and refractive error among first-year optometry students. J Optom 2023; 16:100-106. [PMID: 35690554 PMCID: PMC10104788 DOI: 10.1016/j.optom.2022.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Revised: 03/09/2022] [Accepted: 04/20/2022] [Indexed: 06/15/2023]
Abstract
PURPOSE To determine the fixation disparity and refractive error of first-year optometry students to ascertain any relationship between them and also identify any association between fixation disparity and visual symptoms at near. METHOD It was an analytical cross-sectional study involving 85 participants aged 17 to 27 years (18.60 ± 1.37), 41% of whom were males. Subjective refraction was done at 3 m and fixation disparity was measured with and without spectacle correction using the Wesson Fixation Disparity Card. All analysis was set within a 95% confidence interval with a p-value ≤ 0.05 considered statistically significant. RESULTS Refractive error ranged from 0.25 SEQ (spherical equivalent) to 5.50 SEQ. Mean fixation disparity ranged from 2.9 ± 2.6 to 3.9 ± 2.8 min arc. There was no statistically significant correlation between refractive error and fixation disparity without correction (r = -0.180, p = 0.098) and with correction (r = 0.155, p = 0.157). For fixation disparity in the ortho and exo direction, mean fixation disparity with correction of participants who experienced headaches during or after reading (5.1 ± 2.6 min arc) was significantly higher (p = 0.032) than participants who did not (2.0 ± 2.6 min arc). CONCLUSION Myopia is common among first-year optometry students. Refractive error has no significant effect on fixation disparity. Headache is significantly associated with exo fixation disparity at near.
Collapse
Affiliation(s)
- Mohammed Abdul-Kabir
- Department of Optometry and Visual Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Eldrick Adu Acquah
- Department of Optometry and Visual Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana
| | - Ebenezer Justice Quainoo
- Department of Optometry and Visual Science, Kwame Nkrumah University of Science and Technology, Kumasi, Ghana.
| |
Collapse
|
19
|
Mo C, Lu J, Shi C, Fang F. Neural representations of competing stimuli along the dorsal and ventral visual pathways during binocular rivalry. Cereb Cortex 2023; 33:2734-2747. [PMID: 35689650 DOI: 10.1093/cercor/bhac238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 05/07/2022] [Accepted: 05/08/2022] [Indexed: 11/14/2022] Open
Abstract
Binocular rivalry arises when two discrepant stimuli are simultaneously presented to different eyes, during which observers consciously experience vivid perceptual alternations without physical changes in visual inputs. Neural dynamics tracking such perceptual alternations have been identified at both early and late visual areas, leading to the fundamental debate concerning the primary neural substrate underlying binocular rivalry. One promising hypothesis that might reconcile these seemingly paradoxical findings is a gradual shift from interocular competition between monocular neurons to pattern competition among binocular neurons. Here, we examined this hypothesis by investigating how neural representations of rivalrous stimuli evolved along the visual pathway. We found that representations of the dominant and the suppressed stimuli initially co-existed in V1, which were enhanced and attenuated respectively in extrastriate visual areas. Notably, neural activity in V4 was dictated by the representation of the dominant stimulus, while the representation of the suppressed stimulus was only partially inhibited in dorsal areas V3A and MT+. Our findings revealed a progressive transition from the co-existing representations of the rivalrous inputs to the dictatorial representation of the dominant stimulus in the ventral pathway, and advocated different cortical evolutionary patterns of visual representations between the dorsal and the ventral pathways.
Collapse
Affiliation(s)
- Ce Mo
- Department of Psychology, Sun Yat-Sen University, Guangzhou 510006, Guangdong, China
| | - Junshi Lu
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing 100087, China
- IDG/McGovern Institute for Brain Research, Peking University, Beijing 100087, China
| | - Chao Shi
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing 100087, China
- IDG/McGovern Institute for Brain Research, Peking University, Beijing 100087, China
| | - Fang Fang
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing 100087, China
- IDG/McGovern Institute for Brain Research, Peking University, Beijing 100087, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100087, China
| |
Collapse
|
20
|
Haenelt D, Trampel R, Nasr S, Polimeni JR, Tootell RBH, Sereno MI, Pine KJ, Edwards LJ, Helbling S, Weiskopf N. High-resolution quantitative and functional MRI indicate lower myelination of thin and thick stripes in human secondary visual cortex. eLife 2023; 12:e78756. [PMID: 36888685 PMCID: PMC9995117 DOI: 10.7554/elife.78756] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 02/08/2023] [Indexed: 03/09/2023] Open
Abstract
The characterization of cortical myelination is essential for the study of structure-function relationships in the human brain. However, knowledge about cortical myelination is largely based on post-mortem histology, which generally renders direct comparison to function impossible. The repeating pattern of pale-thin-pale-thick stripes of cytochrome oxidase (CO) activity in the primate secondary visual cortex (V2) is a prominent columnar system, in which histology also indicates different myelination of thin/thick versus pale stripes. We used quantitative magnetic resonance imaging (qMRI) in conjunction with functional magnetic resonance imaging (fMRI) at ultra-high field strength (7 T) to localize and study myelination of stripes in four human participants at sub-millimeter resolution in vivo. Thin and thick stripes were functionally localized by exploiting their sensitivity to color and binocular disparity, respectively. Resulting functional activation maps showed robust stripe patterns in V2 which enabled further comparison of quantitative relaxation parameters between stripe types. Thereby, we found lower longitudinal relaxation rates (R1) of thin and thick stripes compared to surrounding gray matter in the order of 1-2%, indicating higher myelination of pale stripes. No consistent differences were found for effective transverse relaxation rates (R2*). The study demonstrates the feasibility to investigate structure-function relationships in living humans within one cortical area at the level of columnar systems using qMRI.
Collapse
Affiliation(s)
- Daniel Haenelt
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
- International Max Planck Research School on Neuroscience of Communication: Function, Structure, and PlasticityLeipzigGermany
| | - Robert Trampel
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Shahin Nasr
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General HospitalCharlestownUnited States
- Department of Radiology, Harvard Medical SchoolBostonUnited States
| | - Jonathan R Polimeni
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General HospitalCharlestownUnited States
- Department of Radiology, Harvard Medical SchoolBostonUnited States
- Harvard-MIT Division of Health Sciences and Technology, Massachusetts Institute of TechnologyCambridgeUnited States
| | - Roger BH Tootell
- Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General HospitalCharlestownUnited States
- Department of Radiology, Harvard Medical SchoolBostonUnited States
| | - Martin I Sereno
- Department of Psychology, College of Sciences, San Diego State UniversitySan DiegoUnited States
| | - Kerrin J Pine
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Luke J Edwards
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Saskia Helbling
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
- Poeppel Lab, Ernst Strüngmann Institute (ESI) for Neuroscience in Cooperation with Max Planck SocietyFrankfurt am MainGermany
| | - Nikolaus Weiskopf
- Department of Neurophysics, Max Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
- Felix Bloch Institute for Solid State Physics, Faculty of Physics and Earth Sciences, Leipzig UniversityLeipzigGermany
| |
Collapse
|
21
|
Morgan M. Stereopsis for rapidly moving targets. Philos Trans R Soc Lond B Biol Sci 2023; 378:20210462. [PMID: 36511411 PMCID: PMC9745867 DOI: 10.1098/rstb.2021.0462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Accepted: 09/07/2022] [Indexed: 12/15/2022] Open
Abstract
Stereoscopic depth perception is possible with luminance-defined target velocities at least as high as 600° s-1, up to the limit of 30 Hz imposed by the high-temporal frequency cut-off of the eye. The limitation for perceiving depth from stereo disparity of moving targets is not their velocity but the temporal frequency bandwidth of the eye, which is affected by adaption state. Stereoacuity for a depth shift in a horizontally moving grating depends not on spatial disparity between corresponding luminance points in spatial units of arc min, but on the spatial shift as a fixed proportion of the period of the grating, in other words, on the phase angle difference between the two eyes, as is also the case for obliquely orientated, stationary gratings. Phase differences explain not only the classic Pulfrich stereophenomenon but its equivalent with dynamic visual noise, and a new effect in which depth results from interocular phase differences in luminance modulation. This article is part of a discussion meeting issue 'New approaches to 3D vision'.
Collapse
Affiliation(s)
- Michael Morgan
- Division of Optometry, City, University of London, London EC1V 0HB, UK
| |
Collapse
|
22
|
Jaschinski W. Reply to: Mallett unit or fully fusionable images for prisms against asthenopia? J Optom 2023; 16:88-89. [PMID: 36184401 PMCID: PMC9811359 DOI: 10.1016/j.optom.2022.08.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Accepted: 08/17/2022] [Indexed: 06/16/2023]
Affiliation(s)
- Wolfgang Jaschinski
- Leibniz Research Centre for Working Environment and Human Factors at the Technical University of Dortmund, Dortmund, Germany.
| |
Collapse
|
23
|
Bogomolova K, Vorstenbosch MATM, El Messaoudi I, Holla M, Hovius SER, van der Hage JA, Hierck BP. Effect of binocular disparity on learning anatomy with stereoscopic augmented reality visualization: A double center randomized controlled trial. Anat Sci Educ 2023; 16:87-98. [PMID: 34894205 PMCID: PMC10078652 DOI: 10.1002/ase.2164] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/05/2021] [Accepted: 12/09/2021] [Indexed: 06/01/2023]
Abstract
Binocular disparity provides one of the important depth cues within stereoscopic three-dimensional (3D) visualization technology. However, there is limited research on its effect on learning within a 3D augmented reality (AR) environment. This study evaluated the effect of binocular disparity on the acquisition of anatomical knowledge and perceived cognitive load in relation to visual-spatial abilities. In a double-center randomized controlled trial, first-year (bio)medical undergraduates studied lower extremity anatomy in an interactive 3D AR environment either with a stereoscopic 3D view (n = 32) or monoscopic 3D view (n = 34). Visual-spatial abilities were tested with a mental rotation test. Anatomical knowledge was assessed by a validated 30-item written test and 30-item specimen test. Cognitive load was measured by the NASA-TLX questionnaire. Students in the stereoscopic 3D and monoscopic 3D groups performed equally well in terms of percentage correct answers (written test: 47.9 ± 15.8 vs. 49.1 ± 18.3; P = 0.635; specimen test: 43.0 ± 17.9 vs. 46.3 ± 15.1; P = 0.429), and perceived cognitive load scores (6.2 ± 1.0 vs. 6.2 ± 1.3; P = 0.992). Regardless of intervention, visual-spatial abilities were positively associated with the specimen test scores (η2 = 0.13, P = 0.003), perceived representativeness of the anatomy test questions (P = 0.010) and subjective improvement in anatomy knowledge (P < 0.001). In conclusion, binocular disparity does not improve learning anatomy. Motion parallax should be considered as another important depth cue that contributes to depth perception during learning in a stereoscopic 3D AR environment.
Collapse
Affiliation(s)
- Katerina Bogomolova
- Department of SurgeryLeiden University Medical CenterLeidenthe Netherlands
- Center for Innovation of Medical EducationLeiden University Medical CenterLeidenthe Netherlands
| | | | - Inssaf El Messaoudi
- Department of OrthopedicsFaculty of MedicineRadboud University Medical CenterNijmegenthe Netherlands
| | - Micha Holla
- Department of OrthopedicsFaculty of MedicineRadboud University Medical CenterNijmegenthe Netherlands
| | - Steven E. R. Hovius
- Department of Plastic and Reconstructive SurgeryRadboud University Medical CenterNijmegenthe Netherlands
| | - Jos A. van der Hage
- Department of SurgeryLeiden University Medical CenterLeidenthe Netherlands
- Center for Innovation of Medical EducationLeiden University Medical CenterLeidenthe Netherlands
| | - Beerend P. Hierck
- Department of Anatomy and PhysiologyClinical Sciences, Veterinary Medicine FacultyUtrechtthe Netherlands
| |
Collapse
|
24
|
Tanabe S, Fu J, Cang J. Strong tuning for stereoscopic depth indicates orientation-specific recurrent circuitry in tree shrew V1. Curr Biol 2022; 32:5274-5284.e6. [PMID: 36417902 PMCID: PMC9772061 DOI: 10.1016/j.cub.2022.10.063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Revised: 09/23/2022] [Accepted: 10/27/2022] [Indexed: 11/23/2022]
Abstract
Neurons in the primary visual cortex (V1) are tuned to specific disparities between the two retinal images, which form the neural substrate for stereoscopic vision. We show that V1 neurons in tree shrews, but not in mice, display highly selective responses to narrow ranges of disparity in random-dot stereograms. Surprisingly, V1 neurons in both species show similarly strong tuning to gratings of varying interocular phase differences. This stimulus-dependent dissociation of disparity tuning can be explained by a network model that combines both feedforward and recurrent connections. The features of the model connections are supported by cortical organizations specific to each species. We validate this model by identifying putative inhibitory neurons and confirming their predicted disparity tuning in both species. Together, our studies establish a foundation for using tree shrews in studying binocular vision and raise an exciting possibility of how cortical columns could be uniquely important in computing stereoscopic depth.
Collapse
Affiliation(s)
- Seiji Tanabe
- Department of Psychology, University of Virginia, Charlottesville, VA 22904, USA.
| | - Jieming Fu
- Department of Biology, University of Virginia, Charlottesville, VA 22904, USA; Neuroscience Graduate Program, University of Virginia, Charlottesville, VA 22904, USA
| | - Jianhua Cang
- Department of Psychology, University of Virginia, Charlottesville, VA 22904, USA; Department of Biology, University of Virginia, Charlottesville, VA 22904, USA.
| |
Collapse
|
25
|
Dhungel D, Stevenson SB. Spatial-temporal contrast sensitivity of the eye alignment reflex. Sci Rep 2022; 12:19480. [PMID: 36376410 PMCID: PMC9663687 DOI: 10.1038/s41598-022-23753-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 11/04/2022] [Indexed: 11/16/2022] Open
Abstract
The binocular alignment of the eyes involves both voluntary and reflexive mechanisms, but little is known about the visual input and neurological pathway of the reflex component. Our studies examined the role of spatiotemporal frequency and contrast in the control of reflex eye alignment, and compared the contrast sensitivity of the alignment reflex with psychophysical contrast sensitivity. We measured the contrast sensitivity of vertical disparity-driven vergence eye movements in response to bandwidth filtered static or 6 Hz counterphase flickering noise and measured psychophysical detection sensitivity for the same stimuli. Contrast thresholds for producing a detectable vertical alignment change (measured with nonius lines) were determined using a staircase method for 7 spatial frequencies [0.25-16 cycles per degree] and 3 vertical disparities [5, 10, and 30 arcmin] in 7 adults with normal or corrected to normal vision. The main findings of this study are, (1) the vertical alignment reflex had overall relatively high contrast sensitivity, comparable to but somewhat less than visual detection thresholds, (2) the most effective stimulus spatial frequency scaled in inverse proportion to the disparity being stimulated, and (3) unlike psychophysical contrast sensitivity, the eye alignment reflex contrast sensitivity was not improved by flickering low spatial frequencies.
Collapse
Affiliation(s)
- Deepa Dhungel
- University of Houston College of Optometry, 4401 Martin Luther King Blvd, Houston, TX, 77204, USA.
| | - Scott B Stevenson
- University of Houston College of Optometry, 4401 Martin Luther King Blvd, Houston, TX, 77204, USA
| |
Collapse
|
26
|
Chen YD, Kaestner M, Norcia AM. Cognitive penetrability of scene representations based on horizontal image disparities. Sci Rep 2022; 12:17902. [PMID: 36284130 PMCID: PMC9596438 DOI: 10.1038/s41598-022-22670-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2022] [Accepted: 10/18/2022] [Indexed: 01/20/2023] Open
Abstract
The structure of natural scenes is signaled by many visual cues. Principal amongst them are the binocular disparities created by the laterally separated viewpoints of the two eyes. Disparity cues are believed to be processed hierarchically, first in terms of local measurements of absolute disparity and second in terms of more global measurements of relative disparity that allow extraction of the depth structure of a scene. Psychophysical and oculomotor studies have suggested that relative disparities are particularly relevant to perception, whilst absolute disparities are not. Here, we compare neural responses to stimuli that isolate the absolute disparity cue with stimuli that contain additional relative disparity cues, using the high temporal resolution of EEG to determine the temporal order of absolute and relative disparity processing. By varying the observers' task, we assess the extent to which each cue is cognitively penetrable. We find that absolute disparity is extracted before relative disparity, and that task effects arise only at or after the extraction of relative disparity. Our results indicate a hierarchy of disparity processing stages leading to the formation of a proto-object representation upon which higher cognitive processes can act.
Collapse
Affiliation(s)
- Yulan D Chen
- Department of Psychology, Stanford University, 450 Jane Stanford Way, Stanford, CA, USA
- Wu-Tsai Neuroscience Institute, Stanford University, 290 Jane Stanford Way, Stanford, CA, USA
| | - Milena Kaestner
- Department of Psychology, Stanford University, 450 Jane Stanford Way, Stanford, CA, USA.
- Wu-Tsai Neuroscience Institute, Stanford University, 290 Jane Stanford Way, Stanford, CA, USA.
| | - Anthony M Norcia
- Department of Psychology, Stanford University, 450 Jane Stanford Way, Stanford, CA, USA
- Wu-Tsai Neuroscience Institute, Stanford University, 290 Jane Stanford Way, Stanford, CA, USA
| |
Collapse
|
27
|
Liu Y, Xu X, Xiang B, Chen G, Gong G, Lu H. Transformer Based Binocular Disparity Prediction with Occlusion Predict and Novel Full Connection Layers. Sensors (Basel) 2022; 22:7577. [PMID: 36236675 PMCID: PMC9570544 DOI: 10.3390/s22197577] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Revised: 09/29/2022] [Accepted: 09/30/2022] [Indexed: 06/16/2023]
Abstract
The depth estimation algorithm based on the convolutional neural network has many limitations and defects by constructing matching cost volume to calculate the disparity: using a limited disparity range, the authentic disparity beyond the predetermined range can not be acquired; Besides, the matching process lacks constraints on occlusion and matching uniqueness; Also, as a local feature extractor, a convolutional neural network lacks the ability of global context information perception. Aiming at the problems in the matching method of constructing matching cost volume, we propose a disparity prediction algorithm based on Transformer, which specifically comprises the Swin-SPP module for feature extraction based on Swin Transformer, Transformer disparity matching network based on self-attention and cross-attention mechanism, and occlusion prediction sub-network. In addition, we propose a double skip connection fully connected layer to solve the problems of gradient vanishing and explosion during the training process for the Transformer model, thus further enhancing inference accuracy. The proposed model in this paper achieved an EPE (Absolute error) of 0.57 and 0.61, and a 3PE (Percentage error greater than 3 px) of 1.74% and 1.56% on KITTI 2012 and KITTI 2015 datasets, respectively, with an inference time of 0.46 s and parameters as low as only 2.6 M, showing great advantages compared with other algorithms in various evaluation metrics.
Collapse
Affiliation(s)
- Yi Liu
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- University of Chinese Academy of Sciences, Beijing 100089, China
| | - Xintao Xu
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- School of Microelectronics, University of Science and Technology of China, Hefei 230026, China
| | - Bajian Xiang
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- University of Chinese Academy of Sciences, Beijing 100089, China
| | - Gang Chen
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- University of Chinese Academy of Sciences, Beijing 100089, China
| | - Guoliang Gong
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- University of Chinese Academy of Sciences, Beijing 100089, China
| | - Huaxiang Lu
- Institute of Semiconductors, Chinese Academy of Sciences, Beijing 100083, China
- University of Chinese Academy of Sciences, Beijing 100089, China
- Materials and Optoelectronics Research Center, University of Chinese Academy of Sciences, Beijing 200031, China
- College of Microelectronics, University of Chinese Academy of Sciences, Beijing 100049, China
- Semiconductor Neural Network Intelligent Perception and Computing Technology Beijing Key Laboratory, Beijing 100083, China
| |
Collapse
|
28
|
Hibbard PB, Asher JM. Robust natural depth for anticorrelated random dot stereogram for edge stimuli, but minimal reversed depth for embedded circular stimuli, irrespective of eccentricity. PLoS One 2022; 17:e0274566. [PMID: 36137132 PMCID: PMC9499282 DOI: 10.1371/journal.pone.0274566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2021] [Accepted: 08/30/2022] [Indexed: 11/19/2022] Open
Abstract
The small differences between the images formed in our left and right eyes are an important cue to the three-dimensional structure of scenes. These disparities are encoded by binocular neurons in the visual cortex. At the earliest stage of processing, these respond to binocular correlation between images. We assessed the perception of depth in anticorrelated stimuli, in which the contrast polarity in one eye is reversed, as a function of their location in the retinal image, and their depth configuration (a horizontal edge or a circle surrounded by an annulus) We found that, regardless of stimulus eccentricity, participants perceived depth in the natural direction for edge stimuli, and weakened, reversed depth for circular stimuli.
Collapse
Affiliation(s)
- Paul B. Hibbard
- Department of Psychology, University of Essex, Colchester, Essex, United Kingdom
- * E-mail:
| | - Jordi M. Asher
- Department of Psychology, University of Essex, Colchester, Essex, United Kingdom
| |
Collapse
|
29
|
O’Keeffe J, Yap SH, Llamas-Cornejo I, Nityananda V, Read JCA. A computational model of stereoscopic prey capture in praying mantises. PLoS Comput Biol 2022; 18:e1009666. [PMID: 35587948 PMCID: PMC9159633 DOI: 10.1371/journal.pcbi.1009666] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2021] [Revised: 06/01/2022] [Accepted: 04/10/2022] [Indexed: 11/25/2022] Open
Abstract
We present a simple model which can account for the stereoscopic sensitivity of praying mantis predatory strikes. The model consists of a single “disparity sensor”: a binocular neuron sensitive to stereoscopic disparity and thus to distance from the animal. The model is based closely on the known behavioural and neurophysiological properties of mantis stereopsis. The monocular inputs to the neuron reflect temporal change and are insensitive to contrast sign, making the sensor insensitive to interocular correlation. The monocular receptive fields have a excitatory centre and inhibitory surround, making them tuned to size. The disparity sensor combines inputs from the two eyes linearly, applies a threshold and then an exponent output nonlinearity. The activity of the sensor represents the model mantis’s instantaneous probability of striking. We integrate this over the stimulus duration to obtain the expected number of strikes in response to moving targets with different stereoscopic disparity, size and vertical disparity. We optimised the parameters of the model so as to bring its predictions into agreement with our empirical data on mean strike rate as a function of stimulus size and disparity. The model proves capable of reproducing the relatively broad tuning to size and narrow tuning to stereoscopic disparity seen in mantis striking behaviour. Although the model has only a single centre-surround receptive field in each eye, it displays qualitatively the same interaction between size and disparity as we observed in real mantids: the preferred size increases as simulated prey distance increases beyond the preferred distance. We show that this occurs because of a stereoscopic “false match” between the leading edge of the stimulus in one eye and its trailing edge in the other; further work will be required to find whether such false matches occur in real mantises. Importantly, the model also displays realistic responses to stimuli with vertical disparity and to pairs of identical stimuli offering a “ghost match”, despite not being fitted to these data. This is the first image-computable model of insect stereopsis, and reproduces key features of both neurophysiology and striking behaviour. The praying mantis is the only insect so far known to compute depth using stereoscopic (3D) vision. Mantis stereopsis appears to be simpler than human stereopsis and most machine sterovision algorithms. A computational model of mantis stereopsis may therefore be beneficial to the field of robotics, particularly where computational power is limited. Using a combination of behavioural observations and neurophysiological data, we propose a very simple model structure to describe the prey capture response in the praying mantis. We used the limited available data on the mantis’ size and distance preferences for its prey to train our model parameters. Our simple model is able to qualitatively reproduce previously unexplained characteristics of our training data, and predicts key observations in additional empirical data that was not included in the model training. Whilst we believe our model to be only a partial and heavily simplified account of mantis stereopsis, our results are supportive of our model structure as an approximation of the size and disparity sensors used by the mantis when catching its prey.
Collapse
Affiliation(s)
- James O’Keeffe
- Dyson School of Design Engineering, Imperial College, London, United Kingdom
- * E-mail:
| | - Sin Hui Yap
- Biosciences Institute, Newcastle University, Newcastle, United Kingdom
- School of Medical Education, Newcastle University, Johor, Malaysia
| | | | - Vivek Nityananda
- Biosciences Institute, Newcastle University, Newcastle, United Kingdom
| | - Jenny C. A. Read
- Biosciences Institute, Newcastle University, Newcastle, United Kingdom
| |
Collapse
|
30
|
McLean IR, Manning TS, Cooper EA. Perceptual Adaptation to Continuous Versus Intermittent Exposure to Spatial Distortions. Invest Ophthalmol Vis Sci 2022; 63:29. [PMID: 35612838 PMCID: PMC9150834 DOI: 10.1167/iovs.63.5.29] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 04/28/2022] [Indexed: 11/24/2022] Open
Abstract
Purpose To examine perceptual adaptation when people wear spectacles that produce unequal retinal image magnification. Methods Two groups of 15 participants (10 male; mean age 25.6 ± 4.9 years) wore spectacles with a 3.8% horizontal magnifier over one eye. The continuous-wear group wore the spectacles for 5 hours straight. The intermittent-wear group wore them for five 1-hour intervals. To measure slant and shape distortions produced by the spectacles, participants adjusted visual stimuli until they appeared frontoparallel or equiangular, respectively. Adaptation was quantified as the difference in responses at the beginning and end of wearing the spectacles. Aftereffects were quantified as the difference before and after removing the spectacles. We hypothesized that intermittent wear may lead to visual cue reweighting, so we fit a cue combination model to the data and examined changes in weights given to perspective and binocular disparity slant cues. Results Both groups experienced significant shape adaptation and aftereffects. The continuous-wear group underwent significant slant adaptation and the intermittent group did not, but there was no significant difference between groups, suggesting that the difference in adaptation was negligible. There was no evidence for cue reweighting in the intermittent wear group, but unexpectedly, the weight given to binocular disparity cues for slant increased significantly in the continuous-wear group. Conclusions We did not find strong evidence that adaptation to spatial distortions differed between the two groups. However, there may be differences in the cue weighting strategies employed when spectacles are worn intermittently or continuously.
Collapse
Affiliation(s)
- Iona R McLean
- Herbert Wertheim School of Optometry and Vision Science, University of California Berkeley, Berkeley, California, United States
| | - Tyler S Manning
- Herbert Wertheim School of Optometry and Vision Science, University of California Berkeley, Berkeley, California, United States
| | - Emily A Cooper
- Herbert Wertheim School of Optometry and Vision Science, University of California Berkeley, Berkeley, California, United States
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, California, United States
| |
Collapse
|
31
|
Evans BJW, Parmar K, Dickinson C. Reply to: Mallett unit or fully fusionable images for prisms against asthenopia? J Optom 2022; 15:186-187. [PMID: 34670731 PMCID: PMC9068569 DOI: 10.1016/j.optom.2021.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Affiliation(s)
| | - Ketan Parmar
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| | - Christine Dickinson
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK
| |
Collapse
|
32
|
Kommerell G, Bach M. Mallett unit or fully fusionable images for prisms against asthenopia? J Optom 2022; 15:186. [PMID: 34674970 PMCID: PMC9068562 DOI: 10.1016/j.optom.2021.01.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Accepted: 01/07/2021] [Indexed: 06/13/2023]
Affiliation(s)
- Guntram Kommerell
- Eye Center, Faculty of Medicine, Medical Center - University of Freiburg, University of Freiburg, Germany.
| | - Michael Bach
- Eye Center, Faculty of Medicine, Medical Center - University of Freiburg, University of Freiburg, Germany
| |
Collapse
|
33
|
Sasaki R, Kumano H, Mitani A, Suda Y, Uka T. Task-Specific Employment of Sensory Signals Underlies Rapid Task Switching. Cereb Cortex 2022; 32:4657-4670. [PMID: 35088074 DOI: 10.1093/cercor/bhab508] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Revised: 12/09/2021] [Accepted: 12/10/2021] [Indexed: 11/13/2022] Open
Abstract
Much of our flexible behavior is dependent on responding efficiently to relevant information while discarding irrelevant information. Little is known, however, about how neural pathways governing sensory-motor associations can rapidly switch to accomplish such flexibility. Here, we addressed this question by electrically microstimulating middle temporal (MT) neurons selective for both motion direction and binocular disparity in monkeys switching between direction and depth discrimination tasks. Surprisingly, we frequently found that the observed psychophysical bias precipitated by delivering microstimulation to neurons whose preferred direction and depth were related to opposite choices in the two tasks was substantially shifted toward a specific movement. Furthermore, these effects correlated with behavioral switching performance. Our findings suggest that the outputs of sensory signals are task specific and that irrelevant sensory-motor pathways are gated depending on task demand so as to accomplish rapid attentional switching.
Collapse
Affiliation(s)
- Ryo Sasaki
- Department of Neuroscience, Graduate School of Medicine, Kyoto University, Kyoto 606-8501, Japan
| | - Hironori Kumano
- Department of Integrative Physiology, Graduate School of Medicine, University of Yamanashi, Yamanashi 409-3898, Japan
- Department of Neurophysiology, Juntendo University Graduate School of Medicine, Tokyo 113-8421, Japan
| | | | - Yuki Suda
- Department of Integrative Physiology, Graduate School of Medicine, University of Yamanashi, Yamanashi 409-3898, Japan
- Department of Neurophysiology, Juntendo University Graduate School of Medicine, Tokyo 113-8421, Japan
- Brain Science Institute, Tamagawa University, Tokyo 194-8610, Japan
| | - Takanori Uka
- Department of Integrative Physiology, Graduate School of Medicine, University of Yamanashi, Yamanashi 409-3898, Japan
- Department of Neurophysiology, Juntendo University Graduate School of Medicine, Tokyo 113-8421, Japan
- Brain Science Institute, Tamagawa University, Tokyo 194-8610, Japan
| |
Collapse
|
34
|
Abstract
This paper explains the basics of depth perception and explores why both eyes are aligned. We concluded that the brain may analyze image features for binocular fusion while calculating the horizontal disparity for depth perception.
Collapse
Affiliation(s)
- Hiroyuki Mitsudo
- Division of Psychology, Department of Human Sciences, Faculty of Human-Environment Studies, Kyushu University
| |
Collapse
|
35
|
Reena Durai CV, Rajendran S, Webster MA, Vempati S, Bharadwaj SR. The magnitude of monocular light attenuation required to elicit the Pulfrich illusion. Vision Res 2021; 187:85-93. [PMID: 34225133 PMCID: PMC8363548 DOI: 10.1016/j.visres.2021.06.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 06/17/2021] [Accepted: 06/19/2021] [Indexed: 11/17/2022]
Abstract
In the Pulfrich illusion, the depth of a moving object is misperceived due to induced retinal disparity and/or interocular velocity differences arising from differences in luminance, contrast, or spatial frequency between the two eyes. These effects have been shown to occur both for visual deficits and for optical corrections that introduce significant binocular differences between the retinal images. However, it remains unknown to what extent the illusion might arise given normal variation between the eyes, such as natural interocular variation in pupil diameter (anisocoria). To assess this, we examined the threshold interocular retinal illuminance difference required to experience illusory depth in two random-dot fields moving in opposite directions in 24 normally-sighted observers with dilated pupils. Interocular difference in retinal illuminance was induced by placing neutral density filters of different intensities before the left eye. A minority of subjects (n = 8) did not provide meaningful data on changes in the experience of illusory depth with interocular difference in retinal illuminance and four subjects showed biases >±10% from the 50% point of subjective equality in the psychometric function. For the remaining 12 participants, the retinal illuminance had to differ by approximately 40% for the depth between the planes to become visible at threshold levels. This difference was approximately constant over a range of absolute luminance levels from 10 to 80 cd/m2. Our results suggest that while motion-in-depth illusions due to interocular differences in retinal illuminance may be pronounced in certain ophthalmic diseases or following certain optical interventions, it is unlikely to be manifest as a result of normal interocular variations in retinal illuminance. Further, our results also point towards the existence of substantial individual differences in the experience of what is otherwise thought of as a readily appreciable motion-in-depth illusion.
Collapse
Affiliation(s)
- C Vijay Reena Durai
- Brien Holden Institute of Optometry and Vision Sciences, LV Prasad Eye Institute, Road no. 2, Banjara Hills, Hyderabad 500034, Telangana, India; Prof. Brien Holden Eye Research Centre, Hyderabad Eye Research Foundation, LV Prasad Eye Institute, Road no. 2, Banjara Hills, Hyderabad 500034, Telangana, India
| | | | | | - Sandeep Vempati
- Center for Innovation, LV Prasad Eye Institute, Hyderabad 500034, Telangana, India
| | - Shrikant R Bharadwaj
- Brien Holden Institute of Optometry and Vision Sciences, LV Prasad Eye Institute, Road no. 2, Banjara Hills, Hyderabad 500034, Telangana, India; Prof. Brien Holden Eye Research Centre, Hyderabad Eye Research Foundation, LV Prasad Eye Institute, Road no. 2, Banjara Hills, Hyderabad 500034, Telangana, India.
| |
Collapse
|
36
|
Whritner JA, Czuba TB, Cormack LK, Huk AC. Spatiotemporal integration of isolated binocular three-dimensional motion cues. J Vis 2021; 21:2. [PMID: 34468705 PMCID: PMC8419873 DOI: 10.1167/jov.21.10.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 07/28/2021] [Indexed: 11/24/2022] Open
Abstract
Two primary binocular cues-based on velocities seen by the two eyes or on temporal changes in binocular disparity-support the perception of three-dimensional (3D) motion. Although these cues support 3D motion perception in different perceptual tasks or regimes, stimulus cross-cue contamination and/or substantial differences in spatiotemporal structure have complicated interpretations. We introduce novel psychophysical stimuli which cleanly isolate the cues, based on a design introduced in oculomotor work (Sheliga, Quaia, FitzGibbon, & Cumming, 2016). We then use these stimuli to characterize and compare the temporal and spatial integration properties of velocity- and disparity-based mechanisms. On average, temporal integration of velocity-based cues progressed more than twice as quickly as disparity-based cues; performance in each pure-cue condition saturated at approximately 200 ms and approximately 500 ms, respectively. This temporal distinction suggests that disparity-based 3D direction judgments may include a post-sensory stage involving additional integration time in some observers, whereas velocity-based judgments are rapid and seem to be more purely sensory in nature. Thus, these two binocular mechanisms appear to support 3D motion perception with distinct temporal properties, reflecting differential mixtures of sensory and decision contributions. Spatial integration profiles for the two mechanisms were similar, and on the scale of receptive fields in area MT. Consistent with prior work, there were substantial individual differences, which we interpret as both sensory and cognitive variations across subjects, further clarifying the case for distinct sets of both cue-specific sensory and cognitive mechanisms. The pure-cue stimuli presented here lay the groundwork for further investigations of velocity- and disparity-based contributions to 3D motion perception.
Collapse
Affiliation(s)
- Jake A Whritner
- Center for Perceptual Systems, Department of Psychology, The University of Texas at Austin, Austin, TX, USA
| | - Thaddeus B Czuba
- Center for Perceptual Systems, Department of Psychology, The University of Texas at Austin, Austin, TX, USA
| | - Lawrence K Cormack
- Center for Perceptual Systems, Department of Psychology, The University of Texas at Austin, Austin, TX, USA
| | - Alexander C Huk
- Center for Perceptual Systems, Departments of Neuroscience & Psychology, The University of Texas at Austin, Austin, TX, USA
| |
Collapse
|
37
|
Zhang B, Slezak E, Wang W, Shevell SK. Binocularly-driven competing neural responses and the perceptual resolution of color. J Vis 2021; 21:15. [PMID: 34542576 PMCID: PMC8458990 DOI: 10.1167/jov.21.10.15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Accepted: 08/21/2021] [Indexed: 11/25/2022] Open
Abstract
Competing rivalrous neural representations can be resolved at several levels of the visual system. Sustained percepts during interocular-switch rivalry (ISR), in which rivalrous left- and right-eye stimuli swap between eyes several times a second, often are attributed to competing binocularly driven neural representations of each rivalrous stimulus. An alternative view posits monocular neural competition together with a switch in eye dominance at the moment of each stimulus swap between eyes. Here, a range of experimental conditions was tested that would change the colors seen if mediated by eye dominance but not if by competition between binocularly driven responses. Observers viewed multiple chromatically rivalrous discs in various temporal and spatial patterns, and reported when all discs in view appeared the same color. Unlike typical ISR paradigms that swap the complete stimulus in each eye, some of the rivalrous discs were swapped at a different time, or faster frequency, than other discs. Monocular dominance of one eye at a time implies that all discs will rarely be seen as identical in color when some discs swap at a different frequency than others. On the other hand, competing binocularly driven neural responses are not affected by asynchronous swap timing among the individual discs. Results for every observer are in accord with competing responses at the level of binocularly driven, chromatically tuned neurons. Although an account based on eye dominance can be constructed using many small retinotopic zones that have independent timing for the moment of switching the dominant eye, competing binocularly driven responses are a more parsimonious explanation.
Collapse
Affiliation(s)
- Bobicheng Zhang
- Department of Psychology, University of Chicago, Chicago, Illinois, USA
| | - Emily Slezak
- Department of Psychology, University of Chicago, Chicago, Illinois, USA
- Institute for Mind and Biology and University of Chicago, Chicago, Illinois, USA
| | - Wei Wang
- Department of Psychology, University of Chicago, Chicago, Illinois, USA
- Institute for Mind and Biology and University of Chicago, Chicago, Illinois, USA
| | - Steven K Shevell
- Department of Psychology, University of Chicago, Chicago, Illinois, USA
- Institute for Mind and Biology and University of Chicago, Chicago, Illinois, USA
- Department of Ophthalmology & Visual Science, University of Chicago, Chicago, Illinois, USA
| |
Collapse
|
38
|
Alvarez I, Hurley SA, Parker AJ, Bridge H. Human primary visual cortex shows larger population receptive fields for binocular disparity-defined stimuli. Brain Struct Funct 2021; 226:2819-2838. [PMID: 34347164 PMCID: PMC8541985 DOI: 10.1007/s00429-021-02351-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 07/22/2021] [Indexed: 11/26/2022]
Abstract
The visual perception of 3D depth is underpinned by the brain's ability to combine signals from the left and right eyes to produce a neural representation of binocular disparity for perception and behaviour. Electrophysiological studies of binocular disparity over the past 2 decades have investigated the computational role of neurons in area V1 for binocular combination, while more recent neuroimaging investigations have focused on identifying specific roles for different extrastriate visual areas in depth perception. Here we investigate the population receptive field properties of neural responses to binocular information in striate and extrastriate cortical visual areas using ultra-high field fMRI. We measured BOLD fMRI responses while participants viewed retinotopic mapping stimuli defined by different visual properties: contrast, luminance, motion, correlated and anti-correlated stereoscopic disparity. By fitting each condition with a population receptive field model, we compared quantitatively the size of the population receptive field for disparity-specific stimulation. We found larger population receptive fields for disparity compared with contrast and luminance in area V1, the first stage of binocular combination, which likely reflects the binocular integration zone, an interpretation supported by modelling of the binocular energy model. A similar pattern was found in region LOC, where it may reflect the role of disparity as a cue for 3D shape. These findings provide insight into the binocular receptive field properties underlying processing for human stereoscopic vision.
Collapse
Affiliation(s)
- Ivan Alvarez
- Wellcome Centre for Integrative Neuroimaging, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
| | - Samuel A Hurley
- Wellcome Centre for Integrative Neuroimaging, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK
- Department of Radiology, University of Wisconsin, Madison, WI, 53705, USA
| | - Andrew J Parker
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, OX1 3PT, UK
- Institut für Biologie, Otto-von-Guericke Universität, 39120, Magdeburg, Germany
| | - Holly Bridge
- Wellcome Centre for Integrative Neuroimaging, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford, OX3 9DU, UK.
| |
Collapse
|
39
|
Abstract
SIGNIFICANCE The range of clear and single binocular vision differs between 3D displays and clinical prism vergences, but this difference is unexplained. This difference prevents clinicians from predicting the range of clear and single binocular vision in 3D-viewing patients. In this study, we tested a hypothesis for this difference. PURPOSE The purpose of this study was to determine whether changing fixation target size in 3D viewing significantly affects the vergence ranges and, if so, then to determine whether the target size effect is driven by fusional vergence gain changes, threshold of blur changes, or both. METHODS Twenty-one visually normal adults aged 18 to 28 years viewed 3D images at 40 cm in an electronic stereoscopic. The fixation target, a Maltese cross, moved in depth at 2∆/s by way of changing crossed or uncrossed disparity until blur and diplopia ensued. We used four target sizes: (1) small (width × height, 0.21° × 0.63°), (2) medium (1.43° × 4.3°), (3) large (3.6° × 10.8°), and (4) 3D (size changing congruently with disparity). The effect of target size on responses was tested by mixed ANOVAs. RESULT Mean convergence blurs and breaks increased with target size by 40% (P < .001) and 71% (P < .001), respectively, and in divergence by 33% (P = .03) and 30% (P = .04), respectively. The increases in break magnitude with target size implicate fusional vergence gain change in the size effect. Increasing target size raised the threshold of blur from 1.06 to 1.82 D in convergence and from 0.97 to 1.48 D in divergence (P = .008). CONCLUSIONS Growing fixation target size in 3D viewing increases fusional vergence gain and blur thresholds, which together increase the limits of clear and single binocular vision. Therefore, the clarity of a 3D image depends not only on its disparity but also on the size of the viewed image.
Collapse
|
40
|
Goutcher R, Barrington C, Hibbard PB, Graham B. Binocular vision supports the development of scene segmentation capabilities: Evidence from a deep learning model. J Vis 2021; 21:13. [PMID: 34289490 PMCID: PMC8300045 DOI: 10.1167/jov.21.7.13] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 06/16/2021] [Indexed: 11/24/2022] Open
Abstract
The application of deep learning techniques has led to substantial progress in solving a number of critical problems in machine vision, including fundamental problems of scene segmentation and depth estimation. Here, we report a novel deep neural network model, capable of simultaneous scene segmentation and depth estimation from a pair of binocular images. By manipulating the arrangement of binocular image pairs, presenting the model with standard left-right image pairs, identical image pairs or swapped left-right images, we show that performance levels depend on the presence of appropriate binocular image arrangements. Segmentation and depth estimation performance are both impaired when images are swapped. Segmentation performance levels are maintained, however, for identical image pairs, despite the absence of binocular disparity information. Critically, these performance levels exceed those found for an equivalent, monocularly trained, segmentation model. These results provide evidence that binocular image differences support both the direct recovery of depth and segmentation information, and the enhanced learning of monocular segmentation signals. This finding suggests that binocular vision may play an important role in visual development. Better understanding of this role may hold implications for the study and treatment of developmentally acquired perceptual impairments.
Collapse
Affiliation(s)
- Ross Goutcher
- Psychology Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| | - Christian Barrington
- Psychology Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
- Computing Science and Mathematics Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| | - Paul B Hibbard
- Department of Psychology, University of Essex, Colchester, UK
| | - Bruce Graham
- Computing Science and Mathematics Division, Faculty of Natural Sciences, University of Stirling, Stirling, UK
| |
Collapse
|
41
|
Abstract
Interocular suppression is the phenomenon in which the signal from one eye inhibits the other eye in the presence of dissimilar images. Various clinical and laboratory-based tests have been used to assess suppression, which vary in color, contrast, and stimulus size. These stimulus variations may yield different spatial extents of suppression, which makes it difficult to compare the outcomes. To evaluate the role of stimulus characteristics, we measured the suppression zone using a binocular rivalry paradigm in normally-sighted observers by systematically varying the stimulus parameters. The stimuli consist of a constantly visible horizontal reference seen by one eye while two vertical suppressors were presented to the other eye. With a keypress, the suppressors appeared for 1 s, to induce a transient suppression zone in the middle part of the reference. Subjects adjusted the width between the suppressors to determine the zone. The zone decreased significantly with increasing spatial frequency and lower contrast. The width was 1.4 times larger than the height. The zone was smaller with negative compared to positive contrast polarity but independent of eye dominance, luminance, and colored filters. A departure from scale invariance was captured with a model suggesting a stimulus-dependent and a small fixed non-stimulus-dependent portion.
Collapse
Affiliation(s)
- Wei Hau Lew
- University of Houston College of Optometry, 4901 Calhoun Rd, Houston, TX, 77004, USA.
| | - Scott B Stevenson
- University of Houston College of Optometry, 4901 Calhoun Rd, Houston, TX, 77004, USA
| | - Daniel R Coates
- University of Houston College of Optometry, 4901 Calhoun Rd, Houston, TX, 77004, USA
| |
Collapse
|
42
|
Chen PY, Chen CC, Tyler CW. A gain-control disparity energy model for perceived depth from disparity. Vision Res 2021; 181:38-46. [PMID: 33556821 DOI: 10.1016/j.visres.2020.12.008] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2020] [Revised: 10/08/2020] [Accepted: 12/04/2020] [Indexed: 11/18/2022]
Abstract
Luminance contrast is one of the key factors in the visibility of objects in the world around us. Previous work has shown that the perceived depth from binocular disparity depends profoundly on the luminance contrast of the image. This dependence cannot be explained by existing disparity models, such as the well-established disparity energy model, because they predict no effect of luminance contrast on depth perception. Here, we develop a model for disparity processing that incorporates contrast normalization of the neural response into the disparity energy model to account for the contrast dependence of perceived depth from disparity. Our model contains an array of disparity channels, each with a different disparity selectivity. The binocular images are first processed by the left- and right-eye receptive fields of each channel. The outputs of the two receptive fields are combined linearly as the excitatory disparity sensitivity and then fed into a nonlinear contrast gain control mechanism. The perceived depth is determined by the weighted average of all the disparity channels that respond to the binocular images. This model provides the first analytic account of how luminance contrast affects perceived depth from disparity.
Collapse
Affiliation(s)
- Pei-Yin Chen
- Department of Psychology, National Taiwan University, Taipei, Taiwan
| | - Chien-Chung Chen
- Department of Psychology, National Taiwan University, Taipei, Taiwan; Center for Neurobiology and Cognitive Science, National Taiwan University, Taipei, Taiwan.
| | - Christopher W Tyler
- Division of Optometry and Visual Science, School of Health Sciences, City University, London, United Kingdom; Smith-Kettlewell Eye Research Institute, San Francisco, CA, United States
| |
Collapse
|
43
|
Zewari SZ, Jeung L, Gerlach C, Wessels PH. [Migraine with typical aura without headache; first presentation at older age]. Ned Tijdschr Geneeskd 2021; 164:D4949. [PMID: 33560603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Transient visual phenomena in later life due to a visual aura in migraine are not uncommon and can also occur without headache. This migraine with aura is not always recognized as such. Since visual phenomena can also have a variety of other causes, they can present a diagnostic dilemma in both primary and secondary care. Visual aura phenomena have various and often complex manifestations and are usually observed binocularly. In case of transient visual phenomena, the patient's history is of utmost importance to determine whether there is a mono- or binocular cause of the complaints. Timely recognition of visual phenomena in the context of migraine can prevent unnecessary diagnostics and treatments.
Collapse
Affiliation(s)
- S Z Zewari
- St. Antonius Ziekenhuis, afd. Neurologie, Utrecht/Nieuwegein
- Contact: S.Z. Zewari
| | - L Jeung
- St. Antonius Ziekenhuis, afd. Neurologie, Utrecht/Nieuwegein
| | - C Gerlach
- St. Antonius Ziekenhuis, afd. Neurologie, Utrecht/Nieuwegein
| | - P H Wessels
- St. Antonius Ziekenhuis, afd. Neurologie, Utrecht/Nieuwegein
| |
Collapse
|
44
|
Abstract
Several studies demonstrated that functional magnetic resonance imaging (fMRI) signals in early visual cortex can be used to reconstruct 2-dimensional (2D) visual contents. However, it remains unknown how to reconstruct 3-dimensional (3D) visual stimuli from fMRI signals in visual cortex. 3D visual stimuli contain 2D visual features and depth information. Moreover, binocular disparity is an important cue for depth perception. Thus, it is more challenging to reconstruct 3D visual stimuli than 2D visual stimuli from the fMRI signals of visual cortex. This study aimed to reconstruct 3D visual images by constructing three decoding models: contrast-decoding, disparity-decoding and contrast-disparity-decoding models, and testing these models with fMRI data from humans viewing 3D contrast images. The results revealed that the 3D contrast stimuli can be reconstructed from the visual cortex. And the early visual regions (V1, V2) showed predominant advantages in reconstructing the contrast in 3D images for the contrast-decoding model. The dorsal visual regions (V3A, V7 and MT) showed predominant advantages in decoding the disparity in 3D images for the disparity-decoding model. The combination of the early and dorsal visual regions showed predominant advantages in decoding both the contrast and disparity for the contrast-disparity-decoding model. The results suggested that the contrast and disparity in 3D images were mainly represented in the early and dorsal visual regions separately. The two visual systems may interact with each other to decode 3D-contrast images.
Collapse
|
45
|
Chen N, Chen Z, Fang F. Functional specialization in human dorsal pathway for stereoscopic depth processing. Exp Brain Res 2020; 238:2581-2588. [PMID: 32886136 DOI: 10.1007/s00221-020-05918-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 09/01/2020] [Indexed: 11/25/2022]
Abstract
Binocular disparity, a primary cue for stereoscopic depth perception, is widely represented in visual cortex. However, the functional specialization in the disparity processing network remains unclear. Using magnetic resonance imaging-guided transcranial magnetic stimulation, we studied the causal contributions of V3A and MT+ to stereoscopic depth perception. Subjects viewed random-dot stereograms forming transparent planes with various interplane disparities. Their smallest detectable disparity and largest detectable disparity were measured in two experiments. We found that the smallest detectable disparity was affected by V3A, but not MT+ , stimulation. On the other hand, the largest detectable disparity was affected by both V3A and MT+ stimulation. Our results suggest different roles of V3A and MT+ in stereoscopic depth processing.
Collapse
Affiliation(s)
- Nihong Chen
- Department of Psychology, School of Social Sciences, Tsinghua University, Beijing, 100084, China.
- THU-IDG/McGovern Institute for Brain Research, Beijing, 100084, China.
| | - Zhimin Chen
- School of Psychological and Cognitive Sciences, Peking University, Beijing, 100871, China
| | - Fang Fang
- School of Psychological and Cognitive Sciences, Peking University, Beijing, 100871, China.
- Beijing Key Laboratory of Behavior and Mental Health, Beijing, China.
- PKU-IDG/McGovern Institute for Brain Research, Beijing, China.
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
46
|
Abstract
Since Kepler (1604) and Descartes (1637), 'vergence' (the angular rotation of the eyes) has been thought of as one of our most important absolute distance cues. But vergence has never been tested as an absolute distance cue divorced from obvious confounding cues such as binocular disparity. In this article, we control for these confounding cues for the first time by gradually manipulating vergence and find that observers fail to accurately judge distance from vergence. We consider several different interpretations of these results and argue that the most principled response to these results is to question the general effectiveness of vergence as an absolute distance cue. Given that other absolute distance cues (such as motion parallax and vertical disparities) are limited in application, this poses a real challenge to our contemporary understanding of visual scale.
Collapse
Affiliation(s)
- Paul Linton
- Centre for Applied Vision Research, City, University of London, Northampton Square, Clerkenwell, London, EC1V 0HB, UK.
| |
Collapse
|
47
|
Wei H, Xu C, Jin Z. Binocular Matching Model Based on Hierarchical V1 and V2 Receptive Fields With Color, Orientation, and Region Feature Information. IEEE Trans Biomed Eng 2020; 67:3141-3150. [PMID: 32142415 DOI: 10.1109/tbme.2020.2977350] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Binocular matching models serve as the core component in most stereo visual aid systems developed for people with visual impairments. However, purely computational models lack a neuro-biological basis for explaining the phenomena observed in neuro-biology, and therefore offer no support for the development of bioengineering applications, and are overly complex for hardware implementation. In contrast, existing neurobiological models suffer from low matching calculation accuracy. Therefore, the present work proposes a novel binocular matching model based on the receptive field of simple cells rather than on image pixels, and thereby incorporates neurobiological structure, reduces hardware complexity, has enough accuracy and can be used in visual aid system. The proposed model is employed to calculate and optimize the binocular disparity via a cost function. Specifically, we simulate the functions and structures of V1 and V2 neurons according to the discoveries of modern neurobiology. Accordingly, the receptive fields of V1 layer neurons are aggregated to obtain the receptive fields of the V2 layer, and the disparity is obtained in the V2 layer. The accuracy of the proposed model is verified by comparisons of the disparity results obtained using the proposed model with those obtained using other neurobiological model, and thereby demonstrates that the model can guide the design of visual aid systems.
Collapse
|
48
|
Abstract
Many species employ camouflage to disguise their true shape and avoid detection or recognition. Disruptive coloration is a form of camouflage in which high-contrast patterns obscure internal features or break up an animal's outline. In particular, edge enhancement creates illusory, or 'fake' depth edges within the animal's body. Disruptive coloration often co-occurs with background matching, and together, these strategies make it difficult for an observer to visually segment an animal from its background. However, stereoscopic vision could provide a critical advantage in the arms race between perception and camouflage: the depth information provided by binocular disparities reveals the true three-dimensional layout of a scene, and might, therefore, help an observer to overcome the effects of disruptive coloration. Human observers located snake targets embedded in leafy backgrounds. We analysed performance (response time) as a function of edge enhancement, illumination conditions and the availability of binocular depth cues. We confirm that edge enhancement contributes to effective camouflage: observers were slower to find snakes whose patterning contains 'fake' depth edges. Importantly, however, this effect disappeared when binocular depth cues were available. Illumination also affected detection: under directional illumination, where both the leaves and snake produced strong cast shadows, snake targets were localized more quickly than in scenes rendered under ambient illumination. In summary, we show that illusory depth edges, created via disruptive coloration, help to conceal targets from human observers. However, cast shadows and binocular depth information improve detection by providing information about the true three-dimensional structure of a scene. Importantly, the strong interaction between disparity and edge enhancement suggests that stereoscopic vision has a critical role in breaking camouflage, enabling the observer to overcome the disruptive effects of edge enhancement.
Collapse
Affiliation(s)
- Wendy J. Adams
- Department of Psychology, University of Southampton, Southampton SO17 1BJ, UK
| | | | | |
Collapse
|
49
|
Asher JM, Hibbard PB. First- and second-order contributions to depth perception in anti-correlated random dot stereograms. Sci Rep 2018; 8:14120. [PMID: 30237535 PMCID: PMC6148546 DOI: 10.1038/s41598-018-32500-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 09/05/2018] [Indexed: 11/08/2022] Open
Abstract
The binocular energy model of neural responses predicts that depth from binocular disparity might be perceived in the reversed direction when the contrast of dots presented to one eye is reversed. While reversed-depth has been found using anti-correlated random-dot stereograms (ACRDS) the findings are inconsistent across studies. The mixed findings may be accounted for by the presence of a gap between the target and surround, or as a result of overlap of dots around the vertical edges of the stimuli. To test this, we assessed whether (1) the gap size (0, 19.2 or 38.4 arc min) (2) the correlation of dots or (3) the border orientation (circular target, or horizontal or vertical edge) affected the perception of depth. Reversed-depth from ACRDS (circular no-gap condition) was seen by a minority of participants, but this effect reduced as the gap size increased. Depth was mostly perceived in the correct direction for ACRDS edge stimuli, with the effect increasing with the gap size. The inconsistency across conditions can be accounted for by the relative reliability of first- and second-order depth detection mechanisms, and the coarse spatial resolution of the latter.
Collapse
Affiliation(s)
- Jordi M Asher
- University of Essex, Department of Psychology, Wivenhoe Park, CO4 3SQ, United Kingdom.
| | - Paul B Hibbard
- University of Essex, Department of Psychology, Wivenhoe Park, CO4 3SQ, United Kingdom
| |
Collapse
|
50
|
Alberti CF, Bex PJ. Binocular contrast summation and inhibition depends on spatial frequency, eccentricity and binocular disparity. Ophthalmic Physiol Opt 2018; 38:525-537. [PMID: 30221370 PMCID: PMC6202146 DOI: 10.1111/opo.12581] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 08/10/2018] [Indexed: 11/30/2022]
Abstract
PURPOSE When central vision is compromised, visually-guided behaviour becomes dependent on peripheral retina, often at a preferred retinal locus (PRL). Previous studies have examined adaptation to central vision loss with monocular 2D paradigms, whereas in real tasks, patients make binocular eye movements to targets of various sizes and depth in 3D environments. METHODS We therefore examined monocular and binocular contrast sensitivity functions with a 26-AFC (alternate forced choice) band-pass filtered letter identification task at 2° or 6° eccentricity in observers with simulated central vision loss. Binocular stimuli were presented in corresponding or non-corresponding stereoscopic retinal locations. Gaze-contingent scotomas (0.5° radius disks of pink noise) were simulated independently in each eye with a 1000 Hz eye tracker and 120 Hz dichoptic shutter glasses. RESULTS Contrast sensitivity was higher for binocular than monocular conditions, but only exceeded probability summation at low-mid spatial frequencies in corresponding retinal locations. At high spatial frequencies or non-corresponding retinal locations, binocular contrast sensitivity showed evidence of interocular suppression. CONCLUSIONS These results suggest that binocular vision deficits may be underestimated by monocular vision tests and identify a method that can be used to select a PRL based on binocular contrast summation.
Collapse
Affiliation(s)
| | - Peter J Bex
- Department of Psychology, Northeastern University, Boston, USA
| |
Collapse
|