1
|
Domijan D, Ivančić I. Accentuation, Boolean maps and perception of (dis)similarity in a neural model of visual segmentation. Vision Res 2024; 225:108506. [PMID: 39486210 DOI: 10.1016/j.visres.2024.108506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 10/10/2024] [Accepted: 10/22/2024] [Indexed: 11/04/2024]
Abstract
We developed an interactive cortical circuit for visual segmentation that integrates bottom-up and top-down processing to segregate or group visual elements. A bottom-up pathway incorporates stimulus-driven saliency computation, top-down feature-based weighting by relevance and winner-take-all selection. A top-down pathway encompasses multiscale feedback projections, an object-based attention network and a visual segmentation network. Computer simulations have shown that a salient element in the stimulus guides spatial attention and further influences the decomposition of the nearby object into its parts, as postulated by the principle of accentuation. By contrast, when no single salient element is present, top-down feature-based attention highlights all locations occupied by the attended feature and the model forms a Boolean map, i.e., a spatial representation that makes the feature-based grouping explicit. The same distinction between bottom-up and top-down influences in perceptual organization can also be applied to texture perception. The model suggests that the principle of accentuation and feature-based similarity grouping are two manifestations of the same cortical circuit designed to detect similarities and dissimilarities of visual elements in a stimulus.
Collapse
|
2
|
Vincent J, Maertens M, Aguilar G. What Fechner could not do: Separating perceptual encoding and decoding with difference scaling. J Vis 2024; 24:5. [PMID: 38722273 PMCID: PMC11090143 DOI: 10.1167/jov.24.5.5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/29/2024] [Indexed: 05/15/2024] Open
Abstract
A key question in perception research is how stimulus variations translate into perceptual magnitudes, that is, the perceptual encoding process. As experimenters, we cannot probe perceptual magnitudes directly, but infer the encoding process from responses obtained in a psychophysical experiment. The most prominent experimental technique to measure perceptual appearance is matching, where observers adjust a probe stimulus to match a target in its appearance along the dimension of interest. The resulting data quantify the perceived magnitude of the target in physical units of the probe, and are thus an indirect expression of the underlying encoding process. In this paper, we show analytically and in simulation that data from matching tasks do not sufficiently constrain perceptual encoding functions, because there exist an infinite number of pairs of encoding functions that generate the same matching data. We use simulation to demonstrate that maximum likelihood conjoint measurement (Ho, Landy, & Maloney, 2008; Knoblauch & Maloney, 2012) does an excellent job of recovering the shape of ground truth encoding functions from data that were generated with these very functions. Finally, we measure perceptual scales and matching data for White's effect (White, 1979) and show that the matching data can be predicted from the estimated encoding functions, down to individual differences.
Collapse
Affiliation(s)
- Joris Vincent
- Computational Psychology, Technische Universität, Berlin, Germany
- https://www.psyco.tu-berlin.de/vincent.html
| | - Marianne Maertens
- Computational Psychology, Technische Universität, Berlin, Germany
- https://www.psyco.tu-berlin.de/maertens.html
| | - Guillermo Aguilar
- Computational Psychology, Technische Universität, Berlin, Germany
- https://www.psyco.tu-berlin.de/aguilar.html
| |
Collapse
|
3
|
Blakeslee B, McCourt ME. Isolation of brightness induction effects on target patches from adjacent surrounds and remote backgrounds. Front Hum Neurosci 2023; 16:1082059. [PMID: 36998921 PMCID: PMC10043223 DOI: 10.3389/fnhum.2022.1082059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 12/12/2022] [Indexed: 03/15/2023] Open
Abstract
The brightness (perceived intensity) of a region of visual space depends on its luminance and on the luminance of nearby regions. This phenomenon is called brightness induction and includes both brightness contrast and assimilation. Historically, and on a purely descriptive level, brightness contrast refers to a directional shift in target brightness away from the brightness of an adjacent region while assimilation refers to a brightness shift toward that of an adjacent region. In order to understand mechanisms, it is important to differentiate the descriptive terms contrast and assimilation from the optical and/or neural processes, often similarly named, which cause the effects. Experiment 1 isolated the effect on target patch (64 cd/m2) matching luminance (brightness) of six surround-ring widths (0.1°–24.5°) varied over 11 surround-ring luminances (32–96 cd/m2). Using the same observers, Experiment 2 examined the effect of the identical surround-ring parameters on target patch matching luminance in the presence of a dark (0.0 cd/m2) and a bright (96 cd/m2) remote background. By differencing the results of Experiment 1 (the isolated effect of the surround-ring) from those of Experiment 2 (the combined effect of the surround-ring with the dark and bright remote background) we further isolated the effect of the remote background. The results reveal that surround-rings and remote backgrounds produce brightness contrast effects in the target patch that are of the same or opposite polarity depending on the luminance polarity of these regions relative to target patch luminance. The strength of brightness contrast from the surround-ring varied with surround-ring luminance and width. Brightness contrast (darkening) in the target from the bright remote background was relatively constant in magnitude across all surround-ring luminances and increased in magnitude with decreasing surround-ring width. Brightness contrast (brightening) from the isolated dark remote background also increased in magnitude with decreasing surround-ring width: however, despite some regional flattening of the functions due to the fixed luminance of the dark remote background, induction magnitude was much reduced in the presence of a surround-ring of greater luminance than the target patch indicating a non-linear interaction between the dark remote background and surround-ring luminance.
Collapse
|
4
|
Nedimović P, Zdravković S, Domijan D. Empirical evaluation of computational models of lightness perception. Sci Rep 2022; 12:22039. [PMID: 36543784 PMCID: PMC9772371 DOI: 10.1038/s41598-022-22395-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 10/13/2022] [Indexed: 12/24/2022] Open
Abstract
Lightness of a surface depends not only on its physical characteristics, but also on the properties of the surrounding context. As a result, varying the context can significantly alter surface lightness, an effect exploited in many lightness illusions. Computational models can produce outcomes similar to human illusory percepts, allowing for demonstrable assessment of the applied mechanisms and principles. We tested 8 computational models on 13 typical displays used in lightness research (11 Illusions and 2 Mondrians), and compared them with results from human participants (N = 85). Results show that HighPass and MIR models predict empirical results for simultaneous lightness contrast (SLC) and its close variations. ODOG and its newer variants (ODOG-2 and L-ODOG) in addition to SLC displays were able to predict effect of White's illusion. RETINEX was able to predict effects of both SLC displays and Dungeon illusion. Dynamic decorrelation model was able to predict obtained effects for all tested stimuli except two SLC variations. Finally, FL-ODOG model was best at simulating human data, as it was able to predict empirical results for all displays, bar the Reversed contrast illusion. Finally, most models underperform on the Mondrian displays that represent most natural stimuli for the human visual system.
Collapse
Affiliation(s)
- Predrag Nedimović
- Laboratory for Experimental Psychology, Department of Psychology, Faculty of Philosophy, University of Belgrade, Belgrade, Serbia.
| | - Sunčica Zdravković
- Laboratory for Experimental Psychology, Department of Psychology, Faculty of Philosophy, University of Belgrade, Belgrade, Serbia
- Laboratory for Experimental Psychology, Department of Psychology, Faculty of Philosophy, University of Novi Sad, Novi Sad, Serbia
| | - Dražen Domijan
- Department of Psychology, Faculty of Humanities and Social Sciences, University of Rijeka, Rijeka, Croatia
| |
Collapse
|
5
|
Abstract
Human vision relies on mechanisms that respond to luminance edges in space and time. Most edge models use orientation-selective mechanisms on multiple spatial scales and operate on static inputs assuming that edge processing occurs within a single fixational instance. Recent studies, however, demonstrate functionally relevant temporal modulations of the sensory input due to fixational eye movements. Here we propose a spatiotemporal model of human edge detection that combines elements of spatial and active vision. The model augments a spatial vision model by temporal filtering and shifts the input images over time, mimicking an active sampling scheme via fixational eye movements. The first model test was White's illusion, a lightness effect that has been shown to depend on edges. The model reproduced the spatial-frequency-specific interference with the edges by superimposing narrowband noise (1–5 cpd), similar to the psychophysical interference observed in White's effect. Second, we compare the model's edge detection performance in natural images in the presence and absence of Gaussian white noise with human-labeled contours for the same (noise-free) images. Notably, the model detects edges robustly against noise in both test cases without relying on orientation-selective processes. Eliminating model components, we demonstrate the relevance of multiscale spatiotemporal filtering and scale-specific normalization for edge detection. The proposed model facilitates efficient edge detection in (artificial) vision systems and challenges the notion that orientation-selective mechanisms are required for edge detection.
Collapse
Affiliation(s)
- Lynn Schmittwilken
- Science of Intelligence and Computational Psychology, Faculty of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany.,
| | - Marianne Maertens
- Science of Intelligence and Computational Psychology, Faculty of Electrical Engineering and Computer Science, Technische Universität Berlin, Berlin, Germany.,
| |
Collapse
|
6
|
Lerer A, Supèr H, Keil MS. Dynamic decorrelation as a unifying principle for explaining a broad range of brightness phenomena. PLoS Comput Biol 2021; 17:e1007907. [PMID: 33901165 PMCID: PMC8102013 DOI: 10.1371/journal.pcbi.1007907] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Revised: 05/06/2021] [Accepted: 04/06/2021] [Indexed: 11/29/2022] Open
Abstract
The visual system is highly sensitive to spatial context for encoding luminance patterns. Context sensitivity inspired the proposal of many neural mechanisms for explaining the perception of luminance (brightness). Here we propose a novel computational model for estimating the brightness of many visual illusions. We hypothesize that many aspects of brightness can be explained by a dynamic filtering process that reduces the redundancy in edge representations on the one hand, while non-redundant activity is enhanced on the other. The dynamic filter is learned for each input image and implements context sensitivity. Dynamic filtering is applied to the responses of (model) complex cells in order to build a gain control map. The gain control map then acts on simple cell responses before they are used to create a brightness map via activity propagation. Our approach is successful in predicting many challenging visual illusions, including contrast effects, assimilation, and reverse contrast with the same set of model parameters.
Collapse
Affiliation(s)
- Alejandro Lerer
- Departament de Cognició, Desenvolupament i Psicologia de l’Educació, Faculty of Psychology, University of Barcelona, Barcelona, Spain
| | - Hans Supèr
- Departament de Cognició, Desenvolupament i Psicologia de l’Educació, Faculty of Psychology, University of Barcelona, Barcelona, Spain
- Institut de Neurociències, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Barcelona, Spain
- Catalan Institute for Advanced Studies (ICREA), Barcelona, Spain
| | - Matthias S. Keil
- Departament de Cognició, Desenvolupament i Psicologia de l’Educació, Faculty of Psychology, University of Barcelona, Barcelona, Spain
- Institut de Neurociències, Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
7
|
Blakeslee B, Padmanabhan G, McCourt ME. Dissecting the influence of the collinear and flanking bars in White's effect. Vision Res 2016; 127:11-17. [PMID: 27425384 DOI: 10.1016/j.visres.2016.07.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2015] [Revised: 06/29/2016] [Accepted: 07/05/2016] [Indexed: 11/18/2022]
Abstract
In White's effect equiluminant test patches placed on the black and white bars of a square-wave grating appear different in brightness. The illusion has generated intense interest because the direction of the brightness effect does not correlate with the amount of black or white border in contact with the test patch, or in its general vicinity. Therefore, unlike brightness induction effects such as simultaneous contrast, White's effect is not consistent with explanations based on contrast or assimilation that depend solely on the relative amounts of black and white surrounding the test patches. We independently manipulated the luminance of the collinear and flanking bars to investigate their influence on test patch matching luminance (brightness). The inducing grating was a 0.5c/d square-wave and test patches measured 1.0° in width and either 0.5° or 3.0° in height. Test patches measuring 0.5° in height had more extensive contact with the collinear bars and test patches measuring 3.0° in height had more extensive contact with the flanking bars. The luminance of the collinear (or flanking) bars assumed twenty values from 3.2 to 124.8cd/m(2), while the luminance of the flanking (or collinear) bars remained white (124.8cd/m(2)) or black (3.2cd/m(2)). Under these conditions the influence of the collinear and flanking bars was found to be purely in the direction of contrast. The effect was dominated by contrast from the collinear bars (which results in White's effect), however, the influence of the flanking bars was also in the contrast direction. The data elucidate the luminance relationships between the collinear and flanking bars which produce the behavior associated with White's effect as well as that associated with "the inverted White effect" which is akin to simultaneous contrast.
Collapse
Affiliation(s)
- Barbara Blakeslee
- Center for Visual and Cognitive Neuroscience, Department of Psychology, North Dakota State University, Fargo, ND 58105-5075, United States.
| | - Ganesh Padmanabhan
- Center for Visual and Cognitive Neuroscience, Department of Psychology, North Dakota State University, Fargo, ND 58105-5075, United States
| | - Mark E McCourt
- Center for Visual and Cognitive Neuroscience, Department of Psychology, North Dakota State University, Fargo, ND 58105-5075, United States
| |
Collapse
|