1
|
Baker DH, Summers RJ, Baldwin AS, Meese TS. A psychophysical performance-based approach to the quality assessment of image processing algorithms. PLoS One 2022; 17:e0267056. [PMID: 35511914 PMCID: PMC9071145 DOI: 10.1371/journal.pone.0267056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 03/31/2022] [Indexed: 11/23/2022] Open
Abstract
Image processing algorithms are used to improve digital image representations in either their appearance or storage efficiency. The merit of these algorithms depends, in part, on visual perception by human observers. However, in practice, most are assessed numerically, and the perceptual metrics that do exist are criterion sensitive with several shortcomings. Here we propose an objective performance-based perceptual measure of image quality and demonstrate this by comparing the efficacy of a denoising algorithm for a variety of filters. For baseline, we measured detection thresholds for a white noise signal added to one of a pair of natural images in a two-alternative forced-choice (2AFC) paradigm where each image was selected randomly from a set of n = 308 on each trial. In a series of experimental conditions, the stimulus image pairs were passed through various configurations of a denoising algorithm. The differences in noise detection thresholds with and without denoising are objective perceptual measures of the ability of the algorithm to render noise invisible. This was a factor of two (6dB) in our experiment and consistent across a range of filter bandwidths and types. We also found that thresholds in all conditions converged on a common value of PSNR, offering support for this metric. We discuss how the 2AFC approach might be used for other algorithms including compression, deblurring and edge-detection. Finally, we provide a derivation for our Cartesian-separable log-Gabor filters, with polar parameters. For the biological vision community this has some advantages over the more typical (i) polar-separable variety and (ii) Cartesian-separable variety with Cartesian parameters.
Collapse
Affiliation(s)
- Daniel H. Baker
- Department of Psychology and York Biomedical Research Institute, University of York, York, United Kingdom
- * E-mail:
| | - Robert J. Summers
- College of Health & Life Sciences, Aston University, Birmingham, United Kingdom
| | - Alex S. Baldwin
- Department of Ophthalmology & Visual Sciences, McGill Vision Research, McGill University, Montreal, Quebec, Canada
| | - Tim S. Meese
- College of Health & Life Sciences, Aston University, Birmingham, United Kingdom
| |
Collapse
|
2
|
Meese TS, Baker DH, Summers RJ. Perception of global image contrast involves transparent spatial filtering and the integration and suppression of local contrasts (not RMS contrast). ROYAL SOCIETY OPEN SCIENCE 2017; 4:170285. [PMID: 28989735 PMCID: PMC5627075 DOI: 10.1098/rsos.170285] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Accepted: 07/26/2017] [Indexed: 06/07/2023]
Abstract
When adjusting the contrast setting on a television set, we experience a perceptual change in the global image contrast. But how is that statistic computed? We addressed this using a contrast-matching task for checkerboard configurations of micro-patterns in which the contrasts and spatial spreads of two interdigitated components were controlled independently. When the patterns differed greatly in contrast, the higher contrast determined the perceived global contrast. Crucially, however, low contrast additions of one pattern to intermediate contrasts of the other caused a paradoxical reduction in the perceived global contrast. None of the following metrics/models predicted this: max, linear sum, average, energy, root mean squared (RMS), Legge and Foley. However, a nonlinear gain control model, derived from contrast detection and discrimination experiments, incorporating wide-field summation and suppression, did predict the results with no free parameters, but only when spatial filtering was removed. We conclude that our model describes fundamental processes in human contrast vision (the pattern of results was the same for expert and naive observers), but that above threshold-when contrast pedestals are clearly visible-vision's spatial filtering characteristics become transparent, tending towards those of a delta function prior to spatial summation. The global contrast statistic from our model is as easily derived as the RMS contrast of an image, and since it more closely relates to human perception, we suggest it be used as an image contrast metric in practical applications.
Collapse
Affiliation(s)
- Tim S. Meese
- School of Life and Health Sciences, Aston University, Birmingham B4 7ET, UK
| | - Daniel H. Baker
- School of Life and Health Sciences, Aston University, Birmingham B4 7ET, UK
- Department of Psychology, University of York, York YO10 5DD, UK
| | - Robert J. Summers
- School of Life and Health Sciences, Aston University, Birmingham B4 7ET, UK
| |
Collapse
|
3
|
|
4
|
Huang PC, Chen CC. Contrast Gain Control in Plaid Pattern Detection. PLoS One 2016; 11:e0164171. [PMID: 27764119 PMCID: PMC5072603 DOI: 10.1371/journal.pone.0164171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 09/21/2016] [Indexed: 11/18/2022] Open
Abstract
A plaid is a combination of two gratings whose orientations are orthogonal to each other with the same or similar contrasts. We used plaid patterns as stimuli to investigate the mechanisms underlying the detection of a plaid to understand how the visual system combines information from orientation-selective channels. We used a masking paradigm in which an observer was required to detect a target (either a spiral or a plaid) superimposed on a pedestal. We measured the target threshold versus pedestal contrast (TvC) functions at 7 pedestal contrasts for various target-pedestal combinations with a temporal 2AFC paradigm and a staircase procedure. All TvC functions, except the one with an orthogonal spiral pedestal, showed a dipper shape, although the position of the dip and the slope varied across conditions. The result can be explained by a multiple-mechanism divisive inhibition model, which contains several orientation-selective mechanisms. The response of each mechanism is the excitation of a linear filter divided by a broadband inhibitory input. The threshold is determined by a nonlinear combination of the responses of those mechanisms. Alternative models with mechanism(s) specific for plaid did not provide a better description of the data. Thus, a plaid pattern is mediated by a combination of orientation-selective mechanisms. An early plaid-specific mechanism is not necessary for plaid detection.
Collapse
Affiliation(s)
- Pi-Chun Huang
- Department of Psychology, National Cheng Kung University, Tainan, Taiwan
| | - Chien-Chung Chen
- Department of Psychology, National Taiwan University, Taipei, Taiwan
- * E-mail:
| |
Collapse
|
5
|
Neuronal basis of perceptual learning in striate cortex. Sci Rep 2016; 6:24769. [PMID: 27094565 PMCID: PMC4837366 DOI: 10.1038/srep24769] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 03/30/2016] [Indexed: 11/08/2022] Open
Abstract
It is well known that, in humans, contrast sensitivity training at high spatial frequency (SF) not only leads to contrast sensitivity improvement, but also results in an improvement in visual acuity as assessed with gratings (direct effect) or letters (transfer effect). However, the underlying neural mechanisms of this high spatial frequency training improvement remain to be elucidated. In the present study, we examined four properties of neurons in primary visual cortex (area 17) of adult cats that exhibited significantly improved acuity after contrast sensitivity training with a high spatial frequency grating and those of untrained control cats. We found no difference in neuronal contrast sensitivity or tuning width (Width) between the trained and untrained cats. However, the trained cats showed a displacement of the cells’ optimal spatial frequency (OSF) to higher spatial frequencies as well as a larger neuronal signal-to-noise ratio (SNR). Furthermore, both the neuronal differences in OSF and SNR were significantly correlated with the improvement of acuity measured behaviorally. These results suggest that striate neurons might mediate the perceptual learning-induced improvement for high spatial frequency stimuli by an alteration in their spatial frequency representation and by an increased SNR.
Collapse
|
6
|
Abstract
This project examined whether previous visual history can bias perceptual dominance during onset rivalry. A predictive sequence of non-rivalrous stimuli preceded dichoptically presented rivalrous displays. One of the dichoptic images was the implied next step of the preceding sequence while the other was not. Observers reported their initial dominant percept. Across five experiments, we found that motion sequences biased perceptual selection such that a rivalrous stimulus that continued a motion sequence tended to dominate one that did not. However, signals generated by complex pattern of motion information or verbal-semantic information had no influence on selection. These results are consistent with the view that onset rivalry is an early phase of rivalry that is likely insensitive to modulation by factors originating beyond the visual system.
Collapse
|
7
|
Perry G. The effects of cross-orientation masking on the visual gamma response in humans. Eur J Neurosci 2015; 41:1484-95. [DOI: 10.1111/ejn.12900] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 03/18/2015] [Indexed: 11/28/2022]
Affiliation(s)
- Gavin Perry
- Cardiff University Brain Imaging Centre (CUBRIC); School of Psychology; Cardiff University; 70 Park Place Cardiff CF10 3AT UK
- Institute of Psychological Medicine and Clinical Neurosciences; School of Medicine; Cardiff University; Cardiff UK
| |
Collapse
|
8
|
Watt R. Edges, curvature, and primal sketches. Perception 2013; 41:1092-115. [PMID: 23409374 DOI: 10.1068/p7308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Marr described two versions of the primal sketch: the basic image-processing level in human vision. In line with his broader view of how one should construct explanatory theories in vision, he provided some details of the computational theory for this stage, the algorithms used, and how they might be implemented in neural systems. In this paper I consider how Marr ideas have continued over the past 30 years. In this regard, I pay particular attention to three stages: locating edges; describing edge curvature; linking local edge segments into elongated contours.
Collapse
Affiliation(s)
- Roger Watt
- School of Natural Sciences, University of Stirling, Stirling FK9 4LA, UK.
| |
Collapse
|
9
|
Meese TS, Baker DH. A common rule for integration and suppression of luminance contrast across eyes, space, time, and pattern. Iperception 2013; 4:1-16. [PMID: 23799184 PMCID: PMC3690412 DOI: 10.1068/i0556] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2012] [Revised: 11/30/2012] [Indexed: 10/27/2022] Open
Abstract
Visual perception begins by dissecting the retinal image into millions of small patches for local analyses by local receptive fields. However, image structures extend well beyond these receptive fields and so further processes must be involved in sewing the image fragments back together to derive representations of higher order (more global) structures. To investigate the integration process, we also need to understand the opposite process of suppression. To investigate both processes together, we measured triplets of dipper functions for targets and pedestals involving interdigitated stimulus pairs (A, B). Previous work has shown that summation and suppression operate over the full contrast range for the domains of ocularity and space. Here, we extend that work to include orientation and time domains. Temporal stimuli were 15-Hz counter-phase sine-wave gratings, where A and B were the positive and negative phases of the oscillation, respectively. For orientation, we used orthogonally oriented contrast patches (A, B) whose sum was an isotropic difference of Gaussians. Results from all four domains could be understood within a common framework in which summation operates separately within the numerator and denominator of a contrast gain control equation. This simple arrangement of summation and counter-suppression achieves integration of various stimulus attributes without distorting the underlying contrast code.
Collapse
Affiliation(s)
- Tim S Meese
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham B4 7ET, UK; e-mail:
| | | |
Collapse
|
10
|
McDonald JS, Mannion DJ, Clifford CWG. Gain control in the response of human visual cortex to plaids. J Neurophysiol 2012; 107:2570-80. [PMID: 22378166 DOI: 10.1152/jn.00616.2011] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
A recent intrinsic signal optical imaging study in tree shrew showed, surprisingly, that the population response of V1 to plaid patterns comprising grating components of equal contrast is predicted by the average of the responses to the individual components (MacEvoy SP, Tucker TR, Fitzpatrick D. Nat Neurosci 12: 637-645, 2009). This prompted us to compare responses to plaids and gratings in human visual cortex as a function of contrast and orientation. We found that the functional MRI (fMRI) blood oxygenation level-dependent (BOLD) responses of areas V1-V3 to a plaid comprising superposed grating components of equal contrast are significantly higher than the responses to a single component. Furthermore, the orientation response profile of a plaid is poorly predicted from a linear combination of the responses to its components. Together, these results indicate that the model of MacEvoy et al. (2009) cannot, without modification, account for the fMRI BOLD response to plaids in human visual cortex.
Collapse
Affiliation(s)
- J Scott McDonald
- School of Psychology, The University of New South Wales, Sydney, Australia
| | | | | |
Collapse
|
11
|
Paffen CLE, Alais D. Attentional modulation of binocular rivalry. Front Hum Neurosci 2011; 5:105. [PMID: 22046156 PMCID: PMC3202227 DOI: 10.3389/fnhum.2011.00105] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2011] [Accepted: 09/08/2011] [Indexed: 11/13/2022] Open
Abstract
Ever since Wheatstone initiated the scientific study of binocular rivalry, it has been debated whether the phenomenon is under attentional control. In recent years, the issue of attentional modulation of binocular rivalry has seen a revival. Here we review the classical studies as well as recent advances in the study of attentional modulation of binocular rivalry. We show that (1) voluntary control over binocular rivalry is possible, yet limited, (2) both endogenous and exogenous attention influence perceptual dominance during rivalry, (3) diverting attention from rival displays does not arrest perceptual alternations, and that (4) rival targets by themselves can also attract attention. From a theoretical perspective, we suggest that attention affects binocular rivalry by modulating the effective contrast of the images in competition. This contrast enhancing effect of top-down attention is counteracted by a response attenuating effect of neural adaptation at early levels of visual processing, which weakens the response to the dominant image. Moreover, we conclude that although frontal and parietal brain areas involved in both binocular rivalry and visual attention overlap, an adapting reciprocal inhibition arrangement at early visual cortex is sufficient to trigger switches in perceptual dominance independently of a higher-level "selection" mechanisms. Both of these processes are reciprocal and therefore self-balancing, with the consequence that complete attentional control over binocular rivalry can never be realized.
Collapse
Affiliation(s)
- Chris L. E. Paffen
- Department of Experimental Psychology, Helmholtz Institute Utrecht, Utrecht UniversityUtrecht, Netherlands
| | - David Alais
- School of Psychology, University of Sydney, SydneyNSW, Australia
| |
Collapse
|
12
|
Abstract
When two different images are presented to the two eyes, the percept will alternate between the images (a phenomenon called binocular rivalry). In the present study, we investigate the degree to which such interocular conflict is conspicuous. By using a visual search task, we show that search for interocular conflict is near efficient (15 ms/item) and can lead to a search asymmetry, depending on the contrast in the display. We reconcile our findings with those of Wolfe and Franzel (1988), who reported inefficient search for interocular conflict (26 ms/item) and found no evidence for a search asymmetry. In addition, we provide evidence for the suggestion that differences in search for interocular conflict are contingent on the degree of abnormal fusion of the dissimilar images.
Collapse
|
13
|
Langley K, Anderson SJ. The Riesz transform and simultaneous representations of phase, energy and orientation in spatial vision. Vision Res 2011; 50:1748-65. [PMID: 20685326 DOI: 10.1016/j.visres.2010.05.031] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2010] [Revised: 05/13/2010] [Accepted: 05/17/2010] [Indexed: 11/25/2022]
Abstract
To represent the local orientation and energy of a 1-D image signal, many models of early visual processing employ bandpass quadrature filters, formed by combining the original signal with its Hilbert transform. However, representations capable of estimating an image signal's 2-D phase have been largely ignored. Here, we consider 2-D phase representations using a method based upon the Riesz transform. For spatial images there exist two Riesz transformed signals and one original signal from which orientation, phase and energy may be represented as a vector in 3-D signal space. We show that these image properties may be represented by a Singular Value Decomposition (SVD) of the higher-order derivatives of the original and the Riesz transformed signals. We further show that the expected responses of even and odd symmetric filters from the Riesz transform may be represented by a single signal autocorrelation function, which is beneficial in simplifying Bayesian computations for spatial orientation. Importantly, the Riesz transform allows one to weight linearly across orientation using both symmetric and asymmetric filters to account for some perceptual phase distortions observed in image signals - notably one's perception of edge structure within plaid patterns whose component gratings are either equal or unequal in contrast. Finally, exploiting the benefits that arise from the Riesz definition of local energy as a scalar quantity, we demonstrate the utility of Riesz signal representations in estimating the spatial orientation of second-order image signals. We conclude that the Riesz transform may be employed as a general tool for 2-D visual pattern recognition by its virtue of representing phase, orientation and energy as orthogonal signal quantities.
Collapse
Affiliation(s)
- Keith Langley
- Cognitive, Perceptual and Brain Sciences, University College London, London, UK.
| | | |
Collapse
|
14
|
Hancock S, McGovern DP, Peirce JW. Ameliorating the combinatorial explosion with spatial frequency-matched combinations of V1 outputs. J Vis 2010; 10:7. [PMID: 20884582 DOI: 10.1167/10.8.7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Little is known about the way in which the outputs of early orientation-selective neurons are combined. One particular problem is that the number of possible combinations of these outputs greatly outweighs the number of processing units available to represent them. Here we consider two of the possible ways in which the visual system might reduce the impact of this problem. First, the visual system might ameliorate the problem by collapsing across some low-level feature coded by previous processing stages, such as spatial frequency. Second, the visual system may combine only a subset of available outputs, such as those with similar receptive field characteristics. Using plaid-selective contrast adaptation and the curvature aftereffect, we found no evidence for the former solution; both aftereffects were clearly tuned to the spatial frequency of the adaptor relative to the test probe. We did, however, find evidence for the latter with both aftereffects; when the components forming our compound stimuli were dissimilar in spatial frequency, the effects of adapting to them were substantially reduced. This has important implications for mid-level visual processing, both for the combinatorial explosion and for the selective "binding" of common features that are perceived as coming from a single visual object.
Collapse
Affiliation(s)
- Sarah Hancock
- Nottingham Visual Neuroscience, School of Psychology, University of Nottingham, Nottingham, UK.
| | | | | |
Collapse
|
15
|
McGovern DP, Peirce JW. The spatial characteristics of plaid-form-selective mechanisms. Vision Res 2010; 50:796-804. [PMID: 20123109 PMCID: PMC2877803 DOI: 10.1016/j.visres.2010.01.018] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2009] [Revised: 01/26/2010] [Accepted: 01/27/2010] [Indexed: 12/02/2022]
Abstract
Rather little is known about the mechanisms that combine the outputs of orientation- and spatial frequency-selective channels. These can be studied by measuring the selective adaptation to compound stimuli over and above that expected from adaptation to the components alone (Peirce & Taylor, 2006). Here we investigated the contrast- and spatial phase-dependency of such mechanisms. A plaid was adapted in one visual hemi-field, while its constituent gratings were simultaneously adapted in the other hemi-field. Plaid-selective adaptation was most apparent with high-contrast probes, whereas adaptation to the component grating stimuli dominated at low contrasts. The mechanisms underlying this plaid-selective adaptation also appear to be insensitive to the spatial phase of the probes relative to the adaptor, whereas we find a clear phase-dependency for suprathreshold contrast adaptation to grating stimuli. These findings suggest that the visual system is equipped with mechanisms that conduct a global analysis of the plaid pattern, which are likely derived from the non-linear outputs of V1 complex cells.
Collapse
Affiliation(s)
- David P McGovern
- Nottingham Visual Neuroscience, School of Psychology, University of Nottingham, Nottingham NG7 2RD, UK.
| | | |
Collapse
|
16
|
Peeling plaids apart: context counteracts cross-orientation contrast masking. PLoS One 2009; 4:e8123. [PMID: 19956546 PMCID: PMC2780729 DOI: 10.1371/journal.pone.0008123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2009] [Accepted: 09/25/2009] [Indexed: 11/19/2022] Open
Abstract
Background Contrast discrimination for an image is usually harder if another image is superimposed on top. We asked whether such contrast masking may be enhanced or relieved depending on cues promoting integration of both images as a single pattern, versus segmentation into two independent components. Methodology & Principal Findings Contrast discrimination thresholds for a foveal test grating were sharply elevated in the presence of a perfectly overlapping orthogonally-oriented mask grating. However thresholds returned to the unmasked baseline when a surround grating was added, having the same orientation and phase of either the test or mask grating. Both such masking and ‘unmasking’ effects were much stronger for moving than static stimuli. Conclusions & Significance Our results suggest that common-fate motion reinforces the perception of a single coherent plaid pattern, while the surround helps to identify each component independently, thus peeling the plaid apart again. These results challenge current models of early vision, suggesting that higher-level surface organization influences contrast encoding, determining whether the contrast of a grating may be recovered independently from that of its mask.
Collapse
|
17
|
Stevens M, Cuthill IC. Disruptive coloration, crypsis and edge detection in early visual processing. Proc Biol Sci 2006; 273:2141-7. [PMID: 16901833 PMCID: PMC1635512 DOI: 10.1098/rspb.2006.3556] [Citation(s) in RCA: 161] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Many animals use concealing markings to reduce the risk of predation. These include background pattern matching (crypsis), where the coloration matches a random sample of the background and disruptive patterns, whose effectiveness has been hypothesized to lie in breaking up the body into a series of apparently unrelated objects. We have previously established the effectiveness of disruptive coloration against avian predators, using artificial moth-like stimuli with colours designed to match natural backgrounds as perceived by birds. Here, we investigate the mechanism by which disruptive patterns reduce detectability, using a computational vision model of edge detection applied to photographs of our experimental stimuli, calibrated for bird colour vision. We show that, disruptive coloration is effective by exploiting edge detection algorithms that we use to model early visual processing. Thus, 'false' edges are detected within the body rather than at its periphery, so inhibiting successful detection of the animal's body outline.
Collapse
Affiliation(s)
- Martin Stevens
- School of Biological Sciences, University of Bristol, Woodland Road, Bristol BS8 1UG, UK.
| | | |
Collapse
|
18
|
Peirce JW, Taylor LJ. Selective mechanisms for complex visual patterns revealed by adaptation. Neuroscience 2006; 141:15-8. [PMID: 16753271 DOI: 10.1016/j.neuroscience.2006.04.039] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2006] [Revised: 04/13/2006] [Accepted: 04/14/2006] [Indexed: 11/27/2022]
Abstract
A great deal is known about the initial steps of visual processing. We know that humans have neural mechanisms selectively tuned to simple patterns of particular spatial frequencies and orientations. We also know that much later in the visual pathway, in inferotemporal cortex, cells respond to extremely complex visual patterns such as images of faces. Very little is known about intermediate levels of visual processing, where early visual signals are presumably combined to represent increasingly complex visual features. Here we show the existence of visual mechanisms in humans, tuned and selective to particular combinations of simple sinusoidal patterns, using a novel method of compound adaptation.
Collapse
Affiliation(s)
- J W Peirce
- Visual Neuroscience Group, School of Psychology, University of Nottingham, University Park, Nottingham NG7 2RD, UK.
| | | |
Collapse
|
19
|
Takeuchi T. The effect of eccentricity and the adapting level on the café wall illusion. PERCEPTION & PSYCHOPHYSICS 2006; 67:1113-27. [PMID: 16502834 DOI: 10.3758/bf03193545] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The café wall pattern is composed of rows of alternating light and dark tiles, and alternate rows are shifted by one fourth of a cycle. The rows of tiles are separated by narrow horizontal mortar lines whose luminance is between those of the dark and the light tiles. Although the mortar lines are physically parallel, they are perceived to be tilted, which is known as the café wall illusion. In this study, an energy-based model for encoding orientation is implemented in order to estimate the strength of the café wall illusion, and it is shown that the estimated orientation depends on the spatial frequency to which each orientation-encoding unit is tuned. The estimation of mortar line orientation from an orientation-encoding unit tuned to a lower spatial frequency was greater than that from a unit tuned to a higher spatial frequency. It is assumed that the perceived mortar line orientation is the result of an integration of responses from the orientation-encoding units tuned to various spatial frequencies. This leads to the prediction that under viewing conditions in which responses from orientation-encoding units tuned to a higher spatial frequency are presumably weakened, the strength of the café wall illusion increases. In agreement with this prediction, it is shown that the café wall illusion is stronger when the café wall image is presented at the periphery or is observed under low luminance levels. On the other hand, the weighted averaging of the estimated mortar orientations across spatial frequencies overestimates the perceived orientation of the mortar lines. This suggests that the final percept of the café wall illusion could be determined by some kind of nonlinear interaction, such as an inhibitory interaction, between orientation-encoding units.
Collapse
|
20
|
Solomon JA, John A, Morgan MJ. Monocular texture segmentation and proto-rivalry. Vision Res 2005; 46:1488-92. [PMID: 16102795 DOI: 10.1016/j.visres.2005.07.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2005] [Revised: 06/28/2005] [Accepted: 07/03/2005] [Indexed: 11/20/2022]
Abstract
When the right eye's target is the left eye's distracter and vice versa, orientation-defined search is impossible unless, as we show here, the elements are close together. More than 1s was required to find inverse-cyclopean texture boundaries when elements were arranged on a 16 x 16 grid. Less than 250 ms was required for a 24 x 24 grid covering the same area. The conventional view is that binocular rivalry requires at least 200 ms to develop, but our results suggest a more rapid access to monocular signals. We call this rapid form of access "proto-rivalry."
Collapse
Affiliation(s)
- Joshua A Solomon
- Department of Optometry and Visual Science, City University, London EC1V 0HB, UK.
| | | | | |
Collapse
|
21
|
Abstract
We explored top-down modulation of spatial frequency (SF) processing. When auditory pre-cueing directed observers' attention to one of two 4-octaves (SF) apart plaid components observers tended to perceive the cued component, suggesting selective attention to the SF channel they expected to carry task relevant information. In agreement, pre-cueing had no effect with components often processed by the same channel (0.5-octaves apart). Further, effects of expectancy were greater than of uncertainty and were SF tuned. Combined our findings suggest top-down modulation of early, cortical, SF processing. We argue this could similarly explain the previously reported influences of categorisation on SF processing.
Collapse
Affiliation(s)
- Paul T Sowden
- Department of Psychology, University of Surrey, Guildford, Surrey GU2 7XH, UK.
| | | | | | | |
Collapse
|
22
|
Sowden PT, Rose D, Davies IRL. Perceptual learning of luminance contrast detection: specific for spatial frequency and retinal location but not orientation. Vision Res 2002; 42:1249-58. [PMID: 12044757 DOI: 10.1016/s0042-6989(02)00019-6] [Citation(s) in RCA: 101] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Performance of a wide range of simple visual tasks improves with practice. Here we ask whether such learning occurs for the fundamental visual task of luminance contrast detection. In two experiments we find that contrast sensitivity increases following extensive practice at detecting briefly presented sinusoidal luminance gratings and that learning is maintained after six months. Learning is spatial frequency tuned, specific to retinal location and can be specific to one eye, but is not selective for orientation. The selectivity of learning implies that it is based on plasticity in early visual, as opposed to central cognitive, processing mechanisms.
Collapse
Affiliation(s)
- Paul T Sowden
- Department of Psychology, University of Surrey, Guildford, Surrey GU2 7XH, UK.
| | | | | |
Collapse
|
23
|
Meese TS, Hess RF, Williams CB. Spatial coherence does not affect contrast discrimination for multiple Gabor stimuli. Perception 2002; 30:1411-22. [PMID: 11817748 DOI: 10.1068/p3159] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Gestalt grouping rules imply a process or mechanism for grouping together local features of an object into a perceptual whole. Several psychophysical experiments have been interpreted as evidence for constrained interactions between nearby spatial filter elements and this has led to the hypothesis that element linking might be mediated by these interactions. A common tacit assumption is that these interactions result in response modulation which disturbs a local contrast code. We addressed this possibility by performing contrast discrimination experiments using two-dimensional arrays of multiple Gabor patches arranged either (i) vertically, (ii) in circles (coherent conditions), or (iii) randomly (incoherent condition), as well as for a single Gabor patch. In each condition, contrast increments were applied to either the entire test stimulus (experiment 1) or a single patch whose position was cued (experiment 2). In experiment 3, the texture stimuli were reduced to a single contour by displaying only the central vertical strip. Performance was better for the multiple-patch conditions than for the single-patch condition, but whether the multiple-patch stimulus was coherent or not had no systematic effect on the results in any of the experiments. We conclude that constrained local interactions do not interfere with a local contrast code for our suprathreshold stimuli, suggesting that, in general, this is not the way in which element linking is achieved. The possibility that interactions are involved in enhancing the detectability of contour elements at threshold remains unchallenged by our experiments.
Collapse
Affiliation(s)
- T S Meese
- Neuroscences Research Institute, Aston University, Birmingham, UK.
| | | | | |
Collapse
|
24
|
Abstract
Two orthogonal patterns presented to the two eyes, respectively, are perceived as alternating in time, a phenomenon often assumed to reflect competition between neuronal activities corresponding to the two eyes, presumably in the primary visual cortex. Recent evidence supports a competition between neuronal activities corresponding to the two patterns (objects) at some higher cortical processing stage after inputs from the two eyes have converged. Here, using textures made of Gabor signals, we present psychophysical data showing that the level of visual processing at which competition takes place and is resolved, is determined by the degree of stimulus coherence. Moreover, depending on stimulus parameters, competition may occur at several levels of processing at the same time.
Collapse
Affiliation(s)
- Y Bonneh
- Keck Center for Integrative Neuroscience, University of California, San Francisco, California 94143, USA.
| | | | | |
Collapse
|
25
|
Abstract
Orientation detection and discrimination thresholds were measured for Gabor 'envelopes' formed from contrast-modulation of luminance 'carriers'. Consistent with previous research differences between carrier and envelope orientation had no effect on sensitivity to envelopes. Using plaid carriers in which the proportion of contrast modulation 'carried' by each plaid component was systematically manipulated, it was shown that this tolerance to carrier-envelope orientation difference reflects linear summation across orientation indicative of a single second-stage channel coding for contrast-defined structure. That contrast envelopes did not exhibit linear summation across spatial-frequency, nor across combinations of orientation and spatial-frequency differences, suggests that these second-order channels operate only within certain spatial scales. Using arrays of Gabor micropatterns as carriers in which the orientation distribution of the carriers was manipulated independently of the difference between envelope orientation and mean carrier orientation, it was further demonstrated that the locus of orientation integration must occur prior to envelope detection. In the context of two-stage models that incorporate a non-linearity between the stages, the pattern of results obtained is consistent with the operation of an orientation pooling process between first-stage and second-stage channels, analogous to having all filters of the first-stage feed into all filters of the second-stage within the same spatial-frequency band.
Collapse
Affiliation(s)
- A J Mussap
- School of Psychology, Deakin University, 221 Burwood Highway, Melbourne, 3125, Australia.
| |
Collapse
|
26
|
Smith S, Wenderoth P, van der Zwan R. Orientation processing mechanisms revealed by the plaid tilt illusion. Vision Res 2001; 41:483-94. [PMID: 11166051 DOI: 10.1016/s0042-6989(00)00268-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The tilt after-effect (TAE) and tilt illusion (TI) have revealed a great deal about the nature of orientation coding of 1-dimensional (1D) lines and gratings. Comparatively little research however has addressed the mechanisms responsible for encoding the orientation of 2-dimensional (2D) plaid stimuli. A multi-stage model of edge detection has recently been proposed [Georgeson, M. A. (1998) Image & Vision Computing, 16(6-7), 389-405] to account for the perceived structure of a plaid stimulus that incorporates extraction of the zero-crossings (ZCs) of the plaid. Data is presented showing that the ZCs of a plaid inducing stimulus can interact with vertical grating test stimulus to induce a standard tilt illusion. However, by considering the second-order structure of a plaid rather than ZCs, it was shown that the perceived orientation of the vertical test grating results from the combination of orientation illusions due to the first- and second-order components of an inducing plaid. The data suggest that the mechanisms encoding the orientation of second-order contours are similar to, and interact directly with, those that encode first-order contours.
Collapse
Affiliation(s)
- S Smith
- Flight Management and Human Factors Division, NASA Ames Research Center, MS 262-2, Moffett Field, CA 94035-1000, USA.
| | | | | |
Collapse
|
27
|
Abstract
In the beginning there was light, and form, and visual mechanisms. This paper traces developments in research on spatial vision over the 20 years of ECVP, with particular emphasis on (1) hyperacuity, (2) peripheral vision, (3) amblyopia and development, and (4) learning and plasticity.
Collapse
Affiliation(s)
- D M Levi
- College of Optometry, University of Houston, TX 77204-6052, USA.
| |
Collapse
|
28
|
Georgeson MA, Meese TS. Adaptive filtering in spatial vision: evidence from feature marking in plaids. Perception 2000; 28:687-702. [PMID: 10664764 DOI: 10.1068/p2836] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Much evidence shows that early vision employs an array of spatial filters tuned for different spatial frequencies and orientations. We suggest that for moderately low spatial frequencies these preliminary filters are not treated independently, but are used to perform grouping and segmentation in the patchwise Fourier domain. For example, consider a stationary plaid made from two superimposed sinusoidal gratings of the same contrast and spatial frequency oriented +/- 45 degrees from vertical. Most of the energy in a wavelet-like (e.g. simple-cell) transform of this stimulus is in the oblique orientations, but typically it looks like a compound structure containing blurred vertical and horizontal edges. This checkerboard structure corresponds with the locations of zero crossings in the output of an isotropic (circular) filter, synthesised from the linear sum of a set of oriented basis-filters (Georgeson, 1992 Proceedings of the Royal Society of London, Series B 249 235-245). However, the addition of a third harmonic in square-wave phase causes almost complete perceptual segmentation of the plaid into two overlapping oblique gratings. Here we confirm this result psychophysically using a feature-marking technique, and argue that this perceptual segmentation cannot be understood in terms of the zero crossings marked in the output of any static linear filter that is sensitive to all of the plaid's components. If it is assumed that zero crossings or similar are an appropriate feature-primitive in human vision, our results require a flexible process that combines and segments early basis-filters according to prevailing image conditions. Thus, we suggest that combination and segmentation of spatial filters in the patchwise Fourier domain underpins the perceptual segmentation observed in our experiments. Under this kind of image-processing scheme, registration across spatial scales occurs at the level of spatial filters, before features are extracted. This contrasts with many previous schemes where feature correspondence is required between spatial edge-maps at different spatial scales.
Collapse
Affiliation(s)
- M A Georgeson
- Cognitive Science Research Centre, School of Psychology, University of Birmingham, Edgbaston, UK.
| | | |
Collapse
|
29
|
Pardhan S, Rose D. Binocular and monocular detection of Gabor patches in binocular two-dimensional noise. Perception 2000; 28:203-15. [PMID: 10615461 DOI: 10.1068/p2739] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
Contrast thresholds for detecting sine-wave Gabor patches in two-dimensional externally added random-pixel noise were measured. Thresholds were obtained for monocular and binocular signals in the presence of spatial correlated (identical) and uncorrelated (independent) noise in the two eyes. Measurements were obtained at four different spectral densities of noise (including zero). Thresholds were higher for monocular stimuli than for binocular, and higher in the presence of correlated noise compared to uncorrelated noise. The magnitude of binocular summation, similar in correlated and uncorrelated noise, decreased with increasing noise strength. The independent contributions of internal noise and sampling efficiency to detection were analysed. Sampling efficiencies were higher for binocular than for monocular viewing for both types of noise, with values being higher with uncorrelated noise. Binocular stimuli showed a lower equivalent noise level compared to the mean monocular case for both types of noise.
Collapse
Affiliation(s)
- S Pardhan
- Department of Optometry, University of Bradford, West Yorkshire, UK
| | | |
Collapse
|
30
|
Abstract
Spatial interactions among orientation-tuned gain control processes are presumed to mediate center-surround contrast-contrast phenomena. In this paper, we assess contributions of gain control processes that pool over orientation. We measured the apparent contrast of a luminance-modulated center disk embedded in various modulated surrounds. In all conditions, observers compared the apparent contrast of the test center to an identically modulated disk with no surround. When center and surround are simple, vertical sinusoids and presented in phase, suppression depends upon surround contrast and is marked at high contrasts. When components are presented 180 degrees out of phase, no suppression occurs at any contrast. When a horizontal component is added to the surround, much less suppression occurs. However, strong suppression is reinstated when both center and surround are plaids. Neither of the latter two effects are phase dependent. We suggest that two different sources of gain control are revealed by the simple sinusoidal and the plaid stimuli. One is orientation tuned and phase-dependent. The other pools over all orientations and includes neurons tuned to multiple phases.
Collapse
Affiliation(s)
- L A Olzak
- Department of Psychology, Miami University of Ohio, Oxford 45056, USA.
| | | |
Collapse
|