1
|
DiMattina C. Second-order boundaries segment more easily when they are density-defined rather than feature-defined. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.10.548431. [PMID: 37502940 PMCID: PMC10369903 DOI: 10.1101/2023.07.10.548431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Previous studies have demonstrated that density is an important perceptual aspect of textural appearance to which the visual system is highly attuned. Furthermore, it is known that density cues not only influence texture segmentation, but can enable segmentation by themselves, in the absence of other cues. A popular computational model of texture segmentation known as the "Filter-Rectify-Filter" (FRF) model predicts that density should be a second-order cue enabling segmentation. For a compound texture boundary defined by superimposing two single-micropattern density boundaries, a version of the FRF model in which different micropattern-specific channels are analyzed separately by different second-stage filters makes the prediction that segmentation thresholds should be identical in two cases: (1) Compound boundaries with an equal number of micropatterns on each side but different relative proportions of each variety (compound feature boundaries) and (2) Compound boundaries with different numbers of micropatterns on each side, but with each side having an identical number of each variety (compound density boundaries). We directly tested this prediction by comparing segmentation thresholds for second-order compound feature and density boundaries, comprised of two superimposed single-micropattern density boundaries comprised of complementary micropattern pairs differing either in orientation or contrast polarity. In both cases, we observed lower segmentation thresholds for compound density boundaries than compound feature boundaries, with identical results when the compound density boundaries were equated for RMS contrast. In a second experiment, we considered how two varieties of micropatterns summate for compound boundary segmentation. In the case where two single micro-pattern density boundaries are superimposed to form a compound density boundary, we find that the two channels combine via probability summation. By contrast, when they are superimposed to form a compound feature boundary, segmentation performance is worse than for either channel alone. From these findings, we conclude that density segmentation may rely on neural mechanisms different from those which underlie feature segmentation, consistent with recent findings suggesting that density comprises a separate psychophysical 'channel'.
Collapse
Affiliation(s)
- Christopher DiMattina
- Computational Perception Laboratory, Florida Gulf Coast University, Fort Myers, FL, USA 33965-6565
- Department of Psychology, Florida Gulf Coast University, Fort Myers, FL, USA 33965-6565
| |
Collapse
|
2
|
DiMattina C. Luminance texture boundaries and luminance step boundaries are segmented using different mechanisms. Vision Res 2022; 190:107968. [PMID: 34794083 PMCID: PMC8712411 DOI: 10.1016/j.visres.2021.107968] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 10/18/2021] [Accepted: 10/28/2021] [Indexed: 01/03/2023]
Abstract
In natural scenes, two adjacent surfaces may differ in mean luminance without any sharp change in luminance at their boundary, but rather due to different relative proportions of light and dark regions within each surface. We refer to such boundaries as luminance texture boundaries (LTBs), and in this study we investigate whether LTBs are segmented using different mechanisms than luminance step boundaries (LSBs). We develop a novel method to generate luminance texture boundaries from natural uniform textures, and using these natural LTB stimuli in a boundary segmentation task, we find that observers are much more sensitive to identical luminance differences which are defined by textures (LTBs) than by uniform luminance steps (LSBs), consistent with the possibility of different mechanisms. In a second and third set of experiments, we characterize observer performance segmenting natural LTBs in the presence of masking LSBs which observers are instructed to ignore. We show that there is very little effect of masking LSBs on LTB segmentation performance. Furthermore, any masking effects we find are far less than those observed in a control experiment where both the masker and target are LSBs, and far less than those predicted by a model assuming identical mechanisms. Finally, we perform a fourth set of boundary segmentation experiments using artificial LTB stimuli comprised of differing proportions of white and black dots on opposite sides of the boundary. We find that these stimuli are also highly robust to masking by supra-threshold LSBs, consistent with our results using natural stimuli, and with our earlier studies using similar stimuli. Taken as a whole, these results suggest that the visual system contains mechanisms well suited to detecting surface boundaries that are robust to interference from luminance differences arising from luminance steps like those formed by cast shadows.
Collapse
Affiliation(s)
- Christopher DiMattina
- Computational Perception Laboratory, Fort Myers, FL, USA 33965-6565,Department of Psychology Florida Gulf Coast University, Fort Myers, FL, USA 33965-6565
| |
Collapse
|
3
|
DiMattina C, Baker CL. Modeling second-order boundary perception: A machine learning approach. PLoS Comput Biol 2019; 15:e1006829. [PMID: 30883556 PMCID: PMC6438569 DOI: 10.1371/journal.pcbi.1006829] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 03/28/2019] [Accepted: 01/15/2019] [Indexed: 11/18/2022] Open
Abstract
Visual pattern detection and discrimination are essential first steps for scene analysis. Numerous human psychophysical studies have modeled visual pattern detection and discrimination by estimating linear templates for classifying noisy stimuli defined by spatial variations in pixel intensities. However, such methods are poorly suited to understanding sensory processing mechanisms for complex visual stimuli such as second-order boundaries defined by spatial differences in contrast or texture. We introduce a novel machine learning framework for modeling human perception of second-order visual stimuli, using image-computable hierarchical neural network models fit directly to psychophysical trial data. This framework is applied to modeling visual processing of boundaries defined by differences in the contrast of a carrier texture pattern, in two different psychophysical tasks: (1) boundary orientation identification, and (2) fine orientation discrimination. Cross-validation analysis is employed to optimize model hyper-parameters, and demonstrate that these models are able to accurately predict human performance on novel stimulus sets not used for fitting model parameters. We find that, like the ideal observer, human observers take a region-based approach to the orientation identification task, while taking an edge-based approach to the fine orientation discrimination task. How observers integrate contrast modulation across orientation channels is investigated by fitting psychophysical data with two models representing competing hypotheses, revealing a preference for a model which combines multiple orientations at the earliest possible stage. Our results suggest that this machine learning approach has much potential to advance the study of second-order visual processing, and we outline future steps towards generalizing the method to modeling visual segmentation of natural texture boundaries. This study demonstrates how machine learning methodology can be fruitfully applied to psychophysical studies of second-order visual processing. Many naturally occurring visual boundaries are defined by spatial differences in features other than luminance, for example by differences in texture or contrast. Quantitative models of such “second-order” boundary perception cannot be estimated using the standard regression techniques (known as “classification images”) commonly applied to “first-order”, luminance-defined stimuli. Here we present a novel machine learning approach to modeling second-order boundary perception using hierarchical neural networks. In contrast to previous quantitative studies of second-order boundary perception, we directly estimate network model parameters using psychophysical trial data. We demonstrate that our method can reveal different spatial summation strategies that human observers utilize for different kinds of second-order boundary perception tasks, and can be used to compare competing hypotheses of how contrast modulation is integrated across orientation channels. We outline extensions of the methodology to other kinds of second-order boundaries, including those in natural images.
Collapse
Affiliation(s)
- Christopher DiMattina
- Computational Perception Laboratory, Department of Psychology, Florida Gulf Coast University, Fort Myers, Florida, United States of America
- * E-mail:
| | - Curtis L. Baker
- McGill Vision Research Unit, Department of Ophthalmology, McGill University, Montreal, Quebec, Canada
| |
Collapse
|
4
|
Georgeson MA, Schofield AJ. Binocular functional architecture for detection of contrast-modulated gratings. Vision Res 2016; 128:68-82. [PMID: 27664349 DOI: 10.1016/j.visres.2016.09.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Revised: 09/11/2016] [Accepted: 09/12/2016] [Indexed: 10/20/2022]
Abstract
Combination of signals from the two eyes is the gateway to stereo vision. To gain insight into binocular signal processing, we studied binocular summation for luminance-modulated gratings (L or LM) and contrast-modulated gratings (CM). We measured 2AFC detection thresholds for a signal grating (0.75c/deg, 216ms) shown to one eye, both eyes, or both eyes out-of-phase. For LM and CM, the carrier noise was in both eyes, even when the signal was monocular. Mean binocular thresholds for luminance gratings (L) were 5.4dB better than monocular thresholds - close to perfect linear summation (6dB). For LM and CM the binocular advantage was again 5-6dB, even when the carrier noise was uncorrelated, anti-correlated, or at orthogonal orientations in the two eyes. Binocular combination for CM probably arises from summation of envelope responses, and not from summation of these conflicting carrier patterns. Antiphase signals produced no binocular advantage, but thresholds were about 1-3dB higher than monocular ones. This is not consistent with simple linear summation, which should give complete cancellation and unmeasurably high thresholds. We propose a three-channel model in which noisy monocular responses to the envelope are binocularly combined in a contrast-weighted sum, but also remain separately available to perception via a max operator. Vision selects the largest of the three responses. With in-phase gratings the binocular channel dominates, but antiphase gratings cancel in the binocular channel and the monocular channels mediate detection. The small antiphase disadvantage might be explained by a subtle influence of background responses on binocular and monocular detection.
Collapse
Affiliation(s)
- Mark A Georgeson
- School of Life & Health Sciences, Aston University, Birmingham B4 7ET, UK.
| | | |
Collapse
|
5
|
Westrick ZM, Landy MS. Pooling of first-order inputs in second-order vision. Vision Res 2013; 91:108-17. [PMID: 23994031 DOI: 10.1016/j.visres.2013.08.005] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2013] [Revised: 07/22/2013] [Accepted: 08/12/2013] [Indexed: 11/28/2022]
Abstract
The processing of texture patterns has been characterized by a model that first filters the image to isolate one texture component, then applies a rectifying nonlinearity that converts texture variation into intensity variation, and finally processes the resulting pattern with mechanisms similar to those used in processing luminance-defined images (spatial-frequency- and orientation-tuned filters). This model, known as FRF for filter rectify filter, has the appeal of explaining sensitivity to second-order patterns in terms of mechanisms known to exist for processing first-order patterns. This model implies an unexpected interaction between the first and second stages of filtering; if the first-stage filter consists of narrowband mechanisms tuned to detect the carrier texture, then sensitivity to high-frequency texture modulations should be much lower than is observed in humans. We propose that the human visual system must pool over first-order channels tuned to a wide range of spatial frequencies and orientations to achieve texture demodulation, and provide psychophysical evidence for pooling in a cross-carrier adaptation experiment and in an experiment that measures modulation contrast sensitivity at very low first-order contrast.
Collapse
|
6
|
Zavitz E, Baker CL. Texture sparseness, but not local phase structure, impairs second-order segmentation. Vision Res 2013; 91:45-55. [PMID: 23942289 DOI: 10.1016/j.visres.2013.07.018] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2013] [Revised: 07/08/2013] [Accepted: 07/31/2013] [Indexed: 11/29/2022]
Abstract
Texture boundary segmentation is typically thought to reflect a comparison of differences in Fourier energy (i.e. low-order texture statistics) on either side of a boundary. However in a previous study (Arsenault, Yoonessi, & Baker, 2011) we showed that the distribution of energy within a natural texture (i.e. its higher-order statistical structure) also influences segmentation of contrast boundaries. Here we examine the influence of specific higher-order texture statistics on segmentation of contrast- and orientation-defined boundaries. Using naturalistic synthetic textures to manipulate the sparseness, global phase structure, and local phase alignments of carrier textures, we measure segmentation thresholds based on forced-choice judgments of boundary orientation. We find a similar pattern of results for both contrast and orientation boundaries: (1) randomizing all structure by globally phase scrambling the texture reduces segmentation thresholds substantially, (2) decreasing sparseness also reduces thresholds, and (3) removing local phase alignments has little or no effect on segmentation thresholds. We show that a two-stage filter model with an intermediate compressive nonlinearity and expansive output nonlinearity can account for these data using synthetic textures. Furthermore, the model parameter fits obtained using synthetic textures also predict the segmentation thresholds presented in Arsenault, Yoonessi, and Baker (2011) for natural and phase-scrambled natural texture carriers.
Collapse
Affiliation(s)
- Elizabeth Zavitz
- McGill Vision Research, Department of Ophthalmology, McGill University, Montreal, Quebec, Canada; Department of Physiology, Monash University, Clayton, Victoria, Australia.
| | | |
Collapse
|
7
|
Westrick ZM, Henry CA, Landy MS. Inconsistent channel bandwidth estimates suggest winner-take-all nonlinearity in second-order vision. Vision Res 2013; 81:58-68. [PMID: 23416867 DOI: 10.1016/j.visres.2013.01.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2012] [Revised: 01/23/2013] [Accepted: 01/29/2013] [Indexed: 10/27/2022]
Abstract
The processing of texture patterns has been characterized by a model that postulates a first-stage linear filter to highlight a component texture, a pointwise rectification stage to convert contrast for the highlighted texture into mean response strength, followed by a second-stage linear filter to detect the texture-defined pattern. We estimated the spatial-frequency bandwidth of the second-stage filter mediating orientation discrimination of orientation-modulated second-order gratings by measuring threshold elevation in the presence of filtered noise added to the modulation signal. This experiment yielded no evidence for frequency tuning. A second experiment, in which subjects had to detect similar second-order gratings while judging their modulation frequency, produced bandwidth estimates of 1-1.5 octaves, similar to estimated bandwidths of first-order channels. We propose that an additional dominant-response-selection nonlinearity can account for these apparently contradictory results.
Collapse
|
8
|
Baker DH, Meese TS, Georgeson MA. Paradoxical psychometric functions ("swan functions") are explained by dilution masking in four stimulus dimensions. Iperception 2013; 4:17-35. [PMID: 23799185 PMCID: PMC3690413 DOI: 10.1068/i0552] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2012] [Revised: 11/27/2012] [Indexed: 11/30/2022] Open
Abstract
The visual system dissects the retinal image into millions of local analyses along numerous visual dimensions. However, our perceptions of the world are not fragmentary, so further processes must be involved in stitching it all back together. Simply summing up the responses would not work because this would convey an increase in image contrast with an increase in the number of mechanisms stimulated. Here, we consider a generic model of signal combination and counter-suppression designed to address this problem. The model is derived and tested for simple stimulus pairings (e.g. A + B), but is readily extended over multiple analysers. The model can account for nonlinear contrast transduction, dilution masking, and signal combination at threshold and above. It also predicts nonmonotonic psychometric functions where sensitivity to signal A in the presence of pedestal B first declines with increasing signal strength (paradoxically dropping below 50% correct in two-interval forced choice), but then rises back up again, producing a contour that follows the wings and neck of a swan. We looked for and found these "swan" functions in four different stimulus dimensions (ocularity, space, orientation, and time), providing some support for our proposal.
Collapse
Affiliation(s)
- Daniel H. Baker
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham B4 7ET, UK; e-mail:
| | - Tim S. Meese
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham B4 7ET, UK; e-mail:
| | - Mark A. Georgeson
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham, B4 7ET, UK; e-mail:
| |
Collapse
|
9
|
Meese TS, Baker DH. A common rule for integration and suppression of luminance contrast across eyes, space, time, and pattern. Iperception 2013; 4:1-16. [PMID: 23799184 PMCID: PMC3690412 DOI: 10.1068/i0556] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2012] [Revised: 11/30/2012] [Indexed: 10/27/2022] Open
Abstract
Visual perception begins by dissecting the retinal image into millions of small patches for local analyses by local receptive fields. However, image structures extend well beyond these receptive fields and so further processes must be involved in sewing the image fragments back together to derive representations of higher order (more global) structures. To investigate the integration process, we also need to understand the opposite process of suppression. To investigate both processes together, we measured triplets of dipper functions for targets and pedestals involving interdigitated stimulus pairs (A, B). Previous work has shown that summation and suppression operate over the full contrast range for the domains of ocularity and space. Here, we extend that work to include orientation and time domains. Temporal stimuli were 15-Hz counter-phase sine-wave gratings, where A and B were the positive and negative phases of the oscillation, respectively. For orientation, we used orthogonally oriented contrast patches (A, B) whose sum was an isotropic difference of Gaussians. Results from all four domains could be understood within a common framework in which summation operates separately within the numerator and denominator of a contrast gain control equation. This simple arrangement of summation and counter-suppression achieves integration of various stimulus attributes without distorting the underlying contrast code.
Collapse
Affiliation(s)
- Tim S Meese
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham B4 7ET, UK; e-mail:
| | | |
Collapse
|
10
|
Vancleef K, Putzeys T, Gheorghiu E, Sassi M, Machilsen B, Wagemans J. Spatial arrangement in texture discrimination and texture segregation. Iperception 2013; 4:36-52. [PMID: 23799186 PMCID: PMC3690414 DOI: 10.1068/i0515] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2012] [Revised: 12/17/2012] [Indexed: 11/10/2022] Open
Abstract
We investigated the role of spatial arrangement of texture elements in three psychophysical experiments on texture discrimination and texture segregation. In our stimuli, oriented Gabor elements formed an iso-oriented and a randomly oriented texture region. We manipulated (1) the orientation similarity in the iso-oriented region by adding orientation jitter to the orientation of each Gabor; (2) the spatial arrangement of the Gabors: quasi-random or regular; and (3) the shape of the edge between the two texture regions: straight or curved. In Experiment 1, participants discriminated an iso-oriented stimulus from a stimulus with only randomly oriented elements. Experiment 2 required texture segregation to judge the shape of the texture edge. Experiment 3 replicated Experiment 2 with Gabors of a smaller spatial extent in a denser arrangement. We found comparable performance levels with regular and quasi-random Gabor positions in the discrimination task but not in the segregation tasks. We conclude that spatial arrangement plays a role in a texture segregation task requiring shape discrimination of the texture edge but not in a texture discrimination task in which it is sufficient to discriminate an iso-oriented region from a completely random region.
Collapse
Affiliation(s)
- Kathleen Vancleef
- Laboratory of Experimental Psychology, University of Leuven (KU Leuven), Tiensestraat 102, Box 3711, BE-3000 Leuven, Belgium; e-mail:
| | | | | | | | | | | |
Collapse
|
11
|
Reynaud A, Hess RF. Properties of spatial channels underlying the detection of orientation-modulations. Exp Brain Res 2012; 220:135-45. [PMID: 22623098 DOI: 10.1007/s00221-012-3124-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2011] [Accepted: 05/04/2012] [Indexed: 11/28/2022]
Affiliation(s)
- Alexandre Reynaud
- Department of Ophthalmology, McGill Vision Research, McGill University, Montreal, PQ, Canada.
| | | |
Collapse
|
12
|
Meso AI, Hess RF. Orientation gradient detection exhibits variable coupling between first- and second-stage filtering mechanisms. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA. A, OPTICS, IMAGE SCIENCE, AND VISION 2011; 28:1721-1731. [PMID: 21811335 DOI: 10.1364/josaa.28.001721] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
We investigated sensitivity to orientation modulation using visual stimuli with bandpass filtered noise carriers. We characterized the relationship between the spatial parameters of the modulator and the carrier using a 2-AFC detection task. The relationship between these two parameters is potentially informative of the underlying coupling between first- and second-stage filtering mechanisms, which, in turn, may bear on the interrelationship between striate and extrastriate cortical processing. Our previous experiments on analogous motion stimuli found an optimum sensitivity when the ratio of the carrier and modulator spatial frequency parameters (r) was approximately ten. The current results do not exhibit an optimum sensitivity at a given value of the ratio r. Previous experiments involving second-order modulation sensitivity show an inconsistent range of estimates of optimum sensitivity at values of r between 5 and 50. Our results, using a complementary approach, confirm these discrepancies, demonstrating that the coupling between carrier and modulator frequency parameters depends on a number of stimulus-specific factors, such as contrast sensitivity, stimulus eccentricity, and absolute values of the carrier and modulator spatial frequency parameters. We show that these observations are true for a stimulus limited in eccentricity and that this orientation-modulated stimulus does not exhibit scale invariance. Such processing can not be modeled by a generic filter-rectify-filter model.
Collapse
Affiliation(s)
- Andrew Isaac Meso
- McGill Vision Research, Department of Ophthalmology, McGill University, 687 Pine Avenue West Rm H4-14, Montreal QC H3A1A1, Canada.
| | | |
Collapse
|
13
|
Graham NV. Beyond multiple pattern analyzers modeled as linear filters (as classical V1 simple cells): useful additions of the last 25 years. Vision Res 2011; 51:1397-430. [PMID: 21329718 DOI: 10.1016/j.visres.2011.02.007] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2010] [Revised: 02/07/2011] [Accepted: 02/09/2011] [Indexed: 11/28/2022]
Abstract
This review briefly discusses processes that have been suggested in the last 25 years as important to the intermediate stages of visual processing of patterns. Five categories of processes are presented: (1) Higher-order processes including FRF structures; (2) Divisive contrast nonlinearities including contrast normalization; (3) Subtractive contrast nonlinearities including contrast comparison; (4) Non-classical receptive fields (surround suppression, cross-orientation inhibition); (5) Contour integration.
Collapse
Affiliation(s)
- Norma V Graham
- Department of Psychology, Columbia University, NY, NY 10027, USA.
| |
Collapse
|
14
|
Prins N. Texture modulation detection by probability summation among orientation-selective and isotropic mechanisms. Vision Res 2008; 48:2751-66. [PMID: 18831985 DOI: 10.1016/j.visres.2008.09.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2008] [Revised: 09/06/2008] [Accepted: 09/08/2008] [Indexed: 10/21/2022]
Abstract
Substantial evidence has accumulated for the notion that modulations of second-order properties in the visual scene are processed by mechanisms which detect contrast variations within narrow orientation/spatial frequency channels. It has also been suggested that mechanisms exist which detect contrast modulations across all orientations. Many naturally occurring texture variations (e.g., modulations in orientation and/or spatial frequency) involve simultaneous contrast modulations in multiple channels. Contrasting conclusions have been drawn regarding the manner in which the information carried in multiple channels is combined. In a series of two experiments it is shown that simultaneous contrast modulations in two narrow orientation bands are detected by three mechanisms, two of which detect contrast modulations within the modulated bands only, the third of which integrates contrast across orientations in order to detect modulations of overall contrast. The three mechanisms combine their efforts by probability summation.
Collapse
Affiliation(s)
- Nicolaas Prins
- Department of Psychology, University of Mississippi, Peabody Building, P.O. Box 1848, University, MS, 38677, USA.
| |
Collapse
|
15
|
Abstract
The initial image-processing stages of visual cortex are well suited to a local (patchwise) analysis of the viewed scene. But the world's structures extend over space as textures and surfaces, suggesting the need for spatial integration. Most models of contrast vision fall shy of this process because (i) the weak area summation at detection threshold is attributed to probability summation (PS) and (ii) there is little or no advantage of area well above threshold. Both of these views are challenged here. First, it is shown that results at threshold are consistent with linear summation of contrast following retinal inhomogeneity, spatial filtering, nonlinear contrast transduction and multiple sources of additive Gaussian noise. We suggest that the suprathreshold loss of the area advantage in previous studies is due to a concomitant increase in suppression from the pedestal. To overcome this confound, a novel stimulus class is designed where: (i) the observer operates on a constant retinal area, (ii) the target area is controlled within this summation field, and (iii) the pedestal is fixed in size. Using this arrangement, substantial summation is found along the entire masking function, including the region of facilitation. Our analysis shows that PS and uncertainty cannot account for the results, and that suprathreshold summation of contrast extends over at least seven target cycles of grating.
Collapse
Affiliation(s)
- Tim S Meese
- School of Life and Health Sciences, Aston University, Birmingham B47ET, UK.
| | | |
Collapse
|
16
|
Motoyoshi I, Kingdom FAA. Differential roles of contrast polarity reveal two streams of second-order visual processing. Vision Res 2007; 47:2047-54. [PMID: 17555787 DOI: 10.1016/j.visres.2007.03.015] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2007] [Accepted: 03/19/2007] [Indexed: 10/23/2022]
Abstract
Humans can easily segregate texture regions based on differences in contrast, orientation, and contrast polarity. It has been suggested that these abilities can be inclusively modeled by 2nd-order visual mechanisms that detect changes in the half-wave rectified outputs of orientation-selective filters. Using a subthreshold-summation paradigm, however, we show that modulations of contrast polarity are detected by mechanisms that pool signals of different orientations while modulations of orientation are detected by mechanisms that pool signals of different contrast polarities. The results support the existence of two streams of 2nd-order processing, one that receives the full-wave rectified inputs from oriented filters, the other separate half-wave rectified outputs from on-center and off-center filters pooled across all orientations. The two-stream model is shown to predict the perceptual effects of changes to the skewness statistics of natural-image textures, and to solve a contradiction among previous data concerning the detection of contrast modulation.
Collapse
Affiliation(s)
- Isamu Motoyoshi
- Human and Information Science Laboratory, NTT Communication Science Laboratories, NTT 3-1 Morinosato-Wakamiya, Atsugi 243-0198, Japan.
| | | |
Collapse
|