1
|
Vacher J, Launay C, Mamassian P, Coen-Cagli R. Measuring uncertainty in human visual segmentation. ARXIV 2023:arXiv:2301.07807v3. [PMID: 36824425 PMCID: PMC9949179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
Segmenting visual stimuli into distinct groups of features and visual objects is central to visual function. Classical psychophysical methods have helped uncover many rules of human perceptual segmentation, and recent progress in machine learning has produced successful algorithms. Yet, the computational logic of human segmentation remains unclear, partially because we lack well-controlled paradigms to measure perceptual segmentation maps and compare models quantitatively. Here we propose a new, integrated approach: given an image, we measure multiple pixel-based same-different judgments and perform model-based reconstruction of the underlying segmentation map. The reconstruction is robust to several experimental manipulations and captures the variability of individual participants. We demonstrate the validity of the approach on human segmentation of natural images and composite textures. We show that image uncertainty affects measured human variability, and it influences how participants weigh different visual features. Because any putative segmentation algorithm can be inserted to perform the reconstruction, our paradigm affords quantitative tests of theories of perception as well as new benchmarks for segmentation algorithms.
Collapse
Affiliation(s)
- Jonathan Vacher
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École normale supérieure, PSL University, CNRS, Paris, France
| | - Claire Launay
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Pascal Mamassian
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École normale supérieure, PSL University, CNRS, Paris, France
| | - Ruben Coen-Cagli
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, USA
- Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
- Department. of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, NY, USA
| |
Collapse
|
2
|
Vacher J, Launay C, Mamassian P, Coen-Cagli R. Measuring uncertainty in human visual segmentation. PLoS Comput Biol 2023; 19:e1011483. [PMID: 37747914 PMCID: PMC10553811 DOI: 10.1371/journal.pcbi.1011483] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 10/05/2023] [Accepted: 08/31/2023] [Indexed: 09/27/2023] Open
Abstract
Segmenting visual stimuli into distinct groups of features and visual objects is central to visual function. Classical psychophysical methods have helped uncover many rules of human perceptual segmentation, and recent progress in machine learning has produced successful algorithms. Yet, the computational logic of human segmentation remains unclear, partially because we lack well-controlled paradigms to measure perceptual segmentation maps and compare models quantitatively. Here we propose a new, integrated approach: given an image, we measure multiple pixel-based same-different judgments and perform model-based reconstruction of the underlying segmentation map. The reconstruction is robust to several experimental manipulations and captures the variability of individual participants. We demonstrate the validity of the approach on human segmentation of natural images and composite textures. We show that image uncertainty affects measured human variability, and it influences how participants weigh different visual features. Because any putative segmentation algorithm can be inserted to perform the reconstruction, our paradigm affords quantitative tests of theories of perception as well as new benchmarks for segmentation algorithms.
Collapse
Affiliation(s)
- Jonathan Vacher
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École normale supérieure, PSL University, CNRS, Paris, France
| | - Claire Launay
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New-York, United States of America
| | - Pascal Mamassian
- Laboratoire des systèmes perceptifs, Département d’études cognitives, École normale supérieure, PSL University, CNRS, Paris, France
| | - Ruben Coen-Cagli
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New-York, United States of America
- Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, New-York, United States of America
- Department of Ophthalmology and Visual Sciences, Albert Einstein College of Medicine, Bronx, New-York, United States of America
| |
Collapse
|
3
|
Barzegaran E, Norcia AM. Neural sources of letter and Vernier acuity. Sci Rep 2020; 10:15449. [PMID: 32963270 PMCID: PMC7509830 DOI: 10.1038/s41598-020-72370-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 09/01/2020] [Indexed: 01/23/2023] Open
Abstract
Visual acuity can be measured in many different ways, including with letters and Vernier offsets. Prior psychophysical work has suggested that the two acuities are strongly linked given that they both depend strongly on retinal eccentricity and both are similarly affected in amblyopia. Here we used high-density EEG recordings to ask whether the underlying neural sources are common as suggested by the psychophysics or distinct. To measure visual acuity for letters, we recorded evoked potentials to 3 Hz alternations between intact and scrambled text comprised of letters of varying size. To measure visual acuity for Vernier offsets, we recorded evoked potentials to 3 Hz alternations between bar gratings with and without a set of Vernier offsets. Both alternation types elicited robust activity at the 3 Hz stimulus frequency that scaled in amplitude with both letter and offset size, starting near threshold. Letter and Vernier offset responses differed in both their scalp topography and temporal dynamics. The earliest evoked responses to letters occurred on lateral occipital visual areas, predominantly over the left hemisphere. Later responses were measured at electrodes over early visual cortex, suggesting that letter structure is first extracted in second-tier extra-striate areas and that responses over early visual areas are due to feedback. Responses to Vernier offsets, by contrast, occurred first at medial occipital electrodes, with responses at later time-points being more broadly distributed—consistent with feedforward pathway mediation. The previously observed commonalities between letter and Vernier acuity may be due to common bottlenecks in early visual cortex but not because the two tasks are subserved by a common network of visual areas.
Collapse
Affiliation(s)
- Elham Barzegaran
- Wu Tsai Neurosciences Institute, 290 Jane Stanford Way, Stanford, CA, 94305, USA.
| | - Anthony M Norcia
- Wu Tsai Neurosciences Institute, 290 Jane Stanford Way, Stanford, CA, 94305, USA.
| |
Collapse
|
4
|
Montani V, Chanoine V, Stoianov IP, Grainger J, Ziegler JC. Steady state visual evoked potentials in reading aloud: Effects of lexicality, frequency and orthographic familiarity. BRAIN AND LANGUAGE 2019; 192:1-14. [PMID: 30826643 DOI: 10.1016/j.bandl.2019.01.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Revised: 07/16/2018] [Accepted: 01/24/2019] [Indexed: 06/09/2023]
Abstract
The present study explored the possibility to use Steady-State Visual Evoked Potentials (SSVEPs) as a tool to investigate the core mechanisms in visual word recognition. In particular, we investigated three benchmark effects of reading aloud: lexicality (words vs. pseudowords), frequency (high-frequency vs. low-frequency words), and orthographic familiarity ('familiar' versus 'unfamiliar' pseudowords). We found that words and pseudowords elicited robust SSVEPs. Words showed larger SSVEPs than pseudowords and high-frequency words showed larger SSVEPs than low-frequency words. SSVEPs were not sensitive to orthographic familiarity. We further localized the neural generators of the SSVEP effects. The lexicality effect was located in areas associated with early level of visual processing, i.e. in the right occipital lobe and in the right precuneus. Pseudowords produced more activation than words in left sensorimotor areas, rolandic operculum, insula, supramarginal gyrus and in the right temporal gyrus. These areas are devoted to speech processing and/or spelling-to-sound conversion. The frequency effect involved the left temporal pole and orbitofrontal cortex, areas previously implicated in semantic processing and stimulus-response associations respectively, and the right postcentral and parietal inferior gyri, possibly indicating the involvement of the right attentional network.
Collapse
Affiliation(s)
- Veronica Montani
- Aix-Marseille University and CNRS, Brain and Language Research Institute, 3 Place Victor Hugo, 13331 Marseille Cedex 3, France.
| | - Valerie Chanoine
- Aix-Marseille University, Institute of Language, Communication and the Brain, Brain and Language Research Institute, 13100 Aix-en-Provence, France
| | - Ivilin Peev Stoianov
- Aix-Marseille University and CNRS, LPC, 3 Place Victor Hugo, 13331 Marseille Cedex 3, France; Institute of Cognitive Sciences and Technologies, CNR, Via Martiri della Libertà 2, 35137 Padova, Italy
| | - Jonathan Grainger
- Aix-Marseille University and CNRS, LPC, 3 Place Victor Hugo, 13331 Marseille Cedex 3, France
| | - Johannes C Ziegler
- Aix-Marseille University and CNRS, LPC, 3 Place Victor Hugo, 13331 Marseille Cedex 3, France
| |
Collapse
|
5
|
Kohler PJ, Cottereau BR, Norcia AM. Dynamics of perceptual decisions about symmetry in visual cortex. Neuroimage 2017; 167:316-330. [PMID: 29175495 DOI: 10.1016/j.neuroimage.2017.11.051] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 11/14/2017] [Accepted: 11/22/2017] [Indexed: 11/26/2022] Open
Abstract
Neuroimaging studies have identified multiple extra-striate visual areas that are sensitive to symmetry in planar images (Kohler et al., 2016; Sasaki et al., 2005). Here, we investigated which of these areas are directly involved in perceptual decisions about symmetry, by recording high-density EEG in participants (n = 25) who made rapid judgments about whether an exemplar image contained rotation symmetry or not. Stimulus-locked sensor-level analysis revealed symmetry-specific activity that increased with increasing order of rotation symmetry. Response-locked analysis identified activity occurring between 600 and 200 ms before the button-press, that was directly related to perceptual decision making. We then used fMRI-informed EEG source imaging to characterize the dynamics of symmetry-specific activity within an extended network of areas in visual cortex. The most consistent cortical source of the stimulus-locked activity was VO1, a topographically organized area in ventral visual cortex, that was highly sensitive to symmetry in a previous study (Kohler et al., 2016). Importantly, VO1 activity also contained a strong decision-related component, suggesting that this area plays a crucial role in perceptual decisions about symmetry. Other candidate areas, such as lateral occipital cortex, had weak stimulus-locked symmetry responses and no evidence of correlation with response timing.
Collapse
Affiliation(s)
- Peter J Kohler
- Department of Psychology, Stanford University, Jordan Hall, Building 420, 450 Serra Mall, Stanford, CA 94305, United States.
| | - Benoit R Cottereau
- Université de Toulouse, Centre de Recherche Cerveau et Cognition, Toulouse, France; Centre National de la Recherche Scientifique, Toulouse Cedex, France
| | - Anthony M Norcia
- Department of Psychology, Stanford University, Jordan Hall, Building 420, 450 Serra Mall, Stanford, CA 94305, United States
| |
Collapse
|
6
|
Temporal Asymmetry in Dark-Bright Processing Initiates Propagating Activity across Primary Visual Cortex. J Neurosci 2016; 36:1902-13. [PMID: 26865614 DOI: 10.1523/jneurosci.3235-15.2016] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
UNLABELLED Differences between visual pathways representing darks and lights have been shown to affect spatial resolution and detection timing. Both psychophysical and physiological studies suggest an underlying retinal origin with amplification in primary visual cortex (V1). Here we show that temporal asymmetries in the processing of darks and lights create motion in terms of propagating activity across V1. Exploiting the high spatiotemporal resolution of voltage-sensitive dye imaging, we captured population responses to abrupt local changes of luminance in cat V1. For stimulation we used two neighboring small squares presented on either bright or dark backgrounds. When a single square changed from dark to bright or vice versa, we found coherent population activity emerging at the respective retinal input locations. However, faster rising and decay times were obtained for the bright to dark than the dark to bright changes. When the two squares changed luminance simultaneously in opposite polarities, we detected a propagating wave front of activity that originated at the cortical location representing the darkened square and rapidly expanded toward the region representing the brightened location. Thus, simultaneous input led to sequential activation across cortical retinotopy. Importantly, this effect was independent of the squares' contrast with the background. We suggest imbalance in dark-bright processing as a driving force in the generation of wave-like activity. Such propagation may convey motion signals and influence perception of shape whenever abrupt shifts in visual objects or gaze cause counterchange of luminance at high-contrast borders. SIGNIFICANCE STATEMENT An elementary process in vision is the detection of darks and lights through the retina via ON and OFF channels. Psychophysical and physiological studies suggest that differences between these channels affect spatial resolution and detection thresholds. Here we show that temporal asymmetries in the processing of darks and lights create motion signals across visual cortex. Using two neighboring squares, which simultaneously counterchanged luminance, we discovered propagating activity that was strictly drawn out from cortical regions representing the darkened location. Thus, a synchronous stimulus event translated into sequential wave-like brain activation. Such propagation may convey motion signals accessible in higher brain areas, whenever abrupt shifts in visual objects or gaze cause counterchange of luminance at high-contrast borders.
Collapse
|
7
|
Schmidt F, Vancleef K. Response priming evidence for feedforward processing of snake contours but not of ladder contours and textures. Vision Res 2015; 126:174-182. [PMID: 25771400 DOI: 10.1016/j.visres.2015.03.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Revised: 03/02/2015] [Accepted: 03/04/2015] [Indexed: 11/24/2022]
Abstract
In contour integration, increased difficulty in detection and shape discrimination of a chain of parallel elements (a ladder contour) compared to collinear elements (a snake contour) suggests more extensive processing of ladders than of snakes. In addition, conceptual similarities between ladders and textures - which also involve grouping of parallel elements - raises the question whether ladder and texture processing requires feedback from higher visual areas while snakes are processed in a fast feedforward sweep. We tested this in a response priming paradigm, where participants responded as quickly and accurately as possible to the orientation of a diagonal contour in a Gabor array (target). The diagonal was defined either by a snake, ladder, texture, or a continuous line. The target was preceded with varying stimulus onset asynchrony (SOA) by a prime that was either a snake, ladder, or texture, and was consistent or inconsistent to the response demands of the target. Resulting priming effects clearly distinguished between processing of snakes, ladders, and textures. Effects generally increased with SOA but were stronger for snakes and textures compared to ladders. Importantly, only priming effects for snakes were fully present already in the fastest response times, in accordance with a simple feedforward processing model. We conclude that snakes, ladders, and textures do not share similar processing characteristics, with snakes exhibiting a pronounced processing advantage.
Collapse
Affiliation(s)
- Filipp Schmidt
- University of Kaiserslautern, Experimental Psychology, Erwin-Schrödinger-Str. Geb. 57, 67663 Kaiserslautern, Germany; Justus-Liebig-University Giessen, General Psychology, Otto-Behaghel-Str. 10F, 35394 Giessen, Germany.
| | - Kathleen Vancleef
- University of Leuven (KU Leuven), Laboratory of Experimental Psychology, Tiensestraat 102 - Box 1711, 3000 Leuven, Belgium; Newcastle University, Institute of Neuroscience, Framlington Place, Newcastle-upon-Tyne NE2 4HH, United Kingdom.
| |
Collapse
|
8
|
Norcia AM, Appelbaum LG, Ales JM, Cottereau BR, Rossion B. The steady-state visual evoked potential in vision research: A review. J Vis 2015; 15:4. [PMID: 26024451 PMCID: PMC4581566 DOI: 10.1167/15.6.4] [Citation(s) in RCA: 568] [Impact Index Per Article: 63.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2014] [Accepted: 01/05/2015] [Indexed: 02/07/2023] Open
Abstract
Periodic visual stimulation and analysis of the resulting steady-state visual evoked potentials were first introduced over 80 years ago as a means to study visual sensation and perception. From the first single-channel recording of responses to modulated light to the present use of sophisticated digital displays composed of complex visual stimuli and high-density recording arrays, steady-state methods have been applied in a broad range of scientific and applied settings.The purpose of this article is to describe the fundamental stimulation paradigms for steady-state visual evoked potentials and to illustrate these principles through research findings across a range of applications in vision science.
Collapse
|
9
|
Abstract
AbstractThe dissociation of a figure from its background is an essential feat of visual perception, as it allows us to detect, recognize, and interact with shapes and objects in our environment. In order to understand how the human brain gives rise to the perception of figures, we here review experiments that explore the links between activity in visual cortex and performance of perceptual tasks related to figure perception. We organize our review according to a proposed model that attempts to contextualize figure processing within the more general framework of object processing in the brain. Overall, the current literature provides us with individual linking hypotheses as to cortical regions that are necessary for particular tasks related to figure perception. Attempts to reach a more complete understanding of how the brain instantiates figure and object perception, however, will have to consider the temporal interaction between the many regions involved, the details of which may vary widely across different tasks.
Collapse
|
10
|
Vancleef K, Wagemans J, Humphreys GW. Impaired texture segregation but spared contour integration following damage to right posterior parietal cortex. Exp Brain Res 2013; 230:41-57. [PMID: 23831849 DOI: 10.1007/s00221-013-3629-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2013] [Accepted: 06/18/2013] [Indexed: 11/27/2022]
Abstract
We examined the relations between texture segregation and contour integration in patients with deficits in spatial attention leading to left or right hemisphere extinction. Patients and control participants were presented with texture and contour stimuli consisting of oriented elements. We induced regularity in the stimuli by manipulating the element orientations resulting in an implicit texture border or explicit contour. Participants had to discriminate curved from straight shapes without making eye movements, while the stimulus presentation time was varied using a QUEST procedure. The results showed that only patients with right hemisphere extinction had a spatial bias, needing a longer presentation time to determine the shape of the border or contour on the contralesional side, especially for borders defined by texture. These results indicate that texture segregation is modulated by attention-related brain areas in the right posterior parietal cortex.
Collapse
Affiliation(s)
- Kathleen Vancleef
- Laboratory of Experimental Psychology, University of Leuven, Tiensestraat 102, Box 3711, 3000 Leuven, Belgium.
| | | | | |
Collapse
|
11
|
Ales JM, Appelbaum LG, Cottereau BR, Norcia AM. The time course of shape discrimination in the human brain. Neuroimage 2012; 67:77-88. [PMID: 23116814 DOI: 10.1016/j.neuroimage.2012.10.044] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Revised: 09/11/2012] [Accepted: 10/20/2012] [Indexed: 10/27/2022] Open
Abstract
The lateral occipital cortex (LOC) activates selectively to images of intact objects versus scrambled controls, is selective for the figure-ground relationship of a scene, and exhibits at least some degree of invariance for size and position. Because of these attributes, it is considered to be a crucial part of the object recognition pathway. Here we show that human LOC is critically involved in perceptual decisions about object shape. High-density EEG was recorded while subjects performed a threshold-level shape discrimination task on texture-defined figures segmented by either phase or orientation cues. The appearance or disappearance of a figure region from a uniform background generated robust visual evoked potentials throughout retinotopic cortex as determined by inverse modeling of the scalp voltage distribution. Contrasting responses from trials containing shape changes that were correctly detected (hits) with trials in which no change occurred (correct rejects) revealed stimulus-locked, target-selective activity in the occipital visual areas LOC and V4 preceding the subject's response. Activity that was locked to the subjects' reaction time was present in the LOC. Response-locked activity in the LOC was determined to be related to shape discrimination for several reasons: shape-selective responses were silenced when subjects viewed identical stimuli but their attention was directed away from the shapes to a demanding letter discrimination task; shape-selectivity was present across four different stimulus configurations used to define the figure; LOC responses correlated with participants' reaction times. These results indicate that decision-related activity is present in the LOC when subjects are engaged in threshold-level shape discriminations.
Collapse
Affiliation(s)
- Justin M Ales
- Department of Psychology, Stanford University, Stanford, CA 94305, USA.
| | | | | | | |
Collapse
|