1
|
Herzog MH. The Irreducibility of Vision: Gestalt, Crowding and the Fundamentals of Vision. Vision (Basel) 2022; 6:vision6020035. [PMID: 35737422 PMCID: PMC9228288 DOI: 10.3390/vision6020035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2022] [Revised: 05/25/2022] [Accepted: 05/31/2022] [Indexed: 11/16/2022] Open
Abstract
What is fundamental in vision has been discussed for millennia. For philosophical realists and the physiological approach to vision, the objects of the outer world are truly given, and failures to perceive objects properly, such as in illusions, are just sporadic misperceptions. The goal is to replace the subjectivity of the mind by careful physiological analyses. Continental philosophy and the Gestaltists are rather skeptical or ignorant about external objects. The percepts themselves are their starting point, because it is hard to deny the truth of one own′s percepts. I will show that, whereas both approaches can well explain many visual phenomena with classic visual stimuli, they both have trouble when stimuli become slightly more complex. I suggest that these failures have a deeper conceptual reason, namely that their foundations (objects, percepts) do not hold true. I propose that only physical states exist in a mind independent manner and that everyday objects, such as bottles and trees, are perceived in a mind-dependent way. The fundamental processing units to process objects are extended windows of unconscious processing, followed by short, discrete conscious percepts.
Collapse
Affiliation(s)
- Michael H Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
| |
Collapse
|
2
|
Abstract
Reading requires the correct identification of letters and letter positions within words. Selective attention is, therefore, required to select chunks of the text for sequential processing. Despite the extensive literature on visual attention, the well-known effects of spatial cues in simple perceptual tasks cannot inform us about the role of attention in a task as complex as reading. Here, we systematically manipulate spatial attention in a multi-letter processing task to understand the effects of spatial cues on letter encoding in typical adults. Overall, endogenous (voluntary) cue benefits were larger than exogenous (reflexive). We show that cue benefits are greater in the left than in the right visual field and larger for the most crowded letter positions. Endogenous valid cues reduced errors due to confusing letter positions more than misidentifications, specifically for the most crowded letter positions. Therefore, shifting endogenous attention along a line of text is likely an important mechanism to alleviate the effects of crowding on encoding letters within words. Our results help set the premise for constructing theories about how specific mechanisms of attention support reading development in children. Understanding the link between reading development and attention mechanisms has far-reaching implications for effectively addressing the needs of children with reading disabilities.
Collapse
|
3
|
Theiss JD, Bowen JD, Silver MA. Spatial Attention Enhances Crowded Stimulus Encoding Across Modeled Receptive Fields by Increasing Redundancy of Feature Representations. Neural Comput 2021; 34:190-218. [PMID: 34710898 PMCID: PMC8693207 DOI: 10.1162/neco_a_01447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 07/01/2021] [Indexed: 11/04/2022]
Abstract
Any visual system, biological or artificial, must make a trade-off between the number of units used to represent the visual environment and the spatial resolution of the sampling array. Humans and some other animals are able to allocate attention to spatial locations to reconfigure the sampling array of receptive fields (RFs), thereby enhancing the spatial resolution of representations without changing the overall number of sampling units. Here, we examine how representations of visual features in a fully convolutional neural network interact and interfere with each other in an eccentricity-dependent RF pooling array and how these interactions are influenced by dynamic changes in spatial resolution across the array. We study these feature interactions within the framework of visual crowding, a well-characterized perceptual phenomenon in which target objects in the visual periphery that are easily identified in isolation are much more difficult to identify when flanked by similar nearby objects. By separately simulating effects of spatial attention on RF size and on the density of the pooling array, we demonstrate that the increase in RF density due to attention is more beneficial than changes in RF size for enhancing target classification for crowded stimuli. Furthermore, by varying target/flanker spacing, as well as the spatial extent of attention, we find that feature redundancy across RFs has more influence on target classification than the fidelity of the feature representations themselves. Based on these findings, we propose a candidate mechanism by which spatial attention relieves visual crowding through enhanced feature redundancy that is mostly due to increased RF density.
Collapse
Affiliation(s)
| | - Joel D Bowen
- University of California, Berkeley, CA 94720, U.S.A.
| | | |
Collapse
|
4
|
Rummens K, Sayim B. Broad attention uncovers benefits of stimulus uniformity in visual crowding. Sci Rep 2021; 11:23976. [PMID: 34907221 PMCID: PMC8671468 DOI: 10.1038/s41598-021-03258-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 12/01/2021] [Indexed: 11/08/2022] Open
Abstract
Crowding is the interference by surrounding objects (flankers) with target perception. Low target-flanker similarity usually yields weaker crowding than high similarity ('similarity rule') with less interference, e.g., by opposite- than same-contrast polarity flankers. The advantage of low target-flanker similarity has typically been shown with attentional selection of a single target object. Here, we investigated the validity of the similarity rule when broadening attention to multiple objects. In three experiments, we measured identification for crowded letters (Experiment 1), tumbling Ts (Experiment 2), and tilted lines (Experiment 3). Stimuli consisted of three items that were uniform or alternating in contrast polarity and were briefly presented at ten degrees eccentricity. Observers reported all items (full report) or only the left, central, or right item (single-item report). In Experiments 1 and 2, consistent with the similarity rule, single central item performance was superior with opposite- compared to same-contrast polarity flankers. With full report, the similarity rule was inverted: performance was better for uniform compared to alternating stimuli. In Experiment 3, contrast polarity did not affect performance. We demonstrated a reversal of the similarity rule under broadened attention, suggesting that stimulus uniformity benefits crowded object recognition when intentionally directing attention towards all stimulus elements. We propose that key properties of crowding have only limited validity as they may require a-priori differentiation of target and context.
Collapse
Affiliation(s)
- Koen Rummens
- Institute of Psychology, University of Bern, Bern, Switzerland.
| | - Bilge Sayim
- Institute of Psychology, University of Bern, Bern, Switzerland
- UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Université de Lille, CNRS, 59000, Lille, France
| |
Collapse
|
5
|
Chakravarthi R, Rubruck J, Kipling N, Clarke ADF. Characterizing the in-out asymmetry in visual crowding. J Vis 2021; 21:10. [PMID: 34668932 PMCID: PMC8602924 DOI: 10.1167/jov.21.11.10] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 09/18/2021] [Indexed: 11/24/2022] Open
Abstract
An object's processing is impaired by the presence of nearby clutter. Several distinct mechanisms, such as masking and visual crowding, are thought to contribute to such flanker-induced interference. It is therefore important to determine which mechanism is operational in any given situation. Previous studies have proposed that the in-out asymmetry (IOA), where a peripheral flanker interferes with the target more than a foveal flanker, is diagnostic of crowding. However, several studies have documented inconsistencies in the occurrence of this asymmetry, particularly at locations beyond the horizontal meridian, casting doubt on its ability to delineate crowding. In this study, to determine if IOA is diagnostic of crowding, we extensively charted its properties. We asked a relatively large set of participants (n = 38) to identify a briefly presented peripheral letter flanked by a single inward or outward letter at one of four locations. We also manipulated target location uncertainty and attentional allocation by blocking, randomizing or pre-cueing the target location. Using multilevel Bayesian regression analysis, we found robust IOA at all locations, although its strength was modulated by target location, location uncertainty, and attentional allocation. Our findings suggest that IOA can be an excellent marker of crowding, to the extent that it is not observed in other flanker-interference mechanisms, such as masking.
Collapse
Affiliation(s)
| | - Jirko Rubruck
- School of Psychology, University of Aberdeen, Aberdeen, UK
| | - Nikki Kipling
- Department of Psychology, University of Essex, Essex, UK
| | - Alasdair D F Clarke
- Department of Psychology, University of Essex, Essex, UK
- https://www.essex.ac.uk/people/clark28201/alasdair-clarke
| |
Collapse
|
6
|
Abstract
In crowding, perception of a target deteriorates in the presence of nearby flankers. Surprisingly, perception can be rescued from crowding if additional flankers are added (uncrowding). Uncrowding is a major challenge for all classic models of crowding and vision in general, because the global configuration of the entire stimulus is crucial. However, it is unclear which characteristics of the configuration impact (un)crowding. Here, we systematically dissected flanker configurations and showed that (un)crowding cannot be easily explained by the effects of the sub-parts or low-level features of the stimulus configuration. Our modeling results suggest that (un)crowding requires global processing. These results are well in line with previous studies showing the importance of global aspects in crowding.
Collapse
Affiliation(s)
- Oh-Hyeon Choung
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Alban Bornet
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Adrien Doerig
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Michael H Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
7
|
Abstract
Visual crowding-the deleterious influence of nearby objects on object recognition-is considered to be a major bottleneck for object recognition in cluttered environments. Although crowding has been studied for decades with static and artificial stimuli, it is still unclear how crowding operates when viewing natural dynamic scenes in real-life situations. For example, driving is a frequent and potentially fatal real-life situation where crowding may play a critical role. In order to investigate the role of crowding in this kind of situation, we presented observers with naturalistic driving videos and recorded their eye movements while they performed a simulated driving task. We found that the saccade localization on pedestrians was impacted by visual clutter, in a manner consistent with the diagnostic criteria of crowding (Bouma's rule of thumb, flanker similarity tuning, and the radial-tangential anisotropy). In order to further confirm that altered saccadic localization is a behavioral consequence of crowding, we also showed that crowding occurs in the recognition of cluttered pedestrians in a more conventional crowding paradigm. We asked participants to discriminate the gender of pedestrians in static video frames and found that the altered saccadic localization correlated with the degree of crowding of the saccade targets. Taken together, our results provide strong evidence that crowding impacts both recognition and goal-directed actions in natural driving situations.
Collapse
|
8
|
Abstract
Object recognition in the periphery is limited by clutter. This phenomenon of visual crowding is ameliorated when the objects are dissimilar. This effect of inter-object similarity has been extensively studied for low-level features and is thought to reflect bottom-up processes. Recently, crowding was also found to be reduced when objects belonged to explicitly distinct groups; that is, crowding was weak when they had low group membership similarity. It has been claimed that top-down knowledge is necessary to explain this effect of group membership, implying that the effect of similarity on crowding cannot be a purely bottom-up process. We tested the claim that the effect of group membership relies on knowledge in two experiments and found that neither explicit knowledge about differences in group membership nor the possibility of acquiring knowledge about target identities is necessary to produce the effects. These results suggest that top-down processes need not be invoked to explain the effect of group membership. Instead, we suggest that differences in flanker reportability that emerge from the differences in group membership are the source of the effect. That is, when targets and flankers are sampled from distinct groups, flankers cannot be inadvertently reported, leading to fewer errors and hence weaker crowding. Further, we argue that this effect arises at the stage of response selection. This conclusion is well supported by an analytical model based on these principles. We conclude that previously observed effects in crowding attributed to top-down or higher level processes might instead be due to post-perceptual response selection strategies.
Collapse
|
9
|
Asymmetries in flanker-target interference at different levels of number processing. Acta Psychol (Amst) 2019; 201:102938. [PMID: 31726419 DOI: 10.1016/j.actpsy.2019.102938] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2019] [Revised: 07/31/2019] [Accepted: 09/18/2019] [Indexed: 11/23/2022] Open
Abstract
Visual stimuli presented in peripheries can be barely recognized when they are surrounded by flankers (crowding). The target-flanker interference can be asymmetrical, and this asymmetry depends on a stimulus type. In particular, recognition of a letter or a number is more disturbed by the presence of a leftward flanker, reflecting the direction of reading. So far, such reading-related asymmetry has been observed with visual recognition tasks. In the following studies, we used numbers as stimuli to examine whether the leftward asymmetry in crowding extends to other levels of information processing, i.e. whether it is present when more abstract, semantic features are extracted. We presented participants with numerical triplets in the left or right visual field, and asked them to classify the middle number according to its magnitude (Experiment 1), physical characteristics (Experiment 2) or parity (Experiment 3). We observed that the leftward flanker interfered stronger with the target than the rightward flanker, but only when magnitude and physical characteristics were classified. Our findings suggest that the leftward asymmetry in crowding extends up to the semantic level of number processing, but only selectively, i.e. when a certain sort of information (magnitude) is extracted.
Collapse
|
10
|
Hayashi D. Influence of Multiple Types of Proximity on the Degree of Visual Crowding Effects Within a Single Gap Detection Task. Iperception 2019; 10:2041669519837263. [PMID: 30906517 PMCID: PMC6421615 DOI: 10.1177/2041669519837263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Accepted: 02/20/2019] [Indexed: 11/17/2022] Open
Abstract
The visual system cannot recognize an object (target) in peripheral vision when presented with neighboring similar stimuli (flanker). This object recognition disability is known as crowding. Studies have shown that various types of proximity, such as spatial distance or semantic category, affect the degree of crowding. However, thus far, these effects have mostly been studied separately. Hence, their underlying similarities and differences are still unknown. In this study, we developed a novel gap detection task and tested whether the effect of three different types of proximity in crowding (the relative position between target gap and nearest flanker edge, the flanker location compared with the target location, and the semantic category of the target) can be measured within a single task. A psychometric function analysis revealed that two of the assumed types of proximity affected the degree of crowding within a single task.
Collapse
Affiliation(s)
- Daisuke Hayashi
- Department of Psychology, The University of Tokyo, Japan; Faculty of Human Informatics, Aichi Shukutoku University, Japan
| |
Collapse
|
11
|
Gong M, Xuan Y, Smart LJ, Olzak LA. The extraction of natural scene gist in visual crowding. Sci Rep 2018; 8:14073. [PMID: 30232470 PMCID: PMC6145949 DOI: 10.1038/s41598-018-32455-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Accepted: 09/10/2018] [Indexed: 12/02/2022] Open
Abstract
The gist of natural scenes can be extracted very rapidly and even without focal attention. However, it is unclear whether and to what extent the gist of natural scenes can break through the bottleneck of crowding, a phenomenon in which object recognition will be immensely impaired. In the first two experiments, a target scene, either presented alone or surrounded by four flankers, was categorized at basic (Experiment 1) or global levels (Experiment 2). It was showed that the elimination of high-level semantic information of flankers greatly alleviated the crowding effect, demonstrating that high-level information played an important role in crowding of scene gist. More importantly, participants were able to categorize the scenes in crowding at rather high accuracies, suggesting that the extraction of scene gist might be a prioritized process. To test this hypothesis, in Experiment 3 we compared the crowding effect of three types of stimuli, namely, scenes, facial expressions and letter "E"s. The results showed that scenes could be better categorized than the other two types of stimuli in the crowding condition. This scene gist advantage thus supported our hypothesis. Together, the present studies suggest that scene gist is highly recognizable in crowding, probably due to its prioritization in visual processing.
Collapse
Affiliation(s)
- Mingliang Gong
- Department of Psychology, Miami University, Oxford, OH, USA.
| | - Yuming Xuan
- State Key Laboratory of Brain and Cognitive Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.
- Department of Psychology, University of the Chinese Academy of Sciences, Beijing, China.
| | - L James Smart
- Department of Psychology, Miami University, Oxford, OH, USA
| | - Lynn A Olzak
- Department of Psychology, Miami University, Oxford, OH, USA
| |
Collapse
|