1
|
Waugh SJ, Fronius M. Landolt C-Tests With "Fixed" Arcmin Separations Detect Amblyopia But Underestimate Crowding in Moderate-to-Severe Amblyopic Children and Adults. Invest Ophthalmol Vis Sci 2024; 65:33. [PMID: 39177974 PMCID: PMC11346165 DOI: 10.1167/iovs.65.10.33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 07/30/2024] [Indexed: 08/24/2024] Open
Abstract
Purpose Crowding is exaggerated in central vision of strabismic amblyopia, impacting on reading ability. Crowding magnitude and interocular differences (IODs) in acuity are indicators for detection, assessment, and monitoring of treatment. Lateral masking (including contour interaction) also affects acuity and can mimic or ameliorate crowding. We investigated lateral masking/contour interaction and crowding impact on crowding magnitude and IOD measures in healthy and amblyopic pediatric and juvenile/adult groups using two Landolt C-tests with "fixed" arcmin separations. Methods Acuity (logMAR) was measured with Landolt C-tests with specified 2.6' ("crowded") and 35' ("uncrowded") separations. Crowding magnitudes (crowded - uncrowded acuities) and IODs were calculated. Participants were 69 subjects with strabismic amblyopia (n = 39 pediatric, i.e. children ≤8 years of age), 31 subjects with anisometropic amblyopia (n = 14 pediatric), and 76 healthy controls (n = 36 pediatric). Subjects with amblyopia were subgrouped by acuity as low severity (<0.4 logMAR) or high severity (≥0.4 logMAR) using the 35' separation C-test. Results Crowding magnitudes were greater in strabismic than in anisometropic amblyopia and control/fellow eyes. They were higher in pediatric control/fellow eyes than in juvenile/adult eyes. In high severity strabismic amblyopia, crowding magnitudes progressively and significantly reduced (slope = -0.17 ± 0.07, P < 0.05) with worsening acuity. IODs for this group were higher on the 2.6' C-test, but lower than expected. In high severity pediatric subjects with anisometropic amblyopia, seven of eight had lower IODs measured with the "crowded" than the "uncrowded" C-tests. Conclusions These C-tests detect amblyopia but underestimate crowding in children and adults with high severity strabismic amblyopia. Separate isolated optotype acuity and crowding distance tests may better target specific functions, while minimizing the impact of masking.
Collapse
Affiliation(s)
- Sarah J. Waugh
- Centre for Vision across the Life Span, School of Applied Sciences, University of Huddersfield, United Kingdom
| | - Maria Fronius
- Goethe University Hospital, Department of Ophthalmology, Child Vision Research Unit, Frankfurt, Germany
| |
Collapse
|
2
|
L-Miao L, Reynvoet B, Sayim B. The radial-tangential anisotropy of numerosity perception. J Vis 2024; 24:15. [PMID: 39046720 PMCID: PMC11271808 DOI: 10.1167/jov.24.7.15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 06/17/2024] [Indexed: 07/25/2024] Open
Abstract
Humans can estimate the number of visually presented items without counting. In most studies on numerosity perception, items are uniformly distributed across displays, with identical distributions in central and eccentric parts. However, the neural and perceptual representation of the human visual field differs between the fovea and the periphery. For example, in peripheral vision, there are strong asymmetries with regard to perceptual interferences between visual items. In particular, items arranged radially usually interfere more strongly with each other than items arranged tangentially (the radial-tangential anisotropy). This has been shown for crowding (the deleterious effect of clutter on target identification) and redundancy masking (the reduction of the number of perceived items in repeating patterns). In the present study, we tested how the radial-tangential anisotropy of peripheral vision impacts numerosity perception. In four experiments, we presented displays with varying numbers of discs that were predominantly arranged radially or tangentially, forming strong and weak interference conditions, respectively. Participants were asked to report the number of discs. We found that radial displays were reported as less numerous than tangential displays for all radial and tangential manipulations: weak (Experiment 1), strong (Experiment 2), and when using displays with mixed contrast polarity discs (Experiments 3 and 4). We propose that numerosity perception exhibits a significant radial-tangential anisotropy, resulting from local spatial interactions between items.
Collapse
Affiliation(s)
- Li L-Miao
- Université de Lille, CNRS, UMR 9193-SCALab-Sciences Cognitives et Sciences Affectives, Lille, France
- Faculty of Psychology and Educational Sciences, KU Leuven Kulak, Kortrijk, Belgium
- https://miaoli-psy.github.io/
| | - Bert Reynvoet
- Faculty of Psychology and Educational Sciences, KU Leuven Kulak, Kortrijk, Belgium
- Brain and Cognition, Faculty of Psychology and Educational Sciences, KU Leuven, Leuven, Belgium
- https://www.kuleuven.be/wieiswie/nl/person/00047096
| | - Bilge Sayim
- Université de Lille, CNRS, UMR 9193-SCALab-Sciences Cognitives et Sciences Affectives, Lille, France
- https://www.appearancelab.org/bilge
| |
Collapse
|
3
|
Er G, Sweeny TD. Similarity in motion binds and bends judgments of aspect ratio. Vision Res 2024; 220:108400. [PMID: 38603923 DOI: 10.1016/j.visres.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 03/11/2024] [Accepted: 03/23/2024] [Indexed: 04/13/2024]
Abstract
It is well known that objects become grouped in perceptual organization when they share some visual feature, like a common direction of motion. Less well known is that grouping can change how people perceive a set of objects. For example, when a pair of shapes consistently share a common region of space, their aspect ratios tend to be perceived as more similar (are attracted toward each other). Conversely, when shapes are assigned to different regions in space their aspect ratios repel from each other. Here we examine whether the visual system produce both attractive and repulsive distortions when the state of grouping between a pair of shapes changes on a moment-to-moment basis. Observers viewed a pair of ellipses that differed in terms of how flat or tall they were and reported the aspect ratio of one ellipse from the pair. Each ellipse was defined by a cloud of coherently-moving dots, and the dots within the two ellipses had either the same or different directions of motion, varying from trial-to-trial. We found that the cued ellipse's aspect ratio was reported to be repelled from the aspect ratio of the uncued ellipse when the shapes had different directions of motion compared to when they had the same direction of motion. These results suggest that the visual system can adaptively alter visual experience based on grouping, in particular, repelling the appearance of objects when they do not appear to go together, and it can do so quickly and flexibly.
Collapse
Affiliation(s)
- Görkem Er
- Department of Psychology, University of Denver, United States.
| | | |
Collapse
|
4
|
Kim T, Pasupathy A. Neural Correlates of Crowding in Macaque Area V4. J Neurosci 2024; 44:e2260232024. [PMID: 38670806 PMCID: PMC11170949 DOI: 10.1523/jneurosci.2260-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 03/29/2024] [Accepted: 04/17/2024] [Indexed: 04/28/2024] Open
Abstract
Visual crowding refers to the phenomenon where a target object that is easily identifiable in isolation becomes difficult to recognize when surrounded by other stimuli (distractors). Many psychophysical studies have investigated this phenomenon and proposed alternative models for the underlying mechanisms. One prominent hypothesis, albeit with mixed psychophysical support, posits that crowding arises from the loss of information due to pooled encoding of features from target and distractor stimuli in the early stages of cortical visual processing. However, neurophysiological studies have not rigorously tested this hypothesis. We studied the responses of single neurons in macaque (one male, one female) area V4, an intermediate stage of the object-processing pathway, to parametrically designed crowded displays and texture statistics-matched metameric counterparts. Our investigations reveal striking parallels between how crowding parameters-number, distance, and position of distractors-influence human psychophysical performance and V4 shape selectivity. Importantly, we also found that enhancing the salience of a target stimulus could alleviate crowding effects in highly cluttered scenes, and this could be temporally protracted reflecting a dynamical process. Thus, a pooled encoding of nearby stimuli cannot explain the observed responses, and we propose an alternative model where V4 neurons preferentially encode salient stimuli in crowded displays. Overall, we conclude that the magnitude of crowding effects is determined not just by the number of distractors and target-distractor separation but also by the relative salience of targets versus distractors based on their feature attributes-the similarity of distractors and the contrast between target and distractor stimuli.
Collapse
Affiliation(s)
- Taekjun Kim
- Department of Biological Structure, University of Washington, Seattle, Washington 98195
- Washington National Primate Research Center, University of Washington, Seattle, Washington 98195
| | - Anitha Pasupathy
- Department of Biological Structure, University of Washington, Seattle, Washington 98195
- Washington National Primate Research Center, University of Washington, Seattle, Washington 98195
| |
Collapse
|
5
|
Veríssimo IS, Nudelman Z, Olivers CNL. Does crowding predict conjunction search? An individual differences approach. Vision Res 2024; 216:108342. [PMID: 38198971 DOI: 10.1016/j.visres.2023.108342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 11/27/2023] [Accepted: 12/07/2023] [Indexed: 01/12/2024]
Abstract
Searching for objects in the visual environment is an integral part of human behavior. Most of the information used during such visual search comes from the periphery of our vision, and understanding the basic mechanisms of search therefore requires taking into account the inherent limitations of peripheral vision. Our previous work using an individual differences approach has shown that one of the major factors limiting peripheral vision (crowding) is predictive of single feature search, as reflected in response time and eye movement measures. Here we extended this work, by testing the relationship between crowding and visual search in a conjunction-search paradigm. Given that conjunction search involves more fine-grained discrimination and more serial behavior, we predicted it would be strongly affected by crowding. We tested sixty participants with regard to their sensitivity to both orientation and color-based crowding (as measured by critical spacing) and their efficiency in searching for a color/orientation conjunction (as indicated by manual response times and eye movements). While the correlations between the different crowding tasks were high, the correlations between the different crowding measures and search performance were relatively modest, and no higher than those previously observed for single-feature search. Instead, observers showed very strong color selectivity during search. The results suggest that conjunction search behavior relies more on top-down guidance (here by color) and is therefore relatively less determined by individual differences in sensory limitations as caused by crowding.
Collapse
Affiliation(s)
- Inês S Veríssimo
- Department of Experimental and Applied Psychology, Cognitive Psychology Section, Vrije Universiteit Amsterdam, Van der Boechorststraat 7, 1081 BT Amsterdam, The Netherlands; Institute for Brain and Behavior, Van der Boechorststraat 7, 1081 BT Amsterdam, The Netherlands.
| | - Zachary Nudelman
- Department of Experimental and Applied Psychology, Cognitive Psychology Section, Vrije Universiteit Amsterdam, Van der Boechorststraat 7, 1081 BT Amsterdam, The Netherlands
| | - Christian N L Olivers
- Department of Experimental and Applied Psychology, Cognitive Psychology Section, Vrije Universiteit Amsterdam, Van der Boechorststraat 7, 1081 BT Amsterdam, The Netherlands
| |
Collapse
|
6
|
Bondarko VM, Chikhman VN, Danilova MV, Solnushkin SD. Foveal crowding for large and small Landolt Cs: Similarity and Attention. Vision Res 2024; 215:108346. [PMID: 38171199 DOI: 10.1016/j.visres.2023.108346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 12/05/2023] [Accepted: 12/07/2023] [Indexed: 01/05/2024]
Abstract
We compare the recognition of foveal crowded Landolt Cs of two sizes: brief (40 ms), large, low-contrast Cs and high-contrast (1 sec) tests at the resolution limit of the visual system. In different series, the test Landolt C was surrounded by two identical distractors located symmetrically along the horizontal or by a single distractor. The distractors were Landolt Cs or rings. At the resolution limit, the critical spacing was similar in the two series and did not depend on the type of distractor. The result supports the hypothesis that crowding at the resolution limit occurs when both the test and the distractors fall into the same smallest receptive field responsible for the target recognition. For large stimuli, at almost all separations distractors of the same shape caused greater impairment than did rings, and recognition errors were non-random. The critical spacing was equal to 0.5 test diameters only in the presence of one distracting Landolt C. This result suggests that attention is involved: When one distractor is added, involuntary attention, which is directed to the centre of gravity of the stimulus, can lead to confusion of features that are present in both tests and distractors and thus to non-random errors.
Collapse
Affiliation(s)
- V M Bondarko
- IP Pavlov Institute of Physiology, Laboratory of Visual Physiology, Nab.Makarova 6, St. Petersburg 199034, Russia
| | - V N Chikhman
- IP Pavlov Institute of Physiology, Laboratory of Information Technologies and Mathematical Modelling, Nab.Makarova 6, St. Petersburg 199034, Russia
| | - M V Danilova
- IP Pavlov Institute of Physiology, Laboratory of Visual Physiology, Nab.Makarova 6, St. Petersburg 199034, Russia.
| | - S D Solnushkin
- IP Pavlov Institute of Physiology, Laboratory of Information Technologies and Mathematical Modelling, Nab.Makarova 6, St. Petersburg 199034, Russia
| |
Collapse
|
7
|
Shechter A, Medina S, Share DL, Yashar A. Language-universal and script-specific factors in the recognition of letters in visual crowding: The effects of lexicality, hemifield, and transitional probabilities in a right-to-left script. Cortex 2024; 171:319-329. [PMID: 38070387 DOI: 10.1016/j.cortex.2023.10.021] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/26/2023] [Accepted: 10/23/2023] [Indexed: 02/12/2024]
Abstract
Peripheral letter recognition is fundamentally limited not by the visibility of letters but by the spacing between them, i.e., 'crowding'. Crowding imposes a significant constraint on reading, however, the interplay between crowding and reading is not fully understood. Using a letter recognition task in varying display conditions, we investigated the effects of lexicality (words versus pseudowords), visual hemifield, and transitional letter probability (bigram/trigram frequency) among skilled readers (N = 14. and N = 13) in Hebrew - a script read from right to left. We observed two language-universal effects: a lexicality effect and a right hemifield (left hemisphere) advantage, as well as a strong language-specific effect - a left bigram advantage stemming from the right-to-left reading direction of Hebrew. The latter finding suggests that transitional probabilities are essential for parafoveal letter recognition. The results reveal that script-specific contextual information such as letter combination probabilities is used to accurately identify crowded letters.
Collapse
Affiliation(s)
- Adi Shechter
- The Edmond J. Safra Brain Research Center for the Study of Learning Disabilities, The University of Haifa, Haifa, Israel; Department of Learning Disabilities, Faculty of Education, University of Haifa, Haifa, Israel
| | - Sivan Medina
- Department of Learning and Instructional Sciences, Faculty of Education, University of Haifa, Haifa, Israel
| | - David L Share
- The Edmond J. Safra Brain Research Center for the Study of Learning Disabilities, The University of Haifa, Haifa, Israel; Department of Learning Disabilities, Faculty of Education, University of Haifa, Haifa, Israel
| | - Amit Yashar
- The Edmond J. Safra Brain Research Center for the Study of Learning Disabilities, The University of Haifa, Haifa, Israel; Department of Special Education, Faculty of Education, University of Haifa, Haifa, Israel.
| |
Collapse
|
8
|
Moore CM, Zheng Q. Limited midlevel mediation of visual crowding: Surface completion fails to support uncrowding. J Vis 2024; 24:11. [PMID: 38294775 PMCID: PMC10839818 DOI: 10.1167/jov.24.1.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Accepted: 12/10/2023] [Indexed: 02/01/2024] Open
Abstract
Visual crowding refers to impaired object recognition that is caused by nearby stimuli. It increases with eccentricity. Image-level explanations of crowding maintain that it is caused by information loss within early encoding processes that vary in functionality with eccentricity. Alternative explanations maintain that the interference is not limited to two-dimensional image-level interactions but that it is mediated within representations that reflect three-dimensional scene structure. Uncrowding refers to when adding stimulus information to a display, which increases the noise at an image level, nonetheless decreasing the amount of crowding that occurs. Uncrowding has been interpreted as evidence of midlevel mediation of crowding because the additional information tends to provide an opportunity for perceptually organizing stimuli into distinct and therefore protected representations. It is difficult, however, to rule out image-level explanations of crowding and uncrowding when stimulus differences exist between conditions. We adapted displays of a specific form of uncrowding to minimize stimulus differences across conditions, while retaining the potential for perceptual organization, specifically perceptual surface completion. Uncrowding under these conditions would provide strong support for midlevel mediation of crowding. In five experiments, however, we found no evidence of midlevel mediation of crowding, indicating that at least for this version of uncrowding, image-level explanations cannot be ruled out.
Collapse
Affiliation(s)
- Cathleen M Moore
- Department of Psychological and Brain Sciences, University of Iowa, Iowa City, IA, USA
| | - Qingzi Zheng
- Department of Psychological and Brain Sciences, University of Iowa, Iowa City, IA, USA
| |
Collapse
|
9
|
Smithers SP, Shao Y, Altham J, Bex PJ. Large depth differences between target and flankers can increase crowding: Evidence from a multi-depth plane display. eLife 2023; 12:e85143. [PMID: 37665324 PMCID: PMC10476968 DOI: 10.7554/elife.85143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2022] [Accepted: 07/20/2023] [Indexed: 09/05/2023] Open
Abstract
Crowding occurs when the presence of nearby features causes highly visible objects to become unrecognizable. Although crowding has implications for many everyday tasks and the tremendous amounts of research reflect its importance, surprisingly little is known about how depth affects crowding. Most available studies show that stereoscopic disparity reduces crowding, indicating that crowding may be relatively unimportant in three-dimensional environments. However, most previous studies tested only small stereoscopic differences in depth in which disparity, defocus blur, and accommodation are inconsistent with the real world. Using a novel multi-depth plane display, this study investigated how large (0.54-2.25 diopters), real differences in target-flanker depth, representative of those experienced between many objects in the real world, affect crowding. Our findings show that large differences in target-flanker depth increased crowding in the majority of observers, contrary to previous work showing reduced crowding in the presence of small depth differences. Furthermore, when the target was at fixation depth, crowding was generally more pronounced when the flankers were behind the target as opposed to in front of it. However, when the flankers were at fixation depth, crowding was generally more pronounced when the target was behind the flankers. These findings suggest that crowding from clutter outside the limits of binocular fusion can still have a significant impact on object recognition and visual perception in the peripheral field.
Collapse
Affiliation(s)
- Samuel P Smithers
- Department of Psychology, Northeastern UniversityBostonUnited States
| | - Yulong Shao
- Department of Psychology, Northeastern UniversityBostonUnited States
| | - James Altham
- Department of Psychology, Northeastern UniversityBostonUnited States
| | - Peter J Bex
- Department of Psychology, Northeastern UniversityBostonUnited States
| |
Collapse
|
10
|
Kurzawski JW, Burchell A, Thapa D, Winawer J, Majaj NJ, Pelli DG. The Bouma law accounts for crowding in 50 observers. J Vis 2023; 23:6. [PMID: 37540179 PMCID: PMC10408772 DOI: 10.1167/jov.23.8.6] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 05/16/2023] [Indexed: 08/05/2023] Open
Abstract
Crowding is the failure to recognize an object due to surrounding clutter. Our visual crowding survey measured 13 crowding distances (or "critical spacings") twice in each of 50 observers. The survey includes three eccentricities (0, 5, and 10 deg), four cardinal meridians, two orientations (radial and tangential), and two fonts (Sloan and Pelli). The survey also tested foveal acuity, twice. Remarkably, fitting a two-parameter model-the well-known Bouma law, where crowding distance grows linearly with eccentricity-explains 82% of the variance for all 13 × 50 measured log crowding distances, cross-validated. An enhanced Bouma law, with factors for meridian, crowding orientation, target kind, and observer, explains 94% of the variance, again cross-validated. These additional factors reveal several asymmetries, consistent with previous reports, which can be expressed as crowding-distance ratios: 0.62 horizontal:vertical, 0.79 lower:upper, 0.78 right:left, 0.55 tangential:radial, and 0.78 Sloan-font:Pelli-font. Across our observers, peripheral crowding is independent of foveal crowding and acuity. Evaluation of the Bouma factor, b (the slope of the Bouma law), as a biomarker of visual health would be easier if there were a way to compare results across crowding studies that use different methods. We define a standardized Bouma factor b' that corrects for differences from Bouma's 25 choice alternatives, 75% threshold criterion, and linearly symmetric flanker placement. For radial crowding on the right meridian, the standardized Bouma factor b' is 0.24 for this study, 0.35 for Bouma (1970), and 0.30 for the geometric mean across five representative modern studies, including this one, showing good agreement across labs, including Bouma's. Simulations, confirmed by data, show that peeking can skew estimates of crowding (e.g., greatly decreasing the mean or doubling the SD of log b). Using gaze tracking to prevent peeking, individual differences are robust, as evidenced by the much larger 0.08 SD of log b across observers than the mere 0.03 test-retest SD of log b measured in half an hour. The ease of measurement of crowding enhances its promise as a biomarker for dyslexia and visual health.
Collapse
Affiliation(s)
- Jan W Kurzawski
- Department of Psychology, New York University, New York, NY, USA
| | - Augustin Burchell
- Cognitive Science & Computer Science, Swarthmore College, Swarthmore, PA, USA
| | - Darshan Thapa
- Center for Neural Science, New York University, New York, NY, USA
| | - Jonathan Winawer
- Department of Psychology, New York University, New York, NY, USA
- Center for Neural Science, New York University, New York, NY, USA
| | - Najib J Majaj
- Center for Neural Science, New York University, New York, NY, USA
| | - Denis G Pelli
- Department of Psychology, New York University, New York, NY, USA
- Center for Neural Science, New York University, New York, NY, USA
| |
Collapse
|
11
|
Haseeb Z, Wolfe B, Kosovicheva A. Spatial variability in localization biases predicts crowding performance. J Vis 2023; 23:9. [PMID: 37432845 DOI: 10.1167/jov.23.7.9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2023] Open
Abstract
Visual processing varies substantially across individuals, and prior work has shown significant individual differences in fundamental processes such as spatial localization. For example, when asked to report the location of a briefly flashed target in the periphery, different observers systematically misperceive its location in an idiosyncratic manner, showing different patterns of reproduction error across visual field locations. In this study, we tested whether these individual differences may propagate to other stages of visual processing, affecting the strength of visual crowding, which depends on the spacing between objects in the periphery. We, therefore, investigated the relationship between observers' idiosyncratic biases in localization and the strength of crowding to determine whether these spatial biases limit peripheral object recognition. To examine this relationship, we measured the strength of crowding at 12 locations at 8° eccentricity, in addition to the perceived spacing between pairs of Gaussian patches at these same locations. These measurements show an association between variability in crowding strength and perceived spacing at the same visual field locations: at locations where a participant experienced stronger crowding, their perceived spacing was smaller, and vice versa. We demonstrate that spatial heterogeneity in perceived spacing affects observers' ability to recognize objects in the periphery. Our results support the idea that variability in both spatial sensitivity and bias contribute to variability in the strength of crowding and bolster the account that variability in spatial coding may propagate across multiple stages of visual processing.
Collapse
Affiliation(s)
- Zainab Haseeb
- Department of Psychology, University of Toronto Mississauga, Mississauga, Ontario, Canada
- https://applylab.org/
| | - Benjamin Wolfe
- Department of Psychology, University of Toronto Mississauga, Mississauga, Ontario, Canada
- https://applylab.org/
| | - Anna Kosovicheva
- Department of Psychology, University of Toronto Mississauga, Mississauga, Ontario, Canada
- https://applylab.org/
| |
Collapse
|
12
|
Doerig A, Sommers RP, Seeliger K, Richards B, Ismael J, Lindsay GW, Kording KP, Konkle T, van Gerven MAJ, Kriegeskorte N, Kietzmann TC. The neuroconnectionist research programme. Nat Rev Neurosci 2023:10.1038/s41583-023-00705-w. [PMID: 37253949 DOI: 10.1038/s41583-023-00705-w] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/21/2023] [Indexed: 06/01/2023]
Abstract
Artificial neural networks (ANNs) inspired by biology are beginning to be widely used to model behavioural and neural data, an approach we call 'neuroconnectionism'. ANNs have been not only lauded as the current best models of information processing in the brain but also criticized for failing to account for basic cognitive functions. In this Perspective article, we propose that arguing about the successes and failures of a restricted set of current ANNs is the wrong approach to assess the promise of neuroconnectionism for brain science. Instead, we take inspiration from the philosophy of science, and in particular from Lakatos, who showed that the core of a scientific research programme is often not directly falsifiable but should be assessed by its capacity to generate novel insights. Following this view, we present neuroconnectionism as a general research programme centred around ANNs as a computational language for expressing falsifiable theories about brain computation. We describe the core of the programme, the underlying computational framework and its tools for testing specific neuroscientific hypotheses and deriving novel understanding. Taking a longitudinal view, we review past and present neuroconnectionist projects and their responses to challenges and argue that the research programme is highly progressive, generating new and otherwise unreachable insights into the workings of the brain.
Collapse
Affiliation(s)
- Adrien Doerig
- Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany.
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, The Netherlands.
| | - Rowan P Sommers
- Department of Neurobiology of Language, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Katja Seeliger
- Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Blake Richards
- Department of Neurology and Neurosurgery, McGill University, Montréal, QC, Canada
- School of Computer Science, McGill University, Montréal, QC, Canada
- Mila, Montréal, QC, Canada
- Montréal Neurological Institute, Montréal, QC, Canada
- Learning in Machines and Brains Program, CIFAR, Toronto, ON, Canada
| | | | | | - Konrad P Kording
- Learning in Machines and Brains Program, CIFAR, Toronto, ON, Canada
- Bioengineering, Neuroscience, University of Pennsylvania, Pennsylvania, PA, USA
| | | | | | | | - Tim C Kietzmann
- Institute of Cognitive Science, University of Osnabrück, Osnabrück, Germany
| |
Collapse
|
13
|
Gong M, Liu T, Liu X, Huangfu B, Geng F. Attention relieves visual crowding: Dissociable effects of peripheral and central cues. J Vis 2023; 23:9. [PMID: 37163245 PMCID: PMC10179668 DOI: 10.1167/jov.23.5.9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
Visual crowding can be reduced when attention is directed to the target by peripheral cues. However, it is unclear whether central cues relieve visual crowding to the same extent as peripheral cues. In this study, we combined the Posner cueing task and the crowding task to investigate the effect of exogenous and endogenous attention on crowding. In Experiment 1, five different stimulus-onset asychronies (SOAs) between the cue and the target and a predictive validity of 100% were adopted. Both attentional cues were shown to significantly reduce the effect of visual crowding, but the peripheral cue was more effective than the central cue. Furthermore, peripheral cues started to relieve visual crowding at the shortest SOA (100 ms), whereas central cues worked only at later SOAs (275 ms or above). When the predictive validity of the cue was decreased to 70% in Experiment 2, similar results to Experiment 1 were found, but the valid cue was less effective in reducing crowding than that in Experiment 1. In Experiment 3, when the predictive validity was decreased to 50%, a valid peripheral cue improved performance but a valid central cue did not, suggesting that endogenous attention but not exogenous attention can be voluntarily controlled when the cues are not predictive of the target's location. These findings collectively suggest that both peripheral and central cues can alleviate crowding, but they differ in terms of strength, time dynamics, and flexibility of voluntary control.
Collapse
Affiliation(s)
- Mingliang Gong
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Tingyu Liu
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Xi Liu
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Bingzhe Huangfu
- School of Psychology, Jiangxi Normal University, Nanchang, China
| | - Fulei Geng
- School of Psychology, Jiangxi Normal University, Nanchang, China
| |
Collapse
|
14
|
Foveal vision determines the perceived emotion of face ensembles. Atten Percept Psychophys 2023; 85:209-221. [PMID: 36369614 DOI: 10.3758/s13414-022-02614-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/03/2022] [Indexed: 11/13/2022]
Abstract
People can extract summary statistical information from groups of similar objects, an ability called ensemble perception. However, not every object in a group is weighted equally. For example, in ensemble emotion perception, faces far from fixation were weighted less than faces close to fixation. Yet the contribution of foveal input in ensemble emotion perception is still unclear. In two experiments, groups of faces with varying emotions were presented for 100 ms at three different eccentricities (0°, 3°, 8°). Observers reported the perceived average emotion of the group. In two conditions, stimuli consisted of a central face flanked by eight faces (flankers) (central-present condition) and eight faces without the central face (central-absent condition). In the central-present condition, the emotion of the central face was either congruent or incongruent with that of the flankers. In Experiment 1, flanker emotions were uniform (identical flankers); in Experiment 2 they were varied. In both experiments, performance in the central-present condition was superior at 3° compared to 0° and 8°. At 0°, performance was superior in the central-absent (i.e., no foveal input) compared to the central-present condition. Poor performance in the central-present condition was driven by the incongruent condition where the foveal face strongly biased responses. At 3° and 8°, performance was comparable between central-present and central-absent conditions. Our results showed how foveal input determined the perceived emotion of face ensembles, suggesting that ensemble perception fails when salient target information is available in central vision.
Collapse
|
15
|
Choung OH, Gordillo D, Roinishvili M, Brand A, Herzog MH, Chkonia E. Intact and deficient contextual processing in schizophrenia patients. Schizophr Res Cogn 2022; 30:100265. [PMID: 36119400 PMCID: PMC9477851 DOI: 10.1016/j.scog.2022.100265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Revised: 07/09/2022] [Accepted: 07/09/2022] [Indexed: 11/25/2022] Open
Abstract
Schizophrenia patients are known to have deficits in contextual vision. However, results are often very mixed. In some paradigms, patients do not take the context into account and, hence, perform more veridically than healthy controls. In other paradigms, context deteriorates performance much more strongly in patients compared to healthy controls. These mixed results may be explained by differences in the paradigms as well as by small or biased samples, given the large heterogeneity of patients' deficits. Here, we show that mixed results may also come from idiosyncrasies of the stimuli used because in variants of the same visual paradigm, tested with the same participants, we found intact and deficient processing.
Collapse
Affiliation(s)
- Oh-Hyeon Choung
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Corresponding author. http://lpsy.epfl.ch
| | - Dario Gordillo
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Maya Roinishvili
- Laboratory of Vision Physiology, Ivane Beritashvili Centre of Experimental Biomedicine, Tbilisi, Georgia
- Institute of Cognitive Neurosciences, Free University of Tbilisi, Tbilisi, Georgia
| | - Andreas Brand
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Michael H. Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Eka Chkonia
- Department of Psychiatry, Tbilisi State Medical University, Tbilisi, Georgia
| |
Collapse
|
16
|
Herzog MH, Sayim B. Crowding: Recent advances and perspectives. J Vis 2022; 22:15. [DOI: 10.1167/jov.22.12.15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Affiliation(s)
- Michael H. Herzog
- Laboratory of Psychophysics, Brain Mind Institute, Ecole Polytechnique Federale de Lausanne (EPFL), Lausanne, Switzerland
| | - Bilge Sayim
- Sciences Cognitives et Sciences Affectives (SCALab), CNRS, UMR 9193, University of Lille, Lille, France
- Institute of Psychology, University of Bern, Bern, Switzerland
| |
Collapse
|
17
|
On letter-specific crowding and reading: Evidence from ERPs. Neuropsychologia 2022; 176:108396. [DOI: 10.1016/j.neuropsychologia.2022.108396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 10/13/2022] [Accepted: 10/14/2022] [Indexed: 11/18/2022]
|
18
|
Neural correlates of lateral modulation and perceptual filling-in in center-surround radial sinusoidal gratings: an fMRI study. Sci Rep 2022; 12:16143. [PMID: 36167763 PMCID: PMC9515077 DOI: 10.1038/s41598-022-20592-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 09/15/2022] [Indexed: 11/08/2022] Open
Abstract
We investigated lateral modulation effects with functional magnetic resonance imaging. We presented radial sinusoidal gratings in random sequence: a scotoma grating with two arc-shaped blank regions (scotomata) in the periphery, one in the left and one in the right visual field, a center grating containing pattern only in the scotoma regions, and a full-field grating where the pattern occupied the whole screen. On each trial, one of the three gratings flickered in counterphase for 10 s, followed by a blank period. Observers were instructed to perform a fixation task and report whether filling-in was experienced during the scotoma condition. The results showed that the blood-oxygen-level-dependent signal was reduced in areas corresponding to the scotoma regions in the full-field compared to the center condition in V1 to V3 areas, indicating a lateral inhibition effect when the surround was added to the center pattern. The univariate analysis results showed no difference between the filling-in and no-filling-in trials. However, multivariate pattern analysis results showed that classifiers trained on activation pattern in V1 to V3 could differentiate between filling-in and no-filling-in trials, suggesting that the neural activation pattern in visual cortex correlated with the subjective percept.
Collapse
|
19
|
Yildirim FZ, Sayim B. High confidence and low accuracy in redundancy masking. Conscious Cogn 2022; 102:103349. [DOI: 10.1016/j.concog.2022.103349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 02/19/2022] [Accepted: 04/25/2022] [Indexed: 11/03/2022]
|
20
|
Rummens K, Sayim B. Multidimensional feature interactions in visual crowding: When configural cues eliminate the polarity advantage. J Vis 2022; 22:2. [PMID: 35503508 PMCID: PMC9078080 DOI: 10.1167/jov.22.6.2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 03/21/2022] [Indexed: 11/24/2022] Open
Abstract
Crowding occurs when surrounding objects (flankers) impair target perception. A key property of crowding is the weaker interference when target and flankers strongly differ on a given dimension. For instance, identification of a target letter is usually superior with flankers of opposite versus the same contrast polarity as the target (the "polarity advantage"). High performance when target-flanker similarity is low has been attributed to the ungrouping of target and flankers. Here, we show that configural cues can override the usual advantage of low target-flanker similarity, and strong target-flanker grouping can reduce - instead of exacerbate - crowding. In Experiment 1, observers were presented with line triplets in the periphery and reported the tilt (left or right) of the central line. Target and flankers had the same (uniform condition) or opposite contrast polarity (alternating condition). Flanker configurations were either upright (||), unidirectionally tilted (\\ or //), or bidirectionally tilted (\/ or /\). Upright flankers yielded stronger crowding than unidirectional flankers, and weaker crowding than bidirectional flankers. Importantly, our results revealed a clear interaction between contrast polarity and flanker configuration. Triplets with upright and bidirectional flankers, but not unidirectional flankers, showed the polarity advantage. In Experiments 2 and 3, we showed that emergent features and redundancy masking (i.e. the reduction of the number of perceived items in repeating configurations) made it easier to discriminate between uniform triplets when flanker tilts were unidirectional (but not when bidirectional). We propose that the spatial configurations of uniform triplets with unidirectional flankers provided sufficient task-relevant information to enable a similar performance as with alternating triplets: strong-target flanker grouping alleviated crowding. We suggest that features which modulate crowding strength can interact non-additively, limiting the validity of typical crowding rules to contexts where only single, independent dimensions determine the effects of target-flanker similarity.
Collapse
Affiliation(s)
- Koen Rummens
- University of Bern, Institute of Psychology, Bern, Switzerland
| | - Bilge Sayim
- University of Bern, Institute of Psychology, Bern, Switzerland
- Université de Lille, CNRS, UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Lille, France
| |
Collapse
|
21
|
Abstract
Redundancy masking is the reduction of the perceived number of items in repeating patterns. It shares a number of characteristics with crowding, the impairment of target identification in visual clutter. Crowding strongly depends on the location of the target in the visual field. For example, it is stronger in the upper compared to the lower visual field and is usually weakest on the horizontal meridian. This pattern of visual field asymmetries is common in spatial vision, as revealed by tasks measuring, for example, spatial resolution and contrast sensitivity. Here, to characterize redundancy masking and reveal its similarities to and differences from other spatial tasks, we investigated whether redundancy masking shows the same typical visual field asymmetries. Observers were presented with three to six radially arranged lines at 10° eccentricity at one of eight locations around fixation and were asked to report the number of lines. We found asymmetries that differed pronouncedly from those found in crowding. Redundancy masking did not differ between upper and lower visual fields. Importantly, redundancy masking was stronger on the horizontal meridian than on the vertical meridian, the opposite of what is usually found in crowding. These results show that redundancy masking diverges from crowding in regard to visual field asymmetries, suggesting different underlying mechanisms of redundancy masking and crowding. We suggest that the observed atypical visual field asymmetries in redundancy masking are due to the superior extraction of regularity and a more pronounced compression of visual space on the horizontal compared to the vertical meridian.
Collapse
Affiliation(s)
| | - Daniel R Coates
- Institute of Psychology, University of Bern, Bern, Switzerland.,College of Optometry, University of Houston, Houston, TX, USA.,
| | - Bilge Sayim
- Institute of Psychology, University of Bern, Bern, Switzerland.,Sciences Cognitives et Sciences Affectives (SCALab), CNRS, UMR 9193, University of Lille, Lille, France.,
| |
Collapse
|
22
|
Reuther J, Chakravarthi R, Martinovic J. Masking, crowding, and grouping: Connecting low and mid-level vision. J Vis 2022; 22:7. [PMID: 35147663 PMCID: PMC8842520 DOI: 10.1167/jov.22.2.7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 09/30/2021] [Indexed: 11/30/2022] Open
Abstract
An important task for vision science is to build a unitary framework of low- and mid-level vision. As a step on this way, our study examined differences and commonalities between masking, crowding and grouping-three processes that occur through spatial interactions between neighbouring elements. We measured contrast thresholds as functions of inter-element spacing and eccentricity for Gabor detection, discrimination and contour integration, using a common stimulus grid consisting of nine Gabor elements. From these thresholds, we derived a) the baseline contrast necessary to perform each task and b) the spatial extent over which task performance was stable. This spatial window can be taken as an indicator of field size, where elements that fall within a putative field are readily combined. We found that contrast thresholds were universally modulated by inter-element distance, with a shallower and inverted effect for grouping compared with masking and crowding. Baseline contrasts for detecting stimuli and discriminating their properties were positively linked across the tested retinal locations (parafovea and near periphery), whereas those for integrating elements and discriminating their properties were negatively linked. Meanwhile, masking and crowding spatial windows remained uncorrelated across eccentricity, although they were correlated across participants. This suggests that the computation performed by each type of visual field operates over different distances that co-varies across observers, but not across retinal locations. Contrast-processing units may thus lie at the core of the shared idiosyncrasies across tasks reported in many previous studies, despite the fundamental differences in the extent of their spatial windows.
Collapse
Affiliation(s)
| | | | - Jasna Martinovic
- School of Psychology, University of Aberdeen, UK
- Department of Psychology, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, UK
| |
Collapse
|
23
|
Theiss JD, Bowen JD, Silver MA. Spatial Attention Enhances Crowded Stimulus Encoding Across Modeled Receptive Fields by Increasing Redundancy of Feature Representations. Neural Comput 2021; 34:190-218. [PMID: 34710898 PMCID: PMC8693207 DOI: 10.1162/neco_a_01447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 07/01/2021] [Indexed: 11/04/2022]
Abstract
Any visual system, biological or artificial, must make a trade-off between the number of units used to represent the visual environment and the spatial resolution of the sampling array. Humans and some other animals are able to allocate attention to spatial locations to reconfigure the sampling array of receptive fields (RFs), thereby enhancing the spatial resolution of representations without changing the overall number of sampling units. Here, we examine how representations of visual features in a fully convolutional neural network interact and interfere with each other in an eccentricity-dependent RF pooling array and how these interactions are influenced by dynamic changes in spatial resolution across the array. We study these feature interactions within the framework of visual crowding, a well-characterized perceptual phenomenon in which target objects in the visual periphery that are easily identified in isolation are much more difficult to identify when flanked by similar nearby objects. By separately simulating effects of spatial attention on RF size and on the density of the pooling array, we demonstrate that the increase in RF density due to attention is more beneficial than changes in RF size for enhancing target classification for crowded stimuli. Furthermore, by varying target/flanker spacing, as well as the spatial extent of attention, we find that feature redundancy across RFs has more influence on target classification than the fidelity of the feature representations themselves. Based on these findings, we propose a candidate mechanism by which spatial attention relieves visual crowding through enhanced feature redundancy that is mostly due to increased RF density.
Collapse
Affiliation(s)
| | - Joel D Bowen
- University of California, Berkeley, CA 94720, U.S.A.
| | | |
Collapse
|
24
|
Rummens K, Sayim B. Broad attention uncovers benefits of stimulus uniformity in visual crowding. Sci Rep 2021; 11:23976. [PMID: 34907221 PMCID: PMC8671468 DOI: 10.1038/s41598-021-03258-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 12/01/2021] [Indexed: 11/08/2022] Open
Abstract
Crowding is the interference by surrounding objects (flankers) with target perception. Low target-flanker similarity usually yields weaker crowding than high similarity ('similarity rule') with less interference, e.g., by opposite- than same-contrast polarity flankers. The advantage of low target-flanker similarity has typically been shown with attentional selection of a single target object. Here, we investigated the validity of the similarity rule when broadening attention to multiple objects. In three experiments, we measured identification for crowded letters (Experiment 1), tumbling Ts (Experiment 2), and tilted lines (Experiment 3). Stimuli consisted of three items that were uniform or alternating in contrast polarity and were briefly presented at ten degrees eccentricity. Observers reported all items (full report) or only the left, central, or right item (single-item report). In Experiments 1 and 2, consistent with the similarity rule, single central item performance was superior with opposite- compared to same-contrast polarity flankers. With full report, the similarity rule was inverted: performance was better for uniform compared to alternating stimuli. In Experiment 3, contrast polarity did not affect performance. We demonstrated a reversal of the similarity rule under broadened attention, suggesting that stimulus uniformity benefits crowded object recognition when intentionally directing attention towards all stimulus elements. We propose that key properties of crowding have only limited validity as they may require a-priori differentiation of target and context.
Collapse
Affiliation(s)
- Koen Rummens
- Institute of Psychology, University of Bern, Bern, Switzerland.
| | - Bilge Sayim
- Institute of Psychology, University of Bern, Bern, Switzerland
- UMR 9193 - SCALab - Sciences Cognitives et Sciences Affectives, Université de Lille, CNRS, 59000, Lille, France
| |
Collapse
|
25
|
Bornet A, Choung OH, Doerig A, Whitney D, Herzog MH, Manassi M. Global and high-level effects in crowding cannot be predicted by either high-dimensional pooling or target cueing. J Vis 2021; 21:10. [PMID: 34812839 PMCID: PMC8626847 DOI: 10.1167/jov.21.12.10] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 09/30/2021] [Indexed: 11/24/2022] Open
Abstract
In visual crowding, the perception of a target deteriorates in the presence of nearby flankers. Traditionally, target-flanker interactions have been considered as local, mostly deleterious, low-level, and feature specific, occurring when information is pooled along the visual processing hierarchy. Recently, a vast literature of high-level effects in crowding (grouping effects and face-holistic crowding in particular) led to a different understanding of crowding, as a global, complex, and multilevel phenomenon that cannot be captured or explained by simple pooling models. It was recently argued that these high-level effects may still be captured by more sophisticated pooling models, such as the Texture Tiling model (TTM). Unlike simple pooling models, the high-dimensional pooling stage of the TTM preserves rich information about a crowded stimulus and, in principle, this information may be sufficient to drive high-level and global aspects of crowding. In addition, it was proposed that grouping effects in crowding may be explained by post-perceptual target cueing. Here, we extensively tested the predictions of the TTM on the results of six different studies that highlighted high-level effects in crowding. Our results show that the TTM cannot explain any of these high-level effects, and that the behavior of the model is equivalent to a simple pooling model. In addition, we show that grouping effects in crowding cannot be predicted by post-perceptual factors, such as target cueing. Taken together, these results reinforce once more the idea that complex target-flanker interactions determine crowding and that crowding occurs at multiple levels of the visual hierarchy.
Collapse
Affiliation(s)
- Alban Bornet
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Oh-Hyeon Choung
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Adrien Doerig
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, Netherlands
| | - David Whitney
- Department of Psychology, University of California, Berkeley, California, USA
- Helen Wills Neuroscience Institute, University of California, Berkeley, California, USA
- Vision Science Group, University of California, Berkeley, California, USA
| | - Michael H Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Mauro Manassi
- School of Psychology, University of Aberdeen, King's College, Aberdeen, UK
| |
Collapse
|
26
|
Lee RJ, Reuther J, Chakravarthi R, Martinovic J. Emergence of crowding: The role of contrast and orientation salience. J Vis 2021; 21:20. [PMID: 34709355 PMCID: PMC8556554 DOI: 10.1167/jov.21.11.20] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Accepted: 09/22/2021] [Indexed: 11/27/2022] Open
Abstract
Crowding causes difficulties in judging attributes of an object surrounded by other objects. We investigated crowding for stimuli that isolated either S-cone or luminance mechanisms or combined them. By targeting different retinogeniculate mechanisms with contrast-matched stimuli, we aim to determine the earliest site at which crowding emerges. Discrimination was measured in an orientation judgment task where Gabor targets were presented parafoveally among flankers. In the first experiment, we assessed flanked and unflanked orientation discrimination thresholds for pure S-cone and achromatic stimuli and their combinations. In the second experiment, to capture individual differences, we measured unflanked detection and orientation sensitivity, along with performance under flanker interference for stimuli containing luminance only or combined with S-cone contrast. We confirmed that orientation sensitivity was lower for unflanked S-cone stimuli. When flanked, the pattern of results for S-cone stimuli was the same as for achromatic stimuli with comparable (i.e. low) contrast levels. We also found that flanker interference exhibited a genuine signature of crowding only when orientation discrimination threshold was reliably surpassed. Crowding, therefore, emerges at a stage that operates on signals representing task-relevant featural (here, orientation) information. Because luminance and S-cone mechanisms have very different spatial tuning properties, it is most parsimonious to conclude that crowding takes place at a neural processing stage after they have been combined.
Collapse
Affiliation(s)
| | - Josephine Reuther
- School of Psychology, University of Aberdeen, Aberdeen, Scotland, UK
| | | | - Jasna Martinovic
- Department of Psychology, School of Philosophy, Psychology and Language Sciences, University of Edinburgh & School of Psychology, University of Aberdeen, Aberdeen, Scotland, UK
| |
Collapse
|
27
|
Coates DR, Ludowici CJH, Chung STL. The generality of the critical spacing for crowded optotypes: From Bouma to the 21st century. J Vis 2021; 21:18. [PMID: 34694326 PMCID: PMC8556556 DOI: 10.1167/jov.21.11.18] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 09/22/2021] [Indexed: 11/24/2022] Open
Abstract
It is rare to find a crowding manuscript that fails to mention "Bouma's law," the rule of thumb stating that flankers within a distance of about one half of the target eccentricity will induce crowding. Here we investigate the generality of this rule (even for just optotypes), the factors that modulate the critical spacing, and the evidence for the rule in Bouma's own data. We explore these questions by reanalyzing a variety of studies from the literature, running several new control experiments, and by utilizing a model that unifies flanked identification measurements between psychophysical paradigms. Specifically, with minimal assumptions (equivalent psychometric slopes across conditions, for example), crowded acuity can be predicted for arbitrary target sizes and flanker spacings, revealing a performance "landscape" that delineates the critical spacing. Last, we present a compact quantitative summary of the effects of different types of stimulus manipulations on optotype crowding.
Collapse
Affiliation(s)
- Daniel R Coates
- College of Optometry, University of Houston, Houston, TX, USA
| | | | - Susana T L Chung
- School of Optometry, Vision Science Graduate Group, University of California, Berkeley, Berkeley, CA, USA
| |
Collapse
|
28
|
Sensitivity to Central Crowding for Faces in Patients With Glaucoma. J Glaucoma 2021; 30:140-147. [PMID: 33074958 DOI: 10.1097/ijg.0000000000001710] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Accepted: 10/03/2020] [Indexed: 11/25/2022]
Abstract
PRECIS Some patients with glaucoma report difficulties to recognize faces when they are far away. We show that this deficit could result from a higher sensitivity to crowding in central vision. PURPOSE The aim of the study is to investigate whether face recognition difficulties reported by some patients with glaucoma result from a greater sensitivity to inner crowding in central vision. METHODS Seventeen patients with glaucoma and 17 age-matched normally sighted controls participated in the study. An isolated mouth (uncrowded condition) or a mouth within a face (crowded condition) was randomly displayed centrally for 200 ms. For each condition, participants were asked to decide whether the mouth was closed or open. The stimuli were presented at 3 angular sizes (0.6×0.4, 1×0.72, and 1.5×1.08 degrees). Accuracy was measured. RESULTS Crowding affected performance differentially for patients and controls. Consistent with previous studies controls exhibited a "face superiority effect," with a better accuracy when the mouth was located within the face than when it was isolated. Sensitivity to crowding, reflected in a better accuracy with the isolated mouth, was observed in 10 of 17 patients only for small images. Crowding disappeared for larger faces, as the facial features were spaced out. Five patients were not sensitive to crowding. Importantly, no difference was found between the 2 subgroups of patients (sensitive vs. nonsensitive) in terms of mean deviation, contrast sensitivity, acuity, thickness of the retinal nerve fiber layer, or macular ganglion cell-inner plexiform layer. CONCLUSIONS An excessive sensitivity to central crowding might explain the difficulties in face perception and reading reported by some patients with glaucoma. The sensory or cognitive processes underlying this excessive sensitivity must be elucidated to improve central perception in glaucoma.
Collapse
|
29
|
Abstract
In crowding, perception of a target deteriorates in the presence of nearby flankers. Surprisingly, perception can be rescued from crowding if additional flankers are added (uncrowding). Uncrowding is a major challenge for all classic models of crowding and vision in general, because the global configuration of the entire stimulus is crucial. However, it is unclear which characteristics of the configuration impact (un)crowding. Here, we systematically dissected flanker configurations and showed that (un)crowding cannot be easily explained by the effects of the sub-parts or low-level features of the stimulus configuration. Our modeling results suggest that (un)crowding requires global processing. These results are well in line with previous studies showing the importance of global aspects in crowding.
Collapse
Affiliation(s)
- Oh-Hyeon Choung
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Alban Bornet
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Adrien Doerig
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Michael H Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
30
|
Lonnqvist B, Bornet A, Doerig A, Herzog MH. A comparative biology approach to DNN modeling of vision: A focus on differences, not similarities. J Vis 2021; 21:17. [PMID: 34551062 PMCID: PMC8475290 DOI: 10.1167/jov.21.10.17] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 08/26/2021] [Indexed: 11/24/2022] Open
Abstract
Deep neural networks (DNNs) have revolutionized computer science and are now widely used for neuroscientific research. A hot debate has ensued about the usefulness of DNNs as neuroscientific models of the human visual system; the debate centers on to what extent certain shortcomings of DNNs are real failures and to what extent they are redeemable. Here, we argue that the main problem is that we often do not understand which human functions need to be modeled and, thus, what counts as a falsification. Hence, not only is there a problem on the DNN side, but there is also one on the brain side (i.e., with the explanandum-the thing to be explained). For example, should DNNs reproduce illusions? We posit that we can make better use of DNNs by adopting an approach of comparative biology by focusing on the differences, rather than the similarities, between DNNs and humans to improve our understanding of visual information processing in general.
Collapse
Affiliation(s)
- Ben Lonnqvist
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Alban Bornet
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Adrien Doerig
- Donders Institute for Brain, Cognition and Behaviour, Nijmegen, Netherlands
| | - Michael H Herzog
- Laboratory of Psychophysics, Brain Mind Institute, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| |
Collapse
|
31
|
Unraveling brain interactions in vision: The example of crowding. Neuroimage 2021; 240:118390. [PMID: 34271157 DOI: 10.1016/j.neuroimage.2021.118390] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 07/09/2021] [Accepted: 07/12/2021] [Indexed: 11/22/2022] Open
Abstract
Crowding, the impairment of target discrimination in clutter, is the standard situation in vision. Traditionally, crowding is explained with (feedforward) models, in which only neighboring elements interact, leading to a "bottleneck" at the earliest stages of vision. It is with this implicit prior that most functional magnetic resonance imaging (fMRI) studies approach the identification of the "neural locus" of crowding, searching for the earliest visual area in which the blood-oxygenation-level-dependent (BOLD) signal is suppressed under crowded conditions. Using this classic approach, we replicated previous findings of crowding-related BOLD suppression starting in V2 and increasing up the visual hierarchy. Surprisingly, under conditions of uncrowding, in which adding flankers improves performance, the BOLD signal was further suppressed. This suggests an important role for top-down connections, which is in line with global models of crowding. To discriminate between various possible models, we used dynamic causal modeling (DCM). We show that recurrent interactions between all visual areas, including higher-level areas like V4 and the lateral occipital complex (LOC), are crucial in crowding and uncrowding. Our results explain the discrepancies in previous findings: in a recurrent visual hierarchy, the crowding effect can theoretically be detected at any stage. Beyond crowding, we demonstrate the need for models like DCM to understand the complex recurrent processing which most likely underlies human perception in general.
Collapse
|
32
|
Bornet A, Doerig A, Herzog MH, Francis G, Van der Burg E. Shrinking Bouma's window: How to model crowding in dense displays. PLoS Comput Biol 2021; 17:e1009187. [PMID: 34228703 PMCID: PMC8284675 DOI: 10.1371/journal.pcbi.1009187] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 07/16/2021] [Accepted: 06/16/2021] [Indexed: 11/22/2022] Open
Abstract
In crowding, perception of a target deteriorates in the presence of nearby flankers. Traditionally, it is thought that visual crowding obeys Bouma's law, i.e., all elements within a certain distance interfere with the target, and that adding more elements always leads to stronger crowding. Crowding is predominantly studied using sparse displays (a target surrounded by a few flankers). However, many studies have shown that this approach leads to wrong conclusions about human vision. Van der Burg and colleagues proposed a paradigm to measure crowding in dense displays using genetic algorithms. Displays were selected and combined over several generations to maximize human performance. In contrast to Bouma's law, only the target's nearest neighbours affected performance. Here, we tested various models to explain these results. We used the same genetic algorithm, but instead of selecting displays based on human performance we selected displays based on the model's outputs. We found that all models based on the traditional feedforward pooling framework of vision were unable to reproduce human behaviour. In contrast, all models involving a dedicated grouping stage explained the results successfully. We show how traditional models can be improved by adding a grouping stage.
Collapse
Affiliation(s)
- Alban Bornet
- Laboratory of Psychophysics, Brain Mind Institute, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Adrien Doerig
- Laboratory of Psychophysics, Brain Mind Institute, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
| | - Michael H. Herzog
- Laboratory of Psychophysics, Brain Mind Institute, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Gregory Francis
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, United States of America
| | - Erik Van der Burg
- TNO, Human Factors, Soesterberg, The Netherlands
- Brain and Cognition, University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
33
|
Redundancy between spectral and higher-order texture statistics for natural image segmentation. Vision Res 2021; 187:55-65. [PMID: 34217005 DOI: 10.1016/j.visres.2021.06.007] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Revised: 06/09/2021] [Accepted: 06/11/2021] [Indexed: 11/23/2022]
Abstract
Visual texture, defined by local image statistics, provides important information to the human visual system for perceptual segmentation. Second-order or spectral statistics (equivalent to the Fourier power spectrum) are a well-studied segmentation cue. However, the role of higher-order statistics (HOS) in segmentation remains unclear, particularly for natural images. Recent experiments indicate that, in peripheral vision, the HOS of the widely adopted Portilla-Simoncelli texture model are a weak segmentation cue compared to spectral statistics, despite the fact that both are necessary to explain other perceptual phenomena and to support high-quality texture synthesis. Here we test whether this discrepancy reflects a property of natural image statistics. First, we observe that differences in spectral statistics across segments of natural images are redundant with differences in HOS. Second, using linear and nonlinear classifiers, we show that each set of statistics individually affords high performance in natural scenes and texture segmentation tasks, but combining spectral statistics and HOS produces relatively small improvements. Third, we find that HOS improve segmentation for a subset of images, although these images are difficult to identify. We also find that different subsets of HOS improve segmentation to a different extent, in agreement with previous physiological and perceptual work. These results show that the HOS add modestly to spectral statistics for natural image segmentation. We speculate that tuning to natural image statistics under resource constraints could explain the weak contribution of HOS to perceptual segmentation in human peripheral vision.
Collapse
|
34
|
Chen G, Zhu Z, He Q, Fang F. Offline transcranial direct current stimulation improves the ability to perceive crowded targets. J Vis 2021; 21:1. [PMID: 33533878 PMCID: PMC7862736 DOI: 10.1167/jov.21.2.1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The deleterious effect of nearby flankers on target identification in the periphery is known as visual crowding. Studying visual crowding can advance our understanding of the mechanisms of visual awareness and object recognition. Alleviating visual crowding is one of the major ways to improve peripheral vision. The aim of the current study was to examine whether transcranial direct current stimulation (tDCS) was capable of alleviating visual crowding at different visual eccentricities and with different visual tasks. In the present single-blind sham-controlled study, subjects were instructed to perform an orientation discrimination task or a letter identification task with isolated and crowded targets in the periphery, before and after applying 20 minutes of 2 mA anodal tDCS to visual cortex of the hemisphere contralateral or ipsilateral to visual stimuli. Contralateral tDCS significantly alleviated the orientation crowding effect at two different eccentricities and the letter crowding effect. This alleviation was absent after sham or ipsilateral stimulation and could not be fully explained by the performance improvement with the isolated targets. These findings demonstrated that offline tDCS was effective in alleviating visual crowding across different visual eccentricities and tasks, therefore providing a promising way to improve spatial vision rapidly in crowded scenes.
Collapse
Affiliation(s)
- Guanpeng Chen
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, People's Republic of China.,IDG/McGovern Institute for Brain Research, Peking University, Beijing, People's Republic of China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, People's Republic of China.,
| | - Ziyun Zhu
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, People's Republic of China.,IDG/McGovern Institute for Brain Research, Peking University, Beijing, People's Republic of China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, People's Republic of China.,
| | - Qing He
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, People's Republic of China.,IDG/McGovern Institute for Brain Research, Peking University, Beijing, People's Republic of China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, People's Republic of China.,
| | - Fang Fang
- School of Psychological and Cognitive Sciences and Beijing Key Laboratory of Behavior and Mental Health, Peking University, Beijing, People's Republic of China.,IDG/McGovern Institute for Brain Research, Peking University, Beijing, People's Republic of China.,Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, People's Republic of China.,
| |
Collapse
|
35
|
Ringer RV. Investigating Visual Crowding of Objects in Complex Real-World Scenes. Iperception 2021; 12:2041669521994150. [PMID: 35145614 PMCID: PMC8822316 DOI: 10.1177/2041669521994150] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2019] [Accepted: 01/07/2021] [Indexed: 11/23/2022] Open
Abstract
Visual crowding, the impairment of object recognition in peripheral vision due to flanking objects, has generally been studied using simple stimuli on blank backgrounds. While crowding is widely assumed to occur in natural scenes, it has not been shown rigorously yet. Given that scene contexts can facilitate object recognition, crowding effects may be dampened in real-world scenes. Therefore, this study investigated crowding using objects in computer-generated real-world scenes. In two experiments, target objects were presented with four flanker objects placed uniformly around the target. Previous research indicates that crowding occurs when the distance between the target and flanker is approximately less than half the retinal eccentricity of the target. In each image, the spacing between the target and flanker objects was varied considerably above or below the standard (0.5) threshold to either suppress or facilitate the crowding effect. Experiment 1 cued the target location and then briefly flashed the scene image before participants could move their eyes. Participants then selected the target object's category from a 15-alternative forced choice response set (including all objects shown in the scene). Experiment 2 used eye tracking to ensure participants were centrally fixating at the beginning of each trial and showed the image for the duration of the participant's fixation. Both experiments found object recognition accuracy decreased with smaller spacing between targets and flanker objects. Thus, this study rigorously shows crowding of objects in semantically consistent real-world scenes.
Collapse
Affiliation(s)
- Ryan V. Ringer
- Department of Psychology, Wichita State University, Wichita, Kansas, United States
| |
Collapse
|
36
|
Yildirim FZ, Coates DR, Sayim B. Hidden by bias: how standard psychophysical procedures conceal crucial aspects of peripheral visual appearance. Sci Rep 2021; 11:4095. [PMID: 33602975 PMCID: PMC7892995 DOI: 10.1038/s41598-021-83325-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 01/27/2021] [Indexed: 01/31/2023] Open
Abstract
The perception of a target depends on other stimuli surrounding it in time and space. This contextual modulation is ubiquitous in visual perception, and is usually quantified by measuring performance on sets of highly similar stimuli. Implicit or explicit comparisons among the stimuli may, however, inadvertently bias responses and conceal strong variability of target appearance. Here, we investigated the influence of contextual stimuli on the perception of a repeating pattern (a line triplet), presented in the visual periphery. In the neutral condition, the triplet was presented a single time to capture its minimally biased perception. In the similar and dissimilar conditions, it was presented within stimulus sets composed of lines similar to the triplet, and distinct shapes, respectively. The majority of observers reported perceiving a line pair in the neutral and dissimilar conditions, revealing 'redundancy masking', the reduction of the perceived number of repeating items. In the similar condition, by contrast, the number of lines was overestimated. Our results show that the similar context did not reveal redundancy masking which was only observed in the neutral and dissimilar context. We suggest that the influence of contextual stimuli has inadvertently concealed this crucial aspect of peripheral appearance.
Collapse
Affiliation(s)
| | - Daniel R Coates
- Institute of Psychology, University of Bern, Fabrikstrasse 8, 3012, Bern, Switzerland
- College of Optometry, University of Houston, Houston, TX, 77204, USA
| | - Bilge Sayim
- Institute of Psychology, University of Bern, Fabrikstrasse 8, 3012, Bern, Switzerland
- SCALab-Sciences Cognitives et Sciences Affectives, CNRS, UMR 9193, University of Lille, 59000, Lille, France
| |
Collapse
|
37
|
Yildirim FZ, Coates DR, Sayim B. Redundancy masking: The loss of repeated items in crowded peripheral vision. J Vis 2021; 20:14. [PMID: 32330230 PMCID: PMC7405779 DOI: 10.1167/jov.20.4.14] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Crowding is the deterioration of target identification in the presence of neighboring objects. Recent studies using appearance-based methods showed that the perceived number of target elements is often diminished in crowding. Here we introduce a related type of diminishment in repeating patterns (sets of parallel lines), which we term “redundancy masking.” In four experiments, observers were presented with arrays of small numbers of lines centered at 10° eccentricity. The task was to indicate the number of lines. In Experiment 1, spatial characteristics of redundancy masking were examined by varying the inter-line spacing. We found that redundancy masking decreased with increasing inter-line spacing and ceased at spacings of approximately 0.25 times the eccentricity. In Experiment 2, we assessed whether the strength of redundancy masking differed between radial and tangential arrangements of elements as it does in crowding. Redundancy masking was strong with radially arranged lines (horizontally arranged vertical lines), and absent with tangentially arranged lines (vertically arranged horizontal lines). In Experiment 3, we investigated whether target size (line width and length) modulated redundancy masking. There was an effect of width: Thinner lines yielded stronger redundancy masking. We did not find any differences between the tested line lengths. In Experiment 4, we varied the regularity of the line arrays by vertically or horizontally jittering the positions of the lines. Redundancy masking was strongest with regular spacings and weakened with decreasing regularity. Our experiments show under which conditions whole items are lost in crowded displays, and how this redundancy masking resembles—and partly diverges from—crowded identification. We suggest that redundancy masking is a contributor to the deterioration of performance in crowded displays with redundant patterns.
Collapse
|
38
|
Herrera-Esposito D, Coen-Cagli R, Gomez-Sena L. Flexible contextual modulation of naturalistic texture perception in peripheral vision. J Vis 2021; 21:1. [PMID: 33393962 PMCID: PMC7794279 DOI: 10.1167/jov.21.1.1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Accepted: 12/01/2020] [Indexed: 11/24/2022] Open
Abstract
Peripheral vision comprises most of our visual field, and is essential in guiding visual behavior. Its characteristic capabilities and limitations, which distinguish it from foveal vision, have been explained by the most influential theory of peripheral vision as the product of representing the visual input using summary statistics. Despite its success, this account may provide a limited understanding of peripheral vision, because it neglects processes of perceptual grouping and segmentation. To test this hypothesis, we studied how contextual modulation, namely the modulation of the perception of a stimulus by its surrounds, interacts with segmentation in human peripheral vision. We used naturalistic textures, which are directly related to summary-statistics representations. We show that segmentation cues affect contextual modulation, and that this is not captured by our implementation of the summary-statistics model. We then characterize the effects of different texture statistics on contextual modulation, providing guidance for extending the model, as well as for probing neural mechanisms of peripheral vision.
Collapse
Affiliation(s)
- Daniel Herrera-Esposito
- Laboratorio de Neurociencias, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| | - Ruben Coen-Cagli
- Department of Systems and Computational Biology and Dominick P. Purpura Department of Neuroscience, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Leonel Gomez-Sena
- Laboratorio de Neurociencias, Facultad de Ciencias, Universidad de la República, Montevideo, Uruguay
| |
Collapse
|
39
|
Abstract
Visual clutter affects our ability to see. Objects that would be identifiable on their own may become unrecognizable when presented close together ("crowding"), but the psychophysical characteristics of crowding have resisted simplification. Image properties initially thought to produce crowding have paradoxically yielded unexpected results; for example, adding flanking objects can ameliorate crowding (Manassi, Sayim, & Herzog, 2012; Herzog, Sayim, Chcherov, & Manassi, 2015; Pachai, Doerig, & Herzog, 2016). The resulting theory revisions have been sufficiently complex and specialized as to make it difficult to discern what principles may underlie the observed phenomena. Here, a generalized formulation of simple visual contrast energy is presented, arising from straightforward analyses of center and surround neurons in the early visual stream. Extant contrast measures, such as root mean square contrast, are easily shown to fall out as reduced special cases. The new generalized contrast energy metric surprisingly predicts the principal findings of a broad range of crowding studies. These early crowding phenomena may thus be said to arise predominantly from contrast or are, at least, severely confounded by contrast effects. Note that these findings may be distinct from accounts of other, likely downstream, "configural" or "semantic" instances of crowding, suggesting at least two separate forms of crowding that may resist unification. The new fundamental contrast energy formulation provides a candidate explanatory framework that addresses multiple psychophysical phenomena beyond crowding.
Collapse
Affiliation(s)
- Antonio Rodriguez
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| | - Richard Granger
- Department of Psychological and Brain Sciences, Dartmouth College, Hanover, NH, USA
| |
Collapse
|
40
|
Abstract
In this article, I present a framework that would accommodate the classic ideas of visual information processing together with more recent computational approaches. I used the current knowledge about visual crowding, capacity limitations, attention, and saliency to place these phenomena within a standard neural network model. I suggest some revisions to traditional mechanisms of attention and feature integration that are required to fit better into this framework. The results allow us to explain some apparent theoretical controversies in vision research, suggesting a rationale for the limited spatial extent of crowding, a role of saliency in crowding experiments, and several amendments to the feature integration theory. The scheme can be elaborated or modified by future research.
Collapse
Affiliation(s)
- Endel Põder
- Institute of Psychology, University of Tartu, Tartu, Estonia
- www.ut.ee/~endelp/
| |
Collapse
|
41
|
Abstract
The accurate perception of human crowds is integral to social understanding and interaction. Previous studies have shown that observers are sensitive to several crowd characteristics such as average facial expression, gender, identity, joint attention, and heading direction. In two experiments, we examined ensemble perception of crowd speed using standard point-light walkers (PLW). Participants were asked to estimate the average speed of a crowd consisting of 12 figures moving at different speeds. In Experiment 1, trials of intact PLWs alternated with trials of scrambled PLWs with a viewing duration of 3 seconds. We found that ensemble processing of crowd speed could rely on local motion alone, although a globally intact configuration enhanced performance. In Experiment 2, observers estimated the average speed of intact-PLW crowds that were displayed at reduced viewing durations across five blocks of trials (between 2500 ms and 500 ms). Estimation of fast crowds was precise and accurate regardless of viewing duration, and we estimated that three to four walkers could still be integrated at 500 ms. For slow crowds, we found a systematic deterioration in performance as viewing time reduced, and performance at 500 ms could not be distinguished from a single-walker response strategy. Overall, our results suggest that rapid and accurate ensemble perception of crowd speed is possible, although sensitive to the precise speed range examined.
Collapse
|
42
|
Xi H, Wu R, Wang B, Chen L. Topological difference between target and flankers alleviates crowding effect. J Vis 2020; 20:9. [PMID: 32926072 PMCID: PMC7509911 DOI: 10.1167/jov.20.9.9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
In the crowding effect, object recognition in the periphery deteriorates when other items flank the target, especially if they share similarities. Here, we report that the similarity defined by topological property (differences in number of holes) influences the crowding effect. Orientation discrimination tasks suggested that the crowding effect was weaker with a topological different (TD) flanker than a topological equivalent (TE) flanker and an existing inward-outward anisotropy phenomenon. In another experiment, both an outer and an inner flanker were used to constitute four different conditions. The performance of an outer TD flanker and an inner TE flanker was superior to that of an outer TE flanker and an inner TD flanker, even though the items of the stimuli were the same. Different stimuli were used to control for local features. To eliminate the possible explanation of confusability, we selected pairs of letters with matched confusability, but one pair was TD and another was TE. The letter identification performance was better for the TD condition. Lastly, we investigated the digit identification under four conditions with varied spacing. Regardless of different spacing, the crowding effect was reduced by a topological different flanker. The results collectively suggest that topological property plays a role in the perceptual grouping, which modulates the crowding effect.
Collapse
Affiliation(s)
- Huanjun Xi
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Ruijie Wu
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China
| | - Bo Wang
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Lin Chen
- State Key Laboratory of Brain and Cognitive Science, Institute of Biophysics, Chinese Academy of Sciences, Beijing, China.,University of Chinese Academy of Sciences, Beijing, China.,CAS Center for Excellence in Brain Science and Intelligence Technology, Shanghai, China
| |
Collapse
|
43
|
Strappini F, Martelli M, Cozzo C, di Pace E. Empirical Evidence for Intraspecific Multiple Realization? Front Psychol 2020; 11:1676. [PMID: 32793053 PMCID: PMC7394053 DOI: 10.3389/fpsyg.2020.01676] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Accepted: 06/19/2020] [Indexed: 11/13/2022] Open
Abstract
Despite the remarkable advances in behavioral and brain sciences over the last decades, the mind-body (brain) problem is still an open debate and one of the most intriguing questions for both cognitive neuroscience and philosophy of mind. Traditional approaches have conceived this problem in terms of a contrast between physicalist monism and Cartesian dualism. However, since the late sixties, the landscape of philosophical views on the problem has become more varied and complex. The Multiple Realization Thesis (MRT) claims that mental properties can be (or are) realized, and mental processes can be (or are) implemented by neural correlates of different kinds. Thus, MRT challenges the psychoneural type-identity theory and the corresponding reductionism. Many philosophers have acknowledged the a priori plausibility of MRT. However, the existence of empirical evidence in favor of intraspecific, human multiple realizations of mental processes and properties is still controversial. Here, we illustrate some cases that provide empirical evidence in support of MRT. Recently, it has been proposed that foveal agnosic vision, like peripheral vision, can be restored by increasing object parts’ spacing (Crutch and Warrington, 2007; Strappini et al., 2017b). Agnosic fovea and normal periphery are both limited by crowding, which impairs object recognition, and provides the signature of visual integration. Here, we define a psychological property of restored object identification, and we cross-reference the data of visually impaired patients with different etiologies. In particular, we compare the data of two stroke patients, two patients with posterior cortical atrophy, six cases of strabismic amblyopia, and one case with restored sight. We also compare these patients with unimpaired subjects tested in the periphery. We show that integration (i.e., restored recognition) seems to describe quite accurately the visual performance in all these cases. Whereas the patients have different etiologies and different neural correlates, the unimpaired subjects have no neural damage. Thus, similarity in the psychological property given the differences in the neural substrate can be interpreted in relation to MRT and provide evidence in its support. Finally, we will frame our contribution within the current debate concerning MRT providing new and compelling empirical evidence.
Collapse
Affiliation(s)
| | | | - Cesare Cozzo
- Department of Philosophy, Sapienza University of Rome, Rome, Italy
| | - Enrico di Pace
- Department of Psychology, Sapienza University of Rome, Rome, Italy
| |
Collapse
|
44
|
Roumani D, Moutoussis K. Invisible Adaptation: The Effect of Awareness on the Strength of the Motion Aftereffect. Perception 2020; 49:835-857. [PMID: 32605425 DOI: 10.1177/0301006620936853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The ability to process information despite the lack of perceptual awareness is one of the most fascinating aspects of the visual system. Such unconscious processing is often investigated using adaptation, where any presence of the former can be traced by its footprint on aftereffects following the latter. We have investigated the mechanisms of the motion aftereffect (MAE) using random dot displays of varying motion coherence as well as crowding to modulate both the physical as well as the perceptual strength of the adaptation stimulus. Perceptual strength was quantitatively measured as the performance in a forced-choice direction-discrimination task. A motion-nulling technique was used to quantitatively measure the strength of the MAE. We show that the strength of the dynamic MAE is independently influenced by both the physical stimulus strength as well as the subjective perceptual strength, with the effect of the former being more prominent than that of the latter. We further show that the MAE still persists under conditions of subthreshold perception. Our results suggest that perceptual awareness can influence the strength of visual processing, but the latter is not fully dependent on the former and can still take place at its partial or even total absence.
Collapse
Affiliation(s)
- Daphne Roumani
- Cognitive Science Division, Department of Philosophy and History of Science, National and Kapodistrian University of Athens, Greece
| | - Konstantinos Moutoussis
- Cognitive Science Division, Department of Philosophy and History of Science, National and Kapodistrian University of Athens, Greece
| |
Collapse
|
45
|
Abstract
Purpose Grouping of flankers from the target can modulate crowding in adults. Visual acuity in children is measured clinically using charts with targets and different flankers to enhance spatial interactions. We investigated grouping effects on interactions using visual acuity letters, flanked by contours and letters, in children. Methods Visual acuity for isolated and flanked letters was measured in 155 three- to 11-year old children and 32 adults. Flankers were one stroke width from the target and were a box or four bars and black or red letters. Magnitudes of interaction were flanked minus isolated logMAR acuities. Psychometric function slopes were also examined. Results Magnitudes of interaction by contours did not change significantly with age. They were 0.047 ± 0.014 logMAR more with bars than a box. Interaction from flanking letters reduced with age, adults being not different from 9- to 11-year-olds for black and red letter surrounds. It was weaker by 0.033 ± 0.013 logMAR when a black letter was surrounded by red rather than black letters. Psychometric function slopes for visual acuity were steepest for the youngest children (3-5 years). Conclusions For contour and letter flankers, grouping effects on interaction magnitude are age independent. Grouping bars into a box forming a single object reduces magnitude of effect. Grouping letter flankers by color and ungrouping them from the target reduce interaction magnitude by ∼8%, suggesting that luminance-defined form dominates. Differently colored letter flankers of high-luminance contrast on acuity charts could draw attention to the target but retain significant interaction strength.
Collapse
Affiliation(s)
- Sarah J. Waugh
- Anglia Vision Research, Faculty of Science and Engineering, Anglia Ruskin University, Cambridge, United Kingdom
| | - Monika A. Formankiewicz
- Anglia Vision Research, Faculty of Science and Engineering, Anglia Ruskin University, Cambridge, United Kingdom
| |
Collapse
|
46
|
Chakravarthi R, Bertamini M. Clustering leads to underestimation of numerosity, but crowding is not the cause. Cognition 2020; 198:104195. [DOI: 10.1016/j.cognition.2020.104195] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Revised: 01/13/2020] [Accepted: 01/16/2020] [Indexed: 10/25/2022]
|
47
|
Strasburger H. Seven Myths on Crowding and Peripheral Vision. Iperception 2020; 11:2041669520913052. [PMID: 32489576 PMCID: PMC7238452 DOI: 10.1177/2041669520913052] [Citation(s) in RCA: 35] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Accepted: 02/13/2020] [Indexed: 12/03/2022] Open
Abstract
Crowding has become a hot topic in vision research, and some fundamentals are now widely agreed upon. For the classical crowding task, one would likely agree with the following statements. (1) Bouma's law can be stated, succinctly and unequivocally, as saying that critical distance for crowding is about half the target's eccentricity. (2) Crowding is predominantly a peripheral phenomenon. (3) Peripheral vision extends to at most 90° eccentricity. (4) Resolution threshold (the minimal angle of resolution) increases strongly and linearly with eccentricity. Crowding increases at an even steeper rate. (5) Crowding is asymmetric as Bouma has shown. For that inner-outer asymmetry, the peripheral flanker has more effect. (6) Critical crowding distance corresponds to a constant cortical distance in primary visual areas like V1. (7) Except for Bouma's seminal article in 1970, crowding research mostly became prominent starting in the 2000s. I propose the answer is "not really" or "not quite" to these assertions. So should we care? I think we should, before we write the textbook chapters for the next generation.
Collapse
Affiliation(s)
- Hans Strasburger
- Georg-August-Universität, Göttingen, Germany
Ludwig-Maximilians-Universität, München, Germany
| |
Collapse
|
48
|
Lonnqvist B, Clarke ADF, Chakravarthi R. Crowding in humans is unlike that in convolutional neural networks. Neural Netw 2020; 126:262-274. [PMID: 32272430 DOI: 10.1016/j.neunet.2020.03.021] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Revised: 03/10/2020] [Accepted: 03/23/2020] [Indexed: 10/24/2022]
Abstract
Object recognition is a primary function of the human visual system. It has recently been claimed that the highly successful ability to recognise objects in a set of emergent computer vision systems-Deep Convolutional Neural Networks (DCNNs)-can form a useful guide to recognition in humans. To test this assertion, we systematically evaluated visual crowding, a dramatic breakdown of recognition in clutter, in DCNNs and compared their performance to extant research in humans. We examined crowding in three architectures of DCNNs with the same methodology as that used among humans. We manipulated multiple stimulus factors including inter-letter spacing, letter colour, size, and flanker location to assess the extent and shape of crowding in DCNNs. We found that crowding followed a predictable pattern across architectures that was different from that in humans. Some characteristic hallmarks of human crowding, such as invariance to size, the effect of target-flanker similarity, and confusions between target and flanker identities, were completely missing, minimised or even reversed. These data show that DCNNs, while proficient in object recognition, likely achieve this competence through a set of mechanisms that are distinct from those in humans. They are not necessarily equivalent models of human or primate object recognition and caution must be exercised when inferring mechanisms derived from their operation.
Collapse
Affiliation(s)
- Ben Lonnqvist
- Business School, University of Aberdeen, United Kingdom of Great Britain and Northern Ireland.
| | - Alasdair D F Clarke
- Department of Psychology, University of Essex, United Kingdom of Great Britain and Northern Ireland.
| | - Ramakrishna Chakravarthi
- School of Psychology, University of Aberdeen, United Kingdom of Great Britain and Northern Ireland.
| |
Collapse
|
49
|
McSorley E, Gilchrist ID, McCloy R. The parallel programming of landing position in saccadic eye movement sequences. J Vis 2020; 20:2. [PMID: 31999821 PMCID: PMC7239640 DOI: 10.1167/jov.20.1.2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Saccadic eye movements occur in sequences, gathering new information about the visual environment to support successful task completion. Here, we examine the control of these saccadic sequences and specifically the extent to which the spatial aspects of the saccadic responses are programmed in parallel. We asked participants to saccade to a series of visual targets and, while they shifted their gaze around the display, we displaced select targets. We found that saccade landing position was deviated toward the previous location of the target suggesting that partial parallel programming of target location information was occurring. The saccade landing position was also affected by the new target location, which demonstrates that the saccade landing position was also partially updated following the shift. This pattern was present even for targets that were the subject of the next fixation. Having a greater preview about the sequence path influenced saccade accuracy with saccades being less affected by relocations when there is less preview information. The results demonstrate that landing positions from a saccade sequence are programmed in parallel and combined with more immediate visual signals.
Collapse
|
50
|
Abstract
Crowding is the deleterious influence of surrounding objects (flankers) on target identification. A central rule of crowding is that it is stronger when the target and the flankers are similar. Here, we show in three experiments how emergent features break this rule. Observers identified targets with various emergent features consisting of a pair of adjacent chevrons either pointing in opposite ('Diamonds' and 'Xs'), or the same (both up or down) directions. Targets were flanked by Diamonds or Xs, resulting in conditions with different levels of target-flanker similarity. Despite high target-flanker similarity, Diamonds were identified better than Xs when flanked by Diamonds. Participants' judgments of target conspicuity, however, showed that Diamonds were not perceived to stand out more strongly from X than Diamond flankers. Next, we asked observers to indicate whether all presented items were identical. We found superior performance with all Diamonds compared to all Xs, indicating that display uniformity judgments benefitted from the emergent features of Diamonds. Our results showed that emergent features and the information content of the entire display strongly modulated crowding. We suggest that conventional crowding rules only hold when target and flankers are artificially constrained to be mutually independent.
Collapse
|