1
|
Norman JF, Lewis JL, Bryant EN, Conn JD. Aging and temporal integration in the visual perception of object shape. Sci Rep 2023; 13:12748. [PMID: 37550419 PMCID: PMC10406914 DOI: 10.1038/s41598-023-40068-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 08/03/2023] [Indexed: 08/09/2023] Open
Abstract
It has been known for more than 160 years that highly occluded objects that would normally be visually unrecognizable can be successfully identified when they move. This anorthoscopic perception relies on the visual system's ability to integrate information over time to complete the perception of an entire object's shape. In this experiment, 16 younger and older adults (mean ages were 20.5 and 74.6 years, respectively) were familiarized with the (unoccluded) shapes of five naturally-shaped objects (bell peppers, Capsicum annuum) until they could be easily identified (i.e., with accuracies of at least 90 percent correct). All observers then viewed the stimulus objects anorthoscopically as they moved behind narrow slits; only small object fragments could be seen at any given time, because the objects were almost totally occluded from view. Even though the object identification performance for all observers was equivalent when whole object shapes were visible, a large age-related deficit in object identification emerged during anorthoscopic viewing such that the younger adults' identification performance was 45.4 percent higher than that of the older adults. This first ever study of aging and anorthoscopic perception demonstrates that there is an age-related deficit in performing the temporal integration needed for successful object recognition.
Collapse
Affiliation(s)
- J Farley Norman
- Department of Psychological Sciences, Ogden College of Science and Engineering, Western Kentucky University, 1906 College Heights Blvd. #22030, Bowling Green, KY, 42101-2030, USA.
- Center for Applied Science in Health and Aging, Western Kentucky University, Bowling Green, KY, 42101-2030, USA.
| | - Jessica L Lewis
- Department of Psychological Sciences, Ogden College of Science and Engineering, Western Kentucky University, 1906 College Heights Blvd. #22030, Bowling Green, KY, 42101-2030, USA
| | - Emily N Bryant
- Department of Psychological Sciences, Ogden College of Science and Engineering, Western Kentucky University, 1906 College Heights Blvd. #22030, Bowling Green, KY, 42101-2030, USA
| | - Juma D Conn
- Carol Martin Gatton Academy of Mathematics and Science, Bowling Green, KY, USA
| |
Collapse
|
2
|
Lappin JS, Bell HH. Form and Function in Information for Visual Perception. Iperception 2022; 12:20416695211053352. [PMID: 35003612 PMCID: PMC8728782 DOI: 10.1177/20416695211053352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 07/19/2021] [Accepted: 09/27/2021] [Indexed: 11/16/2022] Open
Abstract
Visual perception involves spatially and temporally coordinated variations in diverse
physical systems: environmental surfaces and symbols, optical images, electro-chemical
activity in neural networks, muscles, and bodily movements—each with a distinctly
different material structure and energy. The fundamental problem in the theory of
perception is to characterize the information that enables both perceptual awareness and
real-time dynamic coordination of these diverse physical systems. Gibson's psychophysical
and ecological conception of this problem differed from that of mainstream science both
then and now. The present article aims to incorporate Gibson's ideas within a general
conception of information for visual perception. We emphasize the essential role of
spatiotemporal form, in contrast with symbolic information. We consider contemporary
understanding of surface structure, optical images, and optic flow. Finally, we consider
recent evidence about capacity limitations on the rate of visual perception and
implications for the ecology of vision.
Collapse
|
3
|
Abstract
Shape is an interesting property of objects because it is used in ordinary discourse in ways that seem to have little connection to how it is typically defined in mathematics. The present article describes how the concept of shape can be grounded within Euclidean and non-Euclidean geometry and also to human perception. It considers the formal methods that have been proposed for measuring the differences among shapes and how the performance of those methods compares with shape difference thresholds of human observers. It discusses how different types of shape change can be perceptually categorized. It also evaluates the specific data structures that have been used to represent shape in models of both human and machine vision, and it reviews the psychophysical evidence about the extent to which those models are consistent with human perception. Based on this review of the literature, we argue that shape is not one thing but rather a collection of many object attributes, some of which are more perceptually salient than others. Because the relative importance of these attributes can be context dependent, there is no obvious single definition of shape that is universally applicable in all situations.
Collapse
Affiliation(s)
- James T Todd
- Department of Psychology, The Ohio State University, Columbus, OH, USA
| | | |
Collapse
|
4
|
Wen Y, Lin J, Chen K, Chen CLP, Jia K. Geometry-Aware Generation of Adversarial Point Clouds. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2020; 44:2984-2999. [PMID: 33320808 DOI: 10.1109/tpami.2020.3044712] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Machine learning models are vulnerable to adversarial examples. While most of the existing adversarial methods are on 2D image, a few recent ones extend the studies to 3D point clouds data. These methods generate point outliers, which are noticeable and easy to defend against using the simple technique of outlier removal. Motivated by the different mechanisms humans perceive by 2D images and 3D shapes, we propose the new design of geometry-aware objectives, whose solutions favor the desired surface properties of smoothness and fairness. To generate adversarial point clouds, we use a misclassification loss that supports continuous pursuit of malicious signals. Regularizing the attack loss with our proposed geometry-aware objectives results in our proposed method, Geometry-Aware Adversarial Attack (GeoA3). The results of GeoA3 tend to be more harmful, harder to defend against, and of the key adversarial characterization of being imperceptible. We also present a simple but effective algorithm termed GeoA+3-IterNormPro towards surface-level adversarial attacks via generation of adversarial point clouds. We evaluate our methods on both synthetic and physical objects. For a qualitative evaluation, we conduct subjective studies by collecting human preferences from Amazon Mechanical Turk. Comparative results in comprehensive experiments confirm the advantages of our proposed methods. Our source codes are publicly available at https://github.com/Yuxin-Wen/GeoA3.
Collapse
|
5
|
Towards describing scenes by animals: Pigeons' ordinal discrimination of objects varying in depth. Learn Behav 2020; 49:85-98. [PMID: 32968857 DOI: 10.3758/s13420-020-00444-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2020] [Indexed: 11/08/2022]
Abstract
The perception of a complex scene requires visual mechanisms that include identifying objects and their relative placement in depth. To examine apparent depth perception in birds, we tested four pigeons with a novel multiple-sequential-choice procedure. We created 3D-rendered scene stimuli containing three objects located at different apparent depths based on a variety of pictorial cues and placed small circular target response areas on them. The pigeons were trained to sequentially choose among the multiple response areas to report the object closest in apparent depth (ordinal position; front then middle object). After the pigeons learned this sequential depth discrimination, their use of three different monocular depth cues (occlusion, relative size, height in field) was tested, and their flexibility evaluated using three novel objects. In addition to the contribution to understanding apparent depth perception in birds, the use of more flexible open-ended choice discriminations, as employed here, has considerable promise for creating informative production-like tasks in nonverbal animals.
Collapse
|
6
|
Wang XM, Lind M, Bingham GP. A stratified process for the perception of objects: From optical transformations to 3D relief structure to 3D similarity structure to slant or aspect ratio. Vision Res 2020; 173:77-89. [PMID: 32480110 DOI: 10.1016/j.visres.2020.04.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2019] [Revised: 02/10/2020] [Accepted: 04/11/2020] [Indexed: 11/27/2022]
Abstract
Previously, we developed a stratified process for slant perception. First, optical transformations in structure-from-motion (SFM) and stereo were used to derive 3D relief structure (where depth scaling remains arbitrary). Second, with sufficient continuous perspective change (≥45°), a bootstrap process derived 3D similarity structure. Third, the perceived slant was derived. As predicted by theoretical work on SFM, small visual angle (<5°) viewing requires non-coplanar points. Slanted surfaces with small 3D cuboids or tetrahedrons yielded accurate judgment while planar surfaces did not. Normally, object perception entails non-coplanar points. Now, we apply the stratified process to object perception where, after deriving similarity structure, alternative metric properties of the object can be derived (e.g. slant of the top surface or width-to-depth aspect ratio). First, we tested slant judgments of the smooth planar tops of three different polyhedral objects. We tested rectangular, hexagonal, and asymmetric pentagonal surfaces, finding that symmetry was required to determine the direction of slant (AP&P, 2019, https://doi.org/10.3758/s13414-019-01859-5). Our current results replicated the previous findings. Second, we tested judgments of aspect ratios, finding accurate performance only for symmetric objects. Results from this study suggest that, first, trackable non-coplanar points can be attained in the form of 3D objects. Second, symmetry is necessary to constrain slant and aspect ratio perception. Finally, deriving 3D similarity structure precedes estimating object properties, such as slant or aspect ratio. Together, evidence presented here supports the stratified bootstrap process for 3D object perception. STATEMENT OF SIGNIFICANCE: Planning interactions with objects in the surrounding environment entails the perception of 3D shape and slant. Studying ways through which 3D metric shape and slant can be perceived accurately by moving observers not only sheds light on how the visual system works, but also provides understanding that can be applied to other fields, like machine vision or remote sensing. The current study is a logical extension of previous studies by the same authors and explores the roles of large continuous perspective changes, relief structure, and symmetry in a stratified process for object perception.
Collapse
Affiliation(s)
- Xiaoye Michael Wang
- Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN, USA; Center for Visual Research, York University, Toronto, ON, Canada.
| | - Mats Lind
- Department of Information Technology, Uppsala University, Uppsala, Sweden
| | - Geoffrey P Bingham
- Department of Psychological and Brain Sciences, Indiana University, Bloomington, IN, USA
| |
Collapse
|
7
|
Bootstrapping a better slant: A stratified process for recovering 3D metric slant. Atten Percept Psychophys 2020; 82:1504-1519. [DOI: 10.3758/s13414-019-01860-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
8
|
Höhle B, Fritzsche T, Meß K, Philipp M, Gafos A. Only the right noise? Effects of phonetic and visual input variability on 14-month-olds' minimal pair word learning. Dev Sci 2020; 23:e12950. [PMID: 32052548 DOI: 10.1111/desc.12950] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 12/22/2019] [Accepted: 01/27/2020] [Indexed: 12/24/2022]
Abstract
Seminal work by Werker and colleagues (Stager & Werker [1997] Nature, 388, 381-382) has found that 14-month-old infants do not show evidence for learning minimal pairs in the habituation-switch paradigm. However, when multiple speakers produce the minimal pair in acoustically variable ways, infants' performance improves in comparison to a single speaker condition (Rost & McMurray [2009] Developmental Science, 12, 339-349). The current study further extends these results and assesses how different kinds of input variability affect 14-month-olds' minimal pair learning in the habituation-switch paradigm testing German learning infants. The first two experiments investigated word learning when the labels were spoken by a single speaker versus when the labels were spoken by multiple speakers. In the third experiment we studied whether non-acoustic variability, implemented by visual variability of the objects presented together with the labels, would also affect minimal pair learning. We found enhanced learning in the multiple speakers compared to the single speaker condition, confirming previous findings with English-learning infants. In contrast, visual variability of the presented objects did not support learning. These findings both confirm and better delimit the beneficial role of speech-specific variability in minimal pair learning. Finally, we review different proposals on the mechanisms via which variability confers benefits to learning and outline what may be likely principles that underlie this benefit. We highlight among these the multiplicity of acoustic cues signalling phonemic contrasts and the presence of relations among these cues. It is in these relations where we trace part of the source for the apparent paradoxical benefit of variability in learning.
Collapse
Affiliation(s)
- Barbara Höhle
- Department of Linguistics, Cognitive Sciences, University of Potsdam, Potsdam, Germany
| | - Tom Fritzsche
- Department of Linguistics, Cognitive Sciences, University of Potsdam, Potsdam, Germany
| | - Katharina Meß
- Department of Linguistics, Cognitive Sciences, University of Potsdam, Potsdam, Germany
| | - Mareike Philipp
- Department of Linguistics, Cognitive Sciences, University of Potsdam, Potsdam, Germany
| | - Adamantios Gafos
- Department of Linguistics, Cognitive Sciences, University of Potsdam, Potsdam, Germany
| |
Collapse
|
9
|
Norman JF. The Recognition of Solid Object Shape: The Importance of Inhomogeneity. Iperception 2019; 10:2041669519870553. [PMID: 31448073 PMCID: PMC6693026 DOI: 10.1177/2041669519870553] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Accepted: 07/26/2019] [Indexed: 11/15/2022] Open
Abstract
A single experiment evaluated the haptic-visual cross-modal matching of solid object shape. One set of randomly shaped artificial objects was used (sinusoidally modulated spheres, SMS) as well as two sets of naturally shaped objects (bell peppers, Capsicum annuum and sweet potatoes, Ipomoea batatas). A total of 66 adults participated in the study. The participants' task was to haptically explore a single object on any particular trial and subsequently indicate which of 12 simultaneously visible objects possessed the same shape. The participants' performance for the natural objects was 60.9 and 78.7 percent correct for the bell peppers and sweet potatoes, respectively. The analogous performance for the SMS objects, while better than chance, was far worse (18.6 percent correct). All of these types of stimulus objects possess a rich geometrical structure (e.g., they all possess multiple elliptic, hyperbolic, and parabolic surface regions). Nevertheless, these three types of stimulus objects are perceived differently: Individual members of sweet potatoes and bell peppers are largely identifiable to human participants, while the individual SMS objects are not. Analyses of differential geometry indicate that these natural objects (e.g., bell peppers and sweet potatoes) possess heterogeneous spatial configurations of distinctly curved surface regions, and this heterogeneity is lacking in SMS objects. The current results therefore suggest that increases in surface structure heterogeneity facilitate human object recognition.
Collapse
Affiliation(s)
- J. Farley Norman
- Department of Psychological Sciences, Ogden College of Science and Engineering, Western Kentucky University, Bowling Green, KY, USA
| |
Collapse
|
10
|
Norman JF, Cheeseman JR, Adkins OC, Cox AG, Rogers CE, Dowell CJ, Baxter MW, Norman HF, Reyes CM. Aging and solid shape recognition: Vision and haptics. Vision Res 2015; 115:113-8. [DOI: 10.1016/j.visres.2015.09.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Revised: 08/17/2015] [Accepted: 09/01/2015] [Indexed: 11/24/2022]
|
11
|
Abstract
What are the geometric primitives of binocular disparity? The Venetian blind effect and other converging lines of evidence indicate that stereoscopic depth perception derives from disparities of higher-order structure in images of surfaces. Image structure entails spatial variations of intensity, texture, and motion, jointly structured by observed surfaces. The spatial structure of binocular disparity corresponds to the spatial structure of surfaces. Independent spatial coordinates are not necessary for stereoscopic vision. Stereopsis is highly sensitive to structural disparities associated with local surface shape. Disparate positions on retinal anatomy are neither necessary nor sufficient for stereopsis.
Collapse
|
12
|
Processing convexity and concavity along a 2-D contour: figure-ground, structural shape, and attention. Psychon Bull Rev 2013. [PMID: 23188740 DOI: 10.3758/s13423-012-0347-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Interest in convexity has a long history in vision science. For smooth contours in an image, it is possible to code regions of positive (convex) and negative (concave) curvature, and this provides useful information about solid shape. We review a large body of evidence on the role of this information in perception of shape and in attention. This includes evidence from behavioral, neurophysiological, imaging, and developmental studies. A review is necessary to analyze the evidence on how convexity affects (1) separation between figure and ground, (2) part structure, and (3) attention allocation. Despite some broad agreement on the importance of convexity in these areas, there is a lack of consensus on the interpretation of specific claims--for example, on the contribution of convexity to metric depth and on the automatic directing of attention to convexities or to concavities. The focus is on convexity and concavity along a 2-D contour, not convexity and concavity in 3-D, but the important link between the two is discussed. We conclude that there is good evidence for the role of convexity information in figure-ground organization and in parsing, but other, more specific claims are not (yet) well supported.
Collapse
|
13
|
Solid shape discrimination from vision and haptics: natural objects (Capsicum annuum) and Gibson's "feelies". Exp Brain Res 2012; 222:321-32. [PMID: 22918607 DOI: 10.1007/s00221-012-3220-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2012] [Accepted: 08/04/2012] [Indexed: 10/28/2022]
Abstract
A set of three experiments evaluated 96 participants' ability to visually and haptically discriminate solid object shape. In the past, some researchers have found haptic shape discrimination to be substantially inferior to visual shape discrimination, while other researchers have found haptics and vision to be essentially equivalent. A primary goal of the present study was to understand these discrepant past findings and to determine the true capabilities of the haptic system. All experiments used the same task (same vs. different shape discrimination) and stimulus objects (James Gibson's "feelies" and a set of naturally shaped objects--bell peppers). However, the methodology varied across experiments. Experiment 1 used random 3-dimensional (3-D) orientations of the stimulus objects, and the conditions were full-cue (active manipulation of objects and rotation of the visual objects in depth). Experiment 2 restricted the 3-D orientations of the stimulus objects and limited the haptic and visual information available to the participants. Experiment 3 compared restricted and full-cue conditions using random 3-D orientations. We replicated both previous findings in the current study. When we restricted visual and haptic information (and placed the stimulus objects in the same orientation on every trial), the participants' visual performance was superior to that obtained for haptics (replicating the earlier findings of Davidson et al. in Percept Psychophys 15(3):539-543, 1974). When the circumstances resembled those of ordinary life (e.g., participants able to actively manipulate objects and see them from a variety of perspectives), we found no significant difference between visual and haptic solid shape discrimination.
Collapse
|
14
|
Warren WH. Does this computational theory solve the right problem? Marr, Gibson, and the goal of vision. Perception 2012; 41:1053-60. [PMID: 23409371 PMCID: PMC3816718 DOI: 10.1068/p7327] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
David Marr's book Vision attempted to formulate athoroughgoing formal theory of perception. Marr borrowed much of the "computational" level from James Gibson: a proper understanding of the goal of vision, the natural constraints, and the available information are prerequisite to describing the processes and mechanisms by which the goal is achieved. Yet, as a research program leading to a computational model of human vision, Marr's program did not succeed. This article asks why, using the perception of 3D shape as a morality tale. Marr presumed that the goal of vision is to recover a general-purpose Euclidean description of the world, which can be deployed for any task or action. On this formulation, vision is underdetermined by information, which in turn necessitates auxiliary assumptions to solve the problem. But Marr's assumptions did not actually reflect natural constraints, and consequently the solutions were not robust. We now know that humans do not in fact recover Euclidean structure--rather, they reliably perceive qualitative shape (hills, dales, courses, ridges), which is specified by the second-order differential structure of images. By recasting the goals of vision in terms of our perceptual competencies, and doing the hard work of analyzing the information available under ecological constraints, we can reformulate the problem so that perception is determined by information and prior knowledge is unnecessary.
Collapse
Affiliation(s)
- William H Warren
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University, Providence, RI 02912, USA.
| |
Collapse
|