1
|
Sha O, Zhang H, Bai J, Zhang Y, Yang J. The analysis of the structural parameter influences on measurement errors in a binocular 3D reconstruction system: a portable 3D system. PeerJ Comput Sci 2023; 9:e1610. [PMID: 37810332 PMCID: PMC10557943 DOI: 10.7717/peerj-cs.1610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 09/01/2023] [Indexed: 10/10/2023]
Abstract
This study used an analytical model to investigate the factors that affect the reconstruction accuracy composed of the baseline length, lens focal length, the angle between the optical axis and baseline, and the field of the view angle. Firstly, the theoretical expressions of the above factors and measurement errors are derived based on the binocular three-dimensional reconstruction model. Then, the structural parameters' impact on the error propagation coefficient is analyzed and simulated using MATLAB software. The results show that structural parameters significantly impact the error propagation coefficient, and the reasonable range of structural parameters is pointed out. When the angle between the optical axis of the binocular camera and the baseline is between 30° and 55°, the ratio of the baseline length to the focal length can be reasonably reduced. In addition, using the field angle of the view that does not exceed 20° could reduce the error propagation coefficient. While the angle between the binocular optical axis and the baseline is between 40° and 50°, the reconstruction result has the highest accuracy, changing the angle out of this range will lead to an increase in the reconstruction error. The angle between the binocular optical axis and the baseline changes 30° through 60° leads to the error propagation coefficient being in a lower range. Finally, experimental verification and simulation results show that selecting reasonable structural parameters could significantly reduce measurement errors. This study proposes a model that constructs a binocular three-dimensional reconstruction system with high precision. A portable three-dimensional reconstruction system is built in the article.
Collapse
Affiliation(s)
- Ou Sha
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Hongyu Zhang
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China
| | - Jing Bai
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China
| | - Yaoyu Zhang
- Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences, Changchun, China
| | | |
Collapse
|
2
|
Cooper PS, Colton E, Bode S, Chong TTJ. Standardised images of novel objects created with generative adversarial networks. Sci Data 2023; 10:575. [PMID: 37660073 PMCID: PMC10475029 DOI: 10.1038/s41597-023-02483-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 08/16/2023] [Indexed: 09/04/2023] Open
Abstract
An enduring question in cognitive science is how perceptually novel objects are processed. Addressing this issue has been limited by the absence of a standardised set of object-like stimuli that appear realistic, but cannot possibly have been previously encountered. To this end, we created a dataset, at the core of which are images of 400 perceptually novel objects. These stimuli were created using Generative Adversarial Networks that integrated features of everyday stimuli to produce a set of synthetic objects that appear entirely plausible, yet do not in fact exist. We curated an accompanying dataset of 400 familiar stimuli, which were matched in terms of size, contrast, luminance, and colourfulness. For each object, we quantified their key visual properties (edge density, entropy, symmetry, complexity, and spectral signatures). We also confirmed that adult observers (N = 390) perceive the novel objects to be less familiar, yet similarly engaging, relative to the familiar objects. This dataset serves as an open resource to facilitate future studies on visual perception.
Collapse
Affiliation(s)
- Patrick S Cooper
- Turner Institute for Brain and Mental Health, Monash University, Victoria, 3800, Australia.
- Melbourne School of Psychological Sciences, University of Melbourne, Victoria, 3010, Australia.
| | - Emily Colton
- Turner Institute for Brain and Mental Health, Monash University, Victoria, 3800, Australia
| | - Stefan Bode
- Melbourne School of Psychological Sciences, University of Melbourne, Victoria, 3010, Australia
| | - Trevor T-J Chong
- Turner Institute for Brain and Mental Health, Monash University, Victoria, 3800, Australia.
- Department of Neurology, Alfred Health, Melbourne, Victoria, 3004, Australia.
- Department of Clinical Neurosciences, St Vincent's Hospital, Victoria, 3065, Australia.
| |
Collapse
|
3
|
Stereopsis provides a constant feed to visual shape representation. Vision Res 2023; 204:108175. [PMID: 36571983 DOI: 10.1016/j.visres.2022.108175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Revised: 11/10/2022] [Accepted: 12/05/2022] [Indexed: 12/25/2022]
Abstract
The contribution of stereopsis in human visual shape perception was examined using stimuli with either null, normal, or reversed binocular disparity in an old/new object recognition task. The highest levels of recognition performance were observed with null and normal binocular disparity displays, which did not differ. However, reversed disparity led to significantly worse performance than either of the other display conditions. This indicates that stereopsis provides a continuous input to the mechanisms involved in shape perception.
Collapse
|
4
|
Rosenke M, Davidenko N, Grill-Spector K, Weiner KS. Combined Neural Tuning in Human Ventral Temporal Cortex Resolves the Perceptual Ambiguity of Morphed 2D Images. Cereb Cortex 2020; 30:4882-4898. [PMID: 32372098 PMCID: PMC7391265 DOI: 10.1093/cercor/bhaa081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
We have an amazing ability to categorize objects in the world around us. Nevertheless, how cortical regions in human ventral temporal cortex (VTC), which is critical for categorization, support this behavioral ability, is largely unknown. Here, we examined the relationship between neural responses and behavioral performance during the categorization of morphed silhouettes of faces and hands, which are animate categories processed in cortically adjacent regions in VTC. Our results reveal that the combination of neural responses from VTC face- and body-selective regions more accurately explains behavioral categorization than neural responses from either region alone. Furthermore, we built a model that predicts a person's behavioral performance using estimated parameters of brain-behavior relationships from a different group of people. Moreover, we show that this brain-behavior model generalizes to adjacent face- and body-selective regions in lateral occipitotemporal cortex. Thus, while face- and body-selective regions are located within functionally distinct domain-specific networks, cortically adjacent regions from both networks likely integrate neural responses to resolve competing and perceptually ambiguous information from both categories.
Collapse
Affiliation(s)
- Mona Rosenke
- Psychology Department, Stanford University, Stanford, CA 94305, USA
| | - Nicolas Davidenko
- Psychology Department, University of California, Santa Cruz, Santa Cruz, CA 95064, USA
| | - Kalanit Grill-Spector
- Psychology Department, Stanford University, Stanford, CA 94305, USA
- Neuroscience Institute, Stanford University, Stanford, CA 94305, USA
| | - Kevin S Weiner
- Psychology Department, University of California, Berkeley, Berkeley, CA 94720, USA
- Helen Wills Neuroscience Institute, University of California, Berkeley, Berkeley, CA 94720, USA
| |
Collapse
|
5
|
Grill-Spector K, Weiner KS, Gomez J, Stigliani A, Natu VS. The functional neuroanatomy of face perception: from brain measurements to deep neural networks. Interface Focus 2018; 8:20180013. [PMID: 29951193 PMCID: PMC6015811 DOI: 10.1098/rsfs.2018.0013] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/08/2018] [Indexed: 12/14/2022] Open
Abstract
A central goal in neuroscience is to understand how processing within the ventral visual stream enables rapid and robust perception and recognition. Recent neuroscientific discoveries have significantly advanced understanding of the function, structure and computations along the ventral visual stream that serve as the infrastructure supporting this behaviour. In parallel, significant advances in computational models, such as hierarchical deep neural networks (DNNs), have brought machine performance to a level that is commensurate with human performance. Here, we propose a new framework using the ventral face network as a model system to illustrate how increasing the neural accuracy of present DNNs may allow researchers to test the computational benefits of the functional architecture of the human brain. Thus, the review (i) considers specific neural implementational features of the ventral face network, (ii) describes similarities and differences between the functional architecture of the brain and DNNs, and (iii) provides a hypothesis for the computational value of implementational features within the brain that may improve DNN performance. Importantly, this new framework promotes the incorporation of neuroscientific findings into DNNs in order to test the computational benefits of fundamental organizational features of the visual system.
Collapse
Affiliation(s)
- Kalanit Grill-Spector
- Department of Psychology, School of Medicine, Stanford University, Stanford, CA 94305, USA
- Stanford Neurosciences Institute, School of Medicine, Stanford University, Stanford, CA 94305, USA
| | - Kevin S. Weiner
- Department of Psychology, University of California Berkeley, Berkeley, CA 94720, USA
- Helen Wills Neuroscience Institute, University of California Berkeley, Berkeley, CA 94720, USA
| | - Jesse Gomez
- Stanford Neurosciences Program, School of Medicine, Stanford University, Stanford, CA 94305, USA
| | - Anthony Stigliani
- Department of Psychology, School of Medicine, Stanford University, Stanford, CA 94305, USA
| | - Vaidehi S. Natu
- Department of Psychology, School of Medicine, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|