1
|
Fukazawa H, Okada-Shudo Y. Photosynthetic Protein-Based Retinal Ganglion Cell Receptive Fields for Detecting Edges and Brightness Illusions. NANO LETTERS 2023; 23:10983-10990. [PMID: 38048176 PMCID: PMC10723062 DOI: 10.1021/acs.nanolett.3c03257] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Revised: 11/27/2023] [Accepted: 11/28/2023] [Indexed: 12/06/2023]
Abstract
Bacteriorhodopsin, isolated from a halophilic bacterium, is a photosynthetic protein with a structure and function similar to those of the visual pigment rhodopsin. A voltaic cell with bacteriorhodopsin sandwiched between two transparent electrodes exhibits a time-differential response akin to that observed in retinal ganglion cells. It is intriguing as a means to emulate excitation and inhibition in the neural response. Here, we present a neuromorphic device emulating the retinal ganglion cell receptive field fabricated by patterning bacteriorhodopsin onto two transparent electrodes and encapsulating them with an electrolyte solution. This protein-based artificial ganglion cell receptive field is characterized as a bandpass filter that simultaneously replicates excitatory and inhibitory responses within a single element, successfully detecting image edges and phenomena of brightness illusions. The device naturally emulates the highly interacting ganglion cell receptive fields by exploiting the inherent properties of proteins without the need for electronic components, bias power supply, or an external operating circuit.
Collapse
Affiliation(s)
- Hikaru Fukazawa
- Department of Engineering Science, The University of Electro-Communications, Tokyo 182-8585, Japan
| | - Yoshiko Okada-Shudo
- Department of Engineering Science, The University of Electro-Communications, Tokyo 182-8585, Japan
| |
Collapse
|
2
|
Wendt G, Faul F. Binocular luster - A review. Vision Res 2022; 194:108008. [PMID: 35182893 DOI: 10.1016/j.visres.2022.108008] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 12/20/2021] [Accepted: 12/29/2021] [Indexed: 11/30/2022]
Abstract
Binocular luster is a visual phenomenon that can be elicited by dichoptic stimuli showing an interocular difference in color or luminance contrast. For instance, when the two eyes are presented with simple center-surround stimuli in which the center patch in one eye is brighter and in the other eye darker than the common surround, the center patch in the fused percept assumes a lustrous appearance reminiscent of metal or graphite. Soon after the discovery of this phenomenon in the mid-19th century, it was intensively studied and several explanations were proposed. After this initial phase, however, research interest waned significantly. Stimulated by new insights into related phenomena and the underlying physiological mechanisms, the last 20 years have seen an increase in research activity in this field, which has considerably expanded our understanding of binocular luster. In this paper, we provide a detailed review of research on binocular luster over the past 170 years. We present and discuss the existing findings in a number of separate sections, dealing with 1) the phenomenology of binocular luster, 2) different theories that have been proposed, 3) several factors influencing the lustrous impression, 4) the relationship between binocular luster and binocular rivalry, 5) the current understanding of its neural basis, and 6) potential applications based on binocular luster.
Collapse
|
3
|
Bakshi A, Ghosh K. Tiny Squares at the Hermann Grid Corners Can Completely Remove the Illusion. Perception 2020; 49:232-239. [PMID: 31902280 DOI: 10.1177/0301006619897193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Affiliation(s)
- Ashish Bakshi
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India
| | - Kuntal Ghosh
- Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India; Center for Soft Computing Research, Indian Statistical Institute, Kolkata, India
| |
Collapse
|
4
|
Bakshi A, Ghosh K. A parsimonious model of brightness induction. BIOLOGICAL CYBERNETICS 2018; 112:237-251. [PMID: 29354875 DOI: 10.1007/s00422-018-0747-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2014] [Accepted: 01/02/2018] [Indexed: 06/07/2023]
Abstract
We present a parsimonious model of brightness induction which can account for various brightness illusions of both brightness-contrast and brightness-assimilation types. Our model is based on a difference of difference-of-Gaussian filter and a two-pass model of attentive vision based on the parallel channels in the central visual pathway. It overcomes some of the problems that could not be addressed by the well-known oriented difference of Gaussian model like those associated with Mach band and checkerboard illusions. This model attempts to provide insight to the mechanism of attention in brightness perception through the two major complimentary visual channels, viz. the magnocellular and the parvocellular.
Collapse
Affiliation(s)
- Ashish Bakshi
- Machine Intelligence Unit, Indian Statistical Institute, 203 B T Road, Kolkata, 700108, India.
| | - Kuntal Ghosh
- Machine Intelligence Unit, Indian Statistical Institute, 203 B T Road, Kolkata, 700108, India
- Center for Soft Computing Research, Indian Statistical Institute, 203 B T Road, Kolkata, 700108, India
| |
Collapse
|
5
|
Nematzadeh N, Powers DMW, Lewis TW. Bioplausible multiscale filtering in retino-cortical processing as a mechanism in perceptual grouping. Brain Inform 2017; 4:271-293. [PMID: 28887785 PMCID: PMC5709283 DOI: 10.1007/s40708-017-0072-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2017] [Accepted: 08/23/2017] [Indexed: 10/25/2022] Open
Abstract
Why does our visual system fail to reconstruct reality, when we look at certain patterns? Where do Geometrical illusions start to emerge in the visual pathway? How far should we take computational models of vision with the same visual ability to detect illusions as we do? This study addresses these questions, by focusing on a specific underlying neural mechanism involved in our visual experiences that affects our final perception. Among many types of visual illusion, 'Geometrical' and, in particular, 'Tilt Illusions' are rather important, being characterized by misperception of geometric patterns involving lines and tiles in combination with contrasting orientation, size or position. Over the last decade, many new neurophysiological experiments have led to new insights as to how, when and where retinal processing takes place, and the encoding nature of the retinal representation that is sent to the cortex for further processing. Based on these neurobiological discoveries, we provide computer simulation evidence from modelling retinal ganglion cells responses to some complex Tilt Illusions, suggesting that the emergence of tilt in these illusions is partially related to the interaction of multiscale visual processing performed in the retina. The output of our low-level filtering model is presented for several types of Tilt Illusion, predicting that the final tilt percept arises from multiple-scale processing of the Differences of Gaussians and the perceptual interaction of foreground and background elements. The model is a variation of classical receptive field implementation for simple cells in early stages of vision with the scales tuned to the object/texture sizes in the pattern. Our results suggest that this model has a high potential in revealing the underlying mechanism connecting low-level filtering approaches to mid- and high-level explanations such as 'Anchoring theory' and 'Perceptual grouping'.
Collapse
Affiliation(s)
- Nasim Nematzadeh
- College of Science and Engineering, Flinders University, GPO Box 2100, Adelaide, SA, 5001, Australia.
| | - David M W Powers
- College of Science and Engineering, Flinders University, GPO Box 2100, Adelaide, SA, 5001, Australia
| | - Trent W Lewis
- College of Science and Engineering, Flinders University, GPO Box 2100, Adelaide, SA, 5001, Australia
| |
Collapse
|
6
|
Lv Q, Wang B, Zhang L. Saliency computation via whitened frequency band selection. Cogn Neurodyn 2016; 10:255-67. [PMID: 27275381 PMCID: PMC4870405 DOI: 10.1007/s11571-015-9372-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2015] [Revised: 12/07/2015] [Accepted: 12/24/2015] [Indexed: 11/29/2022] Open
Abstract
Many saliency computational models have been proposed to simulate bottom-up visual attention mechanism of human visual system. However, most of them only deal with certain kinds of images or aim at specific applications. In fact, human beings have the ability to correctly select attentive focuses of objects with arbitrary sizes within any scenes. This paper proposes a new bottom-up computational model from the perspective of frequency domain based on the biological discovery of non-Classical Receptive Field (nCRF) in the retina. A saliency map can be obtained according to the idea of Extended Classical Receptive Field. The model is composed of three major steps: firstly decompose the input image into several feature maps representing different frequency bands that cover the whole frequency domain by utilizing Gabor wavelet. Secondly, whiten the feature maps to highlight the embedded saliency information. Thirdly, select some optimal maps, simulating the response of receptive field especially nCRF, to generate the saliency map. Experimental results show that the proposed algorithm is able to work with stable effect and outstanding performance in a variety of situations as human beings do and is adaptive to both psychological patterns and natural images. Beyond that, biological plausibility of nCRF and Gabor wavelet transform make this approach reliable.
Collapse
Affiliation(s)
- Qi Lv
- />Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, 200433 China
- />Research Center of Smart Networks and Systems, School of Information Science and Technology, Fudan University, Shanghai, 200433 China
| | - Bin Wang
- />Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, 200433 China
- />Research Center of Smart Networks and Systems, School of Information Science and Technology, Fudan University, Shanghai, 200433 China
| | - Liming Zhang
- />Key Laboratory for Information Science of Electromagnetic Waves (MoE), Fudan University, Shanghai, 200433 China
- />Research Center of Smart Networks and Systems, School of Information Science and Technology, Fudan University, Shanghai, 200433 China
| |
Collapse
|
7
|
Mazumdar D, Mitra S, Ghosh K, Bhaumik K. A DOG filter model of the occurrence of Mach bands on spatial contrast discontinuities. BIOLOGICAL CYBERNETICS 2016; 110:229-236. [PMID: 27016101 DOI: 10.1007/s00422-016-0683-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2015] [Accepted: 02/23/2016] [Indexed: 06/05/2023]
Abstract
The present work proposes a unified model to explain two previously reported properties of the Mach band illusion. The first is the frequently referenced fact that Mach bands are prominently visible at ramps, but practically vanish at intensity steps. The second property, less studied, on the other hand may also be related to the first. It concerns the fact that the width of the illusory Mach bands appears to be a function of the slope of the ramp itself. The model proposed here combines the difference of Gaussians (DOG) model of lateral inhibition in receptive fields with the models of feature detection, based on a holistic approach. The sharpness of discontinuity (SOD) concept for Mach band stimulus has been defined and is related to the slope of the ramp. It is suggested that calculation of SOD leads to an adaptive change in inhibitory surround, a notion that has the support of physiological experiments too.
Collapse
Affiliation(s)
- Debasis Mazumdar
- CDAC, Kolkata, Plot- E2/1, Block - GP, Sector - V, Salt Lake City, Kolkata, 700091, India
| | - Soma Mitra
- CDAC, Kolkata, Plot- E2/1, Block - GP, Sector - V, Salt Lake City, Kolkata, 700091, India
| | - Kuntal Ghosh
- Center for Soft Computing Research and Machine Intelligence Unit, Indian Statistical Institute, 203 B T Road, Kolkata, 108, India.
| | - Kamales Bhaumik
- CDAC, Kolkata, Plot- E2/1, Block - GP, Sector - V, Salt Lake City, Kolkata, 700091, India
| |
Collapse
|
8
|
Zeman A, Brooks KR, Ghebreab S. An exponential filter model predicts lightness illusions. Front Hum Neurosci 2015; 9:368. [PMID: 26157381 PMCID: PMC4478851 DOI: 10.3389/fnhum.2015.00368] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2014] [Accepted: 06/11/2015] [Indexed: 12/02/2022] Open
Abstract
Lightness, or perceived reflectance of a surface, is influenced by surrounding context. This is demonstrated by the Simultaneous Contrast Illusion (SCI), where a gray patch is perceived lighter against a black background and vice versa. Conversely, assimilation is where the lightness of the target patch moves toward that of the bounding areas and can be demonstrated in White's effect. Blakeslee and McCourt (1999) introduced an oriented difference-of-Gaussian (ODOG) model that is able to account for both contrast and assimilation in a number of lightness illusions and that has been subsequently improved using localized normalization techniques. We introduce a model inspired by image statistics that is based on a family of exponential filters, with kernels spanning across multiple sizes and shapes. We include an optional second stage of normalization based on contrast gain control. Our model was tested on a well-known set of lightness illusions that have previously been used to evaluate ODOG and its variants, and model lightness values were compared with typical human data. We investigate whether predictive success depends on filters of a particular size or shape and whether pooling information across filters can improve performance. The best single filter correctly predicted the direction of lightness effects for 21 out of 27 illusions. Combining two filters together increased the best performance to 23, with asymptotic performance at 24 for an arbitrarily large combination of filter outputs. While normalization improved prediction magnitudes, it only slightly improved overall scores in direction predictions. The prediction performance of 24 out of 27 illusions equals that of the best performing ODOG variant, with greater parsimony. Our model shows that V1-style orientation-selectivity is not necessary to account for lightness illusions and that a low-level model based on image statistics is able to account for a wide range of both contrast and assimilation effects.
Collapse
Affiliation(s)
- Astrid Zeman
- Department of Cognitive Science, ARC Centre of Excellence in Cognition and its Disorders, Macquarie University Sydney, NSW, Australia ; Commonwealth Scientific and Industrial Research Organisation Marsfield, NSW, Australia ; Perception in Action Research Centre, Macquarie University Sydney, NSW, Australia
| | - Kevin R Brooks
- Perception in Action Research Centre, Macquarie University Sydney, NSW, Australia ; Department of Psychology, Macquarie University Sydney, NSW, Australia
| | - Sennay Ghebreab
- Cognitive Neuroscience Group, Department of Psychology, University of Amsterdam Amsterdam, Netherlands ; Intelligent Systems Lab Amsterdam, Institute of Informatics, University of Amsterdam Amsterdam, Netherlands
| |
Collapse
|
9
|
|
10
|
Wei H, Lang B, Zuo Q. Contour detection model with multi-scale integration based on non-classical receptive field. Neurocomputing 2013. [DOI: 10.1016/j.neucom.2012.09.027] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
11
|
Ghosh K. A possible role and basis of visual pathway selection in brightness induction. SEEING AND PERCEIVING 2012; 25:179-212. [PMID: 22726252 DOI: 10.1163/187847612x629946] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
It is a well-known fact that the perceived brightness of any surface depends on the brightness of the surfaces that surround it. This phenomenon is termed as brightness induction. Isotropic arrays of multi-scale DoG (Difference of Gaussians) as well as cortical Oriented DoG (ODOG) and extensions thereof, like the Frequency-specific Locally Normalized ODOG (FLODOG) functions have been employed towards prediction of the direction of brightness induction in many brightness perception effects. But the neural basis of such spatial filters is seldom obvious. For instance, the visual information from retinal ganglion cells to such spatial filters, which have been generally speculated to appear at the early stage of cortical processing, are fed by at least three parallel channels viz. Parvocellular (P), Magnocellular (M) and Koniocellular (K) in the subcortical pathway, but the role of such pathways in brightness induction is generally not implicit. In this work, three different spatial filters based on an extended classical receptive field (ECRF) model of retinal ganglion cells, have been approximately related to the spatial contrast sensitivity functions of these three parallel channels. Based on our analysis involving different brightness perception effects, we propose that the M channel, with maximum conduction velocity, may have a special role for an initial sensorial perception. As a result, brightness assimilation may be the consequence of vision at a glance through the M pathway; contrast effect may be the consequence of a subsequent vision with scrutiny through the P channel; and the K pathway response may represent an intermediate situation resulting in ambiguity in brightness perception. The present work attempts to correlate this phenomenon of pathway selection with the complementary nature of these channels in terms of spatial frequency as well as contrast.
Collapse
|
12
|
Wei H, Wang XM, Lai LL. Compact image representation model based on both nCRF and reverse control mechanisms. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2012; 23:150-162. [PMID: 24808464 DOI: 10.1109/tnnls.2011.2178472] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
The aim of this paper is to construct a bio-inspired hierarchical neural network that could accurately represent visual images and facilitate follow-up processing. Our computational model adopted a ganglion cell (GC) mechanism with a receptive field that dynamically self-adjusts according to the characteristics of an input image. For each GC, a micro neural circuit and a reverse control circuit were developed to self-adaptively resize the receptive field. An array was also designed to imitate the layer of GCs that perform image representation. Results revealed that this GC array could represent images from the external environment with a low processing cost, and this nonclassical receptive field mechanism could substantially improve both segmentation and integration processing. This model enables automatic extraction of blocks from images, which makes multiscale representation feasible. Importantly, once an original pixel-level image was reorganized into a GC array, semantic-level features emerged. Because GCs, like symbols, are discrete and separable, this GC-grained compact representation is open to operations that can manipulate images partially and selectively. Thus, the GC-array model provides a basic infrastructure and allows for high-level image processing.
Collapse
|
13
|
Ghosh K, Sarkar S, Bhaumik K. A possible mechanism of stochastic resonance in the light of an extra-classical receptive field model of retinal ganglion cells. BIOLOGICAL CYBERNETICS 2009; 100:351-359. [PMID: 19373486 DOI: 10.1007/s00422-009-0306-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/17/2008] [Accepted: 03/22/2009] [Indexed: 05/27/2023]
Abstract
Traditionally the intensity discontinuities in an image are detected as zero-crossings of the second derivative with the help of a Laplacian of Gaussian (LOG) operator that models the receptive field of retinal Ganglion cells. Such zero-crossings supposedly form a raw primal sketch edge map of the external world in the primary visual cortex of the brain. Based on a new operator which is a linear combination of the LOG and a Dirac-delta function that models the extra-classical receptive field of the ganglion cells, we find that zero-crossing points thus generated, store in presence of noise, apart from the edge information, the shading information of the image in the form of density variation of these points. We have also shown that an optimal image contrast produces best mapping of the shading information to such zero-crossing density variation for a given amount of noise contamination. Furthermore, we have observed that an optimal amount of noise contamination reproduces the minimum optimal contrast and hence gives rise to the best representation of the original image. We show that this phenomenon is similar in nature to that of stochastic resonance phenomenon observed in psychophysical experiments.
Collapse
Affiliation(s)
- Kuntal Ghosh
- Machine Intelligence Unit, Center for Soft Computing Research, Indian Statistical Institute, 203 B. T. Road, Kolkata, 700108, India.
| | | | | |
Collapse
|
14
|
Ghosh K, Bhaumik K, Sarkar S. Retinomorphic image processing. PROGRESS IN BRAIN RESEARCH 2008; 168:175-91. [PMID: 18166395 DOI: 10.1016/s0079-6123(07)68015-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2023]
Abstract
The present work is aimed at understanding and explaining some of the aspects of visual signal processing at the retinal level while exploiting the same towards the development of some simple techniques in the domain of digital image processing. Classical studies on retinal physiology revealed the nature of contrast sensitivity of the receptive field of bipolar or ganglion cells, which lie in the outer and inner plexiform layers of the retina. To explain these observations, a difference of Gaussian (DOG) filter was suggested, which was subsequently modified to a Laplacian of Gaussian (LOG) filter for computational ease in handling two-dimensional retinal inputs. Till date almost all image processing algorithms, used in various branches of science and engineering had followed LOG or one of its variants. Recent observations in retinal physiology however, indicate that the retinal ganglion cells receive input from a larger area than the classical receptive fields. We have proposed an isotropic model for the non-classical receptive field of the retinal ganglion cells, corroborated from these recent observations, by introducing higher order derivatives of Gaussian expressed as linear combination of Gaussians only. In digital image processing, this provides a new mechanism of edge detection on one hand and image half-toning on the other. It has also been found that living systems may sometimes prefer to "perceive" the external scenario by adding noise to the received signals in the pre-processing level for arriving at better information on light and shade in the edge map. The proposed model also provides explanation to many brightness-contrast illusions hitherto unexplained not only by the classical isotropic model but also by some other Gestalt and Constructivist models or by non-isotropic multi-scale models. The proposed model is easy to implement both in the analog and digital domain. A scheme for implementation in the analog domain generates a new silicon retina model implemented on a hardware development platform.
Collapse
Affiliation(s)
- Kuntal Ghosh
- Centre for Soft Computing Research, Indian Statistical Institute, 203, B.T. Road, Calcutta, India
| | | | | |
Collapse
|