1
|
Wright D, Dering B, Martinovic J, Gheorghiu E. Neural responses to dynamic adaptation reveal the dissociation between the processing of the shape of contours and textures. Cortex 2020; 127:78-93. [PMID: 32169678 DOI: 10.1016/j.cortex.2020.01.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Revised: 09/20/2019] [Accepted: 01/21/2020] [Indexed: 10/25/2022]
Abstract
Shape-adaptation studies show that surround textures can inhibit the processing of contours. Using event-related potentials (ERP), we examined the time-course of neural processes involved in contour-shape and texture-shape processing following adaptation to contours and textures. Contours were made of Gabor strings whose orientations were either tangential or orthogonal to the contour path, while textures were made of a series of contours arranged in parallel. We focused on two ERP components -P1, related to low-level visual processes and N1, broadly indicative of mid-level vision- and, on ERP difference waves (no-adaptor minus with-adaptor) to isolate the effects of adaptation, which are fundamentally distinct from individual processes driving P1 and N1 components. We found that in the absence of adaptation, the N1 component for contour-tests peaked later and increased in amplitude compared to the N1 for texture-tests. Following adaptation, the ERP difference wave for contour-tests revealed an early and a late component that were differentially affected by the presence of surround texture, but critically not by its orientation. For texture-tests, the early component was of opposite polarity for contours compared to texture adaptors. From the temporal sequence of ERP modulations, we conclude that texture processing begins before contour processing and encompasses the stages of perceptual processing reflected in both the low-level P1 and the mid-level N1 vision-related components. Our study provides novel evidence on the nature of separable and temporally distinct texture and contour processing mechanisms, shown in two difference wave components, that highlights the multi-faceted nature of dynamic adaptation to shape when presented in isolation and in context.
Collapse
Affiliation(s)
- Damien Wright
- University of Stirling, Department of Psychology, Stirling, Scotland, United Kingdom
| | - Benjamin Dering
- University of Stirling, Department of Psychology, Stirling, Scotland, United Kingdom
| | - Jasna Martinovic
- University of Aberdeen, School of Psychology, Aberdeen, Scotland, United Kingdom
| | - Elena Gheorghiu
- University of Stirling, Department of Psychology, Stirling, Scotland, United Kingdom.
| |
Collapse
|
2
|
Lehky SR, Phan AH, Cichocki A, Tanaka K. Face Representations via Tensorfaces of Various Complexities. Neural Comput 2019; 32:281-329. [PMID: 31835006 DOI: 10.1162/neco_a_01258] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Neurons selective for faces exist in humans and monkeys. However, characteristics of face cell receptive fields are poorly understood. In this theoretical study, we explore the effects of complexity, defined as algorithmic information (Kolmogorov complexity) and logical depth, on possible ways that face cells may be organized. We use tensor decompositions to decompose faces into a set of components, called tensorfaces, and their associated weights, which can be interpreted as model face cells and their firing rates. These tensorfaces form a high-dimensional representation space in which each tensorface forms an axis of the space. A distinctive feature of the decomposition algorithm is the ability to specify tensorface complexity. We found that low-complexity tensorfaces have blob-like appearances crudely approximating faces, while high-complexity tensorfaces appear clearly face-like. Low-complexity tensorfaces require a larger population to reach a criterion face reconstruction error than medium- or high-complexity tensorfaces, and thus are inefficient by that criterion. Low-complexity tensorfaces, however, generalize better when representing statistically novel faces, which are faces falling beyond the distribution of face description parameters found in the tensorface training set. The degree to which face representations are parts based or global forms a continuum as a function of tensorface complexity, with low and medium tensorfaces being more parts based. Given the computational load imposed in creating high-complexity face cells (in the form of algorithmic information and logical depth) and in the absence of a compelling advantage to using high-complexity cells, we suggest face representations consist of a mixture of low- and medium-complexity face cells.
Collapse
Affiliation(s)
- Sidney R Lehky
- Cognitive Brain Mapping Laboratory, RIKEN Center for Brain Science, Wako-shi, Saitama 351-0198, Japan, and Computational Neurobiology Laboratory, Salk Institute, La Jolla, CA 92037, U.S.A.
| | - Anh Huy Phan
- Center for Computational and Data-Intensive Science and Engineering, Skolkovo Institute of Science and Technology, 143026 Moscow, Russia; and Institute of Global Innovation Research, Tokyo University of Agriculture and Technology, Tokyo 183-8538, Japan
| | - Andrzej Cichocki
- Center for Computational and Data-Intensive Science and Engineering, Skolkovo Institute of Science and Technology, 143026 Moscow, Russia; Systems Research Institute, Polish Academy of Sciences, 01447 Warsaw, Poland; College of Computer Science, Hangzhou Dianzu University, Hangzhou 310018, China; and Institute of Global Innovation Research, Tokyo University of Agriculture and Technology, Tokyo 183-8538, Japan
| | - Keiji Tanaka
- Cognitive Brain Mapping Laboratory, RIKEN Center for Brain Science, Wako-shi, Saitama 325-0198, Japan
| |
Collapse
|
3
|
Popovkina DV, Bair W, Pasupathy A. Modeling diverse responses to filled and outline shapes in macaque V4. J Neurophysiol 2019; 121:1059-1077. [PMID: 30699004 DOI: 10.1152/jn.00456.2018] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Visual area V4 is an important midlevel cortical processing stage that subserves object recognition in primates. Studies investigating shape coding in V4 have largely probed neuronal responses with filled shapes, i.e., shapes defined by both a boundary and an interior fill. As a result, we do not know whether form-selective V4 responses are dictated by boundary features alone or if interior fill is also important. We studied 43 V4 neurons in two male macaque monkeys ( Macaca mulatta) with a set of 362 filled shapes and their corresponding outlines to determine how interior fill modulates neuronal responses in shape-selective neurons. Only a minority of neurons exhibited similar response strength and shape preferences for filled and outline stimuli. A majority responded preferentially to one stimulus category (either filled or outline shapes) and poorly to the other. Our findings are inconsistent with predictions of the hierarchical-max (HMax) V4 model that builds form selectivity from oriented boundary features and takes little account of attributes related to object surface, such as the phase of the boundary edge. We modified the V4 HMax model to include sensitivity to interior fill by either removing phase-pooling or introducing unoriented units at the V1 level; both modifications better explained our data without increasing the number of free parameters. Overall, our results suggest that boundary orientation and interior surface information are both maintained until at least the midlevel visual representation, consistent with the idea that object fill is important for recognition and perception in natural vision. NEW & NOTEWORTHY The shape of an object's boundary is critical for identification; consistent with this idea, models of object recognition predict that filled and outline versions of a shape are encoded similarly. We report that many neurons in a midlevel visual cortical area respond differently to filled and outline shapes and modify a biologically plausible model to account for our data. Our results suggest that representations of boundary shape and surface fill are interrelated in visual cortex.
Collapse
Affiliation(s)
- Dina V Popovkina
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| | - Wyeth Bair
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| | - Anitha Pasupathy
- Department of Biological Structure, Washington National Primate Research Center, University of Washington , Seattle, Washington
| |
Collapse
|
4
|
Neural representation for object recognition in inferotemporal cortex. Curr Opin Neurobiol 2016; 37:23-35. [PMID: 26771242 DOI: 10.1016/j.conb.2015.12.001] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2015] [Accepted: 12/01/2015] [Indexed: 11/22/2022]
Abstract
We suggest that population representation of objects in inferotemporal cortex lie on a continuum between a purely structural, parts-based description and a purely holistic description. The intrinsic dimensionality of object representation is estimated to be around 100, perhaps with lower dimensionalities for object representations more toward the holistic end of the spectrum. Cognitive knowledge in the form of semantic information and task information feed back to inferotemporal cortex from perirhinal and prefrontal cortex respectively, providing high-level multimodal-based expectations that assist in the interpretation of object stimuli. Integration of object information across eye movements may also contribute to object recognition through a process of active vision.
Collapse
|
5
|
Ohayon S, Freiwald WA, Tsao DY. What makes a cell face selective? The importance of contrast. Neuron 2012; 74:567-81. [PMID: 22578507 DOI: 10.1016/j.neuron.2012.03.024] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/12/2012] [Indexed: 10/28/2022]
Abstract
Faces are robustly detected by computer vision algorithms that search for characteristic coarse contrast features. Here, we investigated whether face-selective cells in the primate brain exploit contrast features as well. We recorded from face-selective neurons in macaque inferotemporal cortex, while presenting a face-like collage of regions whose luminances were changed randomly. Modulating contrast combinations between regions induced activity changes ranging from no response to a response greater than that to a real face in 50% of cells. The critical stimulus factor determining response magnitude was contrast polarity, for example, nose region brighter than left eye. Contrast polarity preferences were consistent across cells, suggesting a common computational strategy across the population, and matched features used by computer vision algorithms for face detection. Furthermore, most cells were tuned both for contrast polarity and for the geometry of facial features, suggesting cells encode information useful both for detection and recognition.
Collapse
Affiliation(s)
- Shay Ohayon
- Division of Biology, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA 91125, USA.
| | | | | |
Collapse
|
6
|
Schmidtmann G, Kennedy GJ, Orbach HS, Loffler G. Non-linear global pooling in the discrimination of circular and non-circular shapes. Vision Res 2012; 62:44-56. [DOI: 10.1016/j.visres.2012.03.001] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2011] [Revised: 03/02/2012] [Accepted: 03/03/2012] [Indexed: 11/29/2022]
|
7
|
Kim YJ, Grabowecky M, Paller KA, Suzuki S. Differential roles of frequency-following and frequency-doubling visual responses revealed by evoked neural harmonics. J Cogn Neurosci 2010; 23:1875-86. [PMID: 20684661 DOI: 10.1162/jocn.2010.21536] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Frequency-following and frequency-doubling neurons are ubiquitous in both striate and extrastriate visual areas. However, responses from these two types of neural populations have not been effectively compared in humans because previous EEG studies have not successfully dissociated responses from these populations. We devised a light-dark flicker stimulus that unambiguously distinguished these responses as reflected in the first and second harmonics in the steady-state visual evoked potentials. These harmonics revealed the spatial and functional segregation of frequency-following (the first harmonic) and frequency-doubling (the second harmonic) neural populations. Spatially, the first and second harmonics in steady-state visual evoked potentials exhibited divergent posterior scalp topographies for a broad range of EEG frequencies. The scalp maximum was medial for the first harmonic and contralateral for the second harmonic, a divergence not attributable to absolute response frequency. Functionally, voluntary visual-spatial attention strongly modulated the second harmonic but had negligible effects on the simultaneously elicited first harmonic. These dissociations suggest an intriguing possibility that frequency-following and frequency-doubling neural populations may contribute complementary functions to resolve the conflicting demands of attentional enhancement and signal fidelity--the frequency-doubling population may mediate substantial top-down signal modulation for attentional selection, whereas the frequency-following population may simultaneously preserve relatively undistorted sensory qualities regardless of the observer's cognitive state.
Collapse
Affiliation(s)
- Yee-Joon Kim
- Department of Psychology, Northwestern University, 2029 Sheridan Rd., Evanston, IL 60208, USA
| | | | | | | |
Collapse
|
8
|
Tompa T, Sáry G. A review on the inferior temporal cortex of the macaque. ACTA ACUST UNITED AC 2010; 62:165-82. [PMID: 19853626 DOI: 10.1016/j.brainresrev.2009.10.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2009] [Revised: 10/14/2009] [Accepted: 10/14/2009] [Indexed: 10/20/2022]
|
9
|
Hosoda K, Watanabe M, Wersing H, Körner E, Tsujino H, Tamura H, Fujita I. A model for learning topographically organized parts-based representations of objects in visual cortex: topographic nonnegative matrix factorization. Neural Comput 2009; 21:2605-33. [PMID: 19548799 DOI: 10.1162/neco.2009.03-08-722] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Object representation in the inferior temporal cortex (IT), an area of visual cortex critical for object recognition in the primate, exhibits two prominent properties: (1) objects are represented by the combined activity of columnar clusters of neurons, with each cluster representing component features or parts of objects, and (2) closely related features are continuously represented along the tangential direction of individual columnar clusters. Here we propose a learning model that reflects these properties of parts-based representation and topographic organization in a unified framework. This model is based on a nonnegative matrix factorization (NMF) basis decomposition method. NMF alone provides a parts-based representation where nonnegative inputs are approximated by additive combinations of nonnegative basis functions. Our proposed model of topographic NMF (TNMF) incorporates neighborhood connections between NMF basis functions arranged on a topographic map and attains the topographic property without losing the parts-based property of the NMF. The TNMF represents an input by multiple activity peaks to describe diverse information, whereas conventional topographic models, such as the self-organizing map (SOM), represent an input by a single activity peak in a topographic map. We demonstrate the parts-based and topographic properties of the TNMF by constructing a hierarchical model for object recognition where the TNMF is at the top tier for learning high-level object features. The TNMF showed better generalization performance over NMF for a data set of continuous view change of an image and more robustly preserving the continuity of the view change in its object representation. Comparison of the outputs of our model with actual neural responses recorded in the IT indicates that the TNMF reconstructs the neuronal responses better than the SOM, giving plausibility to the parts-based learning of the model.
Collapse
Affiliation(s)
- Kenji Hosoda
- Department of Quantum Engineering and Systems Science, University of Tokyo, Tokyo, Japan.
| | | | | | | | | | | | | |
Collapse
|
10
|
Binocular properties of curvature-encoding mechanisms revealed through two shape after-effects. Vision Res 2009; 49:1765-74. [DOI: 10.1016/j.visres.2009.04.010] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2008] [Revised: 02/12/2009] [Accepted: 04/09/2009] [Indexed: 11/18/2022]
|
11
|
Gheorghiu E, Kingdom FAA. Spatial properties of curvature-encoding mechanisms revealed through the shape-frequency and shape-amplitude after-effects. Vision Res 2008; 48:1107-24. [PMID: 18353425 DOI: 10.1016/j.visres.2008.02.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2007] [Revised: 01/18/2008] [Accepted: 02/07/2008] [Indexed: 11/17/2022]
Abstract
The shape-frequency and shape-amplitude after-effects, or SFAE and SAAE, are phenomena in which adaptation to a sinusoidal-shaped contour results in a shift in, respectively, the perceived shape-frequency and perceived shape-amplitude of a test contour in a direction away from that of the adapting stimulus. Recent evidence shows that the SFAE and SAAE are mediated by mechanisms sensitive to curvature [Gheorghiu, E., & Kingdom, F. A. A. (2007a). The spatial feature underlying the shape-frequency and shape-amplitude after-effects. Vision Research, 47(6), 834-844]. Therefore we have used the SFAE and SAAE as a tool to study curvature processing. We examined whether curvature-encoding mechanisms are selective for (i) shape-phase, (ii) curvature polarity (or sign) and (iii) local orientation. We also investigated whether (iv) the two orthogonal dimensions of a curve, the sag and the cord, are encoded independently, and (v) whether curvature encoders are organized in an opponent manner. SFAEs/SAAEs were measured for adapting and test contours that differed or not in a given spatial property, the rationale being that if the after-effects were smaller when adaptor and test differed in a particular spatial property then curvature-encoding mechanisms must be selective for that spatial property. Our results reveal that SFAEs and SAAEs show (i) a degree of selectivity to curves that are mirror symmetric (in our stimuli half-cycle sine-wave contours in cosine (0/180deg) shape-phase); (ii) a degree of selectivity to the sign or polarity of curvature; (iii) a degree of selectivity to local orientation; (iv) independent coding of the sag and the cord of the curve, and (v) no evidence for opponent-curvature coding. The results agree with neurophysiological studies showing that simple shape dimensions are encoded independently.
Collapse
Affiliation(s)
- Elena Gheorghiu
- Department of Ophthalmology, McGill University, Montreal, Que., Canada H3A 1A1.
| | | |
Collapse
|
12
|
Suzuki S, Grabowecky M. Long-term speeding in perceptual switches mediated by attention-dependent plasticity in cortical visual processing. Neuron 2008; 56:741-53. [PMID: 18031689 DOI: 10.1016/j.neuron.2007.09.028] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2006] [Revised: 06/10/2007] [Accepted: 09/19/2007] [Indexed: 11/18/2022]
Abstract
Binocular rivalry has been extensively studied to understand the mechanisms that control switches in visual awareness and much has been revealed about the contributions of stimulus and cognitive factors. Because visual processes are fundamentally adaptive, however, it is also important to understand how experience alters the dynamics of perceptual switches. When observers viewed binocular rivalry repeatedly over many days, the rate of perceptual switches increased as much as 3-fold. This long-term rivalry speeding exhibited a pattern of image-feature specificity that ruled out primary contributions from strategic and nonsensory factors and implicated neural plasticity occurring in both low- and high-level visual processes in the ventral stream. Furthermore, the speeding occurred only when the rivaling patterns were voluntarily attended, suggesting that the underlying neural plasticity selectively engages when stimuli are behaviorally relevant. Long-term rivalry speeding may thus reflect broader mechanisms that facilitate quick assessments of signals that contain multiple behaviorally relevant interpretations.
Collapse
Affiliation(s)
- Satoru Suzuki
- Department of Psychology and Institute for Neuroscience, Northwestern University, Evanston, IL 60208, USA.
| | | |
Collapse
|
13
|
Itier RJ, Alain C, Sedore K, McIntosh AR. Early face processing specificity: it's in the eyes! J Cogn Neurosci 2008; 19:1815-26. [PMID: 17958484 DOI: 10.1162/jocn.2007.19.11.1815] [Citation(s) in RCA: 184] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Unlike most other objects that are processed analytically, faces are processed configurally. This configural processing is reflected early in visual processing following face inversion and contrast reversal, as an increase in the N170 amplitude, a scalp-recorded event-related potential. Here, we show that these face-specific effects are mediated by the eye region. That is, they occurred only when the eyes were present, but not when eyes were removed from the face. The N170 recorded to inverted and negative faces likely reflects the processing of the eyes. We propose a neural model of face processing in which face- and eye-selective neurons situated in the superior temporal sulcus region of the human brain respond differently to the face configuration and to the eyes depending on the face context. This dynamic response modulation accounts for the N170 variations reported in the literature. The eyes may be central to what makes faces so special.
Collapse
Affiliation(s)
- Roxane J Itier
- Rotman Research Institute, Baycrest Centre, Toronto, Canada.
| | | | | | | |
Collapse
|
14
|
Gheorghiu E, Kingdom FAA. Chromatic tuning of contour-shape mechanisms revealed through the shape-frequency and shape-amplitude after-effects. Vision Res 2007; 47:1935-49. [PMID: 17499333 DOI: 10.1016/j.visres.2007.03.010] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2006] [Revised: 02/08/2007] [Accepted: 03/13/2007] [Indexed: 11/21/2022]
Abstract
We investigated whether contour-shape processing mechanisms are selective for color direction using the shape-frequency and shape-amplitude after-effects, or SFAE and SAAE [Gheorghiu, E. & Kingdom, F. A. A. (2006). Luminance-contrast properties of contour-shape processing revealed through the shape-frequency after-effect. Vision Research, 46(21), 3603-3615. Gheorghiu, E. & Kingdom, F. A. A. (2007). The spatial feature underlying the shape-frequency and shape-amplitude after-effects. Vision Research, 47(6), 834-844]. All contours were defined along the 'red-green', 'blue-yellow' and 'luminance' axes of cardinal color space. Adapting and test contours were defined along the same or along opposite polarities within a cardinal axis, and along the same or along different cardinal axes. We found (i) little transfer of the after-effects across different within-axis polarities, for all cardinal axes and for both even-symmetric and odd-symmetric contours; (ii) little transfer between the red-green and blue-yellow cardinal axes; (iii) little transfer between the chromatic and luminance cardinal directions for the SAAE; (iv) large transfer between the chromatic and luminance cardinal directions for the SFAE. We conclude that contour-shape mechanisms are selective for within-cardinal axis polarity and for the chromatic axes within the isoluminant plane. However for certain types of contour-shape processing they are poorly selective along the chromatic versus luminance dimension. Overall our results suggest that contour-shape encoding mechanisms are selective for color direction and that color is important for contour-shape processing.
Collapse
Affiliation(s)
- Elena Gheorghiu
- McGill Vision Research, Department of Ophthalmology, McGill University, 687 Pine Avenue W., Montreal H3A 1A1, Que., Canada.
| | | |
Collapse
|
15
|
Kiani R, Esteky H, Mirpour K, Tanaka K. Object category structure in response patterns of neuronal population in monkey inferior temporal cortex. J Neurophysiol 2007; 97:4296-309. [PMID: 17428910 DOI: 10.1152/jn.00024.2007] [Citation(s) in RCA: 298] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
Our mental representation of object categories is hierarchically organized, and our rapid and seemingly effortless categorization ability is crucial for our daily behavior. Here, we examine responses of a large number (>600) of neurons in monkey inferior temporal (IT) cortex with a large number (>1,000) of natural and artificial object images. During the recordings, the monkeys performed a passive fixation task. We found that the categorical structure of objects is represented by the pattern of activity distributed over the cell population. Animate and inanimate objects created distinguishable clusters in the population code. The global category of animate objects was divided into bodies, hands, and faces. Faces were divided into primate and nonprimate faces, and the primate-face group was divided into human and monkey faces. Bodies of human, birds, and four-limb animals clustered together, whereas lower animals such as fish, reptile, and insects made another cluster. Thus the cluster analysis showed that IT population responses reconstruct a large part of our intuitive category structure, including the global division into animate and inanimate objects, and further hierarchical subdivisions of animate objects. The representation of categories was distributed in several respects, e.g., the similarity of response patterns to stimuli within a category was maintained by both the cells that maximally responded to the category and the cells that responded weakly to the category. These results advance our understanding of the nature of the IT neural code, suggesting an inherently categorical representation that comprises a range of categories including the amply investigated face category.
Collapse
Affiliation(s)
- Roozbeh Kiani
- Research Group for Brain and Cognitive Sciences, School of Medicine, Shaheed Beheshti University, P.O. Box 19835-181, Tehran, Iran
| | | | | | | |
Collapse
|
16
|
Gheorghiu E, Kingdom FAA. The spatial feature underlying the shape-frequency and shape-amplitude after-effects. Vision Res 2007; 47:834-44. [PMID: 17292437 DOI: 10.1016/j.visres.2006.11.023] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2006] [Revised: 11/06/2006] [Accepted: 11/07/2006] [Indexed: 11/22/2022]
Abstract
The shape-frequency and shape-amplitude after-effects, or SFAE and SAAE, refer respectively to the shifts observed in the perceived shape-frequency and shape-amplitude of a sinusoidal test contour following adaptation to a similar-shaped contour. As with other shape after-effects the shifts are in a direction away from that of the adapting stimulus. Using a variety of procedures we tested whether the spatial feature that was adapted in the SFAE and SAAE was (a) local orientation, (b) average unsigned curvature, (c) periodicity/density, (d) shape-amplitude and (e) local curvature. Our results suggest that the last of these, local curvature, underlies both the SFAE and SAAE. The evidence in favour of local curvature was that the after-effect reached its maximum value when just half-a-cycle of the test contour, in +/-cosine phase, was present. We suggest that the SFAE and SAAE are mediated by intermediate-level mechanisms that encode the shapes of contour fragments with constant sign of curvature. Given the neurophysiological evidence that neurons in area V4 encode parts of shapes with constant sign of curvature, we suggest V4 is the likely neural substrate for both the SFAE and SAAE.
Collapse
Affiliation(s)
- Elena Gheorghiu
- McGill Vision Research, Department of Ophthalmology, McGill University, 687 Pine Avenue West, Montreal, Quebec, Canada H3A 1A1.
| | | |
Collapse
|
17
|
Gheorghiu E, Kingdom FAA. Luminance-contrast properties of contour-shape processing revealed through the shape-frequency after-effect. Vision Res 2006; 46:3603-15. [PMID: 16769101 DOI: 10.1016/j.visres.2006.04.021] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2006] [Revised: 04/16/2006] [Accepted: 04/28/2006] [Indexed: 11/17/2022]
Abstract
We investigated the first-order inputs to contour-shape mechanisms using the shape-frequency after-effect (SFAE), in which adaptation to a sinusoidally modulated contour causes a shift in the apparent shape-frequency of a test contour in a direction away from that of the adapting stimulus [Kingdom F. A. A., & Prins N. (2005a). Different mechanisms encode the shapes of contours and contour-textures. Journal of Vision 5(8), 463, (Abstract)]. We measured SFAEs for adapting and test contours (and edges) that differed in the contrast-polarity, scale (or blur) and magnitude of luminance contrast. The rationale was that if the SFAE was found to be reduced when adaptor and test differed along a particular dimension of luminance contrast, contour-shape mechanisms must be tuned to that dimension. Our results reveal that SFAEs manifest (i) a degree of selectivity to luminance contrast polarity for both even-symmetric (contours only) and odd-symmetric (both contours and edges) luminance profiles; (ii) a degree of selectivity to luminance scale (or blur); (iii) higher selectivity to fine compared to coarse scale for broadband edges (iv) a small preference for equal-in-contrast adaptors and tests. These results suggest that contour shapes are not encoded in the form of a sparse, cartoon-like sketch, as might be presumed by local energy (i.e. non-phase-selective) or form-cue invariant models, but instead in a form that is relatively 'feature-rich.'
Collapse
Affiliation(s)
- Elena Gheorghiu
- McGill Vision Research, Department of Ophthalmology, McGill University, Montreal, Que, Canada.
| | | |
Collapse
|
18
|
Itier RJ, Latinus M, Taylor MJ. Face, eye and object early processing: What is the face specificity? Neuroimage 2006; 29:667-76. [PMID: 16169749 DOI: 10.1016/j.neuroimage.2005.07.041] [Citation(s) in RCA: 205] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2005] [Revised: 07/22/2005] [Accepted: 07/27/2005] [Indexed: 11/25/2022] Open
Abstract
We investigated the human face specificity by comparing the effects of inversion and contrast reversal, two manipulations known to disrupt configural face processing, on human and ape faces, isolated eyes and objects, using event-related potentials. The face sensitive marker, N170, was shortest to human faces and delayed by inversion and contrast reversal for all categories and not only for human faces. Most importantly, N170 to inverted or contrast-reversed faces was not different from N170 to eyes that did not differ across manipulations. This suggests the disruption of facial configuration by these manipulations isolates the eye region from the face context, to which eye neurons respond. Our data suggest that (i) the inversion and contrast reversal effects on N170 latency are not specific to human faces and (ii) the similar increase of N170 amplitude by inversion and contrast reversal is unique to human faces and is driven by the eye region. Thus, while inversion and contrast reversal effects on N170 latency are not category-specific, their effects on amplitude are face-specific and reflect mainly the contribution of the eye region.
Collapse
Affiliation(s)
- Roxane J Itier
- The Rotman Research Institute, Baycrest Centre for Geriatric Care, 3560 Bathurst Street, Toronto, Ontario, Canada M6A 2E1.
| | | | | |
Collapse
|
19
|
Tamura H, Kaneko H, Fujita I. Quantitative analysis of functional clustering of neurons in the macaque inferior temporal cortex. Neurosci Res 2005; 52:311-22. [PMID: 15893835 DOI: 10.1016/j.neures.2005.04.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2004] [Revised: 02/28/2005] [Accepted: 04/12/2005] [Indexed: 11/30/2022]
Abstract
Neurons with similar preferences for two-dimensional shapes of intermediate complexity cluster in area TE of the monkey inferior temporal cortex. To further characterize the functional structure of area TE, we quantitatively analyzed various aspects of the visual responses of closely located neurons by applying multiple single-unit recording techniques in anesthetized monkeys. Examination of the visual responses elicited with a large, predetermined set of visual stimuli confirmed previous findings that nearby neurons, on average, exhibited positively correlated preferences for a set of visual stimuli. Nearby neurons also tended to be similar in their receptive-field organization and contrast-polarity preference. In contrast, no correlation was found in the size tuning of neighboring neurons. Pooling or subtraction of activities between a pair of nearby neurons was shown to improve stimulus discriminability, if the neuron pair had positively or negatively correlated stimulus preferences, respectively. These results indicate that nearby TE neurons share some aspects of stimulus preference, but their response selectivity differ in other aspects. Both pooling and subtraction between nearby neurons can reduce across-trial response variability, if these decoding strategies are applied to appropriate neuronal pools.
Collapse
Affiliation(s)
- Hiroshi Tamura
- Graduate School of Frontier Biosciences, Osaka University, Laboratory for Cognitive Neuroscience, Toyonaka, Osaka 560-8531, Japan.
| | | | | |
Collapse
|
20
|
Jüttner M, Langguth B, Rentschler I. The impact of context on pattern category learning and representation. VISUAL COGNITION 2004. [DOI: 10.1080/13506280444000058a] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
21
|
Itier RJ, Taylor MJ, Lobaugh NJ. Spatiotemporal analysis of event-related potentials to upright, inverted, and contrast-reversed faces: Effects on encoding and recognition. Psychophysiology 2004; 41:643-53. [PMID: 15189487 DOI: 10.1111/j.1469-8986.2004.00183.x] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In an n-back face recognition task where subjects responded to repeated stimuli, ERPs were recorded to upright, inverted, and contrast-reversed faces. The effects of inversion and contrast reversal on face encoding and recognition were investigated using the multivariate spatiotemporal partial least squares (PLS) analysis. The configural manipulations affected early processing (100-200 ms) at posterior sites: Inversion effects were parietal and lateral, whereas contrast-reversal effects were more occipital and medial, suggesting different underlying generators. A later reactivation of face processing areas was unique to inverted faces, likely due to processing difficulties. PLS also indicated that the "old-new" repetition effect was maximal for upright faces and likely involved frontotemporal areas. Marked processing differences between inverted and contrast-reversed faces were seen, but these effects were similar at encoding and recognition.
Collapse
|
22
|
Suzuki S, Grabowecky M. Attention during adaptation weakens negative afterimages. J Exp Psychol Hum Percept Perform 2003; 29:793-807. [PMID: 12967222 DOI: 10.1037/0096-1523.29.4.793] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
The effect of attention during adaptation on subsequent negative afterimages was examined. One of 2 overlapped outline figures was attended during a 7-10-s adaptation period. When the figures were readily perceptually segregated (on the basis of color or motion), the subsequent afterimages were initially weaker for the previously attended figure. This effect was confirmed by demonstrations that the onset of a single afterimage was delayed when an afterimage inducer was attended during adaptation compared with when a central digit stream or an overlapped (brightness-balanced) figure that did not generate an afterimage was attended. The attention effect was further confirmed using a criterion-independent (dot-integration) paradigm. The fact that selective attention during adaptation weakened or delayed afterimages suggests that attention primarily facilitates the adaptation of polarity-independent processes that modulate the visibility of afterimages rather than facilitating the adaptation of polarity-selective processes that mediate the formation of afterimages.
Collapse
Affiliation(s)
- Satoru Suzuki
- Department of Psychology, Northwestern University, Evanston, Illinois 60208, USA.
| | | |
Collapse
|
23
|
Abstract
Prior studies using brief stimulus sequences revealed "opponent shape aftereffects", indicative of direct opponent coding of global shape attributes such as aspect ratio, skew, taper, curvature, and convexity (perhaps in IT). Further, aftereffects from overlapped opponent pairs of adaptor shapes (e.g., concave and convex shapes) were substantially modulated by attention [Vision Res. 41 (2001) 3883]. Hypothetically, (1) attention might weight the attended and ignored contours at early stages of processing, or (2) it might sway opposing neural activity (e.g., of convex- vs. concave-tuned units) at the stage of opponent shape coding. Attentional modulation was equivalent for opponent pairs (producing opposite aftereffects) and non-opponent pairs (producing orthogonal aftereffects) of overlapped adaptor shapes, whether convexity or aspect-ratio aftereffects were measured. Further, the degree of attentional modulation obtained for these aftereffects (approximately 60%) was comparable to that obtained for V4 cells [J. Neurosci. 19 (1999) 1736]. Taken together, differential contour weighting appears to be the primary mechanism of attentional modulation of brief shape aftereffects.
Collapse
Affiliation(s)
- Satoru Suzuki
- Department of Psychology, Northwestern University, 2029 Sheridan Rd., Evanston, IL 60208, USA.
| |
Collapse
|
24
|
Abstract
Recent models of texture processing use low level, spatially parallel computations to extract texture properties. The rapid, preattentive nature of texture segregation suggests that these computations are bottom-up in nature. However, the immunity of texture judgments to top-down influences remains to be tested. Here we investigate the degree to which judgments of texture orientation are susceptible to top-down attentional control. Observers view a brief display composed of variously luminant texture elements (line segments) alternately (in checkerboard arrangement) oriented up/right (at 71.5 degrees ) or up/left (at 108.5 degrees ), and are asked to make various judgments. In a given task, the observer attempts on each trial to judge which oriented population of line segments has an intensity histogram that best matches a given target histogram. Performance demonstrates adaptive flexibility across different tasks, suggesting that observers are able to exercise significant top-down control over texture orientation computations. Specifically, observers can attend selectively to positive contrast texture elements, to negative contrast texture elements, or to high (positive and negative) contrast texture elements. More generally, observers perform well if the target histogram can be approximated by a weighted average of positive and negative half-wave rectifiers. Performance is poor for histograms that cannot be captured in this way. These results suggest that attentional control in these tasks is limited to adjusting the relative gain of the on- and off-center systems.
Collapse
Affiliation(s)
- Charles Chubb
- Department of Cognitive Sciences, University of California at Irvine, Irvine, CA 92697-5100, USA.
| | | |
Collapse
|
25
|
Suzuki S. Attention-dependent brief adaptation to contour orientation: a high-level aftereffect for convexity? Vision Res 2001; 41:3883-902. [PMID: 11738454 DOI: 10.1016/s0042-6989(01)00249-8] [Citation(s) in RCA: 59] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In contrast to the abundant literature investigating how orientation coding depends on edges defined by various image features, relatively little is known about how coding of orientation might also depend on the two distinct functional roles that oriented edges commonly play. Oriented lines can delineate outline contours of a figure or they can form texture. The results of five experiments using orientation aftereffects measured with brief tests (27 ms, backward masked; adapt-to-test interval=201 ms) provided evidence that brief stimuli (<135 ms) selectively adapt coding of contour-line orientation rather than coding of line-texture orientation. Furthermore, parametric results revealed that the rapidly adapting aftereffects for contour orientation are characterized by (1) broad orientation tuning (peaking at +/-30 degrees to +/-50 degrees from test orientation), (2) indifference as to how the contours are defined (e.g. bright lines, high-pass-filtered lines, faint lines generated by the spatial inhomogeneity of visual sensitivity), (3) rapid saturation at low contrast energy, (4) strong modulation by selective attention, and (5) relative size tolerance. These characteristics appear to parallel those of cells in the high end of the visual form processing pathway (such as inferotemporal cortex). It is thus suggested that the rapidly adapting contour orientation aftereffects reported here may be mediated by high-level neural units that encode global configurations of orientation (e.g. convexity and concavity).
Collapse
Affiliation(s)
- S Suzuki
- Department of Psychology, Northwestern University, 2029 Sheridan Rd., Evanston, IL 60208, USA.
| |
Collapse
|
26
|
Grill-Spector K, Kourtzi Z, Kanwisher N. The lateral occipital complex and its role in object recognition. Vision Res 2001; 41:1409-22. [PMID: 11322983 DOI: 10.1016/s0042-6989(01)00073-6] [Citation(s) in RCA: 903] [Impact Index Per Article: 39.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Here we review recent findings that reveal the functional properties of extra-striate regions in the human visual cortex that are involved in the representation and perception of objects. We characterize both the invariant and non-invariant properties of these regions and we discuss the correlation between activation of these regions and recognition. Overall, these results indicate that the lateral occipital complex plays an important role in human object recognition.
Collapse
Affiliation(s)
- K Grill-Spector
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA. kalanit2psyche.mit.edu
| | | | | |
Collapse
|
27
|
Grill-Spector K, Malach R. fMR-adaptation: a tool for studying the functional properties of human cortical neurons. Acta Psychol (Amst) 2001; 107:293-321. [PMID: 11388140 DOI: 10.1016/s0001-6918(01)00019-1] [Citation(s) in RCA: 767] [Impact Index Per Article: 33.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The invariant properties of human cortical neurons cannot be studied directly by fMRI due to its limited spatial resolution. One voxel obtained from a fMRI scan contains several hundred thousands neurons. Therefore, the fMRI signal may average out a heterogeneous group of highly selective neurons. Here, we present a novel experimental paradigm for fMRI, functional magnetic resonance-adaptation (fMR-A), that enables to tag specific neuronal populations within an area and investigate their functional properties. This approach contrasts with conventional mapping methods that measure the averaged activity of a region. The application of fMR-A to study the functional properties of cortical neurons proceeds in two stages: First, the neuronal population is adapted by repeated presentation of a single stimulus. Second, some property of the stimulus is varied and the recovery from adaptation is assessed. If the signal remains adapted, it will indicate that the neurons are invariant to that attribute. However, if the fMRI signal will recover from the adapted state it would imply that the neurons are sensitive to the property that was varied. Here, an application of fMR-A for studying the invariant properties of high-order object areas (lateral occipital complex--LOC) to changes in object size, position, illumination and rotation is presented. The results show that LOC is less sensitive to changes in object size and position compared to changes of illumination and viewpoint. fMR-A can be extended to other neuronal systems in which adaptation is manifested and can be used with event-related paradigms as well. By manipulating experimental parameters and testing recovery from adaptation it should be possible to gain insight into the functional properties of cortical neurons which are beyond the spatial resolution limits imposed by conventional fMRI.
Collapse
Affiliation(s)
- K Grill-Spector
- Department of Brain and Cognitive Sciences, MIT, NE20-444, 77 Mass. Ave., Cambridge, MA 02139, USA.
| | | |
Collapse
|
28
|
Abstract
We used magnetoencephalography to search spatio-temporally for cortical activity related to the perception of shape defined by various visual cues in humans. The visual stimuli were three kinds of two-dimensional figures: two had fixed shapes (Diamond and Cross), the other did not (Noise). These figures were defined by three visual cues: difference of flicker, texture or luminance between the foreground and the background in the random dot pattern. Using this stimulus, we recorded the magnetic responses from the temporo-occipital regions of nine healthy subjects. Additionally, we measured the reaction time for the subjects to detect the figure by button-pressing. A magnetic component was identified in the responses. The properties of the first magnetic component differed for stimulus condition. The peak latency of the first magnetic component was different for the cues (270 ms for flicker, 360 ms for texture and 250 ms for luminance), but not for the figures. In contrast, the peak amplitude of the first magnetic component was different for the figures (96-144 fT for Diamond or Cross and 52-80 fT for Noise), but not for the cues. The signal source of the first magnetic component was estimated to lie on the ventral side of the extrastriate cortex: In the posterior part of the inferior temporal cortex, probably in the fusiform gyrus in four subjects, and in the lateral part of the occipital cortex which was outside of the primary visual cortex (visual area 1) in one subject. The signal source location was different inter-individually, but almost the same within each subject. Reaction time was 471 ms for flicker, 569 ms for texture and 426 ms for luminance, but the interval between the reaction time and the peak latency was constant (about 200 ms) for each cue. The first magnetic component was more clearly recorded from the right hemisphere than from the left.We found that the shape defined by the different visual cues activates the same localized site in the lateral extrastriate cortex. This spatial convergence suggests that there is a restricted locus that processes the visual shape regardless of the difference of the visual cue. The correspondence between the peak latency and the reaction time suggests that the activity of the area is responsible for the perception of visual shape. The inter-hemispheric difference suggests a dominance of the right hemisphere in visual shape processing.
Collapse
Affiliation(s)
- T Okusa
- Department of Integrative Physiology, National Institute for Physiological Sciences, Okazaki, Japan.
| | | | | |
Collapse
|
29
|
Janssen P, Vogels R, Orban GA. Selectivity for 3D shape that reveals distinct areas within macaque inferior temporal cortex. Science 2000; 288:2054-6. [PMID: 10856221 DOI: 10.1126/science.288.5473.2054] [Citation(s) in RCA: 206] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
The anterior part of the macaque inferior temporal cortex, area TE, occupies a large portion of the temporal lobe and is critical for object recognition. Thus far, no relation between anatomical subdivisions of TE and neuronal selectivity has been described. Here, we present evidence that neurons selective for three-dimensional (3D) shape are concentrated in the lower bank of the superior temporal sulcus, whereas neurons in lateral TE are generally unselective for 3D shape, though equally selective for 2D shape. These findings reveal that TE consists of at least two distinct areas, one of which processes a specific object property.
Collapse
Affiliation(s)
- P Janssen
- Laboratorium voor Neuro-en Psychofysiologie, Medical School, Katholieke Universiteit Leuven, Leuven B-3000, Belgium
| | | | | |
Collapse
|
30
|
Grill-Spector K, Kushnir T, Edelman S, Avidan G, Itzchak Y, Malach R. Differential processing of objects under various viewing conditions in the human lateral occipital complex. Neuron 1999; 24:187-203. [PMID: 10677037 DOI: 10.1016/s0896-6273(00)80832-6] [Citation(s) in RCA: 872] [Impact Index Per Article: 34.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The invariant properties of human cortical neurons cannot be studied directly by fMRI due to its limited spatial resolution. Here, we circumvented this limitation by using fMR adaptation, namely, reduction of the fMR signal due to repeated presentation of identical images. Object-selective regions (lateral occipital complex [LOC]) showed a monotonic signal decrease as repetition frequency increased. The invariant properties of fMR adaptation were studied by presenting the same object in different viewing conditions. LOC exhibited stronger fMR adaptation to changes in size and position (more invariance) compared to illumination and viewpoint. The effect revealed two putative subdivisions within LOC: caudal-dorsal (LO), which exhibited substantial recovery from adaptation under all transformations, and posterior fusiform (PF/LOa), which displayed stronger adaptation. This study demonstrates the utility of fMR adaptation for revealing functional characteristics of neurons in fMRI studies.
Collapse
|
31
|
Wang G, Tanifuji M, Tanaka K. Functional architecture in monkey inferotemporal cortex revealed by in vivo optical imaging. Neurosci Res 1998; 32:33-46. [PMID: 9831250 DOI: 10.1016/s0168-0102(98)00062-5] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
To investigate the functional organization in the monkey inferotemporal cortex, which is the last exclusively visual area along the ventral visual cortical pathway, optical imaging based on intrinsic signals was carried out. We first conducted single-cell recordings with microelectrodes and determined the features critical for the activation of single cells. For the subsequent optical imaging, each critical feature was presented, which evoked multiple dark spots. Individual spots were approximately 0.5 mm in diameter and one of them covered the site of the electrode penetration at which the particular critical feature had been determined. The degree of stimulus selectivity varied from spot to spot, and from region to region even within a spot. Some regions were activated only by one of 12- 16 stimuli, while others by more than three stimuli. There were spots specifically activated by faces, and the position of activation spot changed gradually along the cortical surface as the stimulus face was rotated in depth. The length of the overall region along the direction of shift of these spots was approximately 1 mm. These results confirm the regional clustering of cells with similar stimulus selectivity and suggest larger units in which some parameters of object features are continuously mapped.
Collapse
Affiliation(s)
- G Wang
- RIKEN Brain Science Institute, Saitama, Japan
| | | | | |
Collapse
|
32
|
Abstract
The past year has seen significant advances in our understanding of the role played by the primary visual cortex (V1) in stereoscopic vision. Recently, the mechanism by which complex cells in V1 respond to random-dot stereograms has been characterized; it appears that their response properties greatly reduce the complexity of one of the critical links for stereopsis, the correspondence problem.
Collapse
Affiliation(s)
- I Ohzawa
- School of Optometry, University of California at Berkeley 94720-2020, USA.
| |
Collapse
|
33
|
Kobatake E, Wang G, Tanaka K. Effects of shape-discrimination training on the selectivity of inferotemporal cells in adult monkeys. J Neurophysiol 1998; 80:324-30. [PMID: 9658053 DOI: 10.1152/jn.1998.80.1.324] [Citation(s) in RCA: 269] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open
Abstract
Through extensive training, humans can become "visual experts, " able to visually distinguish subtle differences among similar objects with greater ease than those who are untrained. To understand the neural mechanisms behind this acquired discrimination ability, adult monkeys were fully trained to discriminate 28 moderately complex shapes. The training effects on the stimulus selectivity of cells in area TE of the inferotemporal cortex were then examined in anesthetized preparations. Area TE represents a later stage of the ventral visual cortical pathway that is known to mediate visual object discrimination and recognition. The recordings from the trained monkeys and untrained controls showed that the proportion of TE cells responsive to some member of the 28 stimuli was significantly greater in the trained monkeys than that in the control monkeys. Cell responses recorded from the trained monkeys were not sharply tuned to single training stimuli, but rather broadly covered several training stimuli. The distances among the training stimuli in the response space spanned by responses of the recorded TE cells were significantly greater in the trained monkeys than those in the control monkeys. The subset of training stimuli to which individual cells responded differed from cell to cell with only partial overlaps, suggesting that the cells responded to features common to several stimuli. These results are consistent with a model in which visual expertise is acquired through the development of differential responses by inferotemporal cells to the images of relevant objects.
Collapse
Affiliation(s)
- E Kobatake
- RIKEN Brain Science Institute, Wako-shi, Saitama 351-01, Japan
| | | | | |
Collapse
|
34
|
Abstract
We explore a new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity. Each query corresponds to a spatial arrangement of several local topographic codes (or tags), which are in themselves too primitive and common to be informative about shape. All the discriminating power derives from relative angles and distances among the tags. The important attributes of the queries are a natural partial ordering corresponding to increasing structure and complexity; semi-invariance, meaning that most shapes of a given class will answer the same way to two queries that are successive in the ordering; and stability, since the queries are not based on distinguished points and substructures. No classifier based on the full feature set can be evaluated, and it is impossible to determine a priori which arrangements are informative. Our approach is to select informative features and build tree classifiers at the same time by inductive learning. In effect, each tree provides an approximation to the full posterior where the features chosen depend on the branch that is traversed. Due to the number and nature of the queries, standard decision tree construction based on a fixed-length feature vector is not feasible. Instead we entertain only a small random sample of queries at each node, constrain their complexity to increase with tree depth, and grow multiple trees. The terminal nodes are labeled by estimates of the corresponding posterior distribution over shape classes. An image is classified by sending it down every tree and aggregating the resulting distributions. The method is applied to classifying handwritten digits and synthetic linear and nonlinear deformations of three hundred [Formula: see text] symbols. State-of-the-art error rates are achieved on the National Institute of Standards and Technology database of digits. The principal goal of the experiments on [Formula: see text] symbols is to analyze invariance, generalization error and related issues, and a comparison with artificial neural networks methods is presented in this context. [Figure: see text]
Collapse
Affiliation(s)
- Yali Amit
- Department of Statistics, University of Chicago, Chicago, IL, 60637, U.S.A
| | - Donald Geman
- Department of Mathematics and Statistics, University of Massachusetts, Amherst, MA 01003, U.S.A
| |
Collapse
|
35
|
|
36
|
Abstract
Cells in area TE of the inferotemporal cortex of the monkey brain selectively respond to various moderately complex object features, and those that respond to similar features cluster in a columnar region elongated vertical to the cortical surface. Columns representing related but different features partially overlap, and at least in some cases they comprise a continuous map of a piece of complex feature space. This continuous mapping is likely used for various computations, such as production of the image of the object at different viewing angles, illumination conditions, and articulation poses. Copyright 1996 Elsevier Science Ltd.
Collapse
Affiliation(s)
- Keiji Tanaka
- The Institute of Physical and Chemical Research (RIKEN), Japan
| |
Collapse
|
37
|
Divergent projections from the anterior inferotemporal area TE to the perirhinal and entorhinal cortices in the macaque monkey. J Neurosci 1996. [PMID: 8764663 DOI: 10.1523/jneurosci.16-15-04757.1996] [Citation(s) in RCA: 138] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Area TE is located at the latter part of the ventral visual cortical pathway, which is essential for visual recognition of objects. TE projects heavily to the perirhinal region, which is important for visual recognition memory of objects. To study the organization of projections from TE to the perirhinal (areas 35 and 36) and entorhinal (area 28) cortices, we made focal injections of Phaseolus vulgaris leucoagglutinin (PHA-L) and large injections of biocytin or wheat germ agglutinin conjugated to horseradish peroxidase (WGA-HRP) into anterior levels of TE in macaque monkeys. Injections of PHA-L into the ventral part of anterior TE (TEav) resulted in labeling of terminals distributed widely in area 36 (approximately one-half of its total extent), although the injection sites were limited to 0.7 mm in width. The labeled terminals tended to be denser in the medial part of area 36. There was less dense but definite labeling in area 35 and the lateral part of area 28. After a single injection of PHA-L or WGA-HRP into the dorsal part of anterior TE (TEad), labeled terminals were confined to a small region at the lateral part of area 36 (less than one-tenth of its total extent). The projections to areas 35 and 28 from TEad were much sparser than those from TEav. The different patterns of projections to the perirhinal and entorhinal cortices, together with previously reported differences in their afferent and other efferent connections, suggest the functional differentiation between TEav and TEad. The divergent projection from TEav to the perirhinal cortex may facilitate the association of different visual features in the perirhinal cortex.
Collapse
|
38
|
Wang G, Tanaka K, Tanifuji M. Optical imaging of functional organization in the monkey inferotemporal cortex. Science 1996; 272:1665-8. [PMID: 8658144 DOI: 10.1126/science.272.5268.1665] [Citation(s) in RCA: 248] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
To investigate the functional organization of object recognition, the technique of optical imaging was applied to the primate inferotemporal cortex, which is thought to be essential for object recognition. The features critical for the activation of single cells were first determined in unit recordings with electrodes. In the subsequent optical imaging, presentation of the critical features activated patchy regions around 0.5 millimeters in diameter, covering the site of the electrode penetration at which the critical feature had been determined. Because signals in optical imaging reflect average neuronal activities in the regions, the result directly indicates the regional clustering of cells responding to similar features.
Collapse
Affiliation(s)
- G Wang
- Laboratory for Neural Information Processing, Frontier Research Program, Institute of Physical and Chemical Research (RIKEN), Kagoshima University, Japan
| | | | | |
Collapse
|
39
|
Abstract
Intrinsic connections in the inferior temporal cortex were analyzed by making extracellular injections of biocytin in Japanese macaques. Analysis was focused mainly on the dorsal part of area TE, in which a functional columnar organization has been shown. Interlaminar connections were analyzed in coronal section after laminar-specific microinjections, and intralaminar connections were examined from tangential sections. After injections at various depths in the dorsal TE, both axons and cell bodies were strongly labeled above or below the injection site in a columnar appearance. Axons from layer 3 ran in bundles towards the white matter and gave off prominent collaterals in layer 5. Ascending axons from lower to upper layers were also present (e.g., layers 4, 5, and 6 to layer 3). In tangential sections, there were abundant axons running parallel to the pia mater. These horizontal axons, particularly those in layers 2 and 3, produced patches of terminals 0.5 +/- 0.1 mm (mean +/- s.d.) in size and cylindrical in shape, spanning layers 1-3 or even to layers 4 and 5. In the tangential plane, they were distributed in an anisotropic manner around the injection. The farthest patch appeared at 4 mm from the injection site. The center-to-center distance between nearest-neighbor patches was 0.7 +/- 0.3 mm. These patches were found only within the dorsal TE and did not extend into the lower bank of the superior temporal sulcus or into the ventral part of area TE. Area TEO, which is a major afferent source to area TE, had axonal patches with spacing similar to those in area TE but with smaller sizes (0.4 +/- 0.1 mm). The results show that intrinsic horizontal axons both in area TE and in area TEO arborize in a patchy manner, as has been reported for several other cortical areas. In are TE, the size and spacing of the terminal patches match those of columns with similar stimulus selectivity. Thus, these patches may be related to the functional modularity in area TE. Vertical connections across layers and cylindrical patches of horizontal axons most likely contribute to the shared stimulus selectivity among cells within a column.
Collapse
Affiliation(s)
- I Fujita
- Precursory Research for Embryonic Science and Technology (PRESTO), Research Development Corporation of Japan, Japan.
| | | |
Collapse
|