1
|
Dai D, Dong C, Yan Q, Sun Y, Zhang C, Li Z, Xu S. I 2U-Net: A dual-path U-Net with rich information interaction for medical image segmentation. Med Image Anal 2024; 97:103241. [PMID: 38897032 DOI: 10.1016/j.media.2024.103241] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Revised: 04/27/2024] [Accepted: 06/10/2024] [Indexed: 06/21/2024]
Abstract
Although the U-shape networks have achieved remarkable performances in many medical image segmentation tasks, they rarely model the sequential relationship of hierarchical layers. This weakness makes it difficult for the current layer to effectively utilize the historical information of the previous layer, leading to unsatisfactory segmentation results for lesions with blurred boundaries and irregular shapes. To solve this problem, we propose a novel dual-path U-Net, dubbed I2U-Net. The newly proposed network encourages historical information re-usage and re-exploration through rich information interaction among the dual paths, allowing deep layers to learn more comprehensive features that contain both low-level detail description and high-level semantic abstraction. Specifically, we introduce a multi-functional information interaction module (MFII), which can model cross-path, cross-layer, and cross-path-and-layer information interactions via a unified design, making the proposed I2U-Net behave similarly to an unfolded RNN and enjoying its advantage of modeling time sequence information. Besides, to further selectively and sensitively integrate the information extracted by the encoder of the dual paths, we propose a holistic information fusion and augmentation module (HIFA), which can efficiently bridge the encoder and the decoder. Extensive experiments on four challenging tasks, including skin lesion, polyp, brain tumor, and abdominal multi-organ segmentation, consistently show that the proposed I2U-Net has superior performance and generalization ability over other state-of-the-art methods. The code is available at https://github.com/duweidai/I2U-Net.
Collapse
Affiliation(s)
- Duwei Dai
- National-Local Joint Engineering Research Center of Biodiagnosis & Biotherapy, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China; Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China
| | - Caixia Dong
- Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China
| | - Qingsen Yan
- School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China
| | - Yongheng Sun
- School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an 710049, China
| | - Chunyan Zhang
- National-Local Joint Engineering Research Center of Biodiagnosis & Biotherapy, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China
| | - Zongfang Li
- National-Local Joint Engineering Research Center of Biodiagnosis & Biotherapy, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China; Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China.
| | - Songhua Xu
- Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University, Xi'an, 710004, China.
| |
Collapse
|
2
|
Salaün JP, Borst G, Cachia A, Orliac F, Vivien D, Poirel N. Effects of general anaesthesia in early childhood on local and global visual processing: a post hoc analysis of the APEX cohort study. Br J Anaesth 2024:S0007-0912(24)00281-2. [PMID: 38862383 DOI: 10.1016/j.bja.2024.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 04/28/2024] [Accepted: 05/05/2024] [Indexed: 06/13/2024] Open
Abstract
BACKGROUND Preclinical studies suggest that early exposure to anaesthesia alters the visual system in mice and non-human primates. We investigated whether exposure to general anaesthesia leads to visual attention processing changes in children, which could potentially impact essential life skills, including learning. METHODS This was a post hoc analysis of data from the APprentissages EXécutifs et cerveau chez les enfants d'âge scolaire (APEX) cohort study. A total of 24 healthy 9-10-yr-old children who were or were not exposed to general anaesthesia (for surgery) by a mean age of 3.8 (2.6) yr performed a visual attention task to evaluate ability to process either local details or general global visual information. Whether children were distracted by visual interference during global and local information processing was also assessed. RESULTS Participants included in the analyses (n=12 participants exposed to general anaesthesia and n=12 controls) successfully completed (>90% of correct answers) the trial tasks. Children from both groups were equally distracted by visual interference. However, children who had been exposed to general anaesthesia were more attracted to global visual information than were control children (P=0.03). CONCLUSIONS These findings suggest lasting effects of early-life exposure to general anaesthesia on visuospatial abilities. Further investigations of the mechanisms by which general anaesthesia could have delayed effects on how children perceive their visual environment are needed.
Collapse
Affiliation(s)
- Jean-Philippe Salaün
- Normandie Université, UNICAEN, Université Caen Normandie, INSERM, GIP CYCERON, Institut Blood and Brain @Caen-Normandie (BB@C), UMR-S U1237, Physiopathology and Imaging of Neurological Disorders (PhIND), Caen, France; Department of Anaesthesiology and Critical Care Medicine, Centre Hospitalier Universitaire Caen Normandie, 14000 Caen, France.
| | - Grégoire Borst
- Université Paris Cité, LaPsyDÉ, CNRS, Paris, France; Institut Universitaire de France (IUF), Paris, France
| | - Arnaud Cachia
- Université Paris Cité, LaPsyDÉ, CNRS, Paris, France; Institut Universitaire de France (IUF), Paris, France
| | | | - Denis Vivien
- Normandie Université, UNICAEN, Université Caen Normandie, INSERM, GIP CYCERON, Institut Blood and Brain @Caen-Normandie (BB@C), UMR-S U1237, Physiopathology and Imaging of Neurological Disorders (PhIND), Caen, France; Department of Clinical Research and Innovation, Centre Hospitalier Universitaire Caen Normandie, Caen, France
| | - Nicolas Poirel
- Université Paris Cité, LaPsyDÉ, CNRS, Paris, France; Institut Universitaire de France (IUF), Paris, France; GIP CYCERON, Caen, France
| |
Collapse
|
3
|
Bonfanti D, Mazzi C, Savazzi S. Mapping the routes of perception: Hemispheric asymmetries in signal propagation dynamics. Psychophysiology 2024; 61:e14529. [PMID: 38279560 DOI: 10.1111/psyp.14529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 12/14/2023] [Accepted: 01/11/2024] [Indexed: 01/28/2024]
Abstract
The visual system has long been considered equivalent across hemispheres. However, an increasing amount of data shows that functional differences may exist in this regard. We therefore tried to characterize the emergence of visual perception and the spatiotemporal dynamics resulting from the stimulation of visual cortices in order to detect possible interhemispheric asymmetries. Eighteen participants were tested. Each of them received 360 transcranial magnetic stimulation (TMS) pulses at phosphene threshold intensity over left and right early visual areas while electroencephalography was being recorded. After each single pulse, participants had to report the presence or absence of a phosphene. Local mean field power analysis of TMS-evoked potentials showed an effect of both site (left vs. right TMS) of stimulation and hemisphere (ipsilateral vs. contralateral to the TMS): while right TMS determined early stronger activations, left TMS determined later stronger activity in contralateral electrodes. The interhemispheric signal propagation index revealed differences in how TMS-evoked activity spreads: left TMS-induced activity diffused contralaterally more than right stimulation. With regard to phosphenes perception, distinct electrophysiological patterns were found to reflect similar perceptual experiences: left TMS-evoked phosphenes are associated with early occipito-parietal and frontal activity followed by late central activity; right TMS-evoked phosphenes determine only late, fronto-central, and parietal activations. Our results show that left and right occipital TMS elicits differential electrophysiological patterns in the brain, both per se and as a function of phosphene perception. These distinct activation patterns may suggest a different role of the two hemispheres in processing visual information and giving rise to perception.
Collapse
Affiliation(s)
- Davide Bonfanti
- Perception and Awareness (PandA) Laboratory, Department of Neuroscience, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| | - Chiara Mazzi
- Perception and Awareness (PandA) Laboratory, Department of Neuroscience, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| | - Silvia Savazzi
- Perception and Awareness (PandA) Laboratory, Department of Neuroscience, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| |
Collapse
|
4
|
Ito M, Suzuki A. Discrepancies in perceived humanness between spatially filtered and unfiltered faces and their associations with uncanny feelings. Perception 2024:3010066241252355. [PMID: 38752230 DOI: 10.1177/03010066241252355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2024]
Abstract
Human and artificial features that coexist in certain types of human-like robots create a discrepancy in perceived humanness and evoke uncanny feelings in human observers. However, whether this perceptual mismatch in humanness occurs for all faces, and whether it is related to the uncanny feelings toward them, is unknown. We investigated this by examining perceived humanness for a variety of natural images of robot and human faces with different spatial frequency (SF) information: that is, faces with only low SF, middle SF, and high SF information, and intact (spatially unfiltered) faces. Uncanny feelings elicited by these faces were also measured. The results showed perceptual mismatches that LSF, MSF, and HSF faces were perceived as more human than intact faces. This was particularly true for intact robot faces that looked slightly human, which tended to evoke strong uncanny feelings. Importantly, the mismatch in perceived humanness between the intact and spatially filtered faces was positively correlated with uncanny feelings toward intact faces. Given that the human visual system performs SF analysis when processing faces, the perceptual mismatches observed in this study likely occur in real life for all faces, and as such might be a ubiquitous source of uncanny feelings in real-life situations.
Collapse
|
5
|
Bertrand Pilon C, Arguin M. The processing of spatial frequencies through time in visual word recognition. Sci Rep 2024; 14:6628. [PMID: 38503810 PMCID: PMC10951267 DOI: 10.1038/s41598-024-57219-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2023] [Accepted: 03/15/2024] [Indexed: 03/21/2024] Open
Abstract
This study examined the temporal profile of spatial frequency processing in a word reading task in 16 normal adult readers. They had to report the word presented in a 200 ms display using a four-alternative forced-choice task (4AFC). The stimuli were made of an additive combination of the signal (i.e. the target word) and of a visual white noise patch wherein the signal-to-noise ratio varied randomly across stimulus duration. Four spatial frequency conditions were defined for the signal component of the stimulus (bandpass Butterworth filters with center frequencies of 1.2, 2.4, 4.8 and 9.6 cycles per degree). In contrast to the coarse-to-fine theory of visual recognition, the results show that the highest spatial frequency range dominates early processing, with a shift toward lower spatial frequencies at later points during stimulus exposure. This pattern interacted in a complex way with the temporal frequency content of signal-to-noise oscillations. The outcome of individual data patterns classification by a machine learning algorithm according to the corresponding spatial frequency band further shows that the most salient spatial frequency signature is obtained when the time dimension within data patterns is recoded into its Fourier transform.
Collapse
Affiliation(s)
- Clémence Bertrand Pilon
- Department of Psychology, Université de Montréal, C.P. 6128, Succ. Centre-ville, Montréal, QC, H3C 3J7, Canada
- Centre interdisciplinaire de recherche sur le cerveau et l'apprentissage (CIRCA), Department of Psychology, Université de Montréal, Montreal, QC, Canada
| | - Martin Arguin
- Department of Psychology, Université de Montréal, C.P. 6128, Succ. Centre-ville, Montréal, QC, H3C 3J7, Canada.
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, Montreal, QC, Canada.
- Centre interdisciplinaire de recherche sur le cerveau et l'apprentissage (CIRCA), Department of Psychology, Université de Montréal, Montreal, QC, Canada.
| |
Collapse
|
6
|
Nara S, Kaiser D. Integrative processing in artificial and biological vision predicts the perceived beauty of natural images. SCIENCE ADVANCES 2024; 10:eadi9294. [PMID: 38427730 PMCID: PMC10906925 DOI: 10.1126/sciadv.adi9294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 01/29/2024] [Indexed: 03/03/2024]
Abstract
Previous research shows that the beauty of natural images is already determined during perceptual analysis. However, it is unclear which perceptual computations give rise to the perception of beauty. Here, we tested whether perceived beauty is predicted by spatial integration across an image, a perceptual computation that reduces processing demands by aggregating image parts into more efficient representations of the whole. We quantified integrative processing in an artificial deep neural network model, where the degree of integration was determined by the amount of deviation between activations for the whole image and its constituent parts. This quantification of integration predicted beauty ratings for natural images across four studies with different stimuli and designs. In a complementary functional magnetic resonance imaging study, we show that integrative processing in human visual cortex similarly predicts perceived beauty. Together, our results establish integration as a computational principle that facilitates perceptual analysis and thereby mediates the perception of beauty.
Collapse
Affiliation(s)
- Sanjeev Nara
- Mathematical Institute, Department of Mathematics and Computer Science, Physics, Geography, Justus Liebig University Gießen, Gießen Germany
| | - Daniel Kaiser
- Mathematical Institute, Department of Mathematics and Computer Science, Physics, Geography, Justus Liebig University Gießen, Gießen Germany
- Center for Mind, Brain and Behavior (CMBB), Philipps-University Marburg and Justus Liebig University Gießen, Marburg, Germany
| |
Collapse
|
7
|
Goktepe N, Schütz AC. Frequency-specific and periodic masking of peripheral characters by delayed foveal input. Sci Rep 2024; 14:4642. [PMID: 38409140 PMCID: PMC10897220 DOI: 10.1038/s41598-024-51710-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 01/09/2024] [Indexed: 02/28/2024] Open
Abstract
The foveal-feedback mechanism supports peripheral object recognition by processing information about peripheral objects in foveal retinotopic visual cortex. When a foveal object is asynchronously presented with a peripheral target, peripheral discrimination performance is affected differently depending on the relationship between the foveal and peripheral objects. However, it is not clear whether the delayed foveal input competes for foveal resources with the information processed by foveal-feedback or masks it. In the current study, we tested these hypotheses by measuring the effect of foveal noise at different spatial frequencies on peripheral discrimination of familiar and novel characters. Our results showed that the impairment of foveal-feedback was strongest for low-spatial frequency noise. A control experiment revealed that for spatially overlapping noise, low-spatial frequencies were more effective than medium-spatial frequencies in the periphery, but vice versa in the fovea. This suggests that the delayed foveal input selectively masks foveal-feedback when it is sufficiently similar to the peripheral information. Additionally, this foveal masking was periodic as evidenced by behavioral oscillations at around 5 Hz. Thus, we conclude that foveal-feedback supports peripheral discrimination of familiar and novel objects by periodically processing peripheral object information.
Collapse
Affiliation(s)
- Nedim Goktepe
- AG Allgemeine und Biologische Psychologie, Philipps-Universität Marburg, Marburg, Germany.
| | - Alexander C Schütz
- AG Allgemeine und Biologische Psychologie, Philipps-Universität Marburg, Marburg, Germany
- Center for Mind, Brain and Behavior, Universities of Marburg, Giessen, and Darmstadt, Marburg, Germany
| |
Collapse
|
8
|
Yoshida M, Miura K, Fujimoto M, Yamamori H, Yasuda Y, Iwase M, Hashimoto R. Visual salience is affected in participants with schizophrenia during free-viewing. Sci Rep 2024; 14:4606. [PMID: 38409435 PMCID: PMC10897421 DOI: 10.1038/s41598-024-55359-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 02/22/2024] [Indexed: 02/28/2024] Open
Abstract
Abnormalities in visual exploration affect the daily lives of patients with schizophrenia. For example, scanpath length during free-viewing is shorter in schizophrenia. However, its origin and its relevance to symptoms are unknown. Here we investigate the possibility that abnormalities in eye movements result from abnormalities in visual or visuo-cognitive processing. More specifically, we examined whether such abnormalities reflect visual salience in schizophrenia. Eye movements of 82 patients and 252 healthy individuals viewing natural and/or complex images were examined using saliency maps for static images to determine the contributions of low-level visual features to salience-guided eye movements. The results showed that the mean value for orientation salience at the gazes of the participants with schizophrenia were higher than that of the healthy control subjects. Further analyses revealed that orientation salience defined by the L + M channel of the DKL color space is specifically affected in schizophrenia, suggesting abnormalities in the magnocellular visual pathway. By looking into the computational stages of the visual salience, we found that the difference between schizophrenia and healthy control emerges at the earlier stage, suggesting functional decline in early visual processing. These results suggest that visual salience is affected in schizophrenia, thereby expanding the concept of the aberrant salience hypothesis of psychosis to the visual domain.
Collapse
Affiliation(s)
- Masatoshi Yoshida
- Center for Human Nature, Artificial Intelligence, and Neuroscience (CHAIN), Hokkaido University, Sapporo, Japan.
| | - Kenichiro Miura
- Department of Pathology of Mental Diseases, National Institute of Mental Health, National Center of Neurology and Psychiatry, Kodaira, Japan.
| | - Michiko Fujimoto
- Department of Pathology of Mental Diseases, National Institute of Mental Health, National Center of Neurology and Psychiatry, Kodaira, Japan
- Department of Psychiatry, Osaka University Graduate School of Medicine, Suita, Japan
| | - Hidenaga Yamamori
- Department of Pathology of Mental Diseases, National Institute of Mental Health, National Center of Neurology and Psychiatry, Kodaira, Japan
- Department of Psychiatry, Osaka University Graduate School of Medicine, Suita, Japan
- Japan Community Health Care Organization, Osaka Hospital, Osaka, Japan
| | - Yuka Yasuda
- Department of Pathology of Mental Diseases, National Institute of Mental Health, National Center of Neurology and Psychiatry, Kodaira, Japan
- Medical Corporation Foster, Life Grow Brilliant Mental Clinic, Osaka, Japan
| | - Masao Iwase
- Department of Psychiatry, Osaka University Graduate School of Medicine, Suita, Japan
- Osaka Psychiatric Research Center, Osaka Psychiatric Medical Center, Osaka Prefectural Hospital Organization, Hirakata, Japan
| | - Ryota Hashimoto
- Department of Pathology of Mental Diseases, National Institute of Mental Health, National Center of Neurology and Psychiatry, Kodaira, Japan
| |
Collapse
|
9
|
Adámek P, Grygarová D, Jajcay L, Bakštein E, Fürstová P, Juríčková V, Jonáš J, Langová V, Neskoroďana I, Kesner L, Horáček J. The Gaze of Schizophrenia Patients Captured by Bottom-up Saliency. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2024; 10:21. [PMID: 38378724 PMCID: PMC10879495 DOI: 10.1038/s41537-024-00438-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 01/19/2024] [Indexed: 02/22/2024]
Abstract
Schizophrenia (SCHZ) notably impacts various human perceptual modalities, including vision. Prior research has identified marked abnormalities in perceptual organization in SCHZ, predominantly attributed to deficits in bottom-up processing. Our study introduces a novel paradigm to differentiate the roles of top-down and bottom-up processes in visual perception in SCHZ. We analysed eye-tracking fixation ground truth maps from 28 SCHZ patients and 25 healthy controls (HC), comparing these with two mathematical models of visual saliency: one bottom-up, based on the physical attributes of images, and the other top-down, incorporating machine learning. While the bottom-up (GBVS) model revealed no significant overall differences between groups (beta = 0.01, p = 0.281, with a marginal increase in SCHZ patients), it did show enhanced performance by SCHZ patients with highly salient images. Conversely, the top-down (EML-Net) model indicated no general group difference (beta = -0.03, p = 0.206, lower in SCHZ patients) but highlighted significantly reduced performance in SCHZ patients for images depicting social interactions (beta = -0.06, p < 0.001). Over time, the disparity between the groups diminished for both models. The previously reported bottom-up bias in SCHZ patients was apparent only during the initial stages of visual exploration and corresponded with progressively shorter fixation durations in this group. Our research proposes an innovative approach to understanding early visual information processing in SCHZ patients, shedding light on the interplay between bottom-up perception and top-down cognition.
Collapse
Affiliation(s)
- Petr Adámek
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic.
- Third Faculty of Medicine, Charles University, Prague, Czech Republic.
| | - Dominika Grygarová
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Third Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Lucia Jajcay
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Institute of Computer Science of the Czech Academy of Sciences, Prague, Czech Republic
- Faculty of Electrical Engineering, Czech Technical University in Prague, Prague, Czech Republic
| | - Eduard Bakštein
- Early Episodes of SMI Research Center, National Institute of Mental Health, Klecany, Czech Republic
- Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University, Prague, Czech Republic
| | - Petra Fürstová
- Early Episodes of SMI Research Center, National Institute of Mental Health, Klecany, Czech Republic
| | - Veronika Juríčková
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- First Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Juraj Jonáš
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Faculty of Humanities, Charles University, Prague, Czech Republic
| | - Veronika Langová
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Third Faculty of Medicine, Charles University, Prague, Czech Republic
| | - Iryna Neskoroďana
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
| | - Ladislav Kesner
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Department of Art History, Masaryk University, Brno, Czech Republic
| | - Jiří Horáček
- Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
- Third Faculty of Medicine, Charles University, Prague, Czech Republic
| |
Collapse
|
10
|
Zhao D, Shen X, Li S, He W. The Impact of Spatial Frequency on the Perception of Crowd Emotion: An fMRI Study. Brain Sci 2023; 13:1699. [PMID: 38137147 PMCID: PMC10742193 DOI: 10.3390/brainsci13121699] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/06/2023] [Accepted: 12/07/2023] [Indexed: 12/24/2023] Open
Abstract
Recognizing the emotions of faces in a crowd is crucial for understanding overall behavior and intention as well as for smooth and friendly social interactions. However, it is unclear whether the spatial frequency of faces affects the discrimination of crowd emotion. Although high- and low-spatial-frequency information for individual faces is processed by distinct neural channels, there is a lack of evidence on how this applies to crowd faces. Here, we used functional magnetic resonance imaging (fMRI) to investigate neural representations of crowd faces at different spatial frequencies. Thirty-three participants were asked to compare whether a test face was happy or more fearful than a crowd face that varied in high, low, and broad spatial frequencies. Our findings revealed that fearful faces with low spatial frequencies were easier to recognize in terms of accuracy (78.9%) and response time (927 ms). Brain regions, such as the fusiform gyrus, located in the ventral visual stream, were preferentially activated in high spatial frequency crowds, which, however, were the most difficult to recognize behaviorally (68.9%). Finally, the right inferior frontal gyrus was found to be better activated in the broad spatial frequency crowds. Our study suggests that people are more sensitive to fearful crowd faces with low spatial frequency and that high spatial frequency does not promote crowd face recognition.
Collapse
Affiliation(s)
- Dongfang Zhao
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian 116029, China; (D.Z.); (X.S.); (S.L.)
- Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, Dalian 116029, China
| | - Xiangnan Shen
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian 116029, China; (D.Z.); (X.S.); (S.L.)
- Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, Dalian 116029, China
| | - Shuaixia Li
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian 116029, China; (D.Z.); (X.S.); (S.L.)
- Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, Dalian 116029, China
| | - Weiqi He
- Research Center of Brain and Cognitive Neuroscience, Liaoning Normal University, Dalian 116029, China; (D.Z.); (X.S.); (S.L.)
- Key Laboratory of Brain and Cognitive Neuroscience, Liaoning Province, Dalian 116029, China
| |
Collapse
|
11
|
Garric C, Wamain Y, Rouland JF, Lenoble Q. Glaucoma-associated abnormalities in cortical activity during a visuocognitive task. Clin Neurophysiol 2023; 156:47-56. [PMID: 37866076 DOI: 10.1016/j.clinph.2023.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/27/2023] [Accepted: 09/04/2023] [Indexed: 10/24/2023]
Abstract
OBJECTIVE To investigate neurophysiological dynamics during a visuocognitive task in glaucoma patients vs. healthy controls. METHODS Fifteen patients with early-stage primary open-angle glaucoma (POAG) and fifteen age-matched healthy participants underwent a "go/no-go" task, monitored with EEG. Participants had to semantically categorize visual objects in central vision, with animal or furniture as targets according to the experimental block. RESULTS Early visual processing was delayed by 50 ms in patients with POAG compared to controls. The patients displayed a smaller difference between animal and furniture categorization during higher-level cognitive processing (at 400-600 ms). Regarding behavioral data, the groups differed in accuracy performance and decision criterion. As opposed to the control group, patients did not display facilitation and a higher accuracy rate for animal stimuli. However, patients maintained a consistent decision criterion throughout the experiment, whereas controls displayed a shift towards worse decision criteria in furniture trials, with higher error rate. CONCLUSIONS The comparative analysis of behavioral and neurophysiological data revealed in POAG patients a delay in early visual processing, and potential high-level cognitive compensation during late, task-dependent activations. SIGNIFICANCE To our knowledge, our findings provide the first evidence of modification in cognitive brain dynamics associated with POAG.
Collapse
Affiliation(s)
- Clémentine Garric
- Univ. Lille, INSERM, CHU Lille, U1172 - LilNCog - Lille Neuroscience and Cognition, F-59000 Lille, France.
| | - Yannick Wamain
- Univ. Lille, CNRS, CHU Lille, UMR 9193, SCALab, Sciences Cognitives et Sciences Affectives, F-59000 Lille, France.
| | - Jean-François Rouland
- Ophthalmology Department, Claude Huriez Hospital, University of Lille, F-59000 Lille, France.
| | - Quentin Lenoble
- Univ. Lille, INSERM, CHU Lille, U1172 - LilNCog - Lille Neuroscience and Cognition, F-59000 Lille, France.
| |
Collapse
|
12
|
Orima T, Motoyoshi I. Spatiotemporal cortical dynamics for visual scene processing as revealed by EEG decoding. Front Neurosci 2023; 17:1167719. [PMID: 38027518 PMCID: PMC10646306 DOI: 10.3389/fnins.2023.1167719] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
The human visual system rapidly recognizes the categories and global properties of complex natural scenes. The present study investigated the spatiotemporal dynamics of neural signals involved in visual scene processing using electroencephalography (EEG) decoding. We recorded visual evoked potentials from 11 human observers for 232 natural scenes, each of which belonged to one of 13 natural scene categories (e.g., a bedroom or open country) and had three global properties (naturalness, openness, and roughness). We trained a deep convolutional classification model of the natural scene categories and global properties using EEGNet. Having confirmed that the model successfully classified natural scene categories and the three global properties, we applied Grad-CAM to the EEGNet model to visualize the EEG channels and time points that contributed to the classification. The analysis showed that EEG signals in the occipital electrodes at short latencies (approximately 80 ~ ms) contributed to the classifications, whereas those in the frontal electrodes at relatively long latencies (200 ~ ms) contributed to the classification of naturalness and the individual scene category. These results suggest that different global properties are encoded in different cortical areas and with different timings, and that the combination of the EEGNet model and Grad-CAM can be a tool to investigate both temporal and spatial distribution of natural scene processing in the human brain.
Collapse
Affiliation(s)
- Taiki Orima
- Department of Life Sciences, The University of Tokyo, Tokyo, Japan
- Japan Society for the Promotion of Science, Tokyo, Japan
| | - Isamu Motoyoshi
- Department of Life Sciences, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
13
|
Wang H, Lian Y, Wang A, Chen E, Liu C. Face motion form at learning influences the time course of face spatial frequency processing during test. Biol Psychol 2023; 183:108691. [PMID: 37748703 DOI: 10.1016/j.biopsycho.2023.108691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Revised: 09/05/2023] [Accepted: 09/21/2023] [Indexed: 09/27/2023]
Abstract
Studies that use static faces suggest that facial processing follows a coarse-to-fine sequence; i.e., holistic precedes featural processing, due to low and high spatial frequencies (LSF, HSF) transmitting holistic/global and featural/local information respectively. Although recent studies have focused on the role of facial movement in holistic facial processing, it is unclear whether moving faces have the same processing mechanism as static ones, especially in the time course of processing. The current study uses the event-related potential technique to investigate this issue by manipulating the facial format at study and face spatial frequency during the test. ERP results showed that the P1 amplitude was increased by LSF faces relative to HSF ones, using both moving and static study faces, with the former larger than the latter. The N170 amplitude was more sensitive to HSF than LSF faces when only static study faces were used, while the P2 amplitude was more sensitive to LSF faces regardless of the facial study format. The above results were not modulated by the race of the faces. These results favor the view that regardless of face race, moving study faces promote holistic processing during the earliest stage of face recognition. Furthermore, holistic processing is observed to be the same for both static and moving study faces at a later stage associated with more in-depth processing. It is evident that facial motion should be factored into further studies of face recognition, given the distinctions between holistic and featural processing for moving and static study faces.
Collapse
Affiliation(s)
- Hailing Wang
- School of Psychology, Shandong Normal University, Jinan 250358, China.
| | - Yujing Lian
- School of Psychology, Shandong Normal University, Jinan 250358, China
| | - Anqing Wang
- School of Psychology, Shandong Normal University, Jinan 250358, China
| | - Enguang Chen
- School of Psychology, Shandong Normal University, Jinan 250358, China
| | - Chengdong Liu
- School of Psychology, Shandong Normal University, Jinan 250358, China
| |
Collapse
|
14
|
Entzmann L, Guyader N, Kauffmann L, Peyrin C, Mermillod M. Detection of emotional faces: The role of spatial frequencies and local features. Vision Res 2023; 211:108281. [PMID: 37421829 DOI: 10.1016/j.visres.2023.108281] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Revised: 06/18/2023] [Accepted: 06/28/2023] [Indexed: 07/10/2023]
Abstract
Models of emotion processing suggest that threat-related stimuli such as fearful faces can be detected based on the rapid extraction of low spatial frequencies. However, this remains debated as other models argue that the decoding of facial expressions occurs with a more flexible use of spatial frequencies. The purpose of this study was to clarify the role of spatial frequencies and differences in luminance contrast between spatial frequencies, on the detection of facial emotions. We used a saccadic choice task in which emotional-neutral face pairs were presented and participants were asked to make a saccade toward the neutral or the emotional (happy or fearful) face. Faces were displayed either in low, high, or broad spatial frequencies. Results showed that participants were better to saccade toward the emotional face. They were also better for high or broad than low spatial frequencies, and the accuracy was higher with a happy target. An analysis of the eye and mouth saliency ofour stimuli revealed that the mouth saliency of the target correlates with participants' performance. Overall, this study underlines the importance of local more than global information, and of the saliency of the mouth region in the detection of emotional and neutral faces.
Collapse
Affiliation(s)
- Léa Entzmann
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, 38000 Grenoble, France; Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, 38000 Grenoble, France; Icelandic Vision Lab, School of Health Sciences, University of Iceland, Reykjavík, Iceland.
| | - Nathalie Guyader
- Univ. Grenoble Alpes, CNRS, Grenoble INP, GIPSA-lab, 38000 Grenoble, France
| | - Louise Kauffmann
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, 38000 Grenoble, France
| | - Carole Peyrin
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, 38000 Grenoble, France
| | - Martial Mermillod
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, 38000 Grenoble, France
| |
Collapse
|
15
|
Kallmayer A, Võ MLH, Draschkow D. Viewpoint dependence and scene context effects generalize to depth rotated three-dimensional objects. J Vis 2023; 23:9. [PMID: 37707802 PMCID: PMC10506680 DOI: 10.1167/jov.23.10.9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 08/17/2023] [Indexed: 09/15/2023] Open
Abstract
Viewpoint effects on object recognition interact with object-scene consistency effects. While recognition of objects seen from "noncanonical" viewpoints (e.g., a cup from below) is typically impeded compared to processing of objects seen from canonical viewpoints (e.g., the string-side of a guitar), this effect is reduced by meaningful scene context information. In the present study we investigated if these findings established by using photographic images, generalize to strongly noncanonical orientations of three-dimensional (3D) models of objects. Using 3D models allowed us to probe a broad range of viewpoints and empirically establish viewpoints with very strong noncanonical and canonical orientations. In Experiment 1, we presented 3D models of objects from six different viewpoints (0°, 60°, 120°, 180° 240°, 300°) in color (1a) and grayscaled (1b) in a sequential matching task. Viewpoint had a significant effect on accuracy and response times. Based on the viewpoint effect in Experiments 1a and 1b, we could empirically determine the most canonical and noncanonical viewpoints from our set of viewpoints to use in Experiment 2. In Experiment 2, participants again performed a sequential matching task, however now the objects were paired with scene backgrounds which could be either consistent (e.g., a cup in the kitchen) or inconsistent (e.g., a guitar in the bathroom) to the object. Viewpoint interacted significantly with scene consistency in that object recognition was less affected by viewpoint when consistent scene information was provided, compared to inconsistent information. Our results show that scene context supports object recognition even when using extremely noncanonical orientations of depth rotated 3D objects. This supports the important role object-scene processing plays for object constancy especially under conditions of high uncertainty.
Collapse
Affiliation(s)
- Aylin Kallmayer
- Department of Psychology, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Melissa L-H Võ
- Department of Psychology, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Dejan Draschkow
- Department of Experimental Psychology, University of Oxford, Oxford, UK
- Oxford Centre for Human Brain Activity, Wellcome Centre for Integrative Neuroimaging, Department of Psychiatry, University of Oxford, Oxford, UK
| |
Collapse
|
16
|
Nuthmann A, Clark CNL. Pseudoneglect during object search in naturalistic scenes. Exp Brain Res 2023; 241:2345-2360. [PMID: 37610677 PMCID: PMC10471692 DOI: 10.1007/s00221-023-06679-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 07/25/2023] [Indexed: 08/24/2023]
Abstract
Pseudoneglect, that is the tendency to pay more attention to the left side of space, is typically assessed with paper-and-pencil tasks, particularly line bisection. In the present study, we used an everyday task with more complex stimuli. Subjects' task was to look for pre-specified objects in images of real-world scenes. In half of the scenes, the search object was located on the left side of the image (L-target); in the other half of the scenes, the target was on the right side (R-target). To control for left-right differences in the composition of the scenes, half of the scenes were mirrored horizontally. Eye-movement recordings were used to track the course of pseudoneglect on a millisecond timescale. Subjects' initial eye movements were biased to the left of the scene, but less so for R-targets than for L-targets, indicating that pseudoneglect was modulated by task demands and scene guidance. We further analyzed how horizontal gaze positions changed over time. When the data for L- and R-targets were pooled, the leftward bias lasted, on average, until the first second of the search process came to an end. Even for right-side targets, the gaze data showed an early left-bias, which was compensated by adjustments in the direction and amplitude of later saccades. Importantly, we found that pseudoneglect affected search efficiency by leading to less efficient scan paths and consequently longer search times for R-targets compared with L-targets. It may therefore be prudent to take spatial asymmetries into account when studying visual search in scenes.
Collapse
Affiliation(s)
- Antje Nuthmann
- Institute of Psychology, University of Kiel, Olshausenstr. 62, 24118, Kiel, Germany.
- Psychology Department, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK.
| | - Christopher N L Clark
- Psychology Department, School of Philosophy, Psychology and Language Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
17
|
Kim T, Chung M, Jeong E, Cho YS, Kwon OS, Kim SP. Cortical representation of musical pitch in event-related potentials. Biomed Eng Lett 2023; 13:441-454. [PMID: 37519879 PMCID: PMC10382469 DOI: 10.1007/s13534-023-00274-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 03/14/2023] [Accepted: 03/18/2023] [Indexed: 08/01/2023] Open
Abstract
Neural coding of auditory stimulus frequency is well-documented; however, the cortical signals and perceptual correlates of pitch have not yet been comprehensively investigated. This study examined the temporal patterns of event-related potentials (ERP) in response to single tones of pitch chroma, with an assumption that these patterns would be more prominent in musically-trained individuals than in non-musically-trained individuals. Participants with and without musical training (N = 20) were presented with seven notes on the C major scale (C4, D4, E4, F4, G4, A4, and B4), and whole-brain activities were recorded. A linear regression analysis between the ERP amplitude and the seven notes showed that the ERP amplitude increased or decreased as the frequency of the pitch increased. Remarkably, these linear correlations were anti-symmetric between the hemispheres. Specifically, we found that ERP amplitudes of the left and right frontotemporal areas decreased and increased, respectively, as the pitch frequency increased. Although linear slopes were significant in both groups, the musically-trained group exhibited marginally steeper slope, and their ERP amplitudes were most discriminant for frequency of tone of pitch at earlier latency than in the non-musically-trained group (~ 460 ms vs ~ 630 ms after stimulus onset). Thus, the ERP amplitudes in frontotemporal areas varied according to the pitch frequency, with the musically-trained participants demonstrating a wider range of amplitudes and inter-hemispheric anti-symmetric patterns. Our findings may provide new insights on cortical processing of musical pitch, revealing anti-symmetric processing of musical pitch between hemispheres, which appears to be more pronounced in musically-trained people. Supplementary Information The online version contains supplementary material available at 10.1007/s13534-023-00274-y.
Collapse
Affiliation(s)
- Taehyoung Kim
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan, Republic of Korea
| | - Miyoung Chung
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan, Republic of Korea
| | - Eunju Jeong
- Department of Music and Science for Clinical Practice, College of Interdisciplinary Industrial Studies, Hanyang University, Seoul, Republic of Korea
| | - Yang Seok Cho
- School of Psychology, Korea University, Seoul, Republic of Korea
| | - Oh-Sang Kwon
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan, Republic of Korea
| | - Sung-Phil Kim
- Department of Biomedical Engineering, Ulsan National Institute of Science and Technology, Ulsan, Republic of Korea
| |
Collapse
|
18
|
He Y, Feng L, Zhou Y, Zhuang Y, Xu Z, Yao Y, Chen X, Jiang R, Yuan J, Ye Q, Wen Y, Jia Y, Liu J, Li J. Characteristics and predictive factors of visual function improvements after monocular perceptual learning in amblyopia. Heliyon 2023; 9:e17281. [PMID: 37416659 PMCID: PMC10320034 DOI: 10.1016/j.heliyon.2023.e17281] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Revised: 06/02/2023] [Accepted: 06/13/2023] [Indexed: 07/08/2023] Open
Abstract
Monocular perceptual learning has shown promising performance in restoring visual function in amblyopes beyond the critical period in the laboratory. However, the treatment outcome is variable and indeterminate in actual clinical and neuroscientific practice. We aimed to explore the efficacy of monocular perceptual learning in the clinical setting. By combining continuous monitoring of perceptual learning and clinical measurements, we evaluated the efficacy and characteristics of visual acuity and contrast sensitivity function improvement and further explored the individualized effect after perceptual learning. Amblyopes (average age:17 ± 7 years old) were trained in a monocular two-alternative forced choice identification task at the 50% contrast threshold of the amblyopic eye for 10-15 days. We found that monocular perceptual learning improves both visual acuity and contrast sensitivity function in amblyopia. The broader activation of spatial contrast sensitivity, with a significant improvement in lower spatial frequencies, contributed to improving visual acuity. Visual acuity changes in the early stage can predict the endpoint treatment outcomes. Our results confirm the efficacy of monocular perceptual learning and suggest potential predictors of training outcomes to assist in the future management of clinical intervention and vision neuroscience research in amblyopia beyond the critical period of visual plasticity.
Collapse
|
19
|
Sanchez-Garcia M, Chauhan T, Cottereau BR, Beyeler M. Efficient multi-scale representation of visual objects using a biologically plausible spike-latency code and winner-take-all inhibition. BIOLOGICAL CYBERNETICS 2023; 117:95-111. [PMID: 37004546 DOI: 10.1007/s00422-023-00956-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 02/10/2023] [Indexed: 05/05/2023]
Abstract
Deep neural networks have surpassed human performance in key visual challenges such as object recognition, but require a large amount of energy, computation, and memory. In contrast, spiking neural networks (SNNs) have the potential to improve both the efficiency and biological plausibility of object recognition systems. Here we present a SNN model that uses spike-latency coding and winner-take-all inhibition (WTA-I) to efficiently represent visual stimuli using multi-scale parallel processing. Mimicking neuronal response properties in early visual cortex, images were preprocessed with three different spatial frequency (SF) channels, before they were fed to a layer of spiking neurons whose synaptic weights were updated using spike-timing-dependent-plasticity. We investigate how the quality of the represented objects changes under different SF bands and WTA-I schemes. We demonstrate that a network of 200 spiking neurons tuned to three SFs can efficiently represent objects with as little as 15 spikes per neuron. Studying how core object recognition may be implemented using biologically plausible learning rules in SNNs may not only further our understanding of the brain, but also lead to novel and efficient artificial vision systems.
Collapse
Affiliation(s)
| | - Tushar Chauhan
- The Picower Institute for Learning and Memory, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Boston, MA, USA
- CerCo CNRS UMR5549, Université de Toulouse III-Paul Sabatier, Toulouse, France
| | - Benoit R Cottereau
- CerCo CNRS UMR5549, Université de Toulouse III-Paul Sabatier, Toulouse, France
- IPAL, CNRS IRL 2955, Singapore, Singapore
| | - Michael Beyeler
- Department of Computer Science, University of California, Santa Barbara, CA, USA
- Department of Psychological & Brain Sciences, University of California, Santa Barbara, CA, USA
| |
Collapse
|
20
|
Cheng A, Chen Z, Dilks DD. A stimulus-driven approach reveals vertical luminance gradient as a stimulus feature that drives human cortical scene selectivity. Neuroimage 2023; 269:119935. [PMID: 36764369 PMCID: PMC10044493 DOI: 10.1016/j.neuroimage.2023.119935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 01/19/2023] [Accepted: 02/07/2023] [Indexed: 02/11/2023] Open
Abstract
Human neuroimaging studies have revealed a dedicated cortical system for visual scene processing. But what is a "scene"? Here, we use a stimulus-driven approach to identify a stimulus feature that selectively drives cortical scene processing. Specifically, using fMRI data from BOLD5000, we examined the images that elicited the greatest response in the cortical scene processing system, and found that there is a common "vertical luminance gradient" (VLG), with the top half of a scene image brighter than the bottom half; moreover, across the entire set of images, VLG systematically increases with the neural response in the scene-selective regions (Study 1). Thus, we hypothesized that VLG is a stimulus feature that selectively engages cortical scene processing, and directly tested the role of VLG in driving cortical scene selectivity using tightly controlled VLG stimuli (Study 2). Consistent with our hypothesis, we found that the scene-selective cortical regions-but not an object-selective region or early visual cortex-responded significantly more to images of VLG over control stimuli with minimal VLG. Interestingly, such selectivity was also found for images with an "inverted" VLG, resembling the luminance gradient in night scenes. Finally, we also tested the behavioral relevance of VLG for visual scene recognition (Study 3); we found that participants even categorized tightly controlled stimuli of both upright and inverted VLG to be a place more than an object, indicating that VLG is also used for behavioral scene recognition. Taken together, these results reveal that VLG is a stimulus feature that selectively engages cortical scene processing, and provide evidence for a recent proposal that visual scenes can be characterized by a set of common and unique visual features.
Collapse
Affiliation(s)
- Annie Cheng
- Department of Psychology, Emory University, Atlanta, GA, USA; Department of Psychiatry, Yale School of Medicine, New Haven, CT, USA
| | - Zirui Chen
- Department of Psychology, Emory University, Atlanta, GA, USA; Department of Cognitive Science, Johns Hopkins University, Baltimore, MD, USA
| | - Daniel D Dilks
- Department of Psychology, Emory University, Atlanta, GA, USA.
| |
Collapse
|
21
|
Manzari ON, Ahmadabadi H, Kashiani H, Shokouhi SB, Ayatollahi A. MedViT: A robust vision transformer for generalized medical image classification. Comput Biol Med 2023; 157:106791. [PMID: 36958234 DOI: 10.1016/j.compbiomed.2023.106791] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 02/18/2023] [Accepted: 03/11/2023] [Indexed: 03/16/2023]
Abstract
Convolutional Neural Networks (CNNs) have advanced existing medical systems for automatic disease diagnosis. However, there are still concerns about the reliability of deep medical diagnosis systems against the potential threats of adversarial attacks since inaccurate diagnosis could lead to disastrous consequences in the safety realm. In this study, we propose a highly robust yet efficient CNN-Transformer hybrid model which is equipped with the locality of CNNs as well as the global connectivity of vision Transformers. To mitigate the high quadratic complexity of the self-attention mechanism while jointly attending to information in various representation subspaces, we construct our attention mechanism by means of an efficient convolution operation. Moreover, to alleviate the fragility of our Transformer model against adversarial attacks, we attempt to learn smoother decision boundaries. To this end, we augment the shape information of an image in the high-level feature space by permuting the feature mean and variance within mini-batches. With less computational complexity, our proposed hybrid model demonstrates its high robustness and generalization ability compared to the state-of-the-art studies on a large-scale collection of standardized MedMNIST-2D datasets.
Collapse
Affiliation(s)
- Omid Nejati Manzari
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran.
| | - Hamid Ahmadabadi
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
| | - Hossein Kashiani
- Lane Department of Computer Science and Electrical Engineering, West Virginia University, Morgantown, USA
| | - Shahriar B Shokouhi
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
| | - Ahmad Ayatollahi
- School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran
| |
Collapse
|
22
|
Amoruso L, Finisguerra A, Urgesi C. “Left and right prefrontal routes to action comprehension”. Cortex 2023; 163:1-13. [PMID: 37030047 DOI: 10.1016/j.cortex.2023.01.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2022] [Revised: 12/05/2022] [Accepted: 01/18/2023] [Indexed: 04/03/2023]
Abstract
Successful action comprehension requires the integration of motor information and semantic cues about objects in context. Previous evidence suggests that while motor features are dorsally encoded in the fronto-parietal action observation network (AON); semantic features are ventrally processed in temporal structures. Importantly, these dorsal and ventral routes seem to be preferentially tuned to low (LSF) and high (HSF) spatial frequencies, respectively. Recently, we proposed a model of action comprehension where we hypothesized an additional route to action understanding whereby coarse LSF information about objects in context is projected to the dorsal AON via the prefrontal cortex (PFC), providing a prediction signal of the most likely intention afforded by them. Yet, this model awaits for experimental testing. To this end, we used a perturb-and-measure continuous theta burst stimulation (cTBS) approach, selectively disrupting neural activity in the left and right PFC and then evaluating the participant's ability to recognize filtered action stimuli containing only HSF or LSF. We find that stimulation over PFC triggered different spatial-frequency modulations depending on lateralization: left-cTBS and right-cTBS led to poorer performance on HSF and LSF action stimuli, respectively. Our findings suggest that left and right PFC exploit distinct spatial frequencies to support action comprehension, providing evidence for multiple routes to social perception in humans.
Collapse
|
23
|
Impact of glaucoma on the spatial frequency processing of scenes in central vision. Vis Neurosci 2023; 40:E001. [PMID: 36752177 PMCID: PMC9970733 DOI: 10.1017/s0952523822000086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
Abstract
Glaucoma is an eye disease characterized by a progressive vision loss usually starting in peripheral vision. However, a deficit for scene categorization is observed even in the preserved central vision of patients with glaucoma. We assessed the processing and integration of spatial frequencies in the central vision of patients with glaucoma during scene categorization, considering the severity of the disease, in comparison to age-matched controls. In the first session, participants had to categorize scenes filtered in low-spatial frequencies (LSFs) and high-spatial frequencies (HSFs) as a natural or an artificial scene. Results showed that the processing of spatial frequencies was impaired only for patients with severe glaucoma, in particular for HFS scenes. In the light of proactive models of visual perception, we investigated how LSF could guide the processing of HSF in a second session. We presented hybrid scenes (combining LSF and HSF from two scenes belonging to the same or different semantic category). Participants had to categorize the scene filtered in HSF while ignoring the scene filtered in LSF. Surprisingly, results showed that the semantic influence of LSF on HSF was greater for patients with early glaucoma than controls, and then disappeared for the severe cases. This study shows that a progressive destruction of retinal ganglion cells affects the spatial frequency processing in central vision. This deficit may, however, be compensated by increased reliance on predictive mechanisms at early stages of the disease which would however decline in more severe cases.
Collapse
|
24
|
Castellotti S, D’Agostino O, Del Viva MM. Fast discrimination of fragmentary images: the role of local optimal information. Front Hum Neurosci 2023; 17:1049615. [PMID: 36845876 PMCID: PMC9945129 DOI: 10.3389/fnhum.2023.1049615] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 01/18/2023] [Indexed: 02/11/2023] Open
Abstract
In naturalistic conditions, objects in the scene may be partly occluded and the visual system has to recognize the whole image based on the little information contained in some visible fragments. Previous studies demonstrated that humans can successfully recognize severely occluded images, but the underlying mechanisms occurring in the early stages of visual processing are still poorly understood. The main objective of this work is to investigate the contribution of local information contained in a few visible fragments to image discrimination in fast vision. It has been already shown that a specific set of features, predicted by a constrained maximum-entropy model to be optimal carriers of information (optimal features), are used to build simplified early visual representations (primal sketch) that are sufficient for fast image discrimination. These features are also considered salient by the visual system and can guide visual attention when presented isolated in artificial stimuli. Here, we explore whether these local features also play a significant role in more natural settings, where all existing features are kept, but the overall available information is drastically reduced. Indeed, the task requires discrimination of naturalistic images based on a very brief presentation (25 ms) of a few small visible image fragments. In the main experiment, we reduced the possibility to perform the task based on global-luminance positional cues by presenting randomly inverted-contrast images, and we measured how much observers' performance relies on the local features contained in the fragments or on global information. The size and the number of fragments were determined in two preliminary experiments. Results show that observers are very skilled in fast image discrimination, even when a drastic occlusion is applied. When observers cannot rely on the position of global-luminance information, the probability of correct discrimination increases when the visible fragments contain a high number of optimal features. These results suggest that such optimal local information contributes to the successful reconstruction of naturalistic images even in challenging conditions.
Collapse
|
25
|
Okrent Smolar AL, Gagrani M, Ghate D. Peripheral visual field loss and activities of daily living. Curr Opin Neurol 2023; 36:19-25. [PMID: 36409221 DOI: 10.1097/wco.0000000000001125] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
PURPOSE OF REVIEW Peripheral visual field (VF) loss affects 13% of the population over 65. Its effect on activities of daily living and higher order visual processing is as important as it is inadequately understood. The purpose of this review is to summarize available literature on the impact of peripheral vision loss on driving, reading, face recognition, scene recognition and scene navigation. RECENT FINDINGS In this review, glaucoma and retrochiasmal cortical damage are utilized as examples of peripheral field loss which typically spare central vision and have patterns respecting the horizontal and vertical meridians, respectively. In both glaucoma and retrochiasmal damage, peripheral field loss causes driving difficulty - especially with lane maintenance - leading to driving cessation, loss of independence, and depression. Likewise, peripheral field loss can lead to slower reading speeds and decreased enjoyment from reading, and anxiety. In glaucoma and retrochiasmal field loss, face processing is impaired which impacts social functioning. Finally, scene recognition and navigation are also adversely affected, impacting wayfinding and hazard detection leading to decreased independence as well as more frequent injury. SUMMARY Peripheral VF loss is an under-recognized cause of patient distress and disability. All peripheral field loss is not the same, differential patterns of loss affect parameters of activities of daily living (ADL) and visual processing in particular ways. Future research should aim to further characterize patterns of deranged ADL and visual processing, their correlation with types of field loss, and associated mechanisms.
Collapse
Affiliation(s)
| | - Meghal Gagrani
- Department of Ophthalmology, University of Pittsburgh School of Medicine Children's Hospital of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Deepta Ghate
- Department of Ophthalmology, Emory University School of Medicine, Atlanta, Georgia
| |
Collapse
|
26
|
D’Argenio G, Finisguerra A, Urgesi C. Spatial Frequency Tuning of Body Inversion Effects. Brain Sci 2023; 13:brainsci13020190. [PMID: 36831733 PMCID: PMC9954120 DOI: 10.3390/brainsci13020190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 01/13/2023] [Accepted: 01/18/2023] [Indexed: 01/26/2023] Open
Abstract
Body inversion effects (BIEs) reflect the deployment of the configural processing of body stimuli. BIE modulates the activity of body-selective areas within both the dorsal and the ventral streams, which are tuned to low (LSF) or high spatial frequencies (HSF), respectively. The specific contribution of different bands to the configural processing of bodies along gender and posture dimensions, however, is still unclear. Seventy-two participants performed a delayed matching-to-sample paradigm in which upright and inverted bodies, differing for gender or posture, could be presented in their original intact form or in the LSF- or HSF-filtered version. In the gender discrimination task, participants' performance was enhanced by the presentation of HSF images. Conversely, for the posture discrimination task, a better performance was shown for either HSF or LSF images. Importantly, comparing the amount of BIE across spatial-frequency conditions, we found greater BIEs for HSF than LSF images in both tasks, indicating that configural body processing may be better supported by HSF information, which will bias processing in the ventral stream areas. Finally, the exploitation of HSF information for the configural processing of body postures was lower in individuals with higher autistic traits, likely reflecting a stronger reliance on the local processing of body-part details.
Collapse
Affiliation(s)
- Giulia D’Argenio
- PhD Program in Neural and Cognitive Sciences, Department of Life Sciences, University of Trieste, 34128 Trieste, Italy
- Laboratory of Cognitive Neuroscience, Department of Languages and Literatures, Communication, Education and Society, University of Udine, 33100 Udine, Italy
- Correspondence: (G.D.); (C.U.)
| | | | - Cosimo Urgesi
- Laboratory of Cognitive Neuroscience, Department of Languages and Literatures, Communication, Education and Society, University of Udine, 33100 Udine, Italy
- Scientific Institute, IRCCS E. Medea, Pasian di Prato (Udine), 33037 Udine, Italy
- Correspondence: (G.D.); (C.U.)
| |
Collapse
|
27
|
Doucet GE, Hamlin N, Kruse JA, Taylor BK, Poirel N. Link between fluid/crystallized intelligence and global/local visual abilities across adulthood. Conscious Cogn 2022; 106:103429. [PMID: 36306570 PMCID: PMC10481540 DOI: 10.1016/j.concog.2022.103429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 10/10/2022] [Accepted: 10/13/2022] [Indexed: 01/27/2023]
Abstract
Human visual processing involves the extraction of both global and local information from a visual stimulus. Such processing may be related to cognitive abilities, which is likely going to change over time as we age. We aimed to investigate the impact of healthy aging on the association between visual global vs local processing and intelligence. In this context, we collected behavioral data during a visual search task in 103 adults (50 younger/53 older). We extracted three metrics reflecting global advantage (faster global than local processing), and visual interference in detecting either local or global features (based on interfering visual distractors). We found that older, but not younger, adults with higher levels of fluid and crystallized intelligence showed stronger signs of global advantage and interference effects during local processing, respectively. The present findings also provide promising clues regarding how participants consider and process their visual world in healthy aging.
Collapse
Affiliation(s)
- Gaelle E Doucet
- Institute for Human Neuroscience, Boys Town National Research Hospital, Omaha, NE, USA; Creighton University School of Medicine, Omaha, NE, USA.
| | - Noah Hamlin
- Institute for Human Neuroscience, Boys Town National Research Hospital, Omaha, NE, USA
| | - Jordanna A Kruse
- Institute for Human Neuroscience, Boys Town National Research Hospital, Omaha, NE, USA
| | - Brittany K Taylor
- Institute for Human Neuroscience, Boys Town National Research Hospital, Omaha, NE, USA; Creighton University School of Medicine, Omaha, NE, USA
| | - Nicolas Poirel
- Université Paris Cité, LaPsyDÉ, CNRS, F-75005 Paris, France; GIP Cyceron, Caen, France; Institut Universitaire de France, Paris, France
| |
Collapse
|
28
|
Abstract
The ability to judge numbers exists in various vertebrate species but also in honey bees, thus raising the question of the phylogenetic origins of numerosity systems. Here, we studied if bees, like humans, organize numbers spatially from left to right according to their magnitude. As the cultural vs. biological origins of this mental number line (MNL) are a subject of debate, our study provides an important perspective for this discussion. We show that bees order numbers from left to right according to their magnitude and that the location of a number on that line varies with the reference number previously trained. Thus, the MNL is a biological numeric representation that is common to the nervous system with distant evolutionary origins. The “mental number line” (MNL) is a form of spatial numeric representation that associates small and large numbers with the left and right spaces, respectively. This spatio-numeric organization can be found in adult humans and has been related to cultural factors such as writing and reading habits. Yet, both human newborns and birds order numbers consistently with an MNL, thus raising the question of whether culture is a main explanation for MNL. Here, we explored the numeric sense of honey bees and show that after being trained to associate numbers with a sucrose reward, they order numbers not previously experienced from left to right according to their magnitude. Importantly, the location of a number on that scale varies with the reference number previously trained and does not depend on low-level cues present on numeric stimuli. We provide a series of neural explanations for this effect based on the extensive knowledge accumulated on the neural underpinnings of visual processing in honey bees and conclude that the MNL is a form of numeric representation that is evolutionarily conserved across nervous systems endowed with a sense of number, irrespective of their neural complexity.
Collapse
|
29
|
Valenzo D, Ciria A, Schillaci G, Lara B. Grounding Context in Embodied Cognitive Robotics. Front Neurorobot 2022; 16:843108. [PMID: 35812785 PMCID: PMC9262126 DOI: 10.3389/fnbot.2022.843108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2021] [Accepted: 05/10/2022] [Indexed: 11/13/2022] Open
Abstract
Biological agents are context-dependent systems that exhibit behavioral flexibility. The internal and external information agents process, their actions, and emotions are all grounded in the context within which they are situated. However, in the field of cognitive robotics, the concept of context is far from being clear with most studies making little to no reference to it. The aim of this paper is to provide an interpretation of the notion of context and its core elements based on different studies in natural agents, and how these core contextual elements have been modeled in cognitive robotics, to introduce a new hypothesis about the interactions between these contextual elements. Here, global context is categorized as agent-related, environmental, and task-related context. The interaction of their core elements, allows agents to first select self-relevant tasks depending on their current needs, or for learning and mastering their environment through exploration. Second, to perform a task and continuously monitor its performance. Third, to abandon a task in case its execution is not going as expected. Here, the monitoring of prediction error, the difference between sensorimotor predictions and incoming sensory information, is at the core of behavioral flexibility during situated action cycles. Additionally, monitoring prediction error dynamics and its comparison with the expected reduction rate should indicate the agent its overall performance on executing the task. Sensitivity to performance evokes emotions that function as the driving element for autonomous behavior which, at the same time, depends on the processing of the interacting core elements. Taking all these into account, an interactionist model of contexts and their core elements is proposed. The model is embodied, affective, and situated, by means of the processing of the agent-related and environmental core contextual elements. Additionally, it is grounded in the processing of the task-related context and the associated situated action cycles during task execution. Finally, the model proposed here aims to guide how artificial agents should process the core contextual elements of the agent-related and environmental context to give rise to the task-related context, allowing agents to autonomously select a task, its planning, execution, and monitoring for behavioral flexibility.
Collapse
Affiliation(s)
- Diana Valenzo
- Laboratorio de Robótica Cognitiva, Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico
| | - Alejandra Ciria
- Facultad de Psicología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | | | - Bruno Lara
- Laboratorio de Robótica Cognitiva, Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, Cuernavaca, Mexico
- *Correspondence: Bruno Lara
| |
Collapse
|
30
|
Trouilloud A, Rossel P, Faurite C, Roux-Sibilon A, Kauffmann L, Peyrin C. Influence of physical features from peripheral vision on scene categorization in central vision. VISUAL COGNITION 2022. [DOI: 10.1080/13506285.2022.2087814] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Affiliation(s)
- Audrey Trouilloud
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, Grenoble, France
| | - Pauline Rossel
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, Grenoble, France
| | - Cynthia Faurite
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, Grenoble, France
| | - Alexia Roux-Sibilon
- Psychological Sciences Research Institute (IPSY), UC Louvain, Louvain-la-Neuve, Belgium
| | - Louise Kauffmann
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, Grenoble, France
| | - Carole Peyrin
- Univ. Grenoble Alpes, Univ. Savoie Mont Blanc, CNRS, LPNC, Grenoble, France
| |
Collapse
|
31
|
Dasgupta D, Warner TPA, Erskine A, Schaefer AT. Coupling of Mouse Olfactory Bulb Projection Neurons to Fluctuating Odor Pulses. J Neurosci 2022; 42:4278-4296. [PMID: 35440491 PMCID: PMC9145232 DOI: 10.1523/jneurosci.1422-21.2022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2021] [Revised: 03/24/2022] [Accepted: 03/29/2022] [Indexed: 11/28/2022] Open
Abstract
Odors are transported by turbulent air currents, creating complex temporal fluctuations in odor concentration that provide a potentially informative stimulus dimension. We have shown that mice are able to discriminate odor stimuli based on their temporal structure, indicating that information contained in the temporal structure of odor plumes can be extracted by the mouse olfactory system. Here, using in vivo extracellular and intracellular electrophysiological recordings, we show that mitral cells (MCs) and tufted cells (TCs) of the male C57BL/6 mouse olfactory bulb can encode the dominant temporal frequencies present in odor stimuli up to at least 20 Hz. A substantial population of cell-odor pairs showed significant coupling of their subthreshold membrane potential with the odor stimulus at both 2 Hz (29/70) and the suprasniff frequency 20 Hz (24/70). Furthermore, mitral/tufted cells (M/TCs) show differential coupling of their membrane potential to odor concentration fluctuations with tufted cells coupling more strongly for the 20 Hz stimulation. Frequency coupling was always observed to be invariant to odor identity, and M/TCs that coupled well to a mixture also coupled to at least one of the components of the mixture. Interestingly, pharmacological blocking of the inhibitory circuitry strongly modulated frequency coupling of cell-odor pairs at both 2 Hz (10/15) and 20 Hz (9/15). These results provide insight into how both cellular and circuit properties contribute to the encoding of temporal odor features in the mouse olfactory bulb.SIGNIFICANCE STATEMENT Odors in the natural environment have a strong temporal structure that can be extracted and used by mice in their behavior. Here, using in vivo extracellular and intracellular electrophysiological techniques, we show that the projection neurons in the olfactory bulb can encode and couple to the dominant frequency present in an odor stimulus. Furthermore, frequency coupling was observed to be differential between mitral and tufted cells and was odor invariant but strongly modulated by local inhibitory circuits. In summary, this study provides insight into how both cellular and circuit properties modulate encoding of odor temporal features in the mouse olfactory bulb.
Collapse
Affiliation(s)
- Debanjan Dasgupta
- Sensory Circuits and Neurotechnology Laboratory, Francis Crick Institute, London NW1 1AT, United Kingdom
- Department of Neuroscience, Physiology and Pharmacology, University College London, London WC1E 6BT, United Kingdom
| | - Tom P A Warner
- Sensory Circuits and Neurotechnology Laboratory, Francis Crick Institute, London NW1 1AT, United Kingdom
| | - Andrew Erskine
- Sensory Circuits and Neurotechnology Laboratory, Francis Crick Institute, London NW1 1AT, United Kingdom
| | - Andreas T Schaefer
- Sensory Circuits and Neurotechnology Laboratory, Francis Crick Institute, London NW1 1AT, United Kingdom
- Department of Neuroscience, Physiology and Pharmacology, University College London, London WC1E 6BT, United Kingdom
| |
Collapse
|
32
|
The forest, the trees, and the leaves across adulthood: Age-related changes on a visual search task containing three-level hierarchical stimuli. Atten Percept Psychophys 2022; 84:1004-1015. [PMID: 35013995 PMCID: PMC9001546 DOI: 10.3758/s13414-021-02438-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/27/2021] [Indexed: 11/08/2022]
Abstract
Selecting relevant visual information in complex scenes by processing either global information or local parts helps us act efficiently within our environment and achieve goals. A global advantage (faster global than local processing) and global interference (global processing interferes with local processing) comprise an evidentiary global precedence phenomenon in early adulthood. However, the impact of healthy aging on this phenomenon remains unclear. As such, we collected behavioral data during a visual search task, including three-levels hierarchical stimuli (i.e., global, intermediate, and local levels) with several hierarchical distractors, in 50 healthy adults (26 younger (mean age: 26 years) and 24 older (mean age: 62 years)). Results revealed that processing information presented at the global and intermediate levels was independent of age. Conversely, older adults were slower for local processing compared to the younger adults, suggesting lower efficiency to deal with visual distractors during detail-oriented visual search. Although healthy older adults continued exhibiting a global precedence phenomenon, they were disproportionately less efficient during local aspects of information processing, especially when multiple visual information was displayed. Our results could have important implications for many life situations by suggesting that visual information processing is impacted by healthy aging, even with similar visual stimuli objectively presented.
Collapse
|
33
|
Adámek P, Langová V, Horáček J. Early-stage visual perception impairment in schizophrenia, bottom-up and back again. SCHIZOPHRENIA (HEIDELBERG, GERMANY) 2022; 8:27. [PMID: 35314712 PMCID: PMC8938488 DOI: 10.1038/s41537-022-00237-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 02/17/2022] [Indexed: 01/01/2023]
Abstract
Visual perception is one of the basic tools for exploring the world. However, in schizophrenia, this modality is disrupted. So far, there has been no clear answer as to whether the disruption occurs primarily within the brain or in the precortical areas of visual perception (the retina, visual pathways, and lateral geniculate nucleus [LGN]). A web-based comprehensive search of peer-reviewed journals was conducted based on various keyword combinations including schizophrenia, saliency, visual cognition, visual pathways, retina, and LGN. Articles were chosen with respect to topic relevance. Searched databases included Google Scholar, PubMed, and Web of Science. This review describes the precortical circuit and the key changes in biochemistry and pathophysiology that affect the creation and characteristics of the retinal signal as well as its subsequent modulation and processing in other parts of this circuit. Changes in the characteristics of the signal and the misinterpretation of visual stimuli associated with them may, as a result, contribute to the development of schizophrenic disease.
Collapse
Affiliation(s)
- Petr Adámek
- Third Faculty of Medicine, Charles University, Prague, Czech Republic. .,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic.
| | - Veronika Langová
- Third Faculty of Medicine, Charles University, Prague, Czech Republic.,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
| | - Jiří Horáček
- Third Faculty of Medicine, Charles University, Prague, Czech Republic.,Center for Advanced Studies of Brain and Consciousness, National Institute of Mental Health, Klecany, Czech Republic
| |
Collapse
|
34
|
Fernandes TP, Oliveira MEC, Silva GM, Santos NA. Improvement in visual performance after nicotine gum administration in tobacco use disorder: a case report. J Addict Dis 2022; 40:568-576. [PMID: 35264083 DOI: 10.1080/10550887.2021.2020045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Chronic tobacco consumption, identified as Tobacco Use Disorder (TUD), is a public health problem. We present a case report of a 37-year-old Brazilian male diagnosed with TUD at age 26, with no comorbidities, that presented visual improvements (i.e., lower thresholds and better discrimination) after nicotine gum administration. Here, we assessed contrast sensitivity and chromatic discrimination using the Metropsis and the Cambridge Colour Test, respectively. Results showed lower thresholds for both visual tasks after the use of nicotine gum. Even considering this is a single case report, our intent is to open new avenues for research involving smoking, addiction and the use of nicotine gum as a replacement tool or adjuvant tool for improvement of visual and/or cognitive processing. It is well known that nicotine gum has protective effects for some diseases, and improves some cognitive functions. However, unclear were its effects on visual processing of people with TUD.
Collapse
Affiliation(s)
- Thiago P Fernandes
- Perception, Neuroscience and Behaviour Lab, Federal University of Paraiba, Joao Pessoa, Brazil.,Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Milena E C Oliveira
- Perception, Neuroscience and Behaviour Lab, Federal University of Paraiba, Joao Pessoa, Brazil.,Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Gabriella M Silva
- Perception, Neuroscience and Behaviour Lab, Federal University of Paraiba, Joao Pessoa, Brazil.,Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Natanael A Santos
- Perception, Neuroscience and Behaviour Lab, Federal University of Paraiba, Joao Pessoa, Brazil.,Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil
| |
Collapse
|
35
|
Almeida NL, Silva JBS, Oliveira MEC, Fernandes TP, Santos NA. Eye movement impairments in children with malnutrition. INTERNATIONAL JOURNAL OF PSYCHOLOGY 2022; 57:644-651. [PMID: 35262928 DOI: 10.1002/ijop.12838] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Accepted: 01/30/2022] [Indexed: 11/08/2022]
Abstract
Malnutrition is characterised by deficient nutrient ingestion and absorption and is still one of the most important causes of morbidity and mortality in children worldwide. Our main rationale was that protein-energy malnutrition (PEM) may affect eye movement in children with malnutrition. Twenty children without PEM (mean age = 10.8; SD = 1.0 years) and 18 children with PEM (mean age = 10.9; SD = 1.2 years) were included in the present study. We applied three types of tests: one that consisted of a maze and two versions of the Spot the Seven Errors test using boats and elephants. Our results indicated that children with PEM exhibited performance deficits in the maze test (p < .001) and Spot the Seven Errors test for both boats (p < .001) and elephants (p < .001). These data suggest that nutritional impairments during the first year of life (i.e., a critical period) can directly impact eye movement. Eye tracking is a reliable technique to investigate higher-order processes, but our results should be interpreted with caution. Our findings highlight the relevance of cognitive development in malnourished children, which can negatively affect their development. Screening, assessment and rehabilitation strategies are essential in this at-risk population.
Collapse
Affiliation(s)
- Natalia L Almeida
- Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Jessica B S Silva
- Department of Psychology, Minas Gerais State University, Belo Horizonte, Brazil
| | - Milena E C Oliveira
- Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Thiago P Fernandes
- Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Natanael A Santos
- Department of Psychology, Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| |
Collapse
|
36
|
Cummins DD, Caton MT, Shah V, Meisel K, Glastonbury C, Amans MR. MRI and MR angiography evaluation of pulsatile tinnitus: A focused, physiology-based protocol. J Neuroimaging 2022; 32:253-263. [PMID: 34910345 PMCID: PMC8917066 DOI: 10.1111/jon.12955] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 11/19/2021] [Accepted: 11/22/2021] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND AND PURPOSE Pulsatile tinnitus (PT) is the subjective sensation of a pulse-synchronous sound, most often due to a cerebrovascular etiology. PT can severely impact quality of life and may indicate a life-threatening process, yet a timely and accurate diagnosis can often lead to effective treatment. Clinical assessment with a history and physical examination can often suggest a diagnosis for PT, but is rarely definitive. Therefore, PT should be evaluated with a comprehensive and targeted radiographic imaging protocol. MR imaging provides a safe and effective means to evaluate PT. Specific MR sequences may be used to highlight different elements of cerebrovascular anatomy and physiology. However, routine MR evaluation of PT must comply with economic and practical constraints, while effectively capturing both common and rarer, life-threatening etiologies of PT. METHODS In this state-of-the-art review, we describe our institutional MR protocol for evaluating PT. RESULTS This protocol includes the following dedicated sequences: time-of-flight magnetic resonance angiography; arterial spin labeling; spoiled gradient recalled acquisition in the steady state; time-resolved imaging of contrast kinetics; diffusion weighted imaging, and 3-dimensional fluid-attenuated inversion recovery. CONCLUSIONS We describe the physiologic and clinical rationale for including each MR sequence in a comprehensive PT imaging protocol, and detail the role of MR within the broader evaluation of PT, from clinical presentation to treatment.
Collapse
Affiliation(s)
- Daniel D. Cummins
- School of Medicine, University of California, San Francisco, San Francisco, CA, USA
| | - Michael T. Caton
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA
| | - Vinil Shah
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA
| | - Karl Meisel
- Department of Neurology, University of California, San Francisco, San Francisco, CA, USA
| | - Christine Glastonbury
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA
| | - Matthew R. Amans
- Department of Radiology and Biomedical Imaging, University of California, San Francisco, San Francisco, CA, USA,Corresponding author: Matthew R. Amans, Address: 505 Parnassus Ave, Room L349, San Francisco, CA 94143, Telephone: 415-353-1863, Fax: 415-353-8606,
| |
Collapse
|
37
|
DiNuzzo M, Mangia S, Moraschi M, Mascali D, Hagberg GE, Giove F. Perception is associated with the brain's metabolic response to sensory stimulation. eLife 2022; 11:71016. [PMID: 35225790 PMCID: PMC9038191 DOI: 10.7554/elife.71016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 02/25/2022] [Indexed: 11/13/2022] Open
Abstract
Processing of incoming sensory stimulation triggers an increase of cerebral perfusion and blood oxygenation (neurovascular response) as well as an alteration of the metabolic neurochemical profile (neurometabolic response). Here we show in human primary visual cortex (V1) that perceived and unperceived isoluminant chromatic flickering stimuli designed to have similar neurovascular responses as measured by blood oxygenation level dependent functional MRI (BOLD-fMRI) have markedly different neurometabolic responses as measured by functional MRS. In particular, a significant regional buildup of lactate, an index of aerobic glycolysis, and glutamate, an index of malate-aspartate shuttle, occurred in V1 only when the flickering was perceived, without any relation with behavioral or physiological variables. Whereas the BOLD-fMRI signal in V1, a proxy for input to V1, was insensitive to flickering perception by design, the BOLD-fMRI signal in secondary visual areas was larger during perceived than unperceived flickering, indicating increased output from V1. These results demonstrate that the upregulation of energy metabolism induced by visual stimulation depends on the type of information processing taking place in V1, and that 1H-fMRS provides unique information about local input/output balance that is not measured by BOLD fMRI.
Collapse
Affiliation(s)
- Mauro DiNuzzo
- Museo Storico della Fisica e Centro Studi e Ricerche Enrico Fermi, Rome, Italy
| | - Silvia Mangia
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, United States
| | - Marta Moraschi
- Department of Radiation Oncology, University of Rome, Rome, Italy
| | - Daniele Mascali
- Dipartimento di Neuroscienze, Università Gabriele D'Annunzio, Chieti, Italy
| | - Gisela E Hagberg
- High-Field Magnetic Resonance, Max Planck Institute for Biological Cybernetics and Biomedical Magnetic Resonance, Tübingen, Germany
| | - Federico Giove
- Museo Storico della Fisica e Centro Studi e Ricerche Enrico Fermi, Rome, Italy
| |
Collapse
|
38
|
Improving Functional Connectivity in Developmental Dyslexia through Combined Neurofeedback and Visual Training. Symmetry (Basel) 2022. [DOI: 10.3390/sym14020369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
This study examined the effects of combined neurofeedback (NF) and visual training (VT) on children with developmental dyslexia (DD). Although NF is the first noninvasive approach to support neurological disorders, the mechanisms of its effects on the brain functional connectivity are still unclear. A key question is whether the functional connectivities of the EEG frequency networks change after the combined NF–VT training of DD children (postD). NF sessions of voluntary α/θ rhythm control were applied in a low-spatial-frequency (LSF) illusion contrast discrimination, which provides feedback with visual cues to improve the brain signals and cognitive abilities in DD children. The measures of connectivity, which are defined by small-world propensity, were sensitive to the properties of the brain electrical oscillations in the quantitative EEG-NF training. In the high-contrast LSF illusion, the z-NF reduced the α/θ scores in the frontal areas, and in the right ventral temporal, occipital–temporal, and middle occipital areas in the postD (vs. the preD) because of their suppression in the local hub θ-network and the altered global characteristics of the functional θ-frequency network. In the low-contrast condition, the z-NF stimulated increases in the α/θ scores, which induced hubs in the left-side α-frequency network of the postD, and changes in the global characteristics of the functional α-frequency network. Because of the anterior, superior, and middle temporal deficits affecting the ventral and occipital–temporal pathways, the z-NF–VT compensated for the more ventral brain regions, mainly in the left hemispheres of the postD group in the low-contrast LSF illusion. Compared to pretraining, the NF–VT increased the segregation of the α, β (low-contrast), and θ networks (high-contrast), as well as the γ2-network integration (both contrasts) after the termination of the training of the children with developmental dyslexia. The remediation compensated more for the dorsal (prefrontal, premotor, occipital–parietal connectivities) dysfunction of the θ network in the developmental dyslexia in the high-contrast LSF illusion. Our findings provide neurobehavioral evidence for the exquisite brain functional plasticity and direct effect of NF–VT on cognitive disabilities in DD children.
Collapse
|
39
|
Mendoza-Medialdea MT, Ruiz-Padial E. Exploration of Exogenous Attention to Disgust and Fear Pictures with Different Spatial Frequencies Through Event-related Potentials. Neuroscience 2022; 481:1-11. [PMID: 34843895 DOI: 10.1016/j.neuroscience.2021.11.029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 11/11/2021] [Accepted: 11/19/2021] [Indexed: 11/26/2022]
Abstract
The study of the effects of fear and disgust on the capture of automatic attention is gaining interest. Most findings reveal a more efficient capture of exogenous attention by disgust than by fear stimuli, although the underlying mechanisms are not completely understood. The manipulation of their spatial frequency may provide new insight that may contribute to clarify this issue. The present study aimed to explore differential processing of disgust and fear scenes containing only low spatial frequencies (LSF) or all spatial frequencies (intact) presented as distractors in an exogenous attention task. Event-related potentials (ERPs) and behavioral responses were recorded as dependent variables from forty participants (29 women). The results showed that disgust and fear distractors captured exogenous attention equally early, as indicated by the augmented amplitude of the N2p, and later disgust distractors are the ones eliciting the highest amplitude of the LPP component. While in an initial stage, both stimuli seem to have similar preferential access to further processing allowing fast responding in both cases, disgust is more deeply processed at a later stage probably facilitating its examination. These findings suggest that exploring the temporal course of processing is relevant for the understanding of the differential capture of exogenous attention by disgust and fear distractors.
Collapse
|
40
|
Zhang Q, Lai J, Feng Z, Xie X. Seeing Like a Human: Asynchronous Learning With Dynamic Progressive Refinement for Person Re-Identification. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021; 31:352-365. [PMID: 34807829 DOI: 10.1109/tip.2021.3128330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Learning discriminative and rich features is an important research task for person re-identification. Previous studies have attempted to capture global and local features at the same time and layer of the model in a non-interactive manner, which are called synchronous learning. However, synchronous learning leads to high similarity, and further defects in model performance. To this end, we propose asynchronous learning based on the human visual perception mechanism. Asynchronous learning emphasizes the time asynchrony and space asynchrony of feature learning and achieves mutual promotion and cyclical interaction for feature learning. Furthermore, we design a dynamic progressive refinement module to improve local features with the guidance of global features. The dynamic property allows this module to adaptively adjust the network parameters according to the input image, in both the training and testing stage. The progressive property narrows the semantic gap between the global and local features, which is due to the guidance of global features. Finally, we have conducted several experiments on four datasets, including Market1501, CUHK03, DukeMTMC-ReID, and MSMT17. The experimental results show that asynchronous learning can effectively improve feature discrimination and achieve strong performance.
Collapse
|
41
|
Geuzebroek AC, Woutersen K, van den Berg AV. When You Do Not Get the Whole Picture: Scene Perception After Occipital Cortex Lesions. Front Neurosci 2021; 15:716273. [PMID: 34966253 PMCID: PMC8710569 DOI: 10.3389/fnins.2021.716273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 11/08/2021] [Indexed: 11/29/2022] Open
Abstract
Background: Occipital cortex lesions (OCLs) typically result in visual field defects (VFDs) contralateral to the damage. VFDs are usually mapped with perimetry involving the detection of point targets. This, however, ignores the important role of integration of visual information across locations in many tasks of everyday life. Here, we ask whether standard perimetry can fully characterize the consequences of OCLs. We compare performance on a rapid scene discrimination task of OCL participants and healthy observers with simulated VFDs. While the healthy observers will only suffer the loss of part of the visual scene, the damage in the OCL participants may further compromise global visual processing. Methods: VFDs were mapped with Humphrey perimetry, and participants performed two rapid scene discrimination tasks. In healthy participants, the VFDs were simulated with hemi- and quadrant occlusions. Additionally, the GIST model, a computational model of scene recognition, was used to make individual predictions based on the VFDs. Results: The GIST model was able to predict the performance of controls regarding the effects of the local occlusion. Using the individual predictions of the GIST model, we can determine that the variability between the OCL participants is much larger than the extent of the VFD could account for. The OCL participants can further be categorized as performing worse, the same, or better as their VFD would predict. Conclusions: While in healthy observers the extent of the simulated occlusion accounts for their performance loss, the OCL participants' performance is not fully determined by the extent or shape of their VFD as measured with Humphrey perimetry. While some OCL participants are indeed only limited by the local occlusion of the scene, for others, the lesions compromised the visual network in a more global and disruptive way. Yet one outperformed a healthy observer, suggesting a possible adaptation to the VFD. Preliminary analysis of neuroimaging data suggests that damage to the lateral geniculate nucleus and corpus callosum might be associated with the larger disruption of rapid scene discrimination. We believe our approach offers a useful behavioral tool for investigating why similar VFDs can produce widely differing limitations in everyday life.
Collapse
Affiliation(s)
- Anna C. Geuzebroek
- Donders Institute for Brain, Cognition and Behavior, Center for Cognitive Neuroscience, Radboud University, Nijmegen, Netherlands
- School of Electrical and Electronic Engineering, University College Dublin, Dublin, Ireland
| | - Karlijn Woutersen
- Donders Institute for Brain, Cognition and Behavior, Center for Cognitive Neuroscience, Radboud University Medical Center (RadboudUMC), Nijmegen, Netherlands
| | - Albert V. van den Berg
- Donders Institute for Brain, Cognition and Behavior, Center for Cognitive Neuroscience, Radboud University Medical Center (RadboudUMC), Nijmegen, Netherlands
| |
Collapse
|
42
|
Cermeño-Aínsa S. The perception/cognition distincton: Challenging the representational account. Conscious Cogn 2021; 95:103216. [PMID: 34649065 DOI: 10.1016/j.concog.2021.103216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Revised: 09/22/2021] [Accepted: 09/23/2021] [Indexed: 11/30/2022]
Abstract
A central goal for cognitive science and philosophy of mind is to distinguish between perception and cognition. The representational approach has emerged as a prominent candidate to draw such a distinction. The idea is that perception and cognition differ in the content and the format in which the information is represented -just as perceptual representations are nonconceptual in content and iconic in format, cognitive representations are conceptual in content and discursive in format. This paper argues against this view. I argue that both perception and cognition can use conceptual and nonconceptual contents and be vehiculated in iconic and discursive formats. If correct, the representational strategy to distinguish perception from cognition fails.
Collapse
Affiliation(s)
- Sergio Cermeño-Aínsa
- Autonomous University of Barcelona, Cognitive Science and Language (CCiL), Edifici B, Campus de la UAB, 08193 Bellaterra, (Cerdanyola del Vallès), Spain.
| |
Collapse
|
43
|
Chaisilprungraung T, Park S. "Scene" from inside: The representation of Observer's space in high-level visual cortex. Neuropsychologia 2021; 161:108010. [PMID: 34454940 DOI: 10.1016/j.neuropsychologia.2021.108010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Revised: 07/30/2021] [Accepted: 08/23/2021] [Indexed: 10/20/2022]
Abstract
Human observers are remarkably adept at perceiving and interacting with visual stimuli around them. Compared to visual stimuli like objects or faces, scenes are unique in that they provide enclosures for observers. An observer looks at a scene by being physically inside the scene. The current research explored this unique observer-scene relationship by studying the neural representation of scenes' spatial boundaries. Previous studies hypothesized that scenes' boundaries were processed in sets of high-level visual cortices. Notably, the parahippocampal place area (PPA), exhibited neural sensitivity to scenes that had closed vs. open spatial boundaries (Kravitz et al., 2011; Park et al., 2011). We asked whether this sensitivity reflected the openness of landscape (e.g., forest vs. beach), or the openness of the environment immediately surrounding the observer (i.e., whether a scene was viewed from inside vs. outside a room). Across two human fMRI experiments, we found that the PPA, as well as another well-known navigation-processing area, the occipital place area (OPA), processed scenes' boundaries according to the observer's space rather than the landscape. Moreover, we found that the PPA's activation pattern was susceptible to manipulations involving mid-level perceptual properties of scenes (e.g., rectilinear pattern of window frames), while the OPA's response was not. Our results have important implications for research in visual scene processing and suggest an important role of an observer's location in representing the spatial boundary, beyond the low-level visual input of a landscape.
Collapse
Affiliation(s)
| | - Soojin Park
- Department of Psychology, Yonsei University, Seoul, South Korea.
| |
Collapse
|
44
|
Avberšek LK, Zeman A, Op de Beeck H. Training for object recognition with increasing spatial frequency: A comparison of deep learning with human vision. J Vis 2021; 21:14. [PMID: 34533580 PMCID: PMC8458991 DOI: 10.1167/jov.21.10.14] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
The ontogenetic development of human vision and the real-time neural processing of visual input exhibit a striking similarity—a sensitivity toward spatial frequencies that progresses in a coarse-to-fine manner. During early human development, sensitivity for higher spatial frequencies increases with age. In adulthood, when humans receive new visual input, low spatial frequencies are typically processed first before subsequent processing of higher spatial frequencies. We investigated to what extent this coarse-to-fine progression might impact visual representations in artificial vision and compared this to adult human representations. We simulated the coarse-to-fine progression of image processing in deep convolutional neural networks (CNNs) by gradually increasing spatial frequency information during training. We compared CNN performance after standard and coarse-to-fine training with a wide range of datasets from behavioral and neuroimaging experiments. In contrast to humans, CNNs that are trained using the standard protocol are very insensitive to low spatial frequency information, showing very poor performance in being able to classify such object images. By training CNNs using our coarse-to-fine method, we improved the classification accuracy of CNNs from 0% to 32% on low-pass-filtered images taken from the ImageNet dataset. The coarse-to-fine training also made the CNNs more sensitive to low spatial frequencies in hybrid images with conflicting information in different frequency bands. When comparing differently trained networks on images containing full spatial frequency information, we saw no representational differences. Overall, this integration of computational, neural, and behavioral findings shows the relevance of the exposure to and processing of inputs with variation in spatial frequency content for some aspects of high-level object representations.
Collapse
Affiliation(s)
- Lev Kiar Avberšek
- Department of Brain and Cognition, Leuven Brain Institute, Faculty of Psychology & Educational Sciences, KU Leuven, Leuven, Belgium.,Department of Psychology, Faculty of Arts, University of Ljubljana, Ljubljana, Slovenia.,
| | - Astrid Zeman
- Department of Brain and Cognition, Leuven Brain Institute, Faculty of Psychology & Educational Sciences, KU Leuven, Leuven, Belgium.,
| | - Hans Op de Beeck
- Department of Brain and Cognition, Leuven Brain Institute, Faculty of Psychology & Educational Sciences, KU Leuven, Leuven, Belgium.,
| |
Collapse
|
45
|
Luo F, Li X, Gao BB, Yan J. A Coarse-to-Fine Instance Segmentation Network with Learning Boundary Representation. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) 2021. [DOI: 10.1109/ijcnn52387.2021.9533399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
|
46
|
Flexible time course of spatial frequency use during scene categorization. Sci Rep 2021; 11:14079. [PMID: 34234183 PMCID: PMC8263560 DOI: 10.1038/s41598-021-93252-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Accepted: 06/22/2021] [Indexed: 11/09/2022] Open
Abstract
Human observers can quickly and accurately categorize scenes. This remarkable ability is related to the usage of information at different spatial frequencies (SFs) following a coarse-to-fine pattern: Low SFs, conveying coarse layout information, are thought to be used earlier than high SFs, representing more fine-grained information. Alternatives to this pattern have rarely been considered. Here, we probed all possible SF usage strategies randomly with high resolution in both the SF and time dimensions at two categorization levels. We show that correct basic-level categorizations of indoor scenes are linked to the sampling of relatively high SFs, whereas correct outdoor scene categorizations are predicted by an early use of high SFs and a later use of low SFs (fine-to-coarse pattern of SF usage). Superordinate-level categorizations (indoor vs. outdoor scenes) rely on lower SFs early on, followed by a shift to higher SFs and a subsequent shift back to lower SFs in late stages. In summary, our results show no consistent pattern of SF usage across tasks and only partially replicate the diagnostic SFs found in previous studies. We therefore propose that SF sampling strategies of observers differ with varying stimulus and task characteristics, thus favouring the notion of flexible SF usage.
Collapse
|
47
|
Kim C, Chong SC. Partial awareness can be induced by independent cognitive access to different spatial frequencies. Cognition 2021; 212:104692. [PMID: 33773425 DOI: 10.1016/j.cognition.2021.104692] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 03/17/2021] [Accepted: 03/18/2021] [Indexed: 01/15/2023]
Abstract
Partial awareness-an intermediate state between complete consciousness and unconsciousness-has been explained by independent cognitive access to different levels of representation in hierarchical visual processing. This account, however, cannot explain graded visual experiences in low levels. We aimed to explain partial awareness in low levels of visual processing by independent cognitive access to different spatial frequencies. To observe partial awareness stably, we used a novel method. Stimuli were presented briefly (12 ms) and repeatedly with a specific inter-stimulus interval, ranging from 0 to 235 ms. By using various stimuli containing high and low spatial frequencies (superimposed sinusoidal gratings, Navon letters, and scenes), we found that conscious percept was degraded with increasing inter-stimulus intervals. However, the degree of degradation was smaller for low spatial frequency than for high spatial frequency information. Our results reveal that cognitive access to different spatial frequencies can occur independently and this can explain partial awareness in low levels of visual processing.
Collapse
Affiliation(s)
- Cheongil Kim
- Graduate Program in Cognitive Science, Yonsei University, South Korea
| | - Sang Chul Chong
- Graduate Program in Cognitive Science, Department of Psychology, Yonsei University, South Korea.
| |
Collapse
|
48
|
Fernandes TP, Almeida NL, Silva GM, Santos NA. Nicotine gum enhances visual processing in healthy nonsmokers. Brain Imaging Behav 2021; 15:2593-2605. [PMID: 33675460 DOI: 10.1007/s11682-021-00461-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/04/2021] [Indexed: 10/22/2022]
Abstract
OBJECTIVE The main purpose of this study was to investigate the isolated effects of nicotine on visual processing, namely contrast processing. METHODS Thirteen participants, aged 18-40 years, were enrolled in this double blind, randomized and pilot controlled trial involving nicotine gum administration (placebo, 2-mg and 4-mg doses). The participants' instruction was to detect the location of vertical gratings (0.2; 1.0; 3.3; 5.7; 8.8; 13.2 and 15.9 cycles per degree) when it was presented either left or right on the monitor screen. A repeated multivariate analysis of variance was conducted to analyse the results for the visual processing tasks. Bayesian analyses were also carried out considering maximum robustness to avoid bias. RESULTS The findings that nicotine gum administration resulted in better contrast discrimination when compared to placebo gum (p < .001). More specifically, the 4-mg resulted in better visual sensitivity when compared to the 2-mg (p < .01) and the placebo (p < .001) gum. Demographic data were not related to the outcomes. CONCLUSIONS These data bring the need for support the findings. If proved, it is possible that nicotine, in small doses, can have a potential therapeutic use for those populations with low vision. TRIAL REGISTRATION NUMBER RBR-46tjy3.
Collapse
Affiliation(s)
- Thiago P Fernandes
- Federal University of Paraiba, Joao Pessoa, Brazil. .,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil.
| | - Natalia L Almeida
- Federal University of Paraiba, Joao Pessoa, Brazil. .,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil.
| | - Gabriella M Silva
- Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| | - Natanael A Santos
- Federal University of Paraiba, Joao Pessoa, Brazil.,Perception, Neuroscience and Behaviour Laboratory, Federal University of Paraiba, Joao Pessoa, Brazil
| |
Collapse
|
49
|
Cheng A, Walther DB, Park S, Dilks DD. Concavity as a diagnostic feature of visual scenes. Neuroimage 2021; 232:117920. [PMID: 33652147 PMCID: PMC8256888 DOI: 10.1016/j.neuroimage.2021.117920] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 02/07/2021] [Accepted: 02/21/2021] [Indexed: 11/29/2022] Open
Abstract
Despite over two decades of research on the neural mechanisms underlying human visual scene, or place, processing, it remains unknown what exactly a “scene” is. Intuitively, we are always inside a scene, while interacting with the outside of objects. Hence, we hypothesize that one diagnostic feature of a scene may be concavity, portraying “inside”, and predict that if concavity is a scene-diagnostic feature, then: 1) images that depict concavity, even non-scene images (e.g., the “inside” of an object – or concave object), will be behaviorally categorized as scenes more often than those that depict convexity, and 2) the cortical scene-processing system will respond more to concave images than to convex images. As predicted, participants categorized concave objects as scenes more often than convex objects, and, using functional magnetic resonance imaging (fMRI), two scene-selective cortical regions (the parahippocampal place area, PPA, and the occipital place area, OPA) responded significantly more to concave than convex objects. Surprisingly, we found no behavioral or neural differences between images of concave versus convex buildings. However, in a follow-up experiment, using tightly-controlled images, we unmasked a selective sensitivity to concavity over convexity of scene boundaries (i.e., walls) in PPA and OPA. Furthermore, we found that even highly impoverished line drawings of concave shapes are behaviorally categorized as scenes more often than convex shapes. Together, these results provide converging behavioral and neural evidence that concavity is a diagnostic feature of visual scenes.
Collapse
Affiliation(s)
- Annie Cheng
- Department of Psychology, Emory University, Atlanta, GA 30322, USA
| | - Dirk B Walther
- Department of Psychology, University of Toronto, Toronto, ON, Canada
| | - Soojin Park
- Department of Psychology, Yonsei University, Seoul, Republic of Korea.
| | - Daniel D Dilks
- Department of Psychology, Emory University, Atlanta, GA 30322, USA.
| |
Collapse
|
50
|
Peyrin C, Roux-Sibilon A, Trouilloud A, Khazaz S, Joly M, Pichat C, Boucart M, Krainik A, Kauffmann L. Semantic and Physical Properties of Peripheral Vision Are Used for Scene Categorization in Central Vision. J Cogn Neurosci 2021; 33:799-813. [PMID: 33571079 DOI: 10.1162/jocn_a_01689] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
Theories of visual recognition postulate that our ability to understand our visual environment at a glance is based on the extraction of the gist of the visual scene, a first global and rudimentary visual representation. Gist perception would be based on the rapid analysis of low spatial frequencies in the visual signal and would allow a coarse categorization of the scene. We aimed to study whether the low spatial resolution information available in peripheral vision could modulate the processing of visual information presented in central vision. We combined behavioral measures (Experiments 1 and 2) and fMRI measures (Experiment 2). Participants categorized a scene presented in central vision (artificial vs. natural categories) while ignoring another scene, either semantically congruent or incongruent, presented in peripheral vision. The two scenes could either share the same physical properties (similar amplitude spectrum and spatial configuration) or not. Categorization of the central scene was impaired by a semantically incongruent peripheral scene, in particular when the two scenes were physically similar. This semantic interference effect was associated with increased activation of the inferior frontal gyrus. When the two scenes were semantically congruent, the dissimilarity of their physical properties impaired the categorization of the central scene. This effect was associated with increased activation in occipito-temporal areas. In line with the hypothesis of predictive mechanisms involved in visual recognition, results suggest that semantic and physical properties of the information coming from peripheral vision would be automatically used to generate predictions that guide the processing of signal in central vision.
Collapse
|