1
|
Stecher R, Cichy RM, Kaiser D. Decoding the rhythmic representation and communication of visual contents. Trends Neurosci 2025:S0166-2236(24)00248-0. [PMID: 39818499 DOI: 10.1016/j.tins.2024.12.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2024] [Revised: 11/18/2024] [Accepted: 12/11/2024] [Indexed: 01/18/2025]
Abstract
Rhythmic neural activity is considered essential for adaptively modulating responses in the visual system. In this opinion article we posit that visual brain rhythms also serve a key function in the representation and communication of visual contents. Collating a set of recent studies that used multivariate decoding methods on rhythmic brain signals, we highlight such rhythmic content representations in visual perception, imagery, and prediction. We argue that characterizing representations across frequency bands allows researchers to elegantly disentangle content transfer in feedforward and feedback directions. We further propose that alpha dynamics are central to content-specific feedback propagation in the visual system. We conclude that considering rhythmic content codes is pivotal for understanding information coding in vision and beyond.
Collapse
Affiliation(s)
- Rico Stecher
- Neural Computation Group, Department of Mathematics and Computer Science, Physics, Geography, Justus-Liebig-Universität Gießen, Gießen 35392, Germany.
| | - Radoslaw Martin Cichy
- Department of Education and Psychology, Freie Universität Berlin, Berlin 14195, Germany
| | - Daniel Kaiser
- Neural Computation Group, Department of Mathematics and Computer Science, Physics, Geography, Justus-Liebig-Universität Gießen, Gießen 35392, Germany; Center for Mind, Brain and Behavior (CMBB), Philipps-Universität Marburg, Justus-Liebig-Universität Gießen & Technische Universität Darmstadt, Marburg 35032, Germany.
| |
Collapse
|
2
|
Li B, Zhang S. Registered Report Stage II: Decoding the category information from evoked potentials to visible and invisible visual objects. Int J Psychophysiol 2024; 205:112446. [PMID: 39389167 DOI: 10.1016/j.ijpsycho.2024.112446] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 09/18/2024] [Accepted: 10/07/2024] [Indexed: 10/12/2024]
Abstract
Previous studies that use decoding methods and EEG to investigate the neural representation of the category information of visual objects focused mainly on consciously processed visual objects. It remains unclear whether the category information of unconsciously processed visual objects can be decoded and whether the decoding performance is different for consciously and unconsciously processed visual objects. The present study compared the neural decoding of the animacy category of visible and invisible visual objects via EEG and decoding methods. The results revealed that the animacy of visible visual objects could be decoded above the chance level by the P200, N300, and N400, but not by the early N/P100. However, the animacy of invisible visual objects could not be decoded above the chance level by neither early nor late ERP components. The decoding accuracy was greater for visible visual objects than that for invisible visual objects for the P200, N300 and N400. These results suggested that access to animacy category information for visual objects requires conscious processing.
Collapse
Affiliation(s)
- Bingbing Li
- School of Education Science, Jiangsu Normal University, Xuzhou, Jiangsu, China.
| | - Shuhui Zhang
- School of Education Science, Jiangsu Normal University, Xuzhou, Jiangsu, China.
| |
Collapse
|
3
|
Bezsudnova Y, Quinn AJ, Wynn SC, Jensen O. Spatiotemporal Properties of Common Semantic Categories for Words and Pictures. J Cogn Neurosci 2024; 36:1760-1769. [PMID: 38739567 DOI: 10.1162/jocn_a_02182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The timing of semantic processing during object recognition in the brain is a topic of ongoing discussion. One way of addressing this question is by applying multivariate pattern analysis to human electrophysiological responses to object images of different semantic categories. However, although multivariate pattern analysis can reveal whether neuronal activity patterns are distinct for different stimulus categories, concerns remain on whether low-level visual features also contribute to the classification results. To circumvent this issue, we applied a cross-decoding approach to magnetoencephalography data from stimuli from two different modalities: images and their corresponding written words. We employed items from three categories and presented them in a randomized order. We show that if the classifier is trained on words, pictures are classified between 150 and 430 msec after stimulus onset, and when training on pictures, words are classified between 225 and 430 msec. The topographical map, identified using a searchlight approach for cross-modal activation in both directions, showed left lateralization, confirming the involvement of linguistic representations. These results point to semantic activation of pictorial stimuli occurring at ∼150 msec, whereas for words, the semantic activation occurs at ∼230 msec.
Collapse
Affiliation(s)
| | | | - Syanah C Wynn
- University of Birmingham
- Gutenberg University Medical Center Mainz
| | | |
Collapse
|
4
|
Ghazaryan G, van Vliet M, Lammi L, Lindh-Knuutila T, Kivisaari S, Hultén A, Salmelin R. Cortical time-course of evidence accumulation during semantic processing. Commun Biol 2023; 6:1242. [PMID: 38066098 PMCID: PMC10709650 DOI: 10.1038/s42003-023-05611-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 11/20/2023] [Indexed: 12/18/2023] Open
Abstract
Our understanding of the surrounding world and communication with other people are tied to mental representations of concepts. In order for the brain to recognize an object, it must determine which concept to access based on information available from sensory inputs. In this study, we combine magnetoencephalography and machine learning to investigate how concepts are represented and accessed in the brain over time. Using brain responses from a silent picture naming task, we track the dynamics of visual and semantic information processing, and show that the brain gradually accumulates information on different levels before eventually reaching a plateau. The timing of this plateau point varies across individuals and feature models, indicating notable temporal variation in visual object recognition and semantic processing.
Collapse
Affiliation(s)
- Gayane Ghazaryan
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland.
| | - Marijn van Vliet
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
| | - Lotta Lammi
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
| | - Tiina Lindh-Knuutila
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
| | - Sasa Kivisaari
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
| | - Annika Hultén
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
- Aalto NeuroImaging, Aalto University, P.O. Box 12200, Aalto, FI-00076, Finland
| | - Riitta Salmelin
- Department of Neuroscience and Biomedical Engineering, Aalto University, P.O. Box 12200, FI-00076, Aalto, Finland
- Aalto NeuroImaging, Aalto University, P.O. Box 12200, Aalto, FI-00076, Finland
| |
Collapse
|
5
|
Newell FN, McKenna E, Seveso MA, Devine I, Alahmad F, Hirst RJ, O'Dowd A. Multisensory perception constrains the formation of object categories: a review of evidence from sensory-driven and predictive processes on categorical decisions. Philos Trans R Soc Lond B Biol Sci 2023; 378:20220342. [PMID: 37545304 PMCID: PMC10404931 DOI: 10.1098/rstb.2022.0342] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 06/29/2023] [Indexed: 08/08/2023] Open
Abstract
Although object categorization is a fundamental cognitive ability, it is also a complex process going beyond the perception and organization of sensory stimulation. Here we review existing evidence about how the human brain acquires and organizes multisensory inputs into object representations that may lead to conceptual knowledge in memory. We first focus on evidence for two processes on object perception, multisensory integration of redundant information (e.g. seeing and feeling a shape) and crossmodal, statistical learning of complementary information (e.g. the 'moo' sound of a cow and its visual shape). For both processes, the importance attributed to each sensory input in constructing a multisensory representation of an object depends on the working range of the specific sensory modality, the relative reliability or distinctiveness of the encoded information and top-down predictions. Moreover, apart from sensory-driven influences on perception, the acquisition of featural information across modalities can affect semantic memory and, in turn, influence category decisions. In sum, we argue that both multisensory processes independently constrain the formation of object categories across the lifespan, possibly through early and late integration mechanisms, respectively, to allow us to efficiently achieve the everyday, but remarkable, ability of recognizing objects. This article is part of the theme issue 'Decision and control processes in multisensory perception'.
Collapse
Affiliation(s)
- F. N. Newell
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - E. McKenna
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - M. A. Seveso
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - I. Devine
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - F. Alahmad
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - R. J. Hirst
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| | - A. O'Dowd
- School of Psychology and Institute of Neuroscience, Trinity College Dublin, College Green, Dublin D02 PN40, Ireland
| |
Collapse
|
6
|
Kabulska Z, Lingnau A. The cognitive structure underlying the organization of observed actions. Behav Res Methods 2023; 55:1890-1906. [PMID: 35788973 PMCID: PMC10250259 DOI: 10.3758/s13428-022-01894-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/26/2022] [Indexed: 11/08/2022]
Abstract
In daily life, we frequently encounter actions performed by other people. Here we aimed to examine the key categories and features underlying the organization of a wide range of actions in three behavioral experiments (N = 378 participants). In Experiment 1, we used a multi-arrangement task of 100 different actions. Inverse multidimensional scaling and hierarchical clustering revealed 11 action categories, including Locomotion, Communication, and Aggressive actions. In Experiment 2, we used a feature-listing paradigm to obtain a wide range of action features that were subsequently reduced to 59 key features and used in a rating study (Experiment 3). A direct comparison of the feature ratings obtained in Experiment 3 between actions belonging to the categories identified in Experiment 1 revealed a number of features that appear to be critical for the distinction between these categories, e.g., the features Harm and Noise for the category Aggressive actions, and the features Targeting a person and Contact with others for the category Interaction. Finally, we found that a part of the category-based organization is explained by a combination of weighted features, whereas a significant proportion of variability remained unexplained, suggesting that there are additional sources of information that contribute to the categorization of observed actions. The characterization of action categories and their associated features serves as an important extension of previous studies examining the cognitive structure of actions. Moreover, our results may serve as the basis for future behavioral, neuroimaging and computational modeling studies.
Collapse
Affiliation(s)
- Zuzanna Kabulska
- Department of Psychology, Faculty of Human Sciences, University of Regensburg, Universitätsstraße 31, 93053, Regensburg, Germany
| | - Angelika Lingnau
- Department of Psychology, Faculty of Human Sciences, University of Regensburg, Universitätsstraße 31, 93053, Regensburg, Germany.
| |
Collapse
|
7
|
Gusein-zade NG, Slezkin AA, Allahyarov E. Statistical processing of time slices of electroencephalography signals during brain reaction to visual stimuli. Biomed Signal Process Control 2023. [DOI: 10.1016/j.bspc.2023.104656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
|
8
|
Taubert J, Wardle SG, Tardiff CT, Patterson A, Yu D, Baker CI. Clutter Substantially Reduces Selectivity for Peripheral Faces in the Macaque Brain. J Neurosci 2022; 42:6739-6750. [PMID: 35868861 PMCID: PMC9436017 DOI: 10.1523/jneurosci.0232-22.2022] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 04/29/2022] [Accepted: 06/06/2022] [Indexed: 11/21/2022] Open
Abstract
According to a prominent view in neuroscience, visual stimuli are coded by discrete cortical networks that respond preferentially to specific categories, such as faces or objects. However, it remains unclear how these category-selective networks respond when viewing conditions are cluttered, i.e., when there is more than one stimulus in the visual field. Here, we asked three questions: (1) Does clutter reduce the response and selectivity for faces as a function of retinal location? (2) Is the preferential response to faces uniform across the visual field? And (3) Does the ventral visual pathway encode information about the location of cluttered faces? We used fMRI to measure the response of the face-selective network in awake, fixating macaques (two female, five male). Across a series of four experiments, we manipulated the presence and absence of clutter, as well as the location of the faces relative to the fovea. We found that clutter reduces the response to peripheral faces. When presented in isolation, without clutter, the selectivity for faces is fairly uniform across the visual field, but, when clutter is present, there is a marked decrease in the selectivity for peripheral faces. We also found no evidence of a contralateral visual field bias when faces were presented in clutter. Nonetheless, multivariate analyses revealed that the location of cluttered faces could be decoded from the multivoxel response of the face-selective network. Collectively, these findings demonstrate that clutter blunts the selectivity of the face-selective network to peripheral faces, although information about their retinal location is retained.SIGNIFICANCE STATEMENT Numerous studies that have measured brain activity in macaques have found visual regions that respond preferentially to faces. Although these regions are thought to be essential for social behavior, their responses have typically been measured while faces were presented in isolation, a situation atypical of the real world. How do these regions respond when faces are presented with other stimuli? We report that, when clutter is present, the preferential response to foveated faces is spared but preferential response to peripheral faces is reduced. Our results indicate that the presence of clutter changes the response of the face-selective network.
Collapse
Affiliation(s)
- Jessica Taubert
- Laboratory of Brain and Cognition, National Institute of Mental Health, Bethesda, Maryland 20814
- School of Psychology, The University of Queensland, Brisbane, Queensland 4072, Australia
| | - Susan G Wardle
- Laboratory of Brain and Cognition, National Institute of Mental Health, Bethesda, Maryland 20814
| | - Clarissa T Tardiff
- Laboratory of Brain and Cognition, National Institute of Mental Health, Bethesda, Maryland 20814
| | - Amanda Patterson
- Laboratory of Brain and Cognition, National Institute of Mental Health, Bethesda, Maryland 20814
| | - David Yu
- Neurophysiology Imaging Facility, National Institutes of Health, Bethesda, Maryland 20814
| | - Chris I Baker
- Laboratory of Brain and Cognition, National Institute of Mental Health, Bethesda, Maryland 20814
| |
Collapse
|
9
|
Aziz A, Sreeharsha PSS, Natesh R, Chakravarthy VS. An integrated deep learning‐based model of spatial cells that combines self‐motion with sensory information. Hippocampus 2022; 32:716-730. [DOI: 10.1002/hipo.23461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 07/10/2022] [Accepted: 07/16/2022] [Indexed: 11/06/2022]
Affiliation(s)
- Azra Aziz
- Computational Neuroscience Lab Indian Institute of Technology Madras Chennai India
| | | | - Rohan Natesh
- Department of Electronics Engineering Indian Institute of Technology (BHU) Varanasi India
| | - Vaddadhi S. Chakravarthy
- Computational Neuroscience Lab Indian Institute of Technology Madras Chennai India
- Center for Complex Systems and Dynamics Indian Institute of Technology Madras Chennai India
| |
Collapse
|
10
|
Grootswagers T, Zhou I, Robinson AK, Hebart MN, Carlson TA. Human EEG recordings for 1,854 concepts presented in rapid serial visual presentation streams. Sci Data 2022; 9:3. [PMID: 35013331 PMCID: PMC8748587 DOI: 10.1038/s41597-021-01102-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Accepted: 11/03/2021] [Indexed: 01/07/2023] Open
Abstract
The neural basis of object recognition and semantic knowledge has been extensively studied but the high dimensionality of object space makes it challenging to develop overarching theories on how the brain organises object knowledge. To help understand how the brain allows us to recognise, categorise, and represent objects and object categories, there is a growing interest in using large-scale image databases for neuroimaging experiments. In the current paper, we present THINGS-EEG, a dataset containing human electroencephalography responses from 50 subjects to 1,854 object concepts and 22,248 images in the THINGS stimulus set, a manually curated and high-quality image database that was specifically designed for studying human vision. The THINGS-EEG dataset provides neuroimaging recordings to a systematic collection of objects and concepts and can therefore support a wide array of research to understand visual object processing in the human brain.
Collapse
Affiliation(s)
- Tijl Grootswagers
- The MARCS Institute for Brain, Behaviour and Development, Western Sydney University, Sydney, Australia.
- School of Psychology, The University of Sydney, Sydney, Australia.
| | - Ivy Zhou
- School of Psychology, The University of Sydney, Sydney, Australia
| | | | - Martin N Hebart
- Vision and Computational Cognition Group, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Thomas A Carlson
- School of Psychology, The University of Sydney, Sydney, Australia
| |
Collapse
|
11
|
Wu C, Ferreira F, Fox M, Harel N, Hattangadi-Gluth J, Horn A, Jbabdi S, Kahan J, Oswal A, Sheth SA, Tie Y, Vakharia V, Zrinzo L, Akram H. Clinical applications of magnetic resonance imaging based functional and structural connectivity. Neuroimage 2021; 244:118649. [PMID: 34648960 DOI: 10.1016/j.neuroimage.2021.118649] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 09/24/2021] [Accepted: 10/10/2021] [Indexed: 12/23/2022] Open
Abstract
Advances in computational neuroimaging techniques have expanded the armamentarium of imaging tools available for clinical applications in clinical neuroscience. Non-invasive, in vivo brain MRI structural and functional network mapping has been used to identify therapeutic targets, define eloquent brain regions to preserve, and gain insight into pathological processes and treatments as well as prognostic biomarkers. These tools have the real potential to inform patient-specific treatment strategies. Nevertheless, a realistic appraisal of clinical utility is needed that balances the growing excitement and interest in the field with important limitations associated with these techniques. Quality of the raw data, minutiae of the processing methodology, and the statistical models applied can all impact on the results and their interpretation. A lack of standardization in data acquisition and processing has also resulted in issues with reproducibility. This limitation has had a direct impact on the reliability of these tools and ultimately, confidence in their clinical use. Advances in MRI technology and computational power as well as automation and standardization of processing methods, including machine learning approaches, may help address some of these issues and make these tools more reliable in clinical use. In this review, we will highlight the current clinical uses of MRI connectomics in the diagnosis and treatment of neurological disorders; balancing emerging applications and technologies with limitations of connectivity analytic approaches to present an encompassing and appropriate perspective.
Collapse
Affiliation(s)
- Chengyuan Wu
- Department of Neurological Surgery, Vickie and Jack Farber Institute for Neuroscience, Thomas Jefferson University, 909 Walnut Street, Third Floor, Philadelphia, PA 19107, USA; Jefferson Integrated Magnetic Resonance Imaging Center, Department of Radiology, Thomas Jefferson University, 909 Walnut Street, First Floor, Philadelphia, PA 19107, USA.
| | - Francisca Ferreira
- Victor Horsley Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, 33 Queen Square, London WC1N 3BG, UK; Unit of Functional Neurosurgery, UCL Queen Square Institute of Neurology, 33 Queen Square, London WC1N 3BG, UK.
| | - Michael Fox
- Center for Brain Circuit Therapeutics, Departments of Neurology, Psychiatry, Radiology, and Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 60 Fenwood Road, Boston, MA 02115, USA.
| | - Noam Harel
- Center for Magnetic Resonance Research, University of Minnesota, 2021 Sixth Street S.E., Minneapolis, MN 55455, USA.
| | - Jona Hattangadi-Gluth
- Department of Radiation Medicine and Applied Sciences, Center for Precision Radiation Medicine, University of California, San Diego, 3855 Health Sciences Drive, La Jolla, CA 92037, USA.
| | - Andreas Horn
- Neurology Department, Movement Disorders and Neuromodulation Section, Charité - University Medicine Berlin, Charitéplatz 1, D-10117, Berlin, Germany.
| | - Saad Jbabdi
- Wellcome Centre for Integrative Neuroimaging, Centre for Functional MRI of the Brain, Nuffield Department of Clinical Neurosciences, John Radcliffe Hospital, University of Oxford, Oxford OX3 9DU, UK.
| | - Joshua Kahan
- Department of Neurology, Weill Cornell Medicine, 525 East 68th Street, New York, NY, 10065, USA.
| | - Ashwini Oswal
- Medical Research Council Brain Network Dynamics Unit, University of Oxford, Mansfield Rd, Oxford OX1 3TH, UK.
| | - Sameer A Sheth
- Department of Neurosurgery, Baylor College of Medicine, 7200 Cambridge, Ninth Floor, Houston, TX 77030, USA.
| | - Yanmei Tie
- Center for Brain Circuit Therapeutics, Departments of Neurology, Psychiatry, Radiology, and Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 60 Fenwood Road, Boston, MA 02115, USA; Department of Neurosurgery, Brigham and Women's Hospital, Harvard Medical School, 60 Fenwood Road, Boston, MA 02115, USA.
| | - Vejay Vakharia
- Victor Horsley Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, 33 Queen Square, London WC1N 3BG, UK.
| | - Ludvic Zrinzo
- Victor Horsley Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, 33 Queen Square, London WC1N 3BG, UK; Unit of Functional Neurosurgery, UCL Queen Square Institute of Neurology, 33 Queen Square, London WC1N 3BG, UK.
| | - Harith Akram
- Victor Horsley Department of Neurosurgery, National Hospital for Neurology and Neurosurgery, 33 Queen Square, London WC1N 3BG, UK; Unit of Functional Neurosurgery, UCL Queen Square Institute of Neurology, 33 Queen Square, London WC1N 3BG, UK.
| |
Collapse
|
12
|
Grootswagers T, Robinson AK. Overfitting the Literature to One Set of Stimuli and Data. Front Hum Neurosci 2021; 15:682661. [PMID: 34305552 PMCID: PMC8295535 DOI: 10.3389/fnhum.2021.682661] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/16/2021] [Indexed: 12/02/2022] Open
Abstract
A large number of papers in Computational Cognitive Neuroscience are developing and testing novel analysis methods using one specific neuroimaging dataset and problematic experimental stimuli. Publication bias and confirmatory exploration will result in overfitting to the limited available data. We highlight the problems with this specific dataset and argue for the need to collect more good quality open neuroimaging data using a variety of experimental stimuli, in order to test the generalisability of current published results, and allow for more robust results in future work.
Collapse
Affiliation(s)
- Tijl Grootswagers
- The MARCS Institute for Brain, Behaviour and Development, Sydney, NSW, Australia.,School of Psychology, Western Sydney University, Sydney, NSW, Australia.,School of Psychology, University of Sydney, Sydney, NSW, Australia
| | | |
Collapse
|
13
|
De Cesarei A, Cavicchi S, Cristadoro G, Lippi M. Do Humans and Deep Convolutional Neural Networks Use Visual Information Similarly for the Categorization of Natural Scenes? Cogn Sci 2021; 45:e13009. [PMID: 34170027 PMCID: PMC8365760 DOI: 10.1111/cogs.13009] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Revised: 05/19/2021] [Accepted: 05/31/2021] [Indexed: 11/28/2022]
Abstract
The investigation of visual categorization has recently been aided by the introduction of deep convolutional neural networks (CNNs), which achieve unprecedented accuracy in picture classification after extensive training. Even if the architecture of CNNs is inspired by the organization of the visual brain, the similarity between CNN and human visual processing remains unclear. Here, we investigated this issue by engaging humans and CNNs in a two-class visual categorization task. To this end, pictures containing animals or vehicles were modified to contain only low/high spatial frequency (HSF) information, or were scrambled in the phase of the spatial frequency spectrum. For all types of degradation, accuracy increased as degradation was reduced for both humans and CNNs; however, the thresholds for accurate categorization varied between humans and CNNs. More remarkable differences were observed for HSF information compared to the other two types of degradation, both in terms of overall accuracy and image-level agreement between humans and CNNs. The difficulty with which the CNNs were shown to categorize high-passed natural scenes was reduced by picture whitening, a procedure which is inspired by how visual systems process natural images. The results are discussed concerning the adaptation to regularities in the visual environment (scene statistics); if the visual characteristics of the environment are not learned by CNNs, their visual categorization may depend only on a subset of the visual information on which humans rely, for example, on low spatial frequency information.
Collapse
Affiliation(s)
| | | | | | - Marco Lippi
- Department of Sciences and Methods for EngineeringUniversity of Modena and Reggio Emilia
| |
Collapse
|