1
|
Shain C, Kean H, Casto C, Lipkin B, Affourtit J, Siegelman M, Mollica F, Fedorenko E. Distributed Sensitivity to Syntax and Semantics throughout the Language Network. J Cogn Neurosci 2024; 36:1427-1471. [PMID: 38683732 DOI: 10.1162/jocn_a_02164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2024]
Abstract
Human language is expressive because it is compositional: The meaning of a sentence (semantics) can be inferred from its structure (syntax). It is commonly believed that language syntax and semantics are processed by distinct brain regions. Here, we revisit this claim using precision fMRI methods to capture separation or overlap of function in the brains of individual participants. Contrary to prior claims, we find distributed sensitivity to both syntax and semantics throughout a broad frontotemporal brain network. Our results join a growing body of evidence for an integrated network for language in the human brain within which internal specialization is primarily a matter of degree rather than kind, in contrast with influential proposals that advocate distinct specialization of different brain areas for different types of linguistic functions.
Collapse
Affiliation(s)
| | - Hope Kean
- Massachusetts Institute of Technology
| | | | | | | | | | | | | |
Collapse
|
2
|
Webb TW, Frankland SM, Altabaa A, Segert S, Krishnamurthy K, Campbell D, Russin J, Giallanza T, O'Reilly R, Lafferty J, Cohen JD. The relational bottleneck as an inductive bias for efficient abstraction. Trends Cogn Sci 2024:S1364-6613(24)00080-9. [PMID: 38729852 DOI: 10.1016/j.tics.2024.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 03/29/2024] [Accepted: 04/01/2024] [Indexed: 05/12/2024]
Abstract
A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck. In that approach, neural networks are constrained via their architecture to focus on relations between perceptual inputs, rather than the attributes of individual inputs. We review a family of models that employ this approach to induce abstractions in a data-efficient manner, emphasizing their potential as candidate models for the acquisition of abstract concepts in the human mind and brain.
Collapse
|
3
|
Murphy E, Holmes E, Friston K. Natural language syntax complies with the free-energy principle. SYNTHESE 2024; 203:154. [PMID: 38706520 PMCID: PMC11068586 DOI: 10.1007/s11229-024-04566-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Accepted: 03/15/2024] [Indexed: 05/07/2024]
Abstract
Natural language syntax yields an unbounded array of hierarchically structured expressions. We claim that these are used in the service of active inference in accord with the free-energy principle (FEP). While conceptual advances alongside modelling and simulation work have attempted to connect speech segmentation and linguistic communication with the FEP, we extend this program to the underlying computations responsible for generating syntactic objects. We argue that recently proposed principles of economy in language design-such as "minimal search" criteria from theoretical syntax-adhere to the FEP. This affords a greater degree of explanatory power to the FEP-with respect to higher language functions-and offers linguistics a grounding in first principles with respect to computability. While we mostly focus on building new principled conceptual relations between syntax and the FEP, we also show through a sample of preliminary examples how both tree-geometric depth and a Kolmogorov complexity estimate (recruiting a Lempel-Ziv compression algorithm) can be used to accurately predict legal operations on syntactic workspaces, directly in line with formulations of variational free energy minimization. This is used to motivate a general principle of language design that we term Turing-Chomsky Compression (TCC). We use TCC to align concerns of linguists with the normative account of self-organization furnished by the FEP, by marshalling evidence from theoretical linguistics and psycholinguistics to ground core principles of efficient syntactic computation within active inference.
Collapse
Affiliation(s)
- Elliot Murphy
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center, Houston, TX 77030 USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center, Houston, TX 77030 USA
| | - Emma Holmes
- Department of Speech Hearing and Phonetic Sciences, University College London, London, WC1N 1PF UK
- The Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR UK
| | - Karl Friston
- The Wellcome Centre for Human Neuroimaging, UCL Queen Square Institute of Neurology, London, WC1N 3AR UK
| |
Collapse
|
4
|
Benjamin L, Sablé-Meyer M, Fló A, Dehaene-Lambertz G, Al Roumi F. Long-Horizon Associative Learning Explains Human Sensitivity to Statistical and Network Structures in Auditory Sequences. J Neurosci 2024; 44:e1369232024. [PMID: 38408873 PMCID: PMC10993028 DOI: 10.1523/jneurosci.1369-23.2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 01/16/2024] [Accepted: 02/07/2024] [Indexed: 02/28/2024] Open
Abstract
Networks are a useful mathematical tool for capturing the complexity of the world. In a previous behavioral study, we showed that human adults were sensitive to the high-level network structure underlying auditory sequences, even when presented with incomplete information. Their performance was best explained by a mathematical model compatible with associative learning principles, based on the integration of the transition probabilities between adjacent and nonadjacent elements with a memory decay. In the present study, we explored the neural correlates of this hypothesis via magnetoencephalography (MEG). Participants (N = 23, 16 females) passively listened to sequences of tones organized in a sparse community network structure comprising two communities. An early difference (∼150 ms) was observed in the brain responses to tone transitions with similar transition probability but occurring either within or between communities. This result implies a rapid and automatic encoding of the sequence structure. Using time-resolved decoding, we estimated the duration and overlap of the representation of each tone. The decoding performance exhibited exponential decay, resulting in a significant overlap between the representations of successive tones. Based on this extended decay profile, we estimated a long-horizon associative learning novelty index for each transition and found a correlation of this measure with the MEG signal. Overall, our study sheds light on the neural mechanisms underlying human sensitivity to network structures and highlights the potential role of Hebbian-like mechanisms in supporting learning at various temporal scales.
Collapse
Affiliation(s)
- Lucas Benjamin
- Cognitive Neuroimaging Unit, CNRS ERL 9003, INSERM U992, CEA, Université Paris-Saclay, NeuroSpin Center, 91190 Gif/Yvette, France
| | - Mathias Sablé-Meyer
- Cognitive Neuroimaging Unit, CNRS ERL 9003, INSERM U992, CEA, Université Paris-Saclay, NeuroSpin Center, 91190 Gif/Yvette, France
- Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London W1T 4JG, United Kingdom
| | - Ana Fló
- Cognitive Neuroimaging Unit, CNRS ERL 9003, INSERM U992, CEA, Université Paris-Saclay, NeuroSpin Center, 91190 Gif/Yvette, France
- Department of Developmental Psychology and Socialization, University of Padova, Padova 35131, Italy
| | - Ghislaine Dehaene-Lambertz
- Cognitive Neuroimaging Unit, CNRS ERL 9003, INSERM U992, CEA, Université Paris-Saclay, NeuroSpin Center, 91190 Gif/Yvette, France
| | - Fosca Al Roumi
- Cognitive Neuroimaging Unit, CNRS ERL 9003, INSERM U992, CEA, Université Paris-Saclay, NeuroSpin Center, 91190 Gif/Yvette, France
| |
Collapse
|
5
|
Pesnot Lerousseau J, Summerfield C. Space as a scaffold for rotational generalisation of abstract concepts. eLife 2024; 13:RP93636. [PMID: 38568075 PMCID: PMC10990485 DOI: 10.7554/elife.93636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/05/2024] Open
Abstract
Learning invariances allows us to generalise. In the visual modality, invariant representations allow us to recognise objects despite translations or rotations in physical space. However, how we learn the invariances that allow us to generalise abstract patterns of sensory data ('concepts') is a longstanding puzzle. Here, we study how humans generalise relational patterns in stimulation sequences that are defined by either transitions on a nonspatial two-dimensional feature manifold, or by transitions in physical space. We measure rotational generalisation, i.e., the ability to recognise concepts even when their corresponding transition vectors are rotated. We find that humans naturally generalise to rotated exemplars when stimuli are defined in physical space, but not when they are defined as positions on a nonspatial feature manifold. However, if participants are first pre-trained to map auditory or visual features to spatial locations, then rotational generalisation becomes possible even in nonspatial domains. These results imply that space acts as a scaffold for learning more abstract conceptual invariances.
Collapse
|
6
|
Liu YF, Wilson C, Bedny M. Contribution of the language network to the comprehension of Python programming code. BRAIN AND LANGUAGE 2024; 251:105392. [PMID: 38387220 DOI: 10.1016/j.bandl.2024.105392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 02/08/2024] [Accepted: 02/14/2024] [Indexed: 02/24/2024]
Abstract
Does the perisylvian language network contribute to comprehension of programming languages, like Python? Univariate neuroimaging studies find high responses to code in fronto-parietal executive areas but not in fronto-temporal language areas, suggesting the language network does little. We used multivariate-pattern-analysis to test whether the language network encodes Python functions. Python programmers read functions while undergoing fMRI. A linear SVM decoded for-loops from if-conditionals based on activity in lateral temporal (LT) language cortex. In searchlight analysis, decoding accuracy was higher in LT language cortex than anywhere else. Follow up analysis showed that decoding was not driven by presence of different words across functions, "for" vs "if," but by compositional program properties. Finally, univariate responses to code peaked earlier in LT language-cortex than in the fronto-parietal network. We propose that the language system forms initial "surface meaning" representations of programs, which input to the reasoning network for processing of algorithms.
Collapse
Affiliation(s)
- Yun-Fei Liu
- Department of Psychological and Brain Sciences, Johns Hopkins Universtiy, 232 Ames Hall, 3400 N. Charles Street, Baltimore, MD 21218, USA.
| | - Colin Wilson
- Department of Cognitive Science, Johns Hopkins University, 237 Krieger Hall, 3400 N. Charles Street, Baltimore, MD 21218, USA
| | - Marina Bedny
- Department of Psychological and Brain Sciences, Johns Hopkins Universtiy, 232 Ames Hall, 3400 N. Charles Street, Baltimore, MD 21218, USA
| |
Collapse
|
7
|
Desbordes T, King JR, Dehaene S. Tracking the neural codes for words and phrases during semantic composition, working-memory storage, and retrieval. Cell Rep 2024; 43:113847. [PMID: 38412098 DOI: 10.1016/j.celrep.2024.113847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 11/02/2023] [Accepted: 02/07/2024] [Indexed: 02/29/2024] Open
Abstract
The ability to compose successive words into a meaningful phrase is a characteristic feature of human cognition, yet its neural mechanisms remain incompletely understood. Here, we analyze the cortical mechanisms of semantic composition using magnetoencephalography (MEG) while participants read one-word, two-word, and five-word noun phrases and compared them with a subsequent image. Decoding of MEG signals revealed three processing stages. During phrase comprehension, the representation of individual words was sustained for a variable duration depending on phrasal context. During the delay period, the word code was replaced by a working-memory code whose activation increased with semantic complexity. Finally, the speed and accuracy of retrieval depended on semantic complexity and was faster for surface than for deep semantic properties. In conclusion, we propose that the brain initially encodes phrases using factorized dimensions for successive words but later compresses them in working memory and requires a period of decompression to access them.
Collapse
Affiliation(s)
- Théo Desbordes
- Meta AI, Paris, France; Cognitive Neuroimaging Unit, NeuroSpin Center, 91191 Gif-sur-Yvette, France.
| | - Jean-Rémi King
- Meta AI, Paris, France; École Normale Supérieure, PSL University, Paris, France
| | - Stanislas Dehaene
- Université Paris Saclay, INSERM, CEA, Cognitive Neuroimaging Unit, NeuroSpin Center, 91191 Gif-sur-Yvette, France; Collège de France, PSL University, Paris, France
| |
Collapse
|
8
|
Zhou Y, Feinman R, Lake BM. Compositional diversity in visual concept learning. Cognition 2024; 244:105711. [PMID: 38224649 DOI: 10.1016/j.cognition.2023.105711] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 12/20/2023] [Accepted: 12/21/2023] [Indexed: 01/17/2024]
Abstract
Humans leverage compositionality to efficiently learn new concepts, understanding how familiar parts can combine together to form novel objects. In contrast, popular computer vision models struggle to make the same types of inferences, requiring more data and generalizing less flexibly than people do. Here, we study these distinctively human abilities across a range of different types of visual composition, examining how people classify and generate "alien figures" with rich relational structure. We also develop a Bayesian program induction model which searches for the best programs for generating the candidate visual figures, utilizing a large program space containing different compositional mechanisms and abstractions. In few shot classification tasks, we find that people and the program induction model can make a range of meaningful compositional generalizations, with the model providing a strong account of the experimental data as well as interpretable parameters that reveal human assumptions about the factors invariant to category membership (here, to rotation and changing part attachment). In few shot generation tasks, both people and the models are able to construct compelling novel examples, with people behaving in additional structured ways beyond the model capabilities, e.g. making choices that complete a set or reconfigure existing parts in new ways. To capture these additional behavioral patterns, we develop an alternative model based on neuro-symbolic program induction: this model also composes new concepts from existing parts yet, distinctively, it utilizes neural network modules to capture residual statistical structure. Together, our behavioral and computational findings show how people and models can produce a variety of compositional behavior when classifying and generating visual objects.
Collapse
Affiliation(s)
- Yanli Zhou
- Center for Data Science, New York University, United States of America.
| | - Reuben Feinman
- Center for Neural Science, New York University, United States of America.
| | - Brenden M Lake
- Center for Data Science, New York University, United States of America; Department of Psychology, New York University, United States of America.
| |
Collapse
|
9
|
Zoefel B, Kösem A. Neural tracking of continuous acoustics: properties, speech-specificity and open questions. Eur J Neurosci 2024; 59:394-414. [PMID: 38151889 DOI: 10.1111/ejn.16221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2023] [Revised: 11/17/2023] [Accepted: 11/22/2023] [Indexed: 12/29/2023]
Abstract
Human speech is a particularly relevant acoustic stimulus for our species, due to its role of information transmission during communication. Speech is inherently a dynamic signal, and a recent line of research focused on neural activity following the temporal structure of speech. We review findings that characterise neural dynamics in the processing of continuous acoustics and that allow us to compare these dynamics with temporal aspects in human speech. We highlight properties and constraints that both neural and speech dynamics have, suggesting that auditory neural systems are optimised to process human speech. We then discuss the speech-specificity of neural dynamics and their potential mechanistic origins and summarise open questions in the field.
Collapse
Affiliation(s)
- Benedikt Zoefel
- Centre de Recherche Cerveau et Cognition (CerCo), CNRS UMR 5549, Toulouse, France
- Université de Toulouse III Paul Sabatier, Toulouse, France
| | - Anne Kösem
- Lyon Neuroscience Research Center (CRNL), INSERM U1028, Bron, France
| |
Collapse
|
10
|
Zariņa L, Šķilters J. Combining and segmenting geometric shapes into parts depending on symmetry type: Evidence from children and adults. Iperception 2024; 15:20416695231226157. [PMID: 38268785 PMCID: PMC10807397 DOI: 10.1177/20416695231226157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 12/27/2023] [Indexed: 01/26/2024] Open
Abstract
Symmetry is an important geometric feature that affects object segmentation into parts, though De Winter and Wagemans note that partly occluded objects can still be identified by the remaining visible parts. In two sets of experiments with children (n = 31, age 7-11, M = 8.8, SD = 1.4) and adults (n = 19, age 17-57, M = 30.4, SD = 12.6), we used 13 basic geometric figures distinguished by symmetry types to test how they are naturally segmented or combined and what the developmental impacts are on the segmentation and combination. In the first experiment, participants were asked to cut figures into two along a straight line; in the second experiment, participants had to create five sets of connected two-figure combinations where overlapping figures were allowed. The results confirmed the importance of the symmetry axis in both tasks. Other relevant criteria were dividing into half, maximal/minimal curvature, and use of edges or corners for reference. This study allows comparisons of the impact of symmetry type on the segmentation and combining of geometric figures and indicates developmental differences between children and adults.
Collapse
Affiliation(s)
- Līga Zariņa
- Laboratory for Perceptual and Cognitive Systems at the Faculty of Computing, University of Latvia, Riga, Latvia
| | - Jurģis Šķilters
- Laboratory for Perceptual and Cognitive Systems at the Faculty of Computing, University of Latvia, Riga, Latvia
| |
Collapse
|
11
|
Fitch WT. Cellular computation and cognition. Front Comput Neurosci 2023; 17:1107876. [PMID: 38077750 PMCID: PMC10702520 DOI: 10.3389/fncom.2023.1107876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Accepted: 10/09/2023] [Indexed: 05/28/2024] Open
Abstract
Contemporary neural network models often overlook a central biological fact about neural processing: that single neurons are themselves complex, semi-autonomous computing systems. Both the information processing and information storage abilities of actual biological neurons vastly exceed the simple weighted sum of synaptic inputs computed by the "units" in standard neural network models. Neurons are eukaryotic cells that store information not only in synapses, but also in their dendritic structure and connectivity, as well as genetic "marking" in the epigenome of each individual cell. Each neuron computes a complex nonlinear function of its inputs, roughly equivalent in processing capacity to an entire 1990s-era neural network model. Furthermore, individual cells provide the biological interface between gene expression, ongoing neural processing, and stored long-term memory traces. Neurons in all organisms have these properties, which are thus relevant to all of neuroscience and cognitive biology. Single-cell computation may also play a particular role in explaining some unusual features of human cognition. The recognition of the centrality of cellular computation to "natural computation" in brains, and of the constraints it imposes upon brain evolution, thus has important implications for the evolution of cognition, and how we study it.
Collapse
Affiliation(s)
- W. Tecumseh Fitch
- Faculty of Life Sciences and Vienna Cognitive Science Hub, University of Vienna, Vienna, Austria
| |
Collapse
|
12
|
Kirschhock ME, Nieder A. Association neurons in the crow telencephalon link visual signs to numerical values. Proc Natl Acad Sci U S A 2023; 120:e2313923120. [PMID: 37903264 PMCID: PMC10636302 DOI: 10.1073/pnas.2313923120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 09/20/2023] [Indexed: 11/01/2023] Open
Abstract
Many animals can associate signs with numerical values and use these signs in a goal-directed way during task performance. However, the neuronal basis of this semantic association has only rarely been investigated, and so far only in primates. How mechanisms of number associations are implemented in the distinctly evolved brains of other animal taxa such as birds is currently unknown. Here, we explored this semantic number-sign mapping by recording single-neuron activity in the crows' nidopallium caudolaterale (NCL), a brain structure critically involved in avian numerical cognition. Crows were trained to associate visual shapes with varying numbers of items in a number production task. The responses of many NCL neurons during stimulus presentation reflected the numerical values associated with visual shapes in a behaviorally relevant way. Consistent with the crow's better behavioral performance with signs, neuronal representations of numerical values extracted from shapes were more selective compared to those from dot arrays. The existence of number association neurons in crows points to a phylogenetic preadaptation of the brains of cognitively advanced vertebrates to link visual shapes with numerical meaning.
Collapse
Affiliation(s)
- Maximilian E. Kirschhock
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, Tübingen72076, Germany
| | - Andreas Nieder
- Animal Physiology Unit, Institute of Neurobiology, University of Tübingen, Tübingen72076, Germany
| |
Collapse
|
13
|
Matsumoto D, Nakai T. Syntactic theory of mathematical expressions. Cogn Psychol 2023; 146:101606. [PMID: 37748253 DOI: 10.1016/j.cogpsych.2023.101606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 06/28/2023] [Accepted: 09/16/2023] [Indexed: 09/27/2023]
Abstract
Mathematical expressions consist of recursive combinations of numbers, variables, and operators. According to theoretical linguists, the syntactic mechanisms of natural language also provide a basis for mathematics. To date, however, no theoretically rigorous investigation has been conducted to support such arguments. Therefore, this study uses a methodology based on theoretical linguistics to analyze the syntactic properties of mathematical expressions. Through a review of recent behavioral and neuroimaging studies on mathematical syntax, we report several inconsistencies with theoretical linguistics, such as the use of ternary structures. To address these, we propose that a syntactic category called Applicative plays a central role in analyzing mathematical expressions with seemingly ternary structures by combining binary structures. Besides basic arithmetic expressions, we also examine algebraic equations and complex expressions such as integral and differential calculi. This study is the first attempt at building a comprehensive framework for analyzing the syntactic structures of mathematical expressions.
Collapse
Affiliation(s)
- Daiki Matsumoto
- Graduate School of Human and Environmental Studies, Kyoto University, Kyoto, Japan; Department of Humanities, Kanazawa Seiryo University, Kanazawa, Japan
| | - Tomoya Nakai
- Lyon Neuroscience Research Center (CRNL), (INSERM/CNRS/University of Lyon), Bron, France; Araya Inc., Tokyo, Japan; Advanced Comprehensive Research Organization, Teikyo University, Tokyo, Japan.
| |
Collapse
|
14
|
Al Roumi F, Planton S, Wang L, Dehaene S. Brain-imaging evidence for compression of binary sound sequences in human memory. eLife 2023; 12:e84376. [PMID: 37910588 PMCID: PMC10619979 DOI: 10.7554/elife.84376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 10/14/2023] [Indexed: 11/03/2023] Open
Abstract
According to the language-of-thought hypothesis, regular sequences are compressed in human memory using recursive loops akin to a mental program that predicts future items. We tested this theory by probing memory for 16-item sequences made of two sounds. We recorded brain activity with functional MRI and magneto-encephalography (MEG) while participants listened to a hierarchy of sequences of variable complexity, whose minimal description required transition probabilities, chunking, or nested structures. Occasional deviant sounds probed the participants' knowledge of the sequence. We predicted that task difficulty and brain activity would be proportional to the complexity derived from the minimal description length in our formal language. Furthermore, activity should increase with complexity for learned sequences, and decrease with complexity for deviants. These predictions were upheld in both fMRI and MEG, indicating that sequence predictions are highly dependent on sequence structure and become weaker and delayed as complexity increases. The proposed language recruited bilateral superior temporal, precentral, anterior intraparietal, and cerebellar cortices. These regions overlapped extensively with a localizer for mathematical calculation, and much less with spoken or written language processing. We propose that these areas collectively encode regular sequences as repetitions with variations and their recursive composition into nested structures.
Collapse
Affiliation(s)
- Fosca Al Roumi
- Cognitive Neuroimaging Unit, Université Paris-Saclay, INSERM, CEA, CNRS, NeuroSpin centerGif/YvetteFrance
| | - Samuel Planton
- Cognitive Neuroimaging Unit, Université Paris-Saclay, INSERM, CEA, CNRS, NeuroSpin centerGif/YvetteFrance
| | - Liping Wang
- Institute of Neuroscience, Key Laboratory of Primate Neurobiology, CAS Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of SciencesShanghaiChina
| | - Stanislas Dehaene
- Cognitive Neuroimaging Unit, Université Paris-Saclay, INSERM, CEA, CNRS, NeuroSpin centerGif/YvetteFrance
- Collège de France, Université Paris Sciences Lettres (PSL)ParisFrance
| |
Collapse
|
15
|
Bramley NR, Zhao B, Quillien T, Lucas CG. Local Search and the Evolution of World Models. Top Cogn Sci 2023. [PMID: 37850714 DOI: 10.1111/tops.12703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 10/02/2023] [Accepted: 10/03/2023] [Indexed: 10/19/2023]
Abstract
An open question regarding how people develop their models of the world is how new candidates are generated for consideration out of infinitely many possibilities. We discuss the role that evolutionary mechanisms play in this process. Specifically, we argue that when it comes to developing a global world model, innovation is necessarily incremental, involving the generation and selection among random local mutations and recombinations of (parts of) one's current model. We argue that, by narrowing and guiding exploration, this feature of cognitive search is what allows human learners to discover better theories, without ever grappling directly with the problem of finding a "global optimum," or best possible world model. We suggest this aspect of cognitive processing works analogously to how blind variation and selection mechanisms drive biological evolution. We propose algorithms developed for program synthesis provide candidate mechanisms for how human minds might achieve this. We discuss objections and implications of this perspective, finally suggesting that a better process-level understanding of how humans incrementally explore compositional theory spaces can shed light on how we think, and provide explanatory traction on fundamental cognitive biases, including anchoring, probability matching, and confirmation bias.
Collapse
Affiliation(s)
| | - Bonan Zhao
- Department of Psychology, University of Edinburgh
| | - Tadeg Quillien
- Institute of Language, Cognition & Computation, Informatics University of Edinburgh
| | - Christopher G Lucas
- Institute of Language, Cognition & Computation, Informatics University of Edinburgh
| |
Collapse
|
16
|
Moss ME, Zhang M, Mayr U. The effect of abstract inter-chunk relationships on serial-order control. Cognition 2023; 239:105578. [PMID: 37541029 DOI: 10.1016/j.cognition.2023.105578] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 06/05/2023] [Accepted: 07/25/2023] [Indexed: 08/06/2023]
Abstract
Hierarchical control is often thought to dissect a complex task space into isolated subspaces in order to eliminate interference. Yet, there is also evidence from serial-order control tasks that our cognitive system can make use of abstract relationships between different parts (chunks) of a sequence. Past evidence in this regard was limited to situations with ordered stimuli (e.g., numbers or positions) that may have aided the detection of relationships and allowed gradual learning and hypothesis testing. Therefore, we used a modified task-span paradigm (with no ordered elements between tasks) in which participants performed memorized sequences of tasks that were encoded in terms of separate chunks of three tasks each. To allow examination of learning effects, each sequence was "cycled" through repeatedly. Importantly, we compared sequences whose chunks were governed by a common, abstract grammar with sequences whose chunks were governed by different grammars. Experiment 1 examined the effect of relationships between shared-element chunks (e.g., ABB-BAA vs. ABB-BAB), Experiment 2 and 3 between different-element chunks (e.g., ABA-CDC vs. ABA-CCD), and Experiment 4 examined second-order relationships (e.g., ABA-ABB--CDC-CDD vs. ABA-ABB--CDC-CCD). Robust evidence in favor of beneficial effects of abstract inter-chunk relationships was obtained across all four experiments. Importantly, these effects were at least as strong in initial cycles of performing a given sequence as during later cycles, suggesting that the cognitive system operates with an "expectation of abstract relationships," rather than benefitting from them through gradual learning. We discuss the implications of these results for models of hierarchical control.
Collapse
Affiliation(s)
| | - Min Zhang
- University of Oregon, United States of America
| | - Ulrich Mayr
- University of Oregon, United States of America.
| |
Collapse
|
17
|
Canudas-Grabolosa I, Martín-Salguero A, Bonatti LL. Natural logic and baby LoTH. Behav Brain Sci 2023; 46:e266. [PMID: 37766633 DOI: 10.1017/s0140525x23001942] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
Abstract
Language-of-thought hypothesis (LoTH) is having a profound impact on cognition studies. However, much remains unknown about its basic primitives and generative operations. Infant studies are fundamental, but methodologically very challenging. By distilling potential primitives from work in natural-language semantics, an approach beyond the corset of standard formal logic may be undertaken. Still, the road ahead is challenging and long.
Collapse
Affiliation(s)
| | - Ana Martín-Salguero
- Center for Brain and Cognition, Universitat Pompeu Fabra, Barcelona, Spain
- Cognitive Neuroimaging Unit, CEA, INSERM, Université Paris-Saclay, NeuroSpin Center, Gif/Yvette, France
| | - Luca L Bonatti
- Center for Brain and Cognition, Universitat Pompeu Fabra, Barcelona, Spain
- ICREA, Barcelona, Spain
| |
Collapse
|
18
|
Demetriou A. Developmental and multiple languages-of-thought. Behav Brain Sci 2023; 46:e273. [PMID: 37766605 DOI: 10.1017/s0140525x23002005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/29/2023]
Abstract
We agree with the target article that assuming language-of-thought (LoT) is useful for the development of cognitive and developmental theories. We note that the target article is weak in its assumptions about development of LoT and possible existence of multiple LoTs. In response to these weaknesses, we outline several developmental principles for LoT development, showing how a developmental theory of LoT springs from probabilistic LoT. We suggest a system 1.5 of reasoning allowing interchange between Bayesian and logical rules as it fits purposes or domain.
Collapse
Affiliation(s)
- Andreas Demetriou
- Cyprus Academy of Sciences, Letters, and Arts, Nicosia, Cyprus
- University of Cyprus, Nicosia, Cyprus
- University of Nicosia, Nicosia, Cyprus
| |
Collapse
|
19
|
Benson I, Marriott N, McCandliss BD. Interventions to improve equational reasoning: replication and extension of the Cuisenaire-Gattegno curriculum effect. Front Psychol 2023; 14:1116555. [PMID: 37736155 PMCID: PMC10509469 DOI: 10.3389/fpsyg.2023.1116555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 08/07/2023] [Indexed: 09/23/2023] Open
Abstract
Introduction The ability to reason about equations in a robust and fluent way requires both instrumental knowledge of symbolic forms, syntax, and operations, as well as relational knowledge of how such formalisms map to meaningful relationships captured within mental models. A recent systematic review of studies contrasting the Cuisenaire-Gattegno (Cui) curriculum approach vs. traditional rote schooling on equational reasoning has demonstrated the positive efficacy of pedagogies that focus on integrating these two forms of knowledge. Methods Here we seek to replicate and extend the most efficacious of these studies (Brownell) by implementing the curriculum to a high degree of fidelity, as well as capturing longitudinal changes within learners via a novel tablet-based assessment of accuracy and fluency with equational reasoning. We examined arithmetic fluency as a function of relational reasoning to equate initial performance across diverse groups and to track changes over four growth assessment points. Results Results showed that the intervention condition that stressed relational reasoning leads to advances in fluency for addition and subtraction with small numbers. We also showed that this intervention leads to changes in problem solving dispositions toward complex challenges, wherein students in the CUI intervention were more inclined to solve challenging problems relative to those in the control who gave up significantly earlier on multi-step problems. This shift in disposition was associated with higher accuracy on complex equational reasoning problems. A treatment by aptitude interaction emerged for both arithmetic equation reasoning and complex multi-step equational reasoning problems, both of which showed that the intervention had greatest impact for children with lower initial mathematical aptitude. Two years of intervention contrast revealed a large effect (d = 1) for improvements in equational reasoning for the experimental (CUI) group relative to control. Discussion The strong replication and extension findings substantiate the importance of embedding these teaching aides within the theory grounded curricula that gave rise to them.
Collapse
Affiliation(s)
- Ian Benson
- School of Education, University of Roehampton, London, United Kingdom
- Ian Benson and Partners Ltd., London, United Kingdom
| | - Nigel Marriott
- Marriott Statistical Consulting Ltd., Bath, United Kingdom
| | - Bruce D. McCandliss
- Graduate School of Education, Stanford University, Stanford, CA, United States
| |
Collapse
|
20
|
McCarty MJ, Murphy E, Scherschligt X, Woolnough O, Morse CW, Snyder K, Mahon BZ, Tandon N. Intraoperative cortical localization of music and language reveals signatures of structural complexity in posterior temporal cortex. iScience 2023; 26:107223. [PMID: 37485361 PMCID: PMC10362292 DOI: 10.1016/j.isci.2023.107223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 06/01/2023] [Accepted: 06/22/2023] [Indexed: 07/25/2023] Open
Abstract
Language and music involve the productive combination of basic units into structures. It remains unclear whether brain regions sensitive to linguistic and musical structure are co-localized. We report an intraoperative awake craniotomy in which a left-hemispheric language-dominant professional musician underwent cortical stimulation mapping (CSM) and electrocorticography of music and language perception and production during repetition tasks. Musical sequences were melodic or amelodic, and differed in algorithmic compressibility (Lempel-Ziv complexity). Auditory recordings of sentences differed in syntactic complexity (single vs. multiple phrasal embeddings). CSM of posterior superior temporal gyrus (pSTG) disrupted music perception and production, along with speech production. pSTG and posterior middle temporal gyrus (pMTG) activated for language and music (broadband gamma; 70-150 Hz). pMTG activity was modulated by musical complexity, while pSTG activity was modulated by syntactic complexity. This points to shared resources for music and language comprehension, but distinct neural signatures for the processing of domain-specific structural features.
Collapse
Affiliation(s)
- Meredith J. McCarty
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Elliot Murphy
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Xavier Scherschligt
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Oscar Woolnough
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Cale W. Morse
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Kathryn Snyder
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Bradford Z. Mahon
- Department of Psychology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Nitin Tandon
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
- Memorial Hermann Hospital, Texas Medical Center, Houston, TX 77030, USA
| |
Collapse
|
21
|
Benítez-Burraco A, Nikolsky A. The (Co)Evolution of Language and Music Under Human Self-Domestication. HUMAN NATURE (HAWTHORNE, N.Y.) 2023; 34:229-275. [PMID: 37097428 PMCID: PMC10354115 DOI: 10.1007/s12110-023-09447-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 03/27/2023] [Indexed: 04/26/2023]
Abstract
Together with language, music is perhaps the most distinctive behavioral trait of the human species. Different hypotheses have been proposed to explain why only humans perform music and how this ability might have evolved in our species. In this paper, we advance a new model of music evolution that builds on the self-domestication view of human evolution, according to which the human phenotype is, at least in part, the outcome of a process similar to domestication in other mammals, triggered by the reduction in reactive aggression responses to environmental changes. We specifically argue that self-domestication can account for some of the cognitive changes, and particularly for the behaviors conducive to the complexification of music through a cultural mechanism. We hypothesize four stages in the evolution of music under self-domestication forces: (1) collective protomusic; (2) private, timbre-oriented music; (3) small-group, pitch-oriented music; and (4) collective, tonally organized music. This line of development encompasses the worldwide diversity of music types and genres and parallels what has been hypothesized for languages. Overall, music diversity might have emerged in a gradual fashion under the effects of the enhanced cultural niche construction as shaped by the progressive decrease in reactive (i.e., impulsive, triggered by fear or anger) aggression and the increase in proactive (i.e., premeditated, goal-directed) aggression.
Collapse
Affiliation(s)
- Antonio Benítez-Burraco
- Department of Spanish Language, Linguistics and Literary Theory (Linguistics), Faculty of Philology, University of Seville, Seville, Spain.
- Departamento de Lengua Española, Facultad de Filología, Área de Lingüística General, Lingüística y Teoría de la Literatura, Universidad de Sevilla, C/ Palos de la Frontera s/n, Sevilla, 41007, España.
| | | |
Collapse
|
22
|
Poli F, Ghilardi T, Mars RB, Hinne M, Hunnius S. Eight-Month-Old Infants Meta-Learn by Downweighting Irrelevant Evidence. Open Mind (Camb) 2023; 7:141-155. [PMID: 37416070 PMCID: PMC10320826 DOI: 10.1162/opmi_a_00079] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 04/06/2023] [Indexed: 07/08/2023] Open
Abstract
Infants learn to navigate the complexity of the physical and social world at an outstanding pace, but how they accomplish this learning is still largely unknown. Recent advances in human and artificial intelligence research propose that a key feature to achieving quick and efficient learning is meta-learning, the ability to make use of prior experiences to learn how to learn better in the future. Here we show that 8-month-old infants successfully engage in meta-learning within very short timespans after being exposed to a new learning environment. We developed a Bayesian model that captures how infants attribute informativity to incoming events, and how this process is optimized by the meta-parameters of their hierarchical models over the task structure. We fitted the model with infants' gaze behavior during a learning task. Our results reveal how infants actively use past experiences to generate new inductive biases that allow future learning to proceed faster.
Collapse
Affiliation(s)
- Francesco Poli
- Donders Center for Cognition, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Tommaso Ghilardi
- Donders Center for Cognition, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Rogier B. Mars
- Donders Center for Cognition, Radboud University Nijmegen, Nijmegen, The Netherlands
- Nuffield Department of Clinical Neurosciences, Wellcome Centre for Integrative Neuroimaging, FMRIB, University of Oxford, John Radcliffe Hospital, Headington, Oxford, UK
| | - Max Hinne
- Donders Center for Cognition, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Sabine Hunnius
- Donders Center for Cognition, Radboud University Nijmegen, Nijmegen, The Netherlands
| |
Collapse
|
23
|
Reiner A. Could theropod dinosaurs have evolved to a human level of intelligence? J Comp Neurol 2023; 531:975-1006. [PMID: 37029483 PMCID: PMC10106414 DOI: 10.1002/cne.25458] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 01/05/2023] [Accepted: 01/11/2023] [Indexed: 04/09/2023]
Abstract
Noting that some theropod dinosaurs had large brains, large grasping hands, and likely binocular vision, paleontologist Dale Russell suggested that a branch of these dinosaurs might have evolved to a human intelligence level, had dinosaurs not become extinct. I offer reasons why the likely pallial organization in dinosaurs would have made this improbable, based on four assumptions. First, it is assumed that achieving human intelligence requires evolving an equivalent of the about 200 functionally specialized cortical areas characteristic of humans. Second, it is assumed that dinosaurs had an avian nuclear type of pallial organization, in contrast to the mammalian cortical organization. Third, it is assumed that the interactions between the different neuron types making up an information processing unit within pallium are critical to its role in analyzing information. Finally, it is assumed that increasing axonal length between the neuron sets carrying out this operation impairs its efficacy. Based on these assumptions, I present two main reasons why dinosaur pallium might have been unable to add the equivalent of 200 efficiently functioning cortical areas. First, a nuclear pattern of pallial organization would require increasing distances between the neuron groups corresponding to the separate layers of any given mammalian cortical area, as more sets of nuclei equivalent to a cortical area are interposed between the existing sets, increasing axon length and thereby impairing processing efficiency. Second, because of its nuclear organization, dinosaur pallium could not reduce axon length by folding to bring adjacent areas closer together, as occurs in cerebral cortex.
Collapse
Affiliation(s)
- Anton Reiner
- Department of Anatomy and Neurobiology, University of Tennessee Health Science Center, Memphis, Tennessee, USA
| |
Collapse
|
24
|
Woolnough O, Donos C, Murphy E, Rollo PS, Roccaforte ZJ, Dehaene S, Tandon N. Spatiotemporally distributed frontotemporal networks for sentence reading. Proc Natl Acad Sci U S A 2023; 120:e2300252120. [PMID: 37068244 PMCID: PMC10151604 DOI: 10.1073/pnas.2300252120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 03/14/2023] [Indexed: 04/19/2023] Open
Abstract
Reading a sentence entails integrating the meanings of individual words to infer more complex, higher-order meaning. This highly rapid and complex human behavior is known to engage the inferior frontal gyrus (IFG) and middle temporal gyrus (MTG) in the language-dominant hemisphere, yet whether there are distinct contributions of these regions to sentence reading is still unclear. To probe these neural spatiotemporal dynamics, we used direct intracranial recordings to measure neural activity while reading sentences, meaning-deficient Jabberwocky sentences, and lists of words or pseudowords. We isolated two functionally and spatiotemporally distinct frontotemporal networks, each sensitive to distinct aspects of word and sentence composition. The first distributed network engages the IFG and MTG, with IFG activity preceding MTG. Activity in this network ramps up over the duration of a sentence and is reduced or absent during Jabberwocky and word lists, implying its role in the derivation of sentence-level meaning. The second network engages the superior temporal gyrus and the IFG, with temporal responses leading those in frontal lobe, and shows greater activation for each word in a list than those in sentences, suggesting that sentential context enables greater efficiency in the lexical and/or phonological processing of individual words. These adjacent, yet spatiotemporally dissociable neural mechanisms for word- and sentence-level processes shed light on the richly layered semantic networks that enable us to fluently read. These results imply distributed, dynamic computation across the frontotemporal language network rather than a clear dichotomy between the contributions of frontal and temporal structures.
Collapse
Affiliation(s)
- Oscar Woolnough
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX77030
| | - Cristian Donos
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Faculty of Physics, University of Bucharest, 050663Bucharest, Romania
| | - Elliot Murphy
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX77030
| | - Patrick S. Rollo
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX77030
| | - Zachary J. Roccaforte
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX77030
| | - Stanislas Dehaene
- Cognitive Neuroimaging Unit, Université Paris-Saclay, INSERM, CEA, NeuroSpin Center, 91191Gif-sur-Yvette, France
- Collège de France, 75005Paris, France
| | - Nitin Tandon
- Vivian L. Smith Department of Neurosurgery, McGovern Medical School at UT Health Houston, Houston, TX77030
- Texas Institute for Restorative Neurotechnologies, University of Texas Health Science Center at Houston, Houston, TX77030
- Memorial Hermann Hospital, Texas Medical Center, Houston, TX77030
| |
Collapse
|
25
|
Dedhe AM, Piantadosi ST, Cantlon JF. Cognitive Mechanisms Underlying Recursive Pattern Processing in Human Adults. Cogn Sci 2023; 47:e13273. [PMID: 37051878 PMCID: PMC11097651 DOI: 10.1111/cogs.13273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 02/24/2023] [Accepted: 03/05/2023] [Indexed: 04/14/2023]
Abstract
The capacity to generate recursive sequences is a marker of rich, algorithmic cognition, and perhaps unique to humans. Yet, the precise processes driving recursive sequence generation remain mysterious. We investigated three potential cognitive mechanisms underlying recursive pattern processing: hierarchical reasoning, ordinal reasoning, and associative chaining. We developed a Bayesian mixture model to quantify the extent to which these three cognitive mechanisms contribute to adult humans' performance in a sequence generation task. We further tested whether recursive rule discovery depends upon relational information, either perceptual or semantic. We found that the presence of relational information facilitates hierarchical reasoning and drives the generation of recursive sequences across novel depths of center embedding. In the absence of relational information, the use of ordinal reasoning predominates. Our results suggest that hierarchical reasoning is an important cognitive mechanism underlying recursive pattern processing and can be deployed across embedding depths and relational domains.
Collapse
Affiliation(s)
- Abhishek M Dedhe
- Department of Psychology, Carnegie Mellon University
- Center for the Neural Basis of Cognition, Carnegie Mellon University
| | | | - Jessica F Cantlon
- Department of Psychology, Carnegie Mellon University
- Center for the Neural Basis of Cognition, Carnegie Mellon University
| |
Collapse
|
26
|
Flesch T, Saxe A, Summerfield C. Continual task learning in natural and artificial agents. Trends Neurosci 2023; 46:199-210. [PMID: 36682991 PMCID: PMC10914671 DOI: 10.1016/j.tins.2022.12.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Revised: 12/07/2022] [Accepted: 12/15/2022] [Indexed: 01/21/2023]
Abstract
How do humans and other animals learn new tasks? A wave of brain recording studies has investigated how neural representations change during task learning, with a focus on how tasks can be acquired and coded in ways that minimise mutual interference. We review recent work that has explored the geometry and dimensionality of neural task representations in neocortex, and computational models that have exploited these findings to understand how the brain may partition knowledge between tasks. We discuss how ideas from machine learning, including those that combine supervised and unsupervised learning, are helping neuroscientists understand how natural tasks are learned and coded in biological brains.
Collapse
Affiliation(s)
- Timo Flesch
- Department of Experimental Psychology, University of Oxford, Oxford, UK
| | - Andrew Saxe
- Gatsby Computational Neuroscience Unit & Sainsbury Wellcome Centre, UCL, London, UK.
| | | |
Collapse
|
27
|
Kurth-Nelson Z, Behrens T, Wayne G, Miller K, Luettgau L, Dolan R, Liu Y, Schwartenbeck P. Replay and compositional computation. Neuron 2023; 111:454-469. [PMID: 36640765 DOI: 10.1016/j.neuron.2022.12.028] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 08/11/2022] [Accepted: 12/18/2022] [Indexed: 01/15/2023]
Abstract
Replay in the brain has been viewed as rehearsal or, more recently, as sampling from a transition model. Here, we propose a new hypothesis: that replay is able to implement a form of compositional computation where entities are assembled into relationally bound structures to derive qualitatively new knowledge. This idea builds on recent advances in neuroscience, which indicate that the hippocampus flexibly binds objects to generalizable roles and that replay strings these role-bound objects into compound statements. We suggest experiments to test our hypothesis, and we end by noting the implications for AI systems which lack the human ability to radically generalize past experience to solve new problems.
Collapse
Affiliation(s)
- Zeb Kurth-Nelson
- DeepMind, London, UK; Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK.
| | - Timothy Behrens
- Wellcome Centre for Human Neuroimaging, University College London, London, UK; Wellcome Centre for Integrative Neuroimaging, University of Oxford, Oxford, UK
| | | | - Kevin Miller
- DeepMind, London, UK; Institute of Ophthalmology, University College London, London, UK
| | - Lennart Luettgau
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK
| | - Ray Dolan
- Max Planck UCL Centre for Computational Psychiatry and Ageing Research, London, UK; Wellcome Centre for Human Neuroimaging, University College London, London, UK
| | - Yunzhe Liu
- State Key Laboratory of Cognitive Neuroscience and Learning, IDG/McGovern Institute for Brain Research, Beijing Normal University, Beijing, China; Chinese Institute for Brain Research, Beijing, China
| | - Philipp Schwartenbeck
- Max Planck Institute for Biological Cybernetics, Tubingen, Germany; University of Tubingen, Tubingen, Germany
| |
Collapse
|
28
|
Dedhe AM, Clatterbuck H, Piantadosi ST, Cantlon JF. Origins of Hierarchical Logical Reasoning. Cogn Sci 2023; 47:e13250. [PMID: 36739520 PMCID: PMC11057913 DOI: 10.1111/cogs.13250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 12/21/2022] [Accepted: 01/06/2023] [Indexed: 02/06/2023]
Abstract
Hierarchical cognitive mechanisms underlie sophisticated behaviors, including language, music, mathematics, tool-use, and theory of mind. The origins of hierarchical logical reasoning have long been, and continue to be, an important puzzle for cognitive science. Prior approaches to hierarchical logical reasoning have often failed to distinguish between observable hierarchical behavior and unobservable hierarchical cognitive mechanisms. Furthermore, past research has been largely methodologically restricted to passive recognition tasks as compared to active generation tasks that are stronger tests of hierarchical rules. We argue that it is necessary to implement learning studies in humans, non-human species, and machines that are analyzed with formal models comparing the contribution of different cognitive mechanisms implicated in the generation of hierarchical behavior. These studies are critical to advance theories in the domains of recursion, rule-learning, symbolic reasoning, and the potentially uniquely human cognitive origins of hierarchical logical reasoning.
Collapse
Affiliation(s)
- Abhishek M. Dedhe
- Department of Psychology, Carnegie Mellon University
- Center for the Neural Basis of Cognition, Carnegie Mellon University
| | | | | | - Jessica F. Cantlon
- Department of Psychology, Carnegie Mellon University
- Center for the Neural Basis of Cognition, Carnegie Mellon University
| |
Collapse
|
29
|
Sablé-Meyer M, Ellis K, Tenenbaum J, Dehaene S. A language of thought for the mental representation of geometric shapes. Cogn Psychol 2022; 139:101527. [PMID: 36403385 DOI: 10.1016/j.cogpsych.2022.101527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 10/26/2022] [Accepted: 10/31/2022] [Indexed: 11/18/2022]
Abstract
In various cultures and at all spatial scales, humans produce a rich complexity of geometric shapes such as lines, circles or spirals. Here, we propose that humans possess a language of thought for geometric shapes that can produce line drawings as recursive combinations of a minimal set of geometric primitives. We present a programming language, similar to Logo, that combines discrete numbers and continuous integration to form higher-level structures based on repetition, concatenation and embedding, and we show that the simplest programs in this language generate the fundamental geometric shapes observed in human cultures. On the perceptual side, we propose that shape perception in humans involves searching for the shortest program that correctly draws the image (program induction). A consequence of this framework is that the mental difficulty of remembering a shape should depend on its minimum description length (MDL) in the proposed language. In two experiments, we show that encoding and processing of geometric shapes is well predicted by MDL. Furthermore, our hypotheses predict additive laws for the psychological complexity of repeated, concatenated or embedded shapes, which we confirm experimentally.
Collapse
Affiliation(s)
- Mathias Sablé-Meyer
- Unicog, CEA, INSERM, Université Paris-Saclay, NeuroSpin Center, 91191 Gif/Yvette, France; Collège de France, Université Paris-Sciences-Lettres (PSL), 75005 Paris, France.
| | - Kevin Ellis
- Cornell University, Ithaca, NY, United States
| | - Josh Tenenbaum
- Massachusetts Institute of Technology, Cambridge, MA, United States
| | - Stanislas Dehaene
- Unicog, CEA, INSERM, Université Paris-Saclay, NeuroSpin Center, 91191 Gif/Yvette, France; Collège de France, Université Paris-Sciences-Lettres (PSL), 75005 Paris, France
| |
Collapse
|
30
|
Mandelbaum E, Dunham Y, Feiman R, Firestone C, Green EJ, Harris D, Kibbe MM, Kurdi B, Mylopoulos M, Shepherd J, Wellwood A, Porot N, Quilty-Dunn J. Problems and Mysteries of the Many Languages of Thought. Cogn Sci 2022; 46:e13225. [PMID: 36537721 DOI: 10.1111/cogs.13225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 11/15/2022] [Accepted: 11/21/2022] [Indexed: 12/24/2022]
Abstract
"What is the structure of thought?" is as central a question as any in cognitive science. A classic answer to this question has appealed to a Language of Thought (LoT). We point to emerging research from disparate branches of the field that supports the LoT hypothesis, but also uncovers diversity in LoTs across cognitive systems, stages of development, and species. Our letter formulates open research questions for cognitive science concerning the varieties of rules and representations that underwrite various LoT-based systems and how these variations can help researchers taxonomize cognitive systems.
Collapse
Affiliation(s)
- Eric Mandelbaum
- Department of Philosophy, Baruch College.,Departments of Philosophy & Psychology, CUNY Graduate Center
| | | | - Roman Feiman
- Department of Cognitive, Linguistic, and Psychological Sciences, Brown University
| | - Chaz Firestone
- Department of Psychological and Brain Sciences, Johns Hopkins University
| | - E J Green
- Department of Linguistics and Philosophy, Massachusetts Institute of Technology
| | - Daniel Harris
- Department of Philosophy, Hunter College & CUNY Graduate Center
| | - Melissa M Kibbe
- Department of Psychological & Brain Sciences, Boston University
| | | | - Myrto Mylopoulos
- Departments of Philosophy and Cognitive Science, Carleton University
| | - Joshua Shepherd
- Department of Philosophy, Carleton College.,Department of Philosophy, University of Barcelona
| | | | - Nicolas Porot
- Africa Institute for Research in Economics and Social Sciences, Mohammed VI Polytechnic University
| | - Jake Quilty-Dunn
- Department of Philosophy & Philosophy-Neuroscience-Psychology, Washington University in St Louis
| |
Collapse
|
31
|
Liao DA, Brecht KF, Johnston M, Nieder A. Recursive sequence generation in crows. SCIENCE ADVANCES 2022; 8:eabq3356. [PMID: 36322648 PMCID: PMC9629703 DOI: 10.1126/sciadv.abq3356] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 09/13/2022] [Indexed: 05/16/2023]
Abstract
Recursion, the process of embedding structures within similar structures, is often considered a foundation of symbolic competence and a uniquely human capability. To understand its evolution, we can study the recursive aptitudes of nonhuman animals. We adopted the behavioral protocol of a recent study demonstrating that humans and nonhuman primates grasp recursion. We presented sequences of bracket pair stimuli (e.g., [ ] and { }) to crows who were instructed to peck at training lists. They were then tested on their ability to transfer center-embedded structure to never-before-seen pairings of brackets. We reveal that crows have recursive capacities; they perform on par with children and even outperform macaques. The crows continued to produce recursive sequences after extending to longer and thus deeper embeddings. These results demonstrate that recursive capabilities are not limited to the primate genealogy and may have occurred separately from or before human symbolic competence in different animal taxa.
Collapse
|
32
|
Safron A, Çatal O, Verbelen T. Generalized Simultaneous Localization and Mapping (G-SLAM) as unification framework for natural and artificial intelligences: towards reverse engineering the hippocampal/entorhinal system and principles of high-level cognition. Front Syst Neurosci 2022; 16:787659. [PMID: 36246500 PMCID: PMC9563348 DOI: 10.3389/fnsys.2022.787659] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2021] [Accepted: 09/02/2022] [Indexed: 11/24/2022] Open
Abstract
Simultaneous localization and mapping (SLAM) represents a fundamental problem for autonomous embodied systems, for which the hippocampal/entorhinal system (H/E-S) has been optimized over the course of evolution. We have developed a biologically-inspired SLAM architecture based on latent variable generative modeling within the Free Energy Principle and Active Inference (FEP-AI) framework, which affords flexible navigation and planning in mobile robots. We have primarily focused on attempting to reverse engineer H/E-S "design" properties, but here we consider ways in which SLAM principles from robotics may help us better understand nervous systems and emergent minds. After reviewing LatentSLAM and notable features of this control architecture, we consider how the H/E-S may realize these functional properties not only for physical navigation, but also with respect to high-level cognition understood as generalized simultaneous localization and mapping (G-SLAM). We focus on loop-closure, graph-relaxation, and node duplication as particularly impactful architectural features, suggesting these computational phenomena may contribute to understanding cognitive insight (as proto-causal-inference), accommodation (as integration into existing schemas), and assimilation (as category formation). All these operations can similarly be describable in terms of structure/category learning on multiple levels of abstraction. However, here we adopt an ecological rationality perspective, framing H/E-S functions as orchestrating SLAM processes within both concrete and abstract hypothesis spaces. In this navigation/search process, adaptive cognitive equilibration between assimilation and accommodation involves balancing tradeoffs between exploration and exploitation; this dynamic equilibrium may be near optimally realized in FEP-AI, wherein control systems governed by expected free energy objective functions naturally balance model simplicity and accuracy. With respect to structure learning, such a balance would involve constructing models and categories that are neither too inclusive nor exclusive. We propose these (generalized) SLAM phenomena may represent some of the most impactful sources of variation in cognition both within and between individuals, suggesting that modulators of H/E-S functioning may potentially illuminate their adaptive significances as fundamental cybernetic control parameters. Finally, we discuss how understanding H/E-S contributions to G-SLAM may provide a unifying framework for high-level cognition and its potential realization in artificial intelligences.
Collapse
Affiliation(s)
- Adam Safron
- Center for Psychedelic and Consciousness Research, Johns Hopkins University School of Medicine, Baltimore, MD, United States
- Cognitive Science Program, Indiana University, Bloomington, IN, United States
- Institute for Advanced Consciousness Studies, Santa Monica, CA, United States
| | - Ozan Çatal
- IDLab, Department of Information Technology, Ghent University—imec, Ghent, Belgium
| | - Tim Verbelen
- IDLab, Department of Information Technology, Ghent University—imec, Ghent, Belgium
| |
Collapse
|