1
|
Ward R, Ramsey R. Integrating Social Cognition Into Domain-General Control: Interactive Activation and Competition for the Control of Action (ICON). Cogn Sci 2024; 48:e13415. [PMID: 38407496 DOI: 10.1111/cogs.13415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 02/02/2024] [Accepted: 02/06/2024] [Indexed: 02/27/2024]
Abstract
Social cognition differs from general cognition in its focus on understanding, perceiving, and interpreting social information. However, we argue that the significance of domain-general processes for controlling cognition has been historically undervalued in social cognition and social neuroscience research. We suggest much of social cognition can be characterized as specialized feature representations supported by domain-general cognitive control systems. To test this proposal, we develop a comprehensive working model, based on an interactive activation and competition architecture and applied to the control of action. As such, we label the model "ICON" (interactive activation and competition model for the control of action). We used the ICON model to simulate human performance across various laboratory tasks. Our simulations emphasize that many laboratory-based social tasks do not require socially specific control systems, such as those that are argued to rely on neural networks associated with theory-of-mind. Moreover, our model clarifies that perceived disruptions in social cognition, even in what appears to be disruption to the control of social cognition, can stem from deficits in social representation instead. We advocate for a "default stance" in social cognition, where control is usually general, but representation is specific. This study underscores the importance of integrating social cognition within the broader realm of domain-general control processing, offering a unified perspective on task processing.
Collapse
Affiliation(s)
- Robert Ward
- Cognitive Neuroscience Institute, Department of Psychology, Bangor University
| | - Richard Ramsey
- Department of Health Sciences and Technology and Department of Humanities, Social and Political Sciences, ETH Zürich
| |
Collapse
|
2
|
Magnuson JS, Crinnion AM, Luthra S, Gaston P, Grubb S. Contra assertions, feedback improves word recognition: How feedback and lateral inhibition sharpen signals over noise. Cognition 2024; 242:105661. [PMID: 37944313 DOI: 10.1016/j.cognition.2023.105661] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 10/17/2023] [Accepted: 11/02/2023] [Indexed: 11/12/2023]
Abstract
Whether top-down feedback modulates perception has deep implications for cognitive theories. Debate has been vigorous in the domain of spoken word recognition, where competing computational models and agreement on at least one diagnostic experimental paradigm suggest that the debate may eventually be resolvable. Norris and Cutler (2021) revisit arguments against lexical feedback in spoken word recognition models. They also incorrectly claim that recent computational demonstrations that feedback promotes accuracy and speed under noise (Magnuson et al., 2018) were due to the use of the Luce choice rule rather than adding noise to inputs (noise was in fact added directly to inputs). They also claim that feedback cannot improve word recognition because feedback cannot distinguish signal from noise. We have two goals in this paper. First, we correct the record about the simulations of Magnuson et al. (2018). Second, we explain how interactive activation models selectively sharpen signals via joint effects of feedback and lateral inhibition that boost lexically-coherent sublexical patterns over noise. We also review a growing body of behavioral and neural results consistent with feedback and inconsistent with autonomous (non-feedback) architectures, and conclude that parsimony supports feedback. We close by discussing the potential for synergy between autonomous and interactive approaches.
Collapse
Affiliation(s)
- James S Magnuson
- University of Connecticut. Storrs, CT, USA; BCBL. Basque Center on Cognition Brain and Language, Donostia-San Sebastián, Spain; Ikerbasque. Basque Foundation for Science, Bilbao, Spain.
| | | | | | | | | |
Collapse
|
3
|
Brothers T, Morgan E, Yacovone A, Kuperberg G. Multiple predictions during language comprehension: Friends, foes, or indifferent companions? Cognition 2023; 241:105602. [PMID: 37716311 PMCID: PMC10783882 DOI: 10.1016/j.cognition.2023.105602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 08/14/2023] [Indexed: 09/18/2023]
Abstract
To comprehend language, we continually use prior context to pre-activate expected upcoming information, resulting in facilitated processing of incoming words that confirm these predictions. But what are the consequences of disconfirming prior predictions? To address this question, most previous studies have examined unpredictable words appearing in contexts that constrain strongly for a single continuation. However, during natural language processing, it is far more common to encounter contexts that constrain for multiple potential continuations, each with some probability. Here, we ask whether and how pre-activating both higher and lower probability alternatives influences the processing of the lower probability incoming word. One possibility is that, similar to language production, there is continuous pressure to select the higher-probability pre-activated alternative through competitive inhibition. During comprehension, this would result in relative costs in processing the lower probability target. A second possibility is that if the two pre-activated alternatives share semantic features, they mutually enhance each other's pre-activation. This would result in greater facilitation in processing the lower probability target. To distinguish between these accounts, we recorded ERPs as participants read three-sentence scenarios that constrained either for a single word or for two potential continuations - a higher probability expected candidate and a lower probability second-best candidate. We found no evidence that competitive pre-activation between the expected and second-best candidates resulted in costs in processing the second-best target, either during lexico-semantic processing (indexed by the N400) or at later stages of processing (indexed by a later frontal positivity). Instead, we found only benefits of pre-activating multiple alternatives, with evidence of enhanced graded facilitation on lower-probability targets that were semantically related to a higher-probability pre-activated alternative. These findings are consistent with a previous eye-tracking study by Luke and Christianson (2016, Cogn Psychol) using corpus-based materials. They have significant theoretical implications for models of predictive language processing, indicating that routine graded prediction in language comprehension does not operate through the same competitive mechanisms that are engaged in language production. Instead, our results align more closely with hierarchical probabilistic accounts of language comprehension, such as predictive coding.
Collapse
Affiliation(s)
- Trevor Brothers
- Department of Psychology, North Carolina A&T, United States of America; Department of Psychology, Tufts University, United States of America
| | - Emily Morgan
- Department of Linguistics, University of California, Davis, United States of America
| | - Anthony Yacovone
- Department of Psychology, Tufts University, United States of America; Department of Psychiatry and the Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, United States of America
| | - Gina Kuperberg
- Department of Psychology, Tufts University, United States of America; Department of Psychiatry and the Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, United States of America.
| |
Collapse
|
4
|
Simon D, Read SJ. Toward a General Framework of Biased Reasoning: Coherence-Based Reasoning. PERSPECTIVES ON PSYCHOLOGICAL SCIENCE 2023:17456916231204579. [PMID: 37983541 DOI: 10.1177/17456916231204579] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
A considerable amount of experimental research has been devoted to uncovering biased forms of reasoning. Notwithstanding the richness and overall empirical soundness of the bias research, the field can be described as disjointed, incomplete, and undertheorized. In this article, we seek to address this disconnect by offering "coherence-based reasoning" as a parsimonious theoretical framework that explains a sizable number of important deviations from normative forms of reasoning. Represented in connectionist networks and processed through constraint-satisfaction processing, coherence-based reasoning serves as a ubiquitous, essential, and overwhelmingly adaptive apparatus in people's mental toolbox. This adaptive process, however, can readily be overrun by bias when the network is dominated by nodes or links that are incorrect, overweighted, or otherwise nonnormative. We apply this framework to explain a variety of well-established biased forms of reasoning, including confirmation bias, the halo effect, stereotype spillovers, hindsight bias, motivated reasoning, emotion-driven reasoning, ideological reasoning, and more.
Collapse
Affiliation(s)
- Dan Simon
- Gould School of Law, University of Southern California
- Department of Psychology, University of Southern California
| | - Stephen J Read
- Department of Psychology, University of Southern California
| |
Collapse
|
5
|
Zaleskiewicz T, Traczyk J, Sobkow A. Decision making and mental imagery: A conceptual synthesis and new research directions. JOURNAL OF COGNITIVE PSYCHOLOGY 2023. [DOI: 10.1080/20445911.2023.2198066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/08/2023]
Affiliation(s)
- Tomasz Zaleskiewicz
- SWPS University of Social Sciences and Humanities, Center for Research in Economic Behavior (CREB), Wroclaw, Poland
| | - Jakub Traczyk
- SWPS University of Social Sciences and Humanities, Center for Research on Improving Decision Making, Wroclaw, Poland
| | - Agata Sobkow
- SWPS University of Social Sciences and Humanities, Center for Research on Improving Decision Making, Wroclaw, Poland
| |
Collapse
|
6
|
Gussow AE. Language production under message uncertainty: When, how, and why we speak before we think. PSYCHOLOGY OF LEARNING AND MOTIVATION 2023. [DOI: 10.1016/bs.plm.2023.02.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
|
7
|
Rosner A, Basieva I, Barque-Duran A, Glöckner A, von Helversen B, Khrennikov A, Pothos EM. Ambivalence in decision making: An eye tracking study. Cogn Psychol 2022; 134:101464. [PMID: 35298978 DOI: 10.1016/j.cogpsych.2022.101464] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 02/04/2022] [Accepted: 02/08/2022] [Indexed: 11/19/2022]
Abstract
An intuition of ambivalence in cognition is particularly strong for complex decisions, for which the merits and demerits of different options are roughly equal but hard to compare. We examined information search in an experimental paradigm which tasked participants with an ambivalent question, while monitoring attentional dynamics concerning the information relevant to each option in different Areas of Interest (AOIs). We developed two dynamical models for describing eye tracking curves, for each response separately. The models incorporated a drift mechanism towards the various options, as in standard drift diffusion theory. In addition, they included a mechanism for intrinsic oscillation, which competed with the drift process and undermined eventual stabilization of the dynamics. The two models varied in the range of drift processes postulated. Higher support was observed for the simpler model, which only included drifts from an uncertainty state to either of two certainty states. In addition, model parameters could be weakly related to the eventual decision, complementing our knowledge of the way eye tracking structure relates to decision (notably the gaze cascade effect).
Collapse
Affiliation(s)
- Agnes Rosner
- Department of Psychology, University of Zurich, 8050 Zurich, Switzerland.
| | - Irina Basieva
- Department of Psychology, City, University of London, London EC1V 0HB, UK.
| | - Albert Barque-Duran
- Department of Psychology, City, University of London, London EC1V 0HB, UK; Department of Computer Science, Universitat de Lleida, Carrer de Jaume II, 67, 25001 Lleida, España.
| | - Andreas Glöckner
- Faculty of Human Sciences, University of Cologne, 50931 Cologne, Germany.
| | - Bettina von Helversen
- Department of Psychology, University of Zurich, 8050 Zurich, Switzerland; Department of Psychology, Bremen University, 28359 Bremen, Germany.
| | - Andrei Khrennikov
- International Center for Mathematical Modeling in Physics and Cognitive Sciences Linnaeus University, Sweden.
| | - Emmanuel M Pothos
- Department of Psychology, City, University of London, London EC1V 0HB, UK.
| |
Collapse
|
8
|
Abstract
Researchers often disagree as to whether emotions are largely consistent across people and over time, or whether they are variable. They also disagree as to whether emotions are initiated by appraisals, or whether they may be initiated in diverse ways. We draw upon Parallel-Distributed-Processing to offer an algorithmic account in which features of an emotion instance are bi-directionally connected to each other via conjunction units. We propose that such indirect connections may be innate as well as learned. These ideas lead to the development of the Interactive Activation and Competition framework for Emotion (IAC-E) which allows us to specify when emotions are consistent and when they are variable, as well as when they are appraisal-led and when they are input-agnostic.
Collapse
Affiliation(s)
- Gaurav Suri
- Department of Psychology, San Francisco State University, San Francisco, CA, USA
| | - James J. Gross
- Department of Psychology, Stanford University, Stanford, CA, USA
| |
Collapse
|
9
|
A self-organized sentence processing theory of gradience: The case of islands. Cognition 2022; 222:104943. [PMID: 35026687 DOI: 10.1016/j.cognition.2021.104943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 09/01/2021] [Accepted: 10/18/2021] [Indexed: 11/20/2022]
Abstract
Formal theories of grammar and traditional parsing models, insofar as they presuppose a categorical notion of grammar, face the challenge of accounting for gradient effects (sentences receive gradient acceptability judgments, speakers report a gradient ability to comprehend sentences that deviate from idealized grammatical forms, and various online sentence processing measures yield gradient effects). This challenge is traditionally met by explaining gradient effects in terms of extra-grammatical factors, positing a purely categorical core for the language system. We present a new way of accounting for gradience in a self-organized sentence processing (SOSP) model. SOSP generates structures with a continuous range of grammaticality values by assuming a flexible structure-formation system in which the parses are formed even under sub-optimal circumstances by coercing elements to play roles that do not optimally suit them. We focus on islands, a family of syntactic domains out of which movement is generally prohibited. Islands are interesting because, although many linguistic theories treat them as fully ungrammatical and uninterpretable, experimental studies have revealed gradient patterns of acceptability and evidence for their interpretability. We describe the conceptual framework of SOSP, showing that it largely respects island constraints, but in certain cases, consistent with empirical data, coerces elements that block dependencies into elements that allow them.
Collapse
|
10
|
Shteynberg G, Hirsh JB, Garthoff J, Bentley RA. Agency and Identity in the Collective Self. PERSONALITY AND SOCIAL PSYCHOLOGY REVIEW 2021; 26:35-56. [PMID: 34969333 DOI: 10.1177/10888683211065921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Contemporary research on human sociality is heavily influenced by the social identity approach, positioning social categorization as the primary mechanism governing social life. Building on the distinction between agency and identity in the individual self ("I" vs. "Me"), we emphasize the analogous importance of distinguishing collective agency from collective identity ("We" vs. "Us"). While collective identity is anchored in the unique characteristics of group members, collective agency involves the adoption of a shared subjectivity that is directed toward some object of our attention, desire, emotion, belief, or action. These distinct components of the collective self are differentiated in terms of their mental representations, neurocognitive underpinnings, conditions of emergence, mechanisms of social convergence, and functional consequences. Overall, we show that collective agency provides a useful complement to the social categorization approach, with unique implications for multiple domains of human social life, including collective action, responsibility, dignity, violence, dominance, ritual, and morality.
Collapse
|
11
|
Mechtenberg H, Xie X, Myers EB. Sentence predictability modulates cortical response to phonetic ambiguity. BRAIN AND LANGUAGE 2021; 218:104959. [PMID: 33930722 PMCID: PMC8513138 DOI: 10.1016/j.bandl.2021.104959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2020] [Revised: 03/02/2021] [Accepted: 04/09/2021] [Indexed: 06/12/2023]
Abstract
Phonetic categories have undefined edges, such that individual tokens that belong to different speech sound categories may occupy the same region in acoustic space. In continuous speech, there are multiple sources of top-down information (e.g., lexical, semantic) that help to resolve the identity of an ambiguous phoneme. Of interest is how these top-down constraints interact with ambiguity at the phonetic level. In the current fMRI study, participants passively listened to sentences that varied in semantic predictability and in the amount of naturally-occurring phonetic competition. The left middle frontal gyrus, angular gyrus, and anterior inferior frontal gyrus were sensitive to both semantic predictability and the degree of phonetic competition. Notably, greater phonetic competition within non-predictive contexts resulted in a negatively-graded neural response. We suggest that uncertainty at the phonetic-acoustic level interacts with uncertainty at the semantic level-perhaps due to a failure of the network to construct a coherent meaning.
Collapse
Affiliation(s)
- Hannah Mechtenberg
- Department of Speech, Language, and Hearing Sciences, University of Connecticut, Storrs, Mansfield, CT 06269, USA.
| | - Xin Xie
- Department of Brain and Cognitive Sciences, University of Rochester, Rochester, NY 14627, USA.
| | - Emily B Myers
- Department of Speech, Language, and Hearing Sciences, University of Connecticut, Storrs, Mansfield, CT 06269, USA; Department of Psychological Sciences, University of Connecticut, Storrs, Mansfield, CT 06269, USA.
| |
Collapse
|
12
|
Does signal reduction imply predictive coding in models of spoken word recognition? Psychon Bull Rev 2021; 28:1381-1389. [PMID: 33852158 PMCID: PMC8367925 DOI: 10.3758/s13423-021-01924-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/24/2021] [Indexed: 12/29/2022]
Abstract
Pervasive behavioral and neural evidence for predictive processing has led to claims that language processing depends upon predictive coding. Formally, predictive coding is a computational mechanism where only deviations from top-down expectations are passed between levels of representation. In many cognitive neuroscience studies, a reduction of signal for expected inputs is taken as being diagnostic of predictive coding. In the present work, we show that despite not explicitly implementing prediction, the TRACE model of speech perception exhibits this putative hallmark of predictive coding, with reductions in total lexical activation, total lexical feedback, and total phoneme activation when the input conforms to expectations. These findings may indicate that interactive activation is functionally equivalent or approximant to predictive coding or that caution is warranted in interpreting neural signal reduction as diagnostic of predictive coding.
Collapse
|
13
|
Luthra S, Peraza‐Santiago G, Beeson K, Saltzman D, Crinnion AM, Magnuson JS. Robust Lexically Mediated Compensation for Coarticulation: Christmash Time Is Here Again. Cogn Sci 2021; 45:e12962. [PMID: 33877697 PMCID: PMC8243960 DOI: 10.1111/cogs.12962] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Revised: 02/10/2021] [Accepted: 02/19/2021] [Indexed: 11/30/2022]
Abstract
A long-standing question in cognitive science is how high-level knowledge is integrated with sensory input. For example, listeners can leverage lexical knowledge to interpret an ambiguous speech sound, but do such effects reflect direct top-down influences on perception or merely postperceptual biases? A critical test case in the domain of spoken word recognition is lexically mediated compensation for coarticulation (LCfC). Previous LCfC studies have shown that a lexically restored context phoneme (e.g., /s/ in Christma#) can alter the perceived place of articulation of a subsequent target phoneme (e.g., the initial phoneme of a stimulus from a tapes-capes continuum), consistent with the influence of an unambiguous context phoneme in the same position. Because this phoneme-to-phoneme compensation for coarticulation is considered sublexical, scientists agree that evidence for LCfC would constitute strong support for top-down interaction. However, results from previous LCfC studies have been inconsistent, and positive effects have often been small. Here, we conducted extensive piloting of stimuli prior to testing for LCfC. Specifically, we ensured that context items elicited robust phoneme restoration (e.g., that the final phoneme of Christma# was reliably identified as /s/) and that unambiguous context-final segments (e.g., a clear /s/ at the end of Christmas) drove reliable compensation for coarticulation for a subsequent target phoneme. We observed robust LCfC in a well-powered, preregistered experiment with these pretested items (N = 40) as well as in a direct replication study (N = 40). These results provide strong evidence in favor of computational models of spoken word recognition that include top-down feedback.
Collapse
Affiliation(s)
| | | | | | | | | | - James S. Magnuson
- Psychological SciencesUniversity of Connecticut
- BCBL, Basque Center on Cognition Brain and Language
- Ikerbasque, Basque Foundation for Science
| |
Collapse
|
14
|
Nadeau SE. Basal Ganglia and Thalamic Contributions to Language Function: Insights from A Parallel Distributed Processing Perspective. Neuropsychol Rev 2021; 31:495-515. [PMID: 33512608 DOI: 10.1007/s11065-020-09466-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2020] [Accepted: 11/10/2020] [Indexed: 11/25/2022]
Abstract
Cerebral representations are encoded as patterns of activity involving billions of neurons. Parallel distributed processing (PDP) across these neuronal populations provides the basis for a number of emergent properties: 1) processing occurs and knowledge (long term memories) is stored (as synaptic connection strengths) in exactly the same networks; 2) networks have the capacity for setting into stable attractor states corresponding to concepts, symbols, implicit rules, or data transformations; 3) networks provide the scaffold for the acquisition of knowledge but knowledge is acquired through experience; 4) PDP networks are adept at incorporating the statistical regularities of experience as well as frequency and age of acquisition effects; 5) networks enable content-addressable memory; 6) because knowledge is distributed throughout networks, they exhibit the property of graceful degradation; 7) networks intrinsically provide the capacity for inference. This paper details the features of the basal ganglia and thalamic systems (recurrent and distributed connectivity) that support PDP. The PDP lens and an understanding of the attractor trench dynamics of the basal ganglia provide a natural explanation for the peculiar dysfunctions of Parkinson's disease and the mechanisms by which dopamine deficiency is causal. The PDP lens, coupled with the fact that the basal ganglia of humans bears strong homology to the basal ganglia of lampreys and the central complex of arthropods, reveals that the fundamental function of the basal ganglia is computational and involves the reduction of the vast dimensionality of a complex multi-dimensional array of sensorimotor input into the optimal choice from a small repertoire of behavioral options - the essence of reactive intention (automatic responses to sensory input). There is strong evidence that the sensorimotor basal ganglia make no contributions to cognitive or motor function in humans but can cause serious dysfunction when pathological. It appears that humans, through the course of evolution, have developed cortical capacities (working memory and volitional and reactive attention) for managing sensory input, however complex, that obviate the need for the basal ganglia. The functions of the dorsal tier thalamus, however, even viewed with an understanding of the properties of population encoded representations, remain somewhat more obscure. Possibilities include the enabling of attractor state constellations that optimize function by taking advantage of simultaneous input from multiple cortical areas; selective engagement of cortical representations; and support of the gamma frequency synchrony that enables binding of the multiple network representations that comprise a full concept representation.
Collapse
Affiliation(s)
- Stephen E Nadeau
- Research Service and the Brain Rehabilitation Research Center, Malcom Randall VA Medical Center and the Department of Neurology, University of Florida College of Medicine, 1601 SW Archer Road, Gainesville, FL, 32608-1197, US.
| |
Collapse
|
15
|
Lampinen AK, McClelland JL. Transforming task representations to perform novel tasks. Proc Natl Acad Sci U S A 2020; 117:32970-32981. [PMID: 33303652 PMCID: PMC7777120 DOI: 10.1073/pnas.2008852117] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An important aspect of intelligence is the ability to adapt to a novel task without any direct experience (zero shot), based on its relationship to previous tasks. Humans can exhibit this cognitive flexibility. By contrast, models that achieve superhuman performance in specific tasks often fail to adapt to even slight task alterations. To address this, we propose a general computational framework for adapting to novel tasks based on their relationship to prior tasks. We begin by learning vector representations of tasks. To adapt to new tasks, we propose metamappings, higher-order tasks that transform basic task representations. We demonstrate the effectiveness of this framework across a wide variety of tasks and computational paradigms, ranging from regression to image classification and reinforcement learning. We compare to both human adaptability and language-based approaches to zero-shot learning. Across these domains, metamapping is successful, often achieving 80 to 90% performance, without any data, on a novel task, even when the new task directly contradicts prior experience. We further show that metamapping can not only generalize to new tasks via learned relationships, but can also generalize using novel relationships unseen during training. Finally, using metamapping as a starting point can dramatically accelerate later learning on a new task and reduce learning time and cumulative error substantially. Our results provide insight into a possible computational basis of intelligent adaptability and offer a possible framework for modeling cognitive flexibility and building more flexible artificial intelligence systems.
Collapse
|
16
|
Luthra S, You H, Rueckl JG, Magnuson JS. Friends in Low-Entropy Places: Orthographic Neighbor Effects on Visual Word Identification Differ Across Letter Positions. Cogn Sci 2020; 44:e12917. [PMID: 33274485 PMCID: PMC8211392 DOI: 10.1111/cogs.12917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Revised: 06/24/2020] [Accepted: 10/01/2020] [Indexed: 11/28/2022]
Abstract
Visual word recognition is facilitated by the presence of orthographic neighbors that mismatch the target word by a single letter substitution. However, researchers typically do not consider where neighbors mismatch the target. In light of evidence that some letter positions are more informative than others, we investigate whether the influence of orthographic neighbors differs across letter positions. To do so, we quantify the number of enemies at each letter position (how many neighbors mismatch the target word at that position). Analyses of reaction time data from a visual word naming task indicate that the influence of enemies differs across letter positions, with the negative impacts of enemies being most pronounced at letter positions where readers have low prior uncertainty about which letters they will encounter (i.e., positions with low entropy). To understand the computational mechanisms that give rise to such positional entropy effects, we introduce a new computational model, VOISeR (Visual Orthographic Input Serial Reader), which receives orthographic inputs in parallel and produces an over-time sequence of phonemes as output. VOISeR produces a similar pattern of results as in the human data, suggesting that positional entropy effects may emerge even when letters are not sampled serially. Finally, we demonstrate that these effects also emerge in human subjects' data from a lexical decision task, illustrating the generalizability of positional entropy effects across visual word recognition paradigms. Taken together, such work suggests that research into orthographic neighbor effects in visual word recognition should also consider differences between letter positions.
Collapse
Affiliation(s)
- Sahil Luthra
- Department of Psychological Sciences, University of Connecticut
- Connecticut Institute for the Brain and Cognitive Sciences
| | - Heejo You
- Department of Psychological Sciences, University of Connecticut
| | - Jay G. Rueckl
- Department of Psychological Sciences, University of Connecticut
- Connecticut Institute for the Brain and Cognitive Sciences
- Haskins Laboratories
| | - James S. Magnuson
- Department of Psychological Sciences, University of Connecticut
- Connecticut Institute for the Brain and Cognitive Sciences
- Haskins Laboratories
| |
Collapse
|
17
|
Sohoglu E, Davis MH. Rapid computations of spectrotemporal prediction error support perception of degraded speech. eLife 2020; 9:e58077. [PMID: 33147138 PMCID: PMC7641582 DOI: 10.7554/elife.58077] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Accepted: 10/19/2020] [Indexed: 12/15/2022] Open
Abstract
Human speech perception can be described as Bayesian perceptual inference but how are these Bayesian computations instantiated neurally? We used magnetoencephalographic recordings of brain responses to degraded spoken words and experimentally manipulated signal quality and prior knowledge. We first demonstrate that spectrotemporal modulations in speech are more strongly represented in neural responses than alternative speech representations (e.g. spectrogram or articulatory features). Critically, we found an interaction between speech signal quality and expectations from prior written text on the quality of neural representations; increased signal quality enhanced neural representations of speech that mismatched with prior expectations, but led to greater suppression of speech that matched prior expectations. This interaction is a unique neural signature of prediction error computations and is apparent in neural responses within 100 ms of speech input. Our findings contribute to the detailed specification of a computational model of speech perception based on predictive coding frameworks.
Collapse
Affiliation(s)
- Ediz Sohoglu
- School of Psychology, University of SussexBrightonUnited Kingdom
| | - Matthew H Davis
- MRC Cognition and Brain Sciences UnitCambridgeUnited Kingdom
| |
Collapse
|
18
|
McClelland JL, Hill F, Rudolph M, Baldridge J, Schütze H. Placing language in an integrated understanding system: Next steps toward human-level performance in neural language models. Proc Natl Acad Sci U S A 2020; 117:25966-25974. [PMID: 32989131 PMCID: PMC7585006 DOI: 10.1073/pnas.1910416117] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Language is crucial for human intelligence, but what exactly is its role? We take language to be a part of a system for understanding and communicating about situations. In humans, these abilities emerge gradually from experience and depend on domain-general principles of biological neural networks: connection-based learning, distributed representation, and context-sensitive, mutual constraint satisfaction-based processing. Current artificial language processing systems rely on the same domain general principles, embodied in artificial neural networks. Indeed, recent progress in this field depends on query-based attention, which extends the ability of these systems to exploit context and has contributed to remarkable breakthroughs. Nevertheless, most current models focus exclusively on language-internal tasks, limiting their ability to perform tasks that depend on understanding situations. These systems also lack memory for the contents of prior situations outside of a fixed contextual span. We describe the organization of the brain's distributed understanding system, which includes a fast learning system that addresses the memory problem. We sketch a framework for future models of understanding drawing equally on cognitive neuroscience and artificial intelligence and exploiting query-based attention. We highlight relevant current directions and consider further developments needed to fully capture human-level language understanding in a computational system.
Collapse
Affiliation(s)
- James L McClelland
- Department of Psychology, Stanford University, Stanford, CA 94305;
- DeepMind, London N1C 4AG, United Kingdom
| | - Felix Hill
- DeepMind, London N1C 4AG, United Kingdom;
| | - Maja Rudolph
- Bosch Center for Artificial Intelligence, Renningen 71272, Germany;
| | | | - Hinrich Schütze
- Center for Information and Language Processing, Ludwig Maximilian University of Munich, Munich 80538, Germany
| |
Collapse
|
19
|
Bermúdez-Margaretto B, Beltrán D, Shtyrov Y, Dominguez A, Cuetos F. Neurophysiological Correlates of Top-Down Phonological and Semantic Influence during the Orthographic Processing of Novel Visual Word-Forms. Brain Sci 2020; 10:E717. [PMID: 33050157 PMCID: PMC7601445 DOI: 10.3390/brainsci10100717] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2020] [Revised: 09/25/2020] [Accepted: 10/06/2020] [Indexed: 11/17/2022] Open
Abstract
The acquisition of new vocabulary is usually mediated by previous experience with language. In the visual domain, the representation of orthographically unfamiliar words at the phonological or conceptual levels may facilitate their orthographic learning. The neural correlates of this advantage were investigated by recording EEG activity during reading novel and familiar words across three different experiments (n = 22 each), manipulating the availability of previous knowledge on the novel written words. A different pattern of event-related potential (ERP) responses was found depending on the previous training, resembling cross-level top-down interactive effects during vocabulary acquisition. Thus, whereas previous phonological experience caused a modulation at the post-lexical stages of the visual recognition of novel written words (~520 ms), additional semantic training influenced their processing at a lexico-semantic stage (~320 ms). Moreover, early lexical differences (~180 ms) elicited in the absence of previous training did not emerge after both phonological and semantic training, reflecting similar orthographic processing and word-form access.
Collapse
Affiliation(s)
- Beatriz Bermúdez-Margaretto
- Centre for Cognition and Decision Making, Institute for Cognitive Neuroscience, National Research University Higher School of Economics, 101000 Moscow, Russia
| | - David Beltrán
- Instituto Universitario de Neurociencia (IUNE), Universidad de La Laguna, 38071 Tenerife, Spain; (D.B.); (A.D.)
- Facultad de Psicología, Universidad de La Laguna, 38071 Tenerife, Spain
| | - Yury Shtyrov
- Institute for Clinical Medicine—Center for Functionally Integrative Neuroscience (CFIN), Aarhus University, 8000 Aarhus, Denmark;
| | - Alberto Dominguez
- Instituto Universitario de Neurociencia (IUNE), Universidad de La Laguna, 38071 Tenerife, Spain; (D.B.); (A.D.)
- Facultad de Psicología, Universidad de La Laguna, 38071 Tenerife, Spain
| | - Fernando Cuetos
- Facultad de Psicología, Universidad de Oviedo, 33001 Oviedo, Spain;
| |
Collapse
|
20
|
Fox NP, Leonard M, Sjerps MJ, Chang EF. Transformation of a temporal speech cue to a spatial neural code in human auditory cortex. eLife 2020; 9:e53051. [PMID: 32840483 PMCID: PMC7556862 DOI: 10.7554/elife.53051] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 08/21/2020] [Indexed: 11/28/2022] Open
Abstract
In speech, listeners extract continuously-varying spectrotemporal cues from the acoustic signal to perceive discrete phonetic categories. Spectral cues are spatially encoded in the amplitude of responses in phonetically-tuned neural populations in auditory cortex. It remains unknown whether similar neurophysiological mechanisms encode temporal cues like voice-onset time (VOT), which distinguishes sounds like /b/ and/p/. We used direct brain recordings in humans to investigate the neural encoding of temporal speech cues with a VOT continuum from /ba/ to /pa/. We found that distinct neural populations respond preferentially to VOTs from one phonetic category, and are also sensitive to sub-phonetic VOT differences within a population's preferred category. In a simple neural network model, simulated populations tuned to detect either temporal gaps or coincidences between spectral cues captured encoding patterns observed in real neural data. These results demonstrate that a spatial/amplitude neural code underlies the cortical representation of both spectral and temporal speech cues.
Collapse
Affiliation(s)
- Neal P Fox
- Department of Neurological Surgery, University of California, San FranciscoSan FranciscoUnited States
| | - Matthew Leonard
- Department of Neurological Surgery, University of California, San FranciscoSan FranciscoUnited States
| | - Matthias J Sjerps
- Donders Institute for Brain, Cognition and Behaviour, Centre for Cognitive Neuroimaging, Radboud UniversityNijmegenNetherlands
- Max Planck Institute for PsycholinguisticsNijmegenNetherlands
| | - Edward F Chang
- Department of Neurological Surgery, University of California, San FranciscoSan FranciscoUnited States
- Weill Institute for Neurosciences, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
21
|
Gibbs RW, Colston HL. Pragmatics Always Matters: An Expanded Vision of Experimental Pragmatics. Front Psychol 2020; 11:1619. [PMID: 32793043 PMCID: PMC7393237 DOI: 10.3389/fpsyg.2020.01619] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Accepted: 06/16/2020] [Indexed: 12/05/2022] Open
Abstract
Much of the work in experimental pragmatics is devoted to testing empirical hypotheses that arise within the study of linguistic and philosophical pragmatics. The focus in much of this work is focused on those aspects of communicated meaning that are “inferred” rather than understood through linguistic “coding” processes. Under this view, pragmatic meanings emerge secondarily after purely linguistic meanings are accessed or computed. Our aim in this article is to greatly broaden the scope of experimental pragmatic studies by calling for much greater emphasis on the complete pragmatics of language use. Pragmatics is continuously present and constrains people’s real-time production and processing of language in context. Experimental pragmatics should attend more to the particularities of pragmatic experience through closer examination of the people we study, the specific tasks used to assess understanding, as well as the actual complex meanings people interpret in diverse contexts. The many specifics of human pragmatics demand the study and theoretical inclusion of many bodily, linguistic, and situational factors that make up each instance of meaning making.
Collapse
Affiliation(s)
- Raymond W. Gibbs
- Independent Researcher, Soquel, CA, United States
- *Correspondence: Raymond W. Gibbs Jr.,
| | | |
Collapse
|
22
|
Bock AJ, Warglien M, George G. A simulation-based approach to business model design and organizational Change. INNOVATION-ORGANIZATION & MANAGEMENT 2020. [DOI: 10.1080/14479338.2020.1769482] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
- Adam J. Bock
- University of Wisconsin, Wisconsin School of Business, 975 University Avenue, Madison, WI, USA
| | - Massimo Warglien
- Ca’ Foscari University of Venice, Advanced School of Economics, Venice, Italy
| | - Gerard George
- Lee Kong Chian School of Business, Singapore Management University, Singapore, Singapore
| |
Collapse
|
23
|
Early lexical influences on sublexical processing in speech perception: Evidence from electrophysiology. Cognition 2020; 197:104162. [DOI: 10.1016/j.cognition.2019.104162] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 12/11/2019] [Accepted: 12/16/2019] [Indexed: 11/17/2022]
|
24
|
Nadeau SE. Neural Population Dynamics and Cognitive Function. Front Hum Neurosci 2020; 14:50. [PMID: 32226366 PMCID: PMC7080985 DOI: 10.3389/fnhum.2020.00050] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Accepted: 02/04/2020] [Indexed: 12/27/2022] Open
Abstract
Representations in the brain are encoded as patterns of activity of large populations of neurons. The science of population encoded representations, also known as parallel distributed processing (PDP), achieves neurological verisimilitude and has been able to account for a large number of cognitive phenomena in normal people, including reaction times (and reading latencies), stimulus recognition, the effect of stimulus salience on attention, perceptual invariance, simultaneous egocentric and allocentric visual processing, top-down/bottom-up processing, language errors, the effect of statistical regularities of experience, frequency, and age of acquisition, instantiation of rules and symbols, content addressable memory and the capacity for pattern completion, preservation of function in the face of noisy or distorted input, inference, parallel constraint satisfaction, the binding problem and gamma coherence, principles of hippocampal function, the location of knowledge in the brain, limitations in the scope and depth of knowledge acquired through experience, and Piagetian stages of cognitive development. PDP studies have been able to provide a coherent account for impairment in a variety of language functions resulting from stroke or dementia in a large number of languages and the phenomenon of graceful degradation observed in such studies. They have also made important contributions to our understanding of attention (including hemispatial neglect), emotional function, executive function, motor planning, visual processing, decision making, and neuroeconomics. The relationship of neural network population dynamics to electroencephalographic rhythms is starting to emerge. Nevertheless, PDP approaches have scarcely penetrated major areas of study of cognition, including neuropsychology and cognitive neuropsychology, as well as much of cognitive psychology. This article attempts to provide an overview of PDP principles and applications that addresses a broader audience.
Collapse
Affiliation(s)
- Stephen E. Nadeau
- Research Service and the Brain Rehabilitation Research Center, Malcom Randall VA Medical Center, Department of Neurology, College of Medicine, University of Florida, Gainesville, FL, United States
| |
Collapse
|
25
|
de Almeida RG, Di Nardo J, Antal C, von Grünau MW. Understanding Events by Eye and Ear: Agent and Verb Drive Non-anticipatory Eye Movements in Dynamic Scenes. Front Psychol 2019; 10:2162. [PMID: 31649574 PMCID: PMC6795699 DOI: 10.3389/fpsyg.2019.02162] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 09/09/2019] [Indexed: 11/13/2022] Open
Abstract
As Macnamara (1978) once asked, how can we talk about what we see? We report on a study manipulating realistic dynamic scenes and sentences aiming to understand the interaction between linguistic and visual representations in real-world situations. Specifically, we monitored participants' eye movements as they watched video clips of everyday scenes while listening to sentences describing these scenes. We manipulated two main variables. The first was the semantic class of the verb in the sentence and the second was the action/motion of the agent in the unfolding event. The sentences employed two verb classes-causatives (e.g., break) and perception/psychological (e.g., notice)-which impose different constraints on the nouns that serve as their grammatical complements. The scenes depicted events in which agents either moved toward a target object (always the referent of the verb-complement noun), away from it, or remained neutral performing a given activity (such as cooking). Scenes and sentences were synchronized such that the verb onset corresponded to the first video frame of the agent motion toward or away from the object. Results show effects of agent motion but weak verb-semantic restrictions: causatives draw more attention to potential referents of their grammatical complements than perception verbs only when the agent moves toward the target object. Crucially, we found no anticipatory verb-driven eye movements toward the target object, contrary to studies using non-naturalistic and static scenes. We propose a model in which linguistic and visual computations in real-world situations occur largely independent of each other during the early moments of perceptual input, but rapidly interact at a central, conceptual system using a common, propositional code. Implications for language use in real world contexts are discussed.
Collapse
Affiliation(s)
| | - Julia Di Nardo
- Department of Psychology, Concordia University, Montreal, QC, Canada
| | - Caitlyn Antal
- Department of Psychology, Concordia University, Montreal, QC, Canada.,Department of Linguistics, Yale University, New Haven, CT, United States
| | | |
Collapse
|
26
|
Delaney-Busch N, Morgan E, Lau E, Kuperberg GR. Neural evidence for Bayesian trial-by-trial adaptation on the N400 during semantic priming. Cognition 2019; 187:10-20. [PMID: 30797099 PMCID: PMC6552672 DOI: 10.1016/j.cognition.2019.01.001] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 12/21/2018] [Accepted: 01/03/2019] [Indexed: 10/27/2022]
Abstract
When semantic information is activated by a context prior to new bottom-up input (i.e. when a word is predicted), semantic processing of that incoming word is typically facilitated, attenuating the amplitude of the N400 event related potential (ERP) - a direct neural measure of semantic processing. N400 modulation is observed even when the context is a single semantically related "prime" word. This so-called "N400 semantic priming effect" is sensitive to the probability of encountering a related prime-target pair within an experimental block, suggesting that participants may be adapting the strength of their predictions to the predictive validity of their broader experimental environment. We formalize this adaptation using a Bayesian learning model that estimates and updates the probability of encountering a related versus an unrelated prime-target pair on each successive trial. We found that our model's trial-by-trial estimates of target word probability accounted for significant variance in trial-by-trial N400 amplitude. These findings suggest that Bayesian principles contribute to how comprehenders adapt their semantic predictions to the statistical structure of their broader environment, with implications for the functional significance of the N400 component and the predictive nature of language processing.
Collapse
Affiliation(s)
| | - Emily Morgan
- Department of Psychology, Tufts University, USA; Department of Linguistics, University of California, Davis, USA.
| | - Ellen Lau
- Department of Linguistics, University of Maryland, USA
| | - Gina R Kuperberg
- Department of Psychology, Tufts University, USA; Department of Psychiatry and the Athinoula A. Martinos Center for Biomedical Imaging, Massachusetts General Hospital, Harvard Medical School, USA
| |
Collapse
|
27
|
Norris D, McQueen JM, Cutler A. Commentary on "Interaction in Spoken Word Recognition Models". Front Psychol 2018; 9:1568. [PMID: 30233453 PMCID: PMC6129619 DOI: 10.3389/fpsyg.2018.01568] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Accepted: 08/07/2018] [Indexed: 11/26/2022] Open
Affiliation(s)
- Dennis Norris
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, United Kingdom
| | - James M McQueen
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands.,Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands
| | - Anne Cutler
- Max Planck Institute for Psycholinguistics, Nijmegen, Netherlands.,MARCS Institute, Western Sydney University, Penrith, NSW, Australia
| |
Collapse
|
28
|
Rutishauser U, Slotine JJ, Douglas RJ. Solving Constraint-Satisfaction Problems with Distributed Neocortical-Like Neuronal Networks. Neural Comput 2018; 30:1359-1393. [PMID: 29566357 PMCID: PMC5930080 DOI: 10.1162/neco_a_01074] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Finding actions that satisfy the constraints imposed by both external inputs and internal representations is central to decision making. We demonstrate that some important classes of constraint satisfaction problems (CSPs) can be solved by networks composed of homogeneous cooperative-competitive modules that have connectivity similar to motifs observed in the superficial layers of neocortex. The winner-take-all modules are sparsely coupled by programming neurons that embed the constraints onto the otherwise homogeneous modular computational substrate. We show rules that embed any instance of the CSP's planar four-color graph coloring, maximum independent set, and sudoku on this substrate and provide mathematical proofs that guarantee these graph coloring problems will convergence to a solution. The network is composed of nonsaturating linear threshold neurons. Their lack of right saturation allows the overall network to explore the problem space driven through the unstable dynamics generated by recurrent excitation. The direction of exploration is steered by the constraint neurons. While many problems can be solved using only linear inhibitory constraints, network performance on hard problems benefits significantly when these negative constraints are implemented by nonlinear multiplicative inhibition. Overall, our results demonstrate the importance of instability rather than stability in network computation and offer insight into the computational role of dual inhibitory mechanisms in neural circuits.
Collapse
Affiliation(s)
- Ueli Rutishauser
- Computation and Neural Systems, Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, U.S.A., and Cedars-Sinai Medical Center, Departments of Neurosurgery, Neurology and Biomedical Sciences, Los Angeles, CA 90048, U.S.A.
| | - Jean-Jacques Slotine
- Nonlinear Systems Laboratory, Department of Mechanical Engineering and Department of Brain and Cognitive Sciences, MIT, Cambridge, MA 02139, U.S.A.
| | - Rodney J Douglas
- Institute of Neuroinformatics, University and ETH Zurich, Zurich 8057, Switzerland
| |
Collapse
|
29
|
Magnuson JS, Mirman D, Luthra S, Strauss T, Harris HD. Interaction in Spoken Word Recognition Models: Feedback Helps. Front Psychol 2018; 9:369. [PMID: 29666593 PMCID: PMC5891609 DOI: 10.3389/fpsyg.2018.00369] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Accepted: 03/06/2018] [Indexed: 11/13/2022] Open
Abstract
Human perception, cognition, and action requires fast integration of bottom-up signals with top-down knowledge and context. A key theoretical perspective in cognitive science is the interactive activation hypothesis: forward and backward flow in bidirectionally connected neural networks allows humans and other biological systems to approximate optimal integration of bottom-up and top-down information under real-world constraints. An alternative view is that online feedback is neither necessary nor helpful; purely feed forward alternatives can be constructed for any feedback system, and online feedback could not improve processing and would preclude veridical perception. In the domain of spoken word recognition, the latter view was apparently supported by simulations using the interactive activation model, TRACE, with and without feedback: as many words were recognized more quickly without feedback as were recognized faster with feedback, However, these simulations used only a small set of words and did not address a primary motivation for interaction: making a model robust in noise. We conducted simulations using hundreds of words, and found that the majority were recognized more quickly with feedback than without. More importantly, as we added noise to inputs, accuracy and recognition times were better with feedback than without. We follow these simulations with a critical review of recent arguments that online feedback in interactive activation models like TRACE is distinct from other potentially helpful forms of feedback. We conclude that in addition to providing the benefits demonstrated in our simulations, online feedback provides a plausible means of implementing putatively distinct forms of feedback, supporting the interactive activation hypothesis.
Collapse
Affiliation(s)
- James S. Magnuson
- Department of Psychological Sciences, University of Connecticut, Storrs, CT, United States
- Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, Storrs, CT, United States
| | - Daniel Mirman
- Department of Psychology, University of Alabama at Birmingham, Birmingham, AL, United States
| | - Sahil Luthra
- Department of Psychological Sciences, University of Connecticut, Storrs, CT, United States
- Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, Storrs, CT, United States
| | - Ted Strauss
- McConnell Brain Imaging Centre, McGill University, Montreal, QC, Canada
| | - Harlan D. Harris
- Department of Psychological Sciences, University of Connecticut, Storrs, CT, United States
- Connecticut Institute for the Brain and Cognitive Sciences, University of Connecticut, Storrs, CT, United States
| |
Collapse
|
30
|
Marewski JN, Bröder A, Glöckner A. Some Metatheoretical Reflections on Adaptive Decision Making and the Strategy Selection Problem. JOURNAL OF BEHAVIORAL DECISION MAKING 2018. [DOI: 10.1002/bdm.2075] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Julian N. Marewski
- Faculty of Business and Economics; University of Lausanne; Lausanne Switzerland
| | - Arndt Bröder
- School of Social Sciences; University of Mannheim; Mannheim Germany
| | - Andreas Glöckner
- Institute for Psychology; University of Hagen; Hagen Germany
- Max Planck Institute for Research on Collective Goods; Bonn Germany
| |
Collapse
|
31
|
Hayes BK, Heit E. Inductive reasoning 2.0. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2017; 9:e1459. [PMID: 29283506 DOI: 10.1002/wcs.1459] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 07/09/2017] [Accepted: 10/23/2017] [Indexed: 11/08/2022]
Abstract
Inductive reasoning entails using existing knowledge to make predictions about novel cases. The first part of this review summarizes key inductive phenomena and critically evaluates theories of induction. We highlight recent theoretical advances, with a special emphasis on the structured statistical approach, the importance of sampling assumptions in Bayesian models, and connectionist modeling. A number of new research directions in this field are identified including comparisons of inductive and deductive reasoning, the identification of common core processes in induction and memory tasks and induction involving category uncertainty. The implications of induction research for areas as diverse as complex decision-making and fear generalization are discussed. This article is categorized under: Psychology > Reasoning and Decision Making Psychology > Learning.
Collapse
Affiliation(s)
- Brett K Hayes
- Department of Psychology, University of New South Wales, Sydney, Australia
| | - Evan Heit
- School of Social Sciences, Humanities and Arts, University of California, Merced, California
| |
Collapse
|
32
|
|
33
|
Winn MB. Rapid Release From Listening Effort Resulting From Semantic Context, and Effects of Spectral Degradation and Cochlear Implants. Trends Hear 2016; 20:2331216516669723. [PMID: 27698260 PMCID: PMC5051669 DOI: 10.1177/2331216516669723] [Citation(s) in RCA: 84] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2016] [Revised: 08/26/2016] [Accepted: 08/26/2016] [Indexed: 11/15/2022] Open
Abstract
People with hearing impairment are thought to rely heavily on context to compensate for reduced audibility. Here, we explore the resulting cost of this compensatory behavior, in terms of effort and the efficiency of ongoing predictive language processing. The listening task featured predictable or unpredictable sentences, and participants included people with cochlear implants as well as people with normal hearing who heard full-spectrum/unprocessed or vocoded speech. The crucial metric was the growth of the pupillary response and the reduction of this response for predictable versus unpredictable sentences, which would suggest reduced cognitive load resulting from predictive processing. Semantic context led to rapid reduction of listening effort for people with normal hearing; the reductions were observed well before the offset of the stimuli. Effort reduction was slightly delayed for people with cochlear implants and considerably more delayed for normal-hearing listeners exposed to spectrally degraded noise-vocoded signals; this pattern of results was maintained even when intelligibility was perfect. Results suggest that speed of sentence processing can still be disrupted, and exertion of effort can be elevated, even when intelligibility remains high. We discuss implications for experimental and clinical assessment of speech recognition, in which good performance can arise because of cognitive processes that occur after a stimulus, during a period of silence. Because silent gaps are not common in continuous flowing speech, the cognitive/linguistic restorative processes observed after sentences in such studies might not be available to listeners in everyday conversations, meaning that speech recognition in conventional tests might overestimate sentence-processing capability.
Collapse
Affiliation(s)
- Matthew B. Winn
- Department of Speech & Hearing Sciences, University of Washington, Seattle, WA, USA
| |
Collapse
|
34
|
Grossberg S, Kazerounian S. Phoneme restoration and empirical coverage of Interactive Activation and Adaptive Resonance models of human speech processing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2016; 140:1130. [PMID: 27586743 DOI: 10.1121/1.4946760] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Magnuson [J. Acoust. Soc. Am. 137, 1481-1492 (2015)] makes claims for Interactive Activation (IA) models and against Adaptive Resonance Theory (ART) models of speech perception. Magnuson also presents simulations that claim to show that the TRACE model can simulate phonemic restoration, which was an explanatory target of the cARTWORD ART model. The theoretical analysis and review herein show that these claims are incorrect. More generally, the TRACE and cARTWORD models illustrate two diametrically opposed types of neural models of speech and language. The TRACE model embodies core assumptions with no analog in known brain processes. The cARTWORD model defines a hierarchy of cortical processing regions whose networks embody cells in laminar cortical circuits as part of the paradigm of laminar computing. cARTWORD further develops ART speech and language models that were introduced in the 1970s. It builds upon Item-Order-Rank working memories, which activate learned list chunks that unitize sequences to represent phonemes, syllables, and words. Psychophysical and neurophysiological data support Item-Order-Rank mechanisms and contradict TRACE representations of time, temporal order, silence, and top-down processing that exhibit many anomalous properties, including hallucinations of non-occurring future phonemes. Computer simulations of the TRACE model are presented that demonstrate these failures.
Collapse
Affiliation(s)
- Stephen Grossberg
- Departments of Mathematics, Psychology, and Biomedical Engineering, Center for Adaptive Systems, Graduate Program in Cognitive and Neural Systems, Center for Computational Neuroscience and Neural Technology, Boston University, Boston, Massachusetts 02215, USA
| | - Sohrob Kazerounian
- Nuance Communications, Inc., 1 Wayside Road, Burlington, Massachusetts 01803, USA
| |
Collapse
|
35
|
Predictive coding as a model of cognition. Cogn Process 2016; 17:279-305. [DOI: 10.1007/s10339-016-0765-6] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 04/06/2016] [Indexed: 10/21/2022]
|
36
|
An fMRI study investigating effects of conceptually related sentences on the perception of degraded speech. Cortex 2016; 79:57-74. [PMID: 27100909 DOI: 10.1016/j.cortex.2016.03.014] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2015] [Revised: 01/06/2016] [Accepted: 03/15/2016] [Indexed: 11/20/2022]
Abstract
Prior research has shown that the perception of degraded speech is influenced by within sentence meaning and recruits one or more components of a frontal-temporal-parietal network. The goal of the current study is to examine whether the overall conceptual meaning of a sentence, made up of one set of words, influences the perception of a second acoustically degraded sentence, made up of a different set of words. Using functional magnetic resonance imaging (fMRI), we presented an acoustically clear sentence followed by an acoustically degraded sentence and manipulated the semantic relationship between them: Related in meaning (but consisting of different content words), Unrelated in meaning, or Same. Results showed that listeners' word recognition accuracy for the acoustically degraded sentences was significantly higher when the target sentence was preceded by a conceptually related compared to a conceptually unrelated sentence. Sensitivity to conceptual relationships was associated with enhanced activity in middle and inferior frontal, temporal, and parietal areas. In addition, the left middle frontal gyrus (LMFG), left inferior frontal gyrus (LIFG), and left middle temporal gyrus (LMTG) showed activity that correlated with individual performance on the Related condition. The superior temporal gyrus (STG) showed increased activation in the Same condition suggesting that it is sensitive to perceptual similarity rather than the integration of meaning between the sentence pairs. A fronto-temporo-parietal network appears to consolidate information sources across multiple levels of language (acoustic, lexical, syntactic, semantic) to build, and ultimately integrate conceptual information across sentences and facilitate the perception of a degraded speech signal. However, the nature of the sources of information that are available differentially recruit specific regions and modulate their activity within this network. Implications of these findings for the functional architecture of the network are considered.
Collapse
|
37
|
Norris D, McQueen JM, Cutler A. Prediction, Bayesian inference and feedback in speech recognition. LANGUAGE, COGNITION AND NEUROSCIENCE 2016; 31:4-18. [PMID: 26740960 PMCID: PMC4685608 DOI: 10.1080/23273798.2015.1081703] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2015] [Accepted: 08/05/2015] [Indexed: 05/19/2023]
Abstract
Speech perception involves prediction, but how is that prediction implemented? In cognitive models prediction has often been taken to imply that there is feedback of activation from lexical to pre-lexical processes as implemented in interactive-activation models (IAMs). We show that simple activation feedback does not actually improve speech recognition. However, other forms of feedback can be beneficial. In particular, feedback can enable the listener to adapt to changing input, and can potentially help the listener to recognise unusual input, or recognise speech in the presence of competing sounds. The common feature of these helpful forms of feedback is that they are all ways of optimising the performance of speech recognition using Bayesian inference. That is, listeners make predictions about speech because speech recognition is optimal in the sense captured in Bayesian models.
Collapse
Affiliation(s)
- Dennis Norris
- MRC Cognition and Brain Sciences Unit, Cambridge, UK
| | - James M. McQueen
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
| | - Anne Cutler
- Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
- MARCS Institute, University of Western Sydney, Penrith South, NSW2751, Australia
| |
Collapse
|
38
|
|
39
|
Abstract
Can what we know change what we see? Does language affect cognition and perception? The last few years have seen increased attention to these seemingly disparate questions, but with little theoretical advance. We argue that substantial clarity can be gained by considering these questions through the lens of predictive processing, a framework in which mental representations—from the perceptual to the cognitive—reflect an interplay between downward-flowing predictions and upward-flowing sensory signals. This framework provides a parsimonious account of how (and when) what we know ought to change what we see and helps us understand how a putatively high-level trait such as language can impact putatively low-level processes such as perception. Within this framework, language begins to take on a surprisingly central role in cognition by providing a uniquely focused and flexible means of constructing predictions against which sensory signals can be evaluated. Predictive processing thus provides a plausible mechanism for many of the reported effects of language on perception, thought, and action, and new insights on how and when speakers of different languages construct the same “reality” in alternate ways.
Collapse
Affiliation(s)
- Gary Lupyan
- Department of Psychology, University of Wisconsin–Madison
| | - Andy Clark
- School of Philosophy, Psychology, and Language Sciences, Edinburgh University
| |
Collapse
|
40
|
Lee TS. The visual system's internal model of the world. PROCEEDINGS OF THE IEEE. INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS 2015; 103:1359-1378. [PMID: 26566294 PMCID: PMC4638327 DOI: 10.1109/jproc.2015.2434601] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The Bayesian paradigm has provided a useful conceptual theory for understanding perceptual computation in the brain. While the detailed neural mechanisms of Bayesian inference are not fully understood, recent computational and neurophysiological works have illuminated the underlying computational principles and representational architecture. The fundamental insights are that the visual system is organized as a modular hierarchy to encode an internal model of the world, and that perception is realized by statistical inference based on such internal model. In this paper, I will discuss and analyze the varieties of representational schemes of these internal models and how they might be used to perform learning and inference. I will argue for a unified theoretical framework for relating the internal models to the observed neural phenomena and mechanisms in the visual cortex.
Collapse
Affiliation(s)
- Tai Sing Lee
- Professor in the Computer Science Department and the Center for the Neural Basis of Cognition, Carnegie Mellon University, Rm 115, Mellon Institute, 4400 Fifth Avenue, Pittsburgh, PA 15213, U.S.A
| |
Collapse
|
41
|
Testolin A, Stoianov I, Sperduti A, Zorzi M. Learning Orthographic Structure With Sequential Generative Neural Networks. Cogn Sci 2015; 40:579-606. [DOI: 10.1111/cogs.12258] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2013] [Revised: 11/21/2014] [Accepted: 02/02/2015] [Indexed: 11/28/2022]
Affiliation(s)
- Alberto Testolin
- Department of Developmental Psychology and Socialisation; University of Padova
- Department of General Psychology; University of Padova
| | - Ivilin Stoianov
- Department of General Psychology; University of Padova
- Cognitive Psychology Laboratory; CNRS & Aix-Marseille University
| | | | - Marco Zorzi
- Department of General Psychology; University of Padova
- Center for Cognitive Neuroscience; University of Padova
- IRCCS San Camillo Neurorehabilitation Hospital
| |
Collapse
|
42
|
Chen Q, Mirman D. Interaction between phonological and semantic representations: time matters. Cogn Sci 2015; 39:538-58. [PMID: 25155249 PMCID: PMC4607034 DOI: 10.1111/cogs.12156] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2012] [Revised: 08/26/2013] [Accepted: 01/07/2014] [Indexed: 11/30/2022]
Abstract
Computational modeling and eye-tracking were used to investigate how phonological and semantic information interact to influence the time course of spoken word recognition. We extended our recent models (Chen & Mirman, 2012; Mirman, Britt, & Chen, 2013) to account for new evidence that competition among phonological neighbors influences activation of semantically related concepts during spoken word recognition (Apfelbaum, Blumstein, & McMurray, 2011). The model made a novel prediction: Semantic input modulates the effect of phonological neighbors on target word processing, producing an approximately inverted-U-shaped pattern with a high phonological density advantage at an intermediate level of semantic input-in contrast to the typical disadvantage for high phonological density words in spoken word recognition. This prediction was confirmed with a new analysis of the Apfelbaum et al. data and in a visual world paradigm experiment with preview duration serving as a manipulation of strength of semantic input. These results are consistent with our previous claim that strongly active neighbors produce net inhibitory effects and weakly active neighbors produce net facilitative effects.
Collapse
Affiliation(s)
- Qi Chen
- Center for Studies of Psychological Application and School of Psychology, South China Normal University
- Moss Rehabilitation Research Institute
| | - Daniel Mirman
- Moss Rehabilitation Research Institute
- Department of Psychology, Drexel University
| |
Collapse
|
43
|
Magnuson JS. Phoneme restoration and empirical coverage of interactive activation and adaptive resonance models of human speech processing. THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA 2015; 137:1481-92. [PMID: 25786959 PMCID: PMC4368586 DOI: 10.1121/1.4904543] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2014] [Revised: 11/17/2014] [Accepted: 11/25/2014] [Indexed: 06/04/2023]
Abstract
Grossberg and Kazerounian [(2011). J. Acoust. Soc. Am. 130, 440-460] present a model of sequence representation for spoken word recognition, the cARTWORD model, which simulates essential aspects of phoneme restoration. Grossberg and Kazerounian also include simulations with the TRACE model presented by McClelland and Elman [(1986). Cognit. Psychol. 18, 1-86] that seem to indicate that TRACE cannot simulate phoneme restoration. Grossberg and Kazerounian also claim cARTWORD should be preferred to TRACE because of TRACE's implausible approach to sequence representation (reduplication of time-specific units) and use of non-modulatory feedback (i.e., without position-specific bottom-up support). This paper responds to Grossberg and Kazerounian first with TRACE simulations that account for phoneme restoration when appropriately constructed noise is used (and with minor changes to TRACE phoneme definitions), then reviews the case for reduplicated units and feedback as implemented in TRACE, as well as TRACE's broad and deep coverage of empirical data. Finally, it is argued that cARTWORD is not comparable to TRACE because cARTWORD cannot represent sequences with repeated elements, has only been implemented with small phoneme and lexical inventories, and has been applied to only one phenomenon (phoneme restoration). Without evidence that cARTWORD captures a similar range and detail of human spoken language processing as alternative models, it is premature to prefer cARTWORD to TRACE.
Collapse
Affiliation(s)
- James S Magnuson
- Department of Psychology, University of Connecticut, Storrs, Connecticut 06269
| |
Collapse
|
44
|
Rogers TT, McClelland JL. Parallel Distributed Processing at 25: Further Explorations in the Microstructure of Cognition. Cogn Sci 2014; 38:1024-77. [DOI: 10.1111/cogs.12148] [Citation(s) in RCA: 63] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Revised: 04/02/2014] [Accepted: 04/09/2014] [Indexed: 11/26/2022]
|
45
|
Zorzi M, Testolin A, Stoianov IP. Modeling language and cognition with deep unsupervised learning: a tutorial overview. Front Psychol 2013; 4:515. [PMID: 23970869 PMCID: PMC3747356 DOI: 10.3389/fpsyg.2013.00515] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 07/20/2013] [Indexed: 11/22/2022] Open
Abstract
Deep unsupervised learning in stochastic recurrent neural networks with many layers of hidden units is a recent breakthrough in neural computation research. These networks build a hierarchy of progressively more complex distributed representations of the sensory data by fitting a hierarchical generative model. In this article we discuss the theoretical foundations of this approach and we review key issues related to training, testing and analysis of deep networks for modeling language and cognitive processing. The classic letter and word perception problem of McClelland and Rumelhart (1981) is used as a tutorial example to illustrate how structured and abstract representations may emerge from deep generative learning. We argue that the focus on deep architectures and generative (rather than discriminative) learning represents a crucial step forward for the connectionist modeling enterprise, because it offers a more plausible model of cortical learning as well as a way to bridge the gap between emergentist connectionist models and structured Bayesian models of cognition.
Collapse
Affiliation(s)
- Marco Zorzi
- Computational Cognitive Neuroscience Lab, Department of General Psychology, University of PadovaPadova, Italy
- IRCCS San Camillo Neurorehabilitation HospitalVenice-Lido, Italy
| | - Alberto Testolin
- Computational Cognitive Neuroscience Lab, Department of General Psychology, University of PadovaPadova, Italy
| | - Ivilin P. Stoianov
- Computational Cognitive Neuroscience Lab, Department of General Psychology, University of PadovaPadova, Italy
- Institute of Cognitive Sciences and Technologies, National Research CouncilRome, Italy
| |
Collapse
|
46
|
McClelland JL. Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review. Front Psychol 2013; 4:503. [PMID: 23970868 PMCID: PMC3747375 DOI: 10.3389/fpsyg.2013.00503] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2013] [Accepted: 07/17/2013] [Indexed: 11/18/2022] Open
Abstract
This article seeks to establish a rapprochement between explicitly Bayesian models of contextual effects in perception and neural network models of such effects, particularly the connectionist interactive activation (IA) model of perception. The article is in part an historical review and in part a tutorial, reviewing the probabilistic Bayesian approach to understanding perception and how it may be shaped by context, and also reviewing ideas about how such probabilistic computations may be carried out in neural networks, focusing on the role of context in interactive neural networks, in which both bottom-up and top-down signals affect the interpretation of sensory inputs. It is pointed out that connectionist units that use the logistic or softmax activation functions can exactly compute Bayesian posterior probabilities when the bias terms and connection weights affecting such units are set to the logarithms of appropriate probabilistic quantities. Bayesian concepts such the prior, likelihood, (joint and marginal) posterior, probability matching and maximizing, and calculating vs. sampling from the posterior are all reviewed and linked to neural network computations. Probabilistic and neural network models are explicitly linked to the concept of a probabilistic generative model that describes the relationship between the underlying target of perception (e.g., the word intended by a speaker or other source of sensory stimuli) and the sensory input that reaches the perceiver for use in inferring the underlying target. It is shown how a new version of the IA model called the multinomial interactive activation (MIA) model can sample correctly from the joint posterior of a proposed generative model for perception of letters in words, indicating that interactive processing is fully consistent with principled probabilistic computation. Ways in which these computations might be realized in real neural systems are also considered.
Collapse
Affiliation(s)
- James L McClelland
- Department of Psychology and Center for Mind, Brain, and Computation, Stanford University Stanford, CA, USA
| |
Collapse
|
47
|
Kumaran D, McClelland JL. Generalization through the recurrent interaction of episodic memories: a model of the hippocampal system. Psychol Rev 2012; 119:573-616. [PMID: 22775499 PMCID: PMC3444305 DOI: 10.1037/a0028681] [Citation(s) in RCA: 192] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2010] [Revised: 04/10/2012] [Accepted: 04/19/2012] [Indexed: 11/25/2022]
Abstract
In this article, we present a perspective on the role of the hippocampal system in generalization, instantiated in a computational model called REMERGE (recurrency and episodic memory results in generalization). We expose a fundamental, but neglected, tension between prevailing computational theories that emphasize the function of the hippocampus in pattern separation (Marr, 1971; McClelland, McNaughton, & O'Reilly, 1995), and empirical support for its role in generalization and flexible relational memory (Cohen & Eichenbaum, 1993; Eichenbaum, 1999). Our account provides a means by which to resolve this conflict, by demonstrating that the basic representational scheme envisioned by complementary learning systems theory (McClelland et al., 1995), which relies upon orthogonalized codes in the hippocampus, is compatible with efficient generalization-as long as there is recurrence rather than unidirectional flow within the hippocampal circuit or, more widely, between the hippocampus and neocortex. We propose that recurrent similarity computation, a process that facilitates the discovery of higher-order relationships between a set of related experiences, expands the scope of classical exemplar-based models of memory (e.g., Nosofsky, 1984) and allows the hippocampus to support generalization through interactions that unfold within a dynamically created memory space.
Collapse
Affiliation(s)
- Dharshan Kumaran
- Department of Psychology, Stanford University
- Institute of Cognitive Neuroscience, University College London, London, England
| | | |
Collapse
|