1
|
Dufau S, Yeaton J, Badier JM, Chen S, Holcomb PJ, Grainger J. Sentence superiority in the reading brain. Neuropsychologia 2024; 198:108885. [PMID: 38604495 DOI: 10.1016/j.neuropsychologia.2024.108885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 02/06/2024] [Accepted: 04/07/2024] [Indexed: 04/13/2024]
Abstract
When a sequence of written words is briefly presented and participants are asked to identify just one word at a post-cued location, then word identification accuracy is higher when the word is presented in a grammatically correct sequence compared with an ungrammatical sequence. This sentence superiority effect has been reported in several behavioral studies and two EEG investigations. Taken together, the results of these studies support the hypothesis that the sentence superiority effect is primarily driven by rapid access to a sentence-level representation via partial word identification processes that operate in parallel over several words. Here we used MEG to examine the neural structures involved in this early stage of written sentence processing, and to further specify the timing of the different processes involved. Source activities over time showed grammatical vs. ungrammatical differences first in the left inferior frontal gyrus (IFG: 321-406 ms), then the left anterior temporal lobe (ATL: 466-531 ms), and finally in both left IFG (549-602 ms) and left posterior superior temporal gyrus (pSTG: 553-622 ms). We interpret the early IFG activity as reflecting the rapid bottom-up activation of sentence-level representations, including syntax, enabled by partly parallel word processing. Subsequent activity in ATL and pSTG is thought to reflect the constraints imposed by such sentence-level representations on on-going word-based semantic activation (ATL), and the subsequent development of a more detailed sentence-level representation (pSTG). These results provide further support for a cascaded interactive-activation account of sentence reading.
Collapse
Affiliation(s)
- Stéphane Dufau
- Laboratoire de Psychologie Cognitive, Centre National de la Recherche Scientifique, Aix-Marseille University, Marseille, France; Institute for Language, Communication, and the Brain, Aix-Marseille University, Aix-en-Provence, France
| | - Jeremy Yeaton
- Laboratoire de Psychologie Cognitive, Centre National de la Recherche Scientifique, Aix-Marseille University, Marseille, France; Department of Language Science, University of California, Irvine, CA, USA
| | - Jean-Michel Badier
- Institute for Language, Communication, and the Brain, Aix-Marseille University, Aix-en-Provence, France; Institut de Neurosciences des Systèmes (INS), INSERM, Aix-Marseille University, Marseille, France
| | - Sophie Chen
- Institute for Language, Communication, and the Brain, Aix-Marseille University, Aix-en-Provence, France; Institut de Neurosciences des Systèmes (INS), INSERM, Aix-Marseille University, Marseille, France
| | - Phillip J Holcomb
- Department of Psychology, San Diego State University, San Diego, CA, USA
| | - Jonathan Grainger
- Laboratoire de Psychologie Cognitive, Centre National de la Recherche Scientifique, Aix-Marseille University, Marseille, France; Institute for Language, Communication, and the Brain, Aix-Marseille University, Aix-en-Provence, France.
| |
Collapse
|
2
|
Hosseini EA, Schrimpf M, Zhang Y, Bowman S, Zaslavsky N, Fedorenko E. Artificial Neural Network Language Models Predict Human Brain Responses to Language Even After a Developmentally Realistic Amount of Training. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2024; 5:43-63. [PMID: 38645622 PMCID: PMC11025646 DOI: 10.1162/nol_a_00137] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 01/09/2024] [Indexed: 04/23/2024]
Abstract
Artificial neural networks have emerged as computationally plausible models of human language processing. A major criticism of these models is that the amount of training data they receive far exceeds that of humans during language learning. Here, we use two complementary approaches to ask how the models' ability to capture human fMRI responses to sentences is affected by the amount of training data. First, we evaluate GPT-2 models trained on 1 million, 10 million, 100 million, or 1 billion words against an fMRI benchmark. We consider the 100-million-word model to be developmentally plausible in terms of the amount of training data given that this amount is similar to what children are estimated to be exposed to during the first 10 years of life. Second, we test the performance of a GPT-2 model trained on a 9-billion-token dataset to reach state-of-the-art next-word prediction performance on the human benchmark at different stages during training. Across both approaches, we find that (i) the models trained on a developmentally plausible amount of data already achieve near-maximal performance in capturing fMRI responses to sentences. Further, (ii) lower perplexity-a measure of next-word prediction performance-is associated with stronger alignment with human data, suggesting that models that have received enough training to achieve sufficiently high next-word prediction performance also acquire representations of sentences that are predictive of human fMRI responses. In tandem, these findings establish that although some training is necessary for the models' predictive ability, a developmentally realistic amount of training (∼100 million words) may suffice.
Collapse
Affiliation(s)
- Eghbal A. Hosseini
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Martin Schrimpf
- The MIT Quest for Intelligence Initiative, Cambridge, MA, USA
- Swiss Federal Institute of Technology, Lausanne, Switzerland
| | - Yian Zhang
- Computer Science Department, Stanford University, Stanford, CA, USA
| | - Samuel Bowman
- Center for Data Science, New York University, New York, NY, USA
- Department of Linguistics, New York University, New York, NY, USA
- Department of Computer Science, New York University, New York, NY, USA
| | - Noga Zaslavsky
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- K. Lisa Yang Integrative Computational Neuroscience (ICoN) Center, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Language Science, University of California, Irvine, CA, USA
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- The MIT Quest for Intelligence Initiative, Cambridge, MA, USA
- Speech and Hearing Bioscience and Technology Program, Harvard University, Boston, MA, USA
| |
Collapse
|
3
|
Kauf C, Tuckute G, Levy R, Andreas J, Fedorenko E. Lexical-Semantic Content, Not Syntactic Structure, Is the Main Contributor to ANN-Brain Similarity of fMRI Responses in the Language Network. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2024; 5:7-42. [PMID: 38645614 PMCID: PMC11025651 DOI: 10.1162/nol_a_00116] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 07/11/2023] [Indexed: 04/23/2024]
Abstract
Representations from artificial neural network (ANN) language models have been shown to predict human brain activity in the language network. To understand what aspects of linguistic stimuli contribute to ANN-to-brain similarity, we used an fMRI data set of responses to n = 627 naturalistic English sentences (Pereira et al., 2018) and systematically manipulated the stimuli for which ANN representations were extracted. In particular, we (i) perturbed sentences' word order, (ii) removed different subsets of words, or (iii) replaced sentences with other sentences of varying semantic similarity. We found that the lexical-semantic content of the sentence (largely carried by content words) rather than the sentence's syntactic form (conveyed via word order or function words) is primarily responsible for the ANN-to-brain similarity. In follow-up analyses, we found that perturbation manipulations that adversely affect brain predictivity also lead to more divergent representations in the ANN's embedding space and decrease the ANN's ability to predict upcoming tokens in those stimuli. Further, results are robust as to whether the mapping model is trained on intact or perturbed stimuli and whether the ANN sentence representations are conditioned on the same linguistic context that humans saw. The critical result-that lexical-semantic content is the main contributor to the similarity between ANN representations and neural ones-aligns with the idea that the goal of the human language system is to extract meaning from linguistic strings. Finally, this work highlights the strength of systematic experimental manipulations for evaluating how close we are to accurate and generalizable models of the human language network.
Collapse
Affiliation(s)
- Carina Kauf
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Greta Tuckute
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Roger Levy
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jacob Andreas
- Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
4
|
Sugimoto Y, Yoshida R, Jeong H, Koizumi M, Brennan JR, Oseki Y. Localizing Syntactic Composition with Left-Corner Recurrent Neural Network Grammars. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2024; 5:201-224. [PMID: 38645619 PMCID: PMC11025653 DOI: 10.1162/nol_a_00118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 07/24/2023] [Indexed: 04/23/2024]
Abstract
In computational neurolinguistics, it has been demonstrated that hierarchical models such as recurrent neural network grammars (RNNGs), which jointly generate word sequences and their syntactic structures via the syntactic composition, better explained human brain activity than sequential models such as long short-term memory networks (LSTMs). However, the vanilla RNNG has employed the top-down parsing strategy, which has been pointed out in the psycholinguistics literature as suboptimal especially for head-final/left-branching languages, and alternatively the left-corner parsing strategy has been proposed as the psychologically plausible parsing strategy. In this article, building on this line of inquiry, we investigate not only whether hierarchical models like RNNGs better explain human brain activity than sequential models like LSTMs, but also which parsing strategy is more neurobiologically plausible, by developing a novel fMRI corpus where participants read newspaper articles in a head-final/left-branching language, namely Japanese, through the naturalistic fMRI experiment. The results revealed that left-corner RNNGs outperformed both LSTMs and top-down RNNGs in the left inferior frontal and temporal-parietal regions, suggesting that there are certain brain regions that localize the syntactic composition with the left-corner parsing strategy.
Collapse
Affiliation(s)
- Yushi Sugimoto
- Graduate School of Arts and Sciences, University of Tokyo, Tokyo, Japan
| | - Ryo Yoshida
- Graduate School of Arts and Sciences, University of Tokyo, Tokyo, Japan
| | - Hyeonjeong Jeong
- Graduate School of International Cultural Studies, Tohoku University, Sendai, Japan
| | - Masatoshi Koizumi
- Department of Linguistics, Graduate School of Arts and Letters, Tohoku University, Sendai, Japan
| | | | - Yohei Oseki
- Graduate School of Arts and Sciences, University of Tokyo, Tokyo, Japan
| |
Collapse
|
5
|
Huber E, Sauppe S, Isasi-Isasmendi A, Bornkessel-Schlesewsky I, Merlo P, Bickel B. Surprisal From Language Models Can Predict ERPs in Processing Predicate-Argument Structures Only if Enriched by an Agent Preference Principle. NEUROBIOLOGY OF LANGUAGE (CAMBRIDGE, MASS.) 2024; 5:167-200. [PMID: 38645615 PMCID: PMC11025647 DOI: 10.1162/nol_a_00121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 08/30/2023] [Indexed: 04/23/2024]
Abstract
Language models based on artificial neural networks increasingly capture key aspects of how humans process sentences. Most notably, model-based surprisals predict event-related potentials such as N400 amplitudes during parsing. Assuming that these models represent realistic estimates of human linguistic experience, their success in modeling language processing raises the possibility that the human processing system relies on no other principles than the general architecture of language models and on sufficient linguistic input. Here, we test this hypothesis on N400 effects observed during the processing of verb-final sentences in German, Basque, and Hindi. By stacking Bayesian generalised additive models, we show that, in each language, N400 amplitudes and topographies in the region of the verb are best predicted when model-based surprisals are complemented by an Agent Preference principle that transiently interprets initial role-ambiguous noun phrases as agents, leading to reanalysis when this interpretation fails. Our findings demonstrate the need for this principle independently of usage frequencies and structural differences between languages. The principle has an unequal force, however. Compared to surprisal, its effect is weakest in German, stronger in Hindi, and still stronger in Basque. This gradient is correlated with the extent to which grammars allow unmarked NPs to be patients, a structural feature that boosts reanalysis effects. We conclude that language models gain more neurobiological plausibility by incorporating an Agent Preference. Conversely, theories of human processing profit from incorporating surprisal estimates in addition to principles like the Agent Preference, which arguably have distinct evolutionary roots.
Collapse
Affiliation(s)
- Eva Huber
- Department of Comparative Language Science, University of Zurich, Zurich, Switzerland
- Center for the Interdisciplinary Study of Language Evolution, University of Zurich, Zurich, Switzerland
| | - Sebastian Sauppe
- Department of Comparative Language Science, University of Zurich, Zurich, Switzerland
- Center for the Interdisciplinary Study of Language Evolution, University of Zurich, Zurich, Switzerland
- Department of Psychology, University of Zurich, Zurich, Switzerland
| | - Arrate Isasi-Isasmendi
- Department of Comparative Language Science, University of Zurich, Zurich, Switzerland
- Center for the Interdisciplinary Study of Language Evolution, University of Zurich, Zurich, Switzerland
| | - Ina Bornkessel-Schlesewsky
- Cognitive Neuroscience Laboratory, Australian Research Centre for Interactive and Virtual Environments, University of South Australia, Adelaide, Australia
| | - Paola Merlo
- Department of Linguistics, University of Geneva, Geneva, Switzerland
- University Center for Computer Science, University of Geneva, Geneva, Switzerland
| | - Balthasar Bickel
- Department of Comparative Language Science, University of Zurich, Zurich, Switzerland
- Center for the Interdisciplinary Study of Language Evolution, University of Zurich, Zurich, Switzerland
| |
Collapse
|
6
|
Giglio L, Ostarek M, Sharoh D, Hagoort P. Diverging neural dynamics for syntactic structure building in naturalistic speaking and listening. Proc Natl Acad Sci U S A 2024; 121:e2310766121. [PMID: 38442171 PMCID: PMC10945772 DOI: 10.1073/pnas.2310766121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 01/31/2024] [Indexed: 03/07/2024] Open
Abstract
The neural correlates of sentence production are typically studied using task paradigms that differ considerably from the experience of speaking outside of an experimental setting. In this fMRI study, we aimed to gain a better understanding of syntactic processing in spontaneous production versus naturalistic comprehension in three regions of interest (BA44, BA45, and left posterior middle temporal gyrus). A group of participants (n = 16) was asked to speak about the events of an episode of a TV series in the scanner. Another group of participants (n = 36) listened to the spoken recall of a participant from the first group. To model syntactic processing, we extracted word-by-word metrics of phrase-structure building with a top-down and a bottom-up parser that make different hypotheses about the timing of structure building. While the top-down parser anticipates syntactic structure, sometimes before it is obvious to the listener, the bottom-up parser builds syntactic structure in an integratory way after all of the evidence has been presented. In comprehension, neural activity was found to be better modeled by the bottom-up parser, while in production, it was better modeled by the top-down parser. We additionally modeled structure building in production with two strategies that were developed here to make different predictions about the incrementality of structure building during speaking. We found evidence for highly incremental and anticipatory structure building in production, which was confirmed by a converging analysis of the pausing patterns in speech. Overall, this study shows the feasibility of studying the neural dynamics of spontaneous language production.
Collapse
Affiliation(s)
- Laura Giglio
- Max Planck Institute for Psycholinguistics, Nijmegen6525XD, The Netherlands
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen6525EN, The Netherlands
| | - Markus Ostarek
- Max Planck Institute for Psycholinguistics, Nijmegen6525XD, The Netherlands
| | - Daniel Sharoh
- Max Planck Institute for Psycholinguistics, Nijmegen6525XD, The Netherlands
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen6525EN, The Netherlands
| | - Peter Hagoort
- Max Planck Institute for Psycholinguistics, Nijmegen6525XD, The Netherlands
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, Nijmegen6525EN, The Netherlands
| |
Collapse
|
7
|
Shain C, Meister C, Pimentel T, Cotterell R, Levy R. Large-scale evidence for logarithmic effects of word predictability on reading time. Proc Natl Acad Sci U S A 2024; 121:e2307876121. [PMID: 38422017 DOI: 10.1073/pnas.2307876121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Accepted: 11/11/2023] [Indexed: 03/02/2024] Open
Abstract
During real-time language comprehension, our minds rapidly decode complex meanings from sequences of words. The difficulty of doing so is known to be related to words' contextual predictability, but what cognitive processes do these predictability effects reflect? In one view, predictability effects reflect facilitation due to anticipatory processing of words that are predictable from context. This view predicts a linear effect of predictability on processing demand. In another view, predictability effects reflect the costs of probabilistic inference over sentence interpretations. This view predicts either a logarithmic or a superlogarithmic effect of predictability on processing demand, depending on whether it assumes pressures toward a uniform distribution of information over time. The empirical record is currently mixed. Here, we revisit this question at scale: We analyze six reading datasets, estimate next-word probabilities with diverse statistical language models, and model reading times using recent advances in nonlinear regression. Results support a logarithmic effect of word predictability on processing difficulty, which favors probabilistic inference as a key component of human language processing.
Collapse
Affiliation(s)
- Cory Shain
- Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Clara Meister
- Department of Computer Science, Institute for Machine Learning, ETH Zürich, Zürich 8092, Schweiz
| | - Tiago Pimentel
- Department of Computer Science and Technology, University of Cambridge, Cambridge CB3 0FD, United Kingdom
| | - Ryan Cotterell
- Department of Computer Science, Institute for Machine Learning, ETH Zürich, Zürich 8092, Schweiz
| | - Roger Levy
- Department of Brain & Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
8
|
Tuckute G, Sathe A, Srikant S, Taliaferro M, Wang M, Schrimpf M, Kay K, Fedorenko E. Driving and suppressing the human language network using large language models. Nat Hum Behav 2024; 8:544-561. [PMID: 38172630 DOI: 10.1038/s41562-023-01783-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 11/10/2023] [Indexed: 01/05/2024]
Abstract
Transformer models such as GPT generate human-like language and are predictive of human brain responses to language. Here, using functional-MRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of the brain response associated with each sentence. We then use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress the activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also non-invasively control neural activity in higher-level cortical areas, such as the language network.
Collapse
Affiliation(s)
- Greta Tuckute
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Aalok Sathe
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Shashank Srikant
- Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA
- MIT-IBM Watson AI Lab, Cambridge, MA, USA
| | - Maya Taliaferro
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Mingye Wang
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Martin Schrimpf
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
- Quest for Intelligence, Massachusetts Institute of Technology, Cambridge, MA, USA
- Neuro-X Institute, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Kendrick Kay
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN, USA
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA.
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Program in Speech and Hearing Bioscience and Technology, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
9
|
Tuckute G, Sathe A, Srikant S, Taliaferro M, Wang M, Schrimpf M, Kay K, Fedorenko E. Driving and suppressing the human language network using large language models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.16.537080. [PMID: 37090673 PMCID: PMC10120732 DOI: 10.1101/2023.04.16.537080] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Transformer models such as GPT generate human-like language and are highly predictive of human brain responses to language. Here, using fMRI-measured brain responses to 1,000 diverse sentences, we first show that a GPT-based encoding model can predict the magnitude of brain response associated with each sentence. Then, we use the model to identify new sentences that are predicted to drive or suppress responses in the human language network. We show that these model-selected novel sentences indeed strongly drive and suppress activity of human language areas in new individuals. A systematic analysis of the model-selected sentences reveals that surprisal and well-formedness of linguistic input are key determinants of response strength in the language network. These results establish the ability of neural network models to not only mimic human language but also noninvasively control neural activity in higher-level cortical areas, like the language network.
Collapse
Affiliation(s)
- Greta Tuckute
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
| | - Aalok Sathe
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
| | - Shashank Srikant
- Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- MIT-IBM Watson AI Lab, Cambridge, MA 02142, USA
| | - Maya Taliaferro
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
| | - Mingye Wang
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
| | - Martin Schrimpf
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- Quest for Intelligence, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- Neuro-X Institute, École Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Kendrick Kay
- Center for Magnetic Resonance Research, University of Minnesota, Minneapolis, MN 55455 USA
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139 USA
- The Program in Speech and Hearing Bioscience and Technology, Harvard University, Cambridge, MA 02138 USA
| |
Collapse
|
10
|
Zhou W, Wang S, Yan M. Fixation-related fMRI analysis reveals the neural basis of natural reading of unspaced and spaced Chinese sentences. Cereb Cortex 2023; 33:10401-10410. [PMID: 37566912 DOI: 10.1093/cercor/bhad290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 07/15/2023] [Accepted: 07/16/2023] [Indexed: 08/13/2023] Open
Abstract
Although there are many eye-movement studies focusing on natural sentence reading and functional magnetic resonance imaging research on reading with serial visual presentation paradigms, there is a scarcity of investigations into the neural mechanism of natural sentence reading. The present study recruited 33 adults to read unspaced and spaced Chinese sentences with the eye tracking and functional magnetic resonance imaging data recorded simultaneously. By using fixation-related functional magnetic resonance imaging analysis, this study showed that natural reading of Chinese sentences produced activations in ventral visual, dorsal attention, and semantic brain regions, which were modulated by the properties of words such as word length and word frequency. The multivoxel pattern analysis showed that the activity pattern in the left middle temporal gyrus could significantly predict the visual layout categories (i.e. unspaced vs. spaced conditions). Dynamic causal modeling analysis showed that there were bidirectional brain connections between the left middle temporal gyrus and the left inferior occipital cortex in the unspaced Chinese sentence reading but not in the spaced reading. These results provide a neural mechanism for the natural reading of Chinese sentences from the perspective of word segmentation.
Collapse
Affiliation(s)
- Wei Zhou
- Beijing Key Lab of Learning and Cognition, School of Psychology, Capital Normal University, Beijing 100048, China
- Chinese Institute for Brain Research, Beijing 102206, China
| | - Sile Wang
- Beijing Key Lab of Learning and Cognition, School of Psychology, Capital Normal University, Beijing 100048, China
- School of Psychology, South China Normal University, Guangzhou 510631, China
| | - Ming Yan
- Department of Psychology and Center for Cognitive and Brain Sciences, University of Macau, Taipa, Macau
| |
Collapse
|
11
|
Stanojević M, Brennan JR, Dunagan D, Steedman M, Hale JT. Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models. Cogn Sci 2023; 47:e13312. [PMID: 37417470 DOI: 10.1111/cogs.13312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 06/07/2023] [Accepted: 06/17/2023] [Indexed: 07/08/2023]
Abstract
To model behavioral and neural correlates of language comprehension in naturalistic environments, researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFGs), yet such formalisms are not sufficiently expressive for human languages. Combinatory categorial grammars (CCGs) are sufficiently expressive directly compositional models of grammar with flexible constituency that affords incremental interpretation. In this work, we evaluate whether a more expressive CCG provides a better model than a CFG for human neural signals collected with functional magnetic resonance imaging (fMRI) while participants listen to an audiobook story. We further test between variants of CCG that differ in how they handle optional adjuncts. These evaluations are carried out against a baseline that includes estimates of next-word predictability from a transformer neural network language model. Such a comparison reveals unique contributions of CCG structure-building predominantly in the left posterior temporal lobe: CCG-derived measures offer a superior fit to neural signals compared to those derived from a CFG. These effects are spatially distinct from bilateral superior temporal effects that are unique to predictability. Neural effects for structure-building are thus separable from predictability during naturalistic listening, and those effects are best characterized by a grammar whose expressive power is motivated on independent linguistic grounds.
Collapse
Affiliation(s)
| | | | | | | | - John T Hale
- Google DeepMind
- Department of Linguistics, University of Georgia
| |
Collapse
|
12
|
Chang M, Zhang K, Sun Y, Li S, Wang J. The graded predictive pre-activation in Chinese sentence reading: evidence from eye movements. Front Psychol 2023; 14:1136488. [PMID: 37457059 PMCID: PMC10342199 DOI: 10.3389/fpsyg.2023.1136488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 06/12/2023] [Indexed: 07/18/2023] Open
Abstract
Previous research has revealed that graded pre-activation rather than specific lexical prediction is more likely to be the mechanism for the word predictability effect in English. However, whether graded pre-activation underlies the predictability effect in Chinese reading is unknown. Accordingly, the present study tested the generality of the graded pre-activation account in Chinese reading. We manipulated the contextual constraint of sentences and the predictability of target words as independent variables. Readers' eye movement behaviors were recorded via an eye tracker. We examined whether processing an unpredictable word in a solid constraining context incurs a prediction error cost when this unpredictable word has a predictable alternative. The results showed no cues of prediction error cost on the early eye movement measures, supported by the Bayes Factor analyses. The current research indicates that graded predictive pre-activation underlies the predictability effect in Chinese reading.
Collapse
Affiliation(s)
- Min Chang
- School of Education Science, Nantong University, Nantong, China
| | - Kuo Zhang
- Department of Social Psychology, Nankai University, Tianjin, China
| | - Yue Sun
- Faculty of Psychology, Tianjin Normal University, Tianjin, China
- Academy of Psychology and Behavior, Tianjin Normal University, Tianjin, China
| | - Sha Li
- School of Psychology, Fujian Normal University, Fuzhou, China
| | - Jingxin Wang
- Faculty of Psychology, Tianjin Normal University, Tianjin, China
- Academy of Psychology and Behavior, Tianjin Normal University, Tianjin, China
| |
Collapse
|
13
|
Kauf C, Tuckute G, Levy R, Andreas J, Fedorenko E. Lexical semantic content, not syntactic structure, is the main contributor to ANN-brain similarity of fMRI responses in the language network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539646. [PMID: 37205405 PMCID: PMC10187317 DOI: 10.1101/2023.05.05.539646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Representations from artificial neural network (ANN) language models have been shown to predict human brain activity in the language network. To understand what aspects of linguistic stimuli contribute to ANN-to-brain similarity, we used an fMRI dataset of responses to n=627 naturalistic English sentences (Pereira et al., 2018) and systematically manipulated the stimuli for which ANN representations were extracted. In particular, we i) perturbed sentences' word order, ii) removed different subsets of words, or iii) replaced sentences with other sentences of varying semantic similarity. We found that the lexical semantic content of the sentence (largely carried by content words) rather than the sentence's syntactic form (conveyed via word order or function words) is primarily responsible for the ANN-to-brain similarity. In follow-up analyses, we found that perturbation manipulations that adversely affect brain predictivity also lead to more divergent representations in the ANN's embedding space and decrease the ANN's ability to predict upcoming tokens in those stimuli. Further, results are robust to whether the mapping model is trained on intact or perturbed stimuli, and whether the ANN sentence representations are conditioned on the same linguistic context that humans saw. The critical result-that lexical-semantic content is the main contributor to the similarity between ANN representations and neural ones-aligns with the idea that the goal of the human language system is to extract meaning from linguistic strings. Finally, this work highlights the strength of systematic experimental manipulations for evaluating how close we are to accurate and generalizable models of the human language network.
Collapse
Affiliation(s)
- Carina Kauf
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology
- McGovern Institute for Brain Research, Massachusetts Institute of Technology
| | - Greta Tuckute
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology
- McGovern Institute for Brain Research, Massachusetts Institute of Technology
| | - Roger Levy
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology
| | - Jacob Andreas
- Computer Science & Artificial Intelligence Laboratory, Massachusetts Institute of Technology
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology
- McGovern Institute for Brain Research, Massachusetts Institute of Technology
- Program in Speech and Hearing Bioscience and Technology, Harvard University
| |
Collapse
|
14
|
Weiss KL, Hawelka S, Hutzler F, Schuster S. Stronger functional connectivity during reading contextually predictable words in slow readers. Sci Rep 2023; 13:5989. [PMID: 37045976 PMCID: PMC10097649 DOI: 10.1038/s41598-023-33231-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Accepted: 04/10/2023] [Indexed: 04/14/2023] Open
Abstract
The effect of word predictability is well-documented in terms of local brain activation, but less is known about the functional connectivity among those regions associated with processing predictable words. Evidence from eye movement studies showed that the effect is much more pronounced in slow than in fast readers, suggesting that speed-impaired readers rely more on sentence context to compensate for their difficulties with visual word recognition. The present study aimed to investigate differences in functional connectivity of fast and slow readers within core regions associated with processing predictable words. We hypothesize a stronger synchronization between higher-order language areas, such as the left middle temporal (MTG) and inferior frontal gyrus (IFG), and the left occipito-temporal cortex (OTC) in slow readers. Our results show that slow readers exhibit more functional correlations among these connections; especially between the left IFG and OTC. We interpret our results in terms of the lexical quality hypothesis which postulates a stronger involvement of semantics on orthographic processing in (speed-)impaired readers.
Collapse
Affiliation(s)
| | - Stefan Hawelka
- Department of Psychology, Centre for Cognitive Neuroscience, Paris-Lodron-University of Salzburg, Hellbrunnerstr. 34, 5020, Salzburg, Austria
| | - Florian Hutzler
- Department of Psychology, Centre for Cognitive Neuroscience, Paris-Lodron-University of Salzburg, Hellbrunnerstr. 34, 5020, Salzburg, Austria
| | - Sarah Schuster
- Department of Psychology, Centre for Cognitive Neuroscience, Paris-Lodron-University of Salzburg, Hellbrunnerstr. 34, 5020, Salzburg, Austria.
| |
Collapse
|
15
|
Kulik V, Reyes LD, Sherwood CC. Coevolution of language and tools in the human brain: An ALE meta-analysis of neural activation during syntactic processing and tool use. PROGRESS IN BRAIN RESEARCH 2023; 275:93-115. [PMID: 36841572 DOI: 10.1016/bs.pbr.2022.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Language and complex tool use are often cited as behaviors unique to humans and may be evolutionarily linked owing to the underlying cognitive processes they have in common. We executed a quantitative activation likelihood estimation (ALE) meta-analysis (GingerALE 2.3) on published, whole-brain neuroimaging studies to identify areas associated with syntactic processing and/or tool use in humans. Significant clusters related to syntactic processing were identified in areas known to be related to language production and comprehension, including bilateral Broca's area in the inferior frontal gyrus. Tool use activation clusters were all in the left hemisphere and included the primary motor cortex and premotor cortex, in addition to other areas involved with sensorimotor transformation. Activation shared by syntactic processing and tool use was only significant at one cluster, located in the pars opercularis of the left inferior frontal gyrus. This minimal overlap between syntactic processing and tool use activation from our meta-analysis of neuroimaging studies indicates that there is not a widespread common neural network between the two. Broca's area may serve as an important hub that was initially recruited in early human evolution in the context of simple tool use, but was eventually co-opted for linguistic purposes, including the sequential and hierarchical ordering processes that characterize syntax. In the future, meta-analyses of additional components of language may allow for a more comprehensive examination of the functional networks that underlie the coevolution of human language and complex tool use.
Collapse
Affiliation(s)
- Veronika Kulik
- Department of Anthropology and Center for the Advanced Study of Human Paleobiology, The George Washington University, Washington, DC, United States
| | - Laura D Reyes
- Department of Anthropology and Center for the Advanced Study of Human Paleobiology, The George Washington University, Washington, DC, United States
| | - Chet C Sherwood
- Department of Anthropology and Center for the Advanced Study of Human Paleobiology, The George Washington University, Washington, DC, United States.
| |
Collapse
|
16
|
Ma X, Liu Y, Clariana R, Gu C, Li P. From eye movements to scanpath networks: A method for studying individual differences in expository text reading. Behav Res Methods 2023; 55:730-750. [PMID: 35445941 PMCID: PMC10027820 DOI: 10.3758/s13428-022-01842-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2022] [Indexed: 11/08/2022]
Abstract
Eye movements have been examined as an index of attention and comprehension during reading in the literature for over 30 years. Although eye-movement measurements are acknowledged as reliable indicators of readers' comprehension skill, few studies have analyzed eye-movement patterns using network science. In this study, we offer a new approach to analyze eye-movement data. Specifically, we recorded visual scanpaths when participants were reading expository science text, and used these to construct scanpath networks that reflect readers' processing of the text. Results showed that low ability and high ability readers' scanpath networks exhibited distinctive properties, which are reflected in different network metrics including density, centrality, small-worldness, transitivity, and global efficiency. Such patterns provide a new way to show how skilled readers, as compared with less skilled readers, process information more efficiently. Implications of our analyses are discussed in light of current theories of reading comprehension.
Collapse
Affiliation(s)
- Xiaochuan Ma
- Department of Psychology, The Pennsylvania State University, Moore Building, University Park, PA, 16802, USA
| | - Yikang Liu
- Department of Biomedical Engineering, The Pennsylvania State University, Millennium Science Complex, University Park, PA, 16802, USA
| | - Roy Clariana
- Department of Learning and Performance Systems, Keller Building, The Pennsylvania State University, University Park, PA, 16802, USA
| | - Chanyuan Gu
- Department of Chinese and Bilingual Studies, Faculty of Humanities, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
| | - Ping Li
- Department of Chinese and Bilingual Studies, Faculty of Humanities, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong.
| |
Collapse
|
17
|
Kasai C, Sumiya M, Koike T, Yoshimoto T, Maki H, Sadato N. Neural underpinning of Japanese particle processing in non-native speakers. Sci Rep 2022; 12:18740. [PMID: 36335170 PMCID: PMC9637203 DOI: 10.1038/s41598-022-23382-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 10/31/2022] [Indexed: 11/08/2022] Open
Abstract
Grammar acquisition by non-native learners (L2) is typically less successful and may produce fundamentally different grammatical systems than that by native speakers (L1). The neural representation of grammatical processing between L1 and L2 speakers remains controversial. We hypothesized that working memory is the primary source of L1/L2 differences, by considering working memory within the predictive coding account, which models grammatical processes as higher-level neuronal representations of cortical hierarchies, generating predictions (forward model) of lower-level representations. A functional MRI study was conducted with L1 Japanese speakers and highly proficient Japanese learners requiring oral production of grammatically correct Japanese particles. We assumed selecting proper particles requires forward model-dependent processes of working memory as their functions are highly context-dependent. As a control, participants read out a visually designated mora indicated by underlining. Particle selection by L1/L2 groups commonly activated the bilateral inferior frontal gyrus/insula, pre-supplementary motor area, left caudate, middle temporal gyrus, and right cerebellum, which constituted the core linguistic production system. In contrast, the left inferior frontal sulcus, known as the neural substrate of verbal working memory, showed more prominent activation in L2 than in L1. Thus, the working memory process causes L1/L2 differences even in highly proficient L2 learners.
Collapse
Affiliation(s)
- Chise Kasai
- grid.256342.40000 0004 0370 4927Faculty of Regional Studies, Gifu University, Yanagido, 501-1193 Japan ,grid.467811.d0000 0001 2272 1771Division of Cerebral Integration, National Institute for Physiological Sciences, Okazaki, 444-8585 Japan
| | - Motofumi Sumiya
- grid.505613.40000 0000 8937 6696Research Center for Child Mental Development, Hamamatsu University School of Medicine, Hamamatsu, 431-3192 Japan
| | - Takahiko Koike
- grid.467811.d0000 0001 2272 1771Division of Cerebral Integration, National Institute for Physiological Sciences, Okazaki, 444-8585 Japan ,grid.275033.00000 0004 1763 208XDepartment of Physiological Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, 240-0193 Japan
| | - Takaaki Yoshimoto
- grid.467811.d0000 0001 2272 1771Division of Cerebral Integration, National Institute for Physiological Sciences, Okazaki, 444-8585 Japan ,grid.262576.20000 0000 8863 9909Research Organization of Science and Technology, Ritsumeikan University, Kusatsu, 525-8577 Japan
| | - Hideki Maki
- grid.256342.40000 0004 0370 4927Faculty of Regional Studies, Gifu University, Yanagido, 501-1193 Japan
| | - Norihiro Sadato
- grid.467811.d0000 0001 2272 1771Division of Cerebral Integration, National Institute for Physiological Sciences, Okazaki, 444-8585 Japan ,grid.275033.00000 0004 1763 208XDepartment of Physiological Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, 240-0193 Japan ,grid.262576.20000 0000 8863 9909Research Organization of Science and Technology, Ritsumeikan University, Kusatsu, 525-8577 Japan
| |
Collapse
|
18
|
Maran M, Numssen O, Hartwigsen G, Zaccarella E. Online neurostimulation of Broca's area does not interfere with syntactic predictions: A combined TMS-EEG approach to basic linguistic combination. Front Psychol 2022; 13:968836. [PMID: 36619118 PMCID: PMC9815778 DOI: 10.3389/fpsyg.2022.968836] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Accepted: 09/13/2022] [Indexed: 01/11/2023] Open
Abstract
Categorical predictions have been proposed as the key mechanism supporting the fast pace of syntactic composition in language. Accordingly, grammar-based expectations are formed-e.g., the determiner "a" triggers the prediction for a noun-and facilitate the analysis of incoming syntactic information, which is then checked against a single or few other word categories. Previous functional neuroimaging studies point towards Broca's area in the left inferior frontal gyrus (IFG) as one fundamental cortical region involved in categorical prediction during incremental language processing. Causal evidence for this hypothesis is however still missing. In this study, we combined Electroencephalography (EEG) and Transcranial Magnetic Stimulation (TMS) to test whether Broca's area is functionally relevant in predictive mechanisms for language. We transiently perturbed Broca's area during the first word in a two-word construction, while simultaneously measuring the Event-Related Potential (ERP) correlates of syntactic composition. We reasoned that if Broca's area is involved in predictive mechanisms for syntax, disruptive TMS during the first word would mitigate the difference in the ERP responses for predicted and unpredicted categories in basic two-word constructions. Contrary to this hypothesis, perturbation of Broca's area at the predictive stage did not affect the ERP correlates of basic composition. The correlation strength between the electrical field induced by TMS and the ERP responses further confirmed this pattern. We discuss the present results considering an alternative account of the role of Broca's area in syntactic composition, namely the bottom-up integration of words into constituents, and of compensatory mechanisms within the language predictive network.
Collapse
Affiliation(s)
- Matteo Maran
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany,International Max Planck Research School on Neuroscience of Communication: Function, Structure, and Plasticity, Leipzig, Germany,*Correspondence: Matteo Maran,
| | - Ole Numssen
- Lise Meitner Research Group Cognition and Plasticity, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Gesa Hartwigsen
- Lise Meitner Research Group Cognition and Plasticity, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| | - Emiliano Zaccarella
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| |
Collapse
|
19
|
van der Burght CL, Numssen O, Schlaak B, Goucha T, Hartwigsen G. Differential contributions of inferior frontal gyrus subregions to sentence processing guided by intonation. Hum Brain Mapp 2022; 44:585-598. [PMID: 36189774 PMCID: PMC9842926 DOI: 10.1002/hbm.26086] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Revised: 08/30/2022] [Accepted: 08/31/2022] [Indexed: 01/25/2023] Open
Abstract
Auditory sentence comprehension involves processing content (semantics), grammar (syntax), and intonation (prosody). The left inferior frontal gyrus (IFG) is involved in sentence comprehension guided by these different cues, with neuroimaging studies preferentially locating syntactic and semantic processing in separate IFG subregions. However, this regional specialisation has not been confirmed with a neurostimulation method. Consequently, the causal role of such a specialisation remains unclear. This study probed the role of the posterior IFG (pIFG) for syntactic processing and the anterior IFG (aIFG) for semantic processing with repetitive transcranial magnetic stimulation (rTMS) in a task that required the interpretation of the sentence's prosodic realisation. Healthy participants performed a sentence completion task with syntactic and semantic decisions, while receiving 10 Hz rTMS over either left aIFG, pIFG, or vertex (control). Initial behavioural analyses showed an inhibitory effect on accuracy without task-specificity. However, electric field simulations revealed differential effects for both subregions. In the aIFG, stronger stimulation led to slower semantic processing, with no effect of pIFG stimulation. In contrast, we found a facilitatory effect on syntactic processing in both aIFG and pIFG, where higher stimulation strength was related to faster responses. Our results provide first evidence for the functional relevance of left aIFG in semantic processing guided by intonation. The stimulation effect on syntactic responses emphasises the importance of the IFG for syntax processing, without supporting the hypothesis of a pIFG-specific involvement. Together, the results support the notion of functionally specialised IFG subregions for diverse but fundamental cues for language processing.
Collapse
Affiliation(s)
- Constantijn L. van der Burght
- Department of NeuropsychologyMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany,Lise Meitner Research Group Cognition and PlasticityMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany,Psychology of Language DepartmentMax Planck Institute for PsycholinguisticsNijmegen
| | - Ole Numssen
- Lise Meitner Research Group Cognition and PlasticityMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Benito Schlaak
- Lise Meitner Research Group Cognition and PlasticityMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Tomás Goucha
- Department of NeuropsychologyMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| | - Gesa Hartwigsen
- Lise Meitner Research Group Cognition and PlasticityMax Planck Institute for Human Cognitive and Brain SciencesLeipzigGermany
| |
Collapse
|
20
|
Abstract
Understanding spoken language requires transforming ambiguous acoustic streams into a hierarchy of representations, from phonemes to meaning. It has been suggested that the brain uses prediction to guide the interpretation of incoming input. However, the role of prediction in language processing remains disputed, with disagreement about both the ubiquity and representational nature of predictions. Here, we address both issues by analyzing brain recordings of participants listening to audiobooks, and using a deep neural network (GPT-2) to precisely quantify contextual predictions. First, we establish that brain responses to words are modulated by ubiquitous predictions. Next, we disentangle model-based predictions into distinct dimensions, revealing dissociable neural signatures of predictions about syntactic category (parts of speech), phonemes, and semantics. Finally, we show that high-level (word) predictions inform low-level (phoneme) predictions, supporting hierarchical predictive processing. Together, these results underscore the ubiquity of prediction in language processing, showing that the brain spontaneously predicts upcoming language at multiple levels of abstraction.
Collapse
|
21
|
Heilbron M, Armeni K, Schoffelen JM, Hagoort P, de Lange FP. A hierarchy of linguistic predictions during natural language comprehension. Proc Natl Acad Sci U S A 2022; 119:e2201968119. [PMID: 35921434 DOI: 10.1101/2020.12.03.410399] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/21/2023] Open
Abstract
Understanding spoken language requires transforming ambiguous acoustic streams into a hierarchy of representations, from phonemes to meaning. It has been suggested that the brain uses prediction to guide the interpretation of incoming input. However, the role of prediction in language processing remains disputed, with disagreement about both the ubiquity and representational nature of predictions. Here, we address both issues by analyzing brain recordings of participants listening to audiobooks, and using a deep neural network (GPT-2) to precisely quantify contextual predictions. First, we establish that brain responses to words are modulated by ubiquitous predictions. Next, we disentangle model-based predictions into distinct dimensions, revealing dissociable neural signatures of predictions about syntactic category (parts of speech), phonemes, and semantics. Finally, we show that high-level (word) predictions inform low-level (phoneme) predictions, supporting hierarchical predictive processing. Together, these results underscore the ubiquity of prediction in language processing, showing that the brain spontaneously predicts upcoming language at multiple levels of abstraction.
Collapse
Affiliation(s)
- Micha Heilbron
- Donders Institute, Radboud University, 6525 EN Nijmegen, The Netherlands
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands
| | - Kristijan Armeni
- Donders Institute, Radboud University, 6525 EN Nijmegen, The Netherlands
| | | | - Peter Hagoort
- Donders Institute, Radboud University, 6525 EN Nijmegen, The Netherlands
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands
| | - Floris P de Lange
- Donders Institute, Radboud University, 6525 EN Nijmegen, The Netherlands
| |
Collapse
|
22
|
Russo AG, De Martino M, Elia A, Di Salle F, Esposito F. Negative correlation between word-level surprisal and intersubject neural synchronization during narrative listening. Cortex 2022; 155:132-149. [DOI: 10.1016/j.cortex.2022.07.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2021] [Revised: 02/10/2022] [Accepted: 07/06/2022] [Indexed: 11/30/2022]
|
23
|
Kragel JE, Voss JL. Looking for the neural basis of memory. Trends Cogn Sci 2022; 26:53-65. [PMID: 34836769 PMCID: PMC8678329 DOI: 10.1016/j.tics.2021.10.010] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 10/14/2021] [Accepted: 10/15/2021] [Indexed: 01/03/2023]
Abstract
Memory neuroscientists often measure neural activity during task trials designed to recruit specific memory processes. Behavior is championed as crucial for deciphering brain-memory linkages but is impoverished in typical experiments that rely on summary judgments. We criticize this approach as being blind to the multiple cognitive, neural, and behavioral processes that occur rapidly within a trial to support memory. Instead, time-resolved behaviors such as eye movements occur at the speed of cognition and neural activity. We highlight successes using eye-movement tracking with in vivo electrophysiology to link rapid hippocampal oscillations to encoding and retrieval processes that interact over hundreds of milliseconds. This approach will improve research on the neural basis of memory because it pinpoints discrete moments of brain-behavior-cognition correspondence.
Collapse
Affiliation(s)
- James E Kragel
- Department of Neurology, The University of Chicago, 5841 South Maryland Avenue, Chicago, IL 60637, USA.
| | - Joel L Voss
- Department of Neurology, The University of Chicago, 5841 South Maryland Avenue, Chicago, IL 60637, USA
| |
Collapse
|
24
|
Zhang H, Ille S, Sogerer L, Schwendner M, Schröder A, Meyer B, Wiestler B, Krieg SM. Elucidating the structural-functional connectome of language in glioma-induced aphasia using nTMS and DTI. Hum Brain Mapp 2021; 43:1836-1849. [PMID: 34951084 PMCID: PMC8933329 DOI: 10.1002/hbm.25757] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 12/01/2021] [Accepted: 12/01/2021] [Indexed: 12/24/2022] Open
Abstract
Glioma‐induced aphasia (GIA) is frequently observed in patients with newly diagnosed gliomas. Previous studies showed an impact of gliomas not only on local brain regions but also on the functionality and structure of brain networks. The current study used navigated transcranial magnetic stimulation (nTMS) to localize language‐related regions and to explore language function at the network level in combination with connectome analysis. Thirty glioma patients without aphasia (NA) and 30 patients with GIA were prospectively enrolled. Tumors were located in the vicinity of arcuate fasciculus‐related cortical and subcortical regions. The visualized ratio (VR) of each tract was calculated based on their respective fractional anisotropy (FA) and maximal FA. Using a thresholding method of each tract at 25% VR and 50% VR, DTI‐based tractography was performed to construct structural brain networks for graph‐based connectome analysis, containing functional data acquired by nTMS. The average degree of left hemispheric networks (Mleft) was higher in the NA group than in the GIA group for both VR thresholds. Differences of global and local efficiency between 25% and 50% VR thresholds were significantly lower in the NA group than in the GIA group. Aphasia levels correlated with connectome properties in Mleft and networks based on positive nTMS mapping regions (Mpos). A more substantial relation to language performance was found in Mpos and Mleft compared to the network of negative mapping regions (Mneg). Gliomas causing deterioration of language are related to various cerebral networks. In NA patients, mainly Mneg was impacted, while Mpos was impacted in GIA patients.
Collapse
Affiliation(s)
- Haosu Zhang
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany
| | - Sebastian Ille
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany.,TUM-Neuroimaging Center, Technical University of Munich, Munich, Germany
| | - Lisa Sogerer
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany
| | - Maximilian Schwendner
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany
| | - Axel Schröder
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany
| | - Bernhard Meyer
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany
| | - Benedikt Wiestler
- School of Medicine, Technical University of Munich, Munich, Germany.,Department of Diagnostic and Interventional Neuroradiology, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,Center for Translational Cancer Research of the TUM (TranslaTUM), Technical University of Munich, Munich, Germany
| | - Sandro M Krieg
- Department of Neurosurgery, Klinikum rechts der Isar, Technical University of Munich, Munich, Germany.,School of Medicine, Technical University of Munich, Munich, Germany.,TUM-Neuroimaging Center, Technical University of Munich, Munich, Germany
| |
Collapse
|
25
|
Schrimpf M, Blank IA, Tuckute G, Kauf C, Hosseini EA, Kanwisher N, Tenenbaum JB, Fedorenko E. The neural architecture of language: Integrative modeling converges on predictive processing. Proc Natl Acad Sci U S A 2021; 118:e2105646118. [PMID: 34737231 PMCID: PMC8694052 DOI: 10.1073/pnas.2105646118] [Citation(s) in RCA: 108] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/03/2021] [Indexed: 01/30/2023] Open
Abstract
The neuroscience of perception has recently been revolutionized with an integrative modeling approach in which computation, brain function, and behavior are linked across many datasets and many computational models. By revealing trends across models, this approach yields novel insights into cognitive and neural mechanisms in the target domain. We here present a systematic study taking this approach to higher-level cognition: human language processing, our species' signature cognitive skill. We find that the most powerful "transformer" models predict nearly 100% of explainable variance in neural responses to sentences and generalize across different datasets and imaging modalities (functional MRI and electrocorticography). Models' neural fits ("brain score") and fits to behavioral responses are both strongly correlated with model accuracy on the next-word prediction task (but not other language tasks). Model architecture appears to substantially contribute to neural fit. These results provide computationally explicit evidence that predictive processing fundamentally shapes the language comprehension mechanisms in the human brain.
Collapse
Affiliation(s)
- Martin Schrimpf
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139;
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Center for Brains, Minds and Machines, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Idan Asher Blank
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- Department of Psychology, University of California, Los Angeles, CA 90095
| | - Greta Tuckute
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Carina Kauf
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Eghbal A Hosseini
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Nancy Kanwisher
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139;
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
- Center for Brains, Minds and Machines, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Joshua B Tenenbaum
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139
- Center for Brains, Minds and Machines, Massachusetts Institute of Technology, Cambridge, MA 02139
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139;
- McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA 02139
| |
Collapse
|
26
|
Ferreira F, Qiu Z. Predicting syntactic structure. Brain Res 2021; 1770:147632. [PMID: 34453937 DOI: 10.1016/j.brainres.2021.147632] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 08/20/2021] [Accepted: 08/21/2021] [Indexed: 11/19/2022]
Abstract
Prediction in language processing has been a topic of major interest in psycholinguistics for at least the last two decades, but most investigations focus on semantic rather than syntactic prediction. This review begins with a discussion of some influential models of parsing which assume that comprehenders have the ability to anticipate syntactic nodes, beginning with left-corner parsers and the garden-path model and ending with current information-theoretic approaches that emphasize online probabilistic prediction. We then turn to evidence for the prediction of specific syntactic forms, including coordinate clauses and noun phrases, verb arguments, and individual nouns, as well as studies that use morphosyntactic constraints to assess whether a specific semantic prediction has been made. The last section considers the implications of syntactic prediction for theories of language architecture and describes four avenues for future research.
Collapse
Affiliation(s)
| | - Zhuang Qiu
- University of California, Davis, United States.
| |
Collapse
|
27
|
Giglio L, Ostarek M, Weber K, Hagoort P. Commonalities and Asymmetries in the Neurobiological Infrastructure for Language Production and Comprehension. Cereb Cortex 2021; 32:1405-1418. [PMID: 34491301 PMCID: PMC8971077 DOI: 10.1093/cercor/bhab287] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 07/19/2021] [Accepted: 07/20/2021] [Indexed: 01/30/2023] Open
Abstract
The neurobiology of sentence production has been largely understudied compared to the neurobiology of sentence comprehension, due to difficulties with experimental control and motion-related artifacts in neuroimaging. We studied the neural response to constituents of increasing size and specifically focused on the similarities and differences in the production and comprehension of the same stimuli. Participants had to either produce or listen to stimuli in a gradient of constituent size based on a visual prompt. Larger constituent sizes engaged the left inferior frontal gyrus (LIFG) and middle temporal gyrus (LMTG) extending to inferior parietal areas in both production and comprehension, confirming that the neural resources for syntactic encoding and decoding are largely overlapping. An ROI analysis in LIFG and LMTG also showed that production elicited larger responses to constituent size than comprehension and that the LMTG was more engaged in comprehension than production, while the LIFG was more engaged in production than comprehension. Finally, increasing constituent size was characterized by later BOLD peaks in comprehension but earlier peaks in production. These results show that syntactic encoding and parsing engage overlapping areas, but there are asymmetries in the engagement of the language network due to the specific requirements of production and comprehension.
Collapse
Affiliation(s)
- Laura Giglio
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands.,Donders Institute for Cognition, Brain and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands
| | - Markus Ostarek
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands.,Donders Institute for Cognition, Brain and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands
| | - Kirsten Weber
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands.,Donders Institute for Cognition, Brain and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands
| | - Peter Hagoort
- Max Planck Institute for Psycholinguistics, 6525 XD Nijmegen, The Netherlands.,Donders Institute for Cognition, Brain and Behaviour, Radboud University, 6525 AJ Nijmegen, The Netherlands
| |
Collapse
|
28
|
Wehbe L, Blank IA, Shain C, Futrell R, Levy R, von der Malsburg T, Smith N, Gibson E, Fedorenko E. Incremental Language Comprehension Difficulty Predicts Activity in the Language Network but Not the Multiple Demand Network. Cereb Cortex 2021. [PMID: 33895807 DOI: 10.1101/2020.04.15.043844] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/11/2023] Open
Abstract
What role do domain-general executive functions play in human language comprehension? To address this question, we examine the relationship between behavioral measures of comprehension and neural activity in the domain-general "multiple demand" (MD) network, which has been linked to constructs like attention, working memory, inhibitory control, and selection, and implicated in diverse goal-directed behaviors. Specifically, functional magnetic resonance imaging data collected during naturalistic story listening are compared with theory-neutral measures of online comprehension difficulty and incremental processing load (reading times and eye-fixation durations). Critically, to ensure that variance in these measures is driven by features of the linguistic stimulus rather than reflecting participant- or trial-level variability, the neuroimaging and behavioral datasets were collected in nonoverlapping samples. We find no behavioral-neural link in functionally localized MD regions; instead, this link is found in the domain-specific, fronto-temporal "core language network," in both left-hemispheric areas and their right hemispheric homotopic areas. These results argue against strong involvement of domain-general executive circuits in language comprehension.
Collapse
Affiliation(s)
- Leila Wehbe
- Carnegie Mellon University, Machine Learning Department PA 15213, USA
| | - Idan Asher Blank
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
- University of California Los Angeles, Department of Psychology CA 90095, USA
| | - Cory Shain
- Ohio State University, Department of Linguistics OH 43210, USA
| | - Richard Futrell
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
- University of California Irvine, Department of Linguistics CA 92697, USA
| | - Roger Levy
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
- University of California San Diego, Department of Linguistics CA 92161, USA
| | - Titus von der Malsburg
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
- University of Stuttgart, Institute of Linguistics, 70049 Stuttgart, Germany
| | - Nathaniel Smith
- University of California San Diego, Department of Linguistics CA 92161, USA
| | - Edward Gibson
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
| | - Evelina Fedorenko
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
- Massachusetts Institute of Technology, McGovern Institute for Brain ResearchMA 02139, USA
| |
Collapse
|
29
|
Wehbe L, Blank IA, Shain C, Futrell R, Levy R, von der Malsburg T, Smith N, Gibson E, Fedorenko E. Incremental Language Comprehension Difficulty Predicts Activity in the Language Network but Not the Multiple Demand Network. Cereb Cortex 2021; 31:4006-4023. [PMID: 33895807 DOI: 10.1093/cercor/bhab065] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 01/15/2021] [Accepted: 02/21/2021] [Indexed: 12/28/2022] Open
Abstract
What role do domain-general executive functions play in human language comprehension? To address this question, we examine the relationship between behavioral measures of comprehension and neural activity in the domain-general "multiple demand" (MD) network, which has been linked to constructs like attention, working memory, inhibitory control, and selection, and implicated in diverse goal-directed behaviors. Specifically, functional magnetic resonance imaging data collected during naturalistic story listening are compared with theory-neutral measures of online comprehension difficulty and incremental processing load (reading times and eye-fixation durations). Critically, to ensure that variance in these measures is driven by features of the linguistic stimulus rather than reflecting participant- or trial-level variability, the neuroimaging and behavioral datasets were collected in nonoverlapping samples. We find no behavioral-neural link in functionally localized MD regions; instead, this link is found in the domain-specific, fronto-temporal "core language network," in both left-hemispheric areas and their right hemispheric homotopic areas. These results argue against strong involvement of domain-general executive circuits in language comprehension.
Collapse
Affiliation(s)
- Leila Wehbe
- Carnegie Mellon University, Machine Learning Department PA 15213, USA
| | - Idan Asher Blank
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA.,University of California Los Angeles, Department of Psychology CA 90095, USA
| | - Cory Shain
- Ohio State University, Department of Linguistics OH 43210, USA
| | - Richard Futrell
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA.,University of California Irvine, Department of Linguistics CA 92697, USA
| | - Roger Levy
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA.,University of California San Diego, Department of Linguistics CA 92161, USA
| | - Titus von der Malsburg
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA.,University of Stuttgart, Institute of Linguistics, 70049 Stuttgart, Germany
| | - Nathaniel Smith
- University of California San Diego, Department of Linguistics CA 92161, USA
| | - Edward Gibson
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA
| | - Evelina Fedorenko
- Massachusetts Institute of Technology, Department of Brain and Cognitive Sciences MA 02139, USA.,Massachusetts Institute of Technology, McGovern Institute for Brain ResearchMA 02139, USA
| |
Collapse
|
30
|
Continuous-time deconvolutional regression for psycholinguistic modeling. Cognition 2021; 215:104735. [PMID: 34303182 DOI: 10.1016/j.cognition.2021.104735] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 04/01/2021] [Accepted: 04/11/2021] [Indexed: 12/28/2022]
Abstract
The influence of stimuli in psycholinguistic experiments diffuses across time because the human response to language is not instantaneous. The linear models typically used to analyze psycholinguistic data are unable to account for this phenomenon due to strong temporal independence assumptions, while existing deconvolutional methods for estimating diffuse temporal structure model time discretely and therefore cannot be directly applied to natural language stimuli where events (words) have variable duration. In light of evidence that continuous-time deconvolutional regression (CDR) can address these issues (Shain & Schuler, 2018), this article motivates the use of CDR for many experimental settings, exposits some of its mathematical properties, and empirically evaluates the influence of various experimental confounds (noise, multicollinearity, and impulse response misspecification), hyperparameter settings, and response types (behavioral and fMRI). Results show that CDR (1) yields highly consistent estimates across a variety of hyperparameter configurations, (2) faithfully recovers the data-generating model on synthetic data, even under adverse training conditions, and (3) outperforms widely-used statistical approaches when applied to naturalistic reading and fMRI data. In addition, procedures for testing scientific hypotheses using CDR are defined and demonstrated, and empirically-motivated best-practices for CDR modeling are proposed. Results support the use of CDR for analyzing psycholinguistic time series, especially in a naturalistic experimental paradigm.
Collapse
|
31
|
Kocagoncu E, Klimovich-Gray A, Hughes LE, Rowe JB. Evidence and implications of abnormal predictive coding in dementia. Brain 2021; 144:3311-3321. [PMID: 34240109 PMCID: PMC8677549 DOI: 10.1093/brain/awab254] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 03/15/2021] [Accepted: 06/17/2021] [Indexed: 11/14/2022] Open
Abstract
The diversity of cognitive deficits and neuropathological processes associated with dementias has encouraged divergence in pathophysiological explanations of disease. Here, we review an alternative framework that emphasizes convergent critical features of cognitive pathophysiology. Rather than the loss of ‘memory centres’ or ‘language centres’, or singular neurotransmitter systems, cognitive deficits are interpreted in terms of aberrant predictive coding in hierarchical neural networks. This builds on advances in normative accounts of brain function, specifically the Bayesian integration of beliefs and sensory evidence in which hierarchical predictions and prediction errors underlie memory, perception, speech and behaviour. We describe how analogous impairments in predictive coding in parallel neurocognitive systems can generate diverse clinical phenomena, including the characteristics of dementias. The review presents evidence from behavioural and neurophysiological studies of perception, language, memory and decision-making. The reformulation of cognitive deficits in terms of predictive coding has several advantages. It brings diverse clinical phenomena into a common framework; it aligns cognitive and movement disorders; and it makes specific predictions on cognitive physiology that support translational and experimental medicine studies. The insights into complex human cognitive disorders from the predictive coding framework may therefore also inform future therapeutic strategies.
Collapse
Affiliation(s)
- Ece Kocagoncu
- Cambridge Centre for Frontotemporal Dementia, Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK
| | | | - Laura E Hughes
- Cambridge Centre for Frontotemporal Dementia, Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK.,Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
| | - James B Rowe
- Cambridge Centre for Frontotemporal Dementia, Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK.,Medical Research Council Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK
| |
Collapse
|
32
|
Huber-Huber C, Buonocore A, Melcher D. The extrafoveal preview paradigm as a measure of predictive, active sampling in visual perception. J Vis 2021; 21:12. [PMID: 34283203 PMCID: PMC8300052 DOI: 10.1167/jov.21.7.12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Accepted: 05/18/2021] [Indexed: 01/02/2023] Open
Abstract
A key feature of visual processing in humans is the use of saccadic eye movements to look around the environment. Saccades are typically used to bring relevant information, which is glimpsed with extrafoveal vision, into the high-resolution fovea for further processing. With the exception of some unusual circumstances, such as the first fixation when walking into a room, our saccades are mainly guided based on this extrafoveal preview. In contrast, the majority of experimental studies in vision science have investigated "passive" behavioral and neural responses to suddenly appearing and often temporally or spatially unpredictable stimuli. As reviewed here, a growing number of studies have investigated visual processing of objects under more natural viewing conditions in which observers move their eyes to a stationary stimulus, visible previously in extrafoveal vision, during each trial. These studies demonstrate that the extrafoveal preview has a profound influence on visual processing of objects, both for behavior and neural activity. Starting from the preview effect in reading research we follow subsequent developments in vision research more generally and finally argue that taking such evidence seriously leads to a reconceptualization of the nature of human visual perception that incorporates the strong influence of prediction and action on sensory processing. We review theoretical perspectives on visual perception under naturalistic viewing conditions, including theories of active vision, active sensing, and sampling. Although the extrafoveal preview paradigm has already provided useful information about the timing of, and potential mechanisms for, the close interaction of the oculomotor and visual systems while reading and in natural scenes, the findings thus far also raise many new questions for future research.
Collapse
Affiliation(s)
- Christoph Huber-Huber
- Radboud University, Donders Institute for Brain, Cognition and Behaviour, The Netherlands
- CIMeC, University of Trento, Italy
| | - Antimo Buonocore
- Werner Reichardt Centre for Integrative Neuroscience, Tübingen University, Tübingen, BW, Germany
- Hertie Institute for Clinical Brain Research, Tübingen University, Tübingen, BW, Germany
| | - David Melcher
- CIMeC, University of Trento, Italy
- Division of Science, New York University Abu Dhabi, UAE
| |
Collapse
|
33
|
Beier EJ, Chantavarin S, Rehrig G, Ferreira F, Miller LM. Cortical Tracking of Speech: Toward Collaboration between the Fields of Signal and Sentence Processing. J Cogn Neurosci 2021; 33:574-593. [PMID: 33475452 DOI: 10.1162/jocn_a_01676] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
In recent years, a growing number of studies have used cortical tracking methods to investigate auditory language processing. Although most studies that employ cortical tracking stem from the field of auditory signal processing, this approach should also be of interest to psycholinguistics-particularly the subfield of sentence processing-given its potential to provide insight into dynamic language comprehension processes. However, there has been limited collaboration between these fields, which we suggest is partly because of differences in theoretical background and methodological constraints, some mutually exclusive. In this paper, we first review the theories and methodological constraints that have historically been prioritized in each field and provide concrete examples of how some of these constraints may be reconciled. We then elaborate on how further collaboration between the two fields could be mutually beneficial. Specifically, we argue that the use of cortical tracking methods may help resolve long-standing debates in the field of sentence processing that commonly used behavioral and neural measures (e.g., ERPs) have failed to adjudicate. Similarly, signal processing researchers who use cortical tracking may be able to reduce noise in the neural data and broaden the impact of their results by controlling for linguistic features of their stimuli and by using simple comprehension tasks. Overall, we argue that a balance between the methodological constraints of the two fields will lead to an overall improved understanding of language processing as well as greater clarity on what mechanisms cortical tracking of speech reflects. Increased collaboration will help resolve debates in both fields and will lead to new and exciting avenues for research.
Collapse
|
34
|
Chen Z, Hale JT. Quantifying Structural and Non‐structural Expectations in Relative Clause Processing. Cogn Sci 2021; 45:e12927. [DOI: 10.1111/cogs.12927] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2019] [Revised: 10/16/2020] [Accepted: 10/29/2020] [Indexed: 11/30/2022]
Affiliation(s)
- Zhong Chen
- Department of Modern Languages and Cultures Rochester Institute of Technology
| | | |
Collapse
|
35
|
Engelhardt PE, Yuen MKY, Kenning EA, Filipovic L. Are Linguistic Prediction Deficits Characteristic of Adults with Dyslexia? Brain Sci 2021; 11:59. [PMID: 33418904 PMCID: PMC7825117 DOI: 10.3390/brainsci11010059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2020] [Revised: 12/18/2020] [Accepted: 12/31/2020] [Indexed: 11/16/2022] Open
Abstract
Individuals with dyslexia show deficits in phonological abilities, rapid automatized naming, short-term/working memory, processing speed, and some aspects of sensory and visual processing. There is currently one report in the literature that individuals with dyslexia also show impairments in linguistic prediction. The current study sought to investigate prediction in language processing in dyslexia. Forty-one adults with dyslexia and 43 typically-developing controls participated. In the experiment, participants made speeded-acceptability judgements in sentences with word final cloze manipulations. The final word was a high-cloze probability word, a low-cloze probability word, or a semantically anomalous word. Reaction time from the onset of the final word to participants' response was recorded. Results indicated that individuals with dyslexia showed longer reaction times, and crucially, they showed clear differences from controls in low predictability sentences, which is consistent with deficits in linguistic prediction. Conclusions focus on the mechanism supporting prediction in language comprehension and possible reasons why individuals with dyslexia show less prediction.
Collapse
Affiliation(s)
| | | | - Elise A. Kenning
- School of Psychology, University of East Anglia, Norwich NR7 7TJ, UK;
| | - Luna Filipovic
- School of Politics, Philosophy, Language, and Communication Studies, University of East Anglia, Norwich NR4 7TJ, UK;
| |
Collapse
|
36
|
Lukic S, Thompson CK, Barbieri E, Chiappetta B, Bonakdarpour B, Kiran S, Rapp B, Parrish TB, Caplan D. Common and distinct neural substrates of sentence production and comprehension. Neuroimage 2021; 224:117374. [PMID: 32949711 PMCID: PMC10134242 DOI: 10.1016/j.neuroimage.2020.117374] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 09/08/2020] [Accepted: 09/12/2020] [Indexed: 01/08/2023] Open
Abstract
Functional neuroimaging and lesion-symptom mapping investigations implicate a left frontal-temporal-parietal network for sentence processing. The majority of studies have focused on sentence comprehension, with fewer in the domain of sentence production, which have not fully elucidated overlapping and/or unique brain structures associated with the two domains, particularly for sentences with noncanonical word order. Using voxel-based lesion symptom mapping (VLSM) we examined the relationship between lesions within the left hemisphere language network and both sentence comprehension and production of simple and complex syntactic structures in 76 participants with chronic stroke-induced aphasia. Results revealed shared regions across domains in the anterior and posterior superior temporal gyri (aSTG, pSTG), and the temporal pole (adjusted for verb production/comprehension). Additionally, comprehension was associated with lesions in the anterior and posterior middle temporal gyri (aMTG, pMTG), the MTG temporooccipital regions, SMG/AG, central and parietal operculum, and the insula. Subsequent VLSM analyses (production versus comprehension) revealed critical regions associated with each domain: anterior temporal lesions were associated with production; posterior temporo-parietal lesions were associated with comprehension, implicating important roles for regions within the ventral and dorsal stream processing routes, respectively. Processing of syntactically complex, noncanonical (adjusted for canonical), sentences was associated with damage to the pSTG across domains, with additional damage to the pMTG and IPL associated with impaired sentence comprehension, suggesting that the pSTG is crucial for computing noncanonical sentences across domains and that the pMTG, and IPL are necessary for re-analysis of thematic roles as required for resolution of long-distance dependencies. These findings converge with previous studies and extend our knowledge of the neural mechanisms of sentence comprehension to production, highlighting critical regions associated with both domains, and further address the mechanism engaged for syntactic computation, controlled for the contribution of verb processing.
Collapse
|
37
|
Schuster S, Himmelstoss NA, Hutzler F, Richlan F, Kronbichler M, Hawelka S. Cloze enough? Hemodynamic effects of predictive processing during natural reading. Neuroimage 2020; 228:117687. [PMID: 33385553 DOI: 10.1016/j.neuroimage.2020.117687] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 11/25/2020] [Accepted: 12/23/2020] [Indexed: 10/22/2022] Open
Abstract
Evidence accrues that readers form multiple hypotheses about upcoming words. The present study investigated the hemodynamic effects of predictive processing during natural reading by means of combining fMRI and eye movement recordings. In particular, we investigated the neural and behavioral correlates of precision-weighted prediction errors, which are thought to be indicative of subsequent belief updating. Participants silently read sentences in which we manipulated the cloze probability and the semantic congruency of the final word that served as an index for precision and prediction error respectively. With respect to the neural correlates, our findings indicate an enhanced activation within the left inferior frontal and middle temporal gyrus suggesting an effect of precision on prediction update in higher (lexico-)semantic levels. Despite being evident at the neural level, we did not observe any evidence that this mechanism resulted in disproportionate reading times on participants' eye movements. The results speak against discrete predictions, but favor the notion that multiple words are activated in parallel during reading.
Collapse
Affiliation(s)
- Sarah Schuster
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria
| | - Nicole Alexandra Himmelstoss
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria
| | - Florian Hutzler
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria.
| | - Fabio Richlan
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria
| | - Martin Kronbichler
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria; Neuroscience Institute and Department of Neurology, Christian Doppler Clinic, Paracelsus Private Medical University, Ignaz-Harrer-Str. 79, 5020 Salzburg, Austria
| | - Stefan Hawelka
- Paris-Lodron-University of Salzburg, Department of Psychology, Centre for Cognitive Neuroscience, Hellbrunnerstr. 34, 5020 Salzburg, Austria
| |
Collapse
|
38
|
Henderson JM, Goold JE, Choi W, Hayes TR. Neural Correlates of Fixated Low- and High-level Scene Properties during Active Scene Viewing. J Cogn Neurosci 2020; 32:2013-2023. [PMID: 32573384 PMCID: PMC11164273 DOI: 10.1162/jocn_a_01599] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022]
Abstract
During real-world scene perception, viewers actively direct their attention through a scene in a controlled sequence of eye fixations. During each fixation, local scene properties are attended, analyzed, and interpreted. What is the relationship between fixated scene properties and neural activity in the visual cortex? Participants inspected photographs of real-world scenes in an MRI scanner while their eye movements were recorded. Fixation-related fMRI was used to measure activation as a function of lower- and higher-level scene properties at fixation, operationalized as edge density and meaning maps, respectively. We found that edge density at fixation was most associated with activation in early visual areas, whereas semantic content at fixation was most associated with activation along the ventral visual stream including core object and scene-selective areas (lateral occipital complex, parahippocampal place area, occipital place area, and retrosplenial cortex). The observed activation from semantic content was not accounted for by differences in edge density. The results are consistent with active vision models in which fixation gates detailed visual analysis for fixated scene regions, and this gating influences both lower and higher levels of scene analysis.
Collapse
Affiliation(s)
| | | | - Wonil Choi
- Gwangju Institute of Science and Technology
| | | |
Collapse
|
39
|
Russo AG, De Martino M, Mancuso A, Iaconetta G, Manara R, Elia A, Laudanna A, Di Salle F, Esposito F. Semantics-weighted lexical surprisal modeling of naturalistic functional MRI time-series during spoken narrative listening. Neuroimage 2020; 222:117281. [PMID: 32828929 DOI: 10.1016/j.neuroimage.2020.117281] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2020] [Revised: 06/22/2020] [Accepted: 08/13/2020] [Indexed: 11/16/2022] Open
Abstract
Probabilistic language models are increasingly used to provide neural representations of linguistic features under naturalistic settings. Word surprisal models can be applied to continuous fMRI recordings during task-free listening of narratives, to detect regions linked to language prediction and comprehension. Here, to this purpose, a novel semantics-weighted lexical surprisal is applied to naturalistic fMRI data. FMRI was performed at 3 Tesla in 31 subjects during task-free listening to a 12-minute audiobook played in both original and word-reversed (control) version. Lexical-only and semantics-weighted lexical surprisal models were estimated for the original and control word series. The two series were alternatively chosen to build the predictor of interest in the first-level general linear model and were compared in the second-level (group) analysis. The addition of the surprisal predictor to the stimulus-related predictors significantly improved the fitting of the neural signal. In average, the semantics-weighted model yielded lower surprisal values and, in some areas, better fitting of the fMRI data compared to the lexical-only model. The two models produced both overlapping and distinct activations: while lexical-only surprisal activated secondary auditory areas in the superior temporal gyri and the cerebellum, semantics-weighted surprisal additionally activated the left inferior frontal gyrus. These results confirm the usefulness of surprisal models in the naturalistic fMRI analysis of linguistic processes and suggest that the use of semantic information may increase the sensitivity of a probabilistic language model in higher-order language-related areas, with possible implications for future naturalistic fMRI studies of language under normal and (clinically or pharmacologically) modified conditions.
Collapse
Affiliation(s)
- Andrea G Russo
- Department of Political and Communication Sciences, University of Salerno, Fisciano, Salerno, Italy; Department of Medicine, Surgery and Dentistry, "Scuola Medica Salernitana", University of Salerno, Baronissi, Salerno, Italy.
| | - Maria De Martino
- Department of Political and Communication Sciences, University of Salerno, Fisciano, Salerno, Italy
| | - Azzurra Mancuso
- Department of Political and Communication Sciences, University of Salerno, Fisciano, Salerno, Italy
| | - Giorgio Iaconetta
- Department of Medicine, Surgery and Dentistry, "Scuola Medica Salernitana", University of Salerno, Baronissi, Salerno, Italy; Department of Diagnostic Imaging, University Hospital "San Giovanni di Dio e Ruggi D'Aragona", Salerno, Italy
| | - Renzo Manara
- Department of Medicine, Surgery and Dentistry, "Scuola Medica Salernitana", University of Salerno, Baronissi, Salerno, Italy; Department of Diagnostic Imaging, University Hospital "San Giovanni di Dio e Ruggi D'Aragona", Salerno, Italy
| | - Annibale Elia
- Department of Political and Communication Sciences, University of Salerno, Fisciano, Salerno, Italy
| | - Alessandro Laudanna
- Department of Political and Communication Sciences, University of Salerno, Fisciano, Salerno, Italy
| | - Francesco Di Salle
- Department of Medicine, Surgery and Dentistry, "Scuola Medica Salernitana", University of Salerno, Baronissi, Salerno, Italy; Department of Diagnostic Imaging, University Hospital "San Giovanni di Dio e Ruggi D'Aragona", Salerno, Italy
| | - Fabrizio Esposito
- Department of Medicine, Surgery and Dentistry, "Scuola Medica Salernitana", University of Salerno, Baronissi, Salerno, Italy; Department of Diagnostic Imaging, University Hospital "San Giovanni di Dio e Ruggi D'Aragona", Salerno, Italy
| |
Collapse
|
40
|
Carter BT, Luke SG. Best practices in eye tracking research. Int J Psychophysiol 2020; 155:49-62. [PMID: 32504653 DOI: 10.1016/j.ijpsycho.2020.05.010] [Citation(s) in RCA: 61] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 05/26/2020] [Accepted: 05/27/2020] [Indexed: 12/14/2022]
Abstract
This guide describes best practices in using eye tracking technology for research in a variety of disciplines. A basic outline of the anatomy and physiology of the eyes and of eye movements is provided, along with a description of the sorts of research questions eye tracking can address. We then explain how eye tracking technology works and what sorts of data it generates, and provide guidance on how to select and use an eye tracker as well as selecting appropriate eye tracking measures. Challenges to the validity of eye tracking studies are described, along with recommendations for overcoming these challenges. We then outline correct reporting standards for eye tracking studies.
Collapse
|
41
|
Brennan JR, Dyer C, Kuncoro A, Hale JT. Localizing syntactic predictions using recurrent neural network grammars. Neuropsychologia 2020; 146:107479. [PMID: 32428530 DOI: 10.1016/j.neuropsychologia.2020.107479] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 04/03/2020] [Accepted: 04/29/2020] [Indexed: 12/14/2022]
Abstract
Brain activity in numerous perisylvian brain regions is modulated by the expectedness of linguistic stimuli. We leverage recent advances in computational parsing models to test what representations guide the processes reflected by this activity. Recurrent Neural Network Grammars (RNNGs) are generative models of (tree, string) pairs that use neural networks to drive derivational choices. Parsing with them yields a variety of incremental complexity metrics that we evaluate against a publicly available fMRI data-set recorded while participants simply listen to an audiobook story. Surprisal, which captures a word's un-expectedness, correlates with a wide range of temporal and frontal regions when it is calculated based on word-sequence information using a top-performing LSTM neural network language model. The explicit encoding of hierarchy afforded by the RNNG additionally captures activity in left posterior temporal areas. A separate metric tracking the number of derivational steps taken between words correlates with activity in the left temporal lobe and inferior frontal gyrus. This pattern of results narrows down the kinds of linguistic representations at play during predictive processing across the brain's language network.
Collapse
|
42
|
Mesmoudi S, Hommet S, Peschanski D. Eye-tracking and learning experience: gaze trajectories to better understand the behavior of memorial visitors. J Eye Mov Res 2020; 13. [PMID: 33828795 PMCID: PMC7992043 DOI: 10.16910/jemr.13.2.3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Eye-tracking technology is increasingly introduced in museums to assess their role in learning and knowledge transfer. However, their use provide limited quantitative and/or qualitative measures such as viewing time and/or gaze trajectory on an isolated object or image (Region of Interest "ROI").The aim of this work is to evaluate the potential of the mobile eye-tracking to quantify the students’ experience and behaviors through their visit of the "Genocide and mass violence" area of the Caen memorial. In this study, we collected eye-tracking data from 17 students during their visit to the memorial. In addition, all visitors filled out a questionnaire before the visit, and a focus group was conducted before and after the visit. The first results of this study allowed us to analyze the viewing time spent by each visitor in front of 19-selected ROIs, and some of their specific sub-parts. The other important result was the reconstruction of the gaze trajectory through these ROIs. Our global trajectory approach allowed to complete the information obtained from an isolated ROI, and to identify some behaviors such as avoidance. Clustering analysis revealed some typical trajectories performed by specific sub-groups. The eye-tracking results were consolidated by the participants' answers during the focus group.
Collapse
Affiliation(s)
- Salma Mesmoudi
- CNRS, Univ. Paris 1 Panthéon-Sorbonne, EHESS, HESAM Univ., UMR8209, CESSP, MATRICE, France
| | | | - Denis Peschanski
- CNRS, Univ. Paris 1 Panthéon-Sorbonne, EHESS, HESAM Univ., UMR8209, CESSP, MATRICE, France
| |
Collapse
|
43
|
Pfeiffer C, Hollenstein N, Zhang C, Langer N. Neural dynamics of sentiment processing during naturalistic sentence reading. Neuroimage 2020; 218:116934. [PMID: 32416227 DOI: 10.1016/j.neuroimage.2020.116934] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 04/24/2020] [Accepted: 05/07/2020] [Indexed: 12/15/2022] Open
Abstract
When we read, our eyes move through the text in a series of fixations and high-velocity saccades to extract visual information. This process allows the brain to obtain meaning, e.g., about sentiment, or the emotional valence, expressed in the written text. How exactly the brain extracts the sentiment of single words during naturalistic reading is largely unknown. This is due to the challenges of naturalistic imaging, which has previously led researchers to employ highly controlled, timed word-by-word presentations of custom reading materials that lack ecological validity. Here, we aimed to assess the electrical neural correlates of word sentiment processing during naturalistic reading of English sentences. We used a publicly available dataset of simultaneous electroencephalography (EEG), eye-tracking recordings, and word-level semantic annotations from 7129 words in 400 sentences (Zurich Cognitive Language Processing Corpus; Hollenstein et al., 2018). We computed fixation-related potentials (FRPs), which are evoked electrical responses time-locked to the onset of fixations. A general linear mixed model analysis of FRPs cleaned from visual- and motor-evoked activity showed a topographical difference between the positive and negative sentiment condition in the 224-304 ms interval after fixation onset in left-central and right-posterior electrode clusters. An additional analysis that included word-, phrase-, and sentence-level sentiment predictors showed the same FRP differences for the word-level sentiment, but no additional FRP differences for phrase- and sentence-level sentiment. Furthermore, decoding analysis that classified word sentiment (positive or negative) from sentiment-matched 40-trial average FRPs showed a 0.60 average accuracy (95% confidence interval: [0.58, 0.61]). Control analyses ruled out that these results were based on differences in eye movements or linguistic features other than word sentiment. Our results extend previous research by showing that the emotional valence of lexico-semantic stimuli evoke a fast electrical neural response upon word fixation during naturalistic reading. These results provide an important step to identify the neural processes of lexico-semantic processing in ecologically valid conditions and can serve to improve computer algorithms for natural language processing.
Collapse
Affiliation(s)
- Christian Pfeiffer
- Methods of Plasticity Research Laboratory, Department of Psychology, University of Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland.
| | | | - Ce Zhang
- Department of Computer Science, ETH, Zurich, Switzerland
| | - Nicolas Langer
- Methods of Plasticity Research Laboratory, Department of Psychology, University of Zurich, Switzerland; University Research Priority Program (URPP) Dynamics of Healthy Aging, Zurich, Switzerland; Neuroscience Center Zurich (ZNZ), Zurich, Switzerland
| |
Collapse
|
44
|
Artoni F, d'Orio P, Catricalà E, Conca F, Bottoni F, Pelliccia V, Sartori I, Russo GL, Cappa SF, Micera S, Moro A. High gamma response tracks different syntactic structures in homophonous phrases. Sci Rep 2020; 10:7537. [PMID: 32372065 PMCID: PMC7200802 DOI: 10.1038/s41598-020-64375-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 04/13/2020] [Indexed: 11/08/2022] Open
Abstract
Syntax is a species-specific component of human language combining a finite set of words in a potentially infinite number of sentences. Since words are by definition expressed by sound, factoring out syntactic information is normally impossible. Here, we circumvented this problem in a novel way by designing phrases with exactly the same acoustic content but different syntactic structures depending on the other words they occur with. In particular, we used phrases merging an article with a noun yielding a Noun Phrase (NP) or a clitic with a verb yielding a Verb Phrase (VP). We performed stereo-electroencephalographic (SEEG) recordings in epileptic patients. We measured a different electrophysiological correlates of verb phrases vs. noun phrases in multiple cortical areas in both hemispheres, including language areas and their homologous in the non-dominant hemisphere. The high gamma band activity (150-300 Hz frequency), which plays a crucial role in inter-regional cortical communications, showed a significant difference during the presentation of the homophonous phrases, depending on whether the phrase was a verb phrase or a noun phrase. Our findings contribute to the ultimate goal of a complete neural decoding of linguistic structures from the brain.
Collapse
Affiliation(s)
- Fiorenzo Artoni
- The Biorobotics Institute and Department of Excellence in AI and Robotics, Scuola Superiore Sant'Anna, Pisa, Italy
- Translational Neural Engineering Laboratory, Center for Neuroprosthetics and Institute of Bioengineering, EPFL - Campus Biotech, Geneve, Switzerland
| | - Piergiorgio d'Orio
- "Claudio Munari" Center for Epilepsy Surgery, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
- Institute of Neuroscience, CNR, Parma, Italy
| | - Eleonora Catricalà
- Neurocognition Epistemology and theoretical Syntax Research Center (NEtS), Scuola Universitaria Superiore IUSS, Pavia, Italy
| | - Francesca Conca
- Neurocognition Epistemology and theoretical Syntax Research Center (NEtS), Scuola Universitaria Superiore IUSS, Pavia, Italy
| | | | - Veronica Pelliccia
- "Claudio Munari" Center for Epilepsy Surgery, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
- Institute of Neuroscience, CNR, Parma, Italy
| | - Ivana Sartori
- "Claudio Munari" Center for Epilepsy Surgery, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
| | - Giorgio Lo Russo
- "Claudio Munari" Center for Epilepsy Surgery, ASST Grande Ospedale Metropolitano Niguarda, Milan, Italy
| | - Stefano F Cappa
- Neurocognition Epistemology and theoretical Syntax Research Center (NEtS), Scuola Universitaria Superiore IUSS, Pavia, Italy
- IRCCS Mondino Foundation National Institute of Neurology, Pavia, Italy
| | - Silvestro Micera
- The Biorobotics Institute and Department of Excellence in AI and Robotics, Scuola Superiore Sant'Anna, Pisa, Italy.
- Translational Neural Engineering Laboratory, Center for Neuroprosthetics and Institute of Bioengineering, EPFL - Campus Biotech, Geneve, Switzerland.
| | - Andrea Moro
- Neurocognition Epistemology and theoretical Syntax Research Center (NEtS), Scuola Universitaria Superiore IUSS, Pavia, Italy.
| |
Collapse
|
45
|
Shain C, Blank IA, van Schijndel M, Schuler W, Fedorenko E. fMRI reveals language-specific predictive coding during naturalistic sentence comprehension. Neuropsychologia 2020; 138:107307. [PMID: 31874149 PMCID: PMC7140726 DOI: 10.1016/j.neuropsychologia.2019.107307] [Citation(s) in RCA: 75] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Revised: 12/02/2019] [Accepted: 12/13/2019] [Indexed: 11/19/2022]
Abstract
Much research in cognitive neuroscience supports prediction as a canonical computation of cognition across domains. Is such predictive coding implemented by feedback from higher-order domain-general circuits, or is it locally implemented in domain-specific circuits? What information sources are used to generate these predictions? This study addresses these two questions in the context of language processing. We present fMRI evidence from a naturalistic comprehension paradigm (1) that predictive coding in the brain's response to language is domain-specific, and (2) that these predictions are sensitive both to local word co-occurrence patterns and to hierarchical structure. Using a recently developed continuous-time deconvolutional regression technique that supports data-driven hemodynamic response function discovery from continuous BOLD signal fluctuations in response to naturalistic stimuli, we found effects of prediction measures in the language network but not in the domain-general multiple-demand network, which supports executive control processes and has been previously implicated in language comprehension. Moreover, within the language network, surface-level and structural prediction effects were separable. The predictability effects in the language network were substantial, with the model capturing over 37% of explainable variance on held-out data. These findings indicate that human sentence processing mechanisms generate predictions about upcoming words using cognitive processes that are sensitive to hierarchical structure and specialized for language processing, rather than via feedback from high-level executive control mechanisms.
Collapse
Affiliation(s)
| | - Idan Asher Blank
- University of California Los Angeles, 90024, USA; Massachusetts Institute of Technology, 02139, USA.
| | | | - William Schuler
- The Ohio State University, 43210, USA; Massachusetts General Hospital, Program in Speech and Hearing Bioscience and Technology, 02115, USA.
| | - Evelina Fedorenko
- Massachusetts General Hospital, Program in Speech and Hearing Bioscience and Technology, 02115, USA.
| |
Collapse
|
46
|
Weissbart H, Kandylaki KD, Reichenbach T. Cortical Tracking of Surprisal during Continuous Speech Comprehension. J Cogn Neurosci 2020; 32:155-166. [DOI: 10.1162/jocn_a_01467] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Abstract
Speech comprehension requires rapid online processing of a continuous acoustic signal to extract structure and meaning. Previous studies on sentence comprehension have found neural correlates of the predictability of a word given its context, as well as of the precision of such a prediction. However, they have focused on single sentences and on particular words in those sentences. Moreover, they compared neural responses to words with low and high predictability, as well as with low and high precision. However, in speech comprehension, a listener hears many successive words whose predictability and precision vary over a large range. Here, we show that cortical activity in different frequency bands tracks word surprisal in continuous natural speech and that this tracking is modulated by precision. We obtain these results through quantifying surprisal and precision from naturalistic speech using a deep neural network and through relating these speech features to EEG responses of human volunteers acquired during auditory story comprehension. We find significant cortical tracking of surprisal at low frequencies, including the delta band as well as in the higher frequency beta and gamma bands, and observe that the tracking is modulated by the precision. Our results pave the way to further investigate the neurobiology of natural speech comprehension.
Collapse
|
47
|
Bilingual language processing: A meta-analysis of functional neuroimaging studies. Neurosci Biobehav Rev 2020; 108:834-853. [DOI: 10.1016/j.neubiorev.2019.12.014] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 12/05/2019] [Accepted: 12/09/2019] [Indexed: 01/27/2023]
|
48
|
Do domain-general executive resources play a role in linguistic prediction? Re-evaluation of the evidence and a path forward. Neuropsychologia 2020; 136:107258. [DOI: 10.1016/j.neuropsychologia.2019.107258] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2019] [Revised: 11/07/2019] [Accepted: 11/07/2019] [Indexed: 12/13/2022]
|
49
|
Lau EF, Namyst A. fMRI evidence that left posterior temporal cortex contributes to N400 effects of predictability independent of congruity. BRAIN AND LANGUAGE 2019; 199:104697. [PMID: 31585316 DOI: 10.1016/j.bandl.2019.104697] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2018] [Revised: 09/17/2019] [Accepted: 09/18/2019] [Indexed: 06/10/2023]
Abstract
Previous electrophysiological work argues that predictability and semantic incongruity rapidly impact comprehension, as indicated by modulation of the N400 component between ~300 and 500 ms. An ongoing question is whether effects of predictability in fact reflect pre-activation in long-term memory as opposed to modulating the kind of integration processes triggered by incongruity. Using fMRI, we compared the impact of predictability and incongruity in adjective-noun phrases, in regions identified with lexical and phrasal localizer scans. We found that predictability impacted activity in left posterior middle temporal gyrus (pMTG), while incongruity impacted activity in left precentral gyrus. Together with parallel data from ERP, these data are consistent with the hypothesis that left pMTG activity is a key contributor to N400 effects of predictability and that the relevant mechanism is reduced activation of stored lexical representations. We tentatively suggest that the left precentral region may play a role in reanalysis when incongruity is encountered.
Collapse
Affiliation(s)
- Ellen F Lau
- University of Maryland, Department of Linguistics, College Park, MD, United States.
| | - Anna Namyst
- University of Maryland, Department of Linguistics, College Park, MD, United States; NIMH MEG Core Facility, National Institutes of Health, Bethesda, MD, United States.
| |
Collapse
|
50
|
Matar S, Pylkkänen L, Marantz A. Left occipital and right frontal involvement in syntactic category prediction: MEG evidence from Standard Arabic. Neuropsychologia 2019; 135:107230. [DOI: 10.1016/j.neuropsychologia.2019.107230] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Revised: 09/25/2019] [Accepted: 10/11/2019] [Indexed: 02/07/2023]
|