1
|
Jamali M, Grannan B, Cai J, Khanna AR, Muñoz W, Caprara I, Paulk AC, Cash SS, Fedorenko E, Williams ZM. Semantic encoding during language comprehension at single-cell resolution. Nature 2024; 631:610-616. [PMID: 38961302 PMCID: PMC11254762 DOI: 10.1038/s41586-024-07643-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 05/31/2024] [Indexed: 07/05/2024]
Abstract
From sequences of speech sounds1,2 or letters3, humans can extract rich and nuanced meaning through language. This capacity is essential for human communication. Yet, despite a growing understanding of the brain areas that support linguistic and semantic processing4-12, the derivation of linguistic meaning in neural tissue at the cellular level and over the timescale of action potentials remains largely unknown. Here we recorded from single cells in the left language-dominant prefrontal cortex as participants listened to semantically diverse sentences and naturalistic stories. By tracking their activities during natural speech processing, we discover a fine-scale cortical representation of semantic information by individual neurons. These neurons responded selectively to specific word meanings and reliably distinguished words from nonwords. Moreover, rather than responding to the words as fixed memory representations, their activities were highly dynamic, reflecting the words' meanings based on their specific sentence contexts and independent of their phonetic form. Collectively, we show how these cell ensembles accurately predicted the broad semantic categories of the words as they were heard in real time during speech and how they tracked the sentences in which they appeared. We also show how they encoded the hierarchical structure of these meaning representations and how these representations mapped onto the cell population. Together, these findings reveal a finely detailed cortical organization of semantic representations at the neuron scale in humans and begin to illuminate the cellular-level processing of meaning during language comprehension.
Collapse
Affiliation(s)
- Mohsen Jamali
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Benjamin Grannan
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Jing Cai
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Arjun R Khanna
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - William Muñoz
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Irene Caprara
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Angelique C Paulk
- Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Center for Neurotechnology and Neurorecovery, Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
| | - Sydney S Cash
- Department of Neurology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
- Center for Neurotechnology and Neurorecovery, Department of Neurology, Massachusetts General Hospital, Boston, MA, USA
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Ziv M Williams
- Department of Neurosurgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA.
- Harvard-MIT Division of Health Sciences and Technology, Boston, MA, USA.
- Harvard Medical School, Program in Neuroscience, Boston, MA, USA.
| |
Collapse
|
2
|
Budel G, Jin Y, Van Mieghem P, Kitsak M. Topological properties and organizing principles of semantic networks. Sci Rep 2023; 13:11728. [PMID: 37474614 PMCID: PMC10359341 DOI: 10.1038/s41598-023-37294-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 06/19/2023] [Indexed: 07/22/2023] Open
Abstract
Interpreting natural language is an increasingly important task in computer algorithms due to the growing availability of unstructured textual data. Natural Language Processing (NLP) applications rely on semantic networks for structured knowledge representation. The fundamental properties of semantic networks must be taken into account when designing NLP algorithms, yet they remain to be structurally investigated. We study the properties of semantic networks from ConceptNet, defined by 7 semantic relations from 11 different languages. We find that semantic networks have universal basic properties: they are sparse, highly clustered, and many exhibit power-law degree distributions. Our findings show that the majority of the considered networks are scale-free. Some networks exhibit language-specific properties determined by grammatical rules, for example networks from highly inflected languages, such as e.g. Latin, German, French and Spanish, show peaks in the degree distribution that deviate from a power law. We find that depending on the semantic relation type and the language, the link formation in semantic networks is guided by different principles. In some networks the connections are similarity-based, while in others the connections are more complementarity-based. Finally, we demonstrate how knowledge of similarity and complementarity in semantic networks can improve NLP algorithms in missing link inference.
Collapse
Affiliation(s)
- Gabriel Budel
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628 CD, Delft, The Netherlands
| | - Ying Jin
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628 CD, Delft, The Netherlands
| | - Piet Van Mieghem
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628 CD, Delft, The Netherlands
| | - Maksim Kitsak
- Faculty of Electrical Engineering, Mathematics and Computer Science, Delft University of Technology, 2628 CD, Delft, The Netherlands.
| |
Collapse
|
3
|
Gomes M, Picó Pérez M, Castro I, Moreira P, Ribeiro S, Mota NB, Morgado P. Speech graph analysis in obsessive-compulsive disorder: The relevance of dream reports. J Psychiatr Res 2023; 161:358-363. [PMID: 37004408 DOI: 10.1016/j.jpsychires.2023.03.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/20/2022] [Revised: 03/12/2023] [Accepted: 03/27/2023] [Indexed: 04/04/2023]
Abstract
Obsessive-compulsive disorder (OCD) is a distressing disorder characterized by the presence of intrusive thoughts, images or urges (obsessions) and/or behavioral efforts to reduce the anxiety (compulsions). OCD lifetime prevalence varies between 1% and 3% in the general population and there are no reliable markers that support the diagnosis. In order to fill this gap, Computational Psychiatry employs multiple types of quantitative analyses to improve the understanding, diagnosis, prediction, and treatment of mental illnesses including OCD. One of these computational tools is speech graphs analysis. A graph represents a network of nodes connected by edges: in non-semantic speech graphs, nodes correspond to words and edges correspond to the directed link between consecutive words. Using non-semantic speech graphs, we compared free speech samples from OCD patients and healthy controls (HC), to test whether speech graphs analysis can grasp structural differences in speech between these groups. To this end, 39 OCD patients and 37 HC were interviewed and recorded during six types of speech reports: yesterday, dream, old memory, positive image, negative image and neutral image. Also, the Obsessive-Compulsive Inventory-Revised (OCI-R) and the Yale Brown Obsessive-Compulsive Scale (Y-BOCS) were used to assess symptom severity. The graph-theoretical structural analysis of dream reports showed that OCD patients have significantly smaller lexical diversity, lower speech connectedness and a higher recurrence of words in comparison with HC. The other five report types failed to show differences between the groups, adding to the notion that dream reports are especially informative of speech structure in different psychiatric states. Further investigation is necessary to completely assess the potential of this tool in OCD.
Collapse
Affiliation(s)
- Matilde Gomes
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, Braga, Guimarães, Portugal; Clinical Academic Center - Braga, Portugal
| | - Maria Picó Pérez
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, Braga, Guimarães, Portugal; Clinical Academic Center - Braga, Portugal; Departamento de Psicología Básica, Clínica y Psicobiología, Universitat Jaume I, Castelló de la Plana, Spain
| | - Inês Castro
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, Braga, Guimarães, Portugal; Clinical Academic Center - Braga, Portugal
| | - Pedro Moreira
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, Braga, Guimarães, Portugal; Clinical Academic Center - Braga, Portugal; School of Psychology, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal
| | - Sidarta Ribeiro
- Brain Institute, Federal University of Rio Grande do Norte, 59078-900, Natal, Brazil
| | - Natália B Mota
- Institute of Psychiatry, Federal University of Rio de Janeiro, Brazil
| | - Pedro Morgado
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Campus de Gualtar, 4710-057, Braga, Portugal; ICVS/3B's, PT Government Associate Laboratory, Braga, Guimarães, Portugal; Clinical Academic Center - Braga, Portugal.
| |
Collapse
|
4
|
Mota NB. How can computational tools help to understand language patterns in mental suffering considering social diversity. Psychiatry Res 2023; 319:114995. [PMID: 36495617 DOI: 10.1016/j.psychres.2022.114995] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Revised: 11/20/2022] [Accepted: 11/27/2022] [Indexed: 11/30/2022]
Abstract
The complex interaction between biological and social factors challenges measuring human behavior. Language has been a crucial source of information that mirrors inner processes like thoughts. The development of a novel computational strategy that helps to understand language needs to consider social factors that could also impact human behavior. Ten years ago, I developed a computational approach based on graph theory to measure structural aspects of the narrative's mental organization expressed in spontaneous oral reports. It was possible to measure the decrease in narrative graph connectedness associated with the schizophrenia diagnosis and negative symptoms severity. However, I was worried that the psychiatric field neglected factors from diverse social realities (such as poor access to education). Formal education impacts language by mastering grammar and syntax. Changes in language structure could be related to symptoms and lack of exposure to formal education. Indeed, the same connectedness markers increase according to typical cognitive and academic development. In this paper, I describe the reasons and methods for investigating both factors (psychiatric symptoms and formal education) on language patterns. Further, I evaluate concerns and future challenges of using computational strategies that include social diversity in mental health conditions.
Collapse
Affiliation(s)
- Natália Bezerra Mota
- Institute of Psychiatry at Federal University of Rio de Janeiro - IPUB/UFRJ, Rio de Janeiro, Brazil; Research department at Motrix Lab - Motrix, Rio de Janeiro, Brazil.
| |
Collapse
|
5
|
Kölbl M, Kyogoku Y, Philipp JN, Richter M, Rietdorf C, Yousef T. Beyond the Failure of Direct-Matching in Keyword Evaluation: A Sketch of a Graph Based Solution. Front Artif Intell 2022; 5:801564. [PMID: 35402902 PMCID: PMC8988042 DOI: 10.3389/frai.2022.801564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 02/23/2022] [Indexed: 11/13/2022] Open
Abstract
The starting point of this paper is the observation that methods based on the direct match of keywords are inadequate because they do not consider the cognitive ability of concept formation and abstraction. We argue that keyword evaluation needs to be based on a semantic model of language capturing the semantic relatedness of words to satisfy the claim of the human-like ability of concept formation and abstraction and achieve better evaluation results. Evaluation of keywords is difficult since semantic informedness is required for this purpose. This model must be capable of identifying semantic relationships such as synonymy, hypernymy, hyponymy, and location-based abstraction. For example, when gathering texts from online sources, one usually finds a few keywords with each text. Still, these keyword sets are neither complete for the text nor are they in themselves closed, i.e., in most cases, the keywords are a random subset of all possible keywords and not that informative w.r.t. the complete keyword set. Therefore all algorithms based on this cannot achieve good evaluation results and provide good/better keywords or even a complete keyword set for a text. As a solution, we propose a word graph that captures all these semantic relationships for a given language. The problem with the hyponym/hyperonym relationship is that, unlike synonyms, it is not bidirectional. Thus the space of keyword sets requires a metric that is non-symmetric, in other words, a quasi-metric. We sketch such a metric that works on our graph. Since it is nearly impossible to obtain such a complete word graph for a language, we propose for the keyword task a simpler graph based on the base text upon which the keyword sets should be evaluated. This reduction is usually sufficient for evaluating keyword sets.
Collapse
|
6
|
Kimmelman V, Komarova A, Luchkova L, Vinogradova V, Alekseeva O. Exploring Networks of Lexical Variation in Russian Sign Language. Front Psychol 2022; 12:740734. [PMID: 35069319 PMCID: PMC8766300 DOI: 10.3389/fpsyg.2021.740734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 12/13/2021] [Indexed: 11/25/2022] Open
Abstract
When describing variation at the lexical level in sign languages, researchers often distinguish between phonological and lexical variants, using the following principle: if two signs differ in only one of the major phonological components (handshape, orientation, movement, location), then they are considered phonological variants, otherwise they are considered separate lexemes. We demonstrate that this principle leads to contradictions in some simple and more complex cases of variation. We argue that it is useful to visualize the relations between variants as graphs, and we describe possible networks of variants that can arise using this visualization tool. We further demonstrate that these scenarios in fact arise in the case of variation in color terms and kinship terms in Russian Sign Language (RSL), using a newly created database of lexical variation in RSL. We show that it is possible to develop a set of formal rules that can help distinguish phonological and lexical variation also in the problematic scenarios. However, we argue that it might be a mistake to dismiss the actual patterns of variant relations in order to arrive at the binary lexical vs. phonological variant opposition.
Collapse
Affiliation(s)
- Vadim Kimmelman
- Department of Linguistic, Literary, and Aesthetic Studies, University of Bergen, Bergen, Norway
| | - Anna Komarova
- Galina Zaitseva Centre for Deaf Studies and Sign Language, Moscow, Russia.,Department of English Stylistics, English Language Faculty, Moscow State Linguistic University, Moscow, Russia
| | - Lyudmila Luchkova
- Inclusive Programs Department, Garage Museum of Contemporary Art, Moscow, Russia
| | | | - Oksana Alekseeva
- Department of English Stylistics, English Language Faculty, Moscow State Linguistic University, Moscow, Russia
| |
Collapse
|
7
|
Jang H, Yoon B. TechWordNet: Development of semantic relation for technology information analysis using F-term and natural language processing. Inf Process Manag 2021. [DOI: 10.1016/j.ipm.2021.102752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
8
|
Cheng M, Yin C, Nazarian S, Bogdan P. Deciphering the laws of social network-transcendent COVID-19 misinformation dynamics and implications for combating misinformation phenomena. Sci Rep 2021; 11:10424. [PMID: 34001937 PMCID: PMC8128875 DOI: 10.1038/s41598-021-89202-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 04/21/2021] [Indexed: 02/03/2023] Open
Abstract
The global rise of COVID-19 health risk has triggered the related misinformation infodemic. We present the first analysis of COVID-19 misinformation networks and determine few of its implications. Firstly, we analyze the spread trends of COVID-19 misinformation and discover that the COVID-19 misinformation statistics are well fitted by a log-normal distribution. Secondly, we form misinformation networks by taking individual misinformation as a node and similarity between misinformation nodes as links, and we decipher the laws of COVID-19 misinformation network evolution: (1) We discover that misinformation evolves to optimize the network information transfer over time with the sacrifice of robustness. (2) We demonstrate the co-existence of fit get richer and rich get richer phenomena in misinformation networks. (3) We show that a misinformation network evolution with node deletion mechanism captures well the public attention shift on social media. Lastly, we present a network science inspired deep learning framework to accurately predict which Twitter posts are likely to become central nodes (i.e., high centrality) in a misinformation network from only one sentence without the need to know the whole network topology. With the network analysis and the central node prediction, we propose that if we correctly suppress certain central nodes in the misinformation network, the information transfer of network would be severely impacted.
Collapse
Affiliation(s)
- Mingxi Cheng
- University of Southern California, Los Angeles, USA
| | | | | | - Paul Bogdan
- University of Southern California, Los Angeles, USA.
| |
Collapse
|
9
|
Corcoran CM. Commentary on "Lower speech connectedness linked to incidence of psychosis in people at clinical high risk": The promise of graph theory and network neuroscience. Schizophr Res 2021; 228:481-482. [PMID: 33046332 PMCID: PMC7987843 DOI: 10.1016/j.schres.2020.09.024] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Revised: 09/26/2020] [Accepted: 09/26/2020] [Indexed: 10/23/2022]
|
10
|
|
11
|
A Study on Differences between Simplified and Traditional Chinese Based on Complex Network Analysis of the Word Co-Occurrence Networks. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2020; 2020:8863847. [PMID: 33343654 PMCID: PMC7728479 DOI: 10.1155/2020/8863847] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 09/15/2020] [Accepted: 10/19/2020] [Indexed: 11/17/2022]
Abstract
Currently, most work on comparing differences between simplified and traditional Chinese only focuses on the character or lexical level, without taking the global differences into consideration. In order to solve this problem, this paper proposes to use complex network analysis of word co-occurrence networks, which have been successfully applied to the language analysis research and can tackle global characters and explore the differences between simplified and traditional Chinese. Specially, we first constructed a word co-occurrence network for simplified and traditional Chinese using selected news corpora. Then, the complex network analysis methods were performed, including network statistics analysis, kernel lexicon comparison, and motif analysis, to gain a global understanding of these networks. After that, the networks were compared based on the properties obtained. Through comparison, we can obtain three interesting results: first, the co-occurrence networks of simplified Chinese and traditional Chinese are both small-world and scale-free networks. However, given the same corpus size, the co-occurrence networks of traditional Chinese tend to have more nodes, which may be due to a large number of one-to-many character/word mappings from simplified Chinese to traditional Chinese; second, since traditional Chinese retains more ancient Chinese words and uses fewer weak verbs, the traditional Chinese kernel lexicons have more entries than the simplified Chinese kernel lexicons; third, motif analysis shows that there is no difference between the simplified Chinese network and the corresponding traditional Chinese network, which means that simplified and traditional Chinese are semantically consistent.
Collapse
|
12
|
Corcoran CM, Mittal VA, Bearden CE, E Gur R, Hitczenko K, Bilgrami Z, Savic A, Cecchi GA, Wolff P. Language as a biomarker for psychosis: A natural language processing approach. Schizophr Res 2020; 226:158-166. [PMID: 32499162 PMCID: PMC7704556 DOI: 10.1016/j.schres.2020.04.032] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Revised: 04/22/2020] [Accepted: 04/24/2020] [Indexed: 12/21/2022]
Abstract
Human ratings of conceptual disorganization, poverty of content, referential cohesion and illogical thinking have been shown to predict psychosis onset in prospective clinical high risk (CHR) cohort studies. The potential value of linguistic biomarkers has been significantly magnified, however, by recent advances in natural language processing (NLP) and machine learning (ML). Such methodologies allow for the rapid and objective measurement of language features, many of which are not easily recognized by human raters. Here we review the key findings on language production disturbance in psychosis. We also describe recent advances in the computational methods used to analyze language data, including methods for the automatic measurement of discourse coherence, syntactic complexity, poverty of content, referential coherence, and metaphorical language. Linguistic biomarkers of psychosis risk are now undergoing cross-validation, with attention to harmonization of methods. Future directions in extended CHR networks include studies of sources of variance, and combination with other promising biomarkers of psychosis risk, such as cognitive and sensory processing impairments likely to be related to language. Implications for the broader study of social communication, including reciprocal prosody, face expression and gesture, are discussed.
Collapse
Affiliation(s)
- Cheryl M Corcoran
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Vijay A Mittal
- Department of Psychology, Northwestern University, Evanston, IL, USA
| | - Carrie E Bearden
- Department of Psychiatry and Biobehavioral Sciences, University of California Los Angeles, CA, USA; Department of Psychology, Semel Institute for Neuroscience and Human Behavior, Brain Research Institute, University of California Los Angeles, CA, USA; Department of Psychology, University of California Los Angeles, CA USA
| | - Raquel E Gur
- Brain Behavior Laboratory, Neuropsychiatry Division, Department of Psychiatry, Philadelphia, PA 19104, USA
| | - Kasia Hitczenko
- Department of Linguistics, Northwestern University, Evanston, IL, USA
| | - Zarina Bilgrami
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Aleksandar Savic
- Department of Diagnostics and Intensive Care, University Psychiatric Hospital Vrapce, Zagreb, Croatia
| | - Guillermo A Cecchi
- Computational Biology Center-Neuroscience, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Phillip Wolff
- Department of Psychology, Emory University, Atlanta, GA, USA.
| |
Collapse
|
13
|
Castro N, Stella M, Siew CSQ. Quantifying the Interplay of Semantics and Phonology During Failures of Word Retrieval by People With Aphasia Using a Multiplex Lexical Network. Cogn Sci 2020; 44:e12881. [PMID: 32893389 DOI: 10.1111/cogs.12881] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Revised: 07/06/2020] [Accepted: 07/14/2020] [Indexed: 11/30/2022]
Abstract
Investigating instances where lexical selection fails can lead to deeper insights into the cognitive machinery and architecture supporting successful word retrieval and speech production. In this paper, we used a multiplex lexical network approach that combines semantic and phonological similarities among words to model the structure of the mental lexicon. Network measures at different levels of analysis (degree, network distance, and closeness centrality) were used to investigate the influence of network structure on picture naming accuracy and errors by people with Anomic, Broca's, Conduction, and Wernicke's aphasia. Our results reveal that word retrieval is influenced by the multiplex lexical network structure in at least two ways-(a) the accuracy of production and error type on incorrect productions were influenced by the degree and closeness centrality of the target word, and (b) error type also varied in terms of network distance between the target word and produced error word. Taken together, the analyses demonstrate that network science techniques, particularly the use of the multiplex lexical network to simultaneously represent semantic and phonological relationships among words, reveal how the structure of the mental lexicon influences language processes beyond traditionally examined psycholinguistic variables. We propose a framework for how the multiplex lexical network approach allows for understanding the influence of mental lexicon structure on word retrieval processes, with an eye toward a better understanding of the nature of clinical impairments, like aphasia.
Collapse
Affiliation(s)
- Nichol Castro
- Department of Psychology, Georgia Institute of Technology.,Department of Speech and Hearing Sciences, University of Washington
| | - Massimo Stella
- Institute for Complex Systems Simulation, University of Southampton.,Complex Science Consulting
| | | |
Collapse
|
14
|
Thompson B, Roberts SG, Lupyan G. Cultural influences on word meanings revealed through large-scale semantic alignment. Nat Hum Behav 2020; 4:1029-1038. [PMID: 32778801 DOI: 10.1038/s41562-020-0924-8] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 07/02/2020] [Indexed: 01/01/2023]
Abstract
If the structure of language vocabularies mirrors the structure of natural divisions that are universally perceived, then the meanings of words in different languages should closely align. By contrast, if shared word meanings are a product of shared culture, history and geography, they may differ between languages in substantial but predictable ways. Here, we analysed the semantic neighbourhoods of 1,010 meanings in 41 languages. The most-aligned words were from semantic domains with high internal structure (number, quantity and kinship). Words denoting natural kinds, common actions and artefacts aligned much less well. Languages that are more geographically proximate, more historically related and/or spoken by more-similar cultures had more aligned word meanings. These results provide evidence that the meanings of common words vary in ways that reflect the culture, history and geography of their users.
Collapse
Affiliation(s)
- Bill Thompson
- Department of Computer Science, Princeton University, Princeton, NJ, USA.
| | - Seán G Roberts
- School of English, Communication and Philosophy, Cardiff University, Cardiff, UK.,Department of Anthropology and Archaeology, University of Bristol, Bristol, UK
| | - Gary Lupyan
- Department of Psychology, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
15
|
Corcoran CM, Cecchi GA. Using Language Processing and Speech Analysis for the Identification of Psychosis and Other Disorders. BIOLOGICAL PSYCHIATRY: COGNITIVE NEUROSCIENCE AND NEUROIMAGING 2020; 5:770-779. [PMID: 32771179 DOI: 10.1016/j.bpsc.2020.06.004] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 06/09/2020] [Accepted: 06/09/2020] [Indexed: 01/12/2023]
Abstract
Increasingly, data-driven methods have been implemented to understand psychopathology. Language is the main source of information in psychiatry and represents "big data" at the level of the individual. Language and behavior are amenable to computational natural language processing (NLP) analytics, which may help operationalize the mental status examination. In this review, we highlight the application of NLP to schizophrenia and its risk states as an exemplar of its use, operationalizing tangential and concrete speech as reductions in semantic coherence and syntactic complexity, respectively. Other clinical applications are reviewed, including forecasting suicide risk and detecting intoxication. Challenges and future directions are discussed, including biomarker development, harmonization, and application of NLP more broadly to behavior, including intonation/prosody, facial expression and gesture, and the integration of these in dyads and during discourse. Similar NLP analytics can also be applied beyond humans to behavioral motifs across species, important for modeling psychopathology in animal models. Finally, clinical neuroscience can inform the development of artificial intelligence.
Collapse
Affiliation(s)
- Cheryl Mary Corcoran
- Icahn School of Medicine at Mount Sinai, New York; James J. Peters Veterans Administration Medical Center, Bronx.
| | - Guillermo A Cecchi
- Thomas J. Watson Research Center, IBM Corporation, Yorktown Heights, New York
| |
Collapse
|
16
|
Solé R, Valverde S. Evolving complexity: how tinkering shapes cells, software and ecological networks. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190325. [PMID: 32089118 PMCID: PMC7061959 DOI: 10.1098/rstb.2019.0325] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/03/2020] [Indexed: 01/09/2023] Open
Abstract
A common trait of complex systems is that they can be represented by means of a network of interacting parts. It is, in fact, the network organization (more than the parts) that largely conditions most higher-level properties, which are not reducible to the properties of the individual parts. Can the topological organization of these webs provide some insight into their evolutionary origins? Both biological and artificial networks share some common architectural traits. They are often heterogeneous and sparse, and most exhibit different types of correlations, such as nestedness, modularity or hierarchical patterns. These properties have often been attributed to the selection of functionally meaningful traits. However, a proper formulation of generative network models suggests a rather different picture. Against the standard selection-optimization argument, some networks reveal the inevitable generation of complex patterns resulting from reuse and can be modelled using duplication-rewiring rules lacking functionality. These give rise to the observed heterogeneous, scale-free and modular architectures. Here, we examine the evidence for tinkering in cellular, technological and ecological webs and its impact in shaping their architecture. Our analysis suggests a serious consideration of the role played by selection as the origin of network topology. Instead, we suggest that the amplification processes associated with reuse might shape these graphs at the topological level. In biological systems, selection forces would take advantage of emergent patterns. This article is part of the theme issue 'Unifying the essential concepts of biological networks: biological insights and philosophical foundations'.
Collapse
Affiliation(s)
- Ricard Solé
- ICREA-Complex Systems Lab, Universitat Pompeu Fabra, Dr. Aiguader 88, Barcelona 08003, Spain
- Institut de Biologia Evolutiva (UPF-CSIC), Pg. Maritim 37, Barcelona 08003, Spain
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
- European Centre for Living Technology, S. Marco 2940, 30124 Venice, Italy
| | - Sergi Valverde
- European Centre for Living Technology, S. Marco 2940, 30124 Venice, Italy
- Evolution of Technology Lab, Institut de Biologia Evolutiva (UPF-CSIC), Pg. Maritim 37, Barcelona 08003, Spain
| |
Collapse
|
17
|
Castro N. The multiplex structure of the mental lexicon influences picture naming in people with aphasia. JOURNAL OF COMPLEX NETWORKS 2019; 7:913-931. [PMID: 31984136 PMCID: PMC6961494 DOI: 10.1093/comnet/cnz012] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 02/25/2019] [Indexed: 06/10/2023]
Abstract
An emerging area of research in cognitive science is the utilization of networks to model the structure and processes of the mental lexicon in healthy and clinical populations, like aphasia. Previous research has focused on only one type of word similarity at a time (e.g., semantic relationships), even though words are multi-faceted. Here, we investigate lexical retrieval in a picture naming task from people with Broca's and Wernicke's aphasia and healthy controls by utilizing a multiplex network structure that accounts for the interplay between multiple semantic and phonological relationships among words in the mental lexicon. Extending upon previous work, we focused on the global network measure of closeness centrality which is known to capture spreading activation, an important process supporting lexical retrieval. We conducted a series of logistic regression models predicting the probability of correct picture naming. We tested whether multiplex closeness centrality was a better predictor of picture naming performance than single-layer closeness centralities, other network measures assessing local and meso-scale structure, psycholinguistic variables and group differences. We also examined production gaps, or the difference between the likelihood of producing a word with the lowest and highest closeness centralities. Our results indicated that multiplex closeness centrality was a significant predictor of picture naming performance, where words with high closeness centrality were more likely to be produced than words with low closeness centrality. Additionally, multiplex closeness centrality outperformed single-layer closeness centralities and other multiplex network measures, and remained a significant predictor after controlling for psycholinguistic variables and group differences. Furthermore, we found that the facilitative effect of closeness centrality was similar for both types of aphasia. Our results underline the importance of integrating multiple measures of word similarities in cognitive language networks for better understanding lexical retrieval in aphasia, with an eye towards future clinical applications.
Collapse
Affiliation(s)
- Nichol Castro
- School of Psychology, Georgia Institute of Technology, 654 Cherry Street, Atlanta, Georgia, 30332 USA
| |
Collapse
|
18
|
Viability in Multiplex Lexical Networks and Machine Learning Characterizes Human Creativity. BIG DATA AND COGNITIVE COMPUTING 2019. [DOI: 10.3390/bdcc3030045] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Previous studies have shown how individual differences in creativity relate to differences in the structure of semantic memory. However, the latter is only one aspect of the whole mental lexicon, a repository of conceptual knowledge that is considered to simultaneously include multiple types of conceptual similarities. In the current study, we apply a multiplex network approach to compute a representation of the mental lexicon combining semantics and phonology and examine how it relates to individual differences in creativity. This multiplex combination of 150,000 phonological and semantic associations identifies a core of words in the mental lexicon known as viable cluster, a kernel containing simpler to parse, more general, concrete words acquired early during language learning. We focus on low (N = 47) and high (N = 47) creative individuals’ performance in generating animal names during a semantic fluency task. We model this performance as the outcome of a mental navigation on the multiplex lexical network, going within, outside, and in-between the viable cluster. We find that low and high creative individuals differ substantially in their access to the viable cluster during the semantic fluency task. Higher creative individuals tend to access the viable cluster less frequently, with a lower uncertainty/entropy, reaching out to more peripheral words and covering longer multiplex network distances between concepts in comparison to lower creative individuals. We use these differences for constructing a machine learning classifier of creativity levels, which leads to an accuracy of 65 . 0 ± 0 . 9 % and an area under the curve of 68 . 0 ± 0 . 8 % , which are both higher than the random expectation of 50%. These results highlight the potential relevance of combining psycholinguistic measures with multiplex network models of the mental lexicon for modelling mental navigation and, consequently, classifying people automatically according to their creativity levels.
Collapse
|
19
|
Kannampallil T, Awadalla SS, Jones S, Abraham J. A graph-based approach for characterizing resident and nurse handoff conversations. J Biomed Inform 2019; 94:103178. [PMID: 31002936 DOI: 10.1016/j.jbi.2019.103178] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2018] [Revised: 04/13/2019] [Accepted: 04/15/2019] [Indexed: 11/26/2022]
Abstract
Prior research has used a variety of qualitative and quantitative approaches for studying handoff communication. Due to the dynamic and interactive nature of handoffs, characterizing the structure and content of these conversations is challenging. In this paper, we use a graph-based approach to characterize handoff communication as a conversation network. Conversation networks were used to compare the structural properties of resident-resident and nurse-nurse handoff communication. Resident (n = 149) and nurse (n = 126) handoff conversations from general medicine units were coded using a previously validated clinical content framework. The coded conversations were then translated into separate resident and nurse conversation networks, and were compared using 11 network measures. Transition probabilities were used to identify commonly repeating sub-networks within resident and nurse conversations. There were significant differences between resident and nurse conversation networks in 10 of the 11 network measures. There were also significant differences in the structure of conversations: compared to resident conversations, nurse conversations were focused on fewer clinical content categories and had more branching and switching between clinical content categories; however, there were clinically-relevant organic relationships in the order of presentation of clinical content among both resident and nurse handoff conversations. We discuss the potential for using graph-based approach as an alternative method for characterizing interactive conversations and also suggest future directions for using network-based approaches for analyzing handoff conversations.
Collapse
Affiliation(s)
- Thomas Kannampallil
- Department of Anesthesiology & Institute for Informatics, School of Medicine, Washington University in St Louis, St. Louis, MO, United States.
| | - Saria S Awadalla
- Division of Epidemiology & Biostatistics, School of Public Health, University of Illinois at Chicago, Chicago, IL, United States
| | - Steve Jones
- Department of Communication, College of Liberal Arts and Sciences, University of Illinois at Chicago, Chicago, IL, United States
| | - Joanna Abraham
- Department of Anesthesiology & Institute for Informatics, School of Medicine, Washington University in St Louis, St. Louis, MO, United States
| |
Collapse
|
20
|
Modelling Early Word Acquisition through Multiplex Lexical Networks and Machine Learning. BIG DATA AND COGNITIVE COMPUTING 2019. [DOI: 10.3390/bdcc3010010] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Early language acquisition is a complex cognitive task. Recent data-informed approaches showed that children do not learn words uniformly at random but rather follow specific strategies based on the associative representation of words in the mental lexicon, a conceptual system enabling human cognitive computing. Building on this evidence, the current investigation introduces a combination of machine learning techniques, psycholinguistic features (i.e., frequency, length, polysemy and class) and multiplex lexical networks, representing the semantics and phonology of the mental lexicon, with the aim of predicting normative acquisition of 529 English words by toddlers between 22 and 26 months. Classifications using logistic regression and based on four psycholinguistic features achieve the best baseline cross-validated accuracy of 61.7% when half of the words have been acquired. Adding network information through multiplex closeness centrality enhances accuracy (up to 67.7%) more than adding multiplex neighbourhood density/degree (62.4%) or multiplex PageRank versatility (63.0%) or the best single-layer network metric, i.e., free association degree (65.2%), instead. Multiplex closeness operationalises the structural relevance of words for semantic and phonological information flow. These results indicate that the whole, global, multi-level flow of information and structure of the mental lexicon influence word acquisition more than single-layer or local network features of words when considered in conjunction with language norms. The highlighted synergy of multiplex lexical structure and psycholinguistic norms opens new ways for understanding human cognition and language processing through powerful and data-parsimonious cognitive computing approaches.
Collapse
|
21
|
Benham S, Goffman L, Schweickert R. An Application of Network Science to Phonological Sequence Learning in Children With Developmental Language Disorder. JOURNAL OF SPEECH, LANGUAGE, AND HEARING RESEARCH : JSLHR 2018; 61:2275-2291. [PMID: 30167667 PMCID: PMC6195047 DOI: 10.1044/2018_jslhr-l-18-0036] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Accepted: 05/06/2018] [Indexed: 05/16/2023]
Abstract
PURPOSE Network science has been a valuable tool in language research for investigating relationships between complex linguistic elements but has not yet been applied to sound sequencing in production. In the present work, we used standard error-based accuracy and articulatory kinematic approaches as well as novel measures from network science to evaluate variability and sequencing errors in speech production in children with developmental language disorder (DLD; aka specific language impairment). METHOD Twelve preschoolers with DLD and 12 age-matched controls participated in a 3-day novel word learning study. Transcription and articulatory movement data were collected to measure accuracy and variability of productions, and networks of speech productions were generated to analyze syllable co-occurrence patterns. RESULTS Results indicated that children with DLD were less accurate than children with typical language at the segmental level. Crucially, these findings did not align with performance at the articulatory level, where there were no differences in movement variability between children with DLD and those with typical language. Network analyses revealed characteristics that were not captured by standard measures of phonetic accuracy, including a larger inventory of syllable forms, more connections between the forms, and less consistent production patterns. CONCLUSIONS Network science provides significant insights into phonological learning trajectories in children with DLD and their typically developing peers. Importantly, errors in word production by children with DLD do not surface as a result of weakness in articulatory control. Instead, results suggest that speech errors in DLD may relate to deficits in sound sequencing.
Collapse
Affiliation(s)
- Sara Benham
- School of Behavioral and Brain Sciences, University of Texas at Dallas
| | - Lisa Goffman
- School of Behavioral and Brain Sciences, University of Texas at Dallas
| | | |
Collapse
|
22
|
Goh WP, Luke KK, Cheong SA. Functional shortcuts in language co-occurrence networks. PLoS One 2018; 13:e0203025. [PMID: 30204769 PMCID: PMC6133353 DOI: 10.1371/journal.pone.0203025] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Accepted: 08/14/2018] [Indexed: 11/18/2022] Open
Abstract
Human language contains regular syntactic structures and grammatical patterns that should be detectable in their co-occurence networks. However, most standard complex network measures can hardly differentiate between co-occurence networks built from an empirical corpus and a body of scrambled text. In this work, we employ a motif extraction procedure to show that empirical networks have much greater motif densities. We demonstrate that motifs function as efficient and effective shortcuts in language networks, potentially explaining why we are able to generate and decipher language expressions so rapidly. Finally we suggest a link between motifs and constructions in Construction Grammar as well as speculate on the mechanisms behind the emergence of constructions in the early stages of language acquisition.
Collapse
Affiliation(s)
- Woon Peng Goh
- Interdisciplinary Graduate School, Nanyang Technological University, Singapore, Singapore
- Complexity Institute, Nanyang Technological University, Singapore, Singapore
| | - Kang-Kwong Luke
- School of Humanities, Nanyang Technological University, Singapore, Singapore
| | - Siew Ann Cheong
- Complexity Institute, Nanyang Technological University, Singapore, Singapore
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
23
|
Stella M, Beckage NM, Brede M, De Domenico M. Multiplex model of mental lexicon reveals explosive learning in humans. Sci Rep 2018; 8:2259. [PMID: 29396497 PMCID: PMC5797130 DOI: 10.1038/s41598-018-20730-5] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Accepted: 01/19/2018] [Indexed: 11/28/2022] Open
Abstract
Word similarities affect language acquisition and use in a multi-relational way barely accounted for in the literature. We propose a multiplex network representation of this mental lexicon of word similarities as a natural framework for investigating large-scale cognitive patterns. Our representation accounts for semantic, taxonomic, and phonological interactions and it identifies a cluster of words which are used with greater frequency, are identified, memorised, and learned more easily, and have more meanings than expected at random. This cluster emerges around age 7 through an explosive transition not reproduced by null models. We relate this explosive emergence to polysemy - redundancy in word meanings. Results indicate that the word cluster acts as a core for the lexicon, increasing both lexical navigability and robustness to linguistic degradation. Our findings provide quantitative confirmation of existing conjectures about core structure in the mental lexicon and the importance of integrating multi-relational word-word interactions in psycholinguistic frameworks.
Collapse
Affiliation(s)
- Massimo Stella
- Institute for Complex Systems Simulation, University of Southampton, Southampton, UK.
- Fondazione Bruno Kessler, Trento, Italy.
| | - Nicole M Beckage
- Department of Electrical Engineering and Computer Science, University of Kansas, Kansas, USA
| | - Markus Brede
- Institute for Complex Systems Simulation, University of Southampton, Southampton, UK
| | - Manlio De Domenico
- Fondazione Bruno Kessler, Trento, Italy
- School of Computer Science and Mathematics, Universitat Rovira i Virgili, Virgili, Spain
| |
Collapse
|
24
|
Altszyler E, Ribeiro S, Sigman M, Fernández Slezak D. The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text. Conscious Cogn 2017; 56:178-187. [PMID: 28943127 DOI: 10.1016/j.concog.2017.09.004] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Revised: 08/25/2017] [Accepted: 09/10/2017] [Indexed: 12/27/2022]
Abstract
Computer-based dreams content analysis relies on word frequencies within predefined categories in order to identify different elements in text. As a complementary approach, we explored the capabilities and limitations of word-embedding techniques to identify word usage patterns among dream reports. These tools allow us to quantify words associations in text and to identify the meaning of target words. Word-embeddings have been extensively studied in large datasets, but only a few studies analyze semantic representations in small corpora. To fill this gap, we compared Skip-gram and Latent Semantic Analysis (LSA) capabilities to extract semantic associations from dream reports. LSA showed better performance than Skip-gram in small size corpora in two tests. Furthermore, LSA captured relevant word associations in dream collection, even in cases with low-frequency words or small numbers of dreams. Word associations in dreams reports can thus be quantified by LSA, which opens new avenues for dream interpretation and decoding.
Collapse
Affiliation(s)
- Edgar Altszyler
- Depto. de Computación, Universidad de Buenos Aires, Ciudad universitaria, CONICET, Pabellon 1, C1428EGA, Argentina.
| | - Sidarta Ribeiro
- Instituto do Cérebro, Universidade Federal do Rio Grande do Norte, Natal, Brazil
| | | | - Diego Fernández Slezak
- Depto. de Computación, Universidad de Buenos Aires, Ciudad universitaria, CONICET, Pabellon 1, C1428EGA, Argentina
| |
Collapse
|
25
|
Stella M, Beckage NM, Brede M. Multiplex lexical networks reveal patterns in early word acquisition in children. Sci Rep 2017; 7:46730. [PMID: 28436476 PMCID: PMC5402256 DOI: 10.1038/srep46730] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2016] [Accepted: 03/20/2017] [Indexed: 11/18/2022] Open
Abstract
Network models of language have provided a way of linking cognitive processes to language structure. However, current approaches focus only on one linguistic relationship at a time, missing the complex multi-relational nature of language. In this work, we overcome this limitation by modelling the mental lexicon of English-speaking toddlers as a multiplex lexical network, i.e. a multi-layered network where N = 529 words/nodes are connected according to four relationship: (i) free association, (ii) feature sharing, (iii) co-occurrence, and (iv) phonological similarity. We investigate the topology of the resulting multiplex and then proceed to evaluate single layers and the full multiplex structure on their ability to predict empirically observed age of acquisition data of English speaking toddlers. We find that the multiplex topology is an important proxy of the cognitive processes of acquisition, capable of capturing emergent lexicon structure. In fact, we show that the multiplex structure is fundamentally more powerful than individual layers in predicting the ordering with which words are acquired. Furthermore, multiplex analysis allows for a quantification of distinct phases of lexical acquisition in early learners: while initially all the multiplex layers contribute to word learning, after about month 23 free associations take the lead in driving word acquisition.
Collapse
Affiliation(s)
- Massimo Stella
- Institute for Complex System Simulations, University of Southampton, UK
| | - Nicole M. Beckage
- Department of Electrical Engineering and Computer Science, University of Kansas, USA
| | - Markus Brede
- Institute for Complex System Simulations, University of Southampton, UK
| |
Collapse
|
26
|
Inácio-Barbosa LTM, Teixeira F, Reis VM, Ribeiro LODM, Boente ANP, Fróes MM. Evaluating the Effects of Artistic Impregnation of Scientific Objects on Qualifiers of Perceptual Assessment Through Self-Report Questionnaires: Implications for an Emerging Experimental Neuroepistemology. JOURNAL OF COGNITION AND CULTURE 2017. [DOI: 10.1163/15685373-12342195] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
This report summarizes our preliminary efforts to delineate, at controlled experimentation, the impact of art, and artistic aesthetics, on the way we assess science. Our results suggest that the analytic-synthetic axis of perceptual cognitive handling of the scientific object is unaffected by its artistic non-conventional contextualization, while cognitive abstraction, positive emotions and aesthetic impressions are favoured. Implications to philosophical foundations of the Cartesian scientific method are considered.
Collapse
Affiliation(s)
- Leonardo Toledo Miranda Inácio-Barbosa
- Laboratory of Experimental Neuroepistemology, Federal University of Rio de JaneiroAvenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| | - Fernanda Teixeira
- Laboratory of Experimental Neuroepistemology, Federal University of Rio de JaneiroAvenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| | - Vivian Maia Reis
- Laboratory of Experimental Neuroepistemology, Federal University of Rio de JaneiroLaboratory of Technological Innovation and Decision Making, Institute Tercio Pacitti of Computational Applications and Research, Federal University of Rio de JaneiroFaculty of Technological Education of the State of Rio de Janeiro (FAETERJ)Avenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| | - Luis Otávio de Marins Ribeiro
- Laboratory of Experimental Neuroepistemology, Federal University of Rio de JaneiroLaboratory of Technological Innovation and Decision Making, Institute Tercio Pacitti of Computational Applications and Research, Federal University of Rio de JaneiroFaculty of Technological Education of the State of Rio de Janeiro (FAETERJ)Avenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| | - Alfredo Nazareno Pereira Boente
- Laboratory of Experimental Neuroepistemology, Federal University of Rio de JaneiroLaboratory of Technological Innovation and Decision Making, Institute Tercio Pacitti of Computational Applications and Research, Federal University of Rio de JaneiroFaculty of Technological Education of the State of Rio de Janeiro (FAETERJ)Avenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| | - Maira Monteiro Fróes
- *Corresponding author, e-mail:
- Laboratory of Experimental Neuroepistemology, Graduate Program of History of Sciences and Technics and Epistemology, Federal University of Rio de JaneiroAvenida Athos da Silveira 274, ccmn Cidade Universitária, rj 21941-916Brazil
| |
Collapse
|
27
|
Akimushkin C, Amancio DR, Oliveira ON. Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks. PLoS One 2017; 12:e0170527. [PMID: 28125703 PMCID: PMC5268788 DOI: 10.1371/journal.pone.0170527] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Accepted: 12/24/2016] [Indexed: 11/18/2022] Open
Abstract
Automatic identification of authorship in disputed documents has benefited from complex network theory as this approach does not require human expertise or detailed semantic knowledge. Networks modeling entire books can be used to discriminate texts from different sources and understand network growth mechanisms, but only a few studies have probed the suitability of networks in modeling small chunks of text to grasp stylistic features. In this study, we introduce a methodology based on the dynamics of word co-occurrence networks representing written texts to classify a corpus of 80 texts by 8 authors. The texts were divided into sections with equal number of linguistic tokens, from which time series were created for 12 topological metrics. Since 73% of all series were stationary (ARIMA(p, 0, q)) and the remaining were integrable of first order (ARIMA(p, 1, q)), probability distributions could be obtained for the global network metrics. The metrics exhibit bell-shaped non-Gaussian distributions, and therefore distribution moments were used as learning attributes. With an optimized supervised learning procedure based on a nonlinear transformation performed by Isomap, 71 out of 80 texts were correctly classified using the K-nearest neighbors algorithm, i.e. a remarkable 88.75% author matching success rate was achieved. Hence, purely dynamic fluctuations in network metrics can characterize authorship, thus paving the way for a robust description of large texts in terms of small evolving networks.
Collapse
Affiliation(s)
- Camilo Akimushkin
- São Carlos Institute of Physics, University of São Paulo, São Carlos, São Paulo, Brazil
| | - Diego Raphael Amancio
- Institute of Mathematics and Computer Science, University of São Paulo, São Carlos, São Paulo, Brazil
| | | |
Collapse
|
28
|
Ke X, Zeng Y, Luo H. Autoscoring Essays Based on Complex Networks. JOURNAL OF EDUCATIONAL MEASUREMENT 2016. [DOI: 10.1111/jedm.12127] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
29
|
García AM, Carrillo F, Orozco-Arroyave JR, Trujillo N, Vargas Bonilla JF, Fittipaldi S, Adolfi F, Nöth E, Sigman M, Fernández Slezak D, Ibáñez A, Cecchi GA. How language flows when movements don't: An automated analysis of spontaneous discourse in Parkinson's disease. BRAIN AND LANGUAGE 2016; 162:19-28. [PMID: 27501386 DOI: 10.1016/j.bandl.2016.07.008] [Citation(s) in RCA: 53] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2016] [Revised: 04/20/2016] [Accepted: 07/25/2016] [Indexed: 06/06/2023]
Abstract
To assess the impact of Parkinson's disease (PD) on spontaneous discourse, we conducted computerized analyses of brief monologues produced by 51 patients and 50 controls. We explored differences in semantic fields (via latent semantic analysis), grammatical choices (using part-of-speech tagging), and word-level repetitions (with graph embedding tools). Although overall output was quantitatively similar between groups, patients relied less heavily on action-related concepts and used more subordinate structures. Also, a classification tool operating on grammatical patterns identified monologues as pertaining to patients or controls with 75% accuracy. Finally, while the incidence of dysfluent word repetitions was similar between groups, it allowed inferring the patients' level of motor impairment with 77% accuracy. Our results highlight the relevance of studying naturalistic discourse features to tap the integrity of neural (and, particularly, motor) networks, beyond the possibilities of standard token-level instruments.
Collapse
Affiliation(s)
- Adolfo M García
- Laboratory of Experimental Psychology and Neuroscience (LPEN), Institute of Cognitive and Translational Neuroscience (INCyT), INECO Foundation, Favaloro University, Pacheco de Melo 1860, C1126AAB Buenos Aires, Argentina; National Scientific and Technical Research Council (CONICET), Av. Rivadavia 1917, C1033AAJ Buenos Aires, Argentina; Faculty of Elementary and Special Education (FEEyE), National University of Cuyo (UNCuyo), Sobremonte 74, C5500 Mendoza, Argentina.
| | - Facundo Carrillo
- National Scientific and Technical Research Council (CONICET), Av. Rivadavia 1917, C1033AAJ Buenos Aires, Argentina; Department of Computer Science, School of Sciences, University of Buenos Aires, Pabellón I, Ciudad Universitaria, C1428EGA Buenos Aires, Argentina
| | - Juan Rafael Orozco-Arroyave
- Faculty of Engineering, University of Antioquia, Calle 67 N° 53-108, C1226 Medellín, Colombia; Pattern Recognition Lab, Friedrich-Alexander-Universität, Martensstrasse 3, 91058 Erlangen-Nürnberg, Germany
| | - Natalia Trujillo
- Neuroscience Group, Faculty of Medicine, University of Antioquia, Calle 62 N° 52-59, C1226 Medellín, Colombia; School of Public Health, University of Antioquia, Calle 62 N° 52-59, C1226 Medellín, Colombia
| | - Jesús F Vargas Bonilla
- Faculty of Engineering, University of Antioquia, Calle 67 N° 53-108, C1226 Medellín, Colombia
| | - Sol Fittipaldi
- Laboratory of Experimental Psychology and Neuroscience (LPEN), Institute of Cognitive and Translational Neuroscience (INCyT), INECO Foundation, Favaloro University, Pacheco de Melo 1860, C1126AAB Buenos Aires, Argentina
| | - Federico Adolfi
- Laboratory of Experimental Psychology and Neuroscience (LPEN), Institute of Cognitive and Translational Neuroscience (INCyT), INECO Foundation, Favaloro University, Pacheco de Melo 1860, C1126AAB Buenos Aires, Argentina
| | - Elmar Nöth
- Pattern Recognition Lab, Friedrich-Alexander-Universität, Martensstrasse 3, 91058 Erlangen-Nürnberg, Germany
| | - Mariano Sigman
- Laboratory of Integrative Neuroscience, Torcuato Di Tella University, Av. Figueroa Alcorta 7350, C1428BCW Buenos Aires, Argentina
| | - Diego Fernández Slezak
- National Scientific and Technical Research Council (CONICET), Av. Rivadavia 1917, C1033AAJ Buenos Aires, Argentina; Department of Computer Science, School of Sciences, University of Buenos Aires, Pabellón I, Ciudad Universitaria, C1428EGA Buenos Aires, Argentina
| | - Agustín Ibáñez
- Laboratory of Experimental Psychology and Neuroscience (LPEN), Institute of Cognitive and Translational Neuroscience (INCyT), INECO Foundation, Favaloro University, Pacheco de Melo 1860, C1126AAB Buenos Aires, Argentina; National Scientific and Technical Research Council (CONICET), Av. Rivadavia 1917, C1033AAJ Buenos Aires, Argentina; Universidad Autónoma del Caribe, Calle 90, N° 46-112, C2754 Barranquilla, Colombia; Center for Social and Cognitive Neuroscience (CSCN), School of Psychology, Universidad Adolfo Ibáñez, Diagonal Las Torres 2640, Santiago, Chile; Centre of Excellence in Cognition and its Disorders, Australian Research Council (ACR), 16 University Avenue, Macquarie University, Sydney, NSW 2109, Australia
| | - Guillermo A Cecchi
- Computational Biology Center, IBM, T.J. Watson Research Center, Yorktown Heights, 1101 Kitchawan Rd., Yorktwon Heights, New York, NY 10598, USA
| |
Collapse
|
30
|
Comparing the topological properties of real and artificially generated scientific manuscripts. Scientometrics 2015. [DOI: 10.1007/s11192-015-1637-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
|
31
|
Fast Distributed Dynamics of Semantic Networks via Social Media. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2015; 2015:712835. [PMID: 26074953 PMCID: PMC4449913 DOI: 10.1155/2015/712835] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2014] [Revised: 04/01/2015] [Accepted: 04/20/2015] [Indexed: 12/04/2022]
Abstract
We investigate the dynamics of semantic organization using social media, a collective expression of human thought. We propose a novel, time-dependent semantic similarity measure (TSS), based on the social network Twitter. We show that TSS is consistent with static measures of similarity but provides high temporal resolution for the identification of real-world events and induced changes in the distributed structure of semantic relationships across the entire lexicon. Using TSS, we measured the evolution of a concept and its movement along the semantic neighborhood, driven by specific news/events. Finally, we showed that particular events may trigger a temporary reorganization of elements in the semantic network.
Collapse
|
32
|
Cong J, Liu H. Approaching human language with complex networks. Phys Life Rev 2014; 11:598-618. [DOI: 10.1016/j.plrev.2014.04.004] [Citation(s) in RCA: 138] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2014] [Revised: 04/07/2014] [Accepted: 04/15/2014] [Indexed: 11/28/2022]
|
33
|
A window into the intoxicated mind? Speech as an index of psychoactive drug effects. Neuropsychopharmacology 2014; 39:2340-8. [PMID: 24694926 PMCID: PMC4138742 DOI: 10.1038/npp.2014.80] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/08/2013] [Revised: 01/20/2014] [Accepted: 03/24/2014] [Indexed: 11/09/2022]
Abstract
Abused drugs can profoundly alter mental states in ways that may motivate drug use. These effects are usually assessed with self-report, an approach that is vulnerable to biases. Analyzing speech during intoxication may present a more direct, objective measure, offering a unique 'window' into the mind. Here, we employed computational analyses of speech semantic and topological structure after ±3,4-methylenedioxymethamphetamine (MDMA; 'ecstasy') and methamphetamine in 13 ecstasy users. In 4 sessions, participants completed a 10-min speech task after MDMA (0.75 and 1.5 mg/kg), methamphetamine (20 mg), or placebo. Latent Semantic Analyses identified the semantic proximity between speech content and concepts relevant to drug effects. Graph-based analyses identified topological speech characteristics. Group-level drug effects on semantic distances and topology were assessed. Machine-learning analyses (with leave-one-out cross-validation) assessed whether speech characteristics could predict drug condition in the individual subject. Speech after MDMA (1.5 mg/kg) had greater semantic proximity than placebo to the concepts friend, support, intimacy, and rapport. Speech on MDMA (0.75 mg/kg) had greater proximity to empathy than placebo. Conversely, speech on methamphetamine was further from compassion than placebo. Classifiers discriminated between MDMA (1.5 mg/kg) and placebo with 88% accuracy, and MDMA (1.5 mg/kg) and methamphetamine with 84% accuracy. For the two MDMA doses, the classifier performed at chance. These data suggest that automated semantic speech analyses can capture subtle alterations in mental state, accurately discriminating between drugs. The findings also illustrate the potential for automated speech-based approaches to characterize clinically relevant alterations to mental state, including those occurring in psychiatric illness.
Collapse
|
34
|
Ribeiro S. The onset of data-driven mental archeology. Front Neurosci 2014; 8:249. [PMID: 25165431 PMCID: PMC4131232 DOI: 10.3389/fnins.2014.00249] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2014] [Accepted: 07/28/2014] [Indexed: 11/13/2022] Open
|
35
|
Graph analysis of dream reports is especially informative about psychosis. Sci Rep 2014; 4:3691. [PMID: 24424108 PMCID: PMC3892182 DOI: 10.1038/srep03691] [Citation(s) in RCA: 73] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2013] [Accepted: 11/25/2013] [Indexed: 12/15/2022] Open
Abstract
Early psychiatry investigated dreams to understand psychopathologies. Contemporary psychiatry, which neglects dreams, has been criticized for lack of objectivity. In search of quantitative insight into the structure of psychotic speech, we investigated speech graph attributes (SGA) in patients with schizophrenia, bipolar disorder type I, and non-psychotic controls as they reported waking and dream contents. Schizophrenic subjects spoke with reduced connectivity, in tight correlation with negative and cognitive symptoms measured by standard psychometric scales. Bipolar and control subjects were undistinguishable by waking reports, but in dream reports bipolar subjects showed significantly less connectivity. Dream-related SGA outperformed psychometric scores or waking-related data for group sorting. Altogether, the results indicate that online and offline processing, the two most fundamental modes of brain operation, produce nearly opposite effects on recollections: While dreaming exposes differences in the mnemonic records across individuals, waking dampens distinctions. The results also demonstrate the feasibility of the differential diagnosis of psychosis based on the analysis of dream graphs, pointing to a fast, low-cost and language-invariant tool for psychiatric diagnosis and the objective search for biomarkers. The Freudian notion that "dreams are the royal road to the unconscious" is clinically useful, after all.
Collapse
|
36
|
Romano S, Sigman M, Figueira S. $LT^2C^2$ : A language of thought with Turing-computable Kolmogorov complexity. PAPERS IN PHYSICS 2013. [DOI: 10.4279/pip.050001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
|
37
|
Petersen AM, Tenenbaum JN, Havlin S, Stanley HE, Perc M. Languages cool as they expand: allometric scaling and the decreasing need for new words. Sci Rep 2012; 2:943. [PMID: 23230508 PMCID: PMC3517984 DOI: 10.1038/srep00943] [Citation(s) in RCA: 142] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2012] [Accepted: 10/24/2012] [Indexed: 11/23/2022] Open
Abstract
We analyze the occurrence frequencies of over 15 million words recorded in millions of books published during the past two centuries in seven different languages. For all languages and chronological subsets of the data we confirm that two scaling regimes characterize the word frequency distributions, with only the more common words obeying the classic Zipf law. Using corpora of unprecedented size, we test the allometric scaling relation between the corpus size and the vocabulary size of growing languages to demonstrate a decreasing marginal need for new words, a feature that is likely related to the underlying correlations between words. We calculate the annual growth fluctuations of word use which has a decreasing trend as the corpus size increases, indicating a slowdown in linguistic evolution following language expansion. This "cooling pattern" forms the basis of a third statistical regularity, which unlike the Zipf and the Heaps law, is dynamical in nature.
Collapse
Affiliation(s)
- Alexander M. Petersen
- Laboratory for the Analysis of Complex Economic Systems, IMT Lucca Institute for Advanced Studies, Lucca 55100, Italy
| | - Joel N. Tenenbaum
- Center for Polymer Studies and Department of Physics, Boston University, Boston, Massachusetts 02215, USA
- Operations and Technology Management, School of Management, Boston University, Boston, Massachusetts 02215, USA
| | - Shlomo Havlin
- Minerva Center and Department of Physics, Bar-Ilan University, Ramat-Gan 52900, Israel
| | - H. Eugene Stanley
- Center for Polymer Studies and Department of Physics, Boston University, Boston, Massachusetts 02215, USA
| | - Matjaž Perc
- Department of Physics, Faculty of Natural Sciences and Mathematics, University of Maribor, Koroška cesta 160, SI-2000 Maribor, Slovenia
| |
Collapse
|
38
|
Abstract
We contrasted the predictive power of three measures of semantic richness—number of features (NFs), contextual dispersion (CD), and a novel measure of number of semantic neighbors (NSN)—for a large set of concrete and abstract concepts on lexical decision and naming tasks. NSN (but not NF) facilitated processing for abstract concepts, while NF (but not NSN) facilitated processing for the most concrete concepts, consistent with claims that linguistic information is more relevant for abstract concepts in early processing. Additionally, converging evidence from two datasets suggests that when NSN and CD are controlled for, the features that most facilitate processing are those associated with a concept's physical characteristics and real-world contexts. These results suggest that rich linguistic contexts (many semantic neighbors) facilitate early activation of abstract concepts, whereas concrete concepts benefit more from rich physical contexts (many associated objects and locations).
Collapse
Affiliation(s)
- Gabriel Recchia
- Department of Cognitive Science, Indiana University Bloomington, IN, USA
| | | |
Collapse
|
39
|
Diuk CG, Slezak DF, Raskovsky I, Sigman M, Cecchi GA. A quantitative philology of introspection. Front Integr Neurosci 2012; 6:80. [PMID: 23015783 PMCID: PMC3449397 DOI: 10.3389/fnint.2012.00080] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2012] [Accepted: 09/04/2012] [Indexed: 11/13/2022] Open
Abstract
The cultural evolution of introspective thought has been recognized to undergo a drastic change during the middle of the first millennium BC. This period, known as the “Axial Age,” saw the birth of religions and philosophies still alive in modern culture, as well as the transition from orality to literacy—which led to the hypothesis of a link between introspection and literacy. Here we set out to examine the evolution of introspection in the Axial Age, studying the cultural record of the Greco-Roman and Judeo-Christian literary traditions. Using a statistical measure of semantic similarity, we identify a single “arrow of time” in the Old and New Testaments of the Bible, and a more complex non-monotonic dynamics in the Greco-Roman tradition reflecting the rise and fall of the respective societies. A comparable analysis of the twentieth century cultural record shows a steady increase in the incidence of introspective topics, punctuated by abrupt declines during and preceding the First and Second World Wars. Our results show that (a) it is possible to devise a consistent metric to quantify the history of a high-level concept such as introspection, cementing the path for a new quantitative philology and (b) to the extent that it is captured in the cultural record, the increased ability of human thought for self-reflection that the Axial Age brought about is still heavily determined by societal contingencies beyond the orality-literacy nexus.
Collapse
Affiliation(s)
- Carlos G Diuk
- Department of Psychology, Princeton Neuroscience Institute, Princeton University Princeton, NJ, USA
| | | | | | | | | |
Collapse
|
40
|
Measuring the evolution of contemporary western popular music. Sci Rep 2012; 2:521. [PMID: 22837813 PMCID: PMC3405292 DOI: 10.1038/srep00521] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Accepted: 07/04/2012] [Indexed: 11/08/2022] Open
Abstract
Popular music is a key cultural expression that has captured listeners' attention for ages. Many of the structural regularities underlying musical discourse are yet to be discovered and, accordingly, their historical evolution remains formally unknown. Here we unveil a number of patterns and metrics characterizing the generic usage of primary musical facets such as pitch, timbre, and loudness in contemporary western popular music. Many of these patterns and metrics have been consistently stable for a period of more than fifty years. However, we prove important changes or trends related to the restriction of pitch transitions, the homogenization of the timbral palette, and the growing loudness levels. This suggests that our perception of the new would be rooted on these changing characteristics. Hence, an old tune could perfectly sound novel and fashionable, provided that it consisted of common harmonic progressions, changed the instrumentation, and increased the average loudness.
Collapse
|
41
|
Mota NB, Vasconcelos NAP, Lemos N, Pieretti AC, Kinouchi O, Cecchi GA, Copelli M, Ribeiro S. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS One 2012; 7:e34928. [PMID: 22506057 PMCID: PMC3322168 DOI: 10.1371/journal.pone.0034928] [Citation(s) in RCA: 142] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2011] [Accepted: 03/07/2012] [Indexed: 01/31/2023] Open
Abstract
BACKGROUND Psychosis has various causes, including mania and schizophrenia. Since the differential diagnosis of psychosis is exclusively based on subjective assessments of oral interviews with patients, an objective quantification of the speech disturbances that characterize mania and schizophrenia is in order. In principle, such quantification could be achieved by the analysis of speech graphs. A graph represents a network with nodes connected by edges; in speech graphs, nodes correspond to words and edges correspond to semantic and grammatical relationships. METHODOLOGY/PRINCIPAL FINDINGS To quantify speech differences related to psychosis, interviews with schizophrenics, manics and normal subjects were recorded and represented as graphs. Manics scored significantly higher than schizophrenics in ten graph measures. Psychopathological symptoms such as logorrhea, poor speech, and flight of thoughts were grasped by the analysis even when verbosity differences were discounted. Binary classifiers based on speech graph measures sorted schizophrenics from manics with up to 93.8% of sensitivity and 93.7% of specificity. In contrast, sorting based on the scores of two standard psychiatric scales (BPRS and PANSS) reached only 62.5% of sensitivity and specificity. CONCLUSIONS/SIGNIFICANCE The results demonstrate that alterations of the thought process manifested in the speech of psychotic patients can be objectively measured using graph-theoretical tools, developed to capture specific features of the normal and dysfunctional flow of thought, such as divergence and recurrence. The quantitative analysis of speech graphs is not redundant with standard psychometric scales but rather complementary, as it yields a very accurate sorting of schizophrenics and manics. Overall, the results point to automated psychiatric diagnosis based not on what is said, but on how it is said.
Collapse
Affiliation(s)
- Natalia B. Mota
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
- Hospital Onofre Lopes, Federal University of Rio Grande do Norte, Natal, Brazil
- Edmond and Lily Safra International Institute of Neuroscience of Natal, Natal, Brazil
| | - Nivaldo A. P. Vasconcelos
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
- Faculdade Natalense para o Desenvolvimento do Rio Grande do Norte, Natal, Brazil
- Department of Systems and Computation, Federal University of Campina Grande, Campina Grande, Brazil
| | - Nathalia Lemos
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
| | - Ana C. Pieretti
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
| | - Osame Kinouchi
- Department of Physics, Universidade de São Paulo, Ribeirão Preto, Brazil
| | - Guillermo A. Cecchi
- Biometaphorical Computing, Computational Biology Center, IBM Research Division, IBM T. J. Watson Research Center, Yorktown Heights, New York, United States of America
| | - Mauro Copelli
- Department of Physics, Federal University of Pernambuco, Recife, Brazil
| | - Sidarta Ribeiro
- Brain Institute, Federal University of Rio Grande do Norte, Natal, Brazil
- * E-mail:
| |
Collapse
|
42
|
Petersen AM, Tenenbaum J, Havlin S, Stanley HE. Statistical laws governing fluctuations in word use from word birth to word death. Sci Rep 2012; 2:313. [PMID: 22423321 PMCID: PMC3304511 DOI: 10.1038/srep00313] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2012] [Accepted: 02/24/2012] [Indexed: 11/09/2022] Open
Abstract
We analyze the dynamic properties of 10(7) words recorded in English, Spanish and Hebrew over the period 1800-2008 in order to gain insight into the coevolution of language and culture. We report language independent patterns useful as benchmarks for theoretical models of language evolution. A significantly decreasing (increasing) trend in the birth (death) rate of words indicates a recent shift in the selection laws governing word use. For new words, we observe a peak in the growth-rate fluctuations around 40 years after introduction, consistent with the typical entry time into standard dictionaries and the human generational timescale. Pronounced changes in the dynamics of language during periods of war shows that word correlations, occurring across time and between words, are largely influenced by coevolutionary social, technological, and political factors. We quantify cultural memory by analyzing the long-term correlations in the use of individual words using detrended fluctuation analysis.
Collapse
|
43
|
Marupaka N, Iyer LR, Minai AA. Connectivity and thought: the influence of semantic network structure in a neurodynamical model of thinking. Neural Netw 2012; 32:147-58. [PMID: 22397950 DOI: 10.1016/j.neunet.2012.02.004] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2011] [Revised: 02/02/2012] [Accepted: 02/07/2012] [Indexed: 11/20/2022]
Abstract
Understanding cognition has been a central focus for psychologists, neuroscientists and philosophers for thousands of years, but many of its most fundamental processes remain very poorly understood. Chief among these is the process of thought itself: the spontaneous emergence of specific ideas within the stream of consciousness. It is widely accepted that ideas, both familiar and novel, arise from the combination of existing concepts. From this perspective, thought is an emergent attribute of memory, arising from the intrinsic dynamics of the neural substrate in which information is embedded. An important issue in any understanding of this process is the relationship between the emergence of conceptual combinations and the dynamics of the underlying neural networks. Virtually all theories of ideation hypothesize that ideas arise during the thought process through association, each one triggering the next through some type of linkage, e.g., structural analogy, semantic similarity, polysemy, etc. In particular, it has been suggested that the creativity of ideation in individuals reflects the qualitative structure of conceptual associations in their minds. Interestingly, psycholinguistic studies have shown that semantic networks across many languages have a particular type of structure with small-world, scale free connectivity. So far, however, these related insights have not been brought together, in part because there has been no explicitly neural model for the dynamics of spontaneous thought. Recently, we have developed such a model. Though simplistic and abstract, this model attempts to capture the most basic aspects of the process hypothesized by theoretical models within a neurodynamical framework. It represents semantic memory as a recurrent semantic neural network with itinerant dynamics. Conceptual combinations arise through this dynamics as co-active groups of neural units, and either dissolve quickly or persist for a time as emergent metastable attractors and are recognized consciously as ideas. The work presented in this paper describes this model in detail, and uses it to systematically study the relationship between the structure of conceptual associations in the neural substrate and the ideas arising from this system's dynamics. In particular, we consider how the small-world and scale-free characteristics influence the effectiveness of the thought process under several metrics, and show that networks with both attributes indeed provide significant advantages in generating unique conceptual combinations.
Collapse
Affiliation(s)
- Nagendra Marupaka
- School of Electronic and Computing Systems, University of Cincinnati, Cincinnati, OH 45221, USA
| | | | | |
Collapse
|
44
|
|
45
|
|
46
|
Network based models of cognitive and social dynamics of human languages. COMPUT SPEECH LANG 2011. [DOI: 10.1016/j.csl.2011.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
47
|
|
48
|
Masucci AP, Kalampokis A, Eguíluz VM, Hernández-García E. Wikipedia information flow analysis reveals the scale-free architecture of the semantic space. PLoS One 2011; 6:e17333. [PMID: 21407801 PMCID: PMC3046238 DOI: 10.1371/journal.pone.0017333] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2011] [Accepted: 01/26/2011] [Indexed: 11/18/2022] Open
Abstract
In this paper we extract the topology of the semantic space in its encyclopedic acception, measuring the semantic flow between the different entries of the largest modern encyclopedia, Wikipedia, and thus creating a directed complex network of semantic flows. Notably at the percolation threshold the semantic space is characterised by scale-free behaviour at different levels of complexity and this relates the semantic space to a wide range of biological, social and linguistics phenomena. In particular we find that the cluster size distribution, representing the size of different semantic areas, is scale-free. Moreover the topology of the resulting semantic space is scale-free in the connectivity distribution and displays small-world properties. However its statistical properties do not allow a classical interpretation via a generative model based on a simple multiplicative process. After giving a detailed description and interpretation of the topological properties of the semantic space, we introduce a stochastic model of content-based network, based on a copy and mutation algorithm and on the Heaps' law, that is able to capture the main statistical properties of the analysed semantic space, including the Zipf's law for the word frequency distribution.
Collapse
Affiliation(s)
- Adolfo Paolo Masucci
- Instituto de Física Interdisciplinar y Sistemas Complejos, Consejo Superior de Investigaciones Científicas - Universitat de les Illes Balears, Palma de Mallorca, Spain
- * E-mail:
| | | | - Victor Martínez Eguíluz
- Instituto de Física Interdisciplinar y Sistemas Complejos, Consejo Superior de Investigaciones Científicas - Universitat de les Illes Balears, Palma de Mallorca, Spain
| | - Emilio Hernández-García
- Instituto de Física Interdisciplinar y Sistemas Complejos, Consejo Superior de Investigaciones Científicas - Universitat de les Illes Balears, Palma de Mallorca, Spain
| |
Collapse
|
49
|
SciReader enables reading of medical content with instantaneous definitions. BMC Med Inform Decis Mak 2011; 11:4. [PMID: 21266060 PMCID: PMC3038137 DOI: 10.1186/1472-6947-11-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Accepted: 01/25/2011] [Indexed: 11/30/2022] Open
Abstract
Background A major problem patients encounter when reading about health related issues is document interpretation, which limits reading comprehension and therefore negatively impacts health care. Currently, searching for medical definitions from an external source is time consuming, distracting, and negatively impacts reading comprehension and memory of the material. Methods SciReader was built as a Java application with a Flex-based front-end client. The dictionary used by SciReader was built by consolidating data from several sources and generating new definitions with a standardized syntax. The application was evaluated by measuring the percentage of words defined in different documents. A survey was used to test the perceived effect of SciReader on reading time and comprehension. Results We present SciReader, a web-application that simplifies document interpretation by allowing users to instantaneously view medical, English, and scientific definitions as they read any document. This tool reveals the definitions of any selected word in a small frame at the top of the application. SciReader relies on a dictionary of ~750,000 unique Biomedical and English word definitions. Evaluation of the application shows that it maps ~98% of words in several different types of documents and that most users tested in a survey indicate that the application decreases reading time and increases comprehension. Conclusions SciReader is a web application useful for reading medical and scientific documents. The program makes jargon-laden content more accessible to patients, educators, health care professionals, and the general public.
Collapse
|
50
|
Goñi J, Arrondo G, Sepulcre J, Martincorena I, Vélez de Mendizábal N, Corominas-Murtra B, Bejarano B, Ardanza-Trevijano S, Peraita H, Wall DP, Villoslada P. The semantic organization of the animal category: evidence from semantic verbal fluency and network theory. Cogn Process 2010; 12:183-96. [PMID: 20938799 DOI: 10.1007/s10339-010-0372-x] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2010] [Accepted: 09/16/2010] [Indexed: 01/31/2023]
Abstract
Semantic memory is the subsystem of human memory that stores knowledge of concepts or meanings, as opposed to life-specific experiences. How humans organize semantic information remains poorly understood. In an effort to better understand this issue, we conducted a verbal fluency experiment on 200 participants with the aim of inferring and representing the conceptual storage structure of the natural category of animals as a network. This was done by formulating a statistical framework for co-occurring concepts that aims to infer significant concept-concept associations and represent them as a graph. The resulting network was analyzed and enriched by means of a missing links recovery criterion based on modularity. Both network models were compared to a thresholded co-occurrence approach. They were evaluated using a random subset of verbal fluency tests and comparing the network outcomes (linked pairs are clustering transitions and disconnected pairs are switching transitions) to the outcomes of two expert human raters. Results show that the network models proposed in this study overcome a thresholded co-occurrence approach, and their outcomes are in high agreement with human evaluations. Finally, the interplay between conceptual structure and retrieval mechanisms is discussed.
Collapse
Affiliation(s)
- Joaquín Goñi
- Department of Neurosciences. Center for Applied Medical Research, University of Navarra, Pamplona, Spain
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|