1
|
Kearney E, McMahon KL, Guenther F, Arciuli J, de Zubicaray GI. Revisiting the concreteness effect: Non-arbitrary mappings between form and concreteness of English words influence lexical processing. Cognition 2024; 254:105972. [PMID: 39388784 DOI: 10.1016/j.cognition.2024.105972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 08/08/2024] [Accepted: 09/28/2024] [Indexed: 10/12/2024]
Abstract
How do we represent and process abstract and concrete concepts? The "concreteness effect", in which words with more concrete meanings are processed more quickly and accurately across a range of language tasks compared to abstract ones, suggests a differential conceptual organization of these words in the brain. However, concrete words tend to be marked by specific phonotactic features, such as having fewer syllables and more phonological neighbours. It is unclear whether these non-arbitrary form-meaning relationships that systematically denote the concreteness of a word impact language processing. In the current study, we first establish the extent of systematic mappings between phonological/phonetic features and concreteness ratings in a large set of monosyllabic and polysyllabic English words (i.e., concreteness form typicality), then demonstrate that they significantly influence lexical processing using behavioural megastudy datasets. Surface form features predicted a significant proportion of variance in concreteness ratings of monomorphemic words (25 %) which increased with the addition of polymorphemic forms (43 %). In addition, concreteness form typicality was a significant predictor of performance on visual and auditory lexical decision, naming, and semantic (concrete/abstract) decision tasks, after controlling for a range of psycholinguistic variables and concreteness ratings. Overall, our results provide the first evidence that concreteness form typicality influences lexical processing. We discuss theoretical implications for interpretations of the concreteness effect and models of language processing that have yet to incorporate non-arbitrary relationships between form and meaning into their feature sets.
Collapse
Affiliation(s)
- Elaine Kearney
- School of Psychology and Counselling, Faculty of Health, Queensland University of Technology (QUT), Kelvin Grove, QLD 4059, Australia.
| | - Katie L McMahon
- School of Clinical Sciences, Centre for Biomedical Technologies, QUT, Kelvin Grove, QLD 4059, Australia; Herston Imaging Research Facility, Royal Brisbane & Women's Hospital, Herston, QLD 4029, Australia
| | - Frank Guenther
- Department of Speech, Language, and Hearing Sciences, Boston University, Boston, MA 02215, USA; Department of Biomedical Engineering, Boston University, Boston, MA 02215, USA
| | - Joanne Arciuli
- College of Nursing and Health Sciences, Flinders University, Bedford Park, SA 5042, Australia
| | - Greig I de Zubicaray
- School of Psychology and Counselling, Faculty of Health, Queensland University of Technology (QUT), Kelvin Grove, QLD 4059, Australia
| |
Collapse
|
2
|
Wiley RW, Singh S, Baig Y, Key K, Purcell JJ. The English Sublexical Toolkit: Methods for indexing sound-spelling consistency. Behav Res Methods 2024; 56:6826-6861. [PMID: 38594441 PMCID: PMC11362297 DOI: 10.3758/s13428-024-02395-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/08/2024] [Indexed: 04/11/2024]
Abstract
This work introduces the English Sublexical Toolkit, a suite of tools that utilizes an experience-dependent learning framework of sublexical knowledge to extract regularities from the English lexicon. The Toolkit quantifies the empirical regularity of sublexical units in both the reading and spelling directions (i.e., grapheme-to-phoneme and phoneme-to-grapheme) and at multiple grain sizes (i.e., phoneme/grapheme and onset/rime unit size). It can extract multiple experience-dependent regularity indices for words or pseudowords, including both frequency indices (e.g., grapheme frequency) and conditional probability indices (e.g., grapheme-to-phoneme probability). These tools provide (1) superior estimates of the regularities that better reflect the complexity of the sublexical system relative to previously published indices and (2) completely novel indices of sublexical units such as phonographeme frequency (i.e., combined units of individual phonemes and graphemes that are independent of processing direction). We demonstrate that measures from the toolkit explain significant amounts of variance in empirical data (naming of real words and lexical decision), and either outperform or are comparable to the best available consistency measures. The flexibility of the toolkit is further demonstrated by its ability to readily index the probability of different pseudowords pronunciations, and we report that the measures account for the majority of variance in these empirically observed probabilities. Overall, this work provides a framework and resources that can be flexibly used to identify optimal corpus-based consistency measures that help explain reading/spelling behaviors for real and pseudowords.
Collapse
Affiliation(s)
- Robert W Wiley
- Department of Psychology, University of North Carolina at Greensboro, 296 Eberhart Building, Greensboro, NC, 27402, USA.
| | - Sartaj Singh
- Department of Psychology, University of North Carolina at Greensboro, 296 Eberhart Building, Greensboro, NC, 27402, USA
| | - Yusuf Baig
- Department of Cognitive Science, Johns Hopkins University, Baltimore, MD, USA
| | - Kristin Key
- Department of Psychology, University of North Carolina at Greensboro, 296 Eberhart Building, Greensboro, NC, 27402, USA
| | - Jeremy J Purcell
- Maryland Neuroimaging Center, University of Maryland, College Park, MD, USA
| |
Collapse
|
3
|
Stein R, Frost R, Siegelman N. HeLP: The Hebrew Lexicon project. Behav Res Methods 2024:10.3758/s13428-024-02502-4. [PMID: 39251528 DOI: 10.3758/s13428-024-02502-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2024] [Indexed: 09/11/2024]
Abstract
Lexicon projects (LPs) are large-scale data resources in different languages that present behavioral results from visual word recognition tasks. Analyses using LP data in multiple languages provide evidence regarding cross-linguistic differences as well as similarities in visual word recognition. Here we present the first LP in a Semitic language-the Hebrew Lexicon Project (HeLP). HeLP assembled lexical decision (LD) responses to 10,000 Hebrew words and nonwords, and naming responses to a subset of 5000 Hebrew words. We used the large-scale HeLP data to estimate the impact of general predictors (lexicality, frequency, word length, orthographic neighborhood density), and Hebrew-specific predictors (Semitic structure, presence of clitics, phonological entropy) of visual word recognition performance. Our results revealed the typical effects of lexicality and frequency obtained in many languages, but more complex impact of word length and neighborhood density. Considering Hebrew-specific characteristics, HeLP data revealed better recognition of words with a Semitic structure than words that do not conform to it, and a drop in performance for words comprising clitics. These effects varied, however, across LD and naming tasks. Lastly, a significant inhibitory effect of phonological ambiguity was found in both naming and LD. The implications of these findings for understanding reading in a Semitic language are discussed.
Collapse
Affiliation(s)
- Roni Stein
- Department of Psychology, The Hebrew University of Jerusalem, Mount Scopus Campus, 9190501, Jerusalem, Israel
| | - Ram Frost
- Department of Psychology, The Hebrew University of Jerusalem, Mount Scopus Campus, 9190501, Jerusalem, Israel
- BCBL, Basque Center of Cognition, Brain and Language, San Sebastian, Spain
| | - Noam Siegelman
- Department of Psychology, The Hebrew University of Jerusalem, Mount Scopus Campus, 9190501, Jerusalem, Israel.
| |
Collapse
|
4
|
de Zubicaray GI, Arciuli J, Guenther FH, McMahon KL, Kearney E. Non-arbitrary mappings between size and sound of English words: Form typicality effects during lexical access and memory. Q J Exp Psychol (Hove) 2024; 77:943-963. [PMID: 37332149 PMCID: PMC11032636 DOI: 10.1177/17470218231184940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/04/2023] [Accepted: 06/09/2023] [Indexed: 06/20/2023]
Abstract
A century of research has provided evidence of limited size sound symbolism in English, that is, certain vowels are non-arbitrarily associated with words denoting small versus large referents (e.g., /i/ as in teensy and /ɑ/ as in tall). In the present study, we investigated more extensive statistical regularities between surface form properties of English words and ratings of their semantic size, that is, form typicality, and its impact on language and memory processing. Our findings provide the first evidence of significant word form typicality for semantic size. In five empirical studies using behavioural megastudy data sets of performance on written and auditory lexical decision, reading aloud, semantic decision, and recognition memory tasks, we show that form typicality for size is a stronger and more consistent predictor of lexical access during word comprehension and production than semantic size, in addition to playing a significant role in verbal memory. The empirical results demonstrate that statistical information about non-arbitrary form-size mappings is accessed automatically during language and verbal memory processing, unlike semantic size that is largely dependent on task contexts that explicitly require participants to access size knowledge. We discuss how a priori knowledge about non-arbitrary form-meaning associations in the lexicon might be incorporated in models of language processing that implement Bayesian statistical inference.
Collapse
Affiliation(s)
- Greig I de Zubicaray
- School of Psychology and Counselling, Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia
| | - Joanne Arciuli
- College of Nursing and Health Sciences, Flinders University, Adelaide, SA, Australia
| | - Frank H Guenther
- Department of Speech, Language & Hearing Sciences, Boston University, Boston, MA, USA
- Department of Biomedical Engineering, Boston University, Boston, MA, USA
| | - Katie L McMahon
- School of Clinical Sciences, Centre for Biomedical Technologies, Queensland University of Technology, Brisbane, QLD, Australia
- Herston Imaging Research Facility, Royal Brisbane and Women’s Hospital, Herston, QLD, Australia
| | - Elaine Kearney
- School of Psychology and Counselling, Faculty of Health, Queensland University of Technology, Brisbane, QLD, Australia
| |
Collapse
|
5
|
Edwards AA, Rigobon VM, Steacy LM, Compton DL. Spelling-to-pronunciation transparency ratings for the 20,000 most frequently written English words. Behav Res Methods 2024; 56:2828-2841. [PMID: 37587326 DOI: 10.3758/s13428-023-02205-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/24/2023] [Indexed: 08/18/2023]
Abstract
Given English orthography's quasi-regular nature, applying common decoding rules to a word does not always result in a correct pronunciation matching the stored phonological form (e.g., the word tongue). To arrive at a correct pronunciation, developing readers must make the match between a decoded pronunciation and a word's correct pronunciation stored in memory. Developmentally, this matching process varies as a function of child skill (e.g., decoding, vocabulary) and word characteristics (e.g., spelling-to-pronunciation transparency, concreteness), with each being continuously distributed. Spelling-to-pronunciation transparency ratings represent a global measure of the ease of arriving at a word's correct pronunciation from its decoded pronunciation and in experimental studies has been shown to be a critical dimension in assessing the difficulty of a word for developing readers (e.g., Steacy et al., 2022a, 2022b). This study aimed to create a database of spelling-to-pronunciation transparency ratings for the 23,282 most frequently written English words, made available in the supplemental materials for future analyses. We asked adults to rate words' spelling-to-pronunciation transparency on a scale of 1-6 (1 = very easy to match, 6 = very difficult). Results of multiple regression analyses revealed variance in ratings to be unaccounted for by other word features, demonstrating the uniqueness of these ratings. Furthermore, words that are considered irregular, classified previously as strange, or contained at least one schwa received higher ratings, demonstrating strong associations between transparency and regularity. Lastly, these ratings significantly predicted both adult word naming time and child word reading accuracy above and beyond other word features known to predict reading.
Collapse
Affiliation(s)
- Ashley A Edwards
- Florida Center for Reading Research, Florida State University, Tallahassee, FL, USA.
| | - Valeria M Rigobon
- Florida Center for Reading Research, Florida State University, Tallahassee, FL, USA
| | - Laura M Steacy
- Florida Center for Reading Research, Florida State University, Tallahassee, FL, USA
| | - Donald L Compton
- Florida Center for Reading Research, Florida State University, Tallahassee, FL, USA
| |
Collapse
|
6
|
Khanna MM, Cortese MJ, Hughes K, Anderson K. Spelling monosyllabic English words. Q J Exp Psychol (Hove) 2024; 77:729-746. [PMID: 37211676 DOI: 10.1177/17470218231178507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
We conducted a megastudy to examine the spelling of American English monosyllables with typewritten responses. We related both sublexical and lexical/semantic factors to spelling accuracy and reaction time (RT) for the first keypress and response duration for spelling 1,856 monophonic monosyllables. We found that (a) each of 13 predictor variables was significantly related to performance for at least one measure, (b) orthographic length was unrelated to the first key RT, but did relate to accuracy and response duration, (c) sound-spelling and spelling-sound consistency was related to performance, and in particular, onset consistency related to accuracy and first key RT, but was unrelated to response duration, (d) contextual diversity was consistently related to performance across all measures, and (e) age of acquisition (AoA) was related to all measures, but was related more to the first key RT than response duration. The results indicate that people begin the spelling process once they identify the first letter, and they continue to process the spelling pattern as the response unfolds. These results are best explained by a parallel-distributed-processing framework.
Collapse
Affiliation(s)
- Maya M Khanna
- Department of Psychological Science, Creighton University, Omaha, NE, USA
| | | | - Kylie Hughes
- Department of Psychological Science, Creighton University, Omaha, NE, USA
- Doane University, Crete, NE, USA
| | - Kayley Anderson
- Department of Psychological Science, Creighton University, Omaha, NE, USA
- University of Nebraska Medical Center, Omaha, NE, USA
| |
Collapse
|
7
|
Lim A, O'Brien B, Onnis L. Orthography-phonology consistency in English: Theory- and data-driven measures and their impact on auditory vs. visual word recognition. Behav Res Methods 2024; 56:1283-1313. [PMID: 37553536 PMCID: PMC10991026 DOI: 10.3758/s13428-023-02094-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/15/2023] [Indexed: 08/10/2023]
Abstract
Research on orthographic consistency in English words has selectively identified different sub-syllabic units in isolation (grapheme, onset, vowel, coda, rime), yet there is no comprehensive assessment of how these measures affect word identification when taken together. To study which aspects of consistency are more psychologically relevant, we investigated their independent and composite effects on human reading behavior using large-scale databases. Study 1 found effects on adults' naming responses of both feedforward consistency (orthography to phonology) and feedback consistency (phonology to orthography). Study 2 found feedback but no feedforward consistency effects on visual and auditory lexical decision tasks, with the best predictor being a composite measure of consistency across grapheme, rime, OVC, and word-initial letter-phoneme. In Study 3, we explicitly modeled the reading process with forward and backward flow in a bidirectionally connected neural network. The model captured latent dimensions of quasi-regular mapping that explain additional variance in human reading and spelling behavior, compared to the established measures. Together, the results suggest interactive activation between phonological and orthographic word representations. They also validate the role of computational analyses of language to better understand how print maps to sound, and what properties of natural language affect reading complexity.
Collapse
Affiliation(s)
- Alfred Lim
- School of Psychology, University of Nottingham Malaysia, Semenyih, Selangor, Malaysia
- Centre for Research in Child Development (CRCD), National Institute of Education, Singapore, Singapore
| | - Beth O'Brien
- Centre for Research in Child Development (CRCD), National Institute of Education, Singapore, Singapore
- Centre for Research and Development on Learning (CRADLE), Nanyang Technological University, Singapore, Singapore
| | - Luca Onnis
- Centre for Multilingualism in Society across the Lifespan, University of Oslo, Semenyih, Selangor, Malaysia.
- Department of Linguistics and Scandinavian Studies, University of Oslo, Oslo, Norway.
| |
Collapse
|
8
|
Gao C, Shinkareva SV, Desai RH. SCOPE: The South Carolina psycholinguistic metabase. Behav Res Methods 2023; 55:2853-2884. [PMID: 35971041 PMCID: PMC10231664 DOI: 10.3758/s13428-022-01934-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/13/2022] [Indexed: 11/08/2022]
Abstract
The number of databases that provide various measurements of lexical properties for psycholinguistic research has increased rapidly in recent years. The proliferation of lexical variables, and the multitude of associated databases, makes the choice, comparison, and standardization of these variables in psycholinguistic research increasingly difficult. Here, we introduce The South Carolina Psycholinguistic Metabase (SCOPE), which is a metabase (or a meta-database) containing an extensive, curated collection of psycholinguistic variable values from major databases. The metabase currently contains 245 lexical variables, organized into seven major categories: General (e.g., frequency), Orthographic (e.g., bigram frequency), Phonological (e.g., phonological uniqueness point), Orth-Phon (e.g., consistency), Semantic (e.g., concreteness), Morphological (e.g., number of morphemes), and Response variables (e.g., lexical decision latency). We hope that SCOPE will become a valuable resource for researchers in psycholinguistics and affiliated disciplines such as cognitive neuroscience of language, computational linguistics, and communication disorders. The availability and ease of use of the metabase with comprehensive set of variables can facilitate the understanding of the unique contribution of each of the variables to word processing, and that of interactions between variables, as well as new insights and development of improved models and theories of word processing. It can also help standardize practice in psycholinguistics. We demonstrate use of the metabase by measuring relationships between variables in multiple ways and testing their individual contribution towards a number of dependent measures, in the most comprehensive analysis of this kind to date. The metabase is freely available at go.sc.edu/scope.
Collapse
Affiliation(s)
- Chuanji Gao
- Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, Netherlands
| | - Svetlana V Shinkareva
- Department of Psychology, Institute for Mind and Brain, University of South Carolina, Columbia, SC, 29201, USA.
| | - Rutvik H Desai
- Department of Psychology, Institute for Mind and Brain, University of South Carolina, Columbia, SC, 29201, USA.
| |
Collapse
|
9
|
Distinct neural sources underlying visual word form processing as revealed by steady state visual evoked potentials (SSVEP). Sci Rep 2021; 11:18229. [PMID: 34521874 PMCID: PMC8440525 DOI: 10.1038/s41598-021-95627-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 07/21/2021] [Indexed: 11/29/2022] Open
Abstract
EEG has been central to investigations of the time course of various neural functions underpinning visual word recognition. Recently the steady-state visual evoked potential (SSVEP) paradigm has been increasingly adopted for word recognition studies due to its high signal-to-noise ratio. Such studies, however, have been typically framed around a single source in the left ventral occipitotemporal cortex (vOT). Here, we combine SSVEP recorded from 16 adult native English speakers with a data-driven spatial filtering approach—Reliable Components Analysis (RCA)—to elucidate distinct functional sources with overlapping yet separable time courses and topographies that emerge when contrasting words with pseudofont visual controls. The first component topography was maximal over left vOT regions with a shorter latency (approximately 180 ms). A second component was maximal over more dorsal parietal regions with a longer latency (approximately 260 ms). Both components consistently emerged across a range of parameter manipulations including changes in the spatial overlap between successive stimuli, and changes in both base and deviation frequency. We then contrasted word-in-nonword and word-in-pseudoword to test the hierarchical processing mechanisms underlying visual word recognition. Results suggest that these hierarchical contrasts fail to evoke a unitary component that might be reasonably associated with lexical access.
Collapse
|