1
|
Tse CS, Chan YL, Yap MJ, Tsang HC. The Chinese Lexicon Project II: A megastudy of speeded naming performance for 25,000+ traditional Chinese two-character words. Behav Res Methods 2023; 55:4382-4402. [PMID: 36443581 PMCID: PMC9707223 DOI: 10.3758/s13428-022-02022-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/06/2022] [Indexed: 11/29/2022]
Abstract
Using a megastudy approach, (Tse et al., 2017 Behavior Research Methods, 49, 1503-1519) established a large-scale repository of lexical variables and lexical decision responses for more than 25,000 traditional Chinese two-character words. In the current study, we expand their database by collecting norms for speeded naming reaction times (RTs) and accuracy rates, and compiling more lexical variables (e.g., phonological consistency and semantic neighborhood size). Following Tse et al.'s procedure, about 33 college-aged native Cantonese speakers in Hong Kong read aloud each word. We conducted item-level regression analyses to test the relative predictive power of orthographic variables (e.g., stroke count), phonological variables (e.g., phonological consistency), and semantic variables (e.g., semantic transparency) in naming performance. We also compared the effects of lexical variables on naming performance and Tse et al.'s lexical decision performance to examine the extent to which effects are task-specific or task-general. Freely accessible to the research community, this resource provides a valuable addition to other influential mega-databases, such as the English Lexicon Project (Balota et al., 2004 Journal of Experimental Psychology: General, 133, 283-316), and furthers our understanding of Chinese word recognition processes.
Collapse
Affiliation(s)
- Chi-Shing Tse
- Department of Educational Psychology, The Chinese University of Hong Kong, Hong Kong, China.
- Centre for Learning Sciences and Technologies, The Chinese University of Hong Kong, Hong Kong, New Territories, China.
| | - Yuen-Lai Chan
- Department of Educational Psychology, The Chinese University of Hong Kong, Hong Kong, China
| | - Melvin J Yap
- Department of Psychology, National University of Singapore, Singapore, Singapore
| | - Ho Chung Tsang
- Department of Educational Psychology, The Chinese University of Hong Kong, Hong Kong, China
| |
Collapse
|
2
|
Su Y, Li Y, Li H. Familiarity ratings for 24,325 simplified Chinese words. Behav Res Methods 2023; 55:1496-1509. [PMID: 35668341 DOI: 10.3758/s13428-022-01878-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/16/2022] [Indexed: 11/08/2022]
Abstract
The present work collected familiarity norms for 20,275 two-character, 1231 three-character, and 2819 four-character simplified Chinese words from 1300 native speakers of Mandarin Chinese. The familiarity of each word was rated on a 7-point scale by at least 21 participants. The results supported the reliability and validity of the present familiarity ratings, which is the first large familiarity database for Chinese in the field. These familiarity norms can be downloaded from the supplemental materials. Furthermore, the contribution of familiarity to Chinese lexical processing was investigated using the present familiarity ratings and previous data (lexical features and visual lexical decision), mainly from two major Chinese lexicon projects, MELD-SCH and CLP. Regression analysis suggests that familiarity explained a substantial percentage of the variance in lexical processing of both simplified and traditional Chinese words, over and above the effects of word frequency and other lexical features, including age of acquisition (AoA). Further analysis identified a significantly greater familiarity effect for lower-frequency words than that for higher-frequency words. Together, among the first, our findings support the important contribution of familiarity with Chinese words to lexical processing, especially for low-frequency words.
Collapse
Affiliation(s)
- Yongqiang Su
- Beijing Key Laboratory of Applied Experimental Psychology, National Demonstration Center for Experimental Psychology Education (Beijing Normal University), Institute of Children's Reading and Learning, Faculty of Psychology, Beijing Normal University, Room 1415, Houzhu Building, No.19 Xinjiekouwai Street, Haidian, Beijing, China
| | - Yixun Li
- Department of Early Childhood Education, The Education University of Hong Kong, Tai Po, Hong Kong SAR, China
| | - Hong Li
- Beijing Key Laboratory of Applied Experimental Psychology, National Demonstration Center for Experimental Psychology Education (Beijing Normal University), Institute of Children's Reading and Learning, Faculty of Psychology, Beijing Normal University, Room 1415, Houzhu Building, No.19 Xinjiekouwai Street, Haidian, Beijing, China.
| |
Collapse
|
3
|
Imageability ratings for 10,426 Chinese two-character words and their contribution to lexical processing. CURRENT PSYCHOLOGY 2022. [DOI: 10.1007/s12144-022-03404-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
4
|
CCLOWW: A grade-level Chinese children's lexicon of written words. Behav Res Methods 2022:10.3758/s13428-022-01890-9. [PMID: 35776384 DOI: 10.3758/s13428-022-01890-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/24/2022] [Indexed: 11/08/2022]
Abstract
In this article, we present the Chinese Children's Lexicon of Written Words (CCLOWW), the first grade-level database that provides frequency statistics of simplified Chinese characters and words for children. The database computes from a corpus of 34,671,424 character tokens and 22,427,010 word tokens (including single- and multicharacter words), extracted from 2131 books. It contains 6746 different character types and 153,079 different word types. CCLOWW provides several frequency indices of simplified Chinese for three grade levels (grade 2 and below, grades 3-4, grades 5-6) to profile children's experience with written Chinese in and outside of school. We describe in this article the distributions of frequency and contextual diversity of the characters and words, as well as word length and syntactic categories of the words in the corpus and the subcorpora. We also report results of correlation analyses with other written corpora and of several naming and lexicon decision experiments. The findings suggest that CCLOWW frequency measures correlate well with other corpora. Importantly, they could reliably predict children's and adults' naming and lexical decision performances. They could also explain variance in adults' visual word recognition, in addition to frequency measures computed in an adult corpus, indicating that early print exposure might influence readers' lexical processing later on beyond an age of acquisition effect. CCLOWW will help researchers in language processing and development as well as educators with selecting language materials appropriate for children's developmental stages. The database is freely available online at https://www.learn2read.cn/database/ .
Collapse
|
5
|
Xiong J, Zhang Y, Ju P. The Effects of Orthographic Neighborhood Size and the Influence of Individual Differences in Linguistic Skills During the Recognition of Chinese Words. Front Psychol 2021; 12:727894. [PMID: 34858267 PMCID: PMC8631189 DOI: 10.3389/fpsyg.2021.727894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2021] [Accepted: 10/19/2021] [Indexed: 11/13/2022] Open
Abstract
There are still inconsistencies as to whether frequency and orthographic neighborhood size affect the reading and recognition of Chinese words. In addition, research on Chinese reading still adheres to the view that "all skilled readers read in the same way" and pays little attention to the influence of individual differences in linguistic skills on word recognition. In this research, we studied the recognition of Chinese two-character words in a lexical decision task (LDT) by manipulating neighborhood size and word frequency and controlling the frequency of the initial constituent character. Individual differences in linguistic skills were assessed through tests of spelling and reading comprehension. The results showed that: (1) A larger orthographic neighborhood size of the initial character had a facilitative effect on Chinese word recognition. The orthographic neighborhood size effect is modulated by word frequency, but this modulation effect was not stable. (2) Spelling and reading comprehension skills are good indicators to assess individual differences in Chinese linguistic skills, and they are significantly correlated. (3) Individual differences in linguistic skills influence the neighborhood size effect, which is moderated by word frequency.
Collapse
Affiliation(s)
- Jianping Xiong
- Department of Psychology, Faculty of Education, Henan Normal University, Xinxiang, China
| | - Yujie Zhang
- Department of Psychology, Faculty of Education, Henan Normal University, Xinxiang, China
| | - Ping Ju
- Department of Psychology, Faculty of Education, Henan Normal University, Xinxiang, China
| |
Collapse
|
6
|
Abstract
Compounds are morphologically complex words made of different linguistic parts. They are very prevalent in a number of languages such as French. Different psycholinguistic characteristics of compounds have been used in certain studies to investigate the mechanisms involved in compound processing (see Table 7). We provide psycholinguistic norms for a set of 506 French compound words. The words were normed on seven characteristics: lexeme meaning dominance, semantic transparency, sensory experience, conceptual familiarity, imageability, age of acquisition (AoA) and subjective frequency. Reliability measures were computed for the collected norms. Descriptive statistical analyses, and correlational and multiple regression analyses were performed. We also report some comparisons made between our normative data and certain norms obtained in other similar studies. The entire set of norms, which will be very useful to researchers investigating the processing of compounds, is available as Supplemental Material.
Collapse
|
7
|
Wang Y, Jiang M, Huang Y, Qiu P. An ERP Study on the Role of Phonological Processing in Reading Two-Character Compound Chinese Words of High and Low Frequency. Front Psychol 2021; 12:637238. [PMID: 33716906 PMCID: PMC7947322 DOI: 10.3389/fpsyg.2021.637238] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 02/04/2021] [Indexed: 11/17/2022] Open
Abstract
Unlike in English, the role of phonology in word recognition in Chinese is unclear. In this event-related potential experiment, we investigated the role of phonology in reading both high- and low-frequency two-character compound Chinese words. Participants executed semantic and homophone judgment tasks of the same precede-target pairs. Each pair of either high- or low-frequency words were either unrelated (control condition) or related semantically or phonologically (homophones). The induced P200 component was greater for low- than for high-frequency word-pairs both in semantic and phonological tasks. Homophones in the semantic judgment task and semantically-related words in the phonology task both elicited a smaller N400 than the control condition, word frequency-independently. However, for low-frequency words in the phonological judgment task, it was found that the semantically related pairs released a significantly larger P200 than the control condition. Thus, the semantic activation of both high- and low-frequency words may be no later than phonological activation.
Collapse
Affiliation(s)
- Yuling Wang
- Center for Psychology and Cognitive Science, Tsinghua University, Beijing, China
| | - Minghu Jiang
- Center for Psychology and Cognitive Science, Tsinghua University, Beijing, China
| | - Yunlong Huang
- Advanced Innovation Center for Future Education, Beijing Normal University, Beijing, China
| | - Peijun Qiu
- Laboratory of Cognitive Linguistics, Department of Foreign Languages and Literature, Tsinghua University, Beijing, China.,College of International Sport Organizations, Beijing Sport University, Beijing, China
| |
Collapse
|
8
|
Lim RY, Yap MJ, Tse CS. Individual differences in Cantonese Chinese word recognition: Insights from the Chinese Lexicon Project. Q J Exp Psychol (Hove) 2020; 73:504-518. [PMID: 32019427 DOI: 10.1177/1747021820906566] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The Chinese Lexicon Project is a repository of lexical decision data for 25,286 Cantonese Chinese two-character compound words. To create that repository, 594 participants responded to approximately 1,404 words and 1,404 nonwords over three sessions. Using the data in this repository, the present study examines the variability and reliability of Chinese lexical decision performance, along with the moderating influence of individual differences on lexical processing. We generally found high to very high within- and between-session reliabilities for mean response times, ex-Gaussian parameters, accuracy rates, and a composite proficiency measure tapping lexical processing fluency. Using linear mixed effects models, we also found reliable interactions between fluency and two lexical effects. Specifically, more fluent readers showed larger effects of word frequency and semantic transparency. These results attest to the stability of Chinese word recognition performance, and are most consistent with a flexible lexical processing system that adapts optimally to task demands.
Collapse
Affiliation(s)
- Rosemarie Yh Lim
- Department of Psychology, Faculty of Arts and Social Sciences, National University of Singapore, Singapore
| | - Melvin J Yap
- Department of Psychology, Faculty of Arts and Social Sciences, National University of Singapore, Singapore
| | - Chi-Shing Tse
- The Chinese University of Hong Kong, Shatin, Hong Kong
| |
Collapse
|
9
|
Abstract
We present word prevalence data for 61,858 English words. Word prevalence refers to the number of people who know the word. The measure was obtained on the basis of an online crowdsourcing study involving over 220,000 people. Word prevalence data are useful for gauging the difficulty of words and, as such, for matching stimulus materials in experimental conditions or selecting stimulus materials for vocabulary tests. Word prevalence also predicts word processing times, over and above the effects of word frequency, word length, similarity to other words, and age of acquisition, in line with previous findings in the Dutch language.
Collapse
|
10
|
Normative data for Chinese-English paired associates. Behav Res Methods 2019; 52:440-445. [PMID: 30963462 DOI: 10.3758/s13428-019-01240-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Paired-associate learning is one of the most commonly used paradigms to study human memory. In many of these studies, participants are typically told to learn foreign language-English translations, such as Swahili-English or Lithuanian-English pairs. One limitation of these currently available foreign language-English translation norms is that their foreign languages are based on the alphabetic writing system, thereby preventing researchers from generalizing their findings to languages based on logographic writing systems. In the present study we collected normative data for 160 Chinese-English word pairs. Participants completed three study-test cycles, followed by metacognitive judgments on their learning experience. For each pair, we report recall performance, recall latency, ease of learning, and judgments of learning. A simultaneous multiple regression analysis with frequency (of both the English word and the Chinese character), word length (English), and number of strokes (Chinese) as predictors revealed that a greater number of strokes (or higher visual complexity) for the Chinese characters predicted lower target recall.
Collapse
|