1
|
Alkhurayyif Y, Wahab Sait AR. Deep learning-driven dyslexia detection model using multi-modality data. PeerJ Comput Sci 2024; 10:e2077. [PMID: 38983227 PMCID: PMC11232624 DOI: 10.7717/peerj-cs.2077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 05/02/2024] [Indexed: 07/11/2024]
Abstract
Background Dyslexia is a neurological disorder that affects an individual's language processing abilities. Early care and intervention can help dyslexic individuals succeed academically and socially. Recent developments in deep learning (DL) approaches motivate researchers to build dyslexia detection models (DDMs). DL approaches facilitate the integration of multi-modality data. However, there are few multi-modality-based DDMs. Methods In this study, the authors built a DL-based DDM using multi-modality data. A squeeze and excitation (SE) integrated MobileNet V3 model, self-attention mechanisms (SA) based EfficientNet B7 model, and early stopping and SA-based Bi-directional long short-term memory (Bi-LSTM) models were developed to extract features from magnetic resonance imaging (MRI), functional MRI, and electroencephalography (EEG) data. In addition, the authors fine-tuned the LightGBM model using the Hyperband optimization technique to detect dyslexia using the extracted features. Three datasets containing FMRI, MRI, and EEG data were used to evaluate the performance of the proposed DDM. Results The findings supported the significance of the proposed DDM in detecting dyslexia with limited computational resources. The proposed model outperformed the existing DDMs by producing an optimal accuracy of 98.9%, 98.6%, and 98.8% for the FMRI, MRI, and EEG datasets, respectively. Healthcare centers and educational institutions can benefit from the proposed model to identify dyslexia in the initial stages. The interpretability of the proposed model can be improved by integrating vision transformers-based feature extraction.
Collapse
Affiliation(s)
| | - Abdul Rahaman Wahab Sait
- Department of Documents and Archive, Center of Documents and Administrative Communication, King Faisal University, Al-Ahsa, Saudi Arabia
| |
Collapse
|
2
|
Improved Graph Convolutional Neural Network for Dance Tracking and Pose Estimation. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022; 2022:7133491. [PMID: 35795760 PMCID: PMC9252656 DOI: 10.1155/2022/7133491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 05/31/2022] [Indexed: 12/03/2022]
Abstract
Movement recognition technology is widely used in various practical application scenarios, but there are few researches on dance movement recognition at present. Aiming at the problem of low accuracy of dance movement recognition due to complex pose changes in dance movements, this paper designed an improved graph convolutional neural network algorithm for dance tracking and pose estimation. In this method, the spatial and temporal characteristics of motion are extracted from the skeleton joint diagram of human body. Then, GCN (graph convolutional neural) is used to extract potential spatial information between skeleton nodes. Finally, LSTM (long short-term memory) extracts the time series features before and after human actions as a supplement and performs late fusion of the prediction outputs of the two networks, respectively, to improve the problem of insufficient generalization ability of single network. The experimental results show that this method can effectively improve the accuracy of dance movement recognition in general movement recognition data set and dance pose data set. It has certain application value in dance self-help teaching, professional dancer movement correction, and other application scenarios.
Collapse
|
3
|
Visual analysis of eye movements during micro-stories reading. J Vis (Tokyo) 2022. [DOI: 10.1007/s12650-022-00845-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
4
|
Fan K, Cao J, Meng Z, Zhu J, Ma H, Ng ACM, Ng T, Qian W, Qi S. Predicting the Reader's English Level from Reading Fixation Patterns Using the Siamese Convolutional Neural Network. IEEE Trans Neural Syst Rehabil Eng 2022; 30:1071-1080. [PMID: 35259110 DOI: 10.1109/tnsre.2022.3157768] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Abstract-Among numerous functions performed by the human eye, reading is a common task that best reflects an individual's understanding and cognitive patterns. Previous studies showed that text comprehension may be determined by comprehension monitoring, a metacognitive process that evaluates and regulates the pattern of comprehension. Herein, we propose a hypothesis: an individual's cognitive pattern during reading is predictive of the level of reading comprehension. According to the criteria of the College English Test Band Six (CET-6), 80 participants (sophomore and junior) were divided into a pass group (n = 40) and a non-pass group (n = 40). Heatmaps of eye fixation counts were collected by an eye-tracker while each participant executed four reading comprehension tests. Using these heatmaps as inputs, we proposed the Siamese convolutional neural network models to predict the English level of participants. Both strategies of "Trained from scratch" and "Pretrained with fine-tuning" were employed. "Soft Voting" was applied to integrate the predictions from four tests. Results showed that the Siamese network model trained by the datasets with the cluster radius of fixation equal to 25 pixels and connection layer by L1 norm distance has a satisfactory or superior performance to other comparative experiments. The AUC values of Siamese networks trained by the two strategies reach 0.941 and 0.956, respectively. This indicates that the individual reading cognitive pattern captured by the eye-tracker can predict the level of reading comprehension through advanced deep learning models.
Collapse
|
5
|
Eye movements during text reading align with the rate of speech production. Nat Hum Behav 2021; 6:429-442. [PMID: 34873275 DOI: 10.1038/s41562-021-01215-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2019] [Accepted: 09/10/2021] [Indexed: 11/08/2022]
Abstract
Across languages, the speech signal is characterized by a predominant modulation of the amplitude spectrum between about 4.3 and 5.5 Hz, reflecting the production and processing of linguistic information chunks (syllables and words) every ~200 ms. Interestingly, ~200 ms is also the typical duration of eye fixations during reading. Prompted by this observation, we demonstrate that German readers sample written text at ~5 Hz. A subsequent meta-analysis of 142 studies from 14 languages replicates this result and shows that sampling frequencies vary across languages between 3.9 Hz and 5.2 Hz. This variation systematically depends on the complexity of the writing systems (character-based versus alphabetic systems and orthographic transparency). Finally, we empirically demonstrate a positive correlation between speech spectrum and eye movement sampling in low-skilled non-native readers, with tentative evidence from post hoc analysis suggesting the same relationship in low-skilled native readers. On the basis of this convergent evidence, we propose that during reading, our brain's linguistic processing systems imprint a preferred processing rate-that is, the rate of spoken language production and perception-onto the oculomotor system.
Collapse
|
6
|
|
7
|
Southwell R, Gregg J, Bixler R, D'Mello SK. What Eye Movements Reveal About Later Comprehension of Long Connected Texts. Cogn Sci 2021; 44:e12905. [PMID: 33029808 DOI: 10.1111/cogs.12905] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 09/02/2020] [Accepted: 09/02/2020] [Indexed: 11/29/2022]
Abstract
We know that reading involves coordination between textual characteristics and visual attention, but research linking eye movements during reading and comprehension assessed after reading is surprisingly limited, especially for reading long connected texts. We tested two competing possibilities: (a) the weak association hypothesis: Links between eye movements and comprehension are weak and short-lived, versus (b) the strong association hypothesis: The two are robustly linked, even after a delay. Using a predictive modeling approach, we trained regression models to predict comprehension scores from global eye movement features, using participant-level cross-validation to ensure that the models generalize across participants. We used data from three studies in which readers (Ns = 104, 130, 147) answered multiple-choice comprehension questions ~30 min after reading a 6,500-word text, or after reading up to eight 1,000-word texts. The models generated accurate predictions of participants' text comprehension scores (correlations between observed and predicted comprehension: 0.384, 0.362, 0.372, ps < .001), in line with the strong association hypothesis. We found that making more, but shorter fixations, consistently predicted comprehension across all studies. Furthermore, models trained on one study's data could successfully predict comprehension on the others, suggesting generalizability across studies. Collectively, these findings suggest that there is a robust link between eye movements and subsequent comprehension of a long connected text, thereby connecting theories of low-level eye movements with those of higher order text processing during reading.
Collapse
Affiliation(s)
- Rosy Southwell
- Institute of Cognitive Science, University of Colorado, Boulder
| | - Julie Gregg
- Institute of Cognitive Science, University of Colorado, Boulder
| | - Robert Bixler
- Department of Computer Science and Engineering, University of Notre Dame
| | | |
Collapse
|
8
|
Zhang G, Yuan B, Hua H, Lou Y, Lin N, Li X. Individual differences in first-pass fixation duration in reading are related to resting-state functional connectivity. BRAIN AND LANGUAGE 2021; 213:104893. [PMID: 33360162 DOI: 10.1016/j.bandl.2020.104893] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2020] [Revised: 12/04/2020] [Accepted: 12/06/2020] [Indexed: 06/12/2023]
Abstract
Although there are considerable individual differences in eye movements during text reading, their neural correlates remain unclear. In this study, we investigated the relationship between the first-pass fixation duration (FPFD) in natural reading and resting-state functional connectivity (RSFC) in the brain. We defined the brain regions associated with early visual processing, word identification, attention shifts, and oculomotor control as seed regions. The results showed that individual FPFDs were positively correlated with individual RSFCs between the early visual network, visual word form area, and eye movement control/dorsal attention network. Our findings provide new evidence on the neural correlates of eye movements in text reading and indicate that individual differences in fixation time may shape the RSFC differences in the brain through the time-on-task effect and the mechanism of Hebbian learning.
Collapse
Affiliation(s)
- Guangyao Zhang
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Binke Yuan
- Center for Language and Brain, Shenzhen Institute of Neuroscience, Shenzhen, China
| | - Huimin Hua
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing, China
| | - Ya Lou
- Beijing Institute of Education, Beijing, China
| | - Nan Lin
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing, China.
| | - Xingshan Li
- CAS Key Laboratory of Behavioral Science, Institute of Psychology, Beijing, China; Department of Psychology, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
9
|
Kaakinen JK. What Can Eye Movements Tell us about Visual Perception Processes in Classroom Contexts? Commentary on a Special Issue. EDUCATIONAL PSYCHOLOGY REVIEW 2020. [DOI: 10.1007/s10648-020-09573-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
AbstractIn this commentary to the Special Issue of Educational Psychology Review on visual perceptual processes, I tie the empirical studies reported in the issue with previous research in other domains to offer some points to be considered in future studies. First, I will point out to issues related to the operationalization of the theoretical constructs. The empirical papers in this Special Issue use eye tracking to study students’ engagement, teachers’ expertise, and student-teacher interaction. However, it is not always clear how the observed eye movement patterns reflect these theoretical concepts and the underlying psychological processes. Second, I will reflect on the analyses of the eye movement data presented in the papers. The main advantage of the methodology is that it can provide detailed information about the time-course of processing, and to fully engage its potential, it should be complemented with adequate statistical methods. In my view, the papers in this Special Issue provide valuable novel information about the complex processes underlying learning in variable contexts, and offer an excellent starting point for future research.
Collapse
|
10
|
D’Mello SK, Southwell R, Gregg J. Machine-Learned Computational Models Can Enhance the Study of Text and Discourse: A Case Study Using Eye Tracking to Model Reading Comprehension. DISCOURSE PROCESSES 2020. [DOI: 10.1080/0163853x.2020.1739600] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Affiliation(s)
| | - Rosy Southwell
- Institute of Cognitive Science, University of Colorado Boulder
| | - Julie Gregg
- Institute of Cognitive Science, University of Colorado Boulder
| |
Collapse
|
11
|
Chang J, Yu R. Hippocampal connectivity in the aftermath of acute social stress. Neurobiol Stress 2019; 11:100195. [PMID: 31832509 PMCID: PMC6889252 DOI: 10.1016/j.ynstr.2019.100195] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 08/06/2019] [Accepted: 09/13/2019] [Indexed: 12/12/2022] Open
Abstract
The hippocampus is a core brain region that responds to stress. Previous studies have found a dysconnectivity between hippocampus and other brain regions under acute and chronic stress. However, whether and how acute social stress influences the directed connectivity patterns from and to the hippocampus remains unclear. In this study, using a within-subject design and Granger causal analysis (GCA), we investigated the alterations of resting state effective connectivity from and to hippocampal subregions after an acute social stressor (the Trier Social Stress Test). Participants were engaged in stress and control conditions spaced approximately one month apart. Our findings showed that stress altered the information flows in the thalamus-hippocampus-insula/midbrain circuit. The changes in this circuit could also predict with high accuracy the stress and control conditions at the subject level. These hippocampus-related brain networks have been documented to be involved in emotional information processing and storage, as well as habitual responses. We speculate that alterations of the effective connectivity between these brain regions may be associated with the registering and encoding of threatening stimuli under stress. Our investigation of hippocampal functional connectivity at a subregional level may help elucidate the functional neurobiology of stress-related psychiatric disorders.
Collapse
Affiliation(s)
- Jingjing Chang
- Guangdong Key Laboratory of Mental Health and Cognitive Science, Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Rongjun Yu
- Department of Psychology, National University of Singapore, Singapore
| |
Collapse
|
12
|
Xue S, Lüdtke J, Sylvester T, Jacobs AM. Reading Shakespeare Sonnets: Combining Quantitative Narrative Analysis and Predictive Modeling -an Eye Tracking Study. J Eye Mov Res 2019; 12:10.16910/jemr.12.5.2. [PMID: 33828746 PMCID: PMC7968390 DOI: 10.16910/jemr.12.5.2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
As a part of a larger interdisciplinary project on Shakespeare sonnets' reception (1, 2), the present study analyzed the eye movement behavior of participants reading three of the 154 sonnets as a function of seven lexical features extracted via Quantitative Narrative Analysis (QNA). Using a machine learning-based predictive modeling approach five 'surface' features (word length, orthographic neighborhood density, word frequency, orthographic dissimilarity and sonority score) were detected as important predictors of total reading time and fixation probability in poetry reading. The fact that one phonological feature, i.e., sonority score, also played a role is in line with current theorizing on poetry reading. Our approach opens new ways for future eye movement research on reading poetic texts and other complex literary materials(3).
Collapse
|
13
|
Strukelj A, Niehorster DC. One page of text: Eye movements during regular and thorough reading, skimming, and spell checking. J Eye Mov Res 2018; 11. [PMID: 33828678 PMCID: PMC7198234 DOI: 10.16910/jemr.11.1.1] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Eye movements during regular reading, thorough reading, skimming, and spell checking of single pages of text were measured, to investigate how high-level reading tasks elicited by instructions affect reading behavior. Word frequency and word length effects were found. All results were compared to regular reading. Thorough reading involved longer total reading times and more rereading, and resulted in higher comprehension scores. Skimming involved longer saccades, shorter average fixation durations, more word skipping, shorter total reading times evenly distributed across the page, and resulted in lower comprehension scores. Spell checking involved shorter saccades, longer average fixation durations, less word skipping, longer total reading times evenly distributed across the entire page, and resulted in lower comprehension scores. Replicating local effects shows that paragraphs maintain sufficient experimental rigor, while also enabling reading analyses from a global perspective. Compared to regular reading, thorough reading was more elaborate and less uniform, skimming was faster and more uniform, and spell checking was slower and more uniform.
Collapse
Affiliation(s)
- Alexander Strukelj
- Centre for Languages and Literature, Lund University, Sweden.,The Humanities Laboratory, Lund University, Sweden
| | - Diederick C Niehorster
- The Humanities Laboratory, Lund University, Sweden.,Department of Psychology, Lund University, Sweden
| |
Collapse
|