Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Murphy-Chutorian E, Trivedi MM. Head pose estimation in computer vision: a survey. IEEE Trans Pattern Anal Mach Intell 2009;31:607-626. [PMID: 19229078 DOI: 10.1109/tpami.2008.106] [Citation(s) in RCA: 164] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]

Number

Cited by Other Article(s)

Wang J, Shen Y, Zhao J, Wang X, Chen Z, Han T, Huang Y, Wang Y, Zhao W, Wen W, Zhou X, Xu Y. Algorithmic and sensor-based research on Chinese children's and adolescents' screen use behavior and light environment. Front Public Health 2024;12:1352759. [PMID: 38454995 PMCID: PMC10917963 DOI: 10.3389/fpubh.2024.1352759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2023] [Accepted: 02/06/2024] [Indexed: 03/09/2024] Open

Abstract

Background

Myopia poses a global health concern and is influenced by both genetic and environmental factors. The incidence of myopia tends to increase during infectious outbreaks, such as the COVID-19 pandemic. This study examined the screen-time behaviors among Chinese children and adolescents and investigated the efficacy of artificial intelligence (AI)-based alerts in modifying screen-time practices.

Methods

A cross-sectional analysis was performed using data from 6,716 children and adolescents with AI-enhanced tablets that monitored and recorded their behavior and environmental light during screen time.

Results

The median daily screen time of all participants was 58.82 min. Among all age groups, elementary-school students had the longest median daily screen time, which was 87.25 min and exceeded 4 h per week. Children younger than 2 years engaged with tablets for a median of 41.84 min per day. Learning accounted for 54.88% of participants' screen time, and 51.03% (3,390/6,643) of the participants used tablets for 1 h at an average distance <50 cm. The distance and posture alarms were triggered 807,355 and 509,199 times, respectively. In the study, 70.65% of the participants used the tablet under an illuminance of <300 lux during the day and 61.11% under an illuminance of <100 lux at night. The ambient light of 85.19% of the participants exceeded 4,000 K color temperature during night. Most incorrect viewing habits (65.49% in viewing distance; 86.48% in viewing posture) were rectified swiftly following AI notifications (all p < 0.05).

Conclusion

Young children are increasingly using digital screens, with school-age children and adolescents showing longer screen time than preschoolers. The study highlighted inadequate lighting conditions during screen use. AI alerts proved effective in prompting users to correct their screen-related behavior promptly.

Collapse

Affiliation(s)

Jifang Wang Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China Department of Nursing, Eye & ENT Hospital, Fudan University, Shanghai, China
Yang Shen Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Jing Zhao Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Xiaoying Wang Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Zhi Chen Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Tian Han Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Yangyi Huang Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Yuliang Wang Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Wuxiao Zhao Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China Center for Optometry and Visual Science, Guangxi Academy of Medical Sciences, Nanning, China
Wen Wen Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Xingtao Zhou Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China
Ye Xu Eye Institute and Department of Ophthalmology, Eye & ENT Hospital, Fudan University, Shanghai, China NHC Key Laboratory of Myopia (Fudan University), Key Laboratory of Myopia, Chinese Academy of Medical Sciences, Shanghai, China Shanghai Research Center of Ophthalmology and Optometry, Shanghai, China Shanghai Engineering Research Center of Laser and Autostereoscopic 3D for Vision Care, Shanghai, China

Collapse

Li N, Ross R. Invoking and identifying task-oriented interlocutor confusion in human-robot interaction. Front Robot AI 2023;10:1244381. [PMID: 38054199 PMCID: PMC10694506 DOI: 10.3389/frobt.2023.1244381] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 10/31/2023] [Indexed: 12/07/2023] Open

Zhong R, He L, Wang H, Yuan L, Li K, Liu Z. Attention-Guided Huber Loss for Head Pose Estimation Based on Improved Capsule Network. ENTROPY (BASEL, SWITZERLAND) 2023;25:1024. [PMID: 37509971 PMCID: PMC10378512 DOI: 10.3390/e25071024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 06/29/2023] [Accepted: 07/04/2023] [Indexed: 07/30/2023]

Xu H, Zhang J, Sun H, Qi M, Kong J. Analyzing students' attention by gaze tracking and object detection in classroom teaching. DATA TECHNOLOGIES AND APPLICATIONS 2023. [DOI: 10.1108/dta-09-2021-0236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Abstract PurposeAttention is one of the most important factors to affect the academic performance of students. Effectively analyzing students' attention in class can promote teachers' precise teaching and students' personalized learning. To intelligently analyze the students' attention in classroom from the first-person perspective, this paper proposes a fusion model based on gaze tracking and object detection. In particular, the proposed attention analysis model does not depend on any smart equipment.Design/methodology/approachGiven a first-person view video of students' learning, the authors first estimate the gazing point by using the deep space–time neural network. Second, single shot multi-box detector and fast segmentation convolutional neural network are comparatively adopted to accurately detect the objects in the video. Third, they predict the gazing objects by combining the results of gazing point estimation and object detection. Finally, the personalized attention of students is analyzed based on the predicted gazing objects and the measurable eye movement criteria.FindingsA large number of experiments are carried out on a public database and a new dataset that is built in a real classroom. The experimental results show that the proposed model not only can accurately track the students' gazing trajectory and effectively analyze the fluctuation of attention of the individual student and all students but also provide a valuable reference to evaluate the process of learning of students.Originality/valueThe contributions of this paper can be summarized as follows. The analysis of students' attention plays an important role in improving teaching quality and student achievement. However, there is little research on how to automatically and intelligently analyze students' attention. To alleviate this problem, this paper focuses on analyzing students' attention by gaze tracking and object detection in classroom teaching, which is significant for practical application in the field of education. The authors proposed an effectively intelligent fusion model based on the deep neural network, which mainly includes the gazing point module and the object detection module, to analyze students' attention in classroom teaching instead of relying on any smart wearable device. They introduce the attention mechanism into the gazing point module to improve the performance of gazing point detection and perform some comparison experiments on the public dataset to prove that the gazing point module can achieve better performance. They associate the eye movement criteria with visual gaze to get quantifiable objective data for students' attention analysis, which can provide a valuable basis to evaluate the learning process of students, provide useful learning information of students for both parents and teachers and support the development of individualized teaching. They built a new database that contains the first-person view videos of 11 subjects in a real classroom and employ it to evaluate the effectiveness and feasibility of the proposed model. Collapse

Eyvazpour R, Shoaran M, Karimian G. Hardware implementation of SLAM algorithms: a survey on implementation approaches and platforms. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10310-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Vankayalapati HD, Kuchibhotla S, Chadalavada MSK, Dargar SK, Anne KR, Kyandoghere K. A Novel Zernike Moment-Based Real-Time Head Pose and Gaze Estimation Framework for Accuracy-Sensitive Applications. SENSORS (BASEL, SWITZERLAND) 2022;22:8449. [PMID: 36366147 PMCID: PMC9658879 DOI: 10.3390/s22218449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 10/02/2022] [Accepted: 10/28/2022] [Indexed: 06/16/2023]

Thai C, Tran V, Bui M, Nguyen D, Ninh H, Tran H. Real-time masked face classification and head pose estimation for RGB facial image via knowledge distillation. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.10.074] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Hammadi Y, Grondin F, Ferland F, Lebel K. Evaluation of Various State of the Art Head Pose Estimation Algorithms for Clinical Scenarios. SENSORS (BASEL, SWITZERLAND) 2022;22:6850. [PMID: 36146199 PMCID: PMC9502716 DOI: 10.3390/s22186850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 08/17/2022] [Accepted: 08/29/2022] [Indexed: 06/16/2023]

An improved hand gesture recognition system using keypoints and hand bounding boxes. ARRAY 2022. [DOI: 10.1016/j.array.2022.100251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Ahmad MI, Refik R. “No Chit Chat!” A Warning From a Physical Versus Virtual Robot Invigilator: Which Matters Most? Front Robot AI 2022;9:908013. [PMID: 35937616 PMCID: PMC9355029 DOI: 10.3389/frobt.2022.908013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 06/14/2022] [Indexed: 11/25/2022] Open

An Improved Tiered Head Pose Estimation Network with Self-Adjust Loss Function. ENTROPY 2022;24:e24070974. [PMID: 35885197 PMCID: PMC9320982 DOI: 10.3390/e24070974] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 07/11/2022] [Accepted: 07/12/2022] [Indexed: 12/10/2022]

Zeng D, Wu Z, Ding C, Ren Z, Yang Q, Xie S. Labeled-Robust Regression: Simultaneous Data Recovery and Classification. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:5026-5039. [PMID: 33151887 DOI: 10.1109/tcyb.2020.3026101] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Zohary E, Harari D, Ullman S, Ben-Zion I, Doron R, Attias S, Porat Y, Sklar AY, Mckyton A. Gaze following requires early visual experience. Proc Natl Acad Sci U S A 2022;119:e2117184119. [PMID: 35549552 PMCID: PMC9171757 DOI: 10.1073/pnas.2117184119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 03/03/2022] [Indexed: 11/18/2022] Open

Face Image Analysis Using Machine Learning: A Survey on Recent Trends and Applications. ELECTRONICS 2022. [DOI: 10.3390/electronics11081210] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Geng X, Qian X, Huo Z, Zhang Y. Head Pose Estimation Based on Multivariate Label Distribution. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2022;44:1974-1991. [PMID: 33031033 DOI: 10.1109/tpami.2020.3029585] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Real-Time Gender Recognition for Juvenile and Adult Faces. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2022;2022:1503188. [PMID: 35341170 PMCID: PMC8947889 DOI: 10.1155/2022/1503188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Revised: 12/09/2021] [Accepted: 01/18/2022] [Indexed: 11/18/2022]

Abstract Facial gender recognition is a crucial research topic due to its comprehensive use cases, including a demographic gender survey, visitor profile identification, targeted advertisement, access control, security, and surveillance from CCTV. For these real-time applications, the face of a person can be oriented to any angle from the camera axis, and the person can be of any age group, including juveniles. A child’s face consists of immature craniofacial feature points in texture and edge compared to an adult face, making it very hard to recognize gender using the child’s face. Real-word faces captured in an unconstrained environment make the gender prediction system more complex to identify correctly due to orientation. These factors reduce the accuracy of the existing state-of-the-art models developed so far for real-time facial gender prediction. This paper presents the novelty of facial gender recognition for juveniles, adults, and unconstrained-oriented faces. The progressive calibration network (PCN) detects rotation-invariant faces in the proposed model. Then, a Gabor filter is applied to extract unique edge and texture features from the detected face. The Gabor filter is invariant to illumination and produces texture and edge features with redundant feature coefficients in large dimensions. Gabor has drawbacks such as redundancy and a large dimension resolved by the proposed meanDWT feature optimization method, which optimizes the system’s accuracy, the size of the model, and computational timing. The proposed feature engineering model is classified with different classifiers such as Naïve Bayes, Logistic Regression, SVM with linear, and RBF kernel. Its results are compared with the state-of-the-art techniques; detailed experimental analysis is presented and concluded to support the argument. We also present a review of approaches based on conventional and deep learning methods with their pros and cons for facial gender recognition on different datasets available for facial gender recognition. Collapse

Detecting Groups and Estimating F-Formations for Social Human–Robot Interactions. MULTIMODAL TECHNOLOGIES AND INTERACTION 2022. [DOI: 10.3390/mti6030018] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Sei M, Utsumi A, Yamazoe H, Lee JH. Personalized face-pose estimation network using incrementally updated face shape parameters. APPL INTELL 2022. [DOI: 10.1007/s10489-021-02888-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

A Study on the Teaching Design of a Hybrid Civics Course Based on the Improved Attention Mechanism. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12031243] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Robot System Assistant (RoSA): Towards Intuitive Multi-Modal and Multi-Device Human-Robot Interaction. SENSORS 2022;22:s22030923. [PMID: 35161671 PMCID: PMC8838571 DOI: 10.3390/s22030923] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 01/18/2022] [Accepted: 01/19/2022] [Indexed: 01/09/2023]

Yuan G, Wang Y, Yan H, Fu X. Self-calibrated driver gaze estimation via gaze pattern learning. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2021.107630] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Barra P, Distasi R, Pero C, Ricciardi S, Tucci M. Gradient boosting regression for faster Partitioned Iterated Function Systems‐based head pose estimation. IET BIOMETRICS 2021. [DOI: 10.1049/bme2.12061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Malek S, Rossi S. Head pose estimation using facial-landmarks classification for children rehabilitation games. Pattern Recognit Lett 2021. [DOI: 10.1016/j.patrec.2021.11.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Berral-Soler R, Madrid-Cuevas FJ, Muñoz-Salinas R, Marín-Jiménez MJ. RealHePoNet: a robust single-stage ConvNet for head pose estimation in the wild. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05511-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Pardoe HR, Martin SP, Zhao Y, George A, Yuan H, Zhou J, Liu W, Devinsky O. Estimation of in-scanner head pose changes during structural MRI using a convolutional neural network trained on eye tracker video. Magn Reson Imaging 2021;81:101-108. [PMID: 34147591 DOI: 10.1016/j.mri.2021.06.010] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 05/06/2021] [Accepted: 06/15/2021] [Indexed: 10/21/2022]

Abstract

INTRODUCTION

In-scanner head motion is a common cause of reduced image quality in neuroimaging, and causes systematic brain-wide changes in cortical thickness and volumetric estimates derived from structural MRI scans. There are few widely available methods for measuring head motion during structural MRI. Here, we train a deep learning predictive model to estimate changes in head pose using video obtained from an in-scanner eye tracker during an EPI-BOLD acquisition with participants undertaking deliberate in-scanner head movements. The predictive model was used to estimate head pose changes during structural MRI scans, and correlated with cortical thickness and subcortical volume estimates.

METHODS

21 healthy controls (age 32 ± 13 years, 11 female) were studied. Participants carried out a series of stereotyped prompted in-scanner head motions during acquisition of an EPI-BOLD sequence with simultaneous recording of eye tracker video. Motion-affected and motion-free whole brain T1-weighted MRI were also obtained. Image coregistration was used to estimate changes in head pose over the duration of the EPI-BOLD scan, and used to train a predictive model to estimate head pose changes from the video data. Model performance was quantified by assessing the coefficient of determination (R²). We evaluated the utility of our technique by assessing the relationship between video-based head pose changes during structural MRI and (i) vertex-wise cortical thickness and (ii) subcortical volume estimates.

RESULTS

Video-based head pose estimates were significantly correlated with ground truth head pose changes estimated from EPI-BOLD imaging in a hold-out dataset. We observed a general brain-wide overall reduction in cortical thickness with increased head motion, with some isolated regions showing increased cortical thickness estimates with increased motion. Subcortical volumes were generally reduced in motion affected scans.

CONCLUSIONS

We trained a predictive model to estimate changes in head pose during structural MRI scans using in-scanner eye tracker video. The method is independent of individual image acquisition parameters and does not require markers to be to be fixed to the patient, suggesting it may be well suited to clinical imaging and research environments. Head pose changes estimated using our approach can be used as covariates for morphometric image analyses to improve the neurobiological validity of structural imaging studies of brain development and disease.

Collapse

Gullapalli AR, Anderson NE, Yerramsetty R, Harenski CL, Kiehl KA. Quantifying the psychopathic stare: Automated assessment of head motion is related to antisocial traits in forensic interviews. JOURNAL OF RESEARCH IN PERSONALITY 2021. [DOI: 10.1016/j.jrp.2021.104093] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Pataky TC, Yagi M, Ichihashi N, Cox PG. Landmark-free, parametric hypothesis tests regarding two-dimensional contour shapes using coherent point drift registration and statistical parametric mapping. PeerJ Comput Sci 2021;7:e542. [PMID: 34084938 PMCID: PMC8157043 DOI: 10.7717/peerj-cs.542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 04/22/2021] [Indexed: 06/12/2023]

Liu T, Wang J, Yang B, Wang X. NGDNet: Nonuniform Gaussian-label distribution learning for infrared head pose estimation and on-task behavior understanding in the classroom. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.12.090] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]

Liu H, Nie H, Zhang Z, Li YF. Anisotropic angle distribution learning for head pose estimation and attention understanding in human-computer interaction. Neurocomputing 2021. [DOI: 10.1016/j.neucom.2020.09.068] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

How frontal is a face? Quantitative estimation of face pose based on CNN and geometric projection. Neural Comput Appl 2021. [DOI: 10.1007/s00521-020-05167-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Liu L, Ke Z, Huo J, Chen J. Head Pose Estimation through Keypoints Matching between Reconstructed 3D Face Model and 2D Image. SENSORS (BASEL, SWITZERLAND) 2021;21:1841. [PMID: 33800750 PMCID: PMC7961623 DOI: 10.3390/s21051841] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 02/26/2021] [Accepted: 03/02/2021] [Indexed: 11/27/2022]

Bisogni C, Nappi M, Pero C, Ricciardi S. FASHE: A FrActal Based Strategy for Head Pose Estimation. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2021;30:3192-3203. [PMID: 33617454 DOI: 10.1109/tip.2021.3059409] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Abbas A, Yadav V, Smith E, Ramjas E, Rutter SB, Benavidez C, Koesmahargyo V, Zhang L, Guan L, Rosenfield P, Perez-Rodriguez M, Galatzer-Levy IR. Computer Vision-Based Assessment of Motor Functioning in Schizophrenia: Use of Smartphones for Remote Measurement of Schizophrenia Symptomatology. Digit Biomark 2021;5:29-36. [PMID: 33615120 DOI: 10.1159/000512383] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Accepted: 10/14/2020] [Indexed: 11/19/2022] Open

Abstract

Introduction

Motor abnormalities have been shown to be a distinct component of schizophrenia symptomatology. However, objective and scalable methods for assessment of motor functioning in schizophrenia are lacking. Advancements in machine learning-based digital tools have allowed for automated and remote "digital phenotyping" of disease symptomatology. Here, we assess the performance of a computer vision-based assessment of motor functioning as a characteristic of schizophrenia using video data collected remotely through smartphones.

Methods

Eighteen patients with schizophrenia and 9 healthy controls were asked to remotely participate in smartphone-based assessments daily for 14 days. Video recorded from the smartphone front-facing camera during these assessments was used to quantify the Euclidean distance of head movement between frames through a pretrained computer vision model. The ability of head movement measurements to distinguish between patients and healthy controls as well as their relationship to schizophrenia symptom severity as measured through traditional clinical scores was assessed.

Results

The rate of head movement in participants with schizophrenia (1.48 mm/frame) and those without differed significantly (2.50 mm/frame; p = 0.01), and a logistic regression demonstrated that head movement was a significant predictor of schizophrenia diagnosis (p = 0.02). Linear regression between head movement and clinical scores of schizophrenia showed that head movement has a negative relationship with schizophrenia symptom severity (p = 0.04), primarily with negative symptoms of schizophrenia.

Conclusions

Remote, smartphone-based assessments were able to capture meaningful visual behavior for computer vision-based objective measurement of head movement. The measurements of head movement acquired were able to accurately classify schizophrenia diagnosis and quantify symptom severity in patients with schizophrenia.

Collapse

Head pose estimation by regression algorithm. Pattern Recognit Lett 2020. [DOI: 10.1016/j.patrec.2020.10.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Driver Distraction Detection Method Based on Continuous Head Pose Estimation. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2020. [DOI: 10.1155/2020/9606908] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Agarwala R, Leube A, Wahl S. Utilizing minicomputer technology for low-cost photorefraction: a feasibility study. BIOMEDICAL OPTICS EXPRESS 2020;11:6108-6121. [PMID: 33282478 PMCID: PMC7687974 DOI: 10.1364/boe.400720] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 09/03/2020] [Accepted: 09/13/2020] [Indexed: 06/12/2023]

Orlandi S, Hotze F, Lim D, Estrada SG, Muir D, Friesen HA, Chau T. Customized Access Technology for Children using Head Movement Recognition^.. ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL INTERNATIONAL CONFERENCE 2020;2020:1783-1786. [PMID: 33018344 DOI: 10.1109/embc44109.2020.9175747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Tan C, Ceballos G, Kasabov N, Puthanmadam Subramaniyam N. FusionSense: Emotion Classification Using Feature Fusion of Multimodal Data and Deep Learning in a Brain-Inspired Spiking Neural Network. SENSORS (BASEL, SWITZERLAND) 2020;20:E5328. [PMID: 32957655 PMCID: PMC7571195 DOI: 10.3390/s20185328] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 09/04/2020] [Accepted: 09/11/2020] [Indexed: 01/22/2023]

Abstract

Using multimodal signals to solve the problem of emotion recognition is one of the emerging trends in affective computing. Several studies have utilized state of the art deep learning methods and combined physiological signals, such as the electrocardiogram (EEG), electroencephalogram (ECG), skin temperature, along with facial expressions, voice, posture to name a few, in order to classify emotions. Spiking neural networks (SNNs) represent the third generation of neural networks and employ biologically plausible models of neurons. SNNs have been shown to handle Spatio-temporal data, which is essentially the nature of the data encountered in emotion recognition problem, in an efficient manner. In this work, for the first time, we propose the application of SNNs in order to solve the emotion recognition problem with the multimodal dataset. Specifically, we use the NeuCube framework, which employs an evolving SNN architecture to classify emotional valence and evaluate the performance of our approach on the MAHNOB-HCI dataset. The multimodal data used in our work consists of facial expressions along with physiological signals such as ECG, skin temperature, skin conductance, respiration signal, mouth length, and pupil size. We perform classification under the Leave-One-Subject-Out (LOSO) cross-validation mode. Our results show that the proposed approach achieves an accuracy of 73.15% for classifying binary valence when applying feature-level fusion, which is comparable to other deep learning methods. We achieve this accuracy even without using EEG, which other deep learning methods have relied on to achieve this level of accuracy. In conclusion, we have demonstrated that the SNN can be successfully used for solving the emotion recognition problem with multimodal data and also provide directions for future research utilizing SNN for Affective computing. In addition to the good accuracy, the SNN recognition system is requires incrementally trainable on new data in an adaptive way. It only one pass training, which makes it suitable for practical and on-line applications. These features are not manifested in other methods for this problem.

Collapse

Learning from discrete Gaussian label distribution and spatial channel-aware residual attention for head pose estimation. Neurocomputing 2020. [DOI: 10.1016/j.neucom.2020.05.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

When I Look into Your Eyes: A Survey on Computer Vision Contributions for Human Gaze Estimation and Tracking. SENSORS 2020;20:s20133739. [PMID: 32635375 PMCID: PMC7374327 DOI: 10.3390/s20133739] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Revised: 06/18/2020] [Accepted: 06/30/2020] [Indexed: 11/16/2022]

Stirling L, Kelty-Stephen D, Fineman R, Jones MLH, Daniel Park BK, Reed MP, Parham J, Choi HJ. Static, Dynamic, and Cognitive Fit of Exosystems for the Human Operator. HUMAN FACTORS 2020;62:424-440. [PMID: 32004106 DOI: 10.1177/0018720819896898] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Wang S, Li J, Yang P, Gao T, Bowers AR, Luo G. Towards Wide Range Tracking of Head Scanning Movement in Driving. INT J PATTERN RECOGN 2020;34. [PMID: 34267412 DOI: 10.1142/s0218001420500330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Borghi G, Fabbri M, Vezzani R, Calderara S, Cucchiara R. Face-from-Depth for Head Pose Estimation on Depth Images. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2020;42:596-609. [PMID: 30530311 DOI: 10.1109/tpami.2018.2885472] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Khan K, Attique M, Khan RU, Syed I, Chung TS. A Multi-Task Framework for Facial Attributes Classification through End-to-End Face Parsing and Deep Convolutional Neural Networks. SENSORS (BASEL, SWITZERLAND) 2020;20:E328. [PMID: 31935996 PMCID: PMC7014093 DOI: 10.3390/s20020328] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Revised: 12/29/2019] [Accepted: 12/30/2019] [Indexed: 11/17/2022]

Singh J, Modi N. Use of information modelling techniques to understand research trends in eye gaze estimation methods: An automated review. Heliyon 2019;5:e03033. [PMID: 31890964 PMCID: PMC6928306 DOI: 10.1016/j.heliyon.2019.e03033] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 10/22/2019] [Accepted: 12/10/2019] [Indexed: 10/31/2022] Open

Leveraging deep learning with symbolic sequences for robust head poses estimation. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-019-00857-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Martinikorena I, Larumbe-Bergera A, Ariz M, Porta S, Cabeza R, Villanueva A. Low cost gaze estimation: knowledge-based solutions. IEEE TRANSACTIONS ON IMAGE PROCESSING : A PUBLICATION OF THE IEEE SIGNAL PROCESSING SOCIETY 2019;29:2328-2343. [PMID: 31634835 DOI: 10.1109/tip.2019.2946452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Derkach D, Ruiz A, Sukno FM. Tensor Decomposition and Non-linear Manifold Modeling for 3D Head Pose Estimation. Int J Comput Vis 2019. [DOI: 10.1007/s11263-019-01208-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]

3D Approaches and Challenges in Facial Expression Recognition Algorithms—A Literature Review. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9183904] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Khan K, Attique M, Syed I, Sarwar G, Irfan MA, Khan RU. A Unified Framework for Head Pose, Age and Gender Classification through End-to-End Face Segmentation. ENTROPY 2019;21:e21070647. [PMID: 33267361 PMCID: PMC7515140 DOI: 10.3390/e21070647] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/02/2019] [Revised: 06/23/2019] [Accepted: 06/24/2019] [Indexed: 11/16/2022]