Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chong E, Clark-Whitney E, Southerland A, Stubbs E, Miller C, Ajodan EL, Silverman MR, Lord C, Rozga A, Jones RM, Rehg JM. Detection of eye contact with deep neural networks is as accurate as human experts. Nat Commun 2020;11:6386. [PMID: 33318484 PMCID: PMC7736573 DOI: 10.1038/s41467-020-19712-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 10/14/2020] [Indexed: 01/10/2023] Open

For:	Chong E, Clark-Whitney E, Southerland A, Stubbs E, Miller C, Ajodan EL, Silverman MR, Lord C, Rozga A, Jones RM, Rehg JM. Detection of eye contact with deep neural networks is as accurate as human experts. Nat Commun 2020;11:6386. [PMID: 33318484 PMCID: PMC7736573 DOI: 10.1038/s41467-020-19712-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 10/14/2020] [Indexed: 01/10/2023] Open

Number

Cited by Other Article(s)

Ahn YA, Moffitt JM, Tao Y, Custode S, Parlade M, Beaumont A, Cardona S, Hale M, Durocher J, Alessandri M, Shyu ML, Perry LK, Messinger DS. Objective Measurement of Social Gaze and Smile Behaviors in Children with Suspected Autism Spectrum Disorder During Administration of the Autism Diagnostic Observation Schedule, 2nd Edition. J Autism Dev Disord 2024;54:2124-2137. [PMID: 37103660 DOI: 10.1007/s10803-023-05990-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/09/2023] [Indexed: 04/28/2023]

So WC, Wong E, Ng W, Fuego J, Lay S, So MT, Lee YY, Chan WY, Chua LY, Lam HL, Lam WT, Li HM, Leung WT, Ng YH, Wong WT. Seeing through a robot's eyes: A cross-sectional exploratory study in developing a robotic screening technology for autism. Autism Res 2024;17:366-380. [PMID: 38183409 DOI: 10.1002/aur.3087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 12/09/2023] [Indexed: 01/08/2024]

Alshammari RFN, Abd Rahman AH, Arshad H, Albahri OS. Real-Time Robotic Presentation Skill Scoring Using Multi-Model Analysis and Fuzzy Delphi-Analytic Hierarchy Process. SENSORS (BASEL, SWITZERLAND) 2023;23:9619. [PMID: 38139465 PMCID: PMC10747450 DOI: 10.3390/s23249619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 10/30/2023] [Accepted: 11/17/2023] [Indexed: 12/24/2023]

Abstract

Existing methods for scoring student presentations predominantly rely on computer-based implementations and do not incorporate a robotic multi-classification model. This limitation can result in potential misclassification issues as these approaches lack active feature learning capabilities due to fixed camera positions. Moreover, these scoring methods often solely focus on facial expressions and neglect other crucial factors, such as eye contact, hand gestures and body movements, thereby leading to potential biases or inaccuracies in scoring. To address these limitations, this study introduces Robotics-based Presentation Skill Scoring (RPSS), which employs a multi-model analysis. RPSS captures and analyses four key presentation parameters in real time, namely facial expressions, eye contact, hand gestures and body movements, and applies the fuzzy Delphi method for criteria selection and the analytic hierarchy process for weighting, thereby enabling decision makers or managers to assign varying weights to each criterion based on its relative importance. RPSS identifies five academic facial expressions and evaluates eye contact to achieve a comprehensive assessment and enhance its scoring accuracy. Specific sub-models are employed for each presentation parameter, namely EfficientNet for facial emotions, DeepEC for eye contact and an integrated Kalman and heuristic approach for hand and body movements. The scores are determined based on predefined rules. RPSS is implemented on a robot, and the results highlight its practical applicability. Each sub-model is rigorously evaluated offline and compared against benchmarks for selection. Real-world evaluations are also conducted by incorporating a novel active learning approach to improve performance by leveraging the robot's mobility. In a comparative evaluation with human tutors, RPSS achieves a remarkable average agreement of 99%, showcasing its effectiveness in assessing students' presentation skills.

Collapse

Wu Z, Zhang C, Gu X, Duporge I, Hughey LF, Stabach JA, Skidmore AK, Hopcraft JGC, Lee SJ, Atkinson PM, McCauley DJ, Lamprey R, Ngene S, Wang T. Deep learning enables satellite-based monitoring of large populations of terrestrial mammals across heterogeneous landscape. Nat Commun 2023;14:3072. [PMID: 37244940 DOI: 10.1038/s41467-023-38901-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 05/19/2023] [Indexed: 05/29/2023] Open

Affiliation(s)

Zijing Wu Department of Natural Resources, Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, The Netherlands
Ce Zhang Lancaster Environment Center, Lancaster University, Lancaster, UK UK Centre for Ecology & Hydrology, Lancaster, UK
Xiaowei Gu School of Computing, University of Kent, Canterbury, UK
Isla Duporge Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA U.S. Army Research Laboratory, Army Research Office, Durham, NC, USA The National Academies of Sciences, Washington, D.C., USA
Lacey F Hughey Conservation Ecology Center, Smithsonian National Zoo and Conservation Biology Institute, Front Royal, VA, USA
Jared A Stabach Conservation Ecology Center, Smithsonian National Zoo and Conservation Biology Institute, Front Royal, VA, USA
Andrew K Skidmore Department of Natural Resources, Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, The Netherlands School of Natural Sciences, Macquarie University, Sydney, NSW, Australia
J Grant C Hopcraft Institute of Biodiversity, Animal Health, and Comparative Medicine, University of Glasgow, Glasgow, UK
Stephen J Lee U.S. Army Research Laboratory, Army Research Office, Durham, NC, USA
Peter M Atkinson Lancaster Environment Center, Lancaster University, Lancaster, UK Geography and Environmental Science, University of Southampton, Southampton, UK
Douglas J McCauley Department of Ecology, Evolution and Marine Biology, University of California, Santa Barbara, CA, USA
Richard Lamprey Department of Natural Resources, Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, The Netherlands
Shadrack Ngene Wildlife Research and Training Institute, Naivasha, Kenya
Tiejun Wang Department of Natural Resources, Faculty of Geo-Information Science and Earth Observation, University of Twente, Enschede, The Netherlands.

Collapse

Li Y, Reed A, Kavoussi N, Wu JY. Eye gaze metrics for skill assessment and feedback in kidney stone surgery. Int J Comput Assist Radiol Surg 2023:10.1007/s11548-023-02901-6. [PMID: 37202714 DOI: 10.1007/s11548-023-02901-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2023] [Accepted: 03/31/2023] [Indexed: 05/20/2023]

Lakkapragada A, Kline A, Mutlu OC, Paskov K, Chrisman B, Stockham N, Washington P, Wall DP. The Classification of Abnormal Hand Movement to Aid in Autism Detection: Machine Learning Study. JMIR BIOMEDICAL ENGINEERING 2022. [DOI: 10.2196/33771] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open

Abstract Background A formal autism diagnosis can be an inefficient and lengthy process. Families may wait several months or longer before receiving a diagnosis for their child despite evidence that earlier intervention leads to better treatment outcomes. Digital technologies that detect the presence of behaviors related to autism can scale access to pediatric diagnoses. A strong indicator of the presence of autism is self-stimulatory behaviors such as hand flapping. Objective This study aims to demonstrate the feasibility of deep learning technologies for the detection of hand flapping from unstructured home videos as a first step toward validation of whether statistical models coupled with digital technologies can be leveraged to aid in the automatic behavioral analysis of autism. To support the widespread sharing of such home videos, we explored privacy-preserving modifications to the input space via conversion of each video to hand landmark coordinates and measured the performance of corresponding time series classifiers. Methods We used the Self-Stimulatory Behavior Dataset (SSBD) that contains 75 videos of hand flapping, head banging, and spinning exhibited by children. From this data set, we extracted 100 hand flapping videos and 100 control videos, each between 2 to 5 seconds in duration. We evaluated five separate feature representations: four privacy-preserved subsets of hand landmarks detected by MediaPipe and one feature representation obtained from the output of the penultimate layer of a MobileNetV2 model fine-tuned on the SSBD. We fed these feature vectors into a long short-term memory network that predicted the presence of hand flapping in each video clip. Results The highest-performing model used MobileNetV2 to extract features and achieved a test F1 score of 84 (SD 3.7; precision 89.6, SD 4.3 and recall 80.4, SD 6) using 5-fold cross-validation for 100 random seeds on the SSBD data (500 total distinct folds). Of the models we trained on privacy-preserved data, the model trained with all hand landmarks reached an F1 score of 66.6 (SD 3.35). Another such model trained with a select 6 landmarks reached an F1 score of 68.3 (SD 3.6). A privacy-preserved model trained using a single landmark at the base of the hands and a model trained with the average of the locations of all the hand landmarks reached an F1 score of 64.9 (SD 6.5) and 64.2 (SD 6.8), respectively. Conclusions We created five lightweight neural networks that can detect hand flapping from unstructured videos. Training a long short-term memory network with convolutional feature vectors outperformed training with feature vectors of hand coordinates and used almost 900,000 fewer model parameters. This study provides the first step toward developing precise deep learning methods for activity detection of autism-related behaviors. Collapse

Chi NA, Washington P, Kline A, Husic A, Hou C, He C, Dunlap K, Wall DP. Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study. JMIR Pediatr Parent 2022;5:e35406. [PMID: 35436234 PMCID: PMC9052034 DOI: 10.2196/35406] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 01/18/2022] [Accepted: 01/25/2022] [Indexed: 01/27/2023] Open

Abstract

BACKGROUND

Autism spectrum disorder (ASD) is a neurodevelopmental disorder that results in altered behavior, social development, and communication patterns. In recent years, autism prevalence has tripled, with 1 in 44 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process that requires the work of trained physicians, significant attention has been given to developing systems that automatically detect autism. We work toward this goal by analyzing audio data, as prosody abnormalities are a signal of autism, with affected children displaying speech idiosyncrasies such as echolalia, monotonous intonation, atypical pitch, and irregular linguistic stress patterns.

OBJECTIVE

We aimed to test the ability for machine learning approaches to aid in detection of autism in self-recorded speech audio captured from children with ASD and neurotypical (NT) children in their home environments.

METHODS

We considered three methods to detect autism in child speech: (1) random forests trained on extracted audio features (including Mel-frequency cepstral coefficients); (2) convolutional neural networks trained on spectrograms; and (3) fine-tuned wav2vec 2.0-a state-of-the-art transformer-based speech recognition model. We trained our classifiers on our novel data set of cellphone-recorded child speech audio curated from the Guess What? mobile game, an app designed to crowdsource videos of children with ASD and NT children in a natural home environment.

RESULTS

The random forest classifier achieved 70% accuracy, the fine-tuned wav2vec 2.0 model achieved 77% accuracy, and the convolutional neural network achieved 79% accuracy when classifying children's audio as either ASD or NT. We used 5-fold cross-validation to evaluate model performance.

CONCLUSIONS

Our models were able to predict autism status when trained on a varied selection of home audio clips with inconsistent recording qualities, which may be more representative of real-world conditions. The results demonstrate that machine learning methods offer promise in detecting autism automatically from speech without specialized equipment.

Collapse

Lombardi M, Maiettini E, De Tommaso D, Wykowska A, Natale L. Toward an Attentive Robotic Architecture: Learning-Based Mutual Gaze Estimation in Human–Robot Interaction. Front Robot AI 2022;9:770165. [PMID: 35321344 PMCID: PMC8935014 DOI: 10.3389/frobt.2022.770165] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 02/10/2022] [Indexed: 11/23/2022] Open

Messinger DS, Perry LK, Mitsven SG, Tao Y, Moffitt J, Fasano RM, Custode SA, Jerry CM. Computational approaches to understanding interaction and development. ADVANCES IN CHILD DEVELOPMENT AND BEHAVIOR 2022;62:191-230. [PMID: 35249682 PMCID: PMC9840818 DOI: 10.1016/bs.acdb.2021.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]

Abstract

Audio-visual recording and location tracking produce enormous quantities of digital data with which researchers can document children's everyday interactions in naturalistic settings and assessment contexts. Machine learning and other computational approaches can produce replicable, automated measurements of these big behavioral data. The economies of scale afforded by repeated automated measurements offer a potent approach to investigating linkages between real-time behavior and developmental change. In our work, automated measurement of audio from child-worn recorders-which quantify the frequency of child and adult speech and index its phonemic complexity-are paired with ultrawide radio tracking of children's location and interpersonal orientation. Applications of objective measurement indicate the influence of adult behavior in both expert ratings of attachment behavior and ratings of autism severity, suggesting the role of dyadic factors in these "child" assessments. In the preschool classroom, location/orientation measures provide data-driven measures of children's social contact, fertile ground for vocal interactions. Both the velocity of children's movement toward one another and their social contact with one another evidence homophily: children with autism spectrum disorder, other developmental disabilities, and typically developing children were more likely to interact with children in the same group even in inclusive preschool classrooms designed to promote interchange between all children. In the vocal domain, the frequency of peer speech and the phonemic complexity of teacher speech predict the frequency and phonemic complexity of children's own speech over multiple timescales. Moreover, children's own speech predicts their assessed language abilities across disability groups, suggesting how everyday interactions facilitate development.

Collapse

Sumioka H, Shiomi M, Honda M, Nakazawa A. Technical Challenges for Smooth Interaction With Seniors With Dementia: Lessons From Humanitude™. Front Robot AI 2021;8:650906. [PMID: 34150858 PMCID: PMC8207295 DOI: 10.3389/frobt.2021.650906] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Accepted: 05/20/2021] [Indexed: 11/13/2022] Open