1
|
Washington P. A Perspective on Crowdsourcing and Human-in-the-Loop Workflows in Precision Health. J Med Internet Res 2024; 26:e51138. [PMID: 38602750 PMCID: PMC11046386 DOI: 10.2196/51138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2023] [Revised: 11/15/2023] [Accepted: 01/30/2024] [Indexed: 04/12/2024] Open
Abstract
Modern machine learning approaches have led to performant diagnostic models for a variety of health conditions. Several machine learning approaches, such as decision trees and deep neural networks, can, in principle, approximate any function. However, this power can be considered to be both a gift and a curse, as the propensity toward overfitting is magnified when the input data are heterogeneous and high dimensional and the output class is highly nonlinear. This issue can especially plague diagnostic systems that predict behavioral and psychiatric conditions that are diagnosed with subjective criteria. An emerging solution to this issue is crowdsourcing, where crowd workers are paid to annotate complex behavioral features in return for monetary compensation or a gamified experience. These labels can then be used to derive a diagnosis, either directly or by using the labels as inputs to a diagnostic machine learning model. This viewpoint describes existing work in this emerging field and discusses ongoing challenges and opportunities with crowd-powered diagnostic systems, a nascent field of study. With the correct considerations, the addition of crowdsourcing to human-in-the-loop machine learning workflows for the prediction of complex and nuanced health conditions can accelerate screening, diagnostics, and ultimately access to care.
Collapse
Affiliation(s)
- Peter Washington
- Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| |
Collapse
|
2
|
Jaiswal A, Kruiper R, Rasool A, Nandkeolyar A, Wall DP, Washington P. Digitally Diagnosing Multiple Developmental Delays Using Crowdsourcing Fused With Machine Learning: Protocol for a Human-in-the-Loop Machine Learning Study. JMIR Res Protoc 2024; 13:e52205. [PMID: 38329783 PMCID: PMC10884895 DOI: 10.2196/52205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2023] [Revised: 12/17/2023] [Accepted: 12/26/2023] [Indexed: 02/09/2024] Open
Abstract
BACKGROUND A considerable number of minors in the United States are diagnosed with developmental or psychiatric conditions, potentially influenced by underdiagnosis factors such as cost, distance, and clinician availability. Despite the potential of digital phenotyping tools with machine learning (ML) approaches to expedite diagnoses and enhance diagnostic services for pediatric psychiatric conditions, existing methods face limitations because they use a limited set of social features for prediction tasks and focus on a single binary prediction, resulting in uncertain accuracies. OBJECTIVE This study aims to propose the development of a gamified web system for data collection, followed by a fusion of novel crowdsourcing algorithms with ML behavioral feature extraction approaches to simultaneously predict diagnoses of autism spectrum disorder and attention-deficit/hyperactivity disorder in a precise and specific manner. METHODS The proposed pipeline will consist of (1) gamified web applications to curate videos of social interactions adaptively based on the needs of the diagnostic system, (2) behavioral feature extraction techniques consisting of automated ML methods and novel crowdsourcing algorithms, and (3) the development of ML models that classify several conditions simultaneously and that adaptively request additional information based on uncertainties about the data. RESULTS A preliminary version of the web interface has been implemented, and a prior feature selection method has highlighted a core set of behavioral features that can be targeted through the proposed gamified approach. CONCLUSIONS The prospect for high reward stems from the possibility of creating the first artificial intelligence-powered tool that can identify complex social behaviors well enough to distinguish conditions with nuanced differentiators such as autism spectrum disorder and attention-deficit/hyperactivity disorder. INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID) PRR1-10.2196/52205.
Collapse
Affiliation(s)
- Aditi Jaiswal
- Department of Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| | - Ruben Kruiper
- Department of Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| | - Abdur Rasool
- Department of Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| | - Aayush Nandkeolyar
- Department of Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| | - Dennis P Wall
- Department of Pediatrics (Systems Medicine), Stanford University School of Medicine, Stanford, CA, United States
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA, United States
- Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, United States
| | - Peter Washington
- Department of Information and Computer Sciences, University of Hawaii at Manoa, Honolulu, HI, United States
| |
Collapse
|
3
|
Todowede O, Lewandowski F, Kotera Y, Ashmore A, Rennick-Egglestone S, Boyd D, Moran S, Ørjasæter KB, Repper J, Robotham D, Rowe M, Katsampa D, Slade M. Best practice guidelines for citizen science in mental health research: systematic review and evidence synthesis. Front Psychiatry 2023; 14:1175311. [PMID: 37743990 PMCID: PMC10515389 DOI: 10.3389/fpsyt.2023.1175311] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 08/25/2023] [Indexed: 09/26/2023] Open
Abstract
Partnering with people most affected by mental health problems can transform mental health outcomes. Citizen science as a research approach enables partnering with the public at a substantial scale, but there is scarce guidance on its use in mental health research. To develop best practise guidelines for conducting and reporting research, we conducted a systematic review of studies reporting mental health citizen science research. Documents were identified from electronic databases (n = 10), grey literature, conference proceedings, hand searching of specific journals and citation tracking. Document content was organised in NVIVO using the ten European Citizen Science Association (ECSA) citizen science principles. Best practise guidelines were developed by (a) identifying approaches specific to mental health research or where citizen science and mental health practises differ, (b) identifying relevant published reporting guidelines and methodologies already used in mental health research, and (c) identifying specific elements to include in reporting studies. A total of 14,063 documents were screened. Nine studies were included, from Australia, Belgium, Canada, Denmark, Netherlands, Spain, the UK, and the United States. Citizen scientists with lived experience of mental health problems were involved in data collection, analysis, project design, leadership, and dissemination of results. Most studies reported against some ECSA principles but reporting against these principles was often unclear and unstated. Best practise guidelines were developed, which identified mental health-specific issues relevant to citizen science, and reporting recommendations. These included citizen science as a mechanism for empowering people affected by mental health problems, attending to safeguarding issues such as health-related advice being shared between contributors, the use of existing health research reporting guidelines, evaluating the benefits for contributors and impact on researchers, explicit reporting of participation at each research stage, naming the citizen science platform and data repository, and clear reporting of consent processes, data ownership, and data sharing arrangements. We conclude that citizen science is feasible in mental health and can be complementary to other participatory approaches. It can contribute to active involvement, engagement, and knowledge production with the public. The proposed guidelines will support the quality of citizen science reporting.
Collapse
Affiliation(s)
- Olamide Todowede
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Felix Lewandowski
- School of Psychology, University of Nottingham, Nottingham, United Kingdom
| | - Yasuhiro Kotera
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Alison Ashmore
- University of Nottingham Libraries, Nottingham, United Kingdom
| | - Stefan Rennick-Egglestone
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
| | - Doreen Boyd
- School of Geography, University of Nottingham, Nottingham, United Kingdom
| | - Stuart Moran
- Information Services, University of Nottingham, Nottingham, United Kingdom
| | - Kristin Berre Ørjasæter
- Nord University, Faculty of Nursing and Health Sciences, Health and Community Participation Division, Namsos, Norway
| | - Julie Repper
- ImROC, Nottinghamshire Healthcare NHS Foundation Trust, Nottingham, United Kingdom
| | | | - Michael Rowe
- Program for Recovery and Community Health, Yale University, New Haven, CT, United States
| | - Dafni Katsampa
- National Elf Service, London, United Kingdom
- School of Psychology, University of Hertfordshire, Hatfield, United Kingdom
| | - Mike Slade
- School of Health Sciences, Institute of Mental Health, University of Nottingham, Nottingham, United Kingdom
- Nord University, Faculty of Nursing and Health Sciences, Health and Community Participation Division, Namsos, Norway
| |
Collapse
|
4
|
Washington P, Wall DP. A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism. Annu Rev Biomed Data Sci 2023; 6:211-228. [PMID: 37137169 PMCID: PMC11093217 DOI: 10.1146/annurev-biodatasci-020722-125454] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
Abstract
Autism spectrum disorder (autism) is a neurodevelopmental delay that affects at least 1 in 44 children. Like many neurological disorder phenotypes, the diagnostic features are observable, can be tracked over time, and can be managed or even eliminated through proper therapy and treatments. However, there are major bottlenecks in the diagnostic, therapeutic, and longitudinal tracking pipelines for autism and related neurodevelopmental delays, creating an opportunity for novel data science solutions to augment and transform existing workflows and provide increased access to services for affected families. Several efforts previously conducted by a multitude of research labs have spawned great progress toward improved digital diagnostics and digital therapies for children with autism. We review the literature on digital health methods for autism behavior quantification and beneficial therapies using data science. We describe both case-control studies and classification systems for digital phenotyping. We then discuss digital diagnostics and therapeutics that integrate machine learning models of autism-related behaviors, including the factors that must be addressed for translational use. Finally, we describe ongoing challenges and potential opportunities for the field of autism data science. Given the heterogeneous nature of autism and the complexities of the relevant behaviors, this review contains insights that are relevant to neurological behavior analysis and digital psychiatry more broadly.
Collapse
Affiliation(s)
- Peter Washington
- Department of Information and Computer Sciences, University of Hawai'i at Mānoa, Honolulu, Hawai'i, USA
| | - Dennis P Wall
- Departments of Pediatrics (Systems Medicine), Biomedical Data Science, and Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, California, USA;
| |
Collapse
|
5
|
Banerjee A, Mutlu OC, Kline A, Surabhi S, Washington P, Wall DP. Training and Profiling a Pediatric Facial Expression Classifier for Children on Mobile Devices: Machine Learning Study. JMIR Form Res 2023; 7:e39917. [PMID: 35962462 PMCID: PMC10131663 DOI: 10.2196/39917] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2022] [Revised: 08/01/2022] [Accepted: 08/09/2022] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Implementing automated facial expression recognition on mobile devices could provide an accessible diagnostic and therapeutic tool for those who struggle to recognize facial expressions, including children with developmental behavioral conditions such as autism. Despite recent advances in facial expression classifiers for children, existing models are too computationally expensive for smartphone use. OBJECTIVE We explored several state-of-the-art facial expression classifiers designed for mobile devices, used posttraining optimization techniques for both classification performance and efficiency on a Motorola Moto G6 phone, evaluated the importance of training our classifiers on children versus adults, and evaluated the models' performance against different ethnic groups. METHODS We collected images from 12 public data sets and used video frames crowdsourced from the GuessWhat app to train our classifiers. All images were annotated for 7 expressions: neutral, fear, happiness, sadness, surprise, anger, and disgust. We tested 3 copies for each of 5 different convolutional neural network architectures: MobileNetV3-Small 1.0x, MobileNetV2 1.0x, EfficientNetB0, MobileNetV3-Large 1.0x, and NASNetMobile. We trained the first copy on images of children, second copy on images of adults, and third copy on all data sets. We evaluated each model against the entire Child Affective Facial Expression (CAFE) set and by ethnicity. We performed weight pruning, weight clustering, and quantize-aware training when possible and profiled each model's performance on the Moto G6. RESULTS Our best model, a MobileNetV3-Large network pretrained on ImageNet, achieved 65.78% accuracy and 65.31% F1-score on the CAFE and a 90-millisecond inference latency on a Moto G6 phone when trained on all data. This accuracy is only 1.12% lower than the current state of the art for CAFE, a model with 13.91x more parameters that was unable to run on the Moto G6 due to its size, even when fully optimized. When trained solely on children, this model achieved 60.57% accuracy and 60.29% F1-score. When trained only on adults, the model received 53.36% accuracy and 53.10% F1-score. Although the MobileNetV3-Large trained on all data sets achieved nearly a 60% F1-score across all ethnicities, the data sets for South Asian and African American children achieved lower accuracy (as much as 11.56%) and F1-score (as much as 11.25%) than other groups. CONCLUSIONS With specialized design and optimization techniques, facial expression classifiers can become lightweight enough to run on mobile devices and achieve state-of-the-art performance. There is potentially a "data shift" phenomenon between facial expressions of children compared with adults; our classifiers performed much better when trained on children. Certain underrepresented ethnic groups (e.g., South Asian and African American) also perform significantly worse than groups such as European Caucasian despite similar data quality. Our models can be integrated into mobile health therapies to help diagnose autism spectrum disorder and provide targeted therapeutic treatment to children.
Collapse
Affiliation(s)
- Agnik Banerjee
- Department of Pediatrics (Systems Medicine), Stanford University, Stanford, CA, United States
| | - Onur Cezmi Mutlu
- Department of Electrical Engineering, Stanford University, Stanford, CA, United States
| | - Aaron Kline
- Department of Pediatrics (Systems Medicine), Stanford University, Stanford, CA, United States
| | - Saimourya Surabhi
- Department of Pediatrics (Systems Medicine), Stanford University, Stanford, CA, United States
| | - Peter Washington
- Department of Information and Computer Sciences, University of Hawai`i at Mānoa, Honolulu, HI, United States
| | - Dennis Paul Wall
- Department of Pediatrics (Systems Medicine), Stanford University, Stanford, CA, United States
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, United States
| |
Collapse
|
6
|
Washington P. Digitally Diagnosing Multiple Developmental Delays using Crowdsourcing fused with Machine Learning: A Research Protocol. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.03.05.23286817. [PMID: 36945467 PMCID: PMC10029023 DOI: 10.1101/2023.03.05.23286817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/09/2023]
Abstract
Background Roughly 17% percent of minors in the United States aged 3 through 17 years have a diagnosis of one or more developmental or psychiatric conditions, with the true prevalence likely being higher due to underdiagnosis in rural areas and for minority populations. Unfortunately, timely diagnostic services are inaccessible to a large portion of the United States and global population due to cost, distance, and clinician availability. Digital phenotyping tools have the potential to shorten the time-to-diagnosis and to bring diagnostic services to more people by enabling accessible evaluations. While automated machine learning (ML) approaches for detection of pediatric psychiatry conditions have garnered increased research attention in recent years, existing approaches use a limited set of social features for the prediction task and focus on a single binary prediction. Objective I propose the development of a gamified web system for data collection followed by a fusion of novel crowdsourcing algorithms with machine learning behavioral feature extraction approaches to simultaneously predict diagnoses of Autism Spectrum Disorder (ASD) and Attention-Deficit/Hyperactivity Disorder (ADHD) in a precise and specific manner. Methods The proposed pipeline will consist of: (1) a gamified web applications to curate videos of social interactions adaptively based on needs of the diagnostic system, (2) behavioral feature extraction techniques consisting of automated ML methods and novel crowdsourcing algorithms, and (3) development of ML models which classify several conditions simultaneously and which adaptively request additional information based on uncertainties about the data. Conclusions The prospective for high reward stems from the possibility of creating the first AI-powered tool which can identify complex social behaviors well enough to distinguish conditions with nuanced differentiators such as ASD and ADHD.
Collapse
|
7
|
Previously Marzena Szkodo MOR, Micai M, Caruso A, Fulceri F, Fazio M, Scattoni ML. Technologies to support the diagnosis and/or treatment of neurodevelopmental disorders: A systematic review. Neurosci Biobehav Rev 2023; 145:105021. [PMID: 36581169 DOI: 10.1016/j.neubiorev.2022.105021] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Revised: 12/13/2022] [Accepted: 12/23/2022] [Indexed: 12/27/2022]
Abstract
In recent years, there has been a great interest in utilizing technology in mental health research. The rapid technological development has encouraged researchers to apply technology as a part of a diagnostic process or treatment of Neurodevelopmental Disorders (NDDs). With the large number of studies being published comes an urgent need to inform clinicians and researchers about the latest advances in this field. Here, we methodically explore and summarize findings from studies published between August 2019 and February 2022. A search strategy led to the identification of 4108 records from PubMed and APA PsycInfo databases. 221 quantitative studies were included, covering a wide range of technologies used for diagnosis and/or treatment of NDDs, with the biggest focus on Autism Spectrum Disorder (ASD). The most popular technologies included machine learning, functional magnetic resonance imaging, electroencephalogram, magnetic resonance imaging, and neurofeedback. The results of the review indicate that technology-based diagnosis and intervention for NDD population is promising. However, given a high risk of bias of many studies, more high-quality research is needed.
Collapse
Affiliation(s)
| | - Martina Micai
- Research Coordination and Support Service, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy.
| | - Angela Caruso
- Research Coordination and Support Service, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy.
| | - Francesca Fulceri
- Research Coordination and Support Service, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy.
| | - Maria Fazio
- Department of Mathematics, Computer Science, Physics and Earth Sciences (MIFT), University of Messina, Viale F. Stagno d'Alcontres, 31, 98166 Messina, Italy.
| | - Maria Luisa Scattoni
- Research Coordination and Support Service, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161 Rome, Italy.
| |
Collapse
|
8
|
How to Make the Unpredictable Foreseeable? Effective Forms of Assistance for Children with Autism Spectrum Disorder (ASD) during the COVID-19 Pandemic. Diagnostics (Basel) 2023; 13:diagnostics13030407. [PMID: 36766512 PMCID: PMC9914931 DOI: 10.3390/diagnostics13030407] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Revised: 01/18/2023] [Accepted: 01/21/2023] [Indexed: 01/24/2023] Open
Abstract
Symptomatology in patients with the diagnosis of autism spectrum disorder (ASD) is very heterogeneous. The symptoms they present include communication difficulties, behavior problems, upbringing problems from their parents, and comorbidities (e.g., epilepsy, intellectual disability). A predictable and stable environment and the continuity of therapeutic interactions are crucial in this population. The COVID-19 pandemic has created much concern, and the need for home isolation to limit the spread of the virus has disrupted the functioning routine of children/adolescents with ASD. Are there effective diagnostic and therapeutic alternatives to limit the consequences of disturbing the daily routine of young patients during the unpredictable times of the pandemic? Modern technology and telemedicine have come to the rescue. This narrative review aims to present a change in the impact profile in the era of isolation and assess the directions of changes that specialists may choose when dealing with patients with ASD.
Collapse
|
9
|
Washington PY, Puniwai N, Kamaka M, Gürsoy G, Tatonetti N, Brenner SE, Wall DP. Session Introduction: TOWARDS ETHICAL BIOMEDICAL INFORMATICS: LEARNING FROM OLELO NOEAU, HAWAIIAN PROVERBS. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023; 28:461-471. [PMID: 36541000 PMCID: PMC11095408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Innovations in human-centered biomedical informatics are often developed with the eventual goal of real-world translation. While biomedical research questions are usually answered in terms of how a method performs in a particular context, we argue that it is equally important to consider and formally evaluate the ethical implications of informatics solutions. Several new research paradigms have arisen as a result of the consideration of ethical issues, including but not limited for privacy-preserving computation and fair machine learning. In the spirit of the Pacific Symposium on Biocomputing, we discuss broad and fundamental principles of ethical biomedical informatics in terms of Olelo Noeau, or Hawaiian proverbs and poetical sayings that capture Hawaiian values. While we emphasize issues related to privacy and fairness in particular, there are a multitude of facets to ethical biomedical informatics that can benefit from a critical analysis grounded in ethics.
Collapse
Affiliation(s)
- Peter Y Washington
- Department of Information & Computer Sciences, University of Hawaii at Manoa Honolulu, HI 96822, USA,
| | | | | | | | | | | | | |
Collapse
|
10
|
Deveau N, Washington P, Leblanc E, Husic A, Dunlap K, Penev Y, Kline A, Mutlu OC, Wall DP. Machine learning models using mobile game play accurately classify children with autism. INTELLIGENCE-BASED MEDICINE 2022; 6:100057. [PMID: 36035501 PMCID: PMC9398788 DOI: 10.1016/j.ibmed.2022.100057] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 01/10/2022] [Accepted: 03/29/2022] [Indexed: 11/23/2022]
Abstract
Digitally-delivered healthcare is well suited to address current inequities in the delivery of care due to barriers of access to healthcare facilities. As the COVID-19 pandemic phases out, we have a unique opportunity to capitalize on the current familiarity with telemedicine approaches and continue to advocate for mainstream adoption of remote care delivery. In this paper, we specifically focus on the ability of GuessWhat? a smartphone-based charades-style gamified therapeutic intervention for autism spectrum disorder (ASD) to generate a signal that distinguishes children with ASD from neurotypical (NT) children. We demonstrate the feasibility of using "in-the-wild", naturalistic gameplay data to distinguish between ASD and NT by children by training a random forest classifier to discern the two classes (AU-ROC = 0.745, recall = 0.769). This performance demonstrates the potential for GuessWhat? to facilitate screening for ASD in historically difficult-to-reach communities. To further examine this potential, future work should expand the size of the training sample and interrogate differences in predictive ability by demographic.
Collapse
Affiliation(s)
- Nicholas Deveau
- Biomedical Data Science, Stanford University, Stanford, 94305, California, United States
| | - Peter Washington
- Bioengineering, Stanford University, Stanford, 94305, California, United States
| | - Emilie Leblanc
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| | - Arman Husic
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| | - Kaitlyn Dunlap
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| | - Yordan Penev
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| | - Aaron Kline
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| | - Onur Cezmi Mutlu
- Electrical Engineering, Stanford University, Stanford, 94305, California, United States
| | - Dennis P Wall
- Biomedical Data Science, Stanford University, Stanford, 94305, California, United States
- Pediatrics, Stanford University, Stanford, 94305, California, United States
| |
Collapse
|
11
|
Lakkapragada A, Kline A, Mutlu OC, Paskov K, Chrisman B, Stockham N, Washington P, Wall DP. The Classification of Abnormal Hand Movement to Aid in Autism Detection: Machine Learning Study. JMIR BIOMEDICAL ENGINEERING 2022. [DOI: 10.2196/33771] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
Background
A formal autism diagnosis can be an inefficient and lengthy process. Families may wait several months or longer before receiving a diagnosis for their child despite evidence that earlier intervention leads to better treatment outcomes. Digital technologies that detect the presence of behaviors related to autism can scale access to pediatric diagnoses. A strong indicator of the presence of autism is self-stimulatory behaviors such as hand flapping.
Objective
This study aims to demonstrate the feasibility of deep learning technologies for the detection of hand flapping from unstructured home videos as a first step toward validation of whether statistical models coupled with digital technologies can be leveraged to aid in the automatic behavioral analysis of autism. To support the widespread sharing of such home videos, we explored privacy-preserving modifications to the input space via conversion of each video to hand landmark coordinates and measured the performance of corresponding time series classifiers.
Methods
We used the Self-Stimulatory Behavior Dataset (SSBD) that contains 75 videos of hand flapping, head banging, and spinning exhibited by children. From this data set, we extracted 100 hand flapping videos and 100 control videos, each between 2 to 5 seconds in duration. We evaluated five separate feature representations: four privacy-preserved subsets of hand landmarks detected by MediaPipe and one feature representation obtained from the output of the penultimate layer of a MobileNetV2 model fine-tuned on the SSBD. We fed these feature vectors into a long short-term memory network that predicted the presence of hand flapping in each video clip.
Results
The highest-performing model used MobileNetV2 to extract features and achieved a test F1 score of 84 (SD 3.7; precision 89.6, SD 4.3 and recall 80.4, SD 6) using 5-fold cross-validation for 100 random seeds on the SSBD data (500 total distinct folds). Of the models we trained on privacy-preserved data, the model trained with all hand landmarks reached an F1 score of 66.6 (SD 3.35). Another such model trained with a select 6 landmarks reached an F1 score of 68.3 (SD 3.6). A privacy-preserved model trained using a single landmark at the base of the hands and a model trained with the average of the locations of all the hand landmarks reached an F1 score of 64.9 (SD 6.5) and 64.2 (SD 6.8), respectively.
Conclusions
We created five lightweight neural networks that can detect hand flapping from unstructured videos. Training a long short-term memory network with convolutional feature vectors outperformed training with feature vectors of hand coordinates and used almost 900,000 fewer model parameters. This study provides the first step toward developing precise deep learning methods for activity detection of autism-related behaviors.
Collapse
|
12
|
Sleiman E, Mutlu OC, Surabhi S, Husic A, Kline A, Washington P, Wall DP. Deep Learning-Based Autism Spectrum Disorder Detection Using Emotion Features From Video Recordings (Preprint). JMIR BIOMEDICAL ENGINEERING 2022. [DOI: 10.2196/39982] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
13
|
Chi NA, Washington P, Kline A, Husic A, Hou C, He C, Dunlap K, Wall DP. Classifying Autism From Crowdsourced Semistructured Speech Recordings: Machine Learning Model Comparison Study. JMIR Pediatr Parent 2022; 5:e35406. [PMID: 35436234 PMCID: PMC9052034 DOI: 10.2196/35406] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 01/18/2022] [Accepted: 01/25/2022] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Autism spectrum disorder (ASD) is a neurodevelopmental disorder that results in altered behavior, social development, and communication patterns. In recent years, autism prevalence has tripled, with 1 in 44 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process that requires the work of trained physicians, significant attention has been given to developing systems that automatically detect autism. We work toward this goal by analyzing audio data, as prosody abnormalities are a signal of autism, with affected children displaying speech idiosyncrasies such as echolalia, monotonous intonation, atypical pitch, and irregular linguistic stress patterns. OBJECTIVE We aimed to test the ability for machine learning approaches to aid in detection of autism in self-recorded speech audio captured from children with ASD and neurotypical (NT) children in their home environments. METHODS We considered three methods to detect autism in child speech: (1) random forests trained on extracted audio features (including Mel-frequency cepstral coefficients); (2) convolutional neural networks trained on spectrograms; and (3) fine-tuned wav2vec 2.0-a state-of-the-art transformer-based speech recognition model. We trained our classifiers on our novel data set of cellphone-recorded child speech audio curated from the Guess What? mobile game, an app designed to crowdsource videos of children with ASD and NT children in a natural home environment. RESULTS The random forest classifier achieved 70% accuracy, the fine-tuned wav2vec 2.0 model achieved 77% accuracy, and the convolutional neural network achieved 79% accuracy when classifying children's audio as either ASD or NT. We used 5-fold cross-validation to evaluate model performance. CONCLUSIONS Our models were able to predict autism status when trained on a varied selection of home audio clips with inconsistent recording qualities, which may be more representative of real-world conditions. The results demonstrate that machine learning methods offer promise in detecting autism automatically from speech without specialized equipment.
Collapse
Affiliation(s)
- Nathan A Chi
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Peter Washington
- Department of Bioengineering, Stanford University, Stanford, CA, United States
| | - Aaron Kline
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Arman Husic
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Cathy Hou
- Department of Computer Science, Stanford University, Stanford, CA, United States
| | - Chloe He
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Kaitlyn Dunlap
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
| | - Dennis P Wall
- Division of Systems Medicine, Department of Pediatrics, Stanford University, Palo Alto, CA, United States
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
- Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, CA, United States
| |
Collapse
|
14
|
Varma M, Washington P, Chrisman B, Kline A, Leblanc E, Paskov K, Stockham N, Jung JY, Sun MW, Wall DP. Identification of Social Engagement Indicators Associated With Autism Spectrum Disorder Using a Game-Based Mobile App: Comparative Study of Gaze Fixation and Visual Scanning Methods. J Med Internet Res 2022; 24:e31830. [PMID: 35166683 PMCID: PMC8889483 DOI: 10.2196/31830] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 12/20/2021] [Accepted: 12/22/2021] [Indexed: 01/26/2023] Open
Abstract
BACKGROUND Autism spectrum disorder (ASD) is a widespread neurodevelopmental condition with a range of potential causes and symptoms. Standard diagnostic mechanisms for ASD, which involve lengthy parent questionnaires and clinical observation, often result in long waiting times for results. Recent advances in computer vision and mobile technology hold potential for speeding up the diagnostic process by enabling computational analysis of behavioral and social impairments from home videos. Such techniques can improve objectivity and contribute quantitatively to the diagnostic process. OBJECTIVE In this work, we evaluate whether home videos collected from a game-based mobile app can be used to provide diagnostic insights into ASD. To the best of our knowledge, this is the first study attempting to identify potential social indicators of ASD from mobile phone videos without the use of eye-tracking hardware, manual annotations, and structured scenarios or clinical environments. METHODS Here, we used a mobile health app to collect over 11 hours of video footage depicting 95 children engaged in gameplay in a natural home environment. We used automated data set annotations to analyze two social indicators that have previously been shown to differ between children with ASD and their neurotypical (NT) peers: (1) gaze fixation patterns, which represent regions of an individual's visual focus and (2) visual scanning methods, which refer to the ways in which individuals scan their surrounding environment. We compared the gaze fixation and visual scanning methods used by children during a 90-second gameplay video to identify statistically significant differences between the 2 cohorts; we then trained a long short-term memory (LSTM) neural network to determine if gaze indicators could be predictive of ASD. RESULTS Our results show that gaze fixation patterns differ between the 2 cohorts; specifically, we could identify 1 statistically significant region of fixation (P<.001). In addition, we also demonstrate that there are unique visual scanning patterns that exist for individuals with ASD when compared to NT children (P<.001). A deep learning model trained on coarse gaze fixation annotations demonstrates mild predictive power in identifying ASD. CONCLUSIONS Ultimately, our study demonstrates that heterogeneous video data sets collected from mobile devices hold potential for quantifying visual patterns and providing insights into ASD. We show the importance of automated labeling techniques in generating large-scale data sets while simultaneously preserving the privacy of participants, and we demonstrate that specific social engagement indicators associated with ASD can be identified and characterized using such data.
Collapse
Affiliation(s)
- Maya Varma
- Department of Computer Science, Stanford University, Stanford, CA, United States
| | - Peter Washington
- Department of Bioengineering, Stanford University, Stanford, CA, United States
| | - Brianna Chrisman
- Department of Bioengineering, Stanford University, Stanford, CA, United States
| | - Aaron Kline
- Department of Pediatrics and Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Emilie Leblanc
- Department of Pediatrics and Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Kelley Paskov
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Nate Stockham
- Department of Neuroscience, Stanford University, Stanford, CA, United States
| | - Jae-Yoon Jung
- Department of Pediatrics and Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Min Woo Sun
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States
| | - Dennis P Wall
- Department of Pediatrics and Biomedical Data Science, Stanford University, Stanford, CA, United States
| |
Collapse
|
15
|
Crowd annotations can approximate clinical autism impressions from short home videos with privacy protections. INTELLIGENCE-BASED MEDICINE 2022; 6. [PMID: 35634270 PMCID: PMC9139408 DOI: 10.1016/j.ibmed.2022.100056] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Artificial Intelligence (A.I.) solutions are increasingly considered for telemedicine. For these methods to serve children and their families in home settings, it is crucial to ensure the privacy of the child and parent or caregiver. To address this challenge, we explore the potential for global image transformations to provide privacy while preserving the quality of behavioral annotations. Crowd workers have previously been shown to reliably annotate behavioral features in unstructured home videos, allowing machine learning classifiers to detect autism using the annotations as input. We evaluate this method with videos altered via pixelation, dense optical flow, and Gaussian blurring. On a balanced test set of 30 videos of children with autism and 30 neurotypical controls, we find that the visual privacy alterations do not drastically alter any individual behavioral annotation at the item level. The AUROC on the evaluation set was 90.0% ±7.5% for unaltered videos, 85.0% ±9.0% for pixelation, 85.0% ±9.0% for optical flow, and 83.3% ±9.3% for blurring, demonstrating that an aggregation of small changes across behavioral questions can collectively result in increased misdiagnosis rates. We also compare crowd answers against clinicians who provided the same annotations for the same videos as crowd workers, and we find that clinicians have higher sensitivity in their recognition of autism-related symptoms. We also find that there is a linear correlation (r = 0.75, p < 0.0001) between the mean Clinical Global Impression (CGI) score provided by professional clinicians and the corresponding score emitted by a previously validated autism classifier with crowd inputs, indicating that the classifier’s output probability is a reliable estimate of the clinical impression of autism. A significant correlation is maintained with privacy alterations, indicating that crowd annotations can approximate clinician-provided autism impression from home videos in a privacy-preserved manner.
Collapse
|
16
|
Washington P, Kalantarian H, Kent J, Husic A, Kline A, Leblanc E, Hou C, Mutlu C, Dunlap K, Penev Y, Stockham N, Chrisman B, Paskov K, Jung JY, Voss C, Haber N, Wall DP. Training Affective Computer Vision Models by Crowdsourcing Soft-Target Labels. Cognit Comput 2021; 13:1363-1373. [PMID: 35669554 PMCID: PMC9165031 DOI: 10.1007/s12559-021-09936-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Accepted: 09/12/2021] [Indexed: 01/12/2023]
Abstract
Background/Introduction Emotion detection classifiers traditionally predict discrete emotions. However, emotion expressions are often subjective, thus requiring a method to handle compound and ambiguous labels. We explore the feasibility of using crowdsourcing to acquire reliable soft-target labels and evaluate an emotion detection classifier trained with these labels. We hypothesize that training with labels that are representative of the diversity of human interpretation of an image will result in predictions that are similarly representative on a disjoint test set. We also hypothesize that crowdsourcing can generate distributions which mirror those generated in a lab setting. Methods We center our study on the Child Affective Facial Expression (CAFE) dataset, a gold standard collection of images depicting pediatric facial expressions along with 100 human labels per image. To test the feasibility of crowdsourcing to generate these labels, we used Microworkers to acquire labels for 207 CAFE images. We evaluate both unfiltered workers as well as workers selected through a short crowd filtration process. We then train two versions of a ResNet-152 neural network on soft-target CAFE labels using the original 100 annotations provided with the dataset: (1) a classifier trained with traditional one-hot encoded labels, and (2) a classifier trained with vector labels representing the distribution of CAFE annotator responses. We compare the resulting softmax output distributions of the two classifiers with a 2-sample independent t-test of L1 distances between the classifier's output probability distribution and the distribution of human labels. Results While agreement with CAFE is weak for unfiltered crowd workers, the filtered crowd agree with the CAFE labels 100% of the time for happy, neutral, sad and "fear + surprise", and 88.8% for "anger + disgust". While the F1-score for a one-hot encoded classifier is much higher (94.33% vs. 78.68%) with respect to the ground truth CAFE labels, the output probability vector of the crowd-trained classifier more closely resembles the distribution of human labels (t=3.2827, p=0.0014). Conclusions For many applications of affective computing, reporting an emotion probability distribution that accounts for the subjectivity of human interpretation can be more useful than an absolute label. Crowdsourcing, including a sufficient filtering mechanism for selecting reliable crowd workers, is a feasible solution for acquiring soft-target labels.
Collapse
Affiliation(s)
| | - Haik Kalantarian
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Jack Kent
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Arman Husic
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Aaron Kline
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Emilie Leblanc
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Cathy Hou
- Department of Computer Science, Stanford University
| | - Cezmi Mutlu
- Department of Electrical Engineering, Stanford University
| | | | - Yordan Penev
- Department of Pediatrics (Systems Medicine), Stanford University
| | | | | | - Kelley Paskov
- Department of Biomedical Data Science, Stanford University
| | - Jae-Yoon Jung
- Department of Pediatrics (Systems Medicine), Stanford University
| | - Catalin Voss
- Department of Computer Science, Stanford University
| | - Nick Haber
- Graduate School of Education, Stanford University
| | - Dennis P. Wall
- Departments of Pediatrics (Systems Medicine), Biomedical Data Science, and Psychiatry and Behavioral Sciences, Stanford University
| |
Collapse
|