1
|
Adekkanattu P, Furmanchuk A, Wu Y, Pathak A, Patra BG, Bost S, Morrow D, Wang GHM, Yang Y, Forrest NJ, Luo Y, Walunas TL, Lo-Ciganic W, Gelad W, Bian J, Bao Y, Weiner M, Oslin D, Pathak J. Deep learning for identifying personal and family history of suicidal thoughts and behaviors from EHRs. NPJ Digit Med 2024; 7:260. [PMID: 39341983 PMCID: PMC11439010 DOI: 10.1038/s41746-024-01266-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 09/14/2024] [Indexed: 10/01/2024] Open
Abstract
Personal and family history of suicidal thoughts and behaviors (PSH and FSH, respectively) are significant risk factors associated with suicides. Research is limited in automatic identification of such data from clinical notes in Electronic Health Records. This study developed deep learning (DL) tools utilizing transformer models (Bio_ClinicalBERT and GatorTron) to detect PSH and FSH in clinical notes derived from three academic medical centers, and compared their performance with a rule-based natural language processing tool. For detecting PSH, the rule-based approach obtained an F1-score of 0.75 ± 0.07, while the Bio_ClinicalBERT and GatorTron DL tools scored 0.83 ± 0.09 and 0.84 ± 0.07, respectively. For detecting FSH, the rule-based approach achieved an F1-score of 0.69 ± 0.11, compared to 0.89 ± 0.10 for Bio_ClinicalBERT and 0.92 ± 0.07 for GatorTron. Across sites, the DL tools identified more than 80% of patients at elevated risk for suicide who remain undiagnosed and untreated.
Collapse
Affiliation(s)
| | - Al'ona Furmanchuk
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Yonghui Wu
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Aman Pathak
- University of Florida College of Medicine, Gainesville, FL, USA
| | | | - Sarah Bost
- University of Florida College of Medicine, Gainesville, FL, USA
| | | | | | - Yuyang Yang
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Yuan Luo
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Theresa L Walunas
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Weihsuan Lo-Ciganic
- University of Florida College of Medicine, Gainesville, FL, USA
- University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Walid Gelad
- University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Jiang Bian
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Yuhua Bao
- Weill Cornell Medicine, New York, NY, USA
| | | | - David Oslin
- Corporal Michael J Crescenz Veterans Affairs Medical Center, Philadelphia, PA, USA
| | | |
Collapse
|
2
|
Thaipisutikul T, Vitoochuleechoti P, Thaipisutikul P, Tuarob S. MONDEP: A unified SpatioTemporal MONitoring Framework for National DEPression Forecasting. Heliyon 2024; 10:e36877. [PMID: 39281477 PMCID: PMC11402176 DOI: 10.1016/j.heliyon.2024.e36877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2024] [Revised: 08/10/2024] [Accepted: 08/23/2024] [Indexed: 09/18/2024] Open
Abstract
Depression has become a prevalent mental disorder that significantly affects a person's emotions, behaviors, physical health, ability to perform daily tasks, and ability to maintain healthy relationships. Untreated depression can escalate the risk of suicide, making the situation even worse. Despite an abundance of models previously proposed for forecasting depression, the issue of foretelling the overall number of patients at each administrative level remains under-investigated. Therefore, in this paper, we propose a simple but effective SpatioTemporal Monitoring Framework for National Depression Forecasting (MONDEP). In particular, we analyze national depression statistics data in Thailand as a case study and create prediction models for a real-time depression forecasting system using machine learning and deep learning approaches. In order to forecast the prevalence of depression at various administrative levels, we use the hierarchical structure of depression aggregation. The proposed framework consists of three modules: Data Pre-processing to extract and pre-process the raw data, Exploratory Data Analysis (EDA) to visualize and analyze the data to get insight, and Model Training and Testing to predict future depression cases. The objective of our research is to construct a comprehensive MONDEP framework that utilizes machine learning and deep learning to predict depression profiles at the district and national levels using multivariate time series across various administrative levels. Our study illustrates the considerable association between a spatial-temporal component and demonstrates how depression profiles may be represented by employing lower administrative-level data to estimate the general level of mental health across the nation. Additionally, the best performance across all criteria is obtained when a deep learning model is used to exploit multivariate time series, showing a 13% improvement in MAE measure compared to the SARIMAX baseline. We believe the proposed framework could be used as a point of reference for decision-making regarding the management of depression and has the potential to be incredibly helpful for policymakers in successfully managing mental health services on time.
Collapse
Affiliation(s)
- Tipajin Thaipisutikul
- Faculty of Information and Communication Technology, Mahidol University, Nakhon Pathom, Thailand
| | | | - Papan Thaipisutikul
- Department of Psychiatry, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Suppawong Tuarob
- Faculty of Information and Communication Technology, Mahidol University, Nakhon Pathom, Thailand
| |
Collapse
|
3
|
Sharma CM, Chariar VM. Diagnosis of mental disorders using machine learning: Literature review and bibliometric mapping from 2012 to 2023. Heliyon 2024; 10:e32548. [PMID: 38975193 PMCID: PMC11225745 DOI: 10.1016/j.heliyon.2024.e32548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 06/05/2024] [Accepted: 06/05/2024] [Indexed: 07/09/2024] Open
Abstract
Background Mental disorders (MDs) are becoming a leading burden in non-communicable diseases (NCDs). As per the World Health Organization's 2022 assessment report, there was a steep increase of 25 % in MDs during the COVID-19 pandemic. Early diagnosis of MDs can significantly improve treatment outcome and save disability-adjusted life years (DALYs). In recent times, the application of machine learning (ML) and deep learning (DL)) has shown promising results in the diagnosis of MDs, and the field has witnessed a huge research output in the form of research publications. Therefore, a bibliometric mapping along with a review of recent advancements is required. Methods This study presents a bibliometric analysis and review of the research, published over the last 10 years. Literature searches were conducted in the Scopus database for the period from January 1, 2012, to June 9, 2023. The data was filtered and screened to include only relevant and reliable publications. A total of 2811 journal articles were found. The data was exported to a comma-separated value (CSV) format for further analysis. Furthermore, a review of 40 selected studies was performed. Results The popularity of ML techniques in diagnosing MDs has been growing, with an annual research growth rate of 17.05 %. The Journal of Affective Disorders published the most documents (n = 97), while Wang Y. (n = 64) has published the most articles. Lotka's law is observed, with a minority of authors contributing the majority of publications. The top affiliating institutes are the West China Hospital of Sichuan University followed by the University of California, with China and the US dominating the top 10 institutes. While China has more publications, papers affiliated with the US receive more citations. Depression and schizophrenia are the primary focuses of ML and deep learning (DL) in mental disease detection. Co-occurrence network analysis reveals that ML is associated with depression, schizophrenia, autism, anxiety, ADHD, obsessive-compulsive disorder, and PTSD. Popular algorithms include support vector machine (SVM) classifier, decision tree classifier, and random forest classifier. Furthermore, DL is linked to neuroimaging techniques such as MRI, fMRI, and EEG, as well as bipolar disorder. Current research trends encompass DL, LSTM, generalized anxiety disorder, feature fusion, and convolutional neural networks.
Collapse
Affiliation(s)
- Chandra Mani Sharma
- CRDT, Indian Institute of Technology Delhi, Hauz Khas, New Delhi, 110016, India
- School of Computer Science, UPES, Dehradun, Uttarakhand, India
| | | |
Collapse
|
4
|
Khosravi H, Ahmed I, Choudhury A. Predicting Suicidal Ideation, Planning, and Attempts among the Adolescent Population of the United States. Healthcare (Basel) 2024; 12:1262. [PMID: 38998797 PMCID: PMC11241284 DOI: 10.3390/healthcare12131262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 06/20/2024] [Accepted: 06/22/2024] [Indexed: 07/14/2024] Open
Abstract
Suicide is the second leading cause of death among individuals aged 5 to 24 in the United States (US). However, the precursors to suicide often do not surface, making suicide prevention challenging. This study aims to develop a machine learning model for predicting suicide ideation (SI), suicide planning (SP), and suicide attempts (SA) among adolescents in the US during the coronavirus pandemic. We used the 2021 Adolescent Behaviors and Experiences Survey Data. Class imbalance was addressed using the proposed data augmentation method tailored for binary variables, Modified Synthetic Minority Over-Sampling Technique. Five different ML models were trained and compared. SHapley Additive exPlanations analysis was conducted for explainability. The Logistic Regression model, identified as the most effective, showed superior performance across all targets, achieving high scores in recall: 0.82, accuracy: 0.80, and area under the Receiver Operating Characteristic curve: 0.88. Variables such as sad feelings, hopelessness, sexual behavior, and being overweight were noted as the most important predictors. Our model holds promise in helping health policymakers design effective public health interventions. By identifying vulnerable sub-groups within regions, our model can guide the implementation of tailored interventions that facilitate early identification and referral to medical treatment.
Collapse
Affiliation(s)
- Hamed Khosravi
- Industrial and Management Systems Engineering, West Virginia University, Morgantown, WV 26506, USA
| | - Imtiaz Ahmed
- Industrial and Management Systems Engineering, West Virginia University, Morgantown, WV 26506, USA
| | - Avishek Choudhury
- Industrial and Management Systems Engineering, West Virginia University, Morgantown, WV 26506, USA
| |
Collapse
|
5
|
Hornstein S, Scharfenberger J, Lueken U, Wundrack R, Hilbert K. Predicting recurrent chat contact in a psychological intervention for the youth using natural language processing. NPJ Digit Med 2024; 7:132. [PMID: 38762694 PMCID: PMC11102489 DOI: 10.1038/s41746-024-01121-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 04/23/2024] [Indexed: 05/20/2024] Open
Abstract
Chat-based counseling hotlines emerged as a promising low-threshold intervention for youth mental health. However, despite the resulting availability of large text corpora, little work has investigated Natural Language Processing (NLP) applications within this setting. Therefore, this preregistered approach (OSF: XA4PN) utilizes a sample of approximately 19,000 children and young adults that received a chat consultation from a 24/7 crisis service in Germany. Around 800,000 messages were used to predict whether chatters would contact the service again, as this would allow the provision of or redirection to additional treatment. We trained an XGBoost Classifier on the words of the anonymized conversations, using repeated cross-validation and bayesian optimization for hyperparameter search. The best model was able to achieve an AUROC score of 0.68 (p < 0.01) on the previously unseen 3942 newest consultations. A shapely-based explainability approach revealed that words indicating younger age or female gender and terms related to self-harm and suicidal thoughts were associated with a higher chance of recontacting. We conclude that NLP-based predictions of recurrent contact are a promising path toward personalized care at chat hotlines.
Collapse
Affiliation(s)
- Silvan Hornstein
- Department of Psychology, Humboldt-Universität zu Berlin, 10099 Berlin, Germany.
| | | | - Ulrike Lueken
- Department of Psychology, Humboldt-Universität zu Berlin, 10099 Berlin, Germany
- German Center for Mental Health (DZPG), partner site Berlin/Potsdam, Potsdam, Germany
| | - Richard Wundrack
- Department of Psychology, Humboldt-Universität zu Berlin, 10099 Berlin, Germany
| | - Kevin Hilbert
- Department of Psychology, Humboldt-Universität zu Berlin, 10099 Berlin, Germany
| |
Collapse
|
6
|
Carson NJ, Yang X, Mullin B, Stettenbauer E, Waddington M, Zhang A, Williams P, Rios Perez GE, Cook BL. Predicting adolescent suicidal behavior following inpatient discharge using structured and unstructured data. J Affect Disord 2024; 350:382-387. [PMID: 38158050 PMCID: PMC10923087 DOI: 10.1016/j.jad.2023.12.059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/30/2023] [Accepted: 12/24/2023] [Indexed: 01/03/2024]
Abstract
BACKGROUND The objective was to develop and assess performance of an algorithm predicting suicide-related ICD codes within three months of psychiatric discharge. METHODS This prognostic study used a retrospective cohort of EHR data from 2789 youth (12 to 20 years old) hospitalized in a safety net institution in the Northeastern United States. The dataset combined structured data with unstructured data obtained through natural language processing of clinical notes. Machine learning approaches compared gradient boosting to random forest analyses. RESULTS Area under the ROC and precision-recall curve were 0.88 and 0.17, respectively, for the final Gradient Boosting model. The cutoff point of the model-generated predicted probabilities of suicide that optimally classified the individual as high risk or not was 0.009. When applying the chosen cutoff (0.009) to the hold-out testing set, the model correctly identified 8 positive cases out of 10, and 418 negative cases out 548. The corresponding performance metrics showed 80 % sensitivity, 76 % specificity, 6 % PPV, 99 % NPV, F-1 score of 0.11, and an accuracy of 76 %. LIMITATIONS The data in this study comes from a single health system, possibly introducing bias in the model's algorithm. Thus, the model may have underestimated the incidence of suicidal behavior in the study population. Further research should include multiple system EHRs. CONCLUSIONS These performance metrics suggest a benefit to including both unstructured and structured data in design of predictive algorithms for suicidal behavior, which can be integrated into psychiatric services to help assess risk.
Collapse
Affiliation(s)
- Nicholas J Carson
- Health Equity Research Lab, Cambridge Health Alliance, 1035 Cambridge Street, Cambridge, MA 02139, USA.
| | - Xinyu Yang
- Parexel, 275 Grove St., Suite 101C, Newton, MA 02466, USA
| | - Brian Mullin
- Health Equity Research Lab, Cambridge Health Alliance, 1035 Cambridge Street, Cambridge, MA 02139, USA
| | | | - Marin Waddington
- Division of Gastroenterology at Brigham and Women's Hospital, Resnek Family Center for PSC Research, 75 Francis Street, Boston, MA 02115, USA
| | - Alice Zhang
- Department of Psychology, New York University, 6 Washington Place, New York, NY 10003, USA
| | - Peyton Williams
- Health Equity Research Lab, Cambridge Health Alliance, 1035 Cambridge Street, Cambridge, MA 02139, USA
| | - Gabriel E Rios Perez
- Health Equity Research Lab, Cambridge Health Alliance, 1035 Cambridge Street, Cambridge, MA 02139, USA
| | - Benjamin Lê Cook
- Health Equity Research Lab, Cambridge Health Alliance, 1035 Cambridge Street, Cambridge, MA 02139, USA
| |
Collapse
|
7
|
Tang H, Miri Rekavandi A, Rooprai D, Dwivedi G, Sanfilippo FM, Boussaid F, Bennamoun M. Analysis and evaluation of explainable artificial intelligence on suicide risk assessment. Sci Rep 2024; 14:6163. [PMID: 38485985 PMCID: PMC10940617 DOI: 10.1038/s41598-024-53426-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 01/31/2024] [Indexed: 03/18/2024] Open
Abstract
This study explores the effectiveness of Explainable Artificial Intelligence (XAI) for predicting suicide risk from medical tabular data. Given the common challenge of limited datasets in health-related Machine Learning (ML) applications, we use data augmentation in tandem with ML to enhance the identification of individuals at high risk of suicide. We use SHapley Additive exPlanations (SHAP) for XAI and traditional correlation analysis to rank feature importance, pinpointing primary factors influencing suicide risk and preventive measures. Experimental results show the Random Forest (RF) model is excelling in accuracy, F1 score, and AUC (>97% across metrics). According to SHAP, anger issues, depression, and social isolation emerge as top predictors of suicide risk, while individuals with high incomes, esteemed professions, and higher education present the lowest risk. Our findings underscore the effectiveness of ML and XAI in suicide risk assessment, offering valuable insights for psychiatrists and facilitating informed clinical decisions.
Collapse
Affiliation(s)
- Hao Tang
- Department of Computer Science and Software Engineering, The University of Western Australia, Perth, Australia
| | - Aref Miri Rekavandi
- Department of Computer Science and Software Engineering, The University of Western Australia, Perth, Australia
| | - Dharjinder Rooprai
- Armadale Mental Health Service, Perth, Australia.
- Bethesda Clinic, Perth, Australia.
| | - Girish Dwivedi
- Advanced Clinical and Translational Cardiovascular Imaging, Harry Perkins Institute of Medical Research, The University of Western Australia, Perth, Australia
- Department of Cardiology, Fiona Stanley Hospital, Murdoch, WA, Australia
| | - Frank M Sanfilippo
- School of Population and Global Health, University of Western Australia, Perth, Australia
| | - Farid Boussaid
- Department of Electrical, Electronic and Computer Engineering, The University of Western Australia, Perth, Australia
| | - Mohammed Bennamoun
- Department of Computer Science and Software Engineering, The University of Western Australia, Perth, Australia.
| |
Collapse
|
8
|
Uludag K. Exploring the Association Between Textual Parameters and Psychological and Cognitive Factors. Psychol Res Behav Manag 2024; 17:1139-1150. [PMID: 38505355 PMCID: PMC10949372 DOI: 10.2147/prbm.s460503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 03/07/2024] [Indexed: 03/21/2024] Open
Abstract
Background Textual data analysis has become a popular method for examining complex human behavior in various fields, including psychology, psychiatry, sociology, computer science, data mining, forensic sciences, and communication studies. However, identifying the most relevant textual parameters for analyzing complex behavior is still a challenge. Goal of Study This paper aims to explore potential textual parameters that could be useful in analyzing behavior through complex textual data. Furthermore, we have examined the randomly generated text based on different textual parameters. Methods To achieve this goal, we conducted a comprehensive review of the literature on textual data analysis and identified several potential topics that could be relevant, such as sentiment analysis, discourse analysis, lexical analysis, and syntactic analysis. We discuss the theoretical background and practical implications of each parameter and provide examples of how they have been used in previous research. Furthermore, we highlight the importance of considering the context in which these parameters are applied and the need for interdisciplinary collaboration to gain a deeper understanding of complex behavior through textual data analysis. Furthermore, we have provided Python code in the Supplementary Materials to facilitate a comprehensive analysis of such behaviors. In addition, to generate the text for analysis, we utilized ChatGPT 3.5 Turbo by requesting it to generate a random text of 1000 words divided into five paragraphs. Afterwards, we applied the provided Python code to analyze the randomly generated text. Conclusion Overall, this paper provides a foundation for researchers to identify relevant textual parameters to analyze complex human behavior in their respective fields such as linguistics, sociology, psychiatry, and psychology.
Collapse
Affiliation(s)
- Kadir Uludag
- Shanghai Mental Health Center, School of Medicine, Shanghai Jiao Tong University, Shanghai, People’s Republic of China
| |
Collapse
|
9
|
Adekkanattu P, Furmanchuk A, Wu Y, Pathak A, Patra BG, Bost S, Morrow D, Wang GHM, Yang Y, Forrest NJ, Luo Y, Walunas TL, Jenny WHLC, Gelad W, Bian J, Bao Y, Weiner M, Oslin D, Pathak J. Detection of Personal and Family History of Suicidal Thoughts and Behaviors using Deep Learning and Natural Language Processing: A Multi-Site Study. RESEARCH SQUARE 2024:rs.3.rs-4014472. [PMID: 38559051 PMCID: PMC10980141 DOI: 10.21203/rs.3.rs-4014472/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Objective Personal and family history of suicidal thoughts and behaviors (PSH and FSH, respectively) are significant risk factors associated with future suicide events. These are often captured in narrative clinical notes in electronic health records (EHRs). Collaboratively, Weill Cornell Medicine (WCM), Northwestern Medicine (NM), and the University of Florida (UF) developed and validated deep learning (DL)-based natural language processing (NLP) tools to detect PSH and FSH from such notes. The tool's performance was further benchmarked against a method relying exclusively on ICD-9/10 diagnosis codes. Materials and Methods We developed DL-based NLP tools utilizing pre-trained transformer models Bio_ClinicalBERT and GatorTron, and compared them with expert-informed, rule-based methods. The tools were initially developed and validated using manually annotated clinical notes at WCM. Their portability and performance were further evaluated using clinical notes at NM and UF. Results The DL tools outperformed the rule-based NLP tool in identifying PSH and FHS. For detecting PSH, the rule-based system obtained an F1-score of 0.75 ± 0.07, while the Bio_ClinicalBERT and GatorTron DL tools scored 0.83 ± 0.09 and 0.84 ± 0.07, respectively. For detecting FSH, the rule-based NLP tool's F1-score was 0.69 ± 0.11, compared to 0.89 ± 0.10 for Bio_ClinicalBERT and 0.92 ± 0.07 for GatorTron. For the gold standard corpora across the three sites, only 2.2% (WCM), 9.3% (NM), and 7.8% (UF) of patients reported to have an ICD-9/10 diagnosis code for suicidal thoughts and behaviors prior to the clinical notes report date. The best performing GatorTron DL tool identified 93.0% (WCM), 80.4% (NM), and 89.0% (UF) of patients with documented PSH, and 85.0%(WCM), 89.5%(NM), and 100%(UF) of patients with documented FSH in their notes. Discussion While PSH and FSH are significant risk factors for future suicide events, little effort has been made previously to identify individuals with these history. To address this, we developed a transformer based DL method and compared with conventional rule-based NLP approach. The varying effectiveness of the rule-based tools across sites suggests a need for improvement in its dictionary-based approach. In contrast, the performances of the DL tools were higher and comparable across sites. Furthermore, DL tools were fine-tuned using only small number of annotated notes at each site, underscores its greater adaptability to local documentation practices and lexical variations. Conclusion Variations in local documentation practices across health care systems pose challenges to rule-based NLP tools. In contrast, the developed DL tools can effectively extract PSH and FSH information from unstructured clinical notes. These tools will provide clinicians with crucial information for assessing and treating patients at elevated risk for suicide who are rarely been diagnosed.
Collapse
Affiliation(s)
| | - Al'ona Furmanchuk
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Yonghui Wu
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Aman Pathak
- University of Florida College of Medicine, Gainesville, FL, USA
| | | | - Sarah Bost
- University of Florida College of Medicine, Gainesville, FL, USA
| | | | | | - Yuyang Yang
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | | | - Yuan Luo
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Theresa L Walunas
- Northwestern University Feinberg School of Medicine, Chicago, IL, USA
| | - Wei-Hsuan Lo-Ciganic Jenny
- University of Florida College of Medicine, Gainesville, FL, USA
- University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Walid Gelad
- University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Jiang Bian
- University of Florida College of Medicine, Gainesville, FL, USA
| | - Yuhua Bao
- Weill Cornell Medicine, New York, NY, USA
| | | | - David Oslin
- Corporal Michael J Crescenz Veterans Affairs Medical Center, Philadelphia, PA, USA
| | | |
Collapse
|
10
|
Pigoni A, Delvecchio G, Turtulici N, Madonna D, Pietrini P, Cecchetti L, Brambilla P. Machine learning and the prediction of suicide in psychiatric populations: a systematic review. Transl Psychiatry 2024; 14:140. [PMID: 38461283 PMCID: PMC10925059 DOI: 10.1038/s41398-024-02852-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/05/2023] [Revised: 02/22/2024] [Accepted: 02/22/2024] [Indexed: 03/11/2024] Open
Abstract
Machine learning (ML) has emerged as a promising tool to enhance suicidal prediction. However, as many large-sample studies mixed psychiatric and non-psychiatric populations, a formal psychiatric diagnosis emerged as a strong predictor of suicidal risk, overshadowing more subtle risk factors specific to distinct populations. To overcome this limitation, we conducted a systematic review of ML studies evaluating suicidal behaviors exclusively in psychiatric clinical populations. A systematic literature search was performed from inception through November 17, 2022 on PubMed, EMBASE, and Scopus following the PRISMA guidelines. Original research using ML techniques to assess the risk of suicide or predict suicide attempts in the psychiatric population were included. An assessment for bias risk was performed using the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) guidelines. About 1032 studies were retrieved, and 81 satisfied the inclusion criteria and were included for qualitative synthesis. Clinical and demographic features were the most frequently employed and random forest, support vector machine, and convolutional neural network performed better in terms of accuracy than other algorithms when directly compared. Despite heterogeneity in procedures, most studies reported an accuracy of 70% or greater based on features such as previous attempts, severity of the disorder, and pharmacological treatments. Although the evidence reported is promising, ML algorithms for suicidal prediction still present limitations, including the lack of neurobiological and imaging data and the lack of external validation samples. Overcoming these issues may lead to the development of models to adopt in clinical practice. Further research is warranted to boost a field that holds the potential to critically impact suicide mortality.
Collapse
Affiliation(s)
- Alessandro Pigoni
- Social and Affective Neuroscience Group, MoMiLab, IMT School for Advanced Studies Lucca, Lucca, Italy
- Department of Neurosciences and Mental Health, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Milan, Italy
| | - Giuseppe Delvecchio
- Department of Neurosciences and Mental Health, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Milan, Italy
| | - Nunzio Turtulici
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
| | - Domenico Madonna
- Department of Neurosciences and Mental Health, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Milan, Italy
| | - Pietro Pietrini
- MoMiLab, IMT School for Advanced Studies Lucca, Lucca, Italy
| | - Luca Cecchetti
- Social and Affective Neuroscience Group, MoMiLab, IMT School for Advanced Studies Lucca, Lucca, Italy
| | - Paolo Brambilla
- Department of Neurosciences and Mental Health, Fondazione IRCCS Ca' Granda, Ospedale Maggiore Policlinico, Milan, Italy.
- Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy.
| |
Collapse
|
11
|
Dutta R, Gkotsis G, Velupillai SU, Downs J, Roberts A, Stewart R, Hotopf M. Identifying features of risk periods for suicide attempts using document frequency and language use in electronic health records. Front Psychiatry 2023; 14:1217649. [PMID: 38152362 PMCID: PMC10752595 DOI: 10.3389/fpsyt.2023.1217649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 11/13/2023] [Indexed: 12/29/2023] Open
Abstract
Background Individualising mental healthcare at times when a patient is most at risk of suicide involves shifting research emphasis from static risk factors to those that may be modifiable with interventions. Currently, risk assessment is based on a range of extensively reported stable risk factors, but critical to dynamic suicide risk assessment is an understanding of each individual patient's health trajectory over time. The use of electronic health records (EHRs) and analysis using machine learning has the potential to accelerate progress in developing early warning indicators. Setting EHR data from the South London and Maudsley NHS Foundation Trust (SLaM) which provides secondary mental healthcare for 1.8 million people living in four South London boroughs. Objectives To determine whether the time window proximal to a hospitalised suicide attempt can be discriminated from a distal period of lower risk by analysing the documentation and mental health clinical free text data from EHRs and (i) investigate whether the rate at which EHR documents are recorded per patient is associated with a suicide attempt; (ii) compare document-level word usage between documents proximal and distal to a suicide attempt; and (iii) compare n-gram frequency related to third-person pronoun use proximal and distal to a suicide attempt using machine learning. Methods The Clinical Record Interactive Search (CRIS) system allowed access to de-identified information from the EHRs. CRIS has been linked with Hospital Episode Statistics (HES) data for Admitted Patient Care. We analysed document and event data for patients who had at some point between 1 April 2006 and 31 March 2013 been hospitalised with a HES ICD-10 code related to attempted suicide (X60-X84; Y10-Y34; Y87.0/Y87.2). Findings n = 8,247 patients were identified to have made a hospitalised suicide attempt. Of these, n = 3,167 (39.8%) of patients had at least one document available in their EHR prior to their first suicide attempt. N = 1,424 (45.0%) of these patients had been "monitored" by mental healthcare services in the past 30 days. From 60 days prior to a first suicide attempt, there was a rapid increase in the monitoring level (document recording of the past 30 days) increasing from 35.1 to 45.0%. Documents containing words related to prescribed medications/drugs/overdose/poisoning/addiction had the highest odds of being a risk indicator used proximal to a suicide attempt (OR 1.88; precision 0.91 and recall 0.93), and documents with words citing a care plan were associated with the lowest risk for a suicide attempt (OR 0.22; precision 1.00 and recall 1.00). Function words, word sequence, and pronouns were most common in all three representations (uni-, bi-, and tri-gram). Conclusion EHR documentation frequency and language use can be used to distinguish periods distal from and proximal to a suicide attempt. However, in our study 55.0% of patients with documentation, prior to their first suicide attempt, did not have a record in the preceding 30 days, meaning that there are a high number who are not seen by services at their most vulnerable point.
Collapse
Affiliation(s)
- Rina Dutta
- King’s College London, IoPPN, London, United Kingdom
- South London and Maudsley NHS Foundation Trust, London, United Kingdom
| | | | | | - Johnny Downs
- King’s College London, IoPPN, London, United Kingdom
- South London and Maudsley NHS Foundation Trust, London, United Kingdom
| | - Angus Roberts
- King’s College London, IoPPN, London, United Kingdom
| | - Robert Stewart
- King’s College London, IoPPN, London, United Kingdom
- South London and Maudsley NHS Foundation Trust, London, United Kingdom
| | - Matthew Hotopf
- King’s College London, IoPPN, London, United Kingdom
- South London and Maudsley NHS Foundation Trust, London, United Kingdom
| |
Collapse
|
12
|
Zantvoort K, Scharfenberger J, Boß L, Lehr D, Funk B. Finding the Best Match - a Case Study on the (Text-)Feature and Model Choice in Digital Mental Health Interventions. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023; 7:447-479. [PMID: 37927375 PMCID: PMC10620349 DOI: 10.1007/s41666-023-00148-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 08/29/2023] [Indexed: 11/07/2023]
Abstract
With the need for psychological help long exceeding the supply, finding ways of scaling, and better allocating mental health support is a necessity. This paper contributes by investigating how to best predict intervention dropout and failure to allow for a need-based adaptation of treatment. We systematically compare the predictive power of different text representation methods (metadata, TF-IDF, sentiment and topic analysis, and word embeddings) in combination with supplementary numerical inputs (socio-demographic, evaluation, and closed-question data). Additionally, we address the research gap of which ML model types - ranging from linear to sophisticated deep learning models - are best suited for different features and outcome variables. To this end, we analyze nearly 16.000 open-text answers from 849 German-speaking users in a Digital Mental Health Intervention (DMHI) for stress. Our research proves that - contrary to previous findings - there is great promise in using neural network approaches on DMHI text data. We propose a task-specific LSTM-based model architecture to tackle the challenge of long input sequences and thereby demonstrate the potential of word embeddings (AUC scores of up to 0.7) for predictions in DMHIs. Despite the relatively small data set, sequential deep learning models, on average, outperform simpler features such as metadata and bag-of-words approaches when predicting dropout. The conclusion is that user-generated text of the first two sessions carries predictive power regarding patients' dropout and intervention failure risk. Furthermore, the match between the sophistication of features and models needs to be closely considered to optimize results, and additional non-text features increase prediction results. Supplementary Information The online version contains supplementary material available at 10.1007/s41666-023-00148-z.
Collapse
Affiliation(s)
- Kirsten Zantvoort
- Institute of Information Systems, Leuphana University, Lüneburg, Germany
| | | | - Leif Boß
- Institute of Psychology, Leuphana University, Lüneburg, Germany
| | - Dirk Lehr
- Institute of Psychology, Leuphana University, Lüneburg, Germany
| | - Burkhardt Funk
- Institute of Information Systems, Leuphana University, Lüneburg, Germany
| |
Collapse
|
13
|
Swaminathan A, López I, Mar RAG, Heist T, McClintock T, Caoili K, Grace M, Rubashkin M, Boggs MN, Chen JH, Gevaert O, Mou D, Nock MK. Natural language processing system for rapid detection and intervention of mental health crisis chat messages. NPJ Digit Med 2023; 6:213. [PMID: 37990134 PMCID: PMC10663535 DOI: 10.1038/s41746-023-00951-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 10/24/2023] [Indexed: 11/23/2023] Open
Abstract
Patients experiencing mental health crises often seek help through messaging-based platforms, but may face long wait times due to limited message triage capacity. Here we build and deploy a machine-learning-enabled system to improve response times to crisis messages in a large, national telehealth provider network. We train a two-stage natural language processing (NLP) system with key word filtering followed by logistic regression on 721 electronic medical record chat messages, of which 32% are potential crises (suicidal/homicidal ideation, domestic violence, or non-suicidal self-injury). Model performance is evaluated on a retrospective test set (4/1/21-4/1/22, N = 481) and a prospective test set (10/1/22-10/31/22, N = 102,471). In the retrospective test set, the model has an AUC of 0.82 (95% CI: 0.78-0.86), sensitivity of 0.99 (95% CI: 0.96-1.00), and PPV of 0.35 (95% CI: 0.309-0.4). In the prospective test set, the model has an AUC of 0.98 (95% CI: 0.966-0.984), sensitivity of 0.98 (95% CI: 0.96-0.99), and PPV of 0.66 (95% CI: 0.626-0.692). The daily median time from message receipt to crisis specialist triage ranges from 8 to 13 min, compared to 9 h before the deployment of the system. We demonstrate that a NLP-based machine learning model can reliably identify potential crisis chat messages in a telehealth setting. Our system integrates into existing clinical workflows, suggesting that with appropriate training, humans can successfully leverage ML systems to facilitate triage of crisis messages.
Collapse
Affiliation(s)
- Akshay Swaminathan
- Cerebral Inc, Claymont, DE, USA.
- Stanford University School of Medicine, Stanford, CA, USA.
| | - Iván López
- Cerebral Inc, Claymont, DE, USA
- Stanford University School of Medicine, Stanford, CA, USA
| | | | | | | | | | | | | | | | - Jonathan H Chen
- Stanford Center for Biomedical Informatics Research, Division of Hospital Medicine, Clinical Excellence Research Center, Department of Medicine, Stanford, CA, USA
| | - Olivier Gevaert
- Stanford Center for Biomedical Informatics Research (BMIR), Department of Medicine and Department of Biomedical Data Science, Stanford, CA, USA
| | - David Mou
- Cerebral Inc, Claymont, DE, USA
- Massachusetts General Hospital Department of Psychiatry, Boston, MA, USA
| | - Matthew K Nock
- Harvard University, Department of Psychology, Cambridge, MA, USA
| |
Collapse
|
14
|
Garriga R, Buda TS, Guerreiro J, Omaña Iglesias J, Estella Aguerri I, Matić A. Combining clinical notes with structured electronic health records enhances the prediction of mental health crises. Cell Rep Med 2023; 4:101260. [PMID: 37913776 PMCID: PMC10694623 DOI: 10.1016/j.xcrm.2023.101260] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Revised: 07/12/2023] [Accepted: 10/05/2023] [Indexed: 11/03/2023]
Abstract
An automatic prediction of mental health crises can improve caseload prioritization and enable preventative interventions, improving patient outcomes and reducing costs. We combine structured electronic health records (EHRs) with clinical notes from 59,750 de-identified patients to predict the risk of mental health crisis relapse within the next 28 days. The results suggest that an ensemble machine learning model that relies on structured EHRs and clinical notes when available, and relying solely on structured data when the notes are unavailable, offers superior performance over models trained with either of the two data streams alone. Furthermore, the study provides key takeaways related to the required amount of clinical notes to add value in predictive analytics. This study sheds light on the untapped potential of clinical notes in the prediction of mental health crises and highlights the importance of choosing an appropriate machine learning method to combine structured and unstructured EHRs.
Collapse
Affiliation(s)
- Roger Garriga
- Koa Health, 08019 Barcelona, Spain; Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain.
| | | | | | | | | | | |
Collapse
|
15
|
Workman TE, Goulet JL, Brandt CA, Warren AR, Eleazer J, Skanderson M, Lindemann L, Blosnich JR, O'Leary J, Zeng‐Treitler Q. Identifying suicide documentation in clinical notes through zero-shot learning. Health Sci Rep 2023; 6:e1526. [PMID: 37706016 PMCID: PMC10495736 DOI: 10.1002/hsr2.1526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 08/08/2023] [Accepted: 08/11/2023] [Indexed: 09/15/2023] Open
Abstract
Background and Aims In deep learning, a major difficulty in identifying suicidality and its risk factors in clinical notes is the lack of training samples given the small number of true positive instances among the number of patients screened. This paper describes a novel methodology that identifies suicidality in clinical notes by addressing this data sparsity issue through zero-shot learning. Our general aim was to develop a tool that leveraged zero-shot learning to effectively identify suicidality documentation in all types of clinical notes. Methods US Veterans Affairs clinical notes served as data. The training data set label was determined using diagnostic codes of suicide attempt and self-harm. We used a base string associated with the target label of suicidality to provide auxiliary information by narrowing the positive training cases to those containing the base string. We trained a deep neural network by mapping the training documents' contents to a semantic space. For comparison, we trained another deep neural network using the identical training data set labels, and bag-of-words features. Results The zero-shot learning model outperformed the baseline model in terms of area under the curve, sensitivity, specificity, and positive predictive value at multiple probability thresholds. In applying a 0.90 probability threshold, the methodology identified notes documenting suicidality but not associated with a relevant ICD-10-CM code, with 94% accuracy. Conclusion This method can effectively identify suicidality without manual annotation.
Collapse
Affiliation(s)
- Terri Elizabeth Workman
- Biomedical Informatics CenterThe George Washington UniversityWashingtonDistrict of ColumbiaUSA
- VA Medical CenterWashingtonDistrict of ColumbiaUSA
| | - Joseph L. Goulet
- Department of Emergency MedicineYale School of MedicineNew HavenConnecticutUSA
- VA Connecticut Healthcare SystemWest HavenConnecticutUSA
| | - Cynthia A. Brandt
- Department of Emergency MedicineYale School of MedicineNew HavenConnecticutUSA
- VA Connecticut Healthcare SystemWest HavenConnecticutUSA
| | - Allison R. Warren
- PRIME Center, VA Connecticut Healthcare SystemWest HavenConnecticutUSA
| | - Jacob Eleazer
- PRIME Center, VA Connecticut Healthcare SystemWest HavenConnecticutUSA
| | | | - Luke Lindemann
- VA Connecticut Healthcare SystemWest HavenConnecticutUSA
| | - John R. Blosnich
- Suzanne Dworak‐Peck School of Social WorkUniversity of Southern CaliforniaLos AngelesCaliforniaUSA
| | - John O'Leary
- VA Connecticut Healthcare SystemWest HavenConnecticutUSA
- Department of Internal MedicineYale School of MedicineWest HavenConnecticutUSA
| | - Qing Zeng‐Treitler
- Biomedical Informatics CenterThe George Washington UniversityWashingtonDistrict of ColumbiaUSA
- VA Medical CenterWashingtonDistrict of ColumbiaUSA
| |
Collapse
|
16
|
Abstract
OBJECTIVES Through a scoping review, we examine in this survey what ways health equity has been promoted in clinical research informatics with patient implications and especially published in the year of 2021 (and some in 2022). METHOD A scoping review was conducted guided by using methods described in the Joanna Briggs Institute Manual. The review process consisted of five stages: 1) development of aim and research question, 2) literature search, 3) literature screening and selection, 4) data extraction, and 5) accumulate and report results. RESULTS From the 478 identified papers in 2021 on the topic of clinical research informatics with focus on health equity as a patient implication, 8 papers met our inclusion criteria. All included papers focused on artificial intelligence (AI) technology. The papers addressed health equity in clinical research informatics either through the exposure of inequity in AI-based solutions or using AI as a tool for promoting health equity in the delivery of healthcare services. While algorithmic bias poses a risk to health equity within AI-based solutions, AI has also uncovered inequity in traditional treatment and demonstrated effective complements and alternatives that promotes health equity. CONCLUSIONS Clinical research informatics with implications for patients still face challenges of ethical nature and clinical value. However, used prudently-for the right purpose in the right context-clinical research informatics could bring powerful tools in advancing health equity in patient care.
Collapse
|
17
|
Li H, Gerkin RC, Bakke A, Norel R, Cecchi G, Laudamiel C, Niv MY, Ohla K, Hayes JE, Parma V, Meyer P. Text-based predictions of COVID-19 diagnosis from self-reported chemosensory descriptions. COMMUNICATIONS MEDICINE 2023; 3:104. [PMID: 37500763 PMCID: PMC10374642 DOI: 10.1038/s43856-023-00334-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2023] [Accepted: 07/19/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND There is a prevailing view that humans' capacity to use language to characterize sensations like odors or tastes is poor, providing an unreliable source of information. METHODS Here, we developed a machine learning method based on Natural Language Processing (NLP) using Large Language Models (LLM) to predict COVID-19 diagnosis solely based on text descriptions of acute changes in chemosensation, i.e., smell, taste and chemesthesis, caused by the disease. The dataset of more than 1500 subjects was obtained from survey responses early in the COVID-19 pandemic, in Spring 2020. RESULTS When predicting COVID-19 diagnosis, our NLP model performs comparably (AUC ROC ~ 0.65) to models based on self-reported changes in function collected via quantitative rating scales. Further, our NLP model could attribute importance of words when performing the prediction; sentiment and descriptive words such as "smell", "taste", "sense", had strong contributions to the predictions. In addition, adjectives describing specific tastes or smells such as "salty", "sweet", "spicy", and "sour" also contributed considerably to predictions. CONCLUSIONS Our results show that the description of perceptual symptoms caused by a viral infection can be used to fine-tune an LLM model to correctly predict and interpret the diagnostic status of a subject. In the future, similar models may have utility for patient verbatims from online health portals or electronic health records.
Collapse
Affiliation(s)
- Hongyang Li
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Richard C Gerkin
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Osmo, Cambridge, MA, USA
| | - Alyssa Bakke
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
| | - Raquel Norel
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | - Guillermo Cecchi
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA
| | | | - Masha Y Niv
- The Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Kathrin Ohla
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
- Science & Research, dsm-firmenich, Satigny, Switzerland
| | - John E Hayes
- Department of Food Science, The Pennsylvania State University, University Park, PA, USA
| | | | - Pablo Meyer
- Health Care and Life Sciences, IBM T.J. Watson Research Center, Yorktown Heights, NY, USA.
| |
Collapse
|
18
|
Levis M, Levy J, Dufort V, Russ CJ, Shiner B. Dynamic suicide topic modelling: Deriving population-specific, psychosocial and time-sensitive suicide risk variables from Electronic Health Record psychotherapy notes. Clin Psychol Psychother 2023; 30:795-810. [PMID: 36797651 PMCID: PMC11172400 DOI: 10.1002/cpp.2842] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 02/14/2023] [Indexed: 02/18/2023]
Abstract
In the machine learning subfield of natural language processing, a topic model is a type of unsupervised method that is used to uncover abstract topics within a corpus of text. Dynamic topic modelling (DTM) is used for capturing change in these topics over time. The study deploys DTM on corpus of electronic health record psychotherapy notes. This retrospective study examines whether DTM helps distinguish closely matched patients that did and did not die by suicide. Cohort consists of United States Department of Veterans Affairs (VA) patients diagnosed with Posttraumatic Stress Disorder (PTSD) between 2004 and 2013. Each case (those who died by suicide during the year following diagnosis) was matched with five controls (those who remained alive) that shared psychotherapists and had similar suicide risk based on VA's suicide prediction algorithm. Cohort was restricted to patients who received psychotherapy for 9+ months after initial PTSD diagnoses (cases = 77; controls = 362). For cases, psychotherapy notes from diagnosis until death were examined. For controls, psychotherapy notes from diagnosis until matched case's death date were examined. A Python-based DTM algorithm was utilized. Derived topics identified population-specific themes, including PTSD, psychotherapy, medication, communication and relationships. Control topics changed significantly more over time than case topics. Topic differences highlighted engagement, expressivity and therapeutic alliance. This study strengthens groundwork for deriving population-specific, psychosocial and time-sensitive suicide risk variables.
Collapse
Affiliation(s)
- Maxwell Levis
- White River Junction VA Medical Center, Hartford, Vermont, USA
- Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
| | - Joshua Levy
- Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
| | - Vincent Dufort
- White River Junction VA Medical Center, Hartford, Vermont, USA
| | - Carey J. Russ
- White River Junction VA Medical Center, Hartford, Vermont, USA
- Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
| | - Brian Shiner
- White River Junction VA Medical Center, Hartford, Vermont, USA
- Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, USA
- National Center for PTSD Executive Division, Hartford, Vermont, USA
| |
Collapse
|
19
|
Levis M, Levy J, Dent KR, Dufort V, Gobbel GT, Watts BV, Shiner B. Leveraging Natural Language Processing to Improve Electronic Health Record Suicide Risk Prediction for Veterans Health Administration Users. J Clin Psychiatry 2023; 84:22m14568. [PMID: 37341477 PMCID: PMC11157783 DOI: 10.4088/jcp.22m14568] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/22/2023]
Abstract
Background: Suicide risk prediction models frequently rely on structured electronic health record (EHR) data, including patient demographics and health care usage variables. Unstructured EHR data, such as clinical notes, may improve predictive accuracy by allowing access to detailed information that does not exist in structured data fields. To assess comparative benefits of including unstructured data, we developed a large case-control dataset matched on a state-of-the-art structured EHR suicide risk algorithm, utilized natural language processing (NLP) to derive a clinical note predictive model, and evaluated to what extent this model provided predictive accuracy over and above existing predictive thresholds. Methods: We developed a matched case-control sample of Veterans Health Administration (VHA) patients in 2017 and 2018. Each case (all patients that died by suicide in that interval, n = 4,584) was matched with 5 controls (patients who remained alive during treatment year) who shared the same suicide risk percentile. All sample EHR notes were selected and abstracted using NLP methods. We applied machine-learning classification algorithms to NLP output to develop predictive models. We calculated area under the curve (AUC) and suicide risk concentration to evaluate predictive accuracy overall and for high-risk patients. Results: The best performing NLP-derived models provided 19% overall additional predictive accuracy (AUC = 0.69; 95% CI, 0.67, 0.72) and 6-fold additional risk concentration for patients at the highest risk tier (top 0.1%), relative to the structured EHR model. Conclusions: The NLP-supplemented predictive models provided considerable benefit when compared to conventional structured EHR models. Results support future structured and unstructured EHR risk model integrations.
Collapse
Affiliation(s)
- Maxwell Levis
- VAMC White River Junction, White River Junction, Vermont
- Department of Psychiatry, Geisel School of Medicine, Hanover, New Hampshire
- Corresponding Author: Maxwell Levis, PhD, White River Junction VA Medical Center, 163 Veterans Dr, White River Junction, VT 05009
| | - Joshua Levy
- Departments of Pathology and Laboratory Medicine, Geisel School of Medicine, Hanover, New Hampshire
| | - Kallisse R Dent
- VA Serious Mental Illness Treatment Resource and Evaluation Center, Ann Arbor, Michigan
| | - Vincent Dufort
- VAMC White River Junction, White River Junction, Vermont
| | - Glenn T Gobbel
- Department of Biomedical Informatics, Nashville, Tennessee
| | - Bradley V Watts
- VAMC White River Junction, White River Junction, Vermont
- Department of Psychiatry, Geisel School of Medicine, Hanover, New Hampshire
- VA Office of Systems Redesign and Improvement, White River Junction, Vermont
| | - Brian Shiner
- VAMC White River Junction, White River Junction, Vermont
- Department of Psychiatry, Geisel School of Medicine, Hanover, New Hampshire
- National Center for PTSD, White River Junction, Vermont
| |
Collapse
|
20
|
Allen KC, Davis A, Krishnamurti T. Indirect Identification of Perinatal Psychosocial Risks from Natural Language. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING 2023; 14:1506-1519. [PMID: 37266391 PMCID: PMC10234606 DOI: 10.1109/taffc.2021.3079282] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
During the perinatal period, psychosocial health risks, including depression and intimate partner violence, are associated with serious adverse health outcomes for birth parents and children. To appropriately intervene, healthcare professionals must first identify those at risk, yet stigma often prevents people from directly disclosing the information needed to prompt an assessment. In this research we use short diary entries to indirectly elicit information that could indicate psychosocial risks, then examine patterns that emerge in the language of those at risk. We find that diary entries exhibit consistent themes, extracted using topic modeling, and emotional perspective, drawn from dictionary-informed sentiment features. Using these features, we use regularized regression to predict screening measures for depression and psychological aggression by an intimate partner. Journal text entries quantified through topic models and sentiment features show promise for depression prediction, corresponding with self-reported screening measures almost as well as closed-form questions. Text-based features are less useful in predicting intimate partner violence, but topic models generate themes that align with known risk correlates. The indirect features uncovered in this research could aid in the detection and analysis of stigmatized risks.
Collapse
|
21
|
Sedano-Capdevila A, Toledo-Acosta M, Barrigon ML, Morales-González E, Torres-Moreno D, Martínez-Zaldivar B, Hermosillo-Valadez J, Baca-García E, Artes-Rodriguez A, Baca-García E, Berrouiguet S, Billot R, Carballo-Belloso JJ, Courtet P, Gomez DD, Lopez-Castroman J, Rodriguez MP, Aznar-Carbone J, Cegla F, Gutiérrez-Recacha P, Izaguirre-Gamir L, Herrera-Sanchez J, Borja MM, Palomar-Ciria N, Martínez ASE, Vasquez M, Vallejo-Oñate S, Vera-Varela C, Amodeo-Escribano S, Arrua E, Bautista O, Barrigón ML, Carmona R, Caro-Cañizares I, Carollo-Vivian S, Chamorro J, González-Granado M, Iza M, Jiménez-Giménez M, López-Gómez A, Mata-Iturralde L, Miguelez C, Muñoz-Lorenzo L, Navarro-Jiménez R, Ovejero S, Palacios ML, Pérez-Fominaya M, Peñuelas-Calvo I, Pérez-Colmenero S, Rico-Romano A, Rodriguez-Jover A, SánchezAlonso S, Sevilla-Vicente J, Vigil-López C, Villoria-Borrego L, Martin-Calvo M, Alcón-Durán A, Stasio ED, García-Vega JM, Martín-Calvo P, Ortega AJ, Segura-Valverde M, Bañón-González SM, Crespo-Llanos E, Codesal-Julián R, Frade-Ciudad A, Merino EH, Álvarez-García R, Coll-Font JM, Portillo-de Antonio P, Puras-Rico P, Sedano-Capdevila A, Serrano-Marugán L. Text mining methods for the characterisation of suicidal thoughts and behaviour. Psychiatry Res 2023; 322:115090. [PMID: 36803841 DOI: 10.1016/j.psychres.2023.115090] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 01/23/2023] [Accepted: 01/28/2023] [Indexed: 02/07/2023]
Abstract
Traditional research methods have shown low predictive value for suicidal risk assessments and limitations to be applied in clinical practice. The authors sought to evaluate natural language processing as a new tool for assessing self-injurious thoughts and behaviors and emotions related. We used MEmind project to assess 2838 psychiatric outpatients. Anonymous unstructured responses to the open-ended question "how are you feeling today?" were collected according to their emotional state. Natural language processing was used to process the patients' writings. The texts were automatically represented (corpus) and analyzed to determine their emotional content and degree of suicidal risk. Authors compared the patients' texts with a question used to assess lack of desire to live, as a suicidal risk assessment tool. Corpus consists of 5,489 short free-text documents containing 12,256 tokenized or unique words. The natural language processing showed an ROC-AUC score of 0.9638 when compared with the responses to lack of a desire to live question. Natural language processing shows encouraging results for classifying subjects according to their desire not to live as a measure of suicidal risk using patients' free texts. It is also easily applicable to clinical practice and facilitates real-time communication with patients, allowing better intervention strategies to be designed.
Collapse
Affiliation(s)
| | - Mauricio Toledo-Acosta
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, Morelos, México
| | - María Luisa Barrigon
- Department of Psychiatry, University Hospital Jimenez Diaz Foundation, Madrid, Spain; Institute of Psychiatry and Mental Health, Hospital General Universitario Gregorio Marañón, Madrid, Spain
| | - Eliseo Morales-González
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, Morelos, México
| | - David Torres-Moreno
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, Morelos, México
| | - Bolívar Martínez-Zaldivar
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, Morelos, México
| | - Jorge Hermosillo-Valadez
- Centro de Investigación en Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, Morelos, México
| | - Enrique Baca-García
- Department of Psychiatry, University Hospital Rey Juan Carlos, Mostoles, Spain; Department of Psychiatry, University Hospital Jimenez Diaz Foundation, Madrid, Spain; Department of Psychiatry, General Hospital of Villalba, Madrid, Spain; Department of Psychiatry, University Hospital Infanta Elena, Valdemoro, Spain; Department of Psychiatry, Madrid Autonomous University, Madrid, Spain; CIBERSAM (Centro de Investigación en Salud Mental), Carlos III Institute of Health, Madrid, Spain; Universidad Católica del Maule, Talca, Chile; Department of psychiatry. Centre Hospitalier Universitaire de Nîmes, France.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
A review of natural language processing in the identification of suicidal behavior. JOURNAL OF AFFECTIVE DISORDERS REPORTS 2023. [DOI: 10.1016/j.jadr.2023.100507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023] Open
|
23
|
Rosser AA, Qadadha YM, Thompson RJ, Jung HS, Jung S. Measuring the impact of simulation debriefing on the practices of interprofessional trauma teams using natural language processing. Am J Surg 2023; 225:394-399. [PMID: 36207174 DOI: 10.1016/j.amjsurg.2022.09.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 08/29/2022] [Accepted: 09/13/2022] [Indexed: 11/24/2022]
Abstract
BACKGROUND Natural language processing (NLP) may be a tool for automating trauma teamwork assessment in simulated scenarios. METHODS Using the Trauma Nontechnical Skills Assessment (T-NOTECHS), raters assessed video recordings of trauma teams in simulated pre-debrief (Sim1) and post-debrief (Sim2) trauma resuscitations. We developed codes through directed content analysis and created algorithms capturing teamwork-related discourse through NLP. Using a within subjects pre-post design (n = 150), we compared changes in teams' Sim1 versus Sim2 T-NOTECHS scores and automatically coded discourse to identify which NLP algorithms could identify skills assessed by the T-NOTECHS. RESULTS Automatically coded behaviors revealed significant post-debrief increases in teams' simulation discourse: Verbalizing Findings, Acknowledging Communication, Directed Communication, Directing Assessment and Role Assignment, and Leader as Hub for Information. CONCLUSIONS Our results suggest NLP can capture changes in trauma team discourse. These findings have implications for the expedition of team assessment and innovations in real-time feedback when paired with speech-to-text technology.
Collapse
Affiliation(s)
| | - Yazeed M Qadadha
- Department of Surgery at the University of Wisconsin, Madison, USA.
| | - Ryan J Thompson
- Department of Emergency Medicine at the University of Wisconsin, Madison, USA.
| | - Hee Soo Jung
- Department of Surgery at University of Wisconsin-Madison As Well As Program Director of the Surgical Critical Care Fellowship and Director of Surgical Critical Care Services, USA.
| | - Sarah Jung
- Department of Surgery at University of Wisconsin, Madison, USA.
| |
Collapse
|
24
|
Arowosegbe A, Oyelade T. Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2023; 20:1514. [PMID: 36674270 PMCID: PMC9859480 DOI: 10.3390/ijerph20021514] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/04/2023] [Accepted: 01/11/2023] [Indexed: 06/17/2023]
Abstract
(1) Introduction: Around a million people are reported to die by suicide every year, and due to the stigma associated with the nature of the death, this figure is usually assumed to be an underestimate. Machine learning and artificial intelligence such as natural language processing has the potential to become a major technique for the detection, diagnosis, and treatment of people. (2) Methods: PubMed, EMBASE, MEDLINE, PsycInfo, and Global Health databases were searched for studies that reported use of NLP for suicide ideation or self-harm. (3) Result: The preliminary search of 5 databases generated 387 results. Removal of duplicates resulted in 158 potentially suitable studies. Twenty papers were finally included in this review. (4) Discussion: Studies show that combining structured and unstructured data in NLP data modelling yielded more accurate results than utilizing either alone. Additionally, to reduce suicides, people with mental problems must be continuously and passively monitored. (5) Conclusions: The use of AI&ML opens new avenues for considerably guiding risk prediction and advancing suicide prevention frameworks. The review's analysis of the included research revealed that the use of NLP may result in low-cost and effective alternatives to existing resource-intensive methods of suicide prevention.
Collapse
Affiliation(s)
- Abayomi Arowosegbe
- Institute of Health Informatics, University College London, London NW1 2DA, UK
- Division of Informatics, Imaging & Data Sciences, University of Manchester, Manchester M13 9PL, UK
| | - Tope Oyelade
- Division of Medicine, University College London, London NW3 2PF, UK
| |
Collapse
|
25
|
Chadha A, Kaushik B. A Hybrid Deep Learning Model Using Grid Search and Cross-Validation for Effective Classification and Prediction of Suicidal Ideation from Social Network Data. NEW GENERATION COMPUTING 2022; 40:889-914. [PMID: 36267123 PMCID: PMC9573777 DOI: 10.1007/s00354-022-00191-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Accepted: 10/01/2022] [Indexed: 06/16/2023]
Abstract
Suicide deaths due to depression and mental stress are growing rapidly at an alarming rate. People freely express their feelings and emotions on social network sites while they feel hesitant to express such feelings during face-to-face interactions with their dear ones. In this study, a dataset comprising 20,000 posts was taken from Reddit and preprocessed into tokens using a variety of effective word2vec techniques. A new hybrid approach is proposed by combining the attention model in a convolutional neural network and long-short-term- memory. The objective of this research is to develop an effective learning model to evaluate the data on social media for the efficient and accurate identification of people with suicidal ideation. The proposed attention convolution long short-term memory (ACL) model uses hyperparameter tuning using a grid search to select optimized hyperparameters. From the experimental evaluation, it is shown that the proposed model, that is, ACL with Glove embedding after hyperparameter tuning gives the highest Accuracy of 88.48%, Precision of 87.36%, F1 score of 90.82% and specificity of 79.23% and ACL with Random embedding gives the highest Recall of 94.94% when compared to the state-of-the-art algorithms.
Collapse
Affiliation(s)
- Akshma Chadha
- School of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, Jammu India
| | - Baijnath Kaushik
- School of Computer Science and Engineering, Shri Mata Vaishno Devi University, Katra, Jammu India
| |
Collapse
|
26
|
Krishnamurti T, Allen K, Hayani L, Rodriguez S, Davis AL. Identification of maternal depression risk from natural language collected in a mobile health app. PROCEDIA COMPUTER SCIENCE 2022; 206:132-140. [PMID: 36712815 PMCID: PMC9879299 DOI: 10.1016/j.procs.2022.09.092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Depression is one of the most common pregnancy complications, affecting approximately 15% of pregnant people. While valid psychometric measures of depression risk exist, they are not consistently administered at routine prenatal care, exacerbating the problem of adequate detection. The language we use in daily life offers a window into our psychological wellbeing. In this longitudinal observational cohort study of prenatal patients using a prenatal care mobile health app, we examine how features of app-entered natural language and other app-entered patient-reported data may be used as indicators for validated depression risk measures. Patient participants (n=1091) were prescribed a prenatal care app as part of a quality improvement initiative in the UPMC healthcare system from September 2019 - May 2022. Natural language from open-ended writing prompts in the app and self-reported daily mood, were entered by patients using the tool. Participants also completed a validated measure of depression risk - the Edinburgh Postnatal Depression Scale (EPDS) - at least once in their pregnancy. A variety of natural language processing tools were used to score sentiment, categorize topics, and capture other semantic and syntactic information from text entries. LASSO was used to model the relationship between the natural language features and depression risk. Open-ended text within a 30-day and 60-day timeframe of completing an EPDS was found to be moderately predictive of moderate to severe depression risk (AUROC=0.66 and 0.67, for each respective timeframe). When combined with average daily reported mood, open-ended text showed good predictive power (AUROC=0.87). Consistently predictive language features across all models included themes of "money" and "sadness." The combination of natural language and other user-reported data collected through a mobile health app offers an opportunity for identifying depression risk among a pregnant population.
Collapse
Affiliation(s)
- Tamar Krishnamurti
- Division of General Internal Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Kristen Allen
- Department of Engineering and Public Policy, Carnegie Mellon University, Pittsburgh, PA, USA
| | | | | | - Alexander L. Davis
- Department of Engineering and Public Policy, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
27
|
Levis M, Levy J, Dufort V, Gobbel GT, Watts BV, Shiner B. Leveraging unstructured electronic medical record notes to derive population-specific suicide risk models. Psychiatry Res 2022; 315:114703. [PMID: 35841702 DOI: 10.1016/j.psychres.2022.114703] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 06/17/2022] [Accepted: 06/29/2022] [Indexed: 01/11/2023]
Abstract
Electronic medical record (EMR)-based suicide risk prediction methods typically rely on analysis of structured variables such as demographics, visit history, and prescription data. Leveraging unstructured EMR notes may improve predictive accuracy by allowing access to nuanced clinical information. We utilized natural language processing (NLP) to analyze a large EMR note corpus to develop a data-driven suicide risk prediction model. We developed a matched case-control sample of U.S. Department of Veterans Affairs (VA) patients in 2015 and 2016. We randomly matched each case (all patients that died by suicide in that interval, n = 5029) with five controls (patients that remained alive). We processed note corpus using NLP methods and applied machine-learning classification algorithms to output. We calculated area under the curve (AUC) and risk tiers to determine predictive accuracy. NLP-derived models demonstrated strong predictive accuracy. Patients that scored within top 10% of risk model accounted for up to 29% of suicide decedents. NLP-derived model compares positively to other leading prediction methods. Our approach is highly implementable, only requiring access to text data and open-source software. Additional studies should evaluate ensemble models incorporating NLP-derived information alongside more typical structured variables.
Collapse
Affiliation(s)
- Maxwell Levis
- VAMC White River Junction, 163 Veterans Dr., White River Junction VT, 05009 United States; Department of Psychiatry, Geisel School of Medicine, 1 Rope Ferry Rd, Hanover NH, 03755 United States.
| | - Joshua Levy
- Departments of Pathology and Laboratory Medicine, Geisel School of Medicine, 1 Rope Ferry Rd, Hanover NH, 03755 United States
| | - Vincent Dufort
- VAMC White River Junction, 163 Veterans Dr., White River Junction VT, 05009 United States
| | - Glenn T Gobbel
- Department of Biomedical Informatics, 2201 West End Ave, Nashville TN, 37235 United States
| | - Bradley V Watts
- VAMC White River Junction, 163 Veterans Dr., White River Junction VT, 05009 United States; Department of Psychiatry, Geisel School of Medicine, 1 Rope Ferry Rd, Hanover NH, 03755 United States; VA Office of Systems Redesign and Improvement, 215 North Main Street, White River Junction VT, 05009, United States
| | - Brian Shiner
- VAMC White River Junction, 163 Veterans Dr., White River Junction VT, 05009 United States; Department of Psychiatry, Geisel School of Medicine, 1 Rope Ferry Rd, Hanover NH, 03755 United States; National Center for PTSD, White River Junction, VT, United States
| |
Collapse
|
28
|
Mahmoudi E, Wu W, Najarian C, Aikens J, Bynum J, Vydiswaran VV. Identify Caregiver Availability Using Medical Notes: Rule-Based Natural Language Processing. JMIR Aging 2022; 5:e40241. [PMID: 35998328 PMCID: PMC9539648 DOI: 10.2196/40241] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2022] [Revised: 07/28/2022] [Accepted: 08/16/2022] [Indexed: 11/23/2022] Open
Abstract
Background Identifying caregiver availability, particularly for patients with dementia or those with a disability, is critical to informing the appropriate care planning by the health systems, hospitals, and providers. This information is not readily available, and there is a paucity of pragmatic approaches to automatically identifying caregiver availability and type. Objective Our main objective was to use medical notes to assess caregiver availability and type for hospitalized patients with dementia. Our second objective was to identify whether the patient lived at home or resided at an institution. Methods In this retrospective cohort study, we used 2016-2019 telephone-encounter medical notes from a single institution to develop a rule-based natural language processing (NLP) algorithm to identify the patient’s caregiver availability and place of residence. Using note-level data, we compared the results of the NLP algorithm with human-conducted chart abstraction for both training (749/976, 77%) and test sets (227/976, 23%) for a total of 223 adults aged 65 years and older diagnosed with dementia. Our outcomes included determining whether the patients (1) reside at home or in an institution, (2) have a formal caregiver, and (3) have an informal caregiver. Results Test set results indicated that our NLP algorithm had high level of accuracy and reliability for identifying whether patients had an informal caregiver (F1=0.94, accuracy=0.95, sensitivity=0.97, and specificity=0.93), but was relatively less able to identify whether the patient lived at an institution (F1=0.64, accuracy=0.90, sensitivity=0.51, and specificity=0.98). The most common explanations for NLP misclassifications across all categories were (1) incomplete or misspelled facility names; (2) past, uncertain, or undecided status; (3) uncommon abbreviations; and (4) irregular use of templates. Conclusions This innovative work was the first to use medical notes to pragmatically determine caregiver availability. Our NLP algorithm identified whether hospitalized patients with dementia have a formal or informal caregiver and, to a lesser extent, whether they lived at home or in an institutional setting. There is merit in using NLP to identify caregivers. This study serves as a proof of concept. Future work can use other approaches and further identify caregivers and the extent of their availability.
Collapse
Affiliation(s)
- Elham Mahmoudi
- Department of Family Medicine, Medical School, University of Michigan, Institute for healthcare Policy and Innovation, University of Michigan, NCRC Building 14, Room G2342800 Plymouth Rd., Ann Arbor, US
| | - Wenbo Wu
- Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, US
| | - Cyrus Najarian
- University of Michigan Medical School, University of Michigan, Ann Arbor, US
| | - James Aikens
- Department of Family Medicine, Medical School, University of Michigan, Ann Arbor, US
| | - Julie Bynum
- Medical School, University of Michigan, Ann Arbor, US
| | - Vg Vinod Vydiswaran
- Department of Learning Health Sciences, Medical School, University of Michigan, Ann Arbor, US
| |
Collapse
|
29
|
Coulthard B, Taylor BJ. Natural language processing to identify case factors in child protection court proceedings. METHODOLOGICAL INNOVATIONS 2022. [DOI: 10.1177/20597991221115967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Social work case files hold rich detail about the lives and needs of vulnerable groups. Traditional case-reading studies to gain generalisable knowledge are resource-intensive, however, and sample sizes thereby limited. The advent of ‘big data’ technology, and vast repositories of centrally stored electronic records offer social work researchers novel alternatives, including data linkage and predictive risk modelling using administrative data. Free-text documents, however – including assessments, reports, and case chronologies – remain a largely untapped resource. This paper describes how 5000 social work court statements held by the Child and Family Court Advisory Support Service in England (Cafcass) were analysed using natural language processing (NLP) based on simple rules and mathematical principles. Thirteen factors relating to harm and risk to children involved in care proceedings in England were identified by automated computer techniques, and almost 90% agreement with professional readers achieved when the factors were clear-cut. The study represents an innovative approach for social work research on complex social problems. In conclusion, the paper discusses learning points; practical implications; future research avenues; and the technical and ethical challenges of NLP.
Collapse
Affiliation(s)
- Beth Coulthard
- School of Applied Social and Policy Sciences, Ulster University, Northern Ireland, UK
| | - Brian J Taylor
- School of Applied Social and Policy Sciences, Ulster University, Northern Ireland, UK
| |
Collapse
|
30
|
Boggs JM, Kafka JM. A Critical Review of Text Mining Applications for Suicide Research. CURR EPIDEMIOL REP 2022; 9:126-134. [PMID: 35911089 PMCID: PMC9315081 DOI: 10.1007/s40471-022-00293-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2022] [Indexed: 11/28/2022]
Abstract
Purpose of Review Applying text mining to suicide research holds a great deal of promise. In this manuscript, literature from 2019 to 2021 is critically reviewed for text mining projects that use electronic health records, social media data, and death records. Recent Findings Text mining has helped identify risk factors for suicide in general and specific populations (e.g., older adults), has been combined with structured variables in EHRs to predict suicide risk, and has been used to track trends in social media suicidal discourse following population level events (e.g., COVID-19, celebrity suicides). Summary Future research should utilize text mining along with data linkage methods to capture more complete information on risk factors and outcomes across data sources (e.g., combining death records and EHRs), evaluate effectiveness of NLP-based intervention programs that use suicide risk prediction, establish standards for reporting accuracy of text mining programs to enable comparison across studies, and incorporate implementation science to understand feasibility, acceptability, and technical considerations.
Collapse
Affiliation(s)
- Jennifer M Boggs
- Kaiser Permanente Colorado, Institute for Health Research, Aurora, CO USA
| | - Julie M Kafka
- Department of Health Behavior, Gillings School of Global Public Health at University of North Carolina Chapel Hill, Chapel Hill, NC USA
| |
Collapse
|
31
|
Khan NZ, Javed MA. Use of Artificial Intelligence-Based Strategies for Assessing Suicidal Behavior and Mental Illness: A Literature Review. Cureus 2022; 14:e27225. [PMID: 36035036 PMCID: PMC9400551 DOI: 10.7759/cureus.27225] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/24/2022] [Indexed: 11/12/2022] Open
Abstract
Mental illness leading to suicide attempts is prevalent in a large portion of the population especially in low and middle-income nations. There remains a significant social stigma associated with mental illness that can lead to stigmatization of patients. Hence, patients are reluctant to communicate their problems to health care providers. Physicians have difficulty in timely identification of patients at risk for suicide. Novel and rigorously designed strategies are needed to determine the population at risk for suicide. This would be the first step in overcoming the multitude of barriers in the management of mental illness. Clinical tools and the use of electronic medical records (EMR) are time intensive. Recently, several artificial intelligence (AI)-based predictive technologies have gained momentum. The aim of this review is to summarize the recent advances in this landscape.
Collapse
|
32
|
Kalvesmaki AF, Chapman AB, Peterson KS, Pugh MJ, Jones M, Gleason TC. Analysis of a national response to a White House directive for ending veteran suicide. Health Serv Res 2022; 57 Suppl 1:32-41. [PMID: 35238027 DOI: 10.1111/1475-6773.13931] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 12/01/2021] [Accepted: 12/06/2021] [Indexed: 11/27/2022] Open
Abstract
OBJECTIVE Analyze responses to a national request for information (RFI) to uncover gaps in policy, practice, and understanding of veteran suicide to inform federal research strategy. DATA SOURCE An RFI with 21 open-ended questions generated from Presidential Executive Order #1386, administered nationally from July 3 to August 5, 2019. STUDY DESIGN Semi-structured, open-ended responses analyzed using a collaborative qualitative and text-mining data process. DATA EXTRACTION METHODS We aligned traditional qualitative methods with natural language processing (NLP) text-mining techniques to analyze 9040 open-ended question responses from 722 respondents to provide results within 3 months. Narrative inquiry and the medical explanatory model guided the data extraction and analytic process. RESULTS Five major themes were identified: risk factors, risk assessment, prevention and intervention, barriers to care, and data/research. Individuals and organizations mentioned different concepts within the same themes. In responses about risk factors, individuals frequently mentioned generic terms like "illness" while organizations mentioned specific terms like "traumatic brain injury." Organizations and individuals described unique barriers to care and emphasized ways to integrate data and research to improve points of care. Organizations often identified lack of funding as barriers while individuals often identified key moments for prevention such as military transitions and ensuring care providers have military cultural understanding. CONCLUSIONS This study provides an example of a rapid, adaptive analysis of a large body of qualitative, public response data about veteran suicide to support a federal strategy for an important public health topic. Combining qualitative and text-mining methods allowed a representation of voices and perspectives including the lived experiences of individuals who described stories of military transition, treatments that worked or did not, and the perspective of organizations treating veterans for suicide. The results supported the development of a national strategy to reduce suicide risks for veterans as well as civilians.
Collapse
Affiliation(s)
- Andrea F Kalvesmaki
- Informatics, Decision-Enhancement and Analytic Sciences Center (IDEAS) Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.,Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Alec B Chapman
- Informatics, Decision-Enhancement and Analytic Sciences Center (IDEAS) Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.,Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah, USA.,Department of Population Health Sciences, University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Kelly S Peterson
- Informatics, Decision-Enhancement and Analytic Sciences Center (IDEAS) Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.,Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah, USA.,Veteran Health Administration Office of Analytics and Performance Integration
| | - Mary Jo Pugh
- Informatics, Decision-Enhancement and Analytic Sciences Center (IDEAS) Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.,Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Makoto Jones
- Informatics, Decision-Enhancement and Analytic Sciences Center (IDEAS) Center, VA Salt Lake City Health Care System, Salt Lake City, Utah, USA.,Division of Epidemiology, University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Theresa C Gleason
- Department of Veteran Affairs, Clinical Science Research & Development Service (CSRD), Washington, District of Columbia, USA
| |
Collapse
|
33
|
Machine learning for suicidal ideation identification: A systematic literature review. COMPUTERS IN HUMAN BEHAVIOR 2022. [DOI: 10.1016/j.chb.2021.107095] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
34
|
Cerna S, Guyeux C, Laiymani D. The usefulness of NLP techniques for predicting peaks in firefighter interventions due to rare events. Neural Comput Appl 2022; 34:10117-10132. [PMID: 35250179 PMCID: PMC8881897 DOI: 10.1007/s00521-022-06996-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 01/30/2022] [Indexed: 11/26/2022]
Abstract
In some countries such as France, the number of operations assisted by firefighters has shown an almost linear increase over the years, contrary to their resource capacity. For this reason, predicting the number of interventions has become a necessity. Initially, time series models were developed with several types of qualitative and quantitative features, including the alert level of the bulletins, to predict the operational load. We realized that interventions related to human activities are quite predictable. However, the recognition of interventions due to rare events such as storms or floods needs more than quantitative meteorological data to be identified, since there are almost always zero cases. Thus, this work proposes the application of natural language processing techniques, namely long short-term memory, convolutional neural networks, FlauBERT, and CamemBERT to extract features from the texts of weather bulletins in order to recognize periods with peak interventions, where the intense workload of firefighters is caused by rare events. Four categories identified as Emergency Person Rescue, Total Person Rescue, interventions related to Heating, and Storm/Flood were our targets for the multilabel classification models developed. The results showed a remarkable accuracy of 80%, 86%, 92%, and 86% for Emergency Rescue People, Total Rescue People, Heating, and Storm/Flood, respectively.
Collapse
Affiliation(s)
- Selene Cerna
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| | - Christophe Guyeux
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| | - David Laiymani
- Femto-ST Institute, UMR 6174 CNRS, University Bourgogne Franche-Comté, Belfort, France
| |
Collapse
|
35
|
Liu Z, Peach RL, Lawrance EL, Noble A, Ungless MA, Barahona M. Listening to Mental Health Crisis Needs at Scale: Using Natural Language Processing to Understand and Evaluate a Mental Health Crisis Text Messaging Service. Front Digit Health 2021; 3:779091. [PMID: 34939068 PMCID: PMC8685221 DOI: 10.3389/fdgth.2021.779091] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 11/12/2021] [Indexed: 11/24/2022] Open
Abstract
The current mental health crisis is a growing public health issue requiring a large-scale response that cannot be met with traditional services alone. Digital support tools are proliferating, yet most are not systematically evaluated, and we know little about their users and their needs. Shout is a free mental health text messaging service run by the charity Mental Health Innovations, which provides support for individuals in the UK experiencing mental or emotional distress and seeking help. Here we study a large data set of anonymised text message conversations and post-conversation surveys compiled through Shout. This data provides an opportunity to hear at scale from those experiencing distress; to better understand mental health needs for people not using traditional mental health services; and to evaluate the impact of a novel form of crisis support. We use natural language processing (NLP) to assess the adherence of volunteers to conversation techniques and formats, and to gain insight into demographic user groups and their behavioural expressions of distress. Our textual analyses achieve accurate classification of conversation stages (weighted accuracy = 88%), behaviours (1-hamming loss = 95%) and texter demographics (weighted accuracy = 96%), exemplifying how the application of NLP to frontline mental health data sets can aid with post-hoc analysis and evaluation of quality of service provision in digital mental health services.
Collapse
Affiliation(s)
- Zhaolu Liu
- Department of Mathematics, Imperial College London, London, United Kingdom
| | - Robert L Peach
- Department of Mathematics, Imperial College London, London, United Kingdom.,Department of Neurology, University Hospital Würzburg, Würzburg, Germany.,Department of Brain Sciences, Imperial College London, London, United Kingdom
| | - Emma L Lawrance
- Institute of Global Health Innovation, Imperial College London, London, United Kingdom.,Mental Health Innovations, London, United Kingdom
| | - Ariele Noble
- Mental Health Innovations, London, United Kingdom
| | | | - Mauricio Barahona
- Department of Mathematics, Imperial College London, London, United Kingdom
| |
Collapse
|
36
|
Khalili-Mahani N, Holowka E, Woods S, Khaled R, Roy M, Lashley M, Glatard T, Timm-Bottos J, Dahan A, Niesters M, Hovey RB, Simon B, Kirmayer LJ. Play the Pain: A Digital Strategy for Play-Oriented Research and Action. Front Psychiatry 2021; 12:746477. [PMID: 34975566 PMCID: PMC8714795 DOI: 10.3389/fpsyt.2021.746477] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 11/11/2021] [Indexed: 12/26/2022] Open
Abstract
The value of understanding patients' illness experience and social contexts for advancing medicine and clinical care is widely acknowledged. However, methodologies for rigorous and inclusive data gathering and integrative analysis of biomedical, cultural, and social factors are limited. In this paper, we propose a digital strategy for large-scale qualitative health research, using play (as a state of being, a communication mode or context, and a set of imaginative, expressive, and game-like activities) as a research method for recursive learning and action planning. Our proposal builds on Gregory Bateson's cybernetic approach to knowledge production. Using chronic pain as an example, we show how pragmatic, structural and cultural constraints that define the relationship of patients to the healthcare system can give rise to conflicted messaging that impedes inclusive health research. We then review existing literature to illustrate how different types of play including games, chatbots, virtual worlds, and creative art making can contribute to research in chronic pain. Inspired by Frederick Steier's application of Bateson's theory to designing a science museum, we propose DiSPORA (Digital Strategy for Play-Oriented Research and Action), a virtual citizen science laboratory which provides a framework for delivering health information, tools for play-based experimentation, and data collection capacity, but is flexible in allowing participants to choose the mode and the extent of their interaction. Combined with other data management platforms used in epidemiological studies of neuropsychiatric illness, DiSPORA offers a tool for large-scale qualitative research, digital phenotyping, and advancing personalized medicine.
Collapse
Affiliation(s)
- Najmeh Khalili-Mahani
- McGill Centre for Integrative Neuroscience, Montreal Neurological Institute, McGill University, Montreal, QC, Canada
- Division of Social & Transcultural Psychiatry, McGill University, Montreal, QC, Canada
- Culture and Mental Health Research Unit, Lady Davis Institute, Jewish General Hospital, Montreal, QC, Canada
- Technoculture, Arts and Game Centre, Milieux Institute for Art, Culture and Technology, Concordia University, Montreal, QC, Canada
| | - Eileen Holowka
- Technoculture, Arts and Game Centre, Milieux Institute for Art, Culture and Technology, Concordia University, Montreal, QC, Canada
| | | | - Rilla Khaled
- Technoculture, Arts and Game Centre, Milieux Institute for Art, Culture and Technology, Concordia University, Montreal, QC, Canada
| | - Mathieu Roy
- Department of Psychology, McGill University, Montreal, QC, Canada
| | - Myrna Lashley
- Division of Social & Transcultural Psychiatry, McGill University, Montreal, QC, Canada
- Culture and Mental Health Research Unit, Lady Davis Institute, Jewish General Hospital, Montreal, QC, Canada
| | - Tristan Glatard
- Department of Computer Science, Concordia University, Montreal, QC, Canada
- PERFORM Centre, Concordia University, Montreal, QC, Canada
| | - Janis Timm-Bottos
- Department of Creative Art Therapies, Concordia University, Montreal, QC, Canada
| | - Albert Dahan
- Department of Anesthesiology, Leiden University Medical Centre, Leiden University, Leiden, Netherlands
| | - Marieke Niesters
- Department of Anesthesiology, Leiden University Medical Centre, Leiden University, Leiden, Netherlands
| | | | - Bart Simon
- Technoculture, Arts and Game Centre, Milieux Institute for Art, Culture and Technology, Concordia University, Montreal, QC, Canada
- Department of Sociology, Concordia University, Montreal, QC, Canada
| | - Laurence J. Kirmayer
- Division of Social & Transcultural Psychiatry, McGill University, Montreal, QC, Canada
- Culture and Mental Health Research Unit, Lady Davis Institute, Jewish General Hospital, Montreal, QC, Canada
| |
Collapse
|
37
|
Yang YC, Al-Garadi MA, Love JS, Perrone J, Sarker A. Automatic gender detection in Twitter profiles for health-related cohort studies. JAMIA Open 2021; 4:ooab042. [PMID: 34169232 PMCID: PMC8220305 DOI: 10.1093/jamiaopen/ooab042] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Revised: 04/27/2021] [Accepted: 05/04/2021] [Indexed: 11/17/2022] Open
Abstract
Objective Biomedical research involving social media data is gradually moving from population-level to targeted, cohort-level data analysis. Though crucial for biomedical studies, social media user’s demographic information (eg, gender) is often not explicitly known from profiles. Here, we present an automatic gender classification system for social media and we illustrate how gender information can be incorporated into a social media-based health-related study. Materials and Methods We used a large Twitter dataset composed of public, gender-labeled users (Dataset-1) for training and evaluating the gender detection pipeline. We experimented with machine learning algorithms including support vector machines (SVMs) and deep-learning models, and public packages including M3. We considered users’ information including profile and tweets for classification. We also developed a meta-classifier ensemble that strategically uses the predicted scores from the classifiers. We then applied the best-performing pipeline to Twitter users who have self-reported nonmedical use of prescription medications (Dataset-2) to assess the system’s utility. Results and Discussion We collected 67 181 and 176 683 users for Dataset-1 and Dataset-2, respectively. A meta-classifier involving SVM and M3 performed the best (Dataset-1 accuracy: 94.4% [95% confidence interval: 94.0–94.8%]; Dataset-2: 94.4% [95% confidence interval: 92.0–96.6%]). Including automatically classified information in the analyses of Dataset-2 revealed gender-specific trends—proportions of females closely resemble data from the National Survey of Drug Use and Health 2018 (tranquilizers: 0.50 vs 0.50; stimulants: 0.50 vs 0.45), and the overdose Emergency Room Visit due to Opioids by Nationwide Emergency Department Sample (pain relievers: 0.38 vs 0.37). Conclusion Our publicly available, automated gender detection pipeline may aid cohort-specific social media data analyses (https://bitbucket.org/sarkerlab/gender-detection-for-public).
Collapse
Affiliation(s)
- Yuan-Chi Yang
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, Georgia, USA
| | - Mohammed Ali Al-Garadi
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, Georgia, USA
| | - Jennifer S Love
- Department of Emergency Medicine, School of Medicine, Oregon Health & Science University, Portland, Oregon, USA
| | - Jeanmarie Perrone
- Department of Emergency Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Abeed Sarker
- Department of Biomedical Informatics, School of Medicine, Emory University, Atlanta, Georgia, USA.,Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, Georgia, USA
| |
Collapse
|
38
|
Su Z, McDonnell D, Bentley BL, He J, Shi F, Cheshmehzangi A, Ahmad J, Jia P. Addressing Biodisaster X Threats With Artificial Intelligence and 6G Technologies: Literature Review and Critical Insights. J Med Internet Res 2021; 23:e26109. [PMID: 33961583 PMCID: PMC8153034 DOI: 10.2196/26109] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2020] [Revised: 01/21/2021] [Accepted: 04/07/2021] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND With advances in science and technology, biotechnology is becoming more accessible to people of all demographics. These advances inevitably hold the promise to improve personal and population well-being and welfare substantially. It is paradoxical that while greater access to biotechnology on a population level has many advantages, it may also increase the likelihood and frequency of biodisasters due to accidental or malicious use. Similar to "Disease X" (describing unknown naturally emerging pathogenic diseases with a pandemic potential), we term this unknown risk from biotechnologies "Biodisaster X." To date, no studies have examined the potential role of information technologies in preventing and mitigating Biodisaster X. OBJECTIVE This study aimed to explore (1) what Biodisaster X might entail and (2) solutions that use artificial intelligence (AI) and emerging 6G technologies to help monitor and manage Biodisaster X threats. METHODS A review of the literature on applying AI and 6G technologies for monitoring and managing biodisasters was conducted on PubMed, using articles published from database inception through to November 16, 2020. RESULTS Our findings show that Biodisaster X has the potential to upend lives and livelihoods and destroy economies, essentially posing a looming risk for civilizations worldwide. To shed light on Biodisaster X threats, we detailed effective AI and 6G-enabled strategies, ranging from natural language processing to deep learning-based image analysis to address issues ranging from early Biodisaster X detection (eg, identification of suspicious behaviors), remote design and development of pharmaceuticals (eg, treatment development), and public health interventions (eg, reactive shelter-at-home mandate enforcement), as well as disaster recovery (eg, sentiment analysis of social media posts to shed light on the public's feelings and readiness for recovery building). CONCLUSIONS Biodisaster X is a looming but avoidable catastrophe. Considering the potential human and economic consequences Biodisaster X could cause, actions that can effectively monitor and manage Biodisaster X threats must be taken promptly and proactively. Rather than solely depending on overstretched professional attention of health experts and government officials, it is perhaps more cost-effective and practical to deploy technology-based solutions to prevent and control Biodisaster X threats. This study discusses what Biodisaster X could entail and emphasizes the importance of monitoring and managing Biodisaster X threats by AI techniques and 6G technologies. Future studies could explore how the convergence of AI and 6G systems may further advance the preparedness for high-impact, less likely events beyond Biodisaster X.
Collapse
Affiliation(s)
- Zhaohui Su
- Center on Smart and Connected Health Technologies, Mays Cancer Center, School of Nursing, UT Health San Antonio, San Antonio, TX, United States
| | - Dean McDonnell
- Department of Humanities, Institute of Technology Carlow, Carlow, Ireland
| | - Barry L Bentley
- Cardiff School of Technologies, Cardiff Metropolitan University, Cardiff, United Kingdom
| | - Jiguang He
- Centre for Wireless Communications, University of Oulu, Oulu, Finland
| | - Feng Shi
- Department of Research and Development, Shanghai United Imaging Intelligence, Shanghai, China
| | - Ali Cheshmehzangi
- Faculty of Science and Engineering, University of Nottingham Ningbo China, Ningbo, China
- Network for Education and Research on Peace and Sustainability, Hiroshima University, Hiroshima, Japan
| | - Junaid Ahmad
- Prime Institute of Public Health, Peshawar Medical College, Peshawar, Pakistan
| | - Peng Jia
- Department of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hong Kong, China
- International Institute of Spatial Lifecourse Epidemiology, Hong Kong, China
| |
Collapse
|
39
|
Le Glaz A, Haralambous Y, Kim-Dufor DH, Lenca P, Billot R, Ryan TC, Marsh J, DeVylder J, Walter M, Berrouiguet S, Lemey C. Machine Learning and Natural Language Processing in Mental Health: Systematic Review. J Med Internet Res 2021; 23:e15708. [PMID: 33944788 PMCID: PMC8132982 DOI: 10.2196/15708] [Citation(s) in RCA: 94] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2019] [Revised: 04/18/2020] [Accepted: 10/02/2020] [Indexed: 01/22/2023] Open
Abstract
BACKGROUND Machine learning systems are part of the field of artificial intelligence that automatically learn models from data to make better decisions. Natural language processing (NLP), by using corpora and learning approaches, provides good performance in statistical tasks, such as text classification or sentiment mining. OBJECTIVE The primary aim of this systematic review was to summarize and characterize, in methodological and technical terms, studies that used machine learning and NLP techniques for mental health. The secondary aim was to consider the potential use of these methods in mental health clinical practice. METHODS This systematic review follows the PRISMA (Preferred Reporting Items for Systematic Review and Meta-analysis) guidelines and is registered with PROSPERO (Prospective Register of Systematic Reviews; number CRD42019107376). The search was conducted using 4 medical databases (PubMed, Scopus, ScienceDirect, and PsycINFO) with the following keywords: machine learning, data mining, psychiatry, mental health, and mental disorder. The exclusion criteria were as follows: languages other than English, anonymization process, case studies, conference papers, and reviews. No limitations on publication dates were imposed. RESULTS A total of 327 articles were identified, of which 269 (82.3%) were excluded and 58 (17.7%) were included in the review. The results were organized through a qualitative perspective. Although studies had heterogeneous topics and methods, some themes emerged. Population studies could be grouped into 3 categories: patients included in medical databases, patients who came to the emergency room, and social media users. The main objectives were to extract symptoms, classify severity of illness, compare therapy effectiveness, provide psychopathological clues, and challenge the current nosography. Medical records and social media were the 2 major data sources. With regard to the methods used, preprocessing used the standard methods of NLP and unique identifier extraction dedicated to medical texts. Efficient classifiers were preferred rather than transparent functioning classifiers. Python was the most frequently used platform. CONCLUSIONS Machine learning and NLP models have been highly topical issues in medicine in recent years and may be considered a new paradigm in medical research. However, these processes tend to confirm clinical hypotheses rather than developing entirely new information, and only one major category of the population (ie, social media users) is an imprecise cohort. Moreover, some language-specific features can improve the performance of NLP methods, and their extension to other languages should be more closely investigated. However, machine learning and NLP techniques provide useful information from unexplored data (ie, patients' daily habits that are usually inaccessible to care providers). Before considering It as an additional tool of mental health care, ethical issues remain and should be discussed in a timely manner. Machine learning and NLP methods may offer multiple perspectives in mental health research but should also be considered as tools to support clinical practice.
Collapse
Affiliation(s)
- Aziliz Le Glaz
- URCI Mental Health Department, Brest Medical University Hospital, Brest, France
| | | | - Deok-Hee Kim-Dufor
- URCI Mental Health Department, Brest Medical University Hospital, Brest, France
| | - Philippe Lenca
- IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238, Brest, France
| | - Romain Billot
- IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238, Brest, France
| | - Taylor C Ryan
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, United States
| | - Jonathan Marsh
- Fordham University Graduate School of Social Service, New York, NY, United States
| | - Jordan DeVylder
- Fordham University Graduate School of Social Service, New York, NY, United States
| | - Michel Walter
- URCI Mental Health Department, Brest Medical University Hospital, Brest, France
- EA 7479 SPURBO, Université de Bretagne Occidentale, Brest, France
| | - Sofian Berrouiguet
- URCI Mental Health Department, Brest Medical University Hospital, Brest, France
- IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238, Brest, France
- EA 7479 SPURBO, Université de Bretagne Occidentale, Brest, France
- LaTIM, INSERM, UMR 1101, Brest, France
| | - Christophe Lemey
- URCI Mental Health Department, Brest Medical University Hospital, Brest, France
- IMT Atlantique, Lab-STICC, UMR CNRS 6285, F-29238, Brest, France
- EA 7479 SPURBO, Université de Bretagne Occidentale, Brest, France
| |
Collapse
|
40
|
Rassy J, Bardon C, Dargis L, Côté LP, Corthésy-Blondin L, Mörch CM, Labelle R. Information and Communication Technology Use in Suicide Prevention: Scoping Review. J Med Internet Res 2021; 23:e25288. [PMID: 33820754 PMCID: PMC8132980 DOI: 10.2196/25288] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 02/10/2021] [Accepted: 03/16/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND The use of information and communication technology (ICT) in suicide prevention has progressed rapidly over the past decade. ICT plays a major role in suicide prevention, but research on best and promising practices has been slow. OBJECTIVE This paper aims to explore the existing literature on ICT use in suicide prevention to answer the following question: what are the best and most promising ICT practices for suicide prevention? METHODS A scoping search was conducted using the following databases: PubMed, PsycINFO, Sociological Abstracts, and IEEE Xplore. These databases were searched for articles published between January 1, 2013, and December 31, 2018. The five stages of the scoping review process were as follows: identifying research questions; targeting relevant studies; selecting studies; charting data; and collating, summarizing, and reporting the results. The World Health Organization suicide prevention model was used according to the continuum of universal, selective, and indicated prevention. RESULTS Of the 3848 studies identified, 115 (2.99%) were selected. Of these, 10 regarded the use of ICT in universal suicide prevention, 53 referred to the use of ICT in selective suicide prevention, and 52 dealt with the use of ICT in indicated suicide prevention. CONCLUSIONS The use of ICT plays a major role in suicide prevention, and many promising programs were identified through this scoping review. However, large-scale evaluation studies are needed to further examine the effectiveness of these programs and strategies. In addition, safety and ethics protocols for ICT-based interventions are recommended.
Collapse
Affiliation(s)
- Jessica Rassy
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Research Center, Institut universitaire en santé mentale de Montréal, Montréal, QC, Canada
- School of Nursing, Université de Sherbrooke, Longueuil, QC, Canada
- Quebec Network on Nursing Intervention Research, Montréal, QC, Canada
| | - Cécile Bardon
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Department of Psychology, Université du Québec à Montréal, Montréal, QC, Canada
| | - Luc Dargis
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
| | - Louis-Philippe Côté
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Department of Psychology, Université du Québec à Montréal, Montréal, QC, Canada
| | - Laurent Corthésy-Blondin
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Department of Psychology, Université du Québec à Montréal, Montréal, QC, Canada
| | - Carl-Maria Mörch
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Department of Psychology, Université du Québec à Montréal, Montréal, QC, Canada
- Algora Lab, Université de Montréal, Montréal, QC, Canada
- Mila, Quebec Artificial Intelligence Institute, Montréal, QC, Canada
| | - Réal Labelle
- Center for Research and Intervention on Suicide, Ethical Issues and End-of-Life Practices, Université du Québec à Montréal, Montréal, QC, Canada
- Research Center, Institut universitaire en santé mentale de Montréal, Montréal, QC, Canada
- Department of Psychology, Université du Québec à Montréal, Montréal, QC, Canada
- Department of Psychiatry, Université de Montréal, Montréal, QC, Canada
| |
Collapse
|
41
|
|
42
|
Resnik P, Foreman A, Kuchuk M, Musacchio Schafer K, Pinkham B. Naturally occurring language as a source of evidence in suicide prevention. Suicide Life Threat Behav 2021; 51:88-96. [PMID: 32914479 DOI: 10.1111/sltb.12674] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
We discuss computational language analysis as it pertains to suicide prevention research, with an emphasis on providing non-technologists with an understanding of key issues and, equally important, considering its relation to the broader enterprise of suicide prevention. Our emphasis here is on naturally occurring language in social media, motivated by its non-intrusive ability to yield high-value information that in the past has been largely unavailable to clinicians.
Collapse
Affiliation(s)
| | - April Foreman
- American Association of Suicidology, Washington, District of Columbia, USA
| | - Michelle Kuchuk
- Vibrant Emotional Health, New York, New York, USA.,National Suicide Prevention Lifeline, New York, New York, USA
| | | | - Beau Pinkham
- American Association of Suicidology, Washington, District of Columbia, USA.,National Suicide Prevention Lifeline, New York, New York, USA.,International Council for Helplines, Nashville, Tennessee, USA
| |
Collapse
|
43
|
Jacobucci R, Ammerman BA, Tyler Wilcox K. The use of text-based responses to improve our understanding and prediction of suicide risk. Suicide Life Threat Behav 2021; 51:55-64. [PMID: 33624877 DOI: 10.1111/sltb.12668] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
OBJECTIVE Text-based responses may provide significant contributions to suicide risk prediction, yet research including text data is limited. This may be due to a lack of exposure and familiarity with statistical analyses for this data structure. METHOD The current study provides an overview of data processing and statistical algorithms for text data, guided by an empirical example of 947 online participants who completed both open-ended items and traditional self-report measures. We give an introduction to a number of text-based statistical approaches, including dictionary-based methods, topic modeling, word embeddings, and deep learning. RESULTS We analyze responses from the open-ended question "How do you feel today?", detailing characteristics of the responses, as well as predicting past-year suicidal ideation. CONCLUSIONS We see the analysis of text from social media, open-ended questions, and other text sources (i.e., medical records) as an important form of complementary assessment to traditional scales, shedding insight on what we are missing in our current set of questionnaires, which may ultimately serve to improve both our understanding and prediction of suicide.
Collapse
Affiliation(s)
- Ross Jacobucci
- Department of Psychology, University of Notre Dame, Notre Dame, Indiana
| | - Brooke A Ammerman
- Department of Psychology, University of Notre Dame, Notre Dame, Indiana
| | | |
Collapse
|
44
|
AIM in Oncology. Artif Intell Med 2021. [DOI: 10.1007/978-3-030-58080-3_94-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
45
|
Bako AT, Taylor HL, Wiley K, Zheng J, Walter-McCabe H, Kasthurirathne SN, Vest JR. Using natural language processing to classify social work interventions. AMERICAN JOURNAL OF MANAGED CARE 2021; 27:e24-e31. [PMID: 33471465 DOI: 10.37765/ajmc.2021.88580] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
OBJECTIVES Health care organizations are increasingly employing social workers to address patients' social needs. However, social work (SW) activities in health care settings are largely captured as text data within electronic health records (EHRs), making measurement and analysis difficult. This study aims to extract and classify, from EHR notes, interventions intended to address patients' social needs using natural language processing (NLP) and machine learning (ML) algorithms. STUDY DESIGN Secondary data analysis of a longitudinal cohort. METHODS We extracted 815 SW encounter notes from the EHR system of a federally qualified health center. We reviewed the literature to derive a 10-category classification scheme for SW interventions. We applied NLP and ML algorithms to categorize the documented SW interventions in EHR notes according to the 10-category classification scheme. RESULTS Most of the SW notes (n = 598; 73.4%) contained at least 1 SW intervention. The most frequent interventions offered by social workers included care coordination (21.5%), education (21.0%), financial planning (18.5%), referral to community services and organizations (17.1%), and supportive counseling (15.3%). High-performing classification algorithms included the kernelized support vector machine (SVM) (accuracy, 0.97), logistic regression (accuracy, 0.96), linear SVM (accuracy, 0.95), and multinomial naive Bayes classifier (accuracy, 0.92). CONCLUSIONS NLP and ML can be utilized for automated identification and classification of SW interventions documented in EHRs. Health care administrators can leverage this automated approach to gain better insight into the most needed social interventions in the patient population served by their organizations. Such information can be applied in managerial decisions related to SW staffing, resource allocation, and patients' social needs.
Collapse
Affiliation(s)
- Abdulaziz Tijjani Bako
- Department of Health Policy and Management, Richard M. Fairbanks School of Public Health, Indiana University-Purdue University Indianapolis, 1050 Wishard Blvd, Indianapolis, IN 46202.
| | | | | | | | | | | | | |
Collapse
|
46
|
Badal VD, Graham SA, Depp CA, Shinkawa K, Yamada Y, Palinkas LA, Kim HC, Jeste DV, Lee EE. Prediction of Loneliness in Older Adults Using Natural Language Processing: Exploring Sex Differences in Speech. Am J Geriatr Psychiatry 2020; 29:853-866. [PMID: 33039266 PMCID: PMC7486862 DOI: 10.1016/j.jagp.2020.09.009] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 09/02/2020] [Accepted: 09/04/2020] [Indexed: 12/17/2022]
Abstract
OBJECTIVE The growing pandemic of loneliness has great relevance to aging populations, though assessments are limited by self-report approaches. This paper explores the use of artificial intelligence (AI) technology to evaluate interviews on loneliness, notably, employing natural language processing (NLP) to quantify sentiment and features that indicate loneliness in transcribed speech text of older adults. DESIGN Participants completed semi-structured qualitative interviews regarding the experience of loneliness and a quantitative self-report scale (University of California Los Angeles or UCLA Loneliness scale) to assess loneliness. Lonely and non-lonely participants (based on qualitative and quantitative assessments) were compared. SETTING Independent living sector of a senior housing community in San Diego County. PARTICIPANTS Eighty English-speaking older adults with age range 66-94 (mean 83 years). MEASUREMENTS Interviews were audiotaped and manually transcribed. Transcripts were examined using NLP approaches to quantify sentiment and expressed emotions. RESULTS Lonely individuals (by qualitative assessments) had longer responses with greater expression of sadness to direct questions about loneliness. Women were more likely to endorse feeling lonely during the qualitative interview. Men used more fearful and joyful words in their responses. Using linguistic features, machine learning models could predict qualitative loneliness with 94% precision (sensitivity = 0.90, specificity = 1.00) and quantitative loneliness with 76% precision (sensitivity = 0.57, specificity = 0.89). CONCLUSIONS AI (e.g., NLP and machine learning approaches) can provide unique insights into how linguistic features of transcribed speech data may reflect loneliness. Eventually linguistic features could be used to assess loneliness of individuals, despite limitations of commercially developed natural language understanding programs.
Collapse
Affiliation(s)
- Varsha D Badal
- Department of Psychiatry (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Sam and Rose Stein Institute for Research on Aging (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA
| | - Sarah A Graham
- Department of Psychiatry (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Sam and Rose Stein Institute for Research on Aging (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA
| | - Colin A Depp
- Department of Psychiatry (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Sam and Rose Stein Institute for Research on Aging (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; VA San Diego Healthcare System (CAD, EEL), La Jolla, CA
| | - Kaoru Shinkawa
- Accessibility and Aging, IBM Research-Tokyo (KS, YY), Tokyo, Japan
| | - Yasunori Yamada
- Accessibility and Aging, IBM Research-Tokyo (KS, YY), Tokyo, Japan
| | - Lawrence A Palinkas
- Suzanne Dworak Peck School of Social Work (LAP), University of Southern California, Los Angeles, CA
| | - Ho-Cheol Kim
- AI and Cognitive Software, IBM Research-Almaden (HCK), San Jose, CA
| | - Dilip V Jeste
- Department of Psychiatry (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Sam and Rose Stein Institute for Research on Aging (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Department of Neurosciences (DVJ), University of California San Diego, La Jolla, CA
| | - Ellen E Lee
- Department of Psychiatry (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; Sam and Rose Stein Institute for Research on Aging (VDB, SAG, CAD, DVJ, EEL), University of California San Diego, San Diego, CA; VA San Diego Healthcare System (CAD, EEL), La Jolla, CA.
| |
Collapse
|
47
|
Parrott S, Britt BC, Hayes JL, Albright DL. Social Media and Suicide: A Validation of Terms to Help Identify Suicide-related Social Media Posts. JOURNAL OF EVIDENCE-BASED SOCIAL WORK (2019) 2020; 17:624-634. [PMID: 32619146 DOI: 10.1080/26408066.2020.1788478] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
BACKGROUND Communication plays an important role in the prevention of suicide, a leading cause of death in the United States. Prior research suggests people who die by suicide often communicate their intent to more than one member of their social network. The ubiquity of social media in modern society means an individual's social network may be larger than ever before, which has contributed to a proliferation of colloquial terms and phrases to describe suicide. AIMS The present study collected and validated suicide-related terms from the U.S. English language in 2018-2019. By validating clinical and lay terms with people on the front lines of suicide prevention, the study provides a necessary foundation for lexical analyses of suicide communication on social media. METHOD 98 terms related to suicide were collected from online, academic, and other sources. Mental health professionals and members of the electronic mailing list of the American Association of Suicidology were asked to validate terms. RESULTS The survey validated common terms used to communicate about suicide. LIMITATIONS The lexicon did not capture international phrases. It also did not document less direct language, such as expressions of emotion.
Collapse
|
48
|
Bernert RA, Hilberg AM, Melia R, Kim JP, Shah NH, Abnousi F. Artificial Intelligence and Suicide Prevention: A Systematic Review of Machine Learning Investigations. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2020; 17:E5929. [PMID: 32824149 PMCID: PMC7460360 DOI: 10.3390/ijerph17165929] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 07/28/2020] [Indexed: 12/12/2022]
Abstract
Suicide is a leading cause of death that defies prediction and challenges prevention efforts worldwide. Artificial intelligence (AI) and machine learning (ML) have emerged as a means of investigating large datasets to enhance risk detection. A systematic review of ML investigations evaluating suicidal behaviors was conducted using PubMed/MEDLINE, PsychInfo, Web-of-Science, and EMBASE, employing search strings and MeSH terms relevant to suicide and AI. Databases were supplemented by hand-search techniques and Google Scholar. Inclusion criteria: (1) journal article, available in English, (2) original investigation, (3) employment of AI/ML, (4) evaluation of a suicide risk outcome. N = 594 records were identified based on abstract search, and 25 hand-searched reports. N = 461 reports remained after duplicates were removed, n = 316 were excluded after abstract screening. Of n = 149 full-text articles assessed for eligibility, n = 87 were included for quantitative synthesis, grouped according to suicide behavior outcome. Reports varied widely in methodology and outcomes. Results suggest high levels of risk classification accuracy (>90%) and Area Under the Curve (AUC) in the prediction of suicidal behaviors. We report key findings and central limitations in the use of AI/ML frameworks to guide additional research, which hold the potential to impact suicide on broad scale.
Collapse
Affiliation(s)
- Rebecca A. Bernert
- Stanford Suicide Prevention Research Laboratory, Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94304, USA
| | - Amanda M. Hilberg
- Stanford Suicide Prevention Research Laboratory, Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94304, USA
| | - Ruth Melia
- Stanford Suicide Prevention Research Laboratory, Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94304, USA
- Department of Psychology, National University of Ireland, Galway, Ireland
| | - Jane Paik Kim
- Stanford Suicide Prevention Research Laboratory, Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA 94304, USA
| | - Nigam H. Shah
- Department of Medicine, Center for Biomedical Informatics Research, Stanford University School of Medicine, Stanford, CA 94304, USA
- Informatics, Stanford Center for Clinical and Translational Research, and Education (Spectrum), Stanford University, Stanford CA 94304, USA
| | - Freddy Abnousi
- Facebook, Menlo Park, CA 94025, USA
- Yale University School of Medicine, New Haven, CT 06510, USA
| |
Collapse
|
49
|
Funk B, Sadeh-Sharvit S, Fitzsimmons-Craft EE, Trockel MT, Monterubio GE, Goel NJ, Balantekin KN, Eichen DM, Flatt RE, Firebaugh ML, Jacobi C, Graham AK, Hoogendoorn M, Wilfley DE, Taylor CB. A Framework for Applying Natural Language Processing in Digital Health Interventions. J Med Internet Res 2020; 22:e13855. [PMID: 32130118 PMCID: PMC7059510 DOI: 10.2196/13855] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 07/28/2019] [Accepted: 07/28/2019] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Digital health interventions (DHIs) are poised to reduce target symptoms in a scalable, affordable, and empirically supported way. DHIs that involve coaching or clinical support often collect text data from 2 sources: (1) open correspondence between users and the trained practitioners supporting them through a messaging system and (2) text data recorded during the intervention by users, such as diary entries. Natural language processing (NLP) offers methods for analyzing text, augmenting the understanding of intervention effects, and informing therapeutic decision making. OBJECTIVE This study aimed to present a technical framework that supports the automated analysis of both types of text data often present in DHIs. This framework generates text features and helps to build statistical models to predict target variables, including user engagement, symptom change, and therapeutic outcomes. METHODS We first discussed various NLP techniques and demonstrated how they are implemented in the presented framework. We then applied the framework in a case study of the Healthy Body Image Program, a Web-based intervention trial for eating disorders (EDs). A total of 372 participants who screened positive for an ED received a DHI aimed at reducing ED psychopathology (including binge eating and purging behaviors) and improving body image. These users generated 37,228 intervention text snippets and exchanged 4285 user-coach messages, which were analyzed using the proposed model. RESULTS We applied the framework to predict binge eating behavior, resulting in an area under the curve between 0.57 (when applied to new users) and 0.72 (when applied to new symptom reports of known users). In addition, initial evidence indicated that specific text features predicted the therapeutic outcome of reducing ED symptoms. CONCLUSIONS The case study demonstrates the usefulness of a structured approach to text data analytics. NLP techniques improve the prediction of symptom changes in DHIs. We present a technical framework that can be easily applied in other clinical trials and clinical presentations and encourage other groups to apply the framework in similar contexts.
Collapse
Affiliation(s)
- Burkhardt Funk
- Leuphana University, Institute of Information Systems, Lueneburg, Germany
| | - Shiri Sadeh-Sharvit
- Palo Alto University, Center for m2Health, Palo Alto, CA, United States
- Stanford University, Department of Psychiatry and Behavioral Sciences, Stanford, CA, United States
| | | | - Mickey Todd Trockel
- Stanford University, Department of Psychiatry and Behavioral Sciences, Stanford, CA, United States
| | - Grace E Monterubio
- Washington University in St Louis, Department of Psychiatry, St Louis, MO, United States
| | - Neha J Goel
- Palo Alto University, Center for m2Health, Palo Alto, CA, United States
- Stanford University, Department of Psychiatry and Behavioral Sciences, Stanford, CA, United States
| | - Katherine N Balantekin
- Washington University in St Louis, Department of Psychiatry, St Louis, MO, United States
- University at Buffalo, Department of Exercise and Nutrition Sciences, Buffalo, NY, United States
| | - Dawn M Eichen
- Washington University in St Louis, Department of Psychiatry, St Louis, MO, United States
- University of California San Diego, Department of Pediatrics, San Diego, CA, United States
| | - Rachael E Flatt
- Palo Alto University, Center for m2Health, Palo Alto, CA, United States
- Stanford University, Department of Psychiatry and Behavioral Sciences, Stanford, CA, United States
- University of North Carolina at Chapel Hill, Department of Psychology and Neurosciences, Chapel Hill, NC, United States
| | - Marie-Laure Firebaugh
- Washington University in St Louis, Department of Psychiatry, St Louis, MO, United States
| | - Corinna Jacobi
- Technische Universität, Institute of Clinical Psychology and Psychotherapy, Dresden, Germany
| | - Andrea K Graham
- Northwestern University, Department of Medical Social Sciences, Chicago, IL, United States
| | - Mark Hoogendoorn
- Vrije Universiteit, Department of Computer Science, Amsterdam, Netherlands
| | - Denise E Wilfley
- Washington University in St Louis, Department of Psychiatry, St Louis, MO, United States
| | - C Barr Taylor
- Palo Alto University, Center for m2Health, Palo Alto, CA, United States
- Stanford University, Department of Psychiatry and Behavioral Sciences, Stanford, CA, United States
| |
Collapse
|
50
|
Walsh CG, Chaudhry B, Dua P, Goodman KW, Kaplan B, Kavuluru R, Solomonides A, Subbian V. Stigma, biomarkers, and algorithmic bias: recommendations for precision behavioral health with artificial intelligence. JAMIA Open 2020; 3:9-15. [PMID: 32607482 PMCID: PMC7309258 DOI: 10.1093/jamiaopen/ooz054] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 07/29/2019] [Accepted: 10/30/2019] [Indexed: 12/22/2022] Open
Abstract
Effective implementation of artificial intelligence in behavioral healthcare delivery depends on overcoming challenges that are pronounced in this domain. Self and social stigma contribute to under-reported symptoms, and under-coding worsens ascertainment. Health disparities contribute to algorithmic bias. Lack of reliable biological and clinical markers hinders model development, and model explainability challenges impede trust among users. In this perspective, we describe these challenges and discuss design and implementation recommendations to overcome them in intelligent systems for behavioral and mental health.
Collapse
Affiliation(s)
- Colin G Walsh
- Biomedical Informatics, Medicine and Psychiatry, Vanderbilt University Medical Center, 2525 West End, Suite 1475, Nashville, TN, USA
| | - Beenish Chaudhry
- School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, Louisiana, USA
| | - Prerna Dua
- Department of Health Informatics and Information Management, Louisiana Tech University, Ruston, Louisiana, USA
| | - Kenneth W Goodman
- Institute for Bioethics and Health Policy, University of Miami, Miller School of Medicine, Miami, Florida, USA
| | - Bonnie Kaplan
- Yale Center for Medical Informatics, Yale Bioethics Center, Yale Information Society, Yale Solomon Center for Health Law & Policy, Yale University, New Haven, Connecticut, USA
| | - Ramakanth Kavuluru
- Division of Biomedical Informatics, Department of Internal Medicine, University of Kentucky, Lexington, Kentucky, USA
| | - Anthony Solomonides
- Outcomes Research and Biomedical Informatics, NorthShore University HealthSystem, Research Institute, Evanston, Illinois, USA
| | - Vignesh Subbian
- Department of Biomedical Engineering, Department of Systems and Industrial Engineering, The University of Arizona, Tucson, Arizona, USA
| |
Collapse
|