Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Abbe A, Grouin C, Zweigenbaum P, Falissard B. Text mining applications in psychiatry: a systematic literature review. Int J Methods Psychiatr Res 2016;25:86-100. [PMID: 26184780 PMCID: PMC6877250 DOI: 10.1002/mpr.1481] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Revised: 01/21/2015] [Accepted: 04/09/2015] [Indexed: 11/08/2022] Open

For:	Abbe A, Grouin C, Zweigenbaum P, Falissard B. Text mining applications in psychiatry: a systematic literature review. Int J Methods Psychiatr Res 2016;25:86-100. [PMID: 26184780 PMCID: PMC6877250 DOI: 10.1002/mpr.1481] [Citation(s) in RCA: 59] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/05/2014] [Revised: 01/21/2015] [Accepted: 04/09/2015] [Indexed: 11/08/2022] Open

Number

Cited by Other Article(s)

Stanhope V, Yoo N, Matthews E, Baslock D, Hu Y. The Impact of Collaborative Documentation on Person-Centered Care: Textual Analysis of Clinical Notes. JMIR Med Inform 2024;12:e52678. [PMID: 39302636 DOI: 10.2196/52678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 06/07/2024] [Accepted: 06/26/2024] [Indexed: 09/22/2024] Open

Abstract

Background

Collaborative documentation (CD) is a behavioral health practice involving shared writing of clinic visit notes by providers and consumers. Despite widespread dissemination of CD, research on its effectiveness or impact on person-centered care (PCC) has been limited. Principles of PCC planning, a recovery-based approach to service planning that operationalizes PCC, can inform the measurement of person-centeredness within clinical documentation.

Objective

This study aims to use the clinical informatics approach of natural language processing (NLP) to examine the impact of CD on person-centeredness in clinic visit notes. Using a dictionary-based approach, this study conducts a textual analysis of clinic notes from a community mental health center before and after staff were trained in CD.

Methods

This study used visit notes (n=1981) from 10 providers in a community mental health center 6 months before and after training in CD. LIWC-22 was used to assess all notes using the Linguistic Inquiry and Word Count (LIWC) dictionary, which categorizes over 5000 linguistic and psychological words. Twelve LIWC categories were selected and mapped onto PCC planning principles through the consensus of 3 domain experts. The LIWC-22 contextualizer was used to extract sentence fragments from notes corresponding to LIWC categories. Then, fixed-effects modeling was used to identify differences in notes before and after CD training while accounting for nesting within the provider.

Results

Sentence fragments identified by the contextualizing process illustrated how visit notes demonstrated PCC. The fixed effects analysis found a significant positive shift toward person-centeredness; this was observed in 6 of the selected LIWC categories post CD. Specifically, there was a notable increase in words associated with achievement (β=.774, P<.001), power (β=.831, P<.001), money (β=.204, P<.001), physical health (β=.427, P=.03), while leisure words decreased (β=-.166, P=.002).

Conclusions

By using a dictionary-based approach, the study identified how CD might influence the integration of PCC principles within clinical notes. Although the results were mixed, the findings highlight the potential effectiveness of CD in enhancing person-centeredness in clinic notes. By leveraging NLP techniques, this research illuminated the value of narrative clinical notes in assessing the quality of care in behavioral health contexts. These findings underscore the promise of NLP for quality assurance in health care settings and emphasize the need for refining algorithms to more accurately measure PCC.

Collapse

Shin D, Kim H, Lee S, Cho Y, Jung W. Using Large Language Models to Detect Depression From User-Generated Diary Text Data as a Novel Approach in Digital Mental Health Screening: Instrument Validation Study. J Med Internet Res 2024;26:e54617. [PMID: 39292502 DOI: 10.2196/54617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 05/17/2024] [Accepted: 08/11/2024] [Indexed: 09/19/2024] Open

Abstract

BACKGROUND

Depressive disorders have substantial global implications, leading to various social consequences, including decreased occupational productivity and a high disability burden. Early detection and intervention for clinically significant depression have gained attention; however, the existing depression screening tools, such as the Center for Epidemiologic Studies Depression Scale, have limitations in objectivity and accuracy. Therefore, researchers are identifying objective indicators of depression, including image analysis, blood biomarkers, and ecological momentary assessments (EMAs). Among EMAs, user-generated text data, particularly from diary writing, have emerged as a clinically significant and analyzable source for detecting or diagnosing depression, leveraging advancements in large language models such as ChatGPT.

OBJECTIVE

We aimed to detect depression based on user-generated diary text through an emotional diary writing app using a large language model (LLM). We aimed to validate the value of the semistructured diary text data as an EMA data source.

METHODS

Participants were assessed for depression using the Patient Health Questionnaire and suicide risk was evaluated using the Beck Scale for Suicide Ideation before starting and after completing the 2-week diary writing period. The text data from the daily diaries were also used in the analysis. The performance of leading LLMs, such as ChatGPT with GPT-3.5 and GPT-4, was assessed with and without GPT-3.5 fine-tuning on the training data set. The model performance comparison involved the use of chain-of-thought and zero-shot prompting to analyze the text structure and content.

RESULTS

We used 428 diaries from 91 participants; GPT-3.5 fine-tuning demonstrated superior performance in depression detection, achieving an accuracy of 0.902 and a specificity of 0.955. However, the balanced accuracy was the highest (0.844) for GPT-3.5 without fine-tuning and prompt techniques; it displayed a recall of 0.929.

CONCLUSIONS

Both GPT-3.5 and GPT-4.0 demonstrated relatively reasonable performance in recognizing the risk of depression based on diaries. Our findings highlight the potential clinical usefulness of user-generated text data for detecting depression. In addition to measurable indicators, such as step count and physical activity, future research should increasingly emphasize qualitative digital expression.

Collapse

Reutens S, Dandolo C, Looi RCH, Karystianis GC, Looi JCL. The uses and misuses of artificial intelligence in psychiatry: Promises and challenges. Australas Psychiatry 2024:10398562241280348. [PMID: 39222479 DOI: 10.1177/10398562241280348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 09/04/2024]

Barruel D, Hilbey J, Charlet J, Chaumette B, Krebs MO, Dauriac-Le Masson V. Predicting treatment resistance in schizophrenia patients: Machine learning highlights the role of early pathophysiologic features. Schizophr Res 2024;270:1-10. [PMID: 38823319 DOI: 10.1016/j.schres.2024.05.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 05/10/2024] [Accepted: 05/13/2024] [Indexed: 06/03/2024]

Abstract

Detecting patients with a high-risk profile for treatment-resistant schizophrenia (TRS) can be beneficial for implementing individually adapted therapeutic strategies and better understanding the TRS etiology. The aim of this study was to explore, with machine learning methods, the impact of demographic and clinical patient characteristics on TRS prediction, for already established risk factors and unexplored ones. This was a retrospective study of 500 patients admitted during 2020 to the University Hospital Group for Paris Psychiatry. We hypothesized potential TRS risk factors. The selected features were coded into structured variables in a new dataset, by processing patients discharge summaries and medical narratives with natural-language processing methods. We compared three machine learning models (XGBoost, logistic elastic net regression, logistic regression without regularization) for predicting TRS outcome. We analysed feature impact on the models, suggesting the following factors as markers of a high-risk TRS profile: early age at first contact with psychiatry, antipsychotic treatment interruptions due to non-adherence, absence of positive symptoms at baseline, educational problems and adolescence mental disorders in the personal psychiatric history. Specifically, we found a significant association with TRS outcome for age at first contact with psychiatry and medication non-adherence. Our findings on TRS risk factors are consistent with the review of the literature and suggest potential in using early pathophysiologic features for TRS prediction. Results were encouraging with the use of natural-langage processing techniques to leverage raw data provided by discharge summaries, combined with machine leaning models. These findings are a promising step for helping clinicians adapt their guidelines to early detection of TRS.

Collapse

Nunez JJ, Leung B, Ho C, Ng RT, Bates AT. Predicting which patients with cancer will see a psychiatrist or counsellor from their initial oncology consultation document using natural language processing. COMMUNICATIONS MEDICINE 2024;4:69. [PMID: 38589545 PMCID: PMC11001970 DOI: 10.1038/s43856-024-00495-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Accepted: 03/28/2024] [Indexed: 04/10/2024] Open

Yoo N, Matthews E, Baslock D, Stanhope V. Impact of Collaborative Documentation on Completeness and Length of Clinical Notes in Behavioral Health Settings. Psychiatr Serv 2024;75:186-190. [PMID: 37528697 DOI: 10.1176/appi.ps.20230118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 08/03/2023]

Romano MF, Shih LC, Paschalidis IC, Au R, Kolachalama VB. Large Language Models in Neurology Research and Future Practice. Neurology 2023;101:1058-1067. [PMID: 37816646 PMCID: PMC10752640 DOI: 10.1212/wnl.0000000000207967] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 09/06/2023] [Indexed: 10/12/2023] Open

Affiliation(s)

Michael F Romano From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
Ludy C Shih From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
Ioannis C Paschalidis From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
Rhoda Au From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA
Vijaya B Kolachalama From the Department of Medicine (M.F.R., R.A., V.B.K.), Boston University Chobanian & Avedisian School of Medicine, MA; Department of Radiology and Biomedical Imaging (M.F.R.), University of California, San Francisco; Department of Neurology (L.C.S., R.A.), Boston University Chobanian & Avedisian School of Medicine; Department of Electrical and Computer Engineering (I.C.P.), Division of Systems Engineering, and Department of Biomedical Engineering; Faculty of Computing and Data Sciences (I.C.P., V.B.K.), Boston University; Department of Anatomy and Neurobiology (R.A.); The Framingham Heart Study, Boston University Chobanian & Avedisian School of Medicine; Department of Epidemiology, Boston University School of Public Health; Boston University Alzheimer's Disease Research Center (R.A.); and Department of Computer Science (V.B.K.), Boston University, MA.

Collapse

Chafjiri FMA, Reece L, Voke L, Landschaft A, Clark J, Kimia AA, Loddenkemper T. Natural language processing for identification of refractory status epilepticus in children. Epilepsia 2023;64:3227-3237. [PMID: 37804085 DOI: 10.1111/epi.17789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 10/03/2023] [Accepted: 10/03/2023] [Indexed: 10/08/2023]

Zantvoort K, Scharfenberger J, Boß L, Lehr D, Funk B. Finding the Best Match - a Case Study on the (Text-)Feature and Model Choice in Digital Mental Health Interventions. JOURNAL OF HEALTHCARE INFORMATICS RESEARCH 2023;7:447-479. [PMID: 37927375 PMCID: PMC10620349 DOI: 10.1007/s41666-023-00148-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 08/29/2023] [Indexed: 11/07/2023]

Abstract

With the need for psychological help long exceeding the supply, finding ways of scaling, and better allocating mental health support is a necessity. This paper contributes by investigating how to best predict intervention dropout and failure to allow for a need-based adaptation of treatment. We systematically compare the predictive power of different text representation methods (metadata, TF-IDF, sentiment and topic analysis, and word embeddings) in combination with supplementary numerical inputs (socio-demographic, evaluation, and closed-question data). Additionally, we address the research gap of which ML model types - ranging from linear to sophisticated deep learning models - are best suited for different features and outcome variables. To this end, we analyze nearly 16.000 open-text answers from 849 German-speaking users in a Digital Mental Health Intervention (DMHI) for stress. Our research proves that - contrary to previous findings - there is great promise in using neural network approaches on DMHI text data. We propose a task-specific LSTM-based model architecture to tackle the challenge of long input sequences and thereby demonstrate the potential of word embeddings (AUC scores of up to 0.7) for predictions in DMHIs. Despite the relatively small data set, sequential deep learning models, on average, outperform simpler features such as metadata and bag-of-words approaches when predicting dropout. The conclusion is that user-generated text of the first two sessions carries predictive power regarding patients' dropout and intervention failure risk. Furthermore, the match between the sophistication of features and models needs to be closely considered to optimize results, and additional non-text features increase prediction results.

Supplementary Information

The online version contains supplementary material available at 10.1007/s41666-023-00148-z.

Collapse

Niu H, Pan Q, Xu K. Hybrid deep learning models with multi-classification investor sentiment to forecast the prices of China's leading stocks. PLoS One 2023;18:e0294460. [PMID: 38011183 PMCID: PMC10681238 DOI: 10.1371/journal.pone.0294460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Accepted: 10/31/2023] [Indexed: 11/29/2023] Open

Crubezy M, Douay C, Michel P, Haesebaert J. Using patient comments from a standardised experience survey to investigate their perceptions and prioritise improvement actions: a thematic and syntactic analysis. BMC Health Serv Res 2023;23:988. [PMID: 37710317 PMCID: PMC10503051 DOI: 10.1186/s12913-023-09953-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Accepted: 08/22/2023] [Indexed: 09/16/2023] Open

Washington P, Wall DP. A Review of and Roadmap for Data Science and Machine Learning for the Neuropsychiatric Phenotype of Autism. Annu Rev Biomed Data Sci 2023;6:211-228. [PMID: 37137169 PMCID: PMC11093217 DOI: 10.1146/annurev-biodatasci-020722-125454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

What users’ musical preference on Twitter reveals about psychological disorders. Inf Process Manag 2023. [DOI: 10.1016/j.ipm.2023.103269] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]

Crubezy M, Haesebaert J, Geig A, Michel P. [E-Satis : A new method for analysis of Patient-Reported Outcome Measures (PROMs)]. Rev Epidemiol Sante Publique 2023;71:101839. [PMID: 37120979 DOI: 10.1016/j.respe.2023.101839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 03/13/2023] [Accepted: 03/13/2023] [Indexed: 05/02/2023] Open

Abstract

OBJECTIVE

Almost 80% of the patients responding to the nationwide French patient experience and satisfaction survey (e-Satis) provided free text comments. The objective of this article is to describe an innovative methodology for analysis of this qualitative data.

METHODOLOGY

This methodological approach is based on analysis of qualitative data from the comments (verbatims) of respondents to the e-Satis survey. Analysis of the verbatims consists in three main steps: (i) analysis of the meaning of the words, with constitution of a thematic dictionary through exploratory research without preconceived notions; (ii) analysis of the syntax, i.e., the way in which the ideas are articulated, which will enable calculation of a linguistic indicator of speakers' involvement in their speech; (iii) production of statistics and characterisation of the themes, which will include three indicators: occurrence of the themes, the average satisfaction shown in the respondents' discourse, and the positive and negative involvement with which they express themselves. Given these results, a priority matrix of four categories of action is established: strong points, priority areas, good practices, and weak signals.

RESULTS

This methodological approach was applied to 5868 e-Satis questionnaires out of a total of 10,061 verbatims by respondents hospitalised at the Hospices Civils de Lyon between 2018 and 2019. The analysis identified 28 major themes with 184 sub-themes. An extract is presented in this article for illustration purposes.

DISCUSSION

A methodological approach based on analysis of qualitative data will enable transformation of unstructured data (verbatims) into measurable and comparable data. This methodology is structured to overcome the limitations of closed questions; open questions allow respondents to describe their experiences and perceptions in their own words. Moreover, it is a first step toward comparability of results over time with those of other establishments. This approach is unique in France on account of (a) its exploratory thematic research without preconceived notions and (b) its syntactic analysis of verbatims.

CONCLUSIONS

This verbatim analysis methodology should enable precise and operational characterization of Patient Experience and induce prioritized improvement actions in healthcare institutions.

Collapse

Gauld C, Pignon B, Fourneret P, Dubertret C, Tebeka S. Comparison of relative areas of interest between major depression disorder and postpartum depression. Prog Neuropsychopharmacol Biol Psychiatry 2023;121:110671. [PMID: 36341842 DOI: 10.1016/j.pnpbp.2022.110671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Revised: 10/11/2022] [Accepted: 10/26/2022] [Indexed: 11/06/2022]

Yew ANJ, Schraagen M, Otte WM, van Diessen E. Transforming epilepsy research: A systematic review on natural language processing applications. Epilepsia 2023;64:292-305. [PMID: 36462150 PMCID: PMC10108221 DOI: 10.1111/epi.17474] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 11/23/2022] [Accepted: 12/01/2022] [Indexed: 12/05/2022]

Luo L, You W, DelBello MP, Gong Q, Li F. Recent advances in psychoradiology. Phys Med Biol 2022;67. [PMID: 36279868 DOI: 10.1088/1361-6560/ac9d1e] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 10/24/2022] [Indexed: 11/24/2022]

Cellini P, Pigoni A, Delvecchio G, Moltrasio C, Brambilla P. Machine learning in the prediction of postpartum depression: A review. J Affect Disord 2022;309:350-357. [PMID: 35460742 DOI: 10.1016/j.jad.2022.04.093] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 03/29/2022] [Accepted: 04/13/2022] [Indexed: 02/06/2023]

Walsh J, Dwumfour C, Cave J, Griffiths F. Spontaneously generated online patient experience data - how and why is it being used in health research: an umbrella scoping review. BMC Med Res Methodol 2022;22:139. [PMID: 35562661 PMCID: PMC9106384 DOI: 10.1186/s12874-022-01610-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 04/13/2022] [Indexed: 11/10/2022] Open

Seo HY, Song GY, Ku JW, Park HY, Myung W, Kim HJ, Baek CH, Lee N, Sohn JH, Yoo HJ, Park JE. Perceived barriers to psychiatric help-seeking in South Korea by age groups: text mining analyses of social media big data. BMC Psychiatry 2022;22:332. [PMID: 35562709 PMCID: PMC9102713 DOI: 10.1186/s12888-022-03969-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 04/11/2022] [Indexed: 11/21/2022] Open

Natural Language Processing and Machine Learning Supporting the Work of a Psychologist and Its Evaluation on the Example of Support for Psychological Diagnosis of Anorexia. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12094702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Wiegersma S, Hidajat M, Schrieken B, Veldkamp B, Olff M. Improving Web-Based Treatment Intake for Multiple Mental and Substance Use Disorders by Text Mining and Machine Learning: Algorithm Development and Validation. JMIR Ment Health 2022;9:e21111. [PMID: 35404261 PMCID: PMC9039807 DOI: 10.2196/21111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 11/01/2020] [Accepted: 09/28/2021] [Indexed: 11/13/2022] Open

Tagliazucchi E. Language as a Window Into the Altered State of Consciousness Elicited by Psychedelic Drugs. Front Pharmacol 2022;13:812227. [PMID: 35392561 PMCID: PMC8980225 DOI: 10.3389/fphar.2022.812227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 03/01/2022] [Indexed: 11/22/2022] Open

Rubeis G. iHealth: The ethics of artificial intelligence and big data in mental healthcare. Internet Interv 2022;28:100518. [PMID: 35257003 PMCID: PMC8897624 DOI: 10.1016/j.invent.2022.100518] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Revised: 01/11/2022] [Accepted: 02/24/2022] [Indexed: 01/13/2023] Open

Bastiaansen JAJ, Veldhuizen EE, De Schepper K, Scheepers FE. Experiences of Siblings of Children With Neurodevelopmental Disorders: Comparing Qualitative Analysis and Machine Learning to Study Narratives. Front Psychiatry 2022;13:719598. [PMID: 35573373 PMCID: PMC9096451 DOI: 10.3389/fpsyt.2022.719598] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Accepted: 04/06/2022] [Indexed: 11/13/2022] Open

Abstract

INTRODUCTION

Relatively few studies have focused on the wellbeing, experiences and needs of the siblings of children with a psychiatric diagnosis. However, the studies that have been conducted suggest that the impact of such circumstances on these siblings is significant. Studying narratives of diagnosed children or relatives has proven to be a successful approach to gain insights that could help improve care. Only a few attempts have been made to study narratives in psychiatry utilizing a machine learning approach.

METHOD

In this current study, 13 narratives of the experiences of siblings of children with a neurodevelopmental disorders were collected through largely unstructured interviews. The interviews were analyzed using the traditional qualitative, hermeneutic phenomenology method as well as latent Dirichlet allocation (LDA), an unsupervised machine learning method clustering words from documents into topics. One aim of this study was to evaluate the experiences of the siblings in order to find leads to improve care and support for these siblings. Furthermore, the outcomes of both analyses were compared to evaluate the role of machine learning in analyzing narratives.

RESULTS

Qualitative analysis of the interviews led to the formulation of nine main themes: confrontation with conflicts, coping strategies siblings, need for rest and time for myself, need for support and attention from personal circle, wish for normality, influence on personal choices and possibilities for development, doing things together, recommendations and advices, ambivalence and loyalty. Using unsupervised machine learning (LDA) 24 topics were formed that mostly overlapped with the qualitative themes found. Both the qualitative analysis and the LDA analysis detected themes that were unique to the respective analysis.

CONCLUSION

The present study found that studying narratives of siblings of children with a neurodevelopmental disorder contributes to a better understanding of the subjects' experiences. Siblings cope with ambivalent feelings toward their brother or sister and this emotional conflict often leads to adapted behavior. Several coping strategies are developed to deal with the behavior of their brother or sister like seeking support or ignoring. Devoted support, time and attention from close relatives, especially parents, is needed. The LDA analysis didn't appear useful to distract meaning and context from the narratives, but it was proposed that machine learning could be a valuable and quick addition to the traditional qualitative methods by finding overlooked topics and giving a rudimental overview of topics in narratives.

Collapse

Crema C, Attardi G, Sartiano D, Redolfi A. Natural language processing in clinical neuroscience and psychiatry: A review. Front Psychiatry 2022;13:946387. [PMID: 36186874 PMCID: PMC9515453 DOI: 10.3389/fpsyt.2022.946387] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022] Open

Kung B, Chiang M, Perera G, Pritchard M, Stewart R. Identifying subtypes of depression in clinician-annotated text: a retrospective cohort study. Sci Rep 2021;11:22426. [PMID: 34789827 PMCID: PMC8599474 DOI: 10.1038/s41598-021-01954-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Accepted: 11/08/2021] [Indexed: 11/23/2022] Open

Kesler SR, Henneghan AM, Thurman W, Rao V. Identifying themes for assessing cancer-related cognitive impairment identified by topic modeling and qualitative content analysis of public online comments (Preprint). JMIR Cancer 2021;8:e34828. [PMID: 35612878 PMCID: PMC9178450 DOI: 10.2196/34828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Revised: 04/28/2022] [Accepted: 05/01/2022] [Indexed: 11/28/2022] Open

Walsh J, Cave J, Griffiths F. Spontaneously Generated Online Patient Experience of Modafinil: A Qualitative and NLP Analysis. Front Digit Health 2021;3:598431. [PMID: 34713085 PMCID: PMC8521895 DOI: 10.3389/fdgth.2021.598431] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Accepted: 01/27/2021] [Indexed: 11/16/2022] Open

Abstract

Objective: To compare the findings from a qualitative and a natural language processing (NLP) based analysis of online patient experience posts on patient experience of the effectiveness and impact of the drug Modafinil.

Methods: Posts (n = 260) from 5 online social media platforms where posts were publicly available formed the dataset/corpus. Three platforms asked posters to give a numerical rating of Modafinil. Thematic analysis: data was coded and themes generated. Data were categorized into PreModafinil, Acquisition, Dosage, and PostModafinil and compared to identify each poster's own view of whether taking Modafinil was linked to an identifiable outcome. We classified this as positive, mixed, negative, or neutral and compared this with numerical ratings. NLP: Corpus text was speech tagged and keywords and key terms extracted. We identified the following entities: drug names, condition names, symptoms, actions, and side-effects. We searched for simple relationships, collocations, and co-occurrences of entities. To identify causal text, we split the corpus into PreModafinil and PostModafinil and used n-gram analysis. To evaluate sentiment, we calculated the polarity of each post between −1 (negative) and +1 (positive). NLP results were mapped to qualitative results.

Results: Posters had used Modafinil for 33 different primary conditions. Eight themes were identified: the reason for taking (condition or symptom), impact of symptoms, acquisition, dosage, side effects, other interventions tried or compared to, effectiveness of Modafinil, and quality of life outcomes. Posters reported perceived effectiveness as follows: 68% positive, 12% mixed, 18% negative. Our classification was consistent with poster ratings. Of the most frequent 100 keywords/keyterms identified by term extraction 88/100 keywords and 84/100 keyterms mapped directly to the eight themes. Seven keyterms indicated negation and temporal states. Sentiment was as follows 72% positive sentiment 4% neutral 24% negative. Matching of sentiment between the qualitative and NLP methods was accurate in 64.2% of posts. If we allow for one category difference matching was accurate in 85% of posts.

Conclusions: User generated patient experience is a rich resource for evaluating real world effectiveness, understanding patient perspectives, and identifying research gaps. Both methods successfully identified the entities and topics contained in the posts. In contrast to current evidence, posters with a wide range of other conditions found Modafinil effective. Perceived causality and effectiveness were identified by both methods demonstrating the potential to augment existing knowledge.

Collapse

Hudon A, Beaudoin M, Phraxayavong K, Dellazizzo L, Potvin S, Dumais A. Use of Automated Thematic Annotations for Small Data Sets in a Psychotherapeutic Context: Systematic Review of Machine Learning Algorithms. JMIR Ment Health 2021;8:e22651. [PMID: 34677133 PMCID: PMC8571689 DOI: 10.2196/22651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/19/2020] [Revised: 10/06/2020] [Accepted: 07/27/2021] [Indexed: 11/21/2022] Open

Abstract

BACKGROUND

A growing body of literature has detailed the use of qualitative analyses to measure the therapeutic processes and intrinsic effectiveness of psychotherapies, which yield small databases. Nonetheless, these approaches have several limitations and machine learning algorithms are needed.

OBJECTIVE

The objective of this study is to conduct a systematic review of the use of machine learning for automated text classification for small data sets in the fields of psychiatry, psychology, and social sciences. This review will identify available algorithms and assess if automated classification of textual entities is comparable to the classification done by human evaluators.

METHODS

A systematic search was performed in the electronic databases of Medline, Web of Science, PsycNet (PsycINFO), and Google Scholar from their inception dates to 2021. The fields of psychiatry, psychology, and social sciences were selected as they include a vast array of textual entities in the domain of mental health that can be reviewed. Additional records identified through cross-referencing were used to find other studies.

RESULTS

This literature search identified 5442 articles that were eligible for our study after the removal of duplicates. Following abstract screening, 114 full articles were assessed in their entirety, of which 107 were excluded. The remaining 7 studies were analyzed. Classification algorithms such as naive Bayes, decision tree, and support vector machine classifiers were identified. Support vector machine is the most used algorithm and best performing as per the identified articles. Prediction classification scores for the identified algorithms ranged from 53%-91% for the classification of textual entities in 4-7 categories. In addition, 3 of the 7 studies reported an interjudge agreement statistic; these were consistent with agreement statistics for text classification done by human evaluators.

CONCLUSIONS

A systematic review of available machine learning algorithms for automated text classification for small data sets in several fields (psychiatry, psychology, and social sciences) was conducted. We compared automated classification with classification done by human evaluators. Our results show that it is possible to automatically classify textual entities of a transcript based solely on small databases. Future studies are nevertheless needed to assess whether such algorithms can be implemented in the context of psychotherapies.

Collapse

Ji M, Xie W, Huang R, Qian X. Forecasting the Suitability of Online Mental Health Information for Effective Self-Care Developing Machine Learning Classifiers Using Natural Language Features. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:ijerph181910048. [PMID: 34639348 PMCID: PMC8507671 DOI: 10.3390/ijerph181910048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 09/10/2021] [Accepted: 09/16/2021] [Indexed: 11/16/2022]

Abstract

Background: Online mental health information represents important resources for people living with mental health issues. Suitability of mental health information for effective self-care remains understudied, despite the increasing needs for more actionable mental health resources, especially among young people. Objective: We aimed to develop Bayesian machine learning classifiers as data-based decision aids for the assessment of the actionability of credible mental health information for people with mental health issues and diseases. Methods: We collected and classified creditable online health information on mental health issues into generic mental health (GEN) information and patient-specific (PAS) mental health information. GEN and PAS were both patient-oriented health resources developed by health authorities of mental health and public health promotion. GENs were non-classified online health information without indication of targeted readerships; PASs were developed purposefully for specific populations (young, elderly people, pregnant women, and men) as indicated by their website labels. To ensure the generalisability of our model, we chose to develop a sparse Bayesian machine learning classifier using Relevance Vector Machine (RVM). Results: Using optimisation and normalisation techniques, we developed a best-performing classifier through joint optimisation of natural language features and min-max normalisation of feature frequencies. The AUC (0.957), sensitivity (0.900), and specificity (0.953) of the best model were statistically higher (p < 0.05) than other models using parallel optimisation of structural and semantic features with or without feature normalisation. We subsequently evaluated the diagnostic utility of our model in the clinic by comparing its positive (LR+) and negative likelihood ratios (LR−) and 95% confidence intervals (95% C.I.) as we adjusted the probability thresholds with the range of 0.1 and 0.9. We found that the best pair of LR+ (18.031, 95% C.I.: 10.992, 29.577) and LR− (0.100, 95% C.I.: 0.068, 0.148) was found when the probability threshold was set to 0.45 associated with a sensitivity of 0.905 (95%: 0.867, 0.942) and specificity of 0.950 (95% C.I.: 0.925, 0.975). These statistical properties of our model suggested its applicability in the clinic. Conclusion: Our study found that PAS had significant advantage over GEN mental health information regarding information actionability, engagement, and suitability for specific populations with distinct mental health issues. GEN is more suitable for general mental health information acquisition, whereas PAS can effectively engage patients and provide more effective and needed self-care support. The Bayesian machine learning classifier developed provided automatic tools to support decision making in the clinic to identify more actionable resources, effective to support self-care among different populations.

Collapse

Grzenda A, Kraguljac NV, McDonald WM, Nemeroff C, Torous J, Alpert JE, Rodriguez CI, Widge AS. Evaluating the Machine Learning Literature: A Primer and User's Guide for Psychiatrists. Am J Psychiatry 2021;178:715-729. [PMID: 34080891 DOI: 10.1176/appi.ajp.2020.20030250] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Affiliation(s)

Adrienne Grzenda Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
Nina V Kraguljac Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
William M McDonald Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
Charles Nemeroff Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
John Torous Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
Jonathan E Alpert Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
Carolyn I Rodriguez Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)
Alik S Widge Department of Psychiatry and Biobehavioral Sciences, David Geffen School of Medicine, University of California, Los Angeles, and Olive View-UCLA Medical Center, Sylmar (Grzenda); Department of Psychiatry and Behavioral Neurobiology, University of Alabama at Birmingham (Kraguljac); Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta (McDonald); Department of Psychiatry, University of Texas Dell Medical School, Austin (Nemeroff); Department of Psychiatry, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston (Torous); Department of Psychiatry and Behavioral Sciences, Albert Einstein School of Medicine, Bronx, N.Y. (Alpert); Department of Psychiatry and Behavioral Sciences, Stanford University, Stanford, Calif., and Veterans Affairs Palo Alto Health Care System, Palo Alto, Calif. (Rodriguez); Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis (Widge)

Collapse

AlSaieedi A, Salhi A, Tifratene F, Raies AB, Hungler A, Uludag M, Van Neste C, Bajic VB, Gojobori T, Essack M. DES-Tcell is a knowledgebase for exploring immunology-related literature. Sci Rep 2021;11:14344. [PMID: 34253812 PMCID: PMC8275784 DOI: 10.1038/s41598-021-93809-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2021] [Accepted: 06/24/2021] [Indexed: 12/02/2022] Open

Abstract

T-cells are a subtype of white blood cells circulating throughout the body, searching for infected and abnormal cells. They have multifaceted functions that include scanning for and directly killing cells infected with intracellular pathogens, eradicating abnormal cells, orchestrating immune response by activating and helping other immune cells, memorizing encountered pathogens, and providing long-lasting protection upon recurrent infections. However, T-cells are also involved in immune responses that result in organ transplant rejection, autoimmune diseases, and some allergic diseases. To support T-cell research, we developed the DES-Tcell knowledgebase (KB). This KB incorporates text- and data-mined information that can expedite retrieval and exploration of T-cell relevant information from the large volume of published T-cell-related research. This KB enables exploration of data through concepts from 15 topic-specific dictionaries, including immunology-related genes, mutations, pathogens, and pathways. We developed three case studies using DES-Tcell, one of which validates effective retrieval of known associations by DES-Tcell. The second and third case studies focuses on concepts that are common to Grave’s disease (GD) and Hashimoto’s thyroiditis (HT). Several reports have shown that up to 20% of GD patients treated with antithyroid medication develop HT, thus suggesting a possible conversion or shift from GD to HT disease. DES-Tcell found miR-4442 links to both GD and HT, and that miR-4442 possibly targets the autoimmune disease risk factor CD6, which provides potential new knowledge derived through the use of DES-Tcell. According to our understanding, DES-Tcell is the first KB dedicated to exploring T-cell-relevant information via literature-mining, data-mining, and topic-specific dictionaries.

Collapse

Affiliation(s)

Ahdab AlSaieedi Department of Medical Laboratory Technology (MLT), Faculty of Applied Medical Sciences (FAMS), King Abdulaziz University (KAU), Jeddah, 21589-80324, Saudi Arabia
Adil Salhi Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Faroug Tifratene Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Arwa Bin Raies Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Arnaud Hungler Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Mahmut Uludag Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Christophe Van Neste Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Vladimir B Bajic Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Takashi Gojobori Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
Magbubah Essack Computer, Electrical, and Mathematical Sciences and Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.

Collapse

Withall A, Karystianis G, Duncan D, Hwang YI, Hagos Kidane A, Butler T. Domestic Violence in Residential Care Facilities in New South Wales, Australia: A Text Mining Study. THE GERONTOLOGIST 2021;62:223-231. [PMID: 34023902 DOI: 10.1093/geront/gnab068] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2020] [Indexed: 11/14/2022] Open

Turchin A, Florez Builes LF. Using Natural Language Processing to Measure and Improve Quality of Diabetes Care: A Systematic Review. J Diabetes Sci Technol 2021;15:553-560. [PMID: 33736486 PMCID: PMC8120048 DOI: 10.1177/19322968211000831] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Khan AH, Abbe A, Falissard B, Carita P, Bachert C, Mullol J, Reaney M, Chao J, Mannent LP, Amin N, Mahajan P, Pirozzi G, Eckert L. Data Mining of Free-Text Responses: An Innovative Approach to Analyzing Patient Perspectives on Treatment for Chronic Rhinosinusitis with Nasal Polyps in a Phase IIa Proof-of-Concept Study for Dupilumab. Patient Prefer Adherence 2021;15:2577-2586. [PMID: 34848949 PMCID: PMC8611726 DOI: 10.2147/ppa.s320242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 11/05/2021] [Indexed: 11/23/2022] Open

Abstract

PURPOSE

Patient perspective is an important and increasingly sought-after complement to clinical assessment. The aim of this study was to transcribe individual patients' experience of treatment in a dupilumab clinical trial through free-text responses with analysis using natural language processing (NLP) to obtain the unique perspective of patients on disease impact and unmet needs with existing treatment to inform future trial design.

PATIENTS AND METHODS

Patients with chronic rhinosinusitis with nasal polyps (CRSwNP) who were enrolled in a Phase IIa randomized controlled trial comparing dupilumab with placebo (NCT01920893) were invited to complete a self-assessment of treatment (SAT) tool at the end of treatment, asking, "What is your opinion on the treatment you had during the trial? What did you like or dislike about the treatment?" Free-text responses were analyzed for the overall cohort and according to treatment assignment using natural language processing including sentiment scoring. In a mixed-methods approach, quantitative patient-reported outcome (PRO) results were utilized to complement the qualitative analysis of free-text responses.

RESULTS

Of 60 patients enrolled in the study, 43 (71.6%) completed the SAT and responses from 37 patients were analyzed (placebo, n = 16; dupilumab, n = 21). Word analyses showed that the most common words were "smell," "improve," "staff," "great," "time," and "good." Across the whole cohort, "smell" was the most common symptom-related word. The words "smell" and "experience" were more likely to occur in patients treated with dupilumab. Patients treated with dupilumab also had more positive sentiment in their SAT responses than those who received placebo. The results from this qualitative analysis were reflected in quantitative PRO results.

CONCLUSION

"Smell" was important to patients with CRSwNP, highlighting its importance as a patient-centric efficacy outcome measure in the context of clinical trials in CRSwNP.

TRIAL REGISTRATION

ClinicalTrials.gov, NCT01920893. Registered 12 August 2013, https://www.clinicaltrials.gov/ct2/show/NCT01920893.

Collapse

Karystianis G, Adily A, Schofield PW, Wand H, Lukmanjaya W, Buchan I, Nenadic G, Butler T. Surveillance of Domestic Violence Using Text Mining Outputs From Australian Police Records. Front Psychiatry 2021;12:787792. [PMID: 35222105 PMCID: PMC8863744 DOI: 10.3389/fpsyt.2021.787792] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 12/01/2021] [Indexed: 11/23/2022] Open

Karystianis G, Simpson A, Adily A, Schofield P, Greenberg D, Wand H, Nenadic G, Butler T. Prevalence of Mental Illnesses in Domestic Violence Police Records: Text Mining Study. J Med Internet Res 2020;22:e23725. [PMID: 33361056 PMCID: PMC7790609 DOI: 10.2196/23725] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 09/17/2020] [Accepted: 11/23/2020] [Indexed: 01/22/2023] Open

Menadue CB. Pandemics, epidemics, viruses, plagues, and disease: Comparative frequency analysis of a cultural pathology reflected in science fiction magazines from 1926 to 2015. ACTA ACUST UNITED AC 2020;2:100048. [PMID: 34173491 PMCID: PMC7480741 DOI: 10.1016/j.ssaho.2020.100048] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2020] [Revised: 07/13/2020] [Accepted: 07/13/2020] [Indexed: 12/03/2022]

Smith Y, Garcia-Torres R, Coughlin SS, Ling J, Marin T, Su S, Young L. Effectiveness of Social Cognitive Theory-Based Interventions for Glycemic Control in Adults With Type 2 Diabetes Mellitus: Protocol for a Systematic Review and Meta-Analysis. JMIR Res Protoc 2020;9:e17148. [PMID: 32673210 PMCID: PMC7495254 DOI: 10.2196/17148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 05/27/2020] [Accepted: 06/14/2020] [Indexed: 11/13/2022] Open

Abstract

BACKGROUND

For those living with type 2 diabetes mellitus (T2DM), failing to engage in self-management behaviors leads to poor glycemic control. Social cognitive theory (SCT) has been shown to improve health behaviors by altering cognitive processes and increasing an individual's belief in their ability to accomplish a task.

OBJECTIVE

We aim to present a protocol for a systematic review and meta-analysis to systematically identify, evaluate, and analyze the effect of SCT-based interventions to improve glycemic control in adults with T2DM.

METHODS

This protocol follows the 2009 Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) guidelines. Data sources will include PubMed, Cumulative Index to Nursing and Allied Health Literature (CINAHL), PsychINFO, Cochrane Library, and Web of Science, and data will be reviewed with the use of customized text mining software. Studies examining SCT-based behavioral interventions for adults diagnosed with T2DM in randomized controlled trials located in the outpatient setting will be included. Intervention effectiveness will be compared with routine care. Screening and data collection will be performed in multiple stages with three reviewers as follows: (1) an independent review of titles/abstracts, (2) a full review, and (3) data collection with alternating teams of two reviewers for disputes to be resolved by a third reviewer. Study quality and risk of bias will be assessed by three reviewers using the Cochrane risk of bias tool. Standardized mean differences will be used to describe the intervention effect sizes with regard to self-efficacy and diabetes knowledge. The raw mean difference of HbA1c will be provided in a random effects model and presented in a forest plot. The expected limitations of this study are incomplete data, the need to contact authors, and analysis of various types of glycemic control measures accurately within the same data set.

RESULTS

This protocol was granted institutional review board exemption on October 7, 2019. PROSPERO registration (ID: CRD42020147105) was received on April 28, 2020. The review began on April 29, 2020. The results of the review will be disseminated through conference presentations, peer-reviewed journals, and meetings.

CONCLUSIONS

This systematic review will appraise the effectiveness of SCT-based interventions for adults diagnosed with T2DM and provide the most effective interventions for improving health behaviors in these patients.

TRIAL REGISTRATION

PROSPERO CRD42020147105; https://www.crd.york.ac.uk/prospero/display_record.php?RecordID=147105.

INTERNATIONAL REGISTERED REPORT IDENTIFIER (IRRID)

PRR1-10.2196/17148.

Collapse

Falissard B, Simpson EL, Guttman-Yassky E, Papp KA, Barbarot S, Gadkari A, Saba G, Gautier L, Abbe A, Eckert L. Qualitative Assessment of Adult Patients' Perception of Atopic Dermatitis Using Natural Language Processing Analysis in a Cross-Sectional Study. Dermatol Ther (Heidelb) 2020;10:297-305. [PMID: 32006346 PMCID: PMC7090107 DOI: 10.1007/s13555-020-00356-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2019] [Indexed: 11/24/2022] Open

Abstract

INTRODUCTION

Atopic dermatitis (AD) is an incurable, inflammatory skin disease characterized by skin barrier disruption and immune dysregulation. Although AD is considered a childhood disease, adult onset is possible, presenting with daily sleep disturbance and functional impairment associated with itch, neuropsychiatric issues (anxiety and depression), and reduced health-related quality of life. Although such aspects of adult AD disease burden have been measured through standardized assessments and based on population-level data, the understanding of the disease experienced at the patient level remains poor. This text-mining study assessed the impact of AD on the lives of adult patients as described from an experiential perspective.

METHODS

Natural language processing (NLP) was applied to qualitative patient response data from two large-scale international cross-sectional surveys conducted in the USA and countries outside of the USA (non-USA; Canada, France, Germany, Italy, Spain, and the UK). Descriptive analysis was conducted on patient responses to an open-ended question on how they felt about their AD and how the disease affected their life. Character length, word count, and stop word (common words) count were evaluated; centrality analysis identified concepts that were most strongly interlinked.

RESULTS

Patients with AD in all countries were most frequently impacted by itch, pain, and embarrassment across all levels of disease severity. Patients with moderate-to-severe AD were more likely than patients with mild AD to describe sleep disturbances, fatigue, and feelings of depression, anxiety, and a lack of hope that were directly associated with AD. Centrality analysis revealed sleep disturbance was strongly linked with itch. Collectively, these concepts revealed that patients with AD are impacted by both physical and emotional burdens that are intricately connected.

CONCLUSIONS

Qualitative data from NLP, being more patient-centric than data from clinical standardized measures, provide a more comprehensive view of the burden of AD to inform disease management.

Collapse

DES-ROD: Exploring Literature to Develop New Links between RNA Oxidation and Human Diseases. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2020;2020:5904315. [PMID: 32308806 PMCID: PMC7142358 DOI: 10.1155/2020/5904315] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 02/21/2020] [Indexed: 12/27/2022]

Abstract

Normal cellular physiology and biochemical processes require undamaged RNA molecules. However, RNAs are frequently subjected to oxidative damage. Overproduction of reactive oxygen species (ROS) leads to RNA oxidation and disturbs redox (oxidation-reduction reaction) homeostasis. When oxidation damage affects RNA carrying protein-coding information, this may result in the synthesis of aberrant proteins as well as a lower efficiency of translation. Both of these, as well as imbalanced redox homeostasis, may lead to numerous human diseases. The number of studies on the effects of RNA oxidative damage in mammals is increasing by year due to the understanding that this oxidation fundamentally leads to numerous human diseases. To enable researchers in this field to explore information relevant to RNA oxidation and effects on human diseases, we developed DES-ROD, an online knowledgebase that contains processed information from 298,603 relevant documents that consist of PubMed abstracts and PubMed Central full-text articles. The system utilizes concepts/terms from 38 curated thematic dictionaries mapped to the analyzed documents. Researchers can explore enriched concepts, as well as enriched pairs of putatively associated concepts. In this way, one can explore mutual relationships between any combinations of two concepts from used dictionaries. Dictionaries cover a wide range of biomedical topics, such as human genes and proteins, pathways, Gene Ontology categories, mutations, noncoding RNAs, enzymes, toxins, metabolites, and diseases. This makes insights into different facets of the effects of RNA oxidation and the control of this process possible. The usefulness of the DES-ROD system is demonstrated by case studies on some known information, as well as potentially novel information involving RNA oxidation and diseases. DES-ROD is the first knowledgebase based on text and data mining that focused on the exploration of RNA oxidation and human diseases.

Collapse

Low DM, Bentley KH, Ghosh SS. Automated assessment of psychiatric disorders using speech: A systematic review. Laryngoscope Investig Otolaryngol 2020;5:96-116. [PMID: 32128436 PMCID: PMC7042657 DOI: 10.1002/lio2.354] [Citation(s) in RCA: 138] [Impact Index Per Article: 34.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2019] [Revised: 12/31/2019] [Accepted: 01/17/2020] [Indexed: 12/31/2022] Open

Abstract

OBJECTIVE

There are many barriers to accessing mental health assessments including cost and stigma. Even when individuals receive professional care, assessments are intermittent and may be limited partly due to the episodic nature of psychiatric symptoms. Therefore, machine-learning technology using speech samples obtained in the clinic or remotely could one day be a biomarker to improve diagnosis and treatment. To date, reviews have only focused on using acoustic features from speech to detect depression and schizophrenia. Here, we present the first systematic review of studies using speech for automated assessments across a broader range of psychiatric disorders.

METHODS

We followed the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) guidelines. We included studies from the last 10 years using speech to identify the presence or severity of disorders within the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). For each study, we describe sample size, clinical evaluation method, speech-eliciting tasks, machine learning methodology, performance, and other relevant findings.

RESULTS

1395 studies were screened of which 127 studies met the inclusion criteria. The majority of studies were on depression, schizophrenia, and bipolar disorder, and the remaining on post-traumatic stress disorder, anxiety disorders, and eating disorders. 63% of studies built machine learning predictive models, and the remaining 37% performed null-hypothesis testing only. We provide an online database with our search results and synthesize how acoustic features appear in each disorder.

CONCLUSION

Speech processing technology could aid mental health assessments, but there are many obstacles to overcome, especially the need for comprehensive transdiagnostic and longitudinal studies. Given the diverse types of data sets, feature extraction, computational methodologies, and evaluation criteria, we provide guidelines for both acquiring data and building machine learning models with a focus on testing hypotheses, open science, reproducibility, and generalizability.

LEVEL OF EVIDENCE

3a.

Collapse

Walsh CG, Chaudhry B, Dua P, Goodman KW, Kaplan B, Kavuluru R, Solomonides A, Subbian V. Stigma, biomarkers, and algorithmic bias: recommendations for precision behavioral health with artificial intelligence. JAMIA Open 2020;3:9-15. [PMID: 32607482 PMCID: PMC7309258 DOI: 10.1093/jamiaopen/ooz054] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 07/29/2019] [Accepted: 10/30/2019] [Indexed: 12/22/2022] Open

Wu CS, Kuo CJ, Su CH, Wang SH, Dai HJ. Using text mining to extract depressive symptoms and to validate the diagnosis of major depressive disorder from electronic health records. J Affect Disord 2020;260:617-623. [PMID: 31541973 DOI: 10.1016/j.jad.2019.09.044] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 07/29/2019] [Accepted: 09/08/2019] [Indexed: 10/26/2022]

Smink W, Sools AM, van der Zwaan JM, Wiegersma S, Veldkamp BP, Westerhof GJ. Towards text mining therapeutic change: A systematic review of text-based methods for Therapeutic Change Process Research. PLoS One 2019;14:e0225703. [PMID: 31805093 PMCID: PMC6894756 DOI: 10.1371/journal.pone.0225703] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2019] [Accepted: 11/11/2019] [Indexed: 01/21/2023] Open

Spasić I, Owen D, Smith A, Button K. KLOSURE: Closing in on open-ended patient questionnaires with text mining. J Biomed Semantics 2019;10:24. [PMID: 31711536 PMCID: PMC6849171 DOI: 10.1186/s13326-019-0215-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

Background

Knee injury and Osteoarthritis Outcome Score (KOOS) is an instrument used to quantify patients’ perceptions about their knee condition and associated problems. It is administered as a 42-item closed-ended questionnaire in which patients are asked to self-assess five outcomes: pain, other symptoms, activities of daily living, sport and recreation activities, and quality of life. We developed KLOG as a 10-item open-ended version of the KOOS questionnaire in an attempt to obtain deeper insight into patients’ opinions including their unmet needs. However, the open–ended nature of the questionnaire incurs analytical overhead associated with the interpretation of responses. The goal of this study was to automate such analysis. We implemented KLOSURE as a system for mining free–text responses to the KLOG questionnaire. It consists of two subsystems, one concerned with feature extraction and the other one concerned with classification of feature vectors. Feature extraction is performed by a set of four modules whose main functionalities are linguistic pre-processing, sentiment analysis, named entity recognition and lexicon lookup respectively. Outputs produced by each module are combined into feature vectors. The structure of feature vectors will vary across the KLOG questions. Finally, Weka, a machine learning workbench, was used for classification of feature vectors.

Results

The precision of the system varied between 62.8 and 95.3%, whereas the recall varied from 58.3 to 87.6% across the 10 questions. The overall performance in terms of F–measure varied between 59.0 and 91.3% with an average of 74.4% and a standard deviation of 8.8.

Conclusions

We demonstrated the feasibility of mining open-ended patient questionnaires. By automatically mapping free text answers onto a Likert scale, we can effectively measure the progress of rehabilitation over time. In comparison to traditional closed-ended questionnaires, our approach offers much richer information that can be utilised to support clinical decision making. In conclusion, we demonstrated how text mining can be used to combine the benefits of qualitative and quantitative analysis of patient experiences.

Collapse

Essack M, Salhi A, Stanimirovic J, Tifratene F, Bin Raies A, Hungler A, Uludag M, Van Neste C, Trpkovic A, Bajic VP, Bajic VB, Isenovic ER. Literature-Based Enrichment Insights into Redox Control of Vascular Biology. OXIDATIVE MEDICINE AND CELLULAR LONGEVITY 2019;2019:1769437. [PMID: 31223421 PMCID: PMC6542245 DOI: 10.1155/2019/1769437] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 04/11/2019] [Accepted: 05/02/2019] [Indexed: 02/07/2023]

Kim YM. Discovering major opioid-related research themes over time: A text mining technique. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE PROCEEDINGS. AMIA JOINT SUMMITS ON TRANSLATIONAL SCIENCE 2019;2019:751-760. [PMID: 31259032 PMCID: PMC6568063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Sheikhalishahi S, Miotto R, Dudley JT, Lavelli A, Rinaldi F, Osmani V. Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review. JMIR Med Inform 2019;7:e12239. [PMID: 31066697 PMCID: PMC6528438 DOI: 10.2196/12239] [Citation(s) in RCA: 204] [Impact Index Per Article: 40.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2018] [Revised: 03/04/2019] [Accepted: 03/24/2019] [Indexed: 01/08/2023] Open

Abstract

BACKGROUND

Novel approaches that complement and go beyond evidence-based medicine are required in the domain of chronic diseases, given the growing incidence of such conditions on the worldwide population. A promising avenue is the secondary use of electronic health records (EHRs), where patient data are analyzed to conduct clinical and translational research. Methods based on machine learning to process EHRs are resulting in improved understanding of patient clinical trajectories and chronic disease risk prediction, creating a unique opportunity to derive previously unknown clinical insights. However, a wealth of clinical histories remains locked behind clinical narratives in free-form text. Consequently, unlocking the full potential of EHR data is contingent on the development of natural language processing (NLP) methods to automatically transform clinical text into structured clinical data that can guide clinical decisions and potentially delay or prevent disease onset.

OBJECTIVE

The goal of the research was to provide a comprehensive overview of the development and uptake of NLP methods applied to free-text clinical notes related to chronic diseases, including the investigation of challenges faced by NLP methodologies in understanding clinical narratives.

METHODS

Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines were followed and searches were conducted in 5 databases using "clinical notes," "natural language processing," and "chronic disease" and their variations as keywords to maximize coverage of the articles.

RESULTS

Of the 2652 articles considered, 106 met the inclusion criteria. Review of the included papers resulted in identification of 43 chronic diseases, which were then further classified into 10 disease categories using the International Classification of Diseases, 10th Revision. The majority of studies focused on diseases of the circulatory system (n=38) while endocrine and metabolic diseases were fewest (n=14). This was due to the structure of clinical records related to metabolic diseases, which typically contain much more structured data, compared with medical records for diseases of the circulatory system, which focus more on unstructured data and consequently have seen a stronger focus of NLP. The review has shown that there is a significant increase in the use of machine learning methods compared to rule-based approaches; however, deep learning methods remain emergent (n=3). Consequently, the majority of works focus on classification of disease phenotype with only a handful of papers addressing extraction of comorbidities from the free text or integration of clinical notes with structured data. There is a notable use of relatively simple methods, such as shallow classifiers (or combination with rule-based methods), due to the interpretability of predictions, which still represents a significant issue for more complex methods. Finally, scarcity of publicly available data may also have contributed to insufficient development of more advanced methods, such as extraction of word embeddings from clinical notes.

CONCLUSIONS

Efforts are still required to improve (1) progression of clinical NLP methods from extraction toward understanding; (2) recognition of relations among entities rather than entities in isolation; (3) temporal extraction to understand past, current, and future clinical events; (4) exploitation of alternative sources of clinical knowledge; and (5) availability of large-scale, de-identified clinical corpora.

Collapse