1
|
Gupta N, Doad J, Singh R, Chien D, Cotroneo M, Reid DBC, Cloney M, Paul D. Temporal Trends in the Epidemiology of Lower Back Pain in the United States. Spine (Phila Pa 1976) 2024; 49:E394-E403. [PMID: 39262199 DOI: 10.1097/brs.0000000000005158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Accepted: 08/31/2024] [Indexed: 09/13/2024]
Abstract
STUDY DESIGN This was an observational study. OBJECTIVE This study aims to explore sociodemographic and regional geographic variations in lower back pain (LBP) incidence, prevalence, and burden in the United States (US from 2000 to 2019). SUMMARY OF BACKGROUND DATA LBP is a major contributor to lost wages and disability in the United States. As LBP is associated with increasing age and sedentary lifestyle, the incidence of LBP is expected to rise. Due to LBP's multifactorial causes, US epidemiological trends lack sufficient data. MATERIALS AND METHODS Descriptive epidemiological data including disability-adjusted life years (DALYs), incidence, and prevalence per 100,000 population from 2000 to 2019 were collected from the Global Burden of Disease database. State-level data regarding poverty, insurance and employment status were obtained from the US Government Census Bureau and US Department of Labor. Statistical significance was indicated by P <0.05. RESULTS From 2000 to 2019, the US demonstrated reductions in LBP incidence, prevalence, and DALYs. Regional analysis demonstrated the Midwest to have the greatest mean incidence, prevalence, and DALYs; with Midwestern females significantly more affected than females in other regions. Those aged 25 to 49 in the Midwest were impacted significantly more across all measures compared with age-matched populations in other regions. Nationally, there were no significant associations between unemployment and LBP. Poverty was inversely correlated with LBP incidence. Uninsured status was positively correlated with prevalence and DALYs. CONCLUSION Although there has been progress in reducing the impact of LBP in the United States, the Midwest region has greater rates for all measures compared with other US regions. Further, females and those aged 25 to 49 in the Midwest were more likely to be affected by LBP compared with counterparts in other regions. Future studies should identify specific factors contributing to elevated LBP rates in the Midwest in order to guide targeted interventions to reduce the incidence and burden of LBP there.
Collapse
Affiliation(s)
- Nithin Gupta
- Campbell University School of Osteopathic Medicine, Lillington, NC
- Department of Orthopedic Spine Surgery, Conway Medical Center, Conway, SC
| | - Jagroop Doad
- Campbell University School of Osteopathic Medicine, Lillington, NC
| | - Rohin Singh
- Department of Neurosurgery, University of Rochester Medical Center, Rochester, NY
| | - Derek Chien
- Department of Neurosurgery, University of Rochester Medical School, Rochester, NY
| | - Matthew Cotroneo
- Department of Neurosurgery, University of Rochester Medical School, Rochester, NY
| | - Daniel B C Reid
- Department of Orthopedic Spine Surgery, Conway Medical Center, Conway, SC
| | - Michael Cloney
- Department of Neurosurgery, University of Pittsburgh, Pittsburgh, PA
| | - David Paul
- Department of Neurosurgery, University of Rochester Medical Center, Rochester, NY
| |
Collapse
|
2
|
Meier TA, Refahi MS, Hearne G, Restifo DS, Munoz-Acuna R, Rosen GL, Woloszynek S. The Role and Applications of Artificial Intelligence in the Treatment of Chronic Pain. Curr Pain Headache Rep 2024; 28:769-784. [PMID: 38822995 DOI: 10.1007/s11916-024-01264-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/28/2024] [Indexed: 06/03/2024]
Abstract
PURPOSE OF REVIEW This review aims to explore the interface between artificial intelligence (AI) and chronic pain, seeking to identify areas of focus for enhancing current treatments and yielding novel therapies. RECENT FINDINGS In the United States, the prevalence of chronic pain is estimated to be upwards of 40%. Its impact extends to increased healthcare costs, reduced economic productivity, and strain on healthcare resources. Addressing this condition is particularly challenging due to its complexity and the significant variability in how patients respond to treatment. Current options often struggle to provide long-term relief, with their benefits rarely outweighing the risks, such as dependency or other side effects. Currently, AI has impacted four key areas of chronic pain treatment and research: (1) predicting outcomes based on clinical information; (2) extracting features from text, specifically clinical notes; (3) modeling 'omic data to identify meaningful patient subgroups with potential for personalized treatments and improved understanding of disease processes; and (4) disentangling complex neuronal signals responsible for pain, which current therapies attempt to modulate. As AI advances, leveraging state-of-the-art architectures will be essential for improving chronic pain treatment. Current efforts aim to extract meaningful representations from complex data, paving the way for personalized medicine. The identification of unique patient subgroups should reveal targets for tailored chronic pain treatments. Moreover, enhancing current treatment approaches is achievable by gaining a more profound understanding of patient physiology and responses. This can be realized by leveraging AI on the increasing volume of data linked to chronic pain.
Collapse
Affiliation(s)
| | - Mohammad S Refahi
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Gavin Hearne
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | | | - Ricardo Munoz-Acuna
- Anesthesia, Critical Care, and Pain Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA
| | - Gail L Rosen
- Ecological and Evolutionary Signal-Processing and Informatics (EESI) Laboratory, Department of Electrical and Computer Engineering, Drexel University, Philadelphia, PA, USA
| | - Stephen Woloszynek
- Anesthesia, Critical Care, and Pain Medicine, Beth Israel Deaconess Medical Center, Boston, MA, USA.
| |
Collapse
|
3
|
Badreau M, Fadel M, Roquelaure Y, Bertin M, Rapicault C, Gilbert F, Porro B, Descatha A. Comparison of Machine Learning Methods in the Study of Cancer Survivors' Return to Work: An Example of Breast Cancer Survivors with Work-Related Factors in the CONSTANCES Cohort. JOURNAL OF OCCUPATIONAL REHABILITATION 2023; 33:750-756. [PMID: 36935460 DOI: 10.1007/s10926-023-10112-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 03/03/2023] [Indexed: 06/18/2023]
Abstract
PURPOSE Machine learning (ML) methods showed a higher accuracy in identifying individuals without cancer who were unable to return to work (RTW) compared to the classical methods (e.g. logistic regression models). We therefore aim to discuss the value of these methods in relation to RTW for cancer survivors. METHODS Breast cancer (BC) survivors who were working at diagnosis within the CONSTANCES cohort were included in the study. RTW was assessed five years after the BC diagnosis (early retirement was considered as non-RTW). Age and occupation at diagnosis, and physical occupational job exposures assessed using the Job Exposure Matrix, JEM-CONSTANCES, were evaluated as predictors of RTW five years after BC diagnosis. The following four ML methods were used: (i) k-nearest neighbors; (ii) random forest; (iii) neural network; and (iv) elastic net. RESULTS The training sample included 683 BC survivors (RTW: 85.7%), and the test sample 171 (RTW: 85.4%). The elastic net method had the best results despite low sensitivity (accuracy = 76.6%; sensitivity = 31.7%; specificity = 90.8%), and the random forest model was the most accurate (= 79.5%) but also the least sensitive (= 14.3%). CONCLUSION This study takes a first step towards opening up new possibilities for identifying the occupational determinants of cancer survivors' RTW. Further work, including a larger sample size, and more predictor variables, is now needed.
Collapse
Affiliation(s)
- Marie Badreau
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
| | - Marc Fadel
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
- Univ Angers, CHU Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET ESTER, SFR ICAT, Angers, F-49000, France
| | - Yves Roquelaure
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
- Univ Angers, CHU Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET ESTER, SFR ICAT, Angers, F-49000, France
| | - Mélanie Bertin
- Univ Angers, CHU Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET ESTER, SFR ICAT, Angers, F-49000, France
- Univ Rennes, EHESP, CNRS, Inserm, Arènes - UMR 6051, RSMS (Recherche sur les Services et Management en Santé) - U 1309, Rennes, F-35000, France
| | - Clémence Rapicault
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
| | - Fabien Gilbert
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
| | - Bertrand Porro
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France.
- Department of Human and Social Sciences, Institut de Cancerologie de l'Ouest (ICO), Angers, 49055, France.
| | - Alexis Descatha
- Univ Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET-ESTER, SFR ICAT, Angers, F-49000, France
- Univ Angers, CHU Angers, Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail), UMR_S 1085, IRSET ESTER, SFR ICAT, Angers, F-49000, France
- Centre antipoison et de toxicovigilance Grand Ouest, CHU Angers, CHU Angers, Angers, France
- Department of Occupational Medicine, Epidemiology and Prevention, Donald and Barbara Zucker School of Medicine, Hofstra, Northwell, USA
| |
Collapse
|
4
|
Vaid A, Landi I, Nadkarni G, Nabeel I. Using fine-tuned large language models to parse clinical notes in musculoskeletal pain disorders. Lancet Digit Health 2023; 5:S2589-7500(23)00202-9. [PMID: 39492289 DOI: 10.1016/s2589-7500(23)00202-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 09/06/2023] [Accepted: 09/22/2023] [Indexed: 11/05/2024]
Affiliation(s)
- Akhil Vaid
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; The Division of Data Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
| | - Isotta Landi
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; The Division of Data Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Girish Nadkarni
- The Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA; The Division of Data Driven and Digital Medicine, Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ismail Nabeel
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| |
Collapse
|
5
|
Jaiswal A, Katz A, Nesca M, Milios E. Identifying Risk Factors Associated With Lower Back Pain in Electronic Medical Record Free Text: Deep Learning Approach Using Clinical Note Annotations. JMIR Med Inform 2023; 11:e45105. [PMID: 37584559 PMCID: PMC10461403 DOI: 10.2196/45105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 05/11/2023] [Accepted: 06/03/2023] [Indexed: 08/17/2023] Open
Abstract
Background Lower back pain is a common weakening condition that affects a large population. It is a leading cause of disability and lost productivity, and the associated medical costs and lost wages place a substantial burden on individuals and society. Recent advances in artificial intelligence and natural language processing have opened new opportunities for the identification and management of risk factors for lower back pain. In this paper, we propose and train a deep learning model on a data set of clinical notes that have been annotated with relevant risk factors, and we evaluate the model's performance in identifying risk factors in new clinical notes. Objective The primary objective is to develop a novel deep learning approach to detect risk factors for underlying disease in patients presenting with lower back pain in clinical encounter notes. The secondary objective is to propose solutions to potential challenges of using deep learning and natural language processing techniques for identifying risk factors in electronic medical record free text and make practical recommendations for future research in this area. Methods We manually annotated clinical notes for the presence of six risk factors for severe underlying disease in patients presenting with lower back pain. Data were highly imbalanced, with only 12% (n=296) of the annotated notes having at least one risk factor. To address imbalanced data, a combination of semantic textual similarity and regular expressions was used to further capture notes for annotation. Further analysis was conducted to study the impact of downsampling, binary formulation of multi-label classification, and unsupervised pretraining on classification performance. Results Of 2749 labeled clinical notes, 347 exhibited at least one risk factor, while 2402 exhibited none. The initial analysis shows that downsampling the training set to equalize the ratio of clinical notes with and without risk factors improved the macro-area under the receiver operating characteristic curve (AUROC) by 2%. The Bidirectional Encoder Representations from Transformers (BERT) model improved the macro-AUROC by 15% over the traditional machine learning baseline. In experiment 2, the proposed BERT-convolutional neural network (CNN) model for longer texts improved (4% macro-AUROC) over the BERT baseline, and the multitask models are more stable for minority classes. In experiment 3, domain adaptation of BERTCNN using masked language modeling improved the macro-AUROC by 2%. Conclusions Primary care clinical notes are likely to require manipulation to perform meaningful free-text analysis. The application of BERT models for multi-label classification on downsampled annotated clinical notes is useful in detecting risk factors suggesting an indication for imaging for patients with lower back pain.
Collapse
Affiliation(s)
- Aman Jaiswal
- Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada
| | - Alan Katz
- Manitoba Centre for Health Policy, Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| | - Marcello Nesca
- Manitoba Centre for Health Policy, Department of Community Health Sciences, University of Manitoba, Winnipeg, MB, Canada
| | - Evangelos Milios
- Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada
| |
Collapse
|
6
|
Cascella M, Schiavo D, Cuomo A, Ottaiano A, Perri F, Patrone R, Migliarelli S, Bignami EG, Vittori A, Cutugno F. Artificial Intelligence for Automatic Pain Assessment: Research Methods and Perspectives. Pain Res Manag 2023; 2023:6018736. [PMID: 37416623 PMCID: PMC10322534 DOI: 10.1155/2023/6018736] [Citation(s) in RCA: 18] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 02/03/2023] [Accepted: 04/20/2023] [Indexed: 07/08/2023]
Abstract
Although proper pain evaluation is mandatory for establishing the appropriate therapy, self-reported pain level assessment has several limitations. Data-driven artificial intelligence (AI) methods can be employed for research on automatic pain assessment (APA). The goal is the development of objective, standardized, and generalizable instruments useful for pain assessment in different clinical contexts. The purpose of this article is to discuss the state of the art of research and perspectives on APA applications in both research and clinical scenarios. Principles of AI functioning will be addressed. For narrative purposes, AI-based methods are grouped into behavioral-based approaches and neurophysiology-based pain detection methods. Since pain is generally accompanied by spontaneous facial behaviors, several approaches for APA are based on image classification and feature extraction. Language features through natural language strategies, body postures, and respiratory-derived elements are other investigated behavioral-based approaches. Neurophysiology-based pain detection is obtained through electroencephalography, electromyography, electrodermal activity, and other biosignals. Recent approaches involve multimode strategies by combining behaviors with neurophysiological findings. Concerning methods, early studies were conducted by machine learning algorithms such as support vector machine, decision tree, and random forest classifiers. More recently, artificial neural networks such as convolutional and recurrent neural network algorithms are implemented, even in combination. Collaboration programs involving clinicians and computer scientists must be aimed at structuring and processing robust datasets that can be used in various settings, from acute to different chronic pain conditions. Finally, it is crucial to apply the concepts of explainability and ethics when examining AI applications for pain research and management.
Collapse
Affiliation(s)
- Marco Cascella
- Division of Anesthesia and Pain Medicine, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Naples 80131, Italy
| | - Daniela Schiavo
- Division of Anesthesia and Pain Medicine, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Naples 80131, Italy
| | - Arturo Cuomo
- Division of Anesthesia and Pain Medicine, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Naples 80131, Italy
| | - Alessandro Ottaiano
- SSD-Innovative Therapies for Abdominal Metastases, Istituto Nazionale Tumori di Napoli IRCCS “G. Pascale”, Via M. Semmola, Naples 80131, Italy
| | - Francesco Perri
- Head and Neck Oncology Unit, Istituto Nazionale Tumori IRCCS-Fondazione “G. Pascale”, Naples 80131, Italy
| | - Renato Patrone
- Dieti Department, University of Naples, Naples, Italy
- Division of Hepatobiliary Surgical Oncology, Istituto Nazionale Tumori IRCCS, Fondazione Pascale-IRCCS di Napoli, Naples, Italy
| | - Sara Migliarelli
- Department of Pharmacology, Faculty of Medicine and Psychology, University Sapienza of Rome, Rome, Italy
| | - Elena Giovanna Bignami
- Anesthesiology, Critical Care and Pain Medicine Division, Department of Medicine and Surgery, University of Parma, Parma, Italy
| | - Alessandro Vittori
- Department of Anesthesia and Critical Care, ARCO ROMA, Ospedale Pediatrico Bambino Gesù IRCCS, Rome 00165, Italy
| | - Francesco Cutugno
- Department of Electrical Engineering and Information Technologies, University of Naples “Federico II”, Naples 80100, Italy
| |
Collapse
|
7
|
Bacco L, Russo F, Ambrosio L, D’Antoni F, Vollero L, Vadalà G, Dell’Orletta F, Merone M, Papalia R, Denaro V. Natural language processing in low back pain and spine diseases: A systematic review. Front Surg 2022; 9:957085. [PMID: 35910476 PMCID: PMC9329654 DOI: 10.3389/fsurg.2022.957085] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
Natural Language Processing (NLP) is a discipline at the intersection between Computer Science (CS), Artificial Intelligence (AI), and Linguistics that leverages unstructured human-interpretable (natural) language text. In recent years, it gained momentum also in health-related applications and research. Although preliminary, studies concerning Low Back Pain (LBP) and other related spine disorders with relevant applications of NLP methodologies have been reported in the literature over the last few years. It motivated us to systematically review the literature comprised of two major public databases, PubMed and Scopus. To do so, we first formulated our research question following the PICO guidelines. Then, we followed a PRISMA-like protocol by performing a search query including terminologies of both technical (e.g., natural language and computational linguistics) and clinical (e.g., lumbar and spine surgery) domains. We collected 221 non-duplicated studies, 16 of which were eligible for our analysis. In this work, we present these studies divided into sub-categories, from both tasks and exploited models’ points of view. Furthermore, we report a detailed description of techniques used to extract and process textual features and the several evaluation metrics used to assess the performance of the NLP models. However, what is clear from our analysis is that additional studies on larger datasets are needed to better define the role of NLP in the care of patients with spinal disorders.
Collapse
Affiliation(s)
- Luca Bacco
- Department of Engineering, Unit of Computer Systems and Bioinformatics, Campus Bio-Medico University of Rome, Rome, Italy
- ItaliaNLP Lab, National Research Council, Istituto di Linguistica Computazionale “Antonio Zampolli”, Pisa, Italy
- R&D Lab, Webmonks S.r.l., Rome, Italy
| | - Fabrizio Russo
- Department of Orthopaedic and Trauma Surgery, Campus Bio-Medico University Hospital Foundation, Rome, Italy
- Research Unit of Orthopaedic and Trauma Surgery, Campus Bio-Medico University of Rome, Rome, Italy
- Correspondence: Mario Merone Fabrizio Russo
| | - Luca Ambrosio
- Research Unit of Orthopaedic and Trauma Surgery, Campus Bio-Medico University of Rome, Rome, Italy
| | - Federico D’Antoni
- Department of Engineering, Unit of Computer Systems and Bioinformatics, Campus Bio-Medico University of Rome, Rome, Italy
| | - Luca Vollero
- Department of Engineering, Unit of Computer Systems and Bioinformatics, Campus Bio-Medico University of Rome, Rome, Italy
| | - Gianluca Vadalà
- Department of Orthopaedic and Trauma Surgery, Campus Bio-Medico University Hospital Foundation, Rome, Italy
- Research Unit of Orthopaedic and Trauma Surgery, Campus Bio-Medico University of Rome, Rome, Italy
| | - Felice Dell’Orletta
- ItaliaNLP Lab, National Research Council, Istituto di Linguistica Computazionale “Antonio Zampolli”, Pisa, Italy
| | - Mario Merone
- Department of Engineering, Unit of Computer Systems and Bioinformatics, Campus Bio-Medico University of Rome, Rome, Italy
- Correspondence: Mario Merone Fabrizio Russo
| | - Rocco Papalia
- Department of Orthopaedic and Trauma Surgery, Campus Bio-Medico University Hospital Foundation, Rome, Italy
- Research Unit of Orthopaedic and Trauma Surgery, Campus Bio-Medico University of Rome, Rome, Italy
| | - Vincenzo Denaro
- Department of Orthopaedic and Trauma Surgery, Campus Bio-Medico University Hospital Foundation, Rome, Italy
| |
Collapse
|
8
|
Lu H, Ehwerhemuepha L, Rakovski C. A comparative study on deep learning models for text classification of unstructured medical notes with various levels of class imbalance. BMC Med Res Methodol 2022; 22:181. [PMID: 35780100 PMCID: PMC9250736 DOI: 10.1186/s12874-022-01665-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 05/23/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Discharge medical notes written by physicians contain important information about the health condition of patients. Many deep learning algorithms have been successfully applied to extract important information from unstructured medical notes data that can entail subsequent actionable results in the medical domain. This study aims to explore the model performance of various deep learning algorithms in text classification tasks on medical notes with respect to different disease class imbalance scenarios. METHODS In this study, we employed seven artificial intelligence models, a CNN (Convolutional Neural Network), a Transformer encoder, a pretrained BERT (Bidirectional Encoder Representations from Transformers), and four typical sequence neural networks models, namely, RNN (Recurrent Neural Network), GRU (Gated Recurrent Unit), LSTM (Long Short-Term Memory), and Bi-LSTM (Bi-directional Long Short-Term Memory) to classify the presence or absence of 16 disease conditions from patients' discharge summary notes. We analyzed this question as a composition of 16 binary separate classification problems. The model performance of the seven models on each of the 16 datasets with various levels of imbalance between classes were compared in terms of AUC-ROC (Area Under the Curve of the Receiver Operating Characteristic), AUC-PR (Area Under the Curve of Precision and Recall), F1 Score, and Balanced Accuracy as well as the training time. The model performances were also compared in combination with different word embedding approaches (GloVe, BioWordVec, and no pre-trained word embeddings). RESULTS The analyses of these 16 binary classification problems showed that the Transformer encoder model performs the best in nearly all scenarios. In addition, when the disease prevalence is close to or greater than 50%, the Convolutional Neural Network model achieved a comparable performance to the Transformer encoder, and its training time was 17.6% shorter than the second fastest model, 91.3% shorter than the Transformer encoder, and 94.7% shorter than the pre-trained BERT-Base model. The BioWordVec embeddings slightly improved the performance of the Bi-LSTM model in most disease prevalence scenarios, while the CNN model performed better without pre-trained word embeddings. In addition, the training time was significantly reduced with the GloVe embeddings for all models. CONCLUSIONS For classification tasks on medical notes, Transformer encoders are the best choice if the computation resource is not an issue. Otherwise, when the classes are relatively balanced, CNNs are a leading candidate because of their competitive performance and computational efficiency.
Collapse
Affiliation(s)
- Hongxia Lu
- Schmid College of Science and Technology, Chapman University, 1 University Dr, Orange, CA, 92866, USA
| | - Louis Ehwerhemuepha
- Schmid College of Science and Technology, Chapman University, 1 University Dr, Orange, CA, 92866, USA.,Children's Health of Orange County (CHOC), Orange, CA, 92868, USA
| | - Cyril Rakovski
- Schmid College of Science and Technology, Chapman University, 1 University Dr, Orange, CA, 92866, USA.
| |
Collapse
|
9
|
D’Antoni F, Russo F, Ambrosio L, Bacco L, Vollero L, Vadalà G, Merone M, Papalia R, Denaro V. Artificial Intelligence and Computer Aided Diagnosis in Chronic Low Back Pain: A Systematic Review. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2022; 19:ijerph19105971. [PMID: 35627508 PMCID: PMC9141006 DOI: 10.3390/ijerph19105971] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/09/2022] [Accepted: 05/12/2022] [Indexed: 12/10/2022]
Abstract
Low Back Pain (LBP) is currently the first cause of disability in the world, with a significant socioeconomic burden. Diagnosis and treatment of LBP often involve a multidisciplinary, individualized approach consisting of several outcome measures and imaging data along with emerging technologies. The increased amount of data generated in this process has led to the development of methods related to artificial intelligence (AI), and to computer-aided diagnosis (CAD) in particular, which aim to assist and improve the diagnosis and treatment of LBP. In this manuscript, we have systematically reviewed the available literature on the use of CAD in the diagnosis and treatment of chronic LBP. A systematic research of PubMed, Scopus, and Web of Science electronic databases was performed. The search strategy was set as the combinations of the following keywords: “Artificial Intelligence”, “Machine Learning”, “Deep Learning”, “Neural Network”, “Computer Aided Diagnosis”, “Low Back Pain”, “Lumbar”, “Intervertebral Disc Degeneration”, “Spine Surgery”, etc. The search returned a total of 1536 articles. After duplication removal and evaluation of the abstracts, 1386 were excluded, whereas 93 papers were excluded after full-text examination, taking the number of eligible articles to 57. The main applications of CAD in LBP included classification and regression. Classification is used to identify or categorize a disease, whereas regression is used to produce a numerical output as a quantitative evaluation of some measure. The best performing systems were developed to diagnose degenerative changes of the spine from imaging data, with average accuracy rates >80%. However, notable outcomes were also reported for CAD tools executing different tasks including analysis of clinical, biomechanical, electrophysiological, and functional imaging data. Further studies are needed to better define the role of CAD in LBP care.
Collapse
Affiliation(s)
- Federico D’Antoni
- Unit of Computer Systems and Bioinformatics, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 21, 00128 Rome, Italy; (F.D.); (L.B.); (L.V.)
| | - Fabrizio Russo
- Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 200, 00128 Rome, Italy; (L.A.); (G.V.); (R.P.); (V.D.)
- Correspondence: (F.R.); (M.M.)
| | - Luca Ambrosio
- Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 200, 00128 Rome, Italy; (L.A.); (G.V.); (R.P.); (V.D.)
| | - Luca Bacco
- Unit of Computer Systems and Bioinformatics, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 21, 00128 Rome, Italy; (F.D.); (L.B.); (L.V.)
- ItaliaNLP Lab, Istituto di Linguistica Computazionale “Antonio Zampolli”, National Research Council, Via Giuseppe Moruzzi, 1, 56124 Pisa, Italy
- Webmonks S.r.l., Via del Triopio, 5, 00178 Rome, Italy
| | - Luca Vollero
- Unit of Computer Systems and Bioinformatics, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 21, 00128 Rome, Italy; (F.D.); (L.B.); (L.V.)
| | - Gianluca Vadalà
- Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 200, 00128 Rome, Italy; (L.A.); (G.V.); (R.P.); (V.D.)
| | - Mario Merone
- Unit of Computer Systems and Bioinformatics, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 21, 00128 Rome, Italy; (F.D.); (L.B.); (L.V.)
- Correspondence: (F.R.); (M.M.)
| | - Rocco Papalia
- Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 200, 00128 Rome, Italy; (L.A.); (G.V.); (R.P.); (V.D.)
| | - Vincenzo Denaro
- Department of Orthopaedic Surgery, Università Campus Bio-Medico di Roma, Via Alvaro Del Portillo, 200, 00128 Rome, Italy; (L.A.); (G.V.); (R.P.); (V.D.)
| |
Collapse
|
10
|
Bousquet C, Coulet A. Supporting Diagnosis With Next-Generation Artificial Intelligence. JAMA 2022; 327:1400. [PMID: 35412572 DOI: 10.1001/jama.2022.2303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Cedric Bousquet
- Sorbonne Université, Inserm, Université Paris 13 Laboratoire d'Informatique Médicale et d'Ingénierie des Connaissances en e-Santé, Paris, France
| | | |
Collapse
|
11
|
Jujjavarapu C, Pejaver V, Cohen TA, Mooney SD, Heagerty PJ, Jarvik JG. A Comparison of Natural Language Processing Methods for the Classification of Lumbar Spine Imaging Findings Related to Lower Back Pain. Acad Radiol 2022; 29 Suppl 3:S188-S200. [PMID: 34862122 PMCID: PMC8917985 DOI: 10.1016/j.acra.2021.09.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 08/22/2021] [Accepted: 09/04/2021] [Indexed: 11/28/2022]
Abstract
RATIONALE AND OBJECTIVES The use of natural language processing (NLP) in radiology provides an opportunity to assist clinicians with phenotyping patients. However, the performance and generalizability of NLP across healthcare systems is uncertain. We assessed the performance within and generalizability across four healthcare systems of different NLP representational methods, coupled with elastic-net logistic regression to classify lower back pain-related findings from lumbar spine imaging reports. MATERIALS AND METHODS We used a dataset of 871 X-ray and magnetic resonance imaging reports sampled from a prospective study across four healthcare systems between October 2013 and September 2016. We annotated each report for 26 findings potentially related to lower back pain. Our framework applied four different NLP methods to convert text into feature sets (representations). For each representation, our framework used an elastic-net logistic regression model for each finding (i.e., 26 binary or "one-vs.-rest" classification models). For performance evaluation, we split data into training (80%, 697/871) and testing (20%, 174/871). In the training set, we used cross validation to identify the optimal hyperparameter value and then retrained on the full training set. We then assessed performance based on area under the curve (AUC) for the test set. We repeated this process 25 times with each repeat using a different random train/test split of the data, so that we could estimate 95% confidence intervals, and assess significant difference in performance between representations. For generalizability evaluation, we trained models on data from three healthcare systems with cross validation and then tested on the fourth. We repeated this process for each system, then calculated mean and standard deviation (SD) of AUC across the systems. RESULTS For individual representations, n-grams had the best average performance across all 26 findings (AUC: 0.960). For generalizability, document embeddings had the most consistent average performance across systems (SD: 0.010). Out of these 26 findings, we considered eight as potentially clinically important (any stenosis, central stenosis, lateral stenosis, foraminal stenosis, disc extrusion, nerve root displacement compression, endplate edema, and listhesis grade 2) since they have a relatively greater association with a history of lower back pain compared to the remaining 18 classes. We found a similar pattern for these eight in which n-grams and document embeddings had the best average performance (AUC: 0.954) and generalizability (SD: 0.007), respectively. CONCLUSION Based on performance assessment, we found that n-grams is the preferred method if classifier development and deployment occur at the same system. However, for deployment at multiple systems outside of the development system, or potentially if physician behavior changes within a system, one should consider document embeddings since embeddings appear to have the most consistent performance across systems.
Collapse
Affiliation(s)
- Chethan Jujjavarapu
- Department of Biomedical Informatics and Medical Education, School
of Medicine, University of Washington, Seattle, Washington
| | - Vikas Pejaver
- Department of Biomedical Informatics and Medical Education, School
of Medicine, University of Washington, Seattle, Washington
| | - Trevor A. Cohen
- Department of Biomedical Informatics and Medical Education, School
of Medicine, University of Washington, Seattle, Washington
| | - Sean D. Mooney
- Department of Biomedical Informatics and Medical Education, School
of Medicine, University of Washington, Seattle, Washington
| | - Patrick J. Heagerty
- Department of Biostatistics, University of Washington, Seattle,
Washington,Center for Biomedical Statistics, University of Washington,
Seattle, Washington
| | - Jeffrey G. Jarvik
- Department of Radiology, University of Washington, 1959 NE Pacific
Street, Seattle WA 98195,Department of Neurological Surgery, University of Washington,
Seattle, Washington,Department of Health Services, University of Washington, Seattle
Washington,Clinical Learning, Evidence And Research Center, University of
Washington, Seattle, Washington
| |
Collapse
|
12
|
Al-Hihi E, Gibson C, Lee J, Mount RR, Irani N, McGowan C. Improving appropriate imaging for non-specific low back pain. BMJ Open Qual 2022; 11:bmjoq-2021-001539. [PMID: 35190485 PMCID: PMC8862455 DOI: 10.1136/bmjoq-2021-001539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2021] [Accepted: 01/15/2022] [Indexed: 12/03/2022] Open
Abstract
Non-specific low back pain (LBP) is a common condition in the USA, with approximately 80% of adults who will have LBP at some point during their life and roughly 30% of the adult population suffering from LBP at any given time. Although LBP is the most common cause of disability in the USA, it often has no identifiable anatomic or physiologic cause. Many patients seeking care for non-specific LBP receive X-rays and other imaging studies. However, for most acute LBP patients, symptoms resolve within 4 weeks and the use of routine imaging may result in unnecessary radiation exposure and add unnecessary costs and wasted time for patients without contributing to patient outcomes. The specific aim of the quality improvement (QI) project was to determine the effect of a multicomponent intervention to enhance the appropriate imaging utilisation for acute LBP to ≥90%. During the first 6 months of the QI project, 191 patients with LBP were seen. Of those patients, 156 (81.7%) received appropriate imaging over the 6-month intervention period, missing our targeted goal. Furthermore, this rate declined to baseline values after termination of the intervention, suggesting the need for additional prompts to sustain the initial intervention effect. Following a health system-wide deployment of practice-based alerts and quality score cards, the appropriate utilisation rate increased again and quickly to the target rate of 90%. To reduce variability in our clinical practice and to sustain an appropriate utilisation rate will require continued work. Health systems must find efficient methods to reduce LBP imaging and increase appropriate management of non-specific LBP in primary care. Increasing concordance with imaging guidelines can lessen harm associated with unnecessary radiation exposure and result in significant cost savings.
Collapse
Affiliation(s)
- Eyad Al-Hihi
- Internal Medicine, The University of Kansas Health System, Kansas City, Kansas, USA
| | - Cheryl Gibson
- Internal Medicine, The University of Kansas Health System, Kansas City, Kansas, USA
| | - Jaehoon Lee
- Educational Psychology, Leadership, and Counseling, Texas Tech University, Lubbock, Texas, USA
| | - Rebecca R Mount
- Internal Medicine, The University of Kansas Health System, Kansas City, Kansas, USA
| | - Neville Irani
- Radiology, The University of Kansas Health System, Kansas City, Kansas, USA
| | - Caylin McGowan
- Internal Medicine, The University of Kansas Health System, Kansas City, Kansas, USA
| |
Collapse
|
13
|
Abstract
Electronic health records (EHRs) are becoming a vital source of data for healthcare quality improvement, research, and operations. However, much of the most valuable information contained in EHRs remains buried in unstructured text. The field of clinical text mining has advanced rapidly in recent years, transitioning from rule-based approaches to machine learning and, more recently, deep learning. With new methods come new challenges, however, especially for those new to the field. This review provides an overview of clinical text mining for those who are encountering it for the first time (e.g., physician researchers, operational analytics teams, machine learning scientists from other domains). While not a comprehensive survey, this review describes the state of the art, with a particular focus on new tasks and methods developed over the past few years. It also identifies key barriers between these remarkable technical advances and the practical realities of implementation in health systems and in industry.
Collapse
Affiliation(s)
- Bethany Percha
- Department of Medicine and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10025, USA;
| |
Collapse
|