1
|
Pilehvari S, Morgan Y, Peng W. An analytical review on the use of artificial intelligence and machine learning in diagnosis, prediction, and risk factor analysis of multiple sclerosis. Mult Scler Relat Disord 2024; 89:105761. [PMID: 39018642 DOI: 10.1016/j.msard.2024.105761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Revised: 06/19/2024] [Accepted: 07/04/2024] [Indexed: 07/19/2024]
Abstract
Medical research offers potential for disease prediction, like Multiple Sclerosis (MS). This neurological disorder damages nerve cell sheaths, with treatments focusing on symptom relief. Manual MS detection is time-consuming and error prone. Though MS lesion detection has been studied, limited attention has been paid to clinical analysis and computational risk factor prediction. Artificial intelligence (AI) techniques and Machine Learning (ML) methods offer accurate and effective alternatives to mapping MS progression. However, there are challenges in accessing clinical data and interdisciplinary collaboration. By analyzing 103 papers, we recognize the trends, strengths and weaknesses of AI, ML, and statistical methods applied to MS diagnosis. AI/ML-based approaches are suggested to identify MS risk factors, select significant MS features, and improve the diagnostic accuracy, such as Rule-based Fuzzy Logic (RBFL), Adaptive Fuzzy Inference System (ANFIS), Artificial Neural Network methods (ANN), Support Vector Machine (SVM), and Bayesian Networks (BNs). Meanwhile, applications of the Expanded Disability Status Scale (EDSS) and Magnetic Resonance Imaging (MRI) can enhance MS diagnostic accuracy. By examining established risk factors like obesity, smoking, and education, some research tackled the issue of disease progression. The performance metrics varied across different aspects of MS studies: Diagnosis: Sensitivity ranged from 60 % to 98 %, specificity from 60 % to 98 %, and accuracy from 61 % to 97 %. Prediction: Sensitivity ranged from 76 % to 98 %, specificity from 65 % to 98 %, and accuracy from 62 % to 99 %. Segmentation: Accuracy ranged up to 96.7 %. Classification: Sensitivity ranged from 78 % to 97.34 %, specificity from 65 % to 99.32 %, and accuracy from 71 % to 97.94 %. Furthermore, the literature shows that combining techniques can improve efficiency, exploiting their strengths for better overall performance.
Collapse
Affiliation(s)
- Shima Pilehvari
- University of Regina, 3737 Wascana Parkway, Regina, SK, S4S 0A2, Canada
| | - Yasser Morgan
- University of Regina, 3737 Wascana Parkway, Regina, SK, S4S 0A2, Canada
| | - Wei Peng
- University of Regina, 3737 Wascana Parkway, Regina, SK, S4S 0A2, Canada.
| |
Collapse
|
2
|
Vázquez-Marrufo M, Sarrias-Arrabal E, García-Torres M, Martín-Clemente R, Izquierdo G. A systematic review of the application of machine-learning algorithms in multiple sclerosis. Neurologia 2023; 38:577-590. [PMID: 35843587 DOI: 10.1016/j.nrleng.2020.10.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 10/11/2020] [Indexed: 10/17/2022] Open
Abstract
INTRODUCTION The applications of artificial intelligence, and in particular automatic learning or "machine learning" (ML), constitute both a challenge and a great opportunity in numerous scientific, technical, and clinical disciplines. Specific applications in the study of multiple sclerosis (MS) have been no exception, and constitute an area of increasing interest in recent years. OBJECTIVE We present a systematic review of the application of ML algorithms in MS. MATERIALS AND METHODS We used the PubMed search engine, which allows free access to the MEDLINE medical database, to identify studies including the keywords "machine learning" and "multiple sclerosis." We excluded review articles, studies written in languages other than English or Spanish, and studies that were mainly technical and did not specifically apply to MS. The final selection included 76 articles, and 38 were rejected. CONCLUSIONS After the review process, we established 4 main applications of ML in MS: 1) classifying MS subtypes; 2) distinguishing patients with MS from healthy controls and individuals with other diseases; 3) predicting progression and response to therapeutic interventions; and 4) other applications. Results found to date have shown that ML algorithms may offer great support for health professionals both in clinical settings and in research into MS.
Collapse
Affiliation(s)
- M Vázquez-Marrufo
- Departamento de Psicología Experimental, Facultad de Psicología, Universidad de Sevilla, Sevilla, Spain.
| | - E Sarrias-Arrabal
- Departamento de Psicología Experimental, Facultad de Psicología, Universidad de Sevilla, Sevilla, Spain
| | - M García-Torres
- Escuela Politécnica Superior, Universidad Pablo de Olavide, Sevilla, Spain
| | - R Martín-Clemente
- Departamento de Teoría de la Señal y Comunicaciones, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Sevilla, Spain
| | - G Izquierdo
- Unidad de Esclerosis Múltiple, Hospital VITHAS, Sevilla, Spain
| |
Collapse
|
3
|
Costantini E, Masciarelli E, Casorri L, Di Luigi M, Reale M. Medicinal herbs and multiple sclerosis: Overview on the hard balance between new therapeutic strategy and occupational health risk. Front Cell Neurosci 2022; 16:985943. [PMID: 36439198 PMCID: PMC9688751 DOI: 10.3389/fncel.2022.985943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 10/20/2022] [Indexed: 11/11/2022] Open
Abstract
Multiple sclerosis (MS) is an autoimmune disease characterized by demyelination and axonal loss of the central nervous system (CNS). Despite its spread throughout the world, the mechanisms that determine its onset are still to be defined. Immunological, genetic, viral, and environmental factors and exposure to chemicals may trigger MS. Many studies have highlighted the anti-inflammatory and anti-oxidant effects of medicinal herbs, which make them a natural and complementary treatment for neurodegenerative diseases. A severe reduction of several MS symptoms occurs with herbal therapy. Thus, the request for medicinal plants with potential beneficial effects, for MS patients, is constantly increasing. Consequently, a production increase needs. Unfortunately, many medicinal herbs were untested and their action mechanism, possible adverse effects, contraindications, or interactions with other drugs, are poorly or not investigated. Keeping in mind the pathological mechanisms of MS and the oxidative damages and mitochondrial dysfunctions induced by pesticides, it is important to understand if pesticides used to increase agricultural productivity and their residues in medicinal plants, may increase the risk of developing MS in both workers and consumers. Studies providing some indication about the relationship between environmental exposure to pesticides and MS disease incidence are few, fragmentary, and discordant. The aim of this article is to provide a glance at the therapeutic potential of medicinal plants and at the risk for MS onset of pesticides used by medicinal plant growers and present in medicinal herbs.
Collapse
Affiliation(s)
- Erica Costantini
- Department of Medicine and Science of Aging, G. d’Annunzio University of Chieti–Pescara, Chieti, Italy
| | - Eva Masciarelli
- Department of Technological Innovations and Safety of Plants, Products and Anthropic Settlements, National Institute for Insurance Against Accidents at Work, Rome, Italy
| | - Laura Casorri
- Department of Technological Innovations and Safety of Plants, Products and Anthropic Settlements, National Institute for Insurance Against Accidents at Work, Rome, Italy
| | - Marco Di Luigi
- Department of Occupational and Environmental Medicine, Epidemiology and Hygiene, INAIL Research Center, National Institute for Insurance Against Accidents at Work, Rome, Italy
| | - Marcella Reale
- Department of Innovative Technologies in Medicine and Dentistry, G. d’Annunzio University of Chieti–Pescara, Chieti, Italy
- *Correspondence: Marcella Reale,
| |
Collapse
|
4
|
Comparison of Machine Learning Methods Using Spectralis OCT for Diagnosis and Disability Progression Prognosis in Multiple Sclerosis. Ann Biomed Eng 2022; 50:507-528. [PMID: 35220529 PMCID: PMC9001622 DOI: 10.1007/s10439-022-02930-3] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 02/10/2022] [Indexed: 12/28/2022]
Abstract
Machine learning approaches in diagnosis and prognosis of multiple sclerosis (MS) were analysed using retinal nerve fiber layer (RNFL) thickness, measured by optical coherence tomography (OCT). A cross-sectional study (72 MS patients and 30 healthy controls) was used for diagnosis. These 72 MS patients were involved in a 10-year longitudinal follow-up study for prognostic purposes. Structural measurements of RNFL thickness were performed using different Spectralis OCT protocols: fast macular thickness protocol to measure macular RNFL, and fast RNFL thickness protocol and fast RNFL-N thickness protocol to measure peripapillary RNFL. Binary classifiers such as multiple linear regression (MLR), support vector machines (SVM), decision tree (DT), k-nearest neighbours (k-NN), Naïve Bayes (NB), ensemble classifier (EC) and long short-term memory (LSTM) recurrent neural network were tested. For MS diagnosis, the best acquisition protocol was fast macular thickness protocol using k-NN (accuracy: 95.8%; sensitivity: 94.4%; specificity: 97.2%; precision: 97.1%; AUC: 0.958). For MS prognosis, our model with a 3-year follow up to predict disability progression 8 years later was the best predictive model. DT performed best for fast macular thickness protocol (accuracy: 91.3%; sensitivity: 90.0%; specificity: 92.5%; precision: 92.3%; AUC: 0.913) and SVM for fast RNFL-N thickness protocol (accuracy: 91.3%; sensitivity: 87.5%; specificity: 95.0%; precision: 94.6%; AUC: 0.913). This work concludes that measurements of RNFL thickness obtained with Spectralis OCT have a good ability to diagnose MS and to predict disability progression in MS patients. This machine learning approach would help clinicians to have valuable information.
Collapse
|
5
|
Childhood pet ownership and multiple sclerosis: A systematic review and meta-analysis. Mult Scler Relat Disord 2021; 53:103046. [PMID: 34090130 DOI: 10.1016/j.msard.2021.103046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 04/20/2021] [Accepted: 05/17/2021] [Indexed: 11/22/2022]
Abstract
BACKGROUND Many studies have been conducted investigating a range of environmental factors which have been implicated in the pathogenesis of multiple sclerosis (MS). We collated available data about exposure to domestic animals before symptom onset in MS to perform a systematic review and meta-analysis. METHODS Medline, Embase and Cinahl were searched for relevant articles, based on pre-defined inclusion and exclusion criteria and reference lists were hand-searched. Data were extracted and critical analysis was conducted using the Newcastle-Ottawa criteria. Meta-analysis used random effects. RESULTS Study heterogeneity was high and study quality was variable. Random effects meta-analysis showed no associations with any pet ownership and development of MS. CONCLUSION It is not possible to draw definitive conclusions from this work. The studies included had a high level of heterogeneity. There are many variables involved in pet ownership and exposure and the nature of the way these have been studied makes the analysis challenging.
Collapse
|
6
|
Montolío A, Martín-Gallego A, Cegoñino J, Orduna E, Vilades E, Garcia-Martin E, Palomar APD. Machine learning in diagnosis and disability prediction of multiple sclerosis using optical coherence tomography. Comput Biol Med 2021; 133:104416. [PMID: 33946022 DOI: 10.1016/j.compbiomed.2021.104416] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2020] [Revised: 03/25/2021] [Accepted: 04/16/2021] [Indexed: 01/02/2023]
Abstract
BACKGROUND Multiple sclerosis (MS) is a neurodegenerative disease that affects the central nervous system, especially the brain, spinal cord, and optic nerve. Diagnosis of this disease is a very complex process and generally requires a lot of time. In addition, treatments are applied without any information on the disability course in each MS patient. For these two reasons, the objective of this study was to improve the MS diagnosis and predict the long-term course of disability in MS patients based on clinical data and retinal nerve fiber layer (RNFL) thickness, measured by optical coherence tomography (OCT). MATERIAL AND METHODS A total of 104 healthy controls and 108 MS patients, 82 of whom had a 10-year follow-up, were enrolled. Classification algorithms such as multiple linear regression (MLR), support vector machines (SVM), decision tree (DT), k-nearest neighbours (k-NN), Naïve Bayes (NB), ensemble classifier (EC) and long short-term memory (LSTM) recurrent neural network were tested to develop two predictive models: MS diagnosis model and MS disability course prediction model. RESULTS For MS diagnosis, the best result was obtained using EC (accuracy: 87.7%; sensitivity: 87.0%; specificity: 88.5%; precision: 88.7%; AUC: 0.8775). In line with this good performance, the accuracy was 85.4% using k-NN and 84.4% using SVM. And, for long-term prediction of MS disability course, LSTM recurrent neural network was the most appropriate classifier (accuracy: 81.7%; sensitivity: 81.1%; specificity: 82.2%; precision: 78.9%; AUC: 0.8165). The use of MLR, SVM and k-NN also showed a good performance (AUC ≥ 0.8). CONCLUSIONS This study demonstrated that machine learning techniques, using clinical and OCT data, can help establish an early diagnosis and predict the course of MS. This advance could help clinicians select more specific treatments for each MS patient. Therefore, our findings underscore the potential of RNFL thickness as a reliable MS biomarker.
Collapse
Affiliation(s)
- Alberto Montolío
- Group of Biomaterials, Aragon Institute of Engineering Research (I3A), University of Zaragoza, Zaragoza, Spain; Department of Mechanical Engineering, University of Zaragoza, Zaragoza, Spain
| | - Alejandro Martín-Gallego
- Group of Biomaterials, Aragon Institute of Engineering Research (I3A), University of Zaragoza, Zaragoza, Spain; Department of Mechanical Engineering, University of Zaragoza, Zaragoza, Spain
| | - José Cegoñino
- Group of Biomaterials, Aragon Institute of Engineering Research (I3A), University of Zaragoza, Zaragoza, Spain; Department of Mechanical Engineering, University of Zaragoza, Zaragoza, Spain
| | - Elvira Orduna
- Ophthalmology Department, Miguel Servet University Hospital, Zaragoza, Spain; GIMSO Research and Innovative Group, Aragon Institute for Health Research (IIS Aragon), Zaragoza, Spain
| | - Elisa Vilades
- Ophthalmology Department, Miguel Servet University Hospital, Zaragoza, Spain; GIMSO Research and Innovative Group, Aragon Institute for Health Research (IIS Aragon), Zaragoza, Spain
| | - Elena Garcia-Martin
- Ophthalmology Department, Miguel Servet University Hospital, Zaragoza, Spain; GIMSO Research and Innovative Group, Aragon Institute for Health Research (IIS Aragon), Zaragoza, Spain
| | - Amaya Pérez Del Palomar
- Group of Biomaterials, Aragon Institute of Engineering Research (I3A), University of Zaragoza, Zaragoza, Spain; Department of Mechanical Engineering, University of Zaragoza, Zaragoza, Spain.
| |
Collapse
|
7
|
Hartmann M, Fenton N, Dobson R. Current review and next steps for artificial intelligence in multiple sclerosis risk research. Comput Biol Med 2021; 132:104337. [PMID: 33773193 DOI: 10.1016/j.compbiomed.2021.104337] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/09/2021] [Accepted: 03/10/2021] [Indexed: 12/30/2022]
Abstract
In the last few decades, the prevalence of multiple sclerosis (MS), a chronic inflammatory disease of the nervous system, has increased, particularly in Northern European countries, the United States, and United Kingdom. The promise of artificial intelligence (AI) and machine learning (ML) as tools to address problems in MS research has attracted increasing interest in these methods. Bayesian networks offer a clear advantage since they can integrate data and causal knowledge allowing for visualizing interactions between dependent variables and potential confounding factors. A review of AI/ML research methods applied to MS found 216 papers using terms "Multiple Sclerosis", "machine learning", "artificial intelligence", "Bayes", and "Bayesian", of which 90 were relevant and recently published. More than half of these involve the detection and segmentation of MS lesions for quantitative analysis; however clinical and lifestyle risk factor assessment and prediction have largely been ignored. Of those that address risk factors, most provide only association studies for some factors and often fail to include the potential impact of confounding factors and bias (especially where these have causal explanations) that could affect data interpretation, such as reporting quality and medical care access in various countries. To address these gaps in the literature, we propose a causal Bayesian network approach to assessing risk factors for MS, which can address deficiencies in current epidemiological methods of producing risk measurements and makes better use of observational data.
Collapse
Affiliation(s)
- Morghan Hartmann
- Risk and Information Management Research Group, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, E1 4NS, UK.
| | - Norman Fenton
- Risk and Information Management Research Group, School of Electronic Engineering and Computer Science, Queen Mary University of London, London, E1 4NS, UK
| | - Ruth Dobson
- Preventive Neurology Unit, Wolfson Institute of Preventive Medicine, Queen Mary University of London, London, E1 4NS, UK
| |
Collapse
|
8
|
Vázquez-Marrufo M, Sarrias-Arrabal E, García-Torres M, Martín-Clemente R, Izquierdo G. A systematic review of the application of machine-learning algorithms in multiple sclerosis. Neurologia 2021; 38:S0213-4853(20)30431-X. [PMID: 33549371 DOI: 10.1016/j.nrl.2020.10.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/20/2020] [Accepted: 10/11/2020] [Indexed: 11/24/2022] Open
Abstract
INTRODUCTION The applications of artificial intelligence, and in particular automatic learning or "machine learning" (ML), constitute both a challenge and a great opportunity in numerous scientific, technical, and clinical disciplines. Specific applications in the study of multiple sclerosis (MS) have been no exception, and constitute an area of increasing interest in recent years. OBJECTIVE We present a systematic review of the application of ML algorithms in MS. MATERIALS AND METHODS We used the PubMed search engine, which allows free access to the MEDLINE medical database, to identify studies including the keywords "machine learning" and "multiple sclerosis." We excluded review articles, studies written in languages other than English or Spanish, and studies that were mainly technical and did not specifically apply to MS. The final selection included 76 articles, and 38 were rejected. CONCLUSIONS After the review process, we established 4 main applications of ML in MS: 1) classifying MS subtypes; 2) distinguishing patients with MS from healthy controls and individuals with other diseases; 3) predicting progression and response to therapeutic interventions; and 4) other applications. Results found to date have shown that ML algorithms may offer great support for health professionals both in clinical settings and in research into MS.
Collapse
Affiliation(s)
- M Vázquez-Marrufo
- Departamento de Psicología Experimental, Facultad de Psicología, Universidad de Sevilla, Sevilla, España.
| | - E Sarrias-Arrabal
- Departamento de Psicología Experimental, Facultad de Psicología, Universidad de Sevilla, Sevilla, España
| | - M García-Torres
- Escuela Politécnica Superior, Universidad Pablo de Olavide, Sevilla, España
| | - R Martín-Clemente
- Departamento de Teoría de la Señal y Comunicaciones, Escuela Técnica Superior de Ingeniería, Universidad de Sevilla, Sevilla, España
| | - G Izquierdo
- Unidad de Esclerosis Múltiple, Hospital VITHAS, Sevilla, España
| |
Collapse
|
9
|
Alshamrani R, Althbiti A, Alshamrani Y, Alkomah F, Ma X. Model-Driven Decision Making in Multiple Sclerosis Research: Existing Works and Latest Trends. PATTERNS 2020; 1:100121. [PMID: 33294867 PMCID: PMC7691382 DOI: 10.1016/j.patter.2020.100121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Multiple sclerosis (MS) is a neurological disorder that strikes the central nervous system. Due to the complexity of this disease, healthcare sectors are increasingly in need of shared clinical decision-making tools to provide practitioners with insightful knowledge and information about MS. These tools ought to be comprehensible by both technical and non-technical healthcare audiences. To aid this cause, this literature review analyzes the state-of-the-art decision support systems (DSSs) in MS research with a special focus on model-driven decision-making processes. The review clusters common methodologies used to support the decision-making process in classifying, diagnosing, predicting, and treating MS. This work observes that the majority of the investigated DSSs rely on knowledge-based and machine learning (ML) approaches, so the utilization of ontology and ML in the MS domain is observed to extend the scope of this review. Finally, this review summarizes the state-of-the-art DSSs, discusses the methods that have commonalities, and addresses the future work of applying DSS technologies in the MS field. Multiple sclerosis (MS) is a disorder that strikes the central nervous system of the human body. This article reviews state-of-the-art decision support systems (DSSs) in MS research, as recent studies have highlighted the importance of DSSs in the medical realm. However, the utilization of decision support systems for MS remains an open challenge. A special focus in this article is given to model-driven DSSs, which uses knowledge representation to simplify the complex process for decision makers. We find that most investigated studies use knowledge-based and machine learning approaches. Based on the literature review, we suggest some future work of applying DSSs in the MS domain. Potential future directions should focus on applying DSS technologies to understand the MS patterns, etiology, effects on the quality-of-life, and correlations with other disorders.
Collapse
Affiliation(s)
- Rayan Alshamrani
- Department of Computer Science, University of Idaho, Moscow, ID 83844-1010, USA.,Department of Information Technology, Taif University, Taif, Makkah 26571, Saudi Arabia
| | - Ashrf Althbiti
- Department of Computer Science, University of Idaho, Moscow, ID 83844-1010, USA.,Department of Information Technology, Taif University, Taif, Makkah 26571, Saudi Arabia
| | - Yara Alshamrani
- Department of Information Technology, Taif University, Taif, Makkah 26571, Saudi Arabia.,INTO Program, Washington State University, Pullman, WA 99164-3251, USA
| | - Fatimah Alkomah
- Department of Computer Science, University of Idaho, Moscow, ID 83844-1010, USA.,Department of Information Systems, Princess Nourah Bint Abdulrahman University, Riyadh 11671, Saudi Arabia
| | - Xiaogang Ma
- Department of Computer Science, University of Idaho, Moscow, ID 83844-1010, USA
| |
Collapse
|
10
|
Stafford IS, Kellermann M, Mossotto E, Beattie RM, MacArthur BD, Ennis S. A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. NPJ Digit Med 2020; 3:30. [PMID: 32195365 PMCID: PMC7062883 DOI: 10.1038/s41746-020-0229-3] [Citation(s) in RCA: 102] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2019] [Accepted: 01/17/2020] [Indexed: 02/07/2023] Open
Abstract
Autoimmune diseases are chronic, multifactorial conditions. Through machine learning (ML), a branch of the wider field of artificial intelligence, it is possible to extract patterns within patient data, and exploit these patterns to predict patient outcomes for improved clinical management. Here, we surveyed the use of ML methods to address clinical problems in autoimmune disease. A systematic review was conducted using MEDLINE, embase and computers and applied sciences complete databases. Relevant papers included "machine learning" or "artificial intelligence" and the autoimmune diseases search term(s) in their title, abstract or key words. Exclusion criteria: studies not written in English, no real human patient data included, publication prior to 2001, studies that were not peer reviewed, non-autoimmune disease comorbidity research and review papers. 169 (of 702) studies met the criteria for inclusion. Support vector machines and random forests were the most popular ML methods used. ML models using data on multiple sclerosis, rheumatoid arthritis and inflammatory bowel disease were most common. A small proportion of studies (7.7% or 13/169) combined different data types in the modelling process. Cross-validation, combined with a separate testing set for more robust model evaluation occurred in 8.3% of papers (14/169). The field may benefit from adopting a best practice of validation, cross-validation and independent testing of ML models. Many models achieved good predictive results in simple scenarios (e.g. classification of cases and controls). Progression to more complex predictive models may be achievable in future through integration of multiple data types.
Collapse
Affiliation(s)
- I. S. Stafford
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - M. Kellermann
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| | - E. Mossotto
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - R. M. Beattie
- Department of Paediatric Gastroenterology, Southampton Children’s Hospital, Southampton, UK
| | - B. D. MacArthur
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - S. Ennis
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| |
Collapse
|