1
|
Dauphinot V, Laurent M, Prodel M, Civet A, Vainchtock A, Moutet C, Krolak-Salmon P, Garnier-Crussard A. Identification of profiles associated with conversions between the Alzheimer's disease stages, using a machine learning approach. Alzheimers Res Ther 2024; 16:166. [PMID: 39061107 PMCID: PMC11282744 DOI: 10.1186/s13195-024-01533-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 07/16/2024] [Indexed: 07/28/2024]
Abstract
BACKGROUND The identification of factors involved in the conversion across the different Alzheimer's disease (AD) stages is crucial to prevent or slow the disease progression. We aimed to assess the factors and their combination associated with the conversion across the AD stages, from mild cognitive impairment to dementia, at a mild, moderate or severe stage and to identify profiles associated with earliest/latest conversion across the AD stages. METHODS In this study conducted on the real-life MEMORA cohort data collected from January 1, 2013, and December 31, 2019, three cohorts were selected depending on the baseline neurocognitive stage from a consecutive sample of patients attending a memory center, aged between 50 and 90 years old, with a diagnosis of AD during the follow-up, and with at least 2 visits at 6 months to 1 year of interval. A machine learning approach was used to assess the relationship between factors including socio-demographic characteristics, comorbidities and history of diseases, prescription of drugs, and geriatric hospitalizations, and the censored time to conversion from mild cognitive impairment to AD dementia, from the mild stage of dementia to the moderate or severe stages of AD dementia, and from the moderate stage of AD dementia to the severe stage. Profiles of earliest/latest conversion compared to median time to conversion across stages were identified. The median time to conversion was estimated with a Kaplan-Meier estimator. RESULTS Overall, 2891 patients were included (mean age 77±9 years old, 65% women). The median time of follow-up was 28 months for mild cognitive impairment (MCI) patients, 33 months for mild AD dementia and 30 months for moderate AD dementia. Among the 1264 patients at MCI stage, 61% converted to AD dementia (median time to conversion: 25 months). Among the 1142 patients with mild AD dementia, 59% converted to moderate/severe stage (median time: 23 months) and among the 1332 patients with moderate AD dementia, 23% converted to severe stage (Q3 time to conversion: 22 months). Among the studied factors, cardiovascular comorbidities, anxiety, social isolation, osteoporosis, and hearing disorders were identified as being associated with earlier conversion across stages. Symptomatic treatment i.e. cholinesterase inhibitors for AD was associated with later conversion from mild stage of dementia to moderate/severe stages. CONCLUSION This study based on a machine learning approach allowed to identify potentially modifiable factors associated with conversion across AD stages for which timely interventions may be implemented to delay disease progression.
Collapse
Affiliation(s)
- Virginie Dauphinot
- Clinical and Research Memory Centre, Lyon Institute For Aging, Charpennes Hospital, Hospices Civils de Lyon, 27 rue Gabriel Péri, Villeurbanne, Lyon, 69100, France.
| | | | | | | | | | - Claire Moutet
- Clinical and Research Memory Centre, Lyon Institute For Aging, Charpennes Hospital, Hospices Civils de Lyon, 27 rue Gabriel Péri, Villeurbanne, Lyon, 69100, France
| | - Pierre Krolak-Salmon
- Clinical and Research Memory Centre, Lyon Institute For Aging, Charpennes Hospital, Hospices Civils de Lyon, 27 rue Gabriel Péri, Villeurbanne, Lyon, 69100, France
| | - Antoine Garnier-Crussard
- Clinical and Research Memory Centre, Lyon Institute For Aging, Charpennes Hospital, Hospices Civils de Lyon, 27 rue Gabriel Péri, Villeurbanne, Lyon, 69100, France
- PhIND "Physiopathology and Imaging of Neurological Disorders", Neuropresage Team, Normandie Univ, UNICAEN, INSERM, U1237, Cyceron, Caen, 14000, France
| |
Collapse
|
2
|
Casanova IJ, Campos M, Juarez JM, Gomariz A, Canovas-Segura B, Lorente-Ros M, Lorente JA. Surprising and novel multivariate sequential patterns using odds ratio for temporal evolution in healthcare. BMC Med Inform Decis Mak 2024; 24:165. [PMID: 38872146 PMCID: PMC11170878 DOI: 10.1186/s12911-024-02566-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 06/05/2024] [Indexed: 06/15/2024] Open
Abstract
BACKGROUND Pattern mining techniques are helpful tools when extracting new knowledge in real practice, but the overwhelming number of patterns is still a limiting factor in the health-care domain. Current efforts concerning the definition of measures of interest for patterns are focused on reducing the number of patterns and quantifying their relevance (utility/usefulness). However, although the temporal dimension plays a key role in medical records, few efforts have been made to extract temporal knowledge about the patient's evolution from multivariate sequential patterns. METHODS In this paper, we propose a method to extract a new type of patterns in the clinical domain called Jumping Diagnostic Odds Ratio Sequential Patterns (JDORSP). The aim of this method is to employ the odds ratio to identify a concise set of sequential patterns that represent a patient's state with a statistically significant protection factor (i.e., a pattern associated with patients that survive) and those extensions whose evolution suddenly changes the patient's clinical state, thus making the sequential patterns a statistically significant risk factor (i.e., a pattern associated with patients that do not survive), or vice versa. RESULTS The results of our experiments highlight that our method reduces the number of sequential patterns obtained with state-of-the-art pattern reduction methods by over 95%. Only by achieving this drastic reduction can medical experts carry out a comprehensive clinical evaluation of the patterns that might be considered medical knowledge regarding the temporal evolution of the patients. We have evaluated the surprisingness and relevance of the sequential patterns with clinicians, and the most interesting fact is the high surprisingness of the extensions of the patterns that become a protection factor, that is, the patients that recover after several days of being at high risk of dying. CONCLUSIONS Our proposed method with which to extract JDORSP generates a set of interpretable multivariate sequential patterns with new knowledge regarding the temporal evolution of the patients. The number of patterns is greatly reduced when compared to those generated by other methods and measures of interest. An additional advantage of this method is that it does not require any parameters or thresholds, and that the reduced number of patterns allows a manual evaluation.
Collapse
Affiliation(s)
- Isidoro J Casanova
- AIKE research team (INTICO), Facultad de Informatica, University of Murcia, Campus de Espinardo, Murcia, 30100, Spain.
| | - Manuel Campos
- AIKE research team (INTICO), Facultad de Informatica, University of Murcia, Campus de Espinardo, Murcia, 30100, Spain
- Murcian Bio-Health Institute (IMIB-Arrixaca), Murcia, Spain
| | - Jose M Juarez
- AIKE research team (INTICO), Facultad de Informatica, University of Murcia, Campus de Espinardo, Murcia, 30100, Spain
| | - Antonio Gomariz
- AIKE research team (INTICO), Facultad de Informatica, University of Murcia, Campus de Espinardo, Murcia, 30100, Spain
| | - Bernardo Canovas-Segura
- AIKE research team (INTICO), Facultad de Informatica, University of Murcia, Campus de Espinardo, Murcia, 30100, Spain
| | - Marta Lorente-Ros
- Department of Cardiology, Washington Hospital Center, Georgetown University, Washington, DC, USA
| | - Jose A Lorente
- CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain
- University Hospital of Getafe, Getafe, Spain
- European University of Madrid, Madrid, Spain
| |
Collapse
|
3
|
Vagliano I, Kingma MY, Dongelmans DA, de Lange DW, de Keizer NF, Schut MC. Automated identification of patient subgroups: A case-study on mortality of COVID-19 patients admitted to the ICU. Comput Biol Med 2023; 163:107146. [PMID: 37356293 PMCID: PMC10266884 DOI: 10.1016/j.compbiomed.2023.107146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/27/2023]
Abstract
BACKGROUND - Subgroup discovery (SGD) is the automated splitting of the data into complex subgroups. Various SGD methods have been applied to the medical domain, but none have been extensively evaluated. We assess the numerical and clinical quality of SGD methods. METHOD - We applied the improved Subgroup Set Discovery (SSD++), Patient Rule Induction Method (PRIM) and APRIORI - Subgroup Discovery (APRIORI-SD) algorithms to obtain patient subgroups on observational data of 14,548 COVID-19 patients admitted to 73 Dutch intensive care units. Hospital mortality was the clinical outcome. Numerical significance of the subgroups was assessed with information-theoretic measures. Clinical significance of the subgroups was assessed by comparing variable importance on population and subgroup levels and by expert evaluation. RESULTS - The tested algorithms varied widely in the total number of discovered subgroups (5-62), the number of selected variables, and the predictive value of the subgroups. Qualitative assessment showed that the found subgroups make clinical sense. SSD++ found most subgroups (n = 62), which added predictive value and generally showed high potential for clinical use. APRIORI-SD and PRIM found fewer subgroups (n = 5 and 6), which did not add predictive value and were clinically less relevant. CONCLUSION - Automated SGD methods find clinical subgroups that are relevant when assessed quantitatively (yield added predictive value) and qualitatively (intensivists consider the subgroups significant). Different methods yield different subgroups with varying degrees of predictive performance and clinical quality. External validation is needed to generalize the results to other populations and future research should explore which algorithm performs best in other settings.
Collapse
Affiliation(s)
- I Vagliano
- Dept. of Medical Informatics, Amsterdam UMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands; Amsterdam Public Health (APH), Postbus 7057, 1007 MB, Amsterdam, the Netherlands.
| | - M Y Kingma
- Dept. of Medical Informatics, Amsterdam UMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands
| | - D A Dongelmans
- Amsterdam Public Health (APH), Postbus 7057, 1007 MB, Amsterdam, the Netherlands; Dept. of Intensive Care Medicine, Amsterdam UMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands; National Intensive Care Evaluation (NICE) Foundation, Postbus 23640, 1100 EC, Amsterdam, the Netherlands
| | - D W de Lange
- National Intensive Care Evaluation (NICE) Foundation, Postbus 23640, 1100 EC, Amsterdam, the Netherlands; Dept. of Intensive Care, University Medical Center Utrecht, University Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, the Netherlands
| | - N F de Keizer
- Dept. of Medical Informatics, Amsterdam UMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands; Amsterdam Public Health (APH), Postbus 7057, 1007 MB, Amsterdam, the Netherlands; National Intensive Care Evaluation (NICE) Foundation, Postbus 23640, 1100 EC, Amsterdam, the Netherlands
| | - M C Schut
- Dept. of Medical Informatics, Amsterdam UMC, University of Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands; Amsterdam Public Health (APH), Postbus 7057, 1007 MB, Amsterdam, the Netherlands; Dept. of Clinical Chemistry, Amsterdam UMC, Vrije Universiteit Amsterdam, Meibergdreef 15, 1105 AZ, Amsterdam, the Netherlands
| |
Collapse
|
4
|
Moingeon P, Chenel M, Rousseau C, Voisin E, Guedj M. Virtual patients, digital twins and causal disease models: paving the ground for in silico clinical trials. Drug Discov Today 2023; 28:103605. [PMID: 37146963 DOI: 10.1016/j.drudis.2023.103605] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Revised: 03/22/2023] [Accepted: 04/27/2023] [Indexed: 05/07/2023]
Abstract
Computational models are being explored to simulate in silico the efficacy and safety of drug candidates and medical devices. Disease models that are based on patients' profiling data are being produced to represent interactomes of genes or proteins and to infer causality in the pathophysiology {AuQ: Edit OK?}, which makes it possible to mimic the impact of drugs on relevant targets. Virtual patients designed from medical records as well as digital twins were generated to simulate specific organs and to predict treatment efficacy at the individual patient level {AuQ: Edit OK?}. As the acceptance of digital evidence by regulators grows, predictive artificial intelligence (AI)-based models will support the design of confirmatory trials in humans and will accelerate the development of efficient drugs and medical devices.
Collapse
|
5
|
Esnault C, Rollot M, Guilmin P, Zucker JD. Qluster: An easy-to-implement generic workflow for robust clustering of health data. Front Artif Intell 2023; 5:1055294. [PMID: 36814808 PMCID: PMC9939832 DOI: 10.3389/frai.2022.1055294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Accepted: 12/22/2022] [Indexed: 02/08/2023] Open
Abstract
The exploration of heath data by clustering algorithms allows to better describe the populations of interest by seeking the sub-profiles that compose it. This therefore reinforces medical knowledge, whether it is about a disease or a targeted population in real life. Nevertheless, contrary to the so-called conventional biostatistical methods where numerous guidelines exist, the standardization of data science approaches in clinical research remains a little discussed subject. This results in a significant variability in the execution of data science projects, whether in terms of algorithms used, reliability and credibility of the designed approach. Taking the path of parsimonious and judicious choice of both algorithms and implementations at each stage, this article proposes Qluster, a practical workflow for performing clustering tasks. Indeed, this workflow makes a compromise between (1) genericity of applications (e.g. usable on small or big data, on continuous, categorical or mixed variables, on database of high-dimensionality or not), (2) ease of implementation (need for few packages, few algorithms, few parameters, ...), and (3) robustness (e.g. use of proven algorithms and robust packages, evaluation of the stability of clusters, management of noise and multicollinearity). This workflow can be easily automated and/or routinely applied on a wide range of clustering projects. It can be useful both for data scientists with little experience in the field to make data clustering easier and more robust, and for more experienced data scientists who are looking for a straightforward and reliable solution to routinely perform preliminary data mining. A synthesis of the literature on data clustering as well as the scientific rationale supporting the proposed workflow is also provided. Finally, a detailed application of the workflow on a concrete use case is provided, along with a practical discussion for data scientists. An implementation on the Dataiku platform is available upon request to the authors.
Collapse
Affiliation(s)
| | | | | | - Jean-Daniel Zucker
- Sorbonne University, IRD, UMMISCO, Bondy, France
- Sorbonne University, INSERM, NUTRIOMICS, Paris, France
| |
Collapse
|