1
|
Ghofrani-Jahromi M, Poudel GR, Razi A, Abeyasinghe PM, Paulsen JS, Tabrizi SJ, Saha S, Georgiou-Karistianis N. Prognostic enrichment for early-stage Huntington's disease: An explainable machine learning approach for clinical trial. Neuroimage Clin 2024; 43:103650. [PMID: 39142216 PMCID: PMC11367643 DOI: 10.1016/j.nicl.2024.103650] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 07/11/2024] [Accepted: 07/31/2024] [Indexed: 08/16/2024]
Abstract
BACKGROUND In Huntington's disease clinical trials, recruitment and stratification approaches primarily rely on genetic load, cognitive and motor assessment scores. They focus less on in vivo brain imaging markers, which reflect neuropathology well before clinical diagnosis. Machine learning methods offer a degree of sophistication which could significantly improve prognosis and stratification by leveraging multimodal biomarkers from large datasets. Such models specifically tailored to HD gene expansion carriers could further enhance the efficacy of the stratification process. OBJECTIVES To improve stratification of Huntington's disease individuals for clinical trials. METHODS We used data from 451 gene positive individuals with Huntington's disease (both premanifest and diagnosed) from previously published cohorts (PREDICT, TRACK, TrackON, and IMAGE). We applied whole-brain parcellation to longitudinal brain scans and measured the rate of lateral ventricular enlargement, over 3 years, which was used as the target variable for our prognostic random forest regression models. The models were trained on various combinations of features at baseline, including genetic load, cognitive and motor assessment score biomarkers, as well as brain imaging-derived features. Furthermore, a simplified stratification model was developed to classify individuals into two homogenous groups (low risk and high risk) based on their anticipated rate of ventricular enlargement. RESULTS The predictive accuracy of the prognostic models substantially improved by integrating brain imaging features alongside genetic load, cognitive and motor biomarkers: a 24 % reduction in the cross-validated mean absolute error, yielding an error of 530 mm3/year. The stratification model had a cross-validated accuracy of 81 % in differentiating between moderate and fast progressors (precision = 83 %, recall = 80 %). CONCLUSIONS This study validated the effectiveness of machine learning in differentiating between low- and high-risk individuals based on the rate of ventricular enlargement. The models were exclusively trained using features from HD individuals, which offers a more disease-specific, simplified, and accurate approach for prognostic enrichment compared to relying on features extracted from healthy control groups, as done in previous studies. The proposed method has the potential to enhance clinical utility by: i) enabling more targeted recruitment of individuals for clinical trials, ii) improving post-hoc evaluation of individuals, and iii) ultimately leading to better outcomes for individuals through personalized treatment selection.
Collapse
Affiliation(s)
| | - Govinda R Poudel
- Mary MacKillop Institute for Health Research, Australian Catholic University, Melbourne VIC3000, Australia
| | - Adeel Razi
- Turner Institute for Brain and Mental Health, Monash University, Clayton VIC3800, Australia
| | - Pubu M Abeyasinghe
- Turner Institute for Brain and Mental Health, Monash University, Clayton VIC3800, Australia
| | - Jane S Paulsen
- Department of Neurology, University of Wisconsin-Madison, 1685 Highland Avenue, Madison, WI, USA
| | - Sarah J Tabrizi
- UCL Huntington's Disease Centre, UCL Queen Square Institute of Neurology, UK Dementia Research Institute, Department of Neurodegenerative Diseases, University College London, London, UK
| | - Susmita Saha
- Turner Institute for Brain and Mental Health, Monash University, Clayton VIC3800, Australia
| | | |
Collapse
|
2
|
Yoshioka H, Jin R, Hisaka A, Suzuki H. Disease progression modeling with temporal realignment: An emerging approach to deepen knowledge on chronic diseases. Pharmacol Ther 2024; 259:108655. [PMID: 38710372 DOI: 10.1016/j.pharmthera.2024.108655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/22/2024] [Accepted: 05/01/2024] [Indexed: 05/08/2024]
Abstract
The recent development of the first disease-modifying drug for Alzheimer's disease represents a major advancement in dementia treatment. Behind this breakthrough is a quarter century of research efforts to understand the disease not by a particular symptom at a given moment, but by long-term sequential changes in multiple biomarkers. Disease progression modeling with temporal realignment (DPM-TR) is an emerging computational approach proposed with this biomarker-based disease concept. By integrating short-term clinical observations of multiple disease biomarkers in a data-driven manner, DPM-TR provides a way to understand the progression of chronic diseases over decades and predict individual disease stages more accurately. DPM-TR has been developed primarily in the area of neurodegenerative diseases but has recently been extended to non-neurodegenerative diseases, including chronic obstructive pulmonary, autoimmune, and ophthalmologic diseases. This review focuses on opportunities for DPM-TR in clinical practice and drug development and discusses its current status and challenges.
Collapse
Affiliation(s)
- Hideki Yoshioka
- Office of Regulatory Science Research, Pharmaceuticals and Medical Devices Agency, Tokyo, Japan; Laboratory of Clinical Pharmacology and Pharmacometrics, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba, Japan
| | - Ryota Jin
- Laboratory of Clinical Pharmacology and Pharmacometrics, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba, Japan
| | - Akihiro Hisaka
- Laboratory of Clinical Pharmacology and Pharmacometrics, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba, Japan.
| | - Hiroshi Suzuki
- Executive Director, Pharmaceuticals and Medical Devices Agency, Tokyo, Japan; Department of Pharmacy, The University of Tokyo Hospital, Faculty of Medicine, The University of Tokyo, Tokyo, Japan
| |
Collapse
|
3
|
Raschka T, Li Z, Gaßner H, Kohl Z, Jukic J, Marxreiter F, Fröhlich H. Unraveling progression subtypes in people with Huntington's disease. EPMA J 2024; 15:275-287. [PMID: 38841617 PMCID: PMC11148000 DOI: 10.1007/s13167-024-00368-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 05/09/2024] [Indexed: 06/07/2024]
Abstract
Background Huntington's disease (HD) is a progressive neurodegenerative disease caused by a CAG trinucleotide expansion in the huntingtin gene. The length of the CAG repeat is inversely correlated with disease onset. HD is characterized by hyperkinetic movement disorder, psychiatric symptoms, and cognitive deficits, which greatly impact patient's quality of life. Despite this clear genetic course, high variability of HD patients' symptoms can be observed. Current clinical diagnosis of HD solely relies on the presence of motor signs, disregarding the other important aspects of the disease. By incorporating a broader approach that encompasses motor as well as non-motor aspects of HD, predictive, preventive, and personalized (3P) medicine can enhance diagnostic accuracy and improve patient care. Methods Multisymptom disease trajectories of HD patients collected from the Enroll-HD study were first aligned on a common disease timescale to account for heterogeneity in disease symptom onset and diagnosis. Following this, the aligned disease trajectories were clustered using the previously published Variational Deep Embedding with Recurrence (VaDER) algorithm and resulting progression subtypes were clinically characterized. Lastly, an AI/ML model was learned to predict the progression subtype from only first visit data or with data from additional follow-up visits. Results Results demonstrate two distinct subtypes, one large cluster (n = 7122) showing a relative stable disease progression and a second, smaller cluster (n = 411) showing a dramatically more progressive disease trajectory. Clinical characterization of the two subtypes correlates with CAG repeat length, as well as several neurobehavioral, psychiatric, and cognitive scores. In fact, cognitive impairment was found to be the major difference between the two subtypes. Additionally, a prognostic model shows the ability to predict HD subtypes from patients' first visit only. Conclusion In summary, this study aims towards the paradigm shift from reactive to preventive and personalized medicine by showing that non-motor symptoms are of vital importance for predicting and categorizing each patients' disease progression pattern, as cognitive decline is oftentimes more reflective of HD progression than its motor aspects. Considering these aspects while counseling and therapy definition will personalize each individuals' treatment. The ability to provide patients with an objective assessment of their disease progression and thus a perspective for their life with HD is the key to improving their quality of life. By conducting additional analysis on biological data from both subtypes, it is possible to gain a deeper understanding of these subtypes and uncover the underlying biological factors of the disease. This greatly aligns with the goal of shifting towards 3P medicine. Supplementary Information The online version contains supplementary material available at 10.1007/s13167-024-00368-2.
Collapse
Affiliation(s)
- Tamara Raschka
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
| | - Zexin Li
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
- Bonn-Aachen International Center for IT, University of Bonn, Friedrich-Hirzebruch-Allee 6, 53115 Bonn, Germany
| | - Heiko Gaßner
- Department of Molecular Neurology, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
- Fraunhofer IIS, Fraunhofer Institute for Integrated Circuits IIS, Am Wolfsmantel 33, 91058 Erlangen, Germany
| | - Zacharias Kohl
- Department of Neurology, University of Regensburg, Regensburg, Germany
| | - Jelena Jukic
- Department of Molecular Neurology, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
- Center for Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Franz Marxreiter
- Department of Molecular Neurology, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
- Center for Movement Disorders, Passauer Wolf, 93333 Bad Gögging, Germany
- Center for Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany
| | - Holger Fröhlich
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Schloss Birlinghoven, 53757 Sankt Augustin, Germany
- Bonn-Aachen International Center for IT, University of Bonn, Friedrich-Hirzebruch-Allee 6, 53115 Bonn, Germany
| |
Collapse
|
4
|
Cheng F, Wang F, Tang J, Zhou Y, Fu Z, Zhang P, Haines JL, Leverenz JB, Gan L, Hu J, Rosen-Zvi M, Pieper AA, Cummings J. Artificial intelligence and open science in discovery of disease-modifying medicines for Alzheimer's disease. Cell Rep Med 2024; 5:101379. [PMID: 38382465 PMCID: PMC10897520 DOI: 10.1016/j.xcrm.2023.101379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 08/15/2023] [Accepted: 12/19/2023] [Indexed: 02/23/2024]
Abstract
The high failure rate of clinical trials in Alzheimer's disease (AD) and AD-related dementia (ADRD) is due to a lack of understanding of the pathophysiology of disease, and this deficit may be addressed by applying artificial intelligence (AI) to "big data" to rapidly and effectively expand therapeutic development efforts. Recent accelerations in computing power and availability of big data, including electronic health records and multi-omics profiles, have converged to provide opportunities for scientific discovery and treatment development. Here, we review the potential utility of applying AI approaches to big data for discovery of disease-modifying medicines for AD/ADRD. We illustrate how AI tools can be applied to the AD/ADRD drug development pipeline through collaborative efforts among neurologists, gerontologists, geneticists, pharmacologists, medicinal chemists, and computational scientists. AI and open data science expedite drug discovery and development of disease-modifying therapeutics for AD/ADRD and other neurodegenerative diseases.
Collapse
Affiliation(s)
- Feixiong Cheng
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Cleveland Clinic Genome Center, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; Department of Molecular Medicine, Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH 44195, USA.
| | - Fei Wang
- Department of Population Health Sciences, Weill Cornell Medical College, Cornell University, New York, NY 10065, USA
| | - Jian Tang
- Mila-Quebec Institute for Learning Algorithms and CIFAR AI Research Chair, HEC Montreal, Montréal, QC H3T 2A7, Canada
| | - Yadi Zhou
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Zhimin Fu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, OH 44195, USA; College of Pharmacy, Northeast Ohio Medical University, Rootstown, OH 44272, USA
| | - Pengyue Zhang
- Department of Biostatistics and Health Data Science, Indiana University, Indianapolis, IN 46037, USA
| | - Jonathan L Haines
- Cleveland Institute for Computational Biology, and Department of Population & Quantitative Health Sciences, Case Western Reserve University, Cleveland, OH 44106, USA
| | - James B Leverenz
- Lou Ruvo Center for Brain Health, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA
| | - Li Gan
- Helen and Robert Appel Alzheimer's Disease Research Institute, Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY 10021, USA
| | - Jianying Hu
- IBM Research, Yorktown Heights, New York, NY 10598, USA
| | - Michal Rosen-Zvi
- AI for Accelerated Healthcare and Life Sciences Discovery, IBM Research Labs, Haifa 3498825, Israel; Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem 9190500, Israel
| | - Andrew A Pieper
- Brain Health Medicines Center, Harrington Discovery Institute, University Hospitals Cleveland Medical Center, Cleveland, OH, 44106, USA; Department of Psychiatry, Case Western Reserve University, Cleveland, OH 44106, USA; Geriatric Psychiatry, GRECC, Louis Stokes Cleveland VA Medical Center, Cleveland, OH 44106, USA; Institute for Transformative Molecular Medicine, School of Medicine, Case Western Reserve University, Cleveland OH 44106, USA; Department of Pathology, Case Western Reserve University, School of Medicine, Cleveland, OH, 44106, USA; Department of Neurosciences, Case Western Reserve University, School of Medicine, Cleveland, OH 44106, USA
| | - Jeffrey Cummings
- Chambers-Grundy Center for Transformative Neuroscience, Department of Brain Health, School of Integrated Health Sciences, UNLV, Las Vegas, NV 89154, USA
| |
Collapse
|
5
|
Sun Z, Ware J, Dey S, Eyigoz E, Sathe S, Sampaio C, Hu J. Large-scale screening of clinical assessments to distinguish between states in the Integrated HD Progression Model (IHDPM). Front Aging Neurosci 2024; 16:1320755. [PMID: 38414632 PMCID: PMC10896990 DOI: 10.3389/fnagi.2024.1320755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 01/30/2024] [Indexed: 02/29/2024] Open
Abstract
Background Understanding the sensitivity and utility of clinical assessments across different HD stages is important for study/trial endpoint selection and clinical assessment development. The Integrated HD Progression Model (IHDPM) characterizes the complex symptom progression of HD and separates the disease into nine ordered disease states. Objective To generate a temporal map of discriminatory clinical measures across the IHDPM states. Methods We applied the IHDPM to all HD individuals in an integrated longitudinal HD dataset derived from four observational studies, obtaining disease state assignment for each study visit. Using large-scale screening, we estimated Cohen's effect sizes to rank the discriminative power of 2,472 clinical measures for separating observations in disease state pairs. Individual trajectories through IHDPM states were examined. Discriminative analyses were limited to individuals with observations in both states of the pairs compared (N = 3,790). Results Discriminative clinical measures were heterogeneous across the HD life course. UHDRS items were frequently identified as the best state pair discriminators, with UHDRS Motor items - most notably TMS - showing the highest discriminatory power between the early-disease states and early post-transition period states. UHDRS functional items emerged as strong discriminators from the transition period and on. Cognitive assessments showed good discriminative power between all state pairs examined, excepting state 1 vs. 2. Several non-UHDRS assessments were also flagged as excellent state discriminators for specific disease phases (e.g., SF-12). For certain state pairs, single assessment items other than total/summary scores were highlighted as having excellent discriminative power. Conclusion By providing ranked quantitative scores indicating discriminatory ability of thousands of clinical measures between specific pairs of IHDPM states, our results will aid clinical trial designers select the most effective outcome measures tailored to their study cohort. Our observations may also assist in the development of end points targeting specific phases in the disease life course, through providing specific conceptual foci.
Collapse
Affiliation(s)
- Zhaonan Sun
- IBM Research, Yorktown Heights, NY, United States
| | | | - Sanjoy Dey
- IBM Research, Yorktown Heights, NY, United States
| | - Elif Eyigoz
- IBM Research, Yorktown Heights, NY, United States
| | - Swati Sathe
- CHDI Management, Inc., Princeton, NJ, United States
| | | | - Jianying Hu
- IBM Research, Yorktown Heights, NY, United States
| |
Collapse
|
6
|
Zhang B, Zhang L, Chen Q, Jin Z, Liu S, Zhang S. Harnessing artificial intelligence to improve clinical trial design. COMMUNICATIONS MEDICINE 2023; 3:191. [PMID: 38129570 PMCID: PMC10739942 DOI: 10.1038/s43856-023-00425-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 12/07/2023] [Indexed: 12/23/2023] Open
Abstract
Zhang et al. discuss how artificial intelligence (AI) can be used to optimize clinical trial design and potentially boost the success rate of clinical trials. AI has unparalleled potential to leverage real-world data and unlock valuable insights for innovative trial design.
Collapse
Affiliation(s)
- Bin Zhang
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China
| | - Lu Zhang
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China
| | - Qiuying Chen
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China
| | - Zhe Jin
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China
| | - Shuyi Liu
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China
| | - Shuixing Zhang
- The First Affiliated Hospital of Jinan University, Guangdong, Guangzhou, China.
| |
Collapse
|
7
|
Ahmad M, Ríos-Anillo MR, Acosta-López JE, Cervantes-Henríquez ML, Martínez-Banfi M, Pineda-Alhucema W, Puentes-Rozo P, Sánchez-Barros C, Pinzón A, Patel HR, Vélez JI, Villarreal-Camacho JL, Pineda DA, Arcos-Burgos M, Sánchez-Rojas M. Uncovering the Genetic and Molecular Features of Huntington's Disease in Northern Colombia. Int J Mol Sci 2023; 24:16154. [PMID: 38003344 PMCID: PMC10671691 DOI: 10.3390/ijms242216154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 10/26/2023] [Accepted: 10/30/2023] [Indexed: 11/26/2023] Open
Abstract
Huntington's disease (HD) is a genetic disorder caused by a CAG trinucleotide expansion in the huntingtin (HTT) gene. Juan de Acosta, Atlántico, a city located on the Caribbean coast of Colombia, is home to the world's second-largest HD pedigree. Here, we include 291 descendants of this pedigree with at least one family member with HD. Blood samples were collected, and genomic DNA was extracted. We quantified the HTT CAG expansion using an amplicon sequencing protocol. The genetic heterogeneity was measured as the ratio of the mosaicism allele's read peak and the slippage ratio of the allele's read peak from our sequence data. The statistical and bioinformatic analyses were performed with a significance threshold of p < 0.05. We found that the average HTT CAG repeat length in all participants was 21.91 (SD = 8.92). Of the 291 participants, 33 (11.3%, 18 females) had a positive molecular diagnosis for HD. Most affected individuals were adults, and the most common primary and secondary alleles were 17/7 (CAG/CCG) and 17/10 (CAG/CCG), respectively. The mosaicism increased with age in the participants with HD, while the slippage analyses revealed differences by the HD allele type only for the secondary allele. The slippage tended to increase with the HTT CAG repeat length in the participants with HD, but the increase was not statistically significant. This study analyzed the genetic and molecular features of 291 participants, including 33 with HD. We found that the mosaicism increased with age in the participants with HD, particularly for the secondary allele. The most common haplotype was 17/7_17/10. The slippage for the secondary allele varied by the HD allele type, but there was no significant difference in the slippage by sex. Our findings offer valuable insights into HD and could have implications for future research and clinical management.
Collapse
Affiliation(s)
- Mostapha Ahmad
- Facultad de Ciencias de la Salud, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Margarita R Ríos-Anillo
- Facultad de Ciencias de la Salud, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Médica Residente de Neurología, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Johan E Acosta-López
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Facultad de Ciencias Jurídicas y Sociales, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Martha L Cervantes-Henríquez
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Facultad de Ciencias Jurídicas y Sociales, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Martha Martínez-Banfi
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Facultad de Ciencias Jurídicas y Sociales, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Wilmar Pineda-Alhucema
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Facultad de Ciencias Jurídicas y Sociales, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| | - Pedro Puentes-Rozo
- Facultad de Ciencias de la Salud, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Grupo de Neurociencias del Caribe, Universidad del Atlántico, Barranquilla 080001, Colombia
| | - Cristian Sánchez-Barros
- Facultad de Ciencias de la Salud, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Life Science Research Center, Universidad Simón Bolívar, Barranquilla 080002, Colombia
- Departamento de Neurofisiología Clínica Palma de Mallorca, Hospital Juaneda Miramar, Islas Baleares, 07011 Palma, Spain
| | - Andrés Pinzón
- Bioinformatics and Systems Biology Laboratory, Institute for Genetics, Universidad Nacional de Colombia, Bogota 111321, Colombia
| | - Hardip R Patel
- National Centre for Indigenous Genomics, John Curtin School of Medical Research, Australian National University, Canberra, ACT 2601, Australia
| | - Jorge I Vélez
- Department of Industrial Engineering, Universidad del Norte, Barranquilla 081007, Colombia
| | - José Luis Villarreal-Camacho
- Programa de Medicina, Facultad de Ciencias de la Salud, Universidad Libre Seccional Barranquilla, Barranquilla 081007, Colombia
| | - David A Pineda
- Grupo de Investigación en Neuropsicología y Conducta, Universidad de San Buenaventura, Medellin 050010, Colombia
- Grupo de Neurociencias de Antioquia, Universidad de Antioquia, Medellin 050010, Colombia
| | - Mauricio Arcos-Burgos
- Grupo de Investigación en Psiquiatría (GIPSI), Departamento de Psiquiatría, Instituto de Investigaciones Médicas, Facultad de Medicina, Universidad de Antioquia, Medellin 050010, Colombia
| | - Manuel Sánchez-Rojas
- Facultad de Ciencias de la Salud, Universidad Simón Bolívar, Barranquilla 080002, Colombia
| |
Collapse
|
8
|
Ouwerkerk J, Feleus S, van der Zwaan KF, Li Y, Roos M, van Roon-Mom WMC, de Bot ST, Wolstencroft KJ, Mina E. Machine learning in Huntington's disease: exploring the Enroll-HD dataset for prognosis and driving capability prediction. Orphanet J Rare Dis 2023; 18:218. [PMID: 37501188 PMCID: PMC10375780 DOI: 10.1186/s13023-023-02785-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/18/2023] [Indexed: 07/29/2023] Open
Abstract
BACKGROUND In biomedicine, machine learning (ML) has proven beneficial for the prognosis and diagnosis of different diseases, including cancer and neurodegenerative disorders. For rare diseases, however, the requirement for large datasets often prevents this approach. Huntington's disease (HD) is a rare neurodegenerative disorder caused by a CAG repeat expansion in the coding region of the huntingtin gene. The world's largest observational study for HD, Enroll-HD, describes over 21,000 participants. As such, Enroll-HD is amenable to ML methods. In this study, we pre-processed and imputed Enroll-HD with ML methods to maximise the inclusion of participants and variables. With this dataset we developed models to improve the prediction of the age at onset (AAO) and compared it to the well-established Langbehn formula. In addition, we used recurrent neural networks (RNNs) to demonstrate the utility of ML methods for longitudinal datasets, assessing driving capabilities by learning from previous participant assessments. RESULTS Simple pre-processing imputed around 42% of missing values in Enroll-HD. Also, 167 variables were retained as a result of imputing with ML. We found that multiple ML models were able to outperform the Langbehn formula. The best ML model (light gradient boosting machine) improved the prognosis of AAO compared to the Langbehn formula by 9.2%, based on root mean squared error in the test set. In addition, our ML model provides more accurate prognosis for a wider CAG repeat range compared to the Langbehn formula. Driving capability was predicted with an accuracy of 85.2%. The resulting pre-processing workflow and code to train the ML models are available to be used for related HD predictions at: https://github.com/JasperO98/hdml/tree/main . CONCLUSIONS Our pre-processing workflow made it possible to resolve the missing values and include most participants and variables in Enroll-HD. We show the added value of a ML approach, which improved AAO predictions and allowed for the development of an advisory model that can assist clinicians and participants in estimating future driving capability.
Collapse
Affiliation(s)
- Jasper Ouwerkerk
- Department of Pathology and Clinical Bioinformatics, Erasmus Medical Center (EMC), Wytemaweg, 3015 CN, Rotterdam, The Netherlands
| | - Stephanie Feleus
- Department of Neurology, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
- Department of Clinical Epidemiology, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
| | - Kasper F van der Zwaan
- Department of Neurology, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
| | - Yunlei Li
- Department of Pathology and Clinical Bioinformatics, Erasmus Medical Center (EMC), Wytemaweg, 3015 CN, Rotterdam, The Netherlands
| | - Marco Roos
- Department of Human Genetics, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
| | - Willeke M C van Roon-Mom
- Department of Human Genetics, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
| | - Susanne T de Bot
- Department of Neurology, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands
| | - Katherine J Wolstencroft
- Leiden Institute of Advanced Computer Science (LIACS), Leiden University, Niels Bohrweg 1, 2333 CA, Leiden, The Netherlands
| | - Eleni Mina
- Department of Human Genetics, Leiden University Medical Center (LUMC), PO Box 9600, 2300 RC, Leiden, The Netherlands.
| |
Collapse
|
9
|
Barrett JS, Goyal RK, Gobburu J, Baran S, Varshney J. An AI Approach to Generating MIDD Assets Across the Drug Development Continuum. AAPS J 2023; 25:70. [PMID: 37430126 DOI: 10.1208/s12248-023-00838-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Model-informed drug development involves developing and applying exposure-based, biological, and statistical models derived from preclinical and clinical data sources to inform drug development and decision-making. Discrete models are generated from individual experiments resulting in a single model expression that is utilized to inform a single stage-gate decision. Other model types provide a more holistic view of disease biology and potentially disease progression depending on the appropriateness of the underlying data sources for that purpose. Despite this awareness, most data integration and model development approaches are still reliant on internal (within company) data stores and traditional structural model types. An AI/ML-based MIDD approach relies on more diverse data and is informed by past successes and failures including data outside a host company (external data sources) that may enhance predictive value and enhance data generated by the sponsor to reflect more informed and timely experimentation. The AI/ML methodology also provides a complementary approach to more traditional modeling efforts that support MIDD and thus yields greater fidelity in decision-making. Early pilot studies support this assessment but will require broader adoption and regulatory support for more evidence and refinement of this paradigm. An AI/ML-based approach to MIDD has the potential to transform regulatory science and the current drug development paradigm, optimize information value, and increase candidate and eventually product confidence with respect to safety and efficacy. We highlight early experiences with this approach using the AI compute platforms as representative examples of how MIDD can be facilitated with an AI/ML approach.
Collapse
Affiliation(s)
- Jeffrey S Barrett
- Aridhia Bioinformatics, 163 Bath Street, Glasgow, Scotland, G2 4SQ, UK.
| | - Rahul K Goyal
- Center for Translational Medicine, University of Maryland Baltimore, Baltimore, Maryland, USA
| | - Jogarao Gobburu
- Center for Translational Medicine, University of Maryland Baltimore, Baltimore, Maryland, USA
- Pumas-AI, Baltimore, Maryland, USA
| | | | | |
Collapse
|
10
|
Eskofier BM, Klucken J. Predictive Models for Health Deterioration: Understanding Disease Pathways for Personalized Medicine. Annu Rev Biomed Eng 2023; 25:131-156. [PMID: 36854259 DOI: 10.1146/annurev-bioeng-110220-030247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
Artificial intelligence (AI) and machine learning (ML) methods are currently widely employed in medicine and healthcare. A PubMed search returns more than 100,000 articles on these topics published between 2018 and 2022 alone. Notwithstanding several recent reviews in various subfields of AI and ML in medicine, we have yet to see a comprehensive review around the methods' use in longitudinal analysis and prediction of an individual patient's health status within a personalized disease pathway. This review seeks to fill that gap. After an overview of the AI and ML methods employed in this field and of specific medical applications of models of this type, the review discusses the strengths and limitations of current studies and looks ahead to future strands of research in this field. We aim to enable interested readers to gain a detailed impression of the research currently available and accordingly plan future work around predictive models for deterioration in health status.
Collapse
Affiliation(s)
- Bjoern M Eskofier
- Machine Learning and Data Analytics Lab, Department of Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany;
| | - Jochen Klucken
- Digital Medicine Group, Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Belvaux, Luxembourg
- Digital Medicine Group, Department of Precision Health, Luxembourg Institute of Health, Strassen, Luxembourg
- Centre Hospitalier de Luxembourg, Luxembourg City, Luxembourg
| |
Collapse
|
11
|
Sturchio A, Duker AP, Muñoz-Sanjuan I, Espay AJ. Subtyping monogenic disorders: Huntington disease. HANDBOOK OF CLINICAL NEUROLOGY 2023; 193:171-184. [PMID: 36803810 DOI: 10.1016/b978-0-323-85555-6.00003-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Huntington disease is a highly disabling neurodegenerative disease characterized by psychiatric, cognitive, and motor deficits. The causal genetic mutation in huntingtin (Htt, also known as IT15), located on chromosome 4p16.3, leads to an expansion of a triplet coding for polyglutamine. The expansion is invariably associated with the disease when >39 repeats. Htt encodes for the protein huntingtin (HTT), which carries out many essential biological functions in the cell, in particular in the nervous system. The precise mechanism of toxicity is not known. Based on a one-gene-one-disease framework, the prevailing hypothesis ascribes toxicity to the universal aggregation of HTT. However, the aggregation process into mutant huntingtin (mHTT) is associated with a reduction of the levels of wild-type HTT. A loss of wild-type HTT may plausibly be pathogenic, contributing to the disease onset and progressive neurodegeneration. Moreover, many other biological pathways are altered in Huntington disease, such as in the autophagic system, mitochondria, and essential proteins beyond HTT, potentially explaining biological and clinical differences among affected individuals. As one gene does not mean one disease, future efforts at identifying specific Huntington subtypes are important to design biologically tailored therapeutic approaches that correct the corresponding biological pathways-rather than continuing to exclusively target the common denominator of HTT aggregation for elimination.
Collapse
Affiliation(s)
- Andrea Sturchio
- James J. and Joan A. Gardner Family Center for Parkinson's disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States; Department of Clinical Neuroscience, Neuro Svenningsson, Karolinska Institutet, Stockholm, Sweden.
| | - Andrew P Duker
- James J. and Joan A. Gardner Family Center for Parkinson's disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | | | - Alberto J Espay
- James J. and Joan A. Gardner Family Center for Parkinson's disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States.
| |
Collapse
|
12
|
de Oliveira CM, Leotti VB, Cappelli AH, Rocha AG, Ecco G, Bolzan G, Kersting N, Saraiva-Pereira ML, Jardim LB. Progression of Clinical and Eye Movement Markers in Preataxic Carriers of Machado-Joseph Disease. Mov Disord 2023; 38:26-34. [PMID: 36129443 DOI: 10.1002/mds.29226] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Little is known about preclinical stages of Machado-Joseph disease, a polyglutamine disorder characterized by progressive adult-onset ataxia. OBJECTIVE We aimed to describe the longitudinal progression of clinical and oculomotor variables in the preataxic phase of disease. METHODS Carriers and noncarriers were assessed at three visits. Preataxic carriers (Scale for Assessment and Rating of Ataxia score < 3) expected to start ataxia in ≤4 years were considered near onset (PAN). Progressions of ataxic and preataxic carriers, considering status at the end of the study, were described according to the start (or its prediction) of gait ataxia (TimeToAfterOnset) and according to the study time. RESULTS A total of 35 ataxics, 38 preataxics, and 22 noncarriers were included. The "TimeToAfterOnset" timeline showed that Neurological Examination Scale for Spinocerebellar Ataxias (NESSCA; effect size, 0.09), Inventory of Non-Ataxia Symptoms (INAS0.07), and the vestibulo-ocular reflex gain (0.12) progressed in preataxic carriers, and that most slopes accelerate in PAN, turning similar to those of ataxics. In the study time, NESSCA (1.36) and vertical pursuit gain (1.17) significantly worsened in PAN, and 6 of 11 PANs converted to ataxia. For a clinical trial with 80% power and 2-year duration, 57 PANs are needed in each study arm to detect a 50% reduction in the conversion rate. CONCLUSIONS NESSCA, INAS, vestibulo-ocular reflex, and vertical pursuit gains significantly worsened in the preataxic phase. The "TimeToAfterOnset" timeline unveiled that slopes of most variables are small in preataxics but increase and reach the ataxic slopes from 4 years before the onset of ataxia. For future trials in preataxic carriers, we recommend recruiting PANs and using the conversion rate as the primary outcome. © 2022 International Parkinson and Movement Disorder Society.
Collapse
Affiliation(s)
- Camila Maria de Oliveira
- Programa de Pós-Graduação em Ciências Médicas, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Centros de Pesquisa Clínica e Experimental, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil
| | - Vanessa Bielefeldt Leotti
- Departamento de Estatística, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Programa de Pós-Graduação em Epidemiologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Amanda Henz Cappelli
- Faculdade de Medicina, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | | | - Gabriela Ecco
- Faculdade de Medicina, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Gabriela Bolzan
- Centros de Pesquisa Clínica e Experimental, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil.,Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Nathalia Kersting
- Programa de Pós-Graduação em Ciências Médicas, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Centros de Pesquisa Clínica e Experimental, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil
| | - Maria-Luiza Saraiva-Pereira
- Centros de Pesquisa Clínica e Experimental, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil.,Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Serviço de Genética Médica, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil.,Departamento de Bioquímica, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| | - Laura Bannach Jardim
- Programa de Pós-Graduação em Ciências Médicas, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Centros de Pesquisa Clínica e Experimental, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil.,Faculdade de Medicina, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil.,Serviço de Genética Médica, Hospital de Clínicas de Porto Alegre, Porto Alegre, Brazil.,Departamento de Medicina Interna, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
| |
Collapse
|
13
|
Mohan A, Sun Z, Ghosh S, Li Y, Sathe S, Hu J, Sampaio C. Corrections to “A Machine‐Learning Derived Huntington's Disease Progression Model: Insights for Clinical Trial Design”. Mov Disord 2022; 37:2468. [DOI: 10.1002/mds.29259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 09/20/2022] [Indexed: 12/23/2022] Open
Affiliation(s)
- Amrita Mohan
- CHDI Management/CHDI Foundation Princeton New Jersey USA
| | - Zhaonan Sun
- Center for Computational Health IBM Research Yorktown Heights New York USA
| | - Soumya Ghosh
- Center for Computational Health IBM Research Yorktown Heights New York USA
| | - Ying Li
- Center for Computational Health IBM Research Yorktown Heights New York USA
| | - Swati Sathe
- CHDI Management/CHDI Foundation Princeton New Jersey USA
| | - Jianying Hu
- Center for Computational Health IBM Research Yorktown Heights New York USA
| | - Cristina Sampaio
- CHDI Management/CHDI Foundation Princeton New Jersey USA
- Faculty of Medicine Laboratory of Clinical Pharmacology and Therapeutics Lisbon Portugal
| |
Collapse
|
14
|
Liu J, Barrett JS, Leonardi ET, Lee L, Roychoudhury S, Chen Y, Trifillis P. Natural History and Real-World Data in Rare Diseases: Applications, Limitations, and Future Perspectives. J Clin Pharmacol 2022; 62 Suppl 2:S38-S55. [PMID: 36461748 PMCID: PMC10107901 DOI: 10.1002/jcph.2134] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 07/28/2022] [Indexed: 12/04/2022]
Abstract
Rare diseases represent a highly heterogeneous group of disorders with high phenotypic and genotypic diversity within individual conditions. Due to the small numbers of people affected, there are unique challenges in understanding rare diseases and drug development for these conditions, including patient identification and recruitment, trial design, and costs. Natural history data and real-world data (RWD) play significant roles in defining and characterizing disease progression, final patient populations, novel biomarkers, genetic relationships, and treatment effects. This review provides an introduction to rare diseases, natural history data, RWD, and real-world evidence, the respective sources and applications of these data in several rare diseases. Considerations for data quality and limitations when using natural history and RWD are also elaborated. Opportunities are highlighted for cross-sector collaboration, standardized and high-quality data collection using new technologies, and more comprehensive evidence generation using quantitative approaches such as disease progression modeling, artificial intelligence, and machine learning. Advanced statistical approaches to integrate natural history data and RWD to further disease understanding and guide more efficient clinical study design and data analysis in drug development in rare diseases are also discussed.
Collapse
Affiliation(s)
- Jing Liu
- Pfizer, Inc., Groton, Connecticut, USA
| | - Jeffrey S Barrett
- Critical Path Institute, Rare Disease Cures Accelerator Data Analytics Platform, Tucson, Arizona, USA
| | | | - Lucy Lee
- PTC Therapeutics, Inc., South Plainfield, New Jersey, USA
| | | | - Yong Chen
- Pfizer, Inc., Groton, Connecticut, USA
| | | |
Collapse
|
15
|
Mestre TA. Using Big Data in Movement Disorders: Disease States and Progression in Huntington's Disease. Mov Disord 2022; 37:441-443. [PMID: 35315555 DOI: 10.1002/mds.28943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 01/06/2022] [Indexed: 11/11/2022] Open
Affiliation(s)
- Tiago A Mestre
- University of Ottawa Brain and Mind Research Institute, Ottawa, Ontario, Canada.,Ottawa Hospital Research Institute, Ottawa, Ontario, Canada.,Division of Neurology, Department of Medicine, University of Ottawa, The Ottawa Hospital Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
16
|
Drew CJG, Busse M. Considerations for clinical trial design and conduct in the evaluation of novel advanced therapeutics in neurodegenerative disease. INTERNATIONAL REVIEW OF NEUROBIOLOGY 2022; 166:235-279. [PMID: 36424094 DOI: 10.1016/bs.irn.2022.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The recent advances in the development of potentially disease modifying cell and gene therapies for neurodegenerative disease has resulted in the production of a number of promising novel therapies which are now moving forward to clinical evaluation. The robust evaluation of these therapies pose a significant number of challenges when compared to more traditional evaluations of pharmacotherapy, which is the current mainstay of neurodegenerative disease symptom management. Indeed, there is an inherent complexity in the design and conduct of these trials at multiple levels. Here we discuss specific aspects requiring consideration in the context of investigating novel cell and gene therapies for neurodegenerative disease. This extends to overarching trial designs that could be employed and the factors that underpin design choices such outcome assessments, participant selection and methods for delivery of cell and gene therapies. We explore methods of data collection that may improve efficiency in trials of cell and gene therapy to maximize data sharing and collaboration. Lastly, we explore some of the additional context beyond efficacy evaluations that should be considered to ensure implementation across relevant healthcare settings.
Collapse
Affiliation(s)
- Cheney J G Drew
- Centre For Trials Research, Cardiff University, Cardiff, United Kingdom; Brain Repair and Intracranial Neurotherapeutics Unit (BRAIN), College of Biomedical and Life Sciences, Cardiff University, Cardiff, United Kingdom.
| | - Monica Busse
- Centre For Trials Research, Cardiff University, Cardiff, United Kingdom; Brain Repair and Intracranial Neurotherapeutics Unit (BRAIN), College of Biomedical and Life Sciences, Cardiff University, Cardiff, United Kingdom
| |
Collapse
|
17
|
Lempriere S. A machine learning model of HD progression. Nat Rev Neurol 2021; 18:65. [PMID: 34931023 DOI: 10.1038/s41582-021-00608-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|