1
|
Chhoa H, Chabriat H, Chevret S, Biard L. Comparison of models for stroke-free survival prediction in patients with CADASIL. Sci Rep 2023; 13:22443. [PMID: 38105268 PMCID: PMC10725863 DOI: 10.1038/s41598-023-49552-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 12/09/2023] [Indexed: 12/19/2023] Open
Abstract
Cerebral autosomal dominant arteriopathy with subcortical infarcts and leukoencephalopathy, which is caused by mutations of the NOTCH3 gene, has a large heterogeneous progression, presenting with declines of various clinical scores and occurrences of various clinical event. To help assess disease progression, this work focused on predicting the composite endpoint of stroke-free survival time by comparing the performance of Cox proportional hazards regression to that of machine learning models using one of four feature selection approaches applied to demographic, clinical and magnetic resonance imaging observational data collected from a study cohort of 482 patients. The quality of the modeling process and the predictive performance were evaluated in a nested cross-validation procedure using the time-dependent Brier Score and AUC at 5 years from baseline, the former measuring the overall performance including calibration and the latter highlighting the discrimination ability, with both metrics taking into account the presence of right-censoring. The best model for each metric was the componentwise gradient boosting model with a mean Brier score of 0.165 and the random survival forest model with a mean AUC of 0.773, both combined with the LASSO feature selection method.
Collapse
Affiliation(s)
- Henri Chhoa
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France
| | - Hugues Chabriat
- Centre NeuroVasculaire Translationnel - Centre de Référence CERVCO, DMU NeuroSciences, Hôpital Lariboisière, GHU APHP-Nord, Université Paris Cité, Paris, France
- INSERM NeuroDiderot UMR 1141, GenMedStroke Team, Paris, France
| | - Sylvie Chevret
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France
| | - Lucie Biard
- ECSTRRA Team, Université Paris Cité, UMR1153, INSERM, Paris, France.
| |
Collapse
|
2
|
Tharmakulasingam M, Gardner B, La Ragione R, Fernando A. Rectified Classifier Chains for Prediction of Antibiotic Resistance From Multi-Labelled Data With Missing Labels. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:625-636. [PMID: 35130168 DOI: 10.1109/tcbb.2022.3148577] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Predicting Antimicrobial Resistance (AMR) from genomic data has important implications for human and animal healthcare, and especially given its potential for more rapid diagnostics and informed treatment choices. With the recent advances in sequencing technologies, applying machine learning techniques for AMR prediction have indicated promising results. Despite this, there are shortcomings in the literature concerning methodologies suitable for multi-drug AMR prediction and especially where samples with missing labels exist. To address this shortcoming, we introduce a Rectified Classifier Chain (RCC) method for predicting multi-drug resistance. This RCC method was tested using annotated features of genomics sequences and compared with similar multi-label classification methodologies. We found that applying the eXtreme Gradient Boosting (XGBoost) base model to our RCC model outperformed the second-best model, XGBoost based binary relevance model, by 3.3% in Hamming accuracy and 7.8% in F1-score. Additionally, we note that in the literature machine learning models applied to AMR prediction typically are unsuitable for identifying biomarkers informative of their decisions; in this study, we show that biomarkers contributing to AMR prediction can also be identified using the proposed RCC method. We expect this can facilitate genome annotation and pave the path towards identifying new biomarkers indicative of AMR.
Collapse
|
3
|
Hu H, Laskin J. Emerging Computational Methods in Mass Spectrometry Imaging. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2022; 9:e2203339. [PMID: 36253139 PMCID: PMC9731724 DOI: 10.1002/advs.202203339] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 09/17/2022] [Indexed: 05/10/2023]
Abstract
Mass spectrometry imaging (MSI) is a powerful analytical technique that generates maps of hundreds of molecules in biological samples with high sensitivity and molecular specificity. Advanced MSI platforms with capability of high-spatial resolution and high-throughput acquisition generate vast amount of data, which necessitates the development of computational tools for MSI data analysis. In addition, computation-driven MSI experiments have recently emerged as enabling technologies for further improving the MSI capabilities with little or no hardware modification. This review provides a critical summary of computational methods and resources developed for MSI data analysis and interpretation along with computational approaches for improving throughput and molecular coverage in MSI experiments. This review is focused on the recently developed artificial intelligence methods and provides an outlook for a future paradigm shift in MSI with transformative computational methods.
Collapse
Affiliation(s)
- Hang Hu
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN47907USA
| | - Julia Laskin
- Department of ChemistryPurdue University560 Oval DriveWest LafayetteIN47907USA
| |
Collapse
|
4
|
D’Amato Figueiredo MV, Alexiou GA, Vartholomatos G, Rehder R. Advances in Intraoperative Flow Cytometry. Int J Mol Sci 2022; 23:ijms232113430. [PMID: 36362215 PMCID: PMC9655491 DOI: 10.3390/ijms232113430] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 10/25/2022] [Accepted: 10/31/2022] [Indexed: 11/06/2022] Open
Abstract
Flow cytometry is the gold-standard laser-based technique to measure and analyze fluorescence levels of immunostaining and DNA content in individual cells. It provides a valuable tool to assess cells in the G0/G1, S, and G2/M phases, and those with polyploidy, which holds prognostic significance. Frozen section analysis is the standard intraoperative assessment for tumor margin evaluation and tumor resection. Here, we present flow cytometry as a promising technique for intraoperative tumor analysis in different pathologies, including brain tumors, leptomeningeal dissemination, breast cancer, head and neck cancer, pancreatic tumor, and hepatic cancer. Flow cytometry is a valuable tool that can provide substantial information on tumor analysis and, consequently, maximize cancer treatment and expedite patients’ survival.
Collapse
Affiliation(s)
- Marcos V. D’Amato Figueiredo
- Department of Neurosurgery, Hospital Estadual Mario Covas, Santo Andre 09190-615, Brazil
- Department of Neurosurgery, Hospital do Coracao, Sao Paulo 04004-030, Brazil
| | - George A. Alexiou
- Department of Neurosurgery, School of Medicine, University of Ioannina, 45500 Ioannina, Greece
- Neurosurgical Institute, University of Ioannina, 45500 Ioannina, Greece
- Correspondence: ; Tel.: +30-6948-525134
| | - George Vartholomatos
- Neurosurgical Institute, University of Ioannina, 45500 Ioannina, Greece
- Hematology Laboratory, Unit of Molecular Biology and Translational Flow Cytometry, University Hospital of Ioannina, 45500 Ioannina, Greece
| | - Roberta Rehder
- Department of Neurosurgery, Hospital do Coracao, Sao Paulo 04004-030, Brazil
- Department of Neurosurgery, Division of Pediatric Neurosurgery, Hospital Santa Marcelina, Sao Paulo 08270-070, Brazil
| |
Collapse
|
5
|
Kamboj S, Rajput A, Rastogi A, Thakur A, Kumar M. Targeting non-structural proteins of Hepatitis C virus for predicting repurposed drugs using QSAR and machine learning approaches. Comput Struct Biotechnol J 2022; 20:3422-3438. [PMID: 35832613 PMCID: PMC9271984 DOI: 10.1016/j.csbj.2022.06.060] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/27/2022] [Accepted: 06/27/2022] [Indexed: 11/24/2022] Open
Abstract
Hepatitis C virus (HCV) infection causes viral hepatitis leading to hepatocellular carcinoma. Despite the clinical use of direct-acting antivirals (DAAs) still there is treatment failure in 5-10% cases. Therefore, it is crucial to develop new antivirals against HCV. In this endeavor, we developed the "Anti-HCV" platform using machine learning and quantitative structure-activity relationship (QSAR) approaches to predict repurposed drugs targeting HCV non-structural (NS) proteins. We retrieved experimentally validated small molecules from the ChEMBL database with bioactivity (IC50/EC50) against HCV NS3 (454), NS3/4A (495), NS5A (494) and NS5B (1671) proteins. These unique compounds were divided into training/testing and independent validation datasets. Relevant molecular descriptors and fingerprints were selected using a recursive feature elimination algorithm. Different machine learning techniques viz. support vector machine, k-nearest neighbour, artificial neural network, and random forest were used to develop the predictive models. We achieved Pearson's correlation coefficients from 0.80 to 0.92 during 10-fold cross validation and similar performance on independent datasets using the best developed models. The robustness and reliability of developed predictive models were also supported by applicability domain, chemical diversity and decoy datasets analyses. The "Anti-HCV" predictive models were used to identify potential repurposing drugs. Representative candidates were further validated by molecular docking which displayed high binding affinities. Hence, this study identified promising repurposed drugs viz. naftifine, butalbital (NS3), vinorelbine, epicriptine (NS3/4A), pipecuronium, trimethaphan (NS5A), olodaterol and vemurafenib (NS5B) etc. targeting HCV NS proteins. These potential repurposed drugs may prove useful in antiviral drug development against HCV.
Collapse
Affiliation(s)
- Sakshi Kamboj
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
| | - Amber Rastogi
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| |
Collapse
|
6
|
Rajput A, Thakur A, Mukhopadhyay A, Kamboj S, Rastogi A, Gautam S, Jassal H, Kumar M. Prediction of repurposed drugs for Coronaviruses using artificial intelligence and machine learning. Comput Struct Biotechnol J 2021; 19:3133-3148. [PMID: 34055238 PMCID: PMC8141697 DOI: 10.1016/j.csbj.2021.05.037] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Revised: 05/18/2021] [Accepted: 05/20/2021] [Indexed: 02/06/2023] Open
Abstract
The world is facing the COVID-19 pandemic caused by Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Likewise, other viruses of the Coronaviridae family were responsible for causing epidemics earlier. To tackle these viruses, there is a lack of approved antiviral drugs. Therefore, we have developed robust computational methods to predict the repurposed drugs using machine learning techniques namely Support Vector Machine, Random Forest, k-Nearest Neighbour, Artificial Neural Network, and Deep Learning. We used the experimentally validated drugs/chemicals with anticorona activity (IC50/EC50) from 'DrugRepV' repository. The unique entries of SARS-CoV-2 (142), SARS (221), MERS (123), and overall Coronaviruses (414) were subdivided into the training/testing and independent validation datasets, followed by the extraction of chemical/structural descriptors and fingerprints (17968). The highly relevant features were filtered using the recursive feature selection algorithm. The selected chemical descriptors were used to develop prediction models with Pearson's correlation coefficients ranging from 0.60 to 0.90 on training/testing. The robustness of the predictive models was further ensured using external independent validation datasets, decoy datasets, applicability domain, and chemical analyses. The developed models were used to predict promising repurposed drug candidates against coronaviruses after scanning the DrugBank. Top predicted molecules for SARS-CoV-2 were further validated by molecular docking against the spike protein complex with ACE receptor. We found potential repurposed drugs namely Verteporfin, Alatrofloxacin, Metergoline, Rescinnamine, Leuprolide, and Telotristat ethyl with high binding affinity. These 'anticorona' computational models would assist in antiviral drug discovery against SARS-CoV-2 and other Coronaviruses.
Collapse
Affiliation(s)
- Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Adhip Mukhopadhyay
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Sakshi Kamboj
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Amber Rastogi
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Sakshi Gautam
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Harvinder Jassal
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| |
Collapse
|
7
|
Fan Z, Xue W, Li L, Zhang C, Lu J, Zhai Y, Suo Z, Zhao J. Identification of an early diagnostic biomarker of lung adenocarcinoma based on co-expression similarity and construction of a diagnostic model. J Transl Med 2018; 16:205. [PMID: 30029648 PMCID: PMC6053739 DOI: 10.1186/s12967-018-1577-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2018] [Accepted: 07/13/2018] [Indexed: 12/13/2022] Open
Abstract
Background The purpose of this study was to achieve early and accurate diagnosis of lung cancer and long-term monitoring of the therapeutic response. Methods We downloaded GSE20189 from GEO database as analysis data. We also downloaded human lung adenocarcinoma RNA-seq transcriptome expression data from the TCGA database as validation data. Finally, the expression of all of the genes underwent z test normalization. We used ANOVA to identify differentially expressed genes specific to each stage, as well as the intersection between them. Two methods, correlation analysis and co-expression network analysis, were used to compare the expression patterns and topological properties of each stage. Using the functional quantification algorithm, we evaluated the functional level of each significantly enriched biological function under different stages. A machine-learning algorithm was used to screen out significant functions as features and to establish an early diagnosis model. Finally, survival analysis was used to verify the correlation between the outcome and the biomarkers that we found. Results We screened 12 significant biomarkers that could distinguish lung cancer patients with diverse risks. Patients carrying variations in these 12 genes also presented a poor outcome in terms of survival status compared with patients without variations. Conclusions We propose a new molecular-based noninvasive detection method. According to the expression of the stage-specific gene set in the peripheral blood of patients with lung cancer, the difference in the functional level is quantified to realize the early diagnosis and prediction of lung cancer.
Collapse
Affiliation(s)
- Zhirui Fan
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Wenhua Xue
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Lifeng Li
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China.,Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Chaoqi Zhang
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Jingli Lu
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Yunkai Zhai
- Center of Telemedicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China.,Engineering Laboratory for Digital Telemedicine Service, Zhengzhou, 450052, Henan, China
| | - Zhenhe Suo
- Cancer Center, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China
| | - Jie Zhao
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China. .,Center of Telemedicine, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, 450052, Henan, China. .,Engineering Laboratory for Digital Telemedicine Service, Zhengzhou, 450052, Henan, China.
| |
Collapse
|
8
|
Cestarelli V, Fiscon G, Felici G, Bertolazzi P, Weitschek E. CAMUR: Knowledge extraction from RNA-seq cancer data through equivalent classification rules. Bioinformatics 2016; 32:697-704. [PMID: 26519501 PMCID: PMC4795614 DOI: 10.1093/bioinformatics/btv635] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2015] [Revised: 10/08/2015] [Accepted: 10/24/2015] [Indexed: 02/07/2023] Open
Abstract
MOTIVATION Nowadays, knowledge extraction methods from Next Generation Sequencing data are highly requested. In this work, we focus on RNA-seq gene expression analysis and specifically on case-control studies with rule-based supervised classification algorithms that build a model able to discriminate cases from controls. State of the art algorithms compute a single classification model that contains few features (genes). On the contrary, our goal is to elicit a higher amount of knowledge by computing many classification models, and therefore to identify most of the genes related to the predicted class. RESULTS We propose CAMUR, a new method that extracts multiple and equivalent classification models. CAMUR iteratively computes a rule-based classification model, calculates the power set of the genes present in the rules, iteratively eliminates those combinations from the data set, and performs again the classification procedure until a stopping criterion is verified. CAMUR includes an ad-hoc knowledge repository (database) and a querying tool.We analyze three different types of RNA-seq data sets (Breast, Head and Neck, and Stomach Cancer) from The Cancer Genome Atlas (TCGA) and we validate CAMUR and its models also on non-TCGA data. Our experimental results show the efficacy of CAMUR: we obtain several reliable equivalent classification models, from which the most frequent genes, their relationships, and the relation with a particular cancer are deduced. AVAILABILITY AND IMPLEMENTATION dmb.iasi.cnr.it/camur.php CONTACT emanuel@iasi.cnr.it SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Valerio Cestarelli
- Institute of Systems Analysis and Computer Science - National Research Council, 00185, Rome, Italy
| | - Giulia Fiscon
- Institute of Systems Analysis and Computer Science - National Research Council, 00185, Rome, Italy, Department of Computer, Control, and Management Engineering - Sapienza University, 00185, Rome, Italy and
| | - Giovanni Felici
- Institute of Systems Analysis and Computer Science - National Research Council, 00185, Rome, Italy
| | - Paola Bertolazzi
- Institute of Systems Analysis and Computer Science - National Research Council, 00185, Rome, Italy
| | - Emanuel Weitschek
- Institute of Systems Analysis and Computer Science - National Research Council, 00185, Rome, Italy, Department of Engineering - Uninettuno International University, Corso Vittorio Emanuele II, 39 - 00186 Rome, Italy
| |
Collapse
|