1
|
Vivek S, Faul J, Thyagarajan B, Guan W. Explainable variational autoencoder (E-VAE) model using genome-wide SNPs to predict dementia. J Biomed Inform 2023; 148:104536. [PMID: 37926392 PMCID: PMC11106718 DOI: 10.1016/j.jbi.2023.104536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 10/30/2023] [Accepted: 11/02/2023] [Indexed: 11/07/2023]
Abstract
OBJECTIVE Alzheimer's disease (AD) and AD related dementias (ADRD) are complex multifactorial neurodegenerative diseases. The associations between genetic variants obtained from genome wide association studies (GWAS) are the most widely available and well documented variants associated with ADRD. Application of deep learning methods to analyze large scale GWAS data may be a powerful approach to elucidate the biological mechanisms in ADRD compared to penalized regression models that may lead to over-fitting. METHODS We developed a deep learning frame work explainable variational autoencoder (E-VAE) classifier model using genotype (GWAS SNPs = 5474) data from 2714 study participants in the Health and Retirement Study (HRS) to classify ADRD. We validated the generalizability of this model among 234 participants in the Religious Orders Study and Memory and Aging Project (ROSMAP). Utilizing a linear decoder approach we have extracted the weights associated with latent features for biological interpretation. RESULTS We obtained a predictive accuracy of 0.71 (95 % CI [0.59, 0.84]) with an AUC of 0.69 in the HRS test dataset and got an accuracy of 0.62 (95 % CI [0.56, 0.68]) with an AUC of 0.63 in the ROSMAP dataset. CONCLUSION This is the first study showing the generalizability of a deep learning prediction model for dementia using genetic variants in an independent cohort. The latent features identified using E-VAE can help us understand the biology of AD/ ADRD and better characterize disease status.
Collapse
Affiliation(s)
- Sithara Vivek
- Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, United States
| | - Jessica Faul
- Institute for Social Research, Survey Research Center, University of Michigan, Ann Arbor, MI, United States
| | - Bharat Thyagarajan
- Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN, United States.
| | - Weihua Guan
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis MN, United States.
| |
Collapse
|
2
|
Handa K, Sakamoto S, Kageyama M, Iijima T. Development of a 2D-QSAR Model for Tissue-to-Plasma Partition Coefficient Value with High Accuracy Using Machine Learning Method, Minimum Required Experimental Values, and Physicochemical Descriptors. Eur J Drug Metab Pharmacokinet 2023:10.1007/s13318-023-00832-w. [PMID: 37266860 DOI: 10.1007/s13318-023-00832-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/09/2023] [Indexed: 06/03/2023]
Abstract
BACKGROUND The demand for physiologically based pharmacokinetic (PBPK) model is increasing currently. New drug application (NDA) of many compounds is submitted with PBPK models for efficient drug development. Tissue-to-plasma partition coefficient (Kp) is a key parameter for the PBPK model to describe differential equations. However, it is difficult to obtain the Kp value experimentally because the measurement of drug concentration in the tissue is much harder than that in plasma. OBJECTIVE Instead of experiments, many researchers have sought in silico methods. Today, most of the models for Kp prediction are using in vitro and in vivo parameters as explanatory variables. We thought of physicochemical descriptors that could improve the predictability. Therefore, we aimed to develop the two-dimensional quantitative structure-activity relationship (2D-QSAR) model for Kp using physicochemical descriptors instead of in vivo experimental data as explanatory variables. METHODS We compared our model with the conventional models using 20-fold cross-validation according to the published method (Yun et al. J Pharmacokinet Pharmacodyn 41:1-14, 2014). We used random forest algorithm, which is known to be one of the best predictors for the 2D-QSAR model. Finally, we combined minimum in vitro experimental values and physiochemical descriptors. Thus, the prediction method for Kp value using a few in vitro parameters and physicochemical descriptors was developed; this is a multimodal model. RESULTS Its accuracy was found to be superior to that of the conventional models. Results of this research suggest that multimodality is useful for the 2D-QSAR model [RMSE and % of two-fold error: 0.66 and 42.2% (Berezohkovsky), 0.52 and 52.2% (Rodgers), 0.65 and 34.6% (Schmitt), 0.44 and 61.1% (published model), 0.41 and 62.1% (traditional model), 0.39 and 64.5% (multimodal model)]. CONCLUSION We could develop a 2D-QSAR model for Kp value with the highest accuracy using a few in vitro experimental data and physicochemical descriptors.
Collapse
Affiliation(s)
- Koichi Handa
- Toxicology & DMPK Research Department, Teijin Institute for Bio-Medical Research, Teijin Pharma Limited, 4-3-2 Asahigaoka, Hino-shi, Tokyo, 191-8512, Japan.
| | - Seishiro Sakamoto
- Pharmaceutical Development Coordination Department, Teijin Pharma Limited, 3-2-1, Kasumigaseki Common Gate West Tower, Kasumigaseki Chiyoda-ku, Tokyo, 100-8585, Japan
| | - Michiharu Kageyama
- Toxicology & DMPK Research Department, Teijin Institute for Bio-Medical Research, Teijin Pharma Limited, 4-3-2 Asahigaoka, Hino-shi, Tokyo, 191-8512, Japan
| | - Takeshi Iijima
- Toxicology & DMPK Research Department, Teijin Institute for Bio-Medical Research, Teijin Pharma Limited, 4-3-2 Asahigaoka, Hino-shi, Tokyo, 191-8512, Japan
| |
Collapse
|
3
|
Deep neural network prediction of genome-wide transcriptome signatures - beyond the Black-box. NPJ Syst Biol Appl 2022; 8:9. [PMID: 35197482 PMCID: PMC8866467 DOI: 10.1038/s41540-022-00218-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 01/24/2022] [Indexed: 11/28/2022] Open
Abstract
Prediction algorithms for protein or gene structures, including transcription factor binding from sequence information, have been transformative in understanding gene regulation. Here we ask whether human transcriptomic profiles can be predicted solely from the expression of transcription factors (TFs). We find that the expression of 1600 TFs can explain >95% of the variance in 25,000 genes. Using the light-up technique to inspect the trained NN, we find an over-representation of known TF-gene regulations. Furthermore, the learned prediction network has a hierarchical organization. A smaller set of around 125 core TFs could explain close to 80% of the variance. Interestingly, reducing the number of TFs below 500 induces a rapid decline in prediction performance. Next, we evaluated the prediction model using transcriptional data from 22 human diseases. The TFs were sufficient to predict the dysregulation of the target genes (rho = 0.61, P < 10−216). By inspecting the model, key causative TFs could be extracted for subsequent validation using disease-associated genetic variants. We demonstrate a methodology for constructing an interpretable neural network predictor, where analyses of the predictors identified key TFs that were inducing transcriptional changes during disease.
Collapse
|
4
|
Wang L, Xie H, Han W, Yang X, Shi L, Dong J, Jiang K, Wu H. Construction of a knowledge graph for diabetes complications from expert-reviewed clinical evidences. Comput Assist Surg (Abingdon) 2021; 25:29-35. [PMID: 33275462 DOI: 10.1080/24699322.2020.1850866] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
A knowledge graph is a structured representation of data that can express entity and relational knowledge. More attention has been paid to the study of a clinical knowledge graph, especially in the field of chronic diseases. However, knowledge graph construction is based mainly on electronic medical records and other data sources, and the authority of the constructed knowledge graph presents some problems. Therefore, regarding the quality of evidence, this study, in combination with experimental research on system evaluation and meta-analysis presents some new information, On the basis of evidence-based medicine (EBM), the secondary results of systematic evaluation and meta-analyses of social, psychological, and behavioral aspects were extracted as data for the core nodes and edges of a knowledge graph to construct a graph of type 2 diabetes (T2D) and its complications. In this study, relevant life-style evidence that are factors for the risk of diabetic retinopathy (DR), diabetic nephropathy (DN), diabetic foot (DF), and diabetic depression (DD), and the results of several of the relevant clinical test, including bariatric surgery, myopia, lipid-lowering drugs, lipid-lowering drug duration, blood glucose control, disease course, glycosylated hemoglobin, fasting blood glucose, hypertension, sex, smoking and other common lifestyle characteristics were finally extracted. The evidence-based knowledge graph of the DM complications was constructed by extracting relevant disease, risk factors, risk outcomes, and other diabetes entities and the strength of the data for the odds ratio (OR) or relative risk (RR) correlations from clinical evidence. Moreover, the risk prediction models constructed using a logistic model were incorporated into the knowledge graph to visualize the risk score of DM complications for each user. In short, the EBM-powered construction of the knowledge graph could provide high-quality information to support decisions for the prevention and control of diabetes and its complications.
Collapse
Affiliation(s)
- Lei Wang
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Huimin Xie
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Wentao Han
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Xiao Yang
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Lili Shi
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Jiancheng Dong
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Kui Jiang
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| | - Huiqun Wu
- Department of Medical Informatics, Medical School of Nantong University, Nantong, China
| |
Collapse
|
5
|
Planell N, Lagani V, Sebastian-Leon P, van der Kloet F, Ewing E, Karathanasis N, Urdangarin A, Arozarena I, Jagodic M, Tsamardinos I, Tarazona S, Conesa A, Tegner J, Gomez-Cabrero D. STATegra: Multi-Omics Data Integration - A Conceptual Scheme With a Bioinformatics Pipeline. Front Genet 2021; 12:620453. [PMID: 33747045 PMCID: PMC7970106 DOI: 10.3389/fgene.2021.620453] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 01/20/2021] [Indexed: 12/13/2022] Open
Abstract
Technologies for profiling samples using different omics platforms have been at the forefront since the human genome project. Large-scale multi-omics data hold the promise of deciphering different regulatory layers. Yet, while there is a myriad of bioinformatics tools, each multi-omics analysis appears to start from scratch with an arbitrary decision over which tools to use and how to combine them. Therefore, it is an unmet need to conceptualize how to integrate such data and implement and validate pipelines in different cases. We have designed a conceptual framework (STATegra), aiming it to be as generic as possible for multi-omics analysis, combining available multi-omic anlaysis tools (machine learning component analysis, non-parametric data combination, and a multi-omics exploratory analysis) in a step-wise manner. While in several studies, we have previously combined those integrative tools, here, we provide a systematic description of the STATegra framework and its validation using two The Cancer Genome Atlas (TCGA) case studies. For both, the Glioblastoma and the Skin Cutaneous Melanoma (SKCM) cases, we demonstrate an enhanced capacity of the framework (and beyond the individual tools) to identify features and pathways compared to single-omics analysis. Such an integrative multi-omics analysis framework for identifying features and components facilitates the discovery of new biology. Finally, we provide several options for applying the STATegra framework when parametric assumptions are fulfilled and for the case when not all the samples are profiled for all omics. The STATegra framework is built using several tools, which are being integrated step-by-step as OpenSource in the STATegRa Bioconductor package.
Collapse
Affiliation(s)
- Nuria Planell
- Translational Bioinformatics Unit, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Pamplona, Spain
| | - Vincenzo Lagani
- Institute of Chemical Biology, Ilia State University, Tbilisi, Georgia
- Gnosis Data Analysis P.C., Heraklion, Greece
| | - Patricia Sebastian-Leon
- Department of Genomic and Systems Reproductive Medicine, IVI-RMA (Instituto Valenciano de Infertilidad – Reproductive Medicine Associates) IVI Foundation, Valencia, Spain
| | - Frans van der Kloet
- Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, Netherlands
| | - Ewoud Ewing
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Nestoras Karathanasis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, Heraklion, Greece
- Computational Medicine Center, Thomas Jefferson University, Philadelphia, PA, United States
| | - Arantxa Urdangarin
- Translational Bioinformatics Unit, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Pamplona, Spain
| | - Imanol Arozarena
- Cancer Signalling Unit, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), Health Research Institute of Navarre (IdiSNA), Pamplona, Spain
| | - Maja Jagodic
- Department of Clinical Neuroscience, Karolinska Institutet, Center for Molecular Medicine, Karolinska University Hospital, Stockholm, Sweden
| | - Ioannis Tsamardinos
- Gnosis Data Analysis P.C., Heraklion, Greece
- Computer Science Department, University of Crete, Heraklion, Greece
| | - Sonia Tarazona
- Department of Applied Statistics, Operations Research and Quality, Universitat Politècnica de València, València, Spain
| | - Ana Conesa
- Microbiology and Cell Science, Institute for Food and Agricultural Sciences, University of Florida, Gainesville, FL, United States
- Genetics Institute, University of Florida, Gainesville, FL, United States
| | - Jesper Tegner
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Unit of Computational Medicine, Department of Medicine, Center for Molecular Medicine, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
- Science for Life Laboratory, Solna, Sweden
| | - David Gomez-Cabrero
- Translational Bioinformatics Unit, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Pamplona, Spain
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
- Unit of Computational Medicine, Department of Medicine, Center for Molecular Medicine, Karolinska Institutet, Karolinska University Hospital, Stockholm, Sweden
- Mucosal & Salivary Biology DivisionKing’s College London Dental Institute, London, United Kingdom
| |
Collapse
|
6
|
Gu W, Hasan S, Rocca-Serra P, Satagopam VP. Road to effective data curation for translational research. Drug Discov Today 2021; 26:626-630. [PMID: 33338655 DOI: 10.1016/j.drudis.2020.12.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 07/02/2020] [Accepted: 12/09/2020] [Indexed: 10/22/2022]
Abstract
Translational research today is data-intensive and requires multi-stakeholder collaborations to generate and pool data together for integrated analysis. This leads to the challenge of harmonization of data from different sources with different formats and standards, which is often overlooked during project planning and thus becomes a bottleneck of the research progress. We report on our experience and lessons learnt about data curation for translational research garnered over the course of the European Translational Research Infrastructure & Knowledge management Services (eTRIKS) program (https://www.etriks.org), a unique, 5-year, cross-organizational, cross-cultural collaboration project funded by the Innovative Medicines Initiative of the EU. Here, we discuss the obstacles and suggest what steps are needed for effective data curation in translational research, especially for projects involving multiple organizations from academia and industry.
Collapse
Affiliation(s)
- Wei Gu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg; ELIXIR Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | | | - Venkata P Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg; ELIXIR Luxembourg, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
7
|
MINERVA, A Platform for the Exploration of Disease Maps. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11685-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
8
|
Cao X, Lee K, Huang Q. Bayesian variable selection in logistic regression with application to whole-brain functional connectivity analysis for Parkinson's disease. Stat Methods Med Res 2020; 30:826-842. [PMID: 33308007 DOI: 10.1177/0962280220978990] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Parkinson's disease is a progressive, chronic, and neurodegenerative disorder that is primarily diagnosed by clinical examinations and magnetic resonance imaging (MRI). In this paper, we propose a Bayesian model to predict Parkinson's disease employing a functional MRI (fMRI) based radiomics approach. We consider a spike and slab prior for variable selection in high-dimensional logistic regression models, and present an approximate Gibbs sampler by replacing a logistic distribution with a t-distribution. Under mild conditions, we establish model selection consistency of the induced posterior and illustrate the performance of the proposed method outperforms existing state-of-the-art methods through simulation studies. In fMRI analysis, 6216 whole-brain functional connectivity features are extracted for 50 healthy controls along with 70 Parkinson's disease patients. We apply our method to the resulting dataset and further show its benefits with a higher average prediction accuracy of 0.83 compared to other contenders based on 10 random splits. The model fitting procedure also reveals the most discriminative brain regions for Parkinson's disease. These findings demonstrate that the proposed Bayesian variable selection method has the potential to support radiological diagnosis for patients with Parkinson's disease.
Collapse
Affiliation(s)
- Xuan Cao
- Division of Statistics and Data Science, Department of Mathematical Sciences, University of Cincinnati
| | - Kyoungjae Lee
- Department of Statistics, Inha University, Incheon, Korea
| | - Qingling Huang
- Department of Radiology, Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China
| |
Collapse
|
9
|
Johann A, Ehlert U. The study protocol: Neuroendocrinology and (epi-) genetics of female reproductive transition phase mood disorder - an observational, longitudinal study from pregnancy to postpartum. BMC Pregnancy Childbirth 2020; 20:609. [PMID: 33036563 PMCID: PMC7545379 DOI: 10.1186/s12884-020-03280-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Accepted: 09/23/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Postpartum depression is considered to be one of the most common health threats during pregnancy and postpartum, affecting not only the woman herself but also the offspring and the whole family system. Evidence for a conclusive etiopathological model with distinct risk and resilience factors is still broadly lacking. Therefore, the aim of the present study is to investigate numerous health-related markers to obtain greater insight into which biopsychosocial profiles render women more vulnerable to PPD or facilitate a healthy transition from pregnancy to postpartum. METHODS The observational, longitudinal study aims to include a total of 288 physically healthy women, aged 20-45 years. A multitude of relevant parameters, of an (epi-) genetic, endocrinological, physiological and psychological nature, will be assessed over a period of 5 months, following the participants from the 3rd trimester until three months postpartum. DISCUSSION The ultimate goal of the present study is to ameliorate mental health care during pregnancy and postpartum, by gaining a better understanding of the underlying biopsychosocial mechanisms that women undergo during the transition from pregnancy to postpartum.
Collapse
Affiliation(s)
- Alexandra Johann
- Clinical Psychology and Psychotherapy, University of Zurich, Binzmühlestrasse 14, 8050, Zurich, Switzerland.,Swiss National Science Foundation (SNSF), Bern, Switzerland
| | - Ulrike Ehlert
- Clinical Psychology and Psychotherapy, University of Zurich, Binzmühlestrasse 14, 8050, Zurich, Switzerland.
| |
Collapse
|
10
|
Cao X, Wang X, Xue C, Zhang S, Huang Q, Liu W. A Radiomics Approach to Predicting Parkinson's Disease by Incorporating Whole-Brain Functional Activity and Gray Matter Structure. Front Neurosci 2020; 14:751. [PMID: 32760248 PMCID: PMC7373781 DOI: 10.3389/fnins.2020.00751] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 06/25/2020] [Indexed: 02/06/2023] Open
Abstract
Parkinson's disease (PD) is a progressive, chronic, and neurodegenerative disorder that is primarily diagnosed by clinical examinations and magnetic resonance imaging (MRI). In this study, we proposed a machine learning based radiomics method to predict PD. Fifty healthy controls (HC) along with 70 PD patients underwent resting-state magnetic resonance imaging (rs-fMRI). For all subjects, we extracted five types of 6664 features, including mean amplitude of low-frequency fluctuation (mALFF), mean regional homogeneity (mReHo), resting-state functional connectivity (RSFC), voxel-mirrored homotopic connectivity (VMHC) and gray matter (GM) volume. After conducting dimension reduction utilizing Least absolute shrinkage and selection operator (LASSO), fifty-three radiomic features including 46 RSFCs, 1 mALFF, 3 mReHos, 1 VMHC, 2 GM volumes and 1 clinical factor were retained. The selected features also indicated the most discriminative regions for PD. We further conducted model fitting procedure for classifying subjects in the training set employing random forest and support volume machine (SVM) to evaluate the performance of the two methods. After cross-validation, both methods achieved 100% accuracy and area under curve (AUC) for distinguishing between PD and HC in the training set. In the testing set, SVM performed better than random forest with the accuracy, true positive rate (TPR) and AUC being 85%, 1 and 0.97, respectively. These findings demonstrate the radiomics technique has the potential to support radiological diagnosis and to achieve high classification accuracy for clinical diagnostic systems for patients with PD.
Collapse
Affiliation(s)
- Xuan Cao
- Division of Statistics and Data Science, Department of Mathematical Sciences, University of Cincinnati, Cincinnati, OH, United States
| | - Xiao Wang
- Department of Radiology, Affiliated Brain Hospital, Nanjing Medical University, Nanjing, China
| | - Chen Xue
- Department of Radiology, Affiliated Brain Hospital, Nanjing Medical University, Nanjing, China
| | - Shaojun Zhang
- Department of Statistics, University of Florida, Gainesville, FL, United States
| | - Qingling Huang
- Department of Radiology, Affiliated Brain Hospital, Nanjing Medical University, Nanjing, China
| | - Weiguo Liu
- Department of Neurology, Affiliated Brain Hospital, Nanjing Medical University, Nanjing, China
| |
Collapse
|
11
|
Romero-Rosales BL, Tamez-Pena JG, Nicolini H, Moreno-Treviño MG, Trevino V. Improving predictive models for Alzheimer's disease using GWAS data by incorporating misclassified samples modeling. PLoS One 2020; 15:e0232103. [PMID: 32324812 PMCID: PMC7179850 DOI: 10.1371/journal.pone.0232103] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 04/07/2020] [Indexed: 01/14/2023] Open
Abstract
Late-onset Alzheimer’s Disease (LOAD) is the most common form of dementia in the elderly. Genome-wide association studies (GWAS) for LOAD have open new avenues to identify genetic causes and to provide diagnostic tools for early detection. Although several predictive models have been proposed using the few detected GWAS markers, there is still a need for improvement and identification of potential markers. Commonly, polygenic risk scores are being used for prediction. Nevertheless, other methods to generate predictive models have been suggested. In this research, we compared three machine learning methods that have been proved to construct powerful predictive models (genetic algorithms, LASSO, and step-wise) and propose the inclusion of markers from misclassified samples to improve overall prediction accuracy. Our results show that the addition of markers from an initial model plus the markers of the model fitted to misclassified samples improves the area under the receiving operative curve by around 5%, reaching ~0.84, which is highly competitive using only genetic information. The computational strategy used here can help to devise better methods to improve classification models for AD. Our results could have a positive impact on the early diagnosis of Alzheimer’s disease.
Collapse
Affiliation(s)
| | - Jose-Gerardo Tamez-Pena
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Monterrey, Nuevo Leon, Mexico
| | - Humberto Nicolini
- Genomics of Psychiatric and Neurodegenerative Diseases Laboratory, National Institute of Genomic Medicine (INMEGEN), Mexico City, Mexico
| | | | - Victor Trevino
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Monterrey, Nuevo Leon, Mexico
- * E-mail:
| |
Collapse
|
12
|
New Frontiers in Parkinson's Disease: From Genetics to the Clinic. J Neurosci 2019; 38:9375-9382. [PMID: 30381429 DOI: 10.1523/jneurosci.1666-18.2018] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Revised: 09/15/2018] [Accepted: 09/18/2018] [Indexed: 12/30/2022] Open
Abstract
The greatest unmet therapeutic need in Parkinson's disease (PD) is a treatment that slows the relentless progression of the symptoms and the neurodegenerative process. This review highlights the utility of genetics to understand the pathogenic mechanisms and develop novel therapeutic approaches for PD. The focus is on strategies provided by genetic studies: notably via the reduction and clearance of α-synuclein, inhibition of LRRK2 kinase activity, and modulation of glucocerebrosidase-related substrates. In addition, the critical role of precompetitive public-private partnerships in supporting trial design optimization, overall drug development, and regulatory approvals is illustrated. With these great advances, the promise of developing transformative therapies that halt or slow disease progression is a tangible goal.
Collapse
|
13
|
Non-parametric combination analysis of multiple data types enables detection of novel regulatory mechanisms in T cells of multiple sclerosis patients. Sci Rep 2019; 9:11996. [PMID: 31427643 PMCID: PMC6700160 DOI: 10.1038/s41598-019-48493-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Accepted: 08/06/2019] [Indexed: 12/13/2022] Open
Abstract
Multiple Sclerosis (MS) is an autoimmune disease of the central nervous system with prominent neurodegenerative components. The triggering and progression of MS is associated with transcriptional and epigenetic alterations in several tissues, including peripheral blood. The combined influence of transcriptional and epigenetic changes associated with MS has not been assessed in the same individuals. Here we generated paired transcriptomic (RNA-seq) and DNA methylation (Illumina 450 K array) profiles of CD4+ and CD8+ T cells (CD4, CD8), using clinically accessible blood from healthy donors and MS patients in the initial relapsing-remitting and subsequent secondary-progressive stage. By integrating the output of a differential expression test with a permutation-based non-parametric combination methodology, we identified 149 differentially expressed (DE) genes in both CD4 and CD8 cells collected from MS patients. Moreover, by leveraging the methylation-dependent regulation of gene expression, we identified the gene SH3YL1, which displayed significant correlated expression and methylation changes in MS patients. Importantly, silencing of SH3YL1 in primary human CD4 cells demonstrated its influence on T cell activation. Collectively, our strategy based on paired sampling of several cell-types provides a novel approach to increase sensitivity for identifying shared mechanisms altered in CD4 and CD8 cells of relevance in MS in small sized clinical materials.
Collapse
|
14
|
Gaillard-Bigot F, Zendjidjian XY, Kheloufi F, Casse-Perrot C, Guilhaumou R, Micallef J, Fakra E, Azorin JM, Blin O. Quantitative System Pharmacology (QSP): An Integrative Framework for paradigm change in the treatment of the first-episode schizophrenia. Encephale 2019; 44:S34-S38. [PMID: 30935485 DOI: 10.1016/s0013-7006(19)30077-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Despite the lack of progress in the curative treatment of mental illness, especially schizophrenia, the accumulation of neuroscience data over the past decade suggests the re-conceptualization of schizophrenia. With the advent of new biomarkers and cognitive tools, new neuroscience technologies such as functional dynamic connectivity and the identification of subtle clinical features; it is now possible to detect early stages at risk or prodromes of a first psychotic episode. Current concepts reconceptualizes schizophrenia as a neurodevelopmental disorder at early onset, with polygenic risk and only symptomatic treatment for positive symptoms at this time. The use of such technologies in the future suggests new diagnostic and therapeutic options. Next steps include new pharmacological perspectives and potential contributions of new technologies such as quantitative system pharmacology brain computational modeling approach.
Collapse
Affiliation(s)
- F Gaillard-Bigot
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France
| | - X-Y Zendjidjian
- Pôle psychiatrie centre, hôpital de la Conception, assistance publique des hôpitaux de Marseille, Marseille, France
| | - F Kheloufi
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France
| | - C Casse-Perrot
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France
| | - R Guilhaumou
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France
| | - J Micallef
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France
| | - E Fakra
- Department of Psychiatry, University Hospital of Saint-Etienne, Saint-Etienne, France, Inserm U1059, University of Lyon, Saint-Etienne F-42023, France
| | - J-M Azorin
- Department of Psychiatry, Sainte Marguerite University Hospital, Marseille, France
| | - O Blin
- Service de pharmacologie clinique et pharmacovigilance, CIC CPCET, assistance publique des hôpitaux de Marseille, Institut de neurosciences des systèmes, Inserm UMR 1106, université d'Aix-Marseille, France.
| |
Collapse
|
15
|
Saqi M, Lysenko A, Guo YK, Tsunoda T, Auffray C. Navigating the disease landscape: knowledge representations for contextualizing molecular signatures. Brief Bioinform 2019; 20:609-623. [PMID: 29684165 PMCID: PMC6556902 DOI: 10.1093/bib/bby025] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 02/05/2018] [Indexed: 12/14/2022] Open
Abstract
Large amounts of data emerging from experiments in molecular medicine are leading to the identification of molecular signatures associated with disease subtypes. The contextualization of these patterns is important for obtaining mechanistic insight into the aberrant processes associated with a disease, and this typically involves the integration of multiple heterogeneous types of data. In this review, we discuss knowledge representations that can be useful to explore the biological context of molecular signatures, in particular three main approaches, namely, pathway mapping approaches, molecular network centric approaches and approaches that represent biological statements as knowledge graphs. We discuss the utility of each of these paradigms, illustrate how they can be leveraged with selected practical examples and identify ongoing challenges for this field of research.
Collapse
Affiliation(s)
- Mansoor Saqi
- Mansoor Saqi Data Science Institute, Imperial College London, UK
| | - Artem Lysenko
- Artem Lysenko Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
| | - Yi-Ke Guo
- Yi-Ke Guo Data Science Institute, Imperial College London, UK
| | - Tatsuhiko Tsunoda
- Tatsuhiko Tsunoda Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan CREST, JST, Tokyo, Japan Department of Medical Science Mathematics, Medical Research Institute, Tokyo Medical and Dental University, Tokyo, Japan
| | - Charles Auffray
- Charles Auffray European Institute for Systems Biology and Medicine, Lyon, France
| |
Collapse
|
16
|
Hoyt CT, Domingo-Fernández D, Aldisi R, Xu L, Kolpeja K, Spalek S, Wollert E, Bachman J, Gyori BM, Greene P, Hofmann-Apitius M. Re-curation and rational enrichment of knowledge graphs in Biological Expression Language. Database (Oxford) 2019; 2019:baz068. [PMID: 31225582 PMCID: PMC6587072 DOI: 10.1093/database/baz068] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 04/03/2019] [Accepted: 04/29/2019] [Indexed: 12/23/2022]
Abstract
The rapid accumulation of new biomedical literature not only causes curated knowledge graphs (KGs) to become outdated and incomplete, but also makes manual curation an impractical and unsustainable solution. Automated or semi-automated workflows are necessary to assist in prioritizing and curating the literature to update and enrich KGs. We have developed two workflows: one for re-curating a given KG to assure its syntactic and semantic quality and another for rationally enriching it by manually revising automatically extracted relations for nodes with low information density. We applied these workflows to the KGs encoded in Biological Expression Language from the NeuroMMSig database using content that was pre-extracted from MEDLINE abstracts and PubMed Central full-text articles using text mining output integrated by INDRA. We have made this workflow freely available at https://github.com/bel-enrichment/bel-enrichment.
Collapse
Affiliation(s)
- Charles Tapley Hoyt
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Daniel Domingo-Fernández
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Rana Aldisi
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Lingling Xu
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| | - Kristian Kolpeja
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - Sandra Spalek
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - Esther Wollert
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
| | - John Bachman
- Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Ave, Boston, MA, USA
| | - Benjamin M Gyori
- Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Ave, Boston, MA, USA
| | - Patrick Greene
- Laboratory of Systems Pharmacology, Harvard Medical School, 200 Longwood Ave, Boston, MA, USA
| | - Martin Hofmann-Apitius
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing (SCAI), Sankt Augustin, Germany
- Bonn-Aachen International Center for Information Technology, Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn, Germany
| |
Collapse
|
17
|
Schork NJ, Raghavachari N. Report: NIA workshop on translating genetic variants associated with longevity into drug targets. GeroScience 2018; 40:523-538. [PMID: 30374935 PMCID: PMC6294726 DOI: 10.1007/s11357-018-0046-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 10/03/2018] [Indexed: 12/16/2022] Open
Abstract
To date, candidate gene and genome-wide association studies (GWAS) have led to the discovery of longevity-associated variants (LAVs) in genes such as FOXO3A and APOE. Unfortunately, translating variants into drug targets is challenging for any trait, and longevity is no exception. Interdisciplinary and integrative multi-omics approaches are needed to understand how LAVs affect longevity-related phenotypes at the molecular physiologic level in order to leverage their discovery to identify new drug targets. The NIA convened a workshop in August 2017 on emerging and novel in silico (i.e., bioinformatics and computational) approaches to the translation of LAVs into drug targets. The goal of the workshop was to identify ways of enabling, enhancing, and facilitating interactions among researchers from different disciplines whose research considers either the identification of LAVs or the mechanistic or causal pathway(s) and protective factors they influence for discovering drug targets. Discussions among the workshop participants resulted in the identification of critical needs for enabling the translation of LAVs into drug targets in several areas. These included (1) the initiation and better use of cohorts with multi-omics profiling on the participants; (2) the generation of longitudinal information on multiple individuals; (3) the collection of data from non-human species (both long and short-lived) for comparative biology studies; (4) the refinement of computational tools for integrative analyses; (5) the development of novel computational and statistical inference techniques for assessing the potential of a drug target; (6) the identification of available drugs that could modulate a target in a way that could potentially provide protection against age-related diseases and/or enhance longevity; and (7) the development or enhancement of databases and repositories of relevant information, such as the Longevity Genomics website ( https://www.longevitygenomics.org ), to enhance and help motivate future interdisciplinary studies. Integrative approaches that examine the influence of LAVs on molecular physiologic phenotypes that might be amenable to pharmacological modulation are necessary for translating LAVs into drugs to enhance health and life span.
Collapse
Affiliation(s)
- Nicholas J. Schork
- Department of Quantitative Medicine, The Translational Genomics Research Institute, Phoenix, AZ USA
| | | | | |
Collapse
|
18
|
Menke A. Precision pharmacotherapy: psychiatry's future direction in preventing, diagnosing, and treating mental disorders. Pharmgenomics Pers Med 2018; 11:211-222. [PMID: 30510440 PMCID: PMC6250105 DOI: 10.2147/pgpm.s146110] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Mental disorders account for around one-third of disability worldwide and cause enormous personal and societal burden. Current pharmacotherapies and nonpharmacotherapies do help many patients, but there are still high rates of partial or no response, delayed effect, and unfavorable adverse effects. The current diagnostic taxonomy of mental disorders by the Diagnostic and Statistical Manual of Mental Disorders and the International Classification of Diseases relies on presenting signs and symptoms, but does not reflect evidence from neurobiological and behavioral systems. However, in the last decades, the understanding of biological mechanisms underlying mental disorders has grown and can be used for the development of precision medicine, that is, to deliver a patient-tailored individual treatment. Precision medicine may incorporate genetic variants contributing to the mental disorder and the response to pharmacotherapies, but also consider gene ¥ environment interactions, blood-based markers, neuropsychological tests, data from electronic health records, early life adversity, stressful life events, and very proximal factors such as lifestyle, nutrition, and sport. Methods such as artificial intelligence and the underlying machine learning and deep learning approaches provide the framework to stratify patients, initiate specific tailored treatments and thus increase response rates, reduce adverse effects and medical errors. In conclusion, precision medicine uses measurable health parameters to identify individuals at risk of a mental disorder, to improve the diagnostic process and to deliver a patient-tailored treatment.
Collapse
Affiliation(s)
- Andreas Menke
- Department of Psychiatry, Psychosomatics and Psychotherapy, University Hospital of Wuerzburg, Wuerzburg 97080, Germany,
- Comprehensive Heart Failure Center, University Hospital of Wuerzburg, Wuerzburg 97080, Germany,
- Interdisciplinary Center for Clinical Research, University of Wuerzburg, Wuerzburg 97080, Germany,
| |
Collapse
|
19
|
Vlietstra WJ, Vos R, Sijbers AM, van Mulligen EM, Kors JA. Using predicate and provenance information from a knowledge graph for drug efficacy screening. J Biomed Semantics 2018; 9:23. [PMID: 30189889 PMCID: PMC6127943 DOI: 10.1186/s13326-018-0189-6] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 08/01/2018] [Indexed: 12/11/2022] Open
Abstract
Background Biomedical knowledge graphs have become important tools to computationally analyse the comprehensive body of biomedical knowledge. They represent knowledge as subject-predicate-object triples, in which the predicate indicates the relationship between subject and object. A triple can also contain provenance information, which consists of references to the sources of the triple (e.g. scientific publications or database entries). Knowledge graphs have been used to classify drug-disease pairs for drug efficacy screening, but existing computational methods have often ignored predicate and provenance information. Using this information, we aimed to develop a supervised machine learning classifier and determine the added value of predicate and provenance information for drug efficacy screening. To ensure the biological plausibility of our method we performed our research on the protein level, where drugs are represented by their drug target proteins, and diseases by their disease proteins. Results Using random forests with repeated 10-fold cross-validation, our method achieved an area under the ROC curve (AUC) of 78.1% and 74.3% for two reference sets. We benchmarked against a state-of-the-art knowledge-graph technique that does not use predicate and provenance information, obtaining AUCs of 65.6% and 64.6%, respectively. Classifiers that only used predicate information performed superior to classifiers that only used provenance information, but using both performed best. Conclusion We conclude that both predicate and provenance information provide added value for drug efficacy screening. Electronic supplementary material The online version of this article (10.1186/s13326-018-0189-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Wytze J Vlietstra
- Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands.
| | - Rein Vos
- Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands.,Department of Methodology and Statistics, Maastricht University, Maastricht, 6200, MD, the Netherlands
| | - Anneke M Sijbers
- Centre for Molecular and Biomolecular Informatics, Radboudumc, Nijmegen, 6525, GA, the Netherlands
| | - Erik M van Mulligen
- Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands
| | - Jan A Kors
- Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands
| |
Collapse
|
20
|
Domingo-Fernández D, Kodamullil AT, Iyappan A, Naz M, Emon MA, Raschka T, Karki R, Springstubbe S, Ebeling C, Hofmann-Apitius M. Multimodal mechanistic signatures for neurodegenerative diseases (NeuroMMSig): a web server for mechanism enrichment. Bioinformatics 2018. [PMID: 28651363 PMCID: PMC5870765 DOI: 10.1093/bioinformatics/btx399] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Motivation The concept of a 'mechanism-based taxonomy of human disease' is currently replacing the outdated paradigm of diseases classified by clinical appearance. We have tackled the paradigm of mechanism-based patient subgroup identification in the challenging area of research on neurodegenerative diseases. Results We have developed a knowledge base representing essential pathophysiology mechanisms of neurodegenerative diseases. Together with dedicated algorithms, this knowledge base forms the basis for a 'mechanism-enrichment server' that supports the mechanistic interpretation of multiscale, multimodal clinical data. Availability and implementation NeuroMMSig is available at http://neurommsig.scai.fraunhofer.de/. Contact martin.hofmann-apitius@scai.fraunhofer.de. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Domingo-Fernández
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Alpha Tom Kodamullil
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Anandhi Iyappan
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Mufassra Naz
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Mohammad Asif Emon
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Tamara Raschka
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Reagon Karki
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| | - Stephan Springstubbe
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany
| | - Christian Ebeling
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany
| | - Martin Hofmann-Apitius
- Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing, Sankt Augustin 53754, Germany.,Rheinische Friedrich-Wilhelms-Universität Bonn, Bonn-Aachen International Center for IT, Bonn 53113, Germany
| |
Collapse
|
21
|
Abstract
The prevalence of purifying selection in the nature suggests that larger organisms bear a higher number of slightly deleterious mutations because of smaller populations and therefore weaker selection. In this work redistribution of purifying selection in favor of information genes, pathways and processes was found in primates compared with treeshrew and rodents on the ground of genome-wide analysis. The genes which are more favored in primates belong mainly to regulation of gene expression and development, in treeshrew and rodents, to metabolism, transport, energetics, reproduction and olfaction. The former occur predominantly in the nucleus, the latter, in the cytoplasm and membranes. Thus, although purifying selection is on average weaker in the primates, it is stronger concentrated on the "information technology" of life (regulation of gene expression and development). Increased accuracy of information processes probably allows escaping "error catastrophes" in spite of more complex organization, larger body size and higher longevity.
Collapse
|
22
|
Dong X, Cong S. Identification of differentially expressed genes and regulatory relationships in Huntington's disease by bioinformatics analysis. Mol Med Rep 2018; 17:4317-4326. [PMID: 29328442 PMCID: PMC5802203 DOI: 10.3892/mmr.2018.8410] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Accepted: 12/20/2017] [Indexed: 12/14/2022] Open
Abstract
Huntington's disease (HD) is an inherited, progressive neurodegenerative disease caused by a CAG expansion in the huntingtin (HTT) gene; various dysfunctions of biological processes in HD have been proposed. However, at present the exact pathogenesis of HD is not fully understood. The present study aimed to explore the pathogenesis of HD using a computational bioinformatics analysis of gene expression. GSE11358 was downloaded from the Gene Expression Omnibus andthe differentially expressed genes (DEGs) in the mutant HTT knock-in cell model STHdhQ111/Q111 were predicted. DEGs between the HD and control samples were screened using the limma package in R. Functional and pathway enrichment analyses were conducted using the database for annotation, visualization and integrated discovery software. A protein-protein interaction (PPI) network was established by the search tool for the retrieval of interacting genes and visualized by Cytoscape. Module analysis of the PPI network was performed utilizing MCODE. A total of 471 DEGs were identified, including ribonuclease A family member 4 (RNASE4). In addition, 41 significantly enriched Kyoto Encyclopedia of Genes and Genomes pathways, as well as several significant Gene Ontology terms (including cytokine-cytokine receptor interaction and cytosolic DNA-sensing) were identified. A total of 18 significant modules were identified from the PPI network. Furthermore, a novel transcriptional regulatory relationship was identified, namely signal transducer and activator of transcription 3 (STAT3), which is regulated by miRNA-124 in HD. In conclusion, deregulation of 18 critical genes may contribute to the occurrence of HD. RNASE4, STAT3, and miRNA-124 may have a regulatory association with the pathological mechanisms in HD.
Collapse
Affiliation(s)
- Xiaoyu Dong
- Department of Neurology, Shengjing Hospital of China Medical University, Shenyang, Liaoning 110004, P.R. China
| | - Shuyan Cong
- Department of Neurology, Shengjing Hospital of China Medical University, Shenyang, Liaoning 110004, P.R. China
| |
Collapse
|
23
|
Renzi C, Provencal N, Bassil KC, Evers K, Kihlbom U, Radford EJ, Koupil I, Mueller-Myhsok B, Hansson MG, Rutten BP. From Epigenetic Associations to Biological and Psychosocial Explanations in Mental Health. PROGRESS IN MOLECULAR BIOLOGY AND TRANSLATIONAL SCIENCE 2018; 158:299-323. [DOI: 10.1016/bs.pmbts.2018.04.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
24
|
Gozes I. Specific protein biomarker patterns for Alzheimer's disease: improved diagnostics in progress. EPMA J 2017; 8:255-259. [PMID: 29021836 DOI: 10.1007/s13167-017-0110-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2017] [Accepted: 08/10/2017] [Indexed: 12/12/2022]
Abstract
This short review looks at Alzheimer's disease (AD) diagnosis through my own point of view, going from imaging through cerebrospinal fluid to blood proteins. Over the last couple of years, we have published two papers targeted at Alzheimer's diagnosis. In one paper, we took an approach of selecting a specific target, namely, activity-dependent neuroprotective protein (ADNP), and our results tightened the association of ADNP blood expression with intelligence. In another paper, we took an unbiased approach of analysis of all genes expressed in lymphoblastoid cells lines and discovered changes in expression of the regulator of G-protein signaling 2 (RGS2) as a potential AD predictor. This review will assess our data in comparison to selected independent studies focusing on blood protein biomarkers as well as assessing saliva and urine samples with potential predictive value for AD. Furthermore, the review will provide directions for a combination of innovative markers, stratifying the population toward disease prevention and personalized medicine.
Collapse
Affiliation(s)
- Illana Gozes
- The Lily and Avraham Gildor Chair for the Investigation of Growth Factors, Elton Laboratory for Neuroendocrinology, Department of Human Molecular Genetics and Biochemistry, Sackler Faculty of Medicine, Adams Super Center for Brain Studies and Sagol School of Neuroscience, Tel Aviv University, 69978 Tel Aviv, Israel
| |
Collapse
|
25
|
Geerts H, Spiros A, Roberts P, Carr R. Towards the virtual human patient. Quantitative Systems Pharmacology in Alzheimer's disease. Eur J Pharmacol 2017; 817:38-45. [PMID: 28583429 DOI: 10.1016/j.ejphar.2017.05.062] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2016] [Revised: 05/05/2017] [Accepted: 05/31/2017] [Indexed: 12/26/2022]
Abstract
Development of successful therapeutic interventions in Central Nervous Systems (CNS) disorders is a daunting challenge with a low success rate. Probable reasons include the lack of translation from preclinical animal models, the individual variability of many pathological processes converging upon the same clinical phenotype, the pharmacodynamical interaction of various comedications and last but not least the complexity of the human brain. This paper argues for a re-engineering of the pharmaceutical CNS Research & Development strategy using ideas focused on advanced computer modeling and simulation from adjacent engineering-based industries. We provide examples that such a Quantitative Systems Pharmacology approach based on computer simulation of biological processes and that combines the best of preclinical research with actual clinical outcomes can enhance translation to the clinical situation. We will expand upon (1) the need to go from Big Data to Smart Data and develop predictive and quantitative algorithms that are actionable for the pharma industry, (2) using this platform as a "knowledge machine" that captures community-wide expertise in an active hypothesis-testing approach, (3) learning from failed clinical trials and (4) the need to go beyond simple linear hypotheses and embrace complex non-linear hypotheses. We will propose a strategy for applying these concepts to the substantial individual variability of AD patient subgroups and the treatment of neuropsychiatric problems in AD. Quantitative Systems Pharmacology is a new 'humanized' tool for supporting drug discovery and development in general and CNS disorders in particular.
Collapse
Affiliation(s)
- Hugo Geerts
- In Silico Biosciences, Lexington, MA, USA; Perelman School of Medicine, Univ. of Pennsylvania, Philadelphia, PA, USA.
| | | | - Patrick Roberts
- Department of Biomedical Engineering, Oregon Health & Science University, Portland OR, USA
| | | |
Collapse
|
26
|
Automated extraction of potential migraine biomarkers using a semantic graph. J Biomed Inform 2017; 71:178-189. [PMID: 28579531 DOI: 10.1016/j.jbi.2017.05.018] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Revised: 04/03/2017] [Accepted: 05/23/2017] [Indexed: 01/20/2023]
Abstract
PROBLEM Biomedical literature and databases contain important clues for the identification of potential disease biomarkers. However, searching these enormous knowledge reservoirs and integrating findings across heterogeneous sources is costly and difficult. Here we demonstrate how semantically integrated knowledge, extracted from biomedical literature and structured databases, can be used to automatically identify potential migraine biomarkers. METHOD We used a knowledge graph containing more than 3.5 million biomedical concepts and 68.4 million relationships. Biochemical compound concepts were filtered and ranked by their potential as biomarkers based on their connections to a subgraph of migraine-related concepts. The ranked results were evaluated against the results of a systematic literature review that was performed manually by migraine researchers. Weight points were assigned to these reference compounds to indicate their relative importance. RESULTS Ranked results automatically generated by the knowledge graph were highly consistent with results from the manual literature review. Out of 222 reference compounds, 163 (73%) ranked in the top 2000, with 547 out of the 644 (85%) weight points assigned to the reference compounds. For reference compounds that were not in the top of the list, an extensive error analysis has been performed. When evaluating the overall performance, we obtained a ROC-AUC of 0.974. DISCUSSION Semantic knowledge graphs composed of information integrated from multiple and varying sources can assist researchers in identifying potential disease biomarkers.
Collapse
|
27
|
Parva E, Boostani R, Ghahramani Z, Paydar S. The Necessity of Data Mining in Clinical Emergency Medicine; A Narrative Review of the Current Literatrue. Bull Emerg Trauma 2017; 5:90-95. [PMID: 28507995 PMCID: PMC5406178] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2017] [Revised: 02/20/2017] [Accepted: 02/24/2017] [Indexed: 06/07/2023] Open
Abstract
Clinical databases can be categorized as big data, include large quantities of information about patients and their medical conditions. Analyzing the quantitative and qualitative clinical data in addition with discovering relationships among huge number of samples using data mining techniques could unveil hidden medical knowledge in terms of correlation and association of apparently independent variables. The aim of this research is using predictive algorithm for prediction of trauma patients on admission to hospital to be able to predict the necessary treatment for patients and provided the necessary measures for the trauma patients who are before entering the critical situation. This study provides a review on data mining in clinical medicine. The relevant, recently-published studies of data mining on medical data with a focus on emergency medicine were investigated to tackle pros and cons of such approaches. The results of this study can be used in prediction of trauma patient’s status at six hours after admission to hospital.
Collapse
Affiliation(s)
- Elahe Parva
- Technical and Vocational University, Shiraz, Iran
| | - Reza Boostani
- Biomedical Engineering Group, CSE & IT Dept., ECE Faculty, Shiraz University, Shiraz, Iran
| | - Zahra Ghahramani
- Trauma Research Center, Shahid Rajaee (Emtiaz) Trauma Hospital, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Shahram Paydar
- Trauma Research Center, Shahid Rajaee (Emtiaz) Trauma Hospital, Shiraz University of Medical Sciences, Shiraz, Iran
| |
Collapse
|
28
|
Hahn T, Nierenberg AA, Whitfield-Gabrieli S. Predictive analytics in mental health: applications, guidelines, challenges and perspectives. Mol Psychiatry 2017; 22:37-43. [PMID: 27843153 DOI: 10.1038/mp.2016.201] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/22/2016] [Revised: 08/16/2016] [Accepted: 09/22/2016] [Indexed: 01/23/2023]
Abstract
The emerging field of 'predictive analytics in mental health' has recently generated tremendous interest with the bold promise to revolutionize clinical practice in psychiatry paralleling similar developments in personalized and precision medicine. Here, we provide an overview of the key questions and challenges in the field, aiming to (1) propose general guidelines for predictive analytics projects in psychiatry, (2) provide a conceptual introduction to core aspects of predictive modeling technology, and (3) foster a broad and informed discussion involving all stakeholders including researchers, clinicians, patients, funding bodies and policymakers.
Collapse
Affiliation(s)
- T Hahn
- Department of Cognitive Psychology II, Goethe-University Frankfurt, Frankfurt am Main, Germany
| | - A A Nierenberg
- Bipolar Clinic and Research Program, Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA.,Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - S Whitfield-Gabrieli
- McGovern Institute for Brain Research and Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA, USA
| |
Collapse
|
29
|
Korbolina EE, Zhdankina AA, Fursova AZ, Kozhevnikova OS, Kolosova NG. Genes of susceptibility to early neurodegenerative changes in the rat retina and brain: analysis by means of congenic strains. BMC Genet 2016; 17:153. [PMID: 28105932 PMCID: PMC5249004 DOI: 10.1186/s12863-016-0461-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
BACKGROUND There has been considerable interest in discovery of the genetic architecture of complex traits, particularly age-related neurodegenerative disorders. To predict disease risk and to understand its genetic basis in humans, it is necessary to study animal models. Our previous research on the accelerated-senescence OXYS strain has revealed two quantitative trait loci (QTLs) on rat chromosome 1 that are associated with early cataract and/or retinopathy as well as with behavioral abnormalities. Each locus was partially mapped within the introgressed segments in a certain congenic strain: WAG/OXYS-1.1 or WAG/OXYS-1.2. Retinal transcriptome profiling of 20-day-old congenic and OXYS rats by high-throughput RNA sequencing uncovered relevant candidate genes and pathways. Nonetheless, the question remained open whether the same genetic components simultaneously have effects on various manifestations of the accelerated-senescence phenotype in OXYS rats. The present study was designed to analyze the genes of susceptibility to early neurodegenerative processes taking place in the OXYS rat retina and brain and to assess their potential functional clustering. The study was based on the findings from recent publications (including mapping of quantitative trait loci) and on comparative phenotyping of congenic rat strains. RESULTS The backcrossing of Wistar Albino Glaxo (WAG) and OXYS strains to generate the congenics resulted in two congenic strains with high susceptibility to cataract and retinopathy but with no obvious signs of Alzheimer's disease-like brain pathology that are specific for OXYS rats. Thus, the genes of susceptibility to brain neurodegeneration were not introgressed into the congenic strains or there is a strong effect of the genetic background on the disease phenotype. Moreover, the progression of retinopathy with age was relatively less severe in the WAG background compared to the OXYS background. A comparative analysis of previously defined QTLs and congenic segments led to identification of candidate genes with a suspected effect on brain neurodegeneration including the genes showing differential expression in the congenic strains. CONCLUSION Overall, our findings suggest that the cause of the cataract and the cause of retinopathy phenotypes in OXYS rats may be genetically linked to each other within the introgressed segments in the WAG/OXYS-1.1 and/or WAG/OXYS-1.2 congenic strains.
Collapse
Affiliation(s)
- Elena E Korbolina
- Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia.
- Novosibirsk State University, Novosibirsk, Russia.
| | | | | | | | - Natalia G Kolosova
- Institute of Cytology and Genetics, SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| |
Collapse
|
30
|
Towards a 21st-century roadmap for biomedical research and drug discovery: consensus report and recommendations. Drug Discov Today 2016; 22:327-339. [PMID: 27989722 DOI: 10.1016/j.drudis.2016.10.011] [Citation(s) in RCA: 51] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2016] [Revised: 09/15/2016] [Accepted: 10/24/2016] [Indexed: 02/08/2023]
Abstract
Decades of costly failures in translating drug candidates from preclinical disease models to human therapeutic use warrant reconsideration of the priority placed on animal models in biomedical research. Following an international workshop attended by experts from academia, government institutions, research funding bodies, and the corporate and non-governmental organisation (NGO) sectors, in this consensus report, we analyse, as case studies, five disease areas with major unmet needs for new treatments. In view of the scientifically driven transition towards a human pathways-based paradigm in toxicology, a similar paradigm shift appears to be justified in biomedical research. There is a pressing need for an approach that strategically implements advanced, human biology-based models and tools to understand disease pathways at multiple biological scales. We present recommendations to help achieve this.
Collapse
|
31
|
Gawron P, Ostaszewski M, Satagopam V, Gebel S, Mazein A, Kuzma M, Zorzan S, McGee F, Otjacques B, Balling R, Schneider R. MINERVA-a platform for visualization and curation of molecular interaction networks. NPJ Syst Biol Appl 2016; 2:16020. [PMID: 28725475 PMCID: PMC5516855 DOI: 10.1038/npjsba.2016.20] [Citation(s) in RCA: 54] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2016] [Revised: 06/15/2016] [Accepted: 06/24/2016] [Indexed: 12/11/2022] Open
Abstract
Our growing knowledge about various molecular mechanisms is becoming increasingly more structured and accessible. Different repositories of molecular interactions and available literature enable construction of focused and high-quality molecular interaction networks. Novel tools for curation and exploration of such networks are needed, in order to foster the development of a systems biology environment. In particular, solutions for visualization, annotation and data cross-linking will facilitate usage of network-encoded knowledge in biomedical research. To this end we developed the MINERVA (Molecular Interaction NEtwoRks VisuAlization) platform, a standalone webservice supporting curation, annotation and visualization of molecular interaction networks in Systems Biology Graphical Notation (SBGN)-compliant format. MINERVA provides automated content annotation and verification for improved quality control. The end users can explore and interact with hosted networks, and provide direct feedback to content curators. MINERVA enables mapping drug targets or overlaying experimental data on the visualized networks. Extensive export functions enable downloading areas of the visualized networks as SBGN-compliant models for efficient reuse of hosted networks. The software is available under Affero GPL 3.0 as a Virtual Machine snapshot, Debian package and Docker instance at http://r3lab.uni.lu/web/minerva-website/. We believe that MINERVA is an important contribution to systems biology community, as its architecture enables set-up of locally or globally accessible SBGN-oriented repositories of molecular interaction networks. Its functionalities allow overlay of multiple information layers, facilitating exploration of content and interpretation of data. Moreover, annotation and verification workflows of MINERVA improve the efficiency of curation of networks, allowing life-science researchers to better engage in development and use of biomedical knowledge repositories.
Collapse
Affiliation(s)
- Piotr Gawron
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Stephan Gebel
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Alexander Mazein
- European Institute for Systems Biology and Medicine, Université de Lyon, eTRIKS Consortium, Lyon, France
| | - Michal Kuzma
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | - Simone Zorzan
- Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Fintan McGee
- Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Benoît Otjacques
- Luxembourg Institute of Science and Technology, Belvaux, Luxembourg
| | - Rudi Balling
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, Université du Luxembourg, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
32
|
Auffray C, Balling R, Barroso I, Bencze L, Benson M, Bergeron J, Bernal-Delgado E, Blomberg N, Bock C, Conesa A, Del Signore S, Delogne C, Devilee P, Di Meglio A, Eijkemans M, Flicek P, Graf N, Grimm V, Guchelaar HJ, Guo YK, Gut IG, Hanbury A, Hanif S, Hilgers RD, Honrado Á, Hose DR, Houwing-Duistermaat J, Hubbard T, Janacek SH, Karanikas H, Kievits T, Kohler M, Kremer A, Lanfear J, Lengauer T, Maes E, Meert T, Müller W, Nickel D, Oledzki P, Pedersen B, Petkovic M, Pliakos K, Rattray M, I Màs JR, Schneider R, Sengstag T, Serra-Picamal X, Spek W, Vaas LAI, van Batenburg O, Vandelaer M, Varnai P, Villoslada P, Vizcaíno JA, Wubbe JPM, Zanetti G. Making sense of big data in health research: Towards an EU action plan. Genome Med 2016; 8:71. [PMID: 27338147 PMCID: PMC4919856 DOI: 10.1186/s13073-016-0323-y] [Citation(s) in RCA: 124] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Medicine and healthcare are undergoing profound changes. Whole-genome sequencing and high-resolution imaging technologies are key drivers of this rapid and crucial transformation. Technological innovation combined with automation and miniaturization has triggered an explosion in data production that will soon reach exabyte proportions. How are we going to deal with this exponential increase in data production? The potential of "big data" for improving health is enormous but, at the same time, we face a wide range of challenges to overcome urgently. Europe is very proud of its cultural diversity; however, exploitation of the data made available through advances in genomic medicine, imaging, and a wide range of mobile health applications or connected devices is hampered by numerous historical, technical, legal, and political barriers. European health systems and databases are diverse and fragmented. There is a lack of harmonization of data formats, processing, analysis, and data transfer, which leads to incompatibilities and lost opportunities. Legal frameworks for data sharing are evolving. Clinicians, researchers, and citizens need improved methods, tools, and training to generate, analyze, and query data effectively. Addressing these barriers will contribute to creating the European Single Market for health, which will improve health and healthcare for all Europeans.
Collapse
Affiliation(s)
- Charles Auffray
- European Institute for Systems Biology and Medicine, 1 avenue Claude Vellefaux, 75010, Paris, France.
- CIRI-UMR5308, CNRS-ENS-INSERM-UCBL, Université de Lyon, 50 avenue Tony Garnier, 69007, Lyon, France.
| | - Rudi Balling
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7 Avenue des Hauts Fourneaux, 4362, Esch-sur-Alzette, Luxembourg.
| | - Inês Barroso
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - László Bencze
- Health Services Management Training Centre, Faculty of Health and Public Services, Semmelweis University, Kútvölgyi út 2, 1125, Budapest, Hungary
| | - Mikael Benson
- Centre for Personalised Medicine, Linköping University, 581 85, Linköping, Sweden
| | - Jay Bergeron
- Translational & Bioinformatics, Pfizer Inc., 300 Technology Square, Cambridge, MA, 02139, USA
| | - Enrique Bernal-Delgado
- Institute for Health Sciences, IACS - IIS Aragon, San Juan Bosco 13, 50009, Zaragoza, Spain
| | - Niklas Blomberg
- ELIXIR, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Lazarettgasse 14, AKH BT25.2, 1090, Vienna, Austria
- Department of Laboratory Medicine, Medical University of Vienna, Lazarettgasse 14, AKH BT25.2, 1090, Vienna, Austria
- Max Planck Institute for Informatics, Campus E1 4, 66123, Saarbrücken, Germany
| | - Ana Conesa
- Príncipe Felipe Research Center, C/ Eduardo Primo Yúfera 3, 46012, Valencia, Spain
- University of Florida, Institute of Food and Agricultural Sciences (IFAS), 2033 Mowry Road, Gainesville, FL, 32610, USA
| | | | - Christophe Delogne
- Technology, Data & Analytics, KPMG Luxembourg, Société Coopérative, 39 Avenue John F. Kennedy, 1855, Luxembourg, Luxembourg
| | - Peter Devilee
- Department of Human Genetics, Department of Pathology, Leiden University Medical Centre, Einthovenweg 20, 2333 ZC, Leiden, The Netherlands
| | - Alberto Di Meglio
- Information Technology Department, European Organization for Nuclear Research (CERN), 385 Route de Meyrin, 1211, Geneva 23, Switzerland
| | - Marinus Eijkemans
- Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Heidelberglaan 100, 3508 GA, Utrecht, The Netherlands
| | - Paul Flicek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Norbert Graf
- Department of Pediatric Oncology/Hematology, Saarland University, Campus Homburg, Building 9, 66421, Homburg, Germany
| | - Vera Grimm
- Project Management Jülich, Forschungszentrum Jülich GmbH, Wilhelm-Johnen-Straße, 52428, Jülich, Germany
| | - Henk-Jan Guchelaar
- Department of Clinical Pharmacy & Toxicology, Leiden University Medical Center, Albinusdreef 2, 2333 ZA, Leiden, The Netherlands
| | - Yi-Ke Guo
- Data Science Institute, Imperial College London, South Kensington, London, SW7 2AZ, UK
| | - Ivo Glynne Gut
- CNAG-CRG, Center for Genomic Regulation, Barcelona Institute for Science and Technology (BIST), C/Baldiri Reixac 4, 08029, Barcelona, Spain
| | - Allan Hanbury
- Institute of Software Technology and Interactive Systems, TU Wien, Favoritenstrasse 9-11/188, 1040, Vienna, Austria
| | - Shahid Hanif
- The Association of the British Pharmaceutical Industry, 7th Floor, Southside, 105 Victoria Street, London, SW1E 6QT, UK
| | - Ralf-Dieter Hilgers
- Department of Medical Statistics, RWTH-Aachen University, Universitätsklinikum Aachen, Pauwelsstraße 30, 52074, Aachen, Germany
| | - Ángel Honrado
- SYNAPSE Research Management Partners, Diputació 237, Àtic 3ª, 08007, Barcelona, Spain
| | - D Rod Hose
- Department of Infection, Immunity and Cardiovascular Disease and Insigneo Institute for In-Silico Medicine, Medical School, University of Sheffield, Beech Hill Road, Sheffield, S10 2RX, UK
| | | | - Tim Hubbard
- Department of Medical & Molecular Genetics, King's College London, London, SE1 9RT, UK
- Genomics England, London, EC1M 6BQ, UK
| | - Sophie Helen Janacek
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Haralampos Karanikas
- National and Kapodistrian University of Athens, Medical School, Xristou Lada 6, 10561, Athens, Greece
| | - Tim Kievits
- Vitromics Healthcare Holding B.V., Onderwijsboulevard 225, 5223 DE, 's-Hertogenbosch, The Netherlands
| | - Manfred Kohler
- Fraunhofer Institute for Molecular Biology and Applied Ecology ScreeningPort, Schnackenburgallee 114, 22525, Hamburg, Germany
| | - Andreas Kremer
- ITTM S.A., 9 avenue des Hauts Fourneaux, 4362, Esch-sur-Alzette, Luxembourg
| | - Jerry Lanfear
- Research Business Technology, Pfizer Ltd, GP4 Building, Granta Park, Cambridge, CB21 6GP, UK
| | - Thomas Lengauer
- Max Planck Institute for Informatics, Campus E1 4, 66123, Saarbrücken, Germany
| | - Edith Maes
- Health Economics & Outcomes Research, Deloitte Belgium, Berkenlaan 8A, 1831, Diegem, Belgium
| | - Theo Meert
- Janssen Pharmaceutica N.V., R&D G3O, Turnhoutseweg 30, 2340, Beerse, Belgium
| | - Werner Müller
- Faculty of Life Sciences, University of Manchester, AV Hill Building, Oxford Road, Manchester, M13 9PT, UK
| | - Dörthe Nickel
- UMR3664 IC/CNRS, Institut Curie, Section Recherche, Pavillon Pasteur, 26 rue d'Ulm, 75248, Paris cedex 05, France
| | - Peter Oledzki
- Linguamatics Ltd, 324 Cambridge Science Park Milton Rd, Cambridge, CB4 0WG, UK
| | - Bertrand Pedersen
- PwC Luxembourg, 2 rue Gerhard Mercator, 2182, Luxembourg, Luxembourg
| | - Milan Petkovic
- Philips, HighTechCampus 36, 5656AE, Eindhoven, The Netherlands
| | - Konstantinos Pliakos
- Department of Public Health and Primary Care, KU Leuven Kulak, Etienne Sabbelaan 53, 8500, Kortrijk, Belgium
| | - Magnus Rattray
- Faculty of Life Sciences, University of Manchester, AV Hill Building, Oxford Road, Manchester, M13 9PT, UK
| | - Josep Redón I Màs
- INCLIVA Health Research Institute, University of Valencia, CIBERobn ISCIII, Avenida Menéndez Pelayo 4 accesorio, 46010, Valencia, Spain
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 7 Avenue des Hauts Fourneaux, 4362, Esch-sur-Alzette, Luxembourg
| | - Thierry Sengstag
- Swiss Institute of Bioinformatics (SIB) and University of Basel, Klingelbergstrasse 50/70, 4056, Basel, Switzerland
| | - Xavier Serra-Picamal
- Agency for Health Quality and Assessment of Catalonia (AQuAS), Carrer de Roc Boronat 81-95, 08005, Barcelona, Spain
| | - Wouter Spek
- EuroBioForum Foundation, Chrysantstraat 10, 3135 HG, Vlaardingen, The Netherlands
| | - Lea A I Vaas
- Fraunhofer Institute for Molecular Biology and Applied Ecology ScreeningPort, Schnackenburgallee 114, 22525, Hamburg, Germany
| | - Okker van Batenburg
- EuroBioForum Foundation, Chrysantstraat 10, 3135 HG, Vlaardingen, The Netherlands
| | - Marc Vandelaer
- Integrated BioBank of Luxembourg, 6 rue Nicolas-Ernest Barblé, 1210, Luxembourg, Luxembourg
| | - Peter Varnai
- Technopolis Group, 3 Pavilion Buildings, Brighton, BN1 1EE, UK
| | - Pablo Villoslada
- Hospital Clinic of Barcelona, Institute d'Investigacions Biomediques August Pi Sunyer (IDIBAPS), Rosello 149, 08036, Barcelona, Spain
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - John Peter Mary Wubbe
- European Platform for Patients' Organisations, Science and Industry (Epposi), De Meeûs Square 38-40, 1000, Brussels, Belgium
| | - Gianluigi Zanetti
- CRS4, Ed.1 POLARIS, 09129, Pula, Italy
- BBMRI-ERIC, Neue Stiftingtalstrasse 2/B/6, 8010, Graz, Austria
| |
Collapse
|
33
|
Satagopam V, Gu W, Eifes S, Gawron P, Ostaszewski M, Gebel S, Barbosa-Silva A, Balling R, Schneider R. Integration and Visualization of Translational Medicine Data for Better Understanding of Human Diseases. BIG DATA 2016; 4:97-108. [PMID: 27441714 PMCID: PMC4932659 DOI: 10.1089/big.2015.0057] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
Translational medicine is a domain turning results of basic life science research into new tools and methods in a clinical environment, for example, as new diagnostics or therapies. Nowadays, the process of translation is supported by large amounts of heterogeneous data ranging from medical data to a whole range of -omics data. It is not only a great opportunity but also a great challenge, as translational medicine big data is difficult to integrate and analyze, and requires the involvement of biomedical experts for the data processing. We show here that visualization and interoperable workflows, combining multiple complex steps, can address at least parts of the challenge. In this article, we present an integrated workflow for exploring, analysis, and interpretation of translational medicine data in the context of human health. Three Web services-tranSMART, a Galaxy Server, and a MINERVA platform-are combined into one big data pipeline. Native visualization capabilities enable the biomedical experts to get a comprehensive overview and control over separate steps of the workflow. The capabilities of tranSMART enable a flexible filtering of multidimensional integrated data sets to create subsets suitable for downstream processing. A Galaxy Server offers visually aided construction of analytical pipelines, with the use of existing or custom components. A MINERVA platform supports the exploration of health and disease-related mechanisms in a contextualized analytical visualization system. We demonstrate the utility of our workflow by illustrating its subsequent steps using an existing data set, for which we propose a filtering scheme, an analytical pipeline, and a corresponding visualization of analytical results. The workflow is available as a sandbox environment, where readers can work with the described setup themselves. Overall, our work shows how visualization and interfacing of big data processing services facilitate exploration, analysis, and interpretation of translational medicine data.
Collapse
Affiliation(s)
- Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Wei Gu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Serge Eifes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
- Information Technology for Translational Medicine (ITTM) S.A., Esch-Belval, Luxembourg
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Stephan Gebel
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Adriano Barbosa-Silva
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Rudi Balling
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-Belval, Luxembourg
| |
Collapse
|