1
|
Zhao Z, Cai Z, Zhang S, Yin X, Jiang T, Shen C, Yin Y, Sun H, Chen Z, Han J, Zhang B. Activation of the FOXM1/ASF1B/PRDX3 axis confers hyperproliferative and antioxidative stress reactivity to gastric cancer. Cancer Lett 2024; 589:216796. [PMID: 38537775 DOI: 10.1016/j.canlet.2024.216796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 02/02/2024] [Accepted: 03/04/2024] [Indexed: 04/07/2024]
Abstract
Nucleosome assembly during DNA replication is dependent on histone chaperones. Recent studies suggest that dysregulated histone chaperones contribute to cancer progression, including gastric cancer (GC). Further studies are required to explore the prognostic and therapeutic implications of histone chaperones and their mechanisms of action in GC progression. Here we identified histone chaperone ASF1B as a potential biomarker for GC proliferation and prognosis. ASF1B was significantly upregulated in GC, which was associated with poor prognosis. In vitro and in vivo experiments demonstrated that the inhibition of ASF1B suppressed the malignant characteristics of GC, while overexpression of ASF1B had the opposite effect. Mechanistically, transcription factor FOXM1 directly bound to the ASF1B-promoter region, thereby regulating its transcription. Treatment with thiostrepton, a FOXM1 inhibitor, not only suppressed ASF1B expression, but also inhibited GC progression. Furthermore, ASF1B regulated the mitochondrial protein peroxiredoxin 3 (PRDX3) transcription in a FOXM1-dependent manner. The crucial role of ASF1B-regulated PRDX3 in GC cell proliferation and oxidative stress balance was also elucidated. In summary, our study suggests that the FOXM1-ASF1B-PRDX3 axis is a potential therapeutic target for treating GC.
Collapse
Affiliation(s)
- Zhou Zhao
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China; Gastrointestinal Cancer Center, Chongqing University Cancer Hospital, Chongqing, China
| | - Zhaolun Cai
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Su Zhang
- State Key Laboratory of Biotherapy and Cancer Center, and Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Xiaonan Yin
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Tianxiang Jiang
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Chaoyong Shen
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Yuan Yin
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Hao Sun
- Gastrointestinal Cancer Center, Chongqing University Cancer Hospital, Chongqing, China
| | - Zhixin Chen
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China
| | - Junhong Han
- State Key Laboratory of Biotherapy and Cancer Center, and Frontiers Science Center for Disease-related Molecular Network, West China Hospital, Sichuan University, Chengdu, China.
| | - Bo Zhang
- Gastric Cancer Center, Department of General Surgery, Research Laboratory of Tumor Epigenetics and Genomics, West China Hospital, Sichuan University, Chengdu, China.
| |
Collapse
|
2
|
Zogopoulos VL, Malatras A, Kyriakidis K, Charalampous C, Makrygianni EA, Duguez S, Koutsi MA, Pouliou M, Vasileiou C, Duddy WJ, Agelopoulos M, Chrousos GP, Iconomidou VA, Michalopoulos I. HGCA2.0: An RNA-Seq Based Webtool for Gene Coexpression Analysis in Homo sapiens. Cells 2023; 12:cells12030388. [PMID: 36766730 PMCID: PMC9913097 DOI: 10.3390/cells12030388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 01/09/2023] [Accepted: 01/19/2023] [Indexed: 01/24/2023] Open
Abstract
Genes with similar expression patterns in a set of diverse samples may be considered coexpressed. Human Gene Coexpression Analysis 2.0 (HGCA2.0) is a webtool which studies the global coexpression landscape of human genes. The website is based on the hierarchical clustering of 55,431 Homo sapiens genes based on a large-scale coexpression analysis of 3500 GTEx bulk RNA-Seq samples of healthy individuals, which were selected as the best representative samples of each tissue type. HGCA2.0 presents subclades of coexpressed genes to a gene of interest, and performs various built-in gene term enrichment analyses on the coexpressed genes, including gene ontologies, biological pathways, protein families, and diseases, while also being unique in revealing enriched transcription factors driving coexpression. HGCA2.0 has been successful in identifying not only genes with ubiquitous expression patterns, but also tissue-specific genes. Benchmarking showed that HGCA2.0 belongs to the top performing coexpression webtools, as shown by STRING analysis. HGCA2.0 creates working hypotheses for the discovery of gene partners or common biological processes that can be experimentally validated. It offers a simple and intuitive website design and user interface, as well as an API endpoint.
Collapse
Affiliation(s)
- Vasileios L. Zogopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Section of Cell Biology and Biophysics, Department of Biology, National and Kapodistrian University of Athens, 15701 Athens, Greece
| | - Apostolos Malatras
- Biobank.cy Center of Excellence in Biobanking and Biomedical Research, University of Cyprus, 2029 Nicosia, Cyprus
| | - Konstantinos Kyriakidis
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- School of Pharmacy, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Chrysanthi Charalampous
- Centre of Basic Research, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
| | - Evanthia A. Makrygianni
- University Research Institute of Maternal and Child Health and Precision Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| | - Stéphanie Duguez
- Personalised Medicine Centre, School of Medicine, Ulster University, Derry-Londonderry BT47 6SB, UK
| | - Marianna A. Koutsi
- Centre of Basic Research, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
| | - Marialena Pouliou
- Centre of Basic Research, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
| | - Christos Vasileiou
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Engineering Design and Computing Laboratory, ETH Zurich, 8092 Zurich, Switzerland
| | - William J. Duddy
- Personalised Medicine Centre, School of Medicine, Ulster University, Derry-Londonderry BT47 6SB, UK
| | - Marios Agelopoulos
- Centre of Basic Research, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
| | - George P. Chrousos
- University Research Institute of Maternal and Child Health and Precision Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| | - Vassiliki A. Iconomidou
- Section of Cell Biology and Biophysics, Department of Biology, National and Kapodistrian University of Athens, 15701 Athens, Greece
| | - Ioannis Michalopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Correspondence:
| |
Collapse
|
3
|
Lan T, Hutvagner G, Zhang X, Liu T, Wong L, Li J. Density-based detection of cell transition states to construct disparate and bifurcating trajectories. Nucleic Acids Res 2022; 50:e122. [PMID: 36124665 PMCID: PMC9757071 DOI: 10.1093/nar/gkac785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 08/22/2022] [Accepted: 09/01/2022] [Indexed: 12/24/2022] Open
Abstract
Tree- and linear-shaped cell differentiation trajectories have been widely observed in developmental biologies and can be also inferred through computational methods from single-cell RNA-sequencing datasets. However, trajectories with complicated topologies such as loops, disparate lineages and bifurcating hierarchy remain difficult to infer accurately. Here, we introduce a density-based trajectory inference method capable of constructing diverse shapes of topological patterns including the most intriguing bifurcations. The novelty of our method is a step to exploit overlapping probability distributions to identify transition states of cells for determining connectability between cell clusters, and another step to infer a stable trajectory through a base-topology guided iterative fitting. Our method precisely re-constructed various benchmark reference trajectories. As a case study to demonstrate practical usefulness, our method was tested on single-cell RNA sequencing profiles of blood cells of SARS-CoV-2-infected patients. We not only re-discovered the linear trajectory bridging the transition from IgM plasmablast cells to developing neutrophils, and also found a previously-undiscovered lineage which can be rigorously supported by differentially expressed gene analysis.
Collapse
Affiliation(s)
- Tian Lan
- Data Science Institute and School of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, Australia
| | - Gyorgy Hutvagner
- School of Biomedical Engineering, University of Technology Sydney, Ultimo, NSW 2007, Australia
| | - Xuan Zhang
- Data Science Institute and School of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, Australia
| | - Tao Liu
- Children’s Cancer Institute Australia for Medical Research, Randwick, NSW 2031, Australia
| | - Limsoon Wong
- School of Computing, National University of Singapore, 13 Computing Drive, 117417, Singapore
| | - Jinyan Li
- Data Science Institute and School of Computer Science, University of Technology Sydney, Ultimo, NSW 2007, Australia
| |
Collapse
|
4
|
Asadi MR, Talebi M, Gharesouran J, Sabaie H, Jalaiei A, Arsang-Jang S, Taheri M, Sayad A, Rezazadeh M. Analysis of ROQUIN, Tristetraprolin (TTP), and BDNF/miR-16/TTP regulatory axis in late onset Alzheimer’s disease. Front Aging Neurosci 2022; 14:933019. [PMID: 36016853 PMCID: PMC9397504 DOI: 10.3389/fnagi.2022.933019] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 07/14/2022] [Indexed: 12/25/2022] Open
Abstract
Alzheimer’s disease (AD) is a heterogeneous degenerative disorder of the brain that is on the rise worldwide. One of the critical processes that might be disturbed in AD is gene expression regulation. Tristetraprolin (TTP) and RC3H1 gene (ROQUIN) are two RNA-binding proteins (RBPs) that target AU-rich elements (AREs) and constitutive decay elements (CDEs), respectively. TTP and ROQUIN, members of the CCCH zinc-finger protein family, have been demonstrated to fine-tune numerous inflammatory factors. In addition, miR-16 has distinct characteristics and may influence the target mRNA through the ARE site. Interestingly, BDNF mRNA has ARE sites in the 3’ untranslated region (UTR) and can be targeted by regulatory factors, such as TTP and miR-16 on MRE sequences, forming BDNF/miR-16/TTP regulatory axis. A number of two microarray datasets were downloaded, including information on mRNAs (GSE106241) and miRNAs (GSE157239) from individuals with AD and corresponding controls. R software was used to identify BDNF, TTP, ROQUIN, and miR-16 expression levels in temporal cortex (TC) tissue datasets. Q-PCR was also used to evaluate the expression of these regulatory factors and the expression of BDNF in the blood of 50 patients with AD and 50 controls. Bioinformatic evaluation showed that TTP and miR-16 overexpression might act as post-transcriptional regulatory factors to control BDNF expression in AD in TC samples. Instead, this expression pattern was not found in peripheral blood samples from patients with AD compared to normal controls. ROQUIN expression was increased in the peripheral blood of patients with AD. Hsa-miR-16-5p levels did not show significant differences in peripheral blood samples. Finally, it was shown that TTP and BDNF, based on evaluating the receiver operating characteristic (ROC), effectively identify patients with AD from healthy controls. This study could provide a new perspective on the molecular regulatory processes associated with AD pathogenic mechanisms linked to the BDNF growth factor, although further research is needed on the possible roles of these factors in AD.
Collapse
Affiliation(s)
- Mohammad Reza Asadi
- Molecular Medicine Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Mahnaz Talebi
- Neurosciences Research Center (NSRC), Tabriz University of Medical Sciences, Tabriz, Iran
| | - Jalal Gharesouran
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Hani Sabaie
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Abbas Jalaiei
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Shahram Arsang-Jang
- Cancer Gene Therapy Research Center, Zanjan University of Medical Sciences, Zanjan, Iran
| | - Mohammad Taheri
- Institute of Human Genetics, Jena University Hospital, Jena, Germany
| | - Arezou Sayad
- Department of Medical Genetics, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
- *Correspondence: Arezou Sayad,
| | - Maryam Rezazadeh
- Molecular Medicine Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
- Clinical Research Development Unit of Tabriz Valiasr Hospital, Tabriz University of Medical Sciences, Tabriz, Iran
- Maryam Rezazadeh,
| |
Collapse
|
5
|
Approaches in Gene Coexpression Analysis in Eukaryotes. BIOLOGY 2022; 11:biology11071019. [PMID: 36101400 PMCID: PMC9312353 DOI: 10.3390/biology11071019] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 06/28/2022] [Accepted: 07/04/2022] [Indexed: 11/22/2022]
Abstract
Simple Summary Genes whose expression levels rise and fall similarly in a large set of samples, may be considered coexpressed. Gene coexpression analysis refers to the en masse discovery of coexpressed genes from a large variety of transcriptomic experiments. The type of biological networks that studies gene coexpression, known as Gene Coexpression Networks, consist of an undirected graph depicting genes and their coexpression relationships. Coexpressed genes are clustered in smaller subnetworks, the predominant biological roles of which can be determined through enrichment analysis. By studying well-annotated gene partners, the attribution of new roles to genes of unknown function or assumption for participation in common metabolic pathways can be achieved, through a guilt-by-association approach. In this review, we present key issues in gene coexpression analysis, as well as the most popular tools that perform it. Abstract Gene coexpression analysis constitutes a widely used practice for gene partner identification and gene function prediction, consisting of many intricate procedures. The analysis begins with the collection of primary transcriptomic data and their preprocessing, continues with the calculation of the similarity between genes based on their expression values in the selected sample dataset and results in the construction and visualisation of a gene coexpression network (GCN) and its evaluation using biological term enrichment analysis. As gene coexpression analysis has been studied extensively, we present most parts of the methodology in a clear manner and the reasoning behind the selection of some of the techniques. In this review, we offer a comprehensive and comprehensible account of the steps required for performing a complete gene coexpression analysis in eukaryotic organisms. We comment on the use of RNA-Seq vs. microarrays, as well as the best practices for GCN construction. Furthermore, we recount the most popular webtools and standalone applications performing gene coexpression analysis, with details on their methods, features and outputs.
Collapse
|
6
|
Zogopoulos VL, Saxami G, Malatras A, Angelopoulou A, Jen CH, Duddy WJ, Daras G, Hatzopoulos P, Westhead DR, Michalopoulos I. Arabidopsis Coexpression Tool: a tool for gene coexpression analysis in Arabidopsis thaliana. iScience 2021; 24:102848. [PMID: 34381973 PMCID: PMC8334378 DOI: 10.1016/j.isci.2021.102848] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 06/23/2021] [Accepted: 07/08/2021] [Indexed: 02/08/2023] Open
Abstract
Gene coexpression analysis refers to the discovery of sets of genes which exhibit similar expression patterns across multiple transcriptomic data sets, such as microarray experiment data of public repositories. Arabidopsis Coexpression Tool (ACT), a gene coexpression analysis web tool for Arabidopsis thaliana, identifies genes which are correlated to a driver gene. Primary microarray data from ATH1 Affymetrix platform were processed with Single-Channel Array Normalization algorithm and combined to produce a coexpression tree which contains ∼21,000 A. thaliana genes. ACT was developed to present subclades of coexpressed genes, as well as to perform gene set enrichment analysis, being unique in revealing enriched transcription factors targeting coexpressed genes. ACT offers a simple and user-friendly interface producing working hypotheses which can be experimentally verified for the discovery of gene partnership, pathway membership, and transcriptional regulation. ACT analyses have been successful in identifying not only genes with coordinated ubiquitous expressions but also genes with tissue-specific expressions.
Collapse
Affiliation(s)
- Vasileios L. Zogopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens 11527, Greece
| | - Georgia Saxami
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens 11527, Greece
| | - Apostolos Malatras
- Center for Research in Myology, Sorbonne Université, Paris 75013, France
| | - Antonia Angelopoulou
- Department of Biotechnology, Agricultural University of Athens, Athens 11855, Greece
| | - Chih-Hung Jen
- Cold Spring Biotech Corp, Da Hu Science Park, New Taipei City, Taiwan
| | - William J. Duddy
- Center for Research in Myology, Sorbonne Université, Paris 75013, France
- Northern Ireland Centre for Stratified Medicine, Altnagelvin Hospital Campus, Ulster University, Londonderry BT52 1SJ, UK
| | - Gerasimos Daras
- Department of Biotechnology, Agricultural University of Athens, Athens 11855, Greece
| | | | - David R. Westhead
- School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds LS2 9JT, UK
| | - Ioannis Michalopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens 11527, Greece
| |
Collapse
|
7
|
Baltoumas FA, Zafeiropoulou S, Karatzas E, Koutrouli M, Thanati F, Voutsadaki K, Gkonta M, Hotova J, Kasionis I, Hatzis P, Pavlopoulos GA. Biomolecule and Bioentity Interaction Databases in Systems Biology: A Comprehensive Review. Biomolecules 2021; 11:1245. [PMID: 34439912 PMCID: PMC8391349 DOI: 10.3390/biom11081245] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 08/16/2021] [Accepted: 08/18/2021] [Indexed: 02/06/2023] Open
Abstract
Technological advances in high-throughput techniques have resulted in tremendous growth of complex biological datasets providing evidence regarding various biomolecular interactions. To cope with this data flood, computational approaches, web services, and databases have been implemented to deal with issues such as data integration, visualization, exploration, organization, scalability, and complexity. Nevertheless, as the number of such sets increases, it is becoming more and more difficult for an end user to know what the scope and focus of each repository is and how redundant the information between them is. Several repositories have a more general scope, while others focus on specialized aspects, such as specific organisms or biological systems. Unfortunately, many of these databases are self-contained or poorly documented and maintained. For a clearer view, in this article we provide a comprehensive categorization, comparison and evaluation of such repositories for different bioentity interaction types. We discuss most of the publicly available services based on their content, sources of information, data representation methods, user-friendliness, scope and interconnectivity, and we comment on their strengths and weaknesses. We aim for this review to reach a broad readership varying from biomedical beginners to experts and serve as a reference article in the field of Network Biology.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Sofia Zafeiropoulou
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Mikaela Koutrouli
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Foteini Thanati
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Kleanthi Voutsadaki
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Maria Gkonta
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Joana Hotova
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Ioannis Kasionis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Pantelis Hatzis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| |
Collapse
|
8
|
Zhu W, Lévy-Leduc C, Ternès N. A variable selection approach for highly correlated predictors in high-dimensional genomic data. Bioinformatics 2021; 37:2238-2244. [PMID: 33617644 DOI: 10.1093/bioinformatics/btab114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2020] [Revised: 01/25/2021] [Accepted: 02/18/2021] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION In genomic studies, identifying biomarkers associated with a variable of interest is a major concern in biomedical research. Regularized approaches are classically used to perform variable selection in high-dimensional linear models. However, these methods can fail in highly correlated settings. RESULTS We propose a novel variable selection approach called WLasso, taking these correlations into account. It consists in rewriting the initial high-dimensional linear model to remove the correlation between the biomarkers (predictors) and in applying the generalized Lasso criterion. The performance of WLasso is assessed using synthetic data in several scenarios and compared with recent alternative approaches. The results show that when the biomarkers are highly correlated, WLasso outperforms the other approaches in sparse high-dimensional frameworks. The method is also illustrated on publicly available gene expression data in breast cancer. AVAILABILITY Our method is implemented in the WLasso R package which is available from the Comprehensive R Archive Network (CRAN). SUPPLEMENTARY INFORMATION Supplementary material is available at Bioinformatics online.
Collapse
Affiliation(s)
- Wencan Zhu
- UMR MIA-Paris, AgroParisTech, INRAE-Université Paris-Saclay, Paris, 75005, France.,Biostatistics and Programming department, Sanofi R&D, Chilly Mazarin, 91380, France
| | - Céline Lévy-Leduc
- UMR MIA-Paris, AgroParisTech, INRAE-Université Paris-Saclay, Paris, 75005, France
| | - Nils Ternès
- Biostatistics and Programming department, Sanofi R&D, Chilly Mazarin, 91380, France
| |
Collapse
|
9
|
Zhai S, Xiao Y, Tang Y, Wan Q, Guo S. Transcriptome of Edwardsiella anguillarum in vivo and in vitro revealed two-component system, ABC transporter and flagellar assembly are three pathways pathogenic to European eel (Anguilla anguilla). Microb Pathog 2021; 153:104801. [PMID: 33610715 DOI: 10.1016/j.micpath.2021.104801] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 02/09/2021] [Accepted: 02/10/2021] [Indexed: 02/07/2023]
Abstract
Edwardsiella anguillarum is one of the common bacterial pathogens for the cultivated eels in China. The aim of this study was to reveal the cause of E. anguillarum pathogenic to European eel (Anguilla anguilla) from the perspective of the transcriptome. In this study, we first prepared E. anguillarum cultured in vitro and analysed the whole transcriptome after extracting the total RNA. Then, eels were i.p injected with E. anguillarum, and total RNA were extracted from the liver of European eels 48 h after the infection. After sequencing the transcriptome, we obtained average 1.97 × 108 clean reads cultured in vitro and 1.36 × 105 clean reads located in vivo after annotating all reads into the genome of E. anguillarum. The whole transcriptome showed, compared to the E. anguillarum cultured in vitro, 503 significantly up and 657 significantly down-regulated different expressed genes (DEGs) were observed. KEGG analysis showed that 38 DEGs of Two-Component System, 41 DEGs of ABC transporter, and 10 DEGs flagellar assembly pathways were highly upregulated in E. anguillarum located in vivo. Then, we designed primers to analyse the up-regulated DEGs through qRT-PCR and confirmed some up-regulated DEGs. The results of this study provide important reference for the further study of pathogen-host interaction between E. anguillarum and European eel.
Collapse
Affiliation(s)
- Shaowei Zhai
- Jimei University Fisheries College / Engineering Research Center of the Modern Industry Technology for Eel. Ministry of Education of PR China, Xiamen, 361021, China
| | - YiQun Xiao
- Jimei University Fisheries College / Engineering Research Center of the Modern Industry Technology for Eel. Ministry of Education of PR China, Xiamen, 361021, China
| | - YiJun Tang
- Department of Chemistry, University of Wisconsin Oshkosh, 800 Algoma Blvd., Oshkosh, WI, USA
| | - Qijuan Wan
- Jimei University Fisheries College / Engineering Research Center of the Modern Industry Technology for Eel. Ministry of Education of PR China, Xiamen, 361021, China
| | - Songlin Guo
- Jimei University Fisheries College / Engineering Research Center of the Modern Industry Technology for Eel. Ministry of Education of PR China, Xiamen, 361021, China.
| |
Collapse
|
10
|
Massi MC, Gasperoni F, Ieva F, Paganoni AM, Zunino P, Manzoni A, Franco NR, Veldeman L, Ost P, Fonteyne V, Talbot CJ, Rattay T, Webb A, Symonds PR, Johnson K, Lambrecht M, Haustermans K, De Meerleer G, de Ruysscher D, Vanneste B, Van Limbergen E, Choudhury A, Elliott RM, Sperk E, Herskind C, Veldwijk MR, Avuzzi B, Giandini T, Valdagni R, Cicchetti A, Azria D, Jacquet MPF, Rosenstein BS, Stock RG, Collado K, Vega A, Aguado-Barrera ME, Calvo P, Dunning AM, Fachal L, Kerns SL, Payne D, Chang-Claude J, Seibold P, West CML, Rancati T. A Deep Learning Approach Validates Genetic Risk Factors for Late Toxicity After Prostate Cancer Radiotherapy in a REQUITE Multi-National Cohort. Front Oncol 2020; 10:541281. [PMID: 33178576 PMCID: PMC7593843 DOI: 10.3389/fonc.2020.541281] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Accepted: 09/02/2020] [Indexed: 12/23/2022] Open
Abstract
Background: REQUITE (validating pREdictive models and biomarkers of radiotherapy toxicity to reduce side effects and improve QUalITy of lifE in cancer survivors) is an international prospective cohort study. The purpose of this project was to analyse a cohort of patients recruited into REQUITE using a deep learning algorithm to identify patient-specific features associated with the development of toxicity, and test the approach by attempting to validate previously published genetic risk factors. Methods: The study involved REQUITE prostate cancer patients treated with external beam radiotherapy who had complete 2-year follow-up. We used five separate late toxicity endpoints: ≥grade 1 late rectal bleeding, ≥grade 2 urinary frequency, ≥grade 1 haematuria, ≥ grade 2 nocturia, ≥ grade 1 decreased urinary stream. Forty-three single nucleotide polymorphisms (SNPs) already reported in the literature to be associated with the toxicity endpoints were included in the analysis. No SNP had been studied before in the REQUITE cohort. Deep Sparse AutoEncoders (DSAE) were trained to recognize features (SNPs) identifying patients with no toxicity and tested on a different independent mixed population including patients without and with toxicity. Results: One thousand, four hundred and one patients were included, and toxicity rates were: rectal bleeding 11.7%, urinary frequency 4%, haematuria 5.5%, nocturia 7.8%, decreased urinary stream 17.1%. Twenty-four of the 43 SNPs that were associated with the toxicity endpoints were validated as identifying patients with toxicity. Twenty of the 24 SNPs were associated with the same toxicity endpoint as reported in the literature: 9 SNPs for urinary symptoms and 11 SNPs for overall toxicity. The other 4 SNPs were associated with a different endpoint. Conclusion: Deep learning algorithms can validate SNPs associated with toxicity after radiotherapy for prostate cancer. The method should be studied further to identify polygenic SNP risk signatures for radiotherapy toxicity. The signatures could then be included in integrated normal tissue complication probability models and tested for their ability to personalize radiotherapy treatment planning.
Collapse
Affiliation(s)
- Michela Carlotta Massi
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
- Center for Analysis, Decisions and Society, Human Technopole, Milan, Italy
| | - Francesca Gasperoni
- Medical Research Council-Biostatistic Unit, University of Cambridge, Cambridge, United Kingdom
| | - Francesca Ieva
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
- Center for Analysis, Decisions and Society, Human Technopole, Milan, Italy
- CHRP-National Center for Healthcare Research and Pharmacoepidemiology, University of Milano-Bicocca, Milan, Italy
| | - Anna Maria Paganoni
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
- Center for Analysis, Decisions and Society, Human Technopole, Milan, Italy
- CHRP-National Center for Healthcare Research and Pharmacoepidemiology, University of Milano-Bicocca, Milan, Italy
| | - Paolo Zunino
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
| | - Andrea Manzoni
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
| | - Nicola Rares Franco
- Modelling and Scientific Computing Laboratory, Math Department, Politecnico di Milano, Milan, Italy
| | - Liv Veldeman
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
- Department of Radiation Oncology, Ghent University Hospital, Ghent, Belgium
| | - Piet Ost
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
- Department of Radiation Oncology, Ghent University Hospital, Ghent, Belgium
| | - Valérie Fonteyne
- Department of Human Structure and Repair, Ghent University, Ghent, Belgium
- Department of Radiation Oncology, Ghent University Hospital, Ghent, Belgium
| | - Christopher J. Talbot
- Leicester Cancer Research Centre, Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Tim Rattay
- Leicester Cancer Research Centre, Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Adam Webb
- Leicester Cancer Research Centre, Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Paul R. Symonds
- Leicester Cancer Research Centre, Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Kerstie Johnson
- Leicester Cancer Research Centre, Department of Genetics and Genome Biology, University of Leicester, Leicester, United Kingdom
| | - Maarten Lambrecht
- Department of Radiation Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Karin Haustermans
- Department of Radiation Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Gert De Meerleer
- Department of Radiation Oncology, University Hospitals Leuven, Leuven, Belgium
| | - Dirk de Ruysscher
- Maastricht University Medical Center, Maastricht, Netherlands
- Department of Radiation Oncology (Maastro), GROW Institute for Oncology and Developmental Biology, Maastricht, Netherlands
| | - Ben Vanneste
- Department of Radiation Oncology (Maastro), GROW Institute for Oncology and Developmental Biology, Maastricht, Netherlands
| | - Evert Van Limbergen
- Maastricht University Medical Center, Maastricht, Netherlands
- Department of Radiation Oncology (Maastro), GROW Institute for Oncology and Developmental Biology, Maastricht, Netherlands
| | - Ananya Choudhury
- Translational Radiobiology Group, Division of Cancer Sciences, Manchester Academic Health Science Centre, Christie Hospital, University of Manchester, Manchester, United Kingdom
| | - Rebecca M. Elliott
- Translational Radiobiology Group, Division of Cancer Sciences, Manchester Academic Health Science Centre, Christie Hospital, University of Manchester, Manchester, United Kingdom
| | - Elena Sperk
- Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Carsten Herskind
- Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Marlon R. Veldwijk
- Department of Radiation Oncology, Universitätsmedizin Mannheim, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany
| | - Barbara Avuzzi
- Department of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
| | - Tommaso Giandini
- Department of Medical Physics, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
| | - Riccardo Valdagni
- Department of Radiation Oncology 1, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
- Department of Oncology and Haemato-Oncology, University of Milan, Milan, Italy
- Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
| | - Alessandro Cicchetti
- Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
| | - David Azria
- Department of Radiation Oncology, University Federation of Radiation Oncology, Montpellier Cancer Institute, Univ Montpellier MUSE, Grant INCa_Inserm_DGOS_12553, Inserm U1194, Montpellier, France
| | - Marie-Pierre Farcy Jacquet
- Department of Radiation Oncology, University Federation of Radiation Oncology, CHU Caremeau, Nîmes, France
| | - Barry S. Rosenstein
- Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Richard G. Stock
- Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Kayla Collado
- Department of Radiation Oncology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Ana Vega
- Fundación Pública Galega de Medicina Xenómica, Grupo de Medicina Xenómica (USC), Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain
- Biomedical Network on Rare Diseases (CIBERER), Madrid, Spain
| | - Miguel Elías Aguado-Barrera
- Fundación Pública Galega de Medicina Xenómica, Grupo de Medicina Xenómica (USC), Santiago de Compostela, Spain
- Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain
| | - Patricia Calvo
- Instituto de Investigación Sanitaria de Santiago de Compostela, Santiago de Compostela, Spain
- Department of Radiation Oncology, Complexo Hospitalario Universitario de Santiago, SERGAS, Santiago de Compostela, Spain
| | - Alison M. Dunning
- Strangeways Research Labs, Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, United Kingdom
| | - Laura Fachal
- Strangeways Research Labs, Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Sarah L. Kerns
- Departments of Radiation Oncology and Surgery, University of Rochester Medical Center, Rochester, New York, NY, United States
| | - Debbie Payne
- Centre for Integrated Genomic Medical Research (CIGMR), University of Manchester, Manchester, United Kingdom
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- University Cancer Center Hamburg, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Petra Seibold
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Catharine M. L. West
- Translational Radiobiology Group, Division of Cancer Sciences, Manchester Academic Health Science Centre, Christie Hospital, University of Manchester, Manchester, United Kingdom
| | - Tiziana Rancati
- Prostate Cancer Program, Fondazione IRCCS Istituto Nazionale dei Tumori, Milan, Italy
| |
Collapse
|
11
|
Malatras A, Michalopoulos I, Duguez S, Butler-Browne G, Spuler S, Duddy WJ. MyoMiner: explore gene co-expression in normal and pathological muscle. BMC Med Genomics 2020; 13:67. [PMID: 32393257 PMCID: PMC7216615 DOI: 10.1186/s12920-020-0712-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Accepted: 04/13/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND High-throughput transcriptomics measures mRNA levels for thousands of genes in a biological sample. Most gene expression studies aim to identify genes that are differentially expressed between different biological conditions, such as between healthy and diseased states. However, these data can also be used to identify genes that are co-expressed within a biological condition. Gene co-expression is used in a guilt-by-association approach to prioritize candidate genes that could be involved in disease, and to gain insights into the functions of genes, protein relations, and signaling pathways. Most existing gene co-expression databases are generic, amalgamating data for a given organism regardless of tissue-type. METHODS To study muscle-specific gene co-expression in both normal and pathological states, publicly available gene expression data were acquired for 2376 mouse and 2228 human striated muscle samples, and separated into 142 categories based on species (human or mouse), tissue origin, age, gender, anatomic part, and experimental condition. Co-expression values were calculated for each category to create the MyoMiner database. RESULTS Within each category, users can select a gene of interest, and the MyoMiner web interface will return all correlated genes. For each co-expressed gene pair, adjusted p-value and confidence intervals are provided as measures of expression correlation strength. A standardized expression-level scatterplot is available for every gene pair r-value. MyoMiner has two extra functions: (a) a network interface for creating a 2-shell correlation network, based either on the most highly correlated genes or from a list of genes provided by the user with the option to include linked genes from the database and (b) a comparison tool from which the users can test whether any two correlation coefficients from different conditions are significantly different. CONCLUSIONS These co-expression analyses will help investigators to delineate the tissue-, cell-, and pathology-specific elements of muscle protein interactions, cell signaling and gene regulation. Changes in co-expression between pathologic and healthy tissue may suggest new disease mechanisms and help define novel therapeutic targets. Thus, MyoMiner is a powerful muscle-specific database for the discovery of genes that are associated with related functions based on their co-expression. MyoMiner is freely available at https://www.sys-myo.com/myominer.
Collapse
Affiliation(s)
- Apostolos Malatras
- Sorbonne Université, Inserm, Institut de Myologie, U974, Center for Research in Myology, 47 Boulevard de l’hôpital, 75013 Paris, France
| | - Ioannis Michalopoulos
- Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, 4 Soranou Ephessiou St., 11527 Athens, Greece
| | - Stéphanie Duguez
- Sorbonne Université, Inserm, Institut de Myologie, U974, Center for Research in Myology, 47 Boulevard de l’hôpital, 75013 Paris, France
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, C-TRIC, Altnagelvin Hospital Campus, Glenshane Road, Ulster University, Derry/Londonderry, BT47 6SB UK
| | - Gillian Butler-Browne
- Sorbonne Université, Inserm, Institut de Myologie, U974, Center for Research in Myology, 47 Boulevard de l’hôpital, 75013 Paris, France
| | - Simone Spuler
- Muscle Research Unit, Experimental and Clinical Research Center – a joint cooperation of the Charité Medical Faculty and the Max Delbrück Center for Molecular Medicine, Lindenberger Weg 80, 13125 Berlin, Germany
| | - William J. Duddy
- Sorbonne Université, Inserm, Institut de Myologie, U974, Center for Research in Myology, 47 Boulevard de l’hôpital, 75013 Paris, France
- Northern Ireland Centre for Stratified Medicine, Biomedical Sciences Research Institute, C-TRIC, Altnagelvin Hospital Campus, Glenshane Road, Ulster University, Derry/Londonderry, BT47 6SB UK
| |
Collapse
|
12
|
Wang H, Lengerich BJ, Aragam B, Xing EP. Precision Lasso: accounting for correlations and linear dependencies in high-dimensional genomic data. Bioinformatics 2020; 35:1181-1187. [PMID: 30184048 PMCID: PMC6449749 DOI: 10.1093/bioinformatics/bty750] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 07/31/2018] [Accepted: 08/31/2018] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Association studies to discover links between genetic markers and phenotypes are central to bioinformatics. Methods of regularized regression, such as variants of the Lasso, are popular for this task. Despite the good predictive performance of these methods in the average case, they suffer from unstable selections of correlated variables and inconsistent selections of linearly dependent variables. Unfortunately, as we demonstrate empirically, such problematic situations of correlated and linearly dependent variables often exist in genomic datasets and lead to under-performance of classical methods of variable selection. RESULTS To address these challenges, we propose the Precision Lasso. Precision Lasso is a Lasso variant that promotes sparse variable selection by regularization governed by the covariance and inverse covariance matrices of explanatory variables. We illustrate its capacity for stable and consistent variable selection in simulated data with highly correlated and linearly dependent variables. We then demonstrate the effectiveness of the Precision Lasso to select meaningful variables from transcriptomic profiles of breast cancer patients. Our results indicate that in settings with correlated and linearly dependent variables, the Precision Lasso outperforms popular methods of variable selection such as the Lasso, the Elastic Net and Minimax Concave Penalty (MCP) regression. AVAILABILITY AND IMPLEMENTATION Software is available at https://github.com/HaohanWang/thePrecisionLasso. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Haohan Wang
- Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Benjamin J Lengerich
- Department of Computer Science, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Bryon Aragam
- Department of Machine Learning, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Eric P Xing
- Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.,Department of Computer Science, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA.,Department of Machine Learning, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
13
|
Maj C, Azevedo T, Giansanti V, Borisov O, Dimitri GM, Spasov S, Lió P, Merelli I. Integration of Machine Learning Methods to Dissect Genetically Imputed Transcriptomic Profiles in Alzheimer's Disease. Front Genet 2019; 10:726. [PMID: 31552082 PMCID: PMC6735530 DOI: 10.3389/fgene.2019.00726] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2019] [Accepted: 07/10/2019] [Indexed: 12/12/2022] Open
Abstract
The genetic component of many common traits is associated with the gene expression and several variants act as expression quantitative loci, regulating the gene expression in a tissue specific manner. In this work, we applied tissue-specific cis-eQTL gene expression prediction models on the genotype of 808 samples including controls, subjects with mild cognitive impairment, and patients with Alzheimer's Disease. We then dissected the imputed transcriptomic profiles by means of different unsupervised and supervised machine learning approaches to identify potential biological associations. Our analysis suggests that unsupervised and supervised methods can provide complementary information, which can be integrated for a better characterization of the underlying biological system. In particular, a variational autoencoder representation of the transcriptomic profiles, followed by a support vector machine classification, has been used for tissue-specific gene prioritizations. Interestingly, the achieved gene prioritizations can be efficiently integrated as a feature selection step for improving the accuracy of deep learning classifier networks. The identified gene-tissue information suggests a potential role for inflammatory and regulatory processes in gut-brain axis related tissues. In line with the expected low heritability that can be apportioned to eQTL variants, we were able to achieve only relatively low prediction capability with deep learning classification models. However, our analysis revealed that the classification power strongly depends on the network structure, with recurrent neural networks being the best performing network class. Interestingly, cross-tissue analysis suggests a potentially greater role of models trained in brain tissues also by considering dementia-related endophenotypes. Overall, the present analysis suggests that the combination of supervised and unsupervised machine learning techniques can be used for the evaluation of high dimensional omics data.
Collapse
Affiliation(s)
- Carlo Maj
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
| | - Tiago Azevedo
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Valentina Giansanti
- National Research Council, Institute for Biomedical Technologies, Milan, Italy
| | - Oleg Borisov
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Bonn, Germany
| | - Giovanna Maria Dimitri
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Simeon Spasov
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | | | - Pietro Lió
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Ivan Merelli
- National Research Council, Institute for Biomedical Technologies, Milan, Italy
| |
Collapse
|
14
|
Omony J, de Jong A, Kok J, van Hijum SAFT. Reconstruction and inference of the Lactococcus lactis MG1363 gene co-expression network. PLoS One 2019; 14:e0214868. [PMID: 31116749 PMCID: PMC6530827 DOI: 10.1371/journal.pone.0214868] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 03/21/2019] [Indexed: 01/30/2023] Open
Abstract
Lactic acid bacteria are Gram-positive bacteria used throughout the world in many industrial applications for their acidification, flavor and texture formation attributes. One of the species, Lactococcus lactis, is employed for the production of fermented milk products like cheese, buttermilk and quark. It ferments lactose to lactic acid and, thus, helps improve the shelf life of the products. Many physiological and transcriptome studies have been performed in L. lactis in order to comprehend and improve its biotechnological assets. Using large amounts of transcriptome data to understand and predict the behavior of biological processes in bacterial or other cell types is a complex task. Gene networks enable predicting gene behavior and function in the context of transcriptionally linked processes. We reconstruct and present the gene co-expression network (GCN) for the most widely studied L. lactis strain, MG1363, using publicly available transcriptome data. Several methods exist to generate and judge the quality of GCNs. Different reconstruction methods lead to networks with varying structural properties, consequently altering gene clusters. We compared the structural properties of the MG1363 GCNs generated by five methods, namely Pearson correlation, Spearman correlation, GeneNet, Weighted Gene Co-expression Network Analysis (WGCNA), and Sparse PArtial Correlation Estimation (SPACE). Using SPACE, we generated an L. lactis MG1363 GCN and assessed its quality using modularity and structural and biological criteria. The L. lactis MG1363 GCN has structural properties similar to those of the gold-standard networks of Escherichia coli K-12 and Bacillus subtilis 168. We showcase that the network can be used to mine for genes with similar expression profiles that are also generally linked to the same biological process.
Collapse
Affiliation(s)
- Jimmy Omony
- Top Institute Food and Nutrition (TIFN), Wageningen, The Netherlands
| | - Anne de Jong
- Top Institute Food and Nutrition (TIFN), Wageningen, The Netherlands
| | - Jan Kok
- Top Institute Food and Nutrition (TIFN), Wageningen, The Netherlands
- * E-mail:
| | | |
Collapse
|
15
|
Omony J, de Jong A, Krawczyk AO, Eijlander RT, Kuipers OP. Dynamic sporulation gene co-expression networks for Bacillus subtilis 168 and the food-borne isolate Bacillus amyloliquefaciens: a transcriptomic model. Microb Genom 2018; 4. [PMID: 29424683 PMCID: PMC5857382 DOI: 10.1099/mgen.0.000157] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Sporulation is a survival strategy, adapted by bacterial cells in response to harsh environmental adversities. The adaptation potential differs between strains and the variations may arise from differences in gene regulation. Gene networks are a valuable way of studying such regulation processes and establishing associations between genes. We reconstructed and compared sporulation gene co-expression networks (GCNs) of the model laboratory strain Bacillus subtilis 168 and the food-borne industrial isolate Bacillus amyloliquefaciens. Transcriptome data obtained from samples of six stages during the sporulation process were used for network inference. Subsequently, a gene set enrichment analysis was performed to compare the reconstructed GCNs of B. subtilis 168 and B. amyloliquefaciens with respect to biological functions, which showed the enriched modules with coherent functional groups associated with sporulation. On basis of the GCNs and time-evolution of differentially expressed genes, we could identify novel candidate genes strongly associated with sporulation in B. subtilis 168 and B. amyloliquefaciens. The GCNs offer a framework for exploring transcription factors, their targets, and co-expressed genes during sporulation. Furthermore, the methodology described here can conveniently be applied to other species or biological processes.
Collapse
Affiliation(s)
- Jimmy Omony
- 1Laboratory of Molecular Genetics, University of Groningen, 9747 AG Groningen, The Netherlands.,2Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands
| | - Anne de Jong
- 1Laboratory of Molecular Genetics, University of Groningen, 9747 AG Groningen, The Netherlands.,2Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands
| | - Antonina O Krawczyk
- 1Laboratory of Molecular Genetics, University of Groningen, 9747 AG Groningen, The Netherlands.,2Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands
| | - Robyn T Eijlander
- 1Laboratory of Molecular Genetics, University of Groningen, 9747 AG Groningen, The Netherlands.,2Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands.,3NIZO Food Research, B.V., P.O. Box 20, Ede 6710 BA, Ede, The Netherlands
| | - Oscar P Kuipers
- 1Laboratory of Molecular Genetics, University of Groningen, 9747 AG Groningen, The Netherlands.,2Top Institute Food and Nutrition (TIFN), Nieuwe Kanaal 9A, 6709 PA Wageningen, The Netherlands
| |
Collapse
|
16
|
Wang P, Han W, Ma D. Virtual Sorting Has a Distinctive Advantage in Identification of Anticorrelated Genes and Further Negative Regulators of Immune Cell Subpopulations. THE JOURNAL OF IMMUNOLOGY 2017; 199:4155-4164. [PMID: 29093063 DOI: 10.4049/jimmunol.1700946] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 10/10/2017] [Indexed: 12/20/2022]
Abstract
Immune cells are highly plastic in both gene expression and cell phenotype. We have established a method of gene expressional plasticity and virtual sorting to evaluate immune cell subpopulations and their characteristic genes in human CD4+ T cells. In this study, we continued to investigate the informatics mechanism on the effectiveness of virtual sorting. We found that virtual sorting had an overall positive correlation to the Pearson correlation in the identification of positively correlated genes. However, owing to nonlinear biological anticorrelation, virtual sorting showed a distinctive advantage for anticorrelated genes, suggesting an important role in the identification of negative regulators. In addition, based on virtual sorting results, we identified two basic gene sets among highly plastic genes, i.e., highly plastic cell cycle-associated molecules and highly plastic immune and defense response-associated molecules. Genes within each set tended to be positively connected, but genes between two sets were often anticorrelated. Further analysis revealed preferential transcription factor binding motifs existed between highly plastic cell cycle-associated molecules and highly plastic immune and defense response-associated molecules. Our results strongly suggested predetermined regulation, which was called an immune cell internal phenotype, should exist and could be mined by virtual sorting analysis. This provided efficient functional clues to study immune cell phenotypes and their regulation. Moreover, the current substantial virtual sorting results in both CD4+ T cells and B cells provide a useful resource for big-data-driven experimental studies and knowledge discoveries.
Collapse
Affiliation(s)
- Pingzhang Wang
- Department of Immunology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China; .,Peking University Center for Human Disease Genomics, Beijing 100191, China; and .,Key Laboratory of Medical Immunology, Ministry of Health, Beijing 100191, China
| | - Wenling Han
- Department of Immunology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China.,Peking University Center for Human Disease Genomics, Beijing 100191, China; and.,Key Laboratory of Medical Immunology, Ministry of Health, Beijing 100191, China
| | - Dalong Ma
- Department of Immunology, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China.,Peking University Center for Human Disease Genomics, Beijing 100191, China; and.,Key Laboratory of Medical Immunology, Ministry of Health, Beijing 100191, China
| |
Collapse
|
17
|
Pierce S, Coetzee GA. Parkinson's disease-associated genetic variation is linked to quantitative expression of inflammatory genes. PLoS One 2017; 12:e0175882. [PMID: 28407015 PMCID: PMC5391096 DOI: 10.1371/journal.pone.0175882] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2016] [Accepted: 03/31/2017] [Indexed: 12/11/2022] Open
Abstract
Genome-wide association studies (GWAS) have linked dozens of single nucleotide polymorphisms (SNPs) with Parkinson’s disease (PD) risk. Ascertaining the functional and eventual causal mechanisms underlying these relationships has proven difficult. The majority of risk SNPs, and nearby SNPs in linkage disequilibrium (LD), are found in intergenic or intronic regions and confer risk through allele-dependent expression of multiple unknown target genes. Combining GWAS results with publicly available GTEx data, generated through eQTL (expression quantitative trait loci) identification studies, enables a direct association of SNPs to gene expression levels and aids in narrowing the large population of potential genetic targets for hypothesis-driven experimental cell biology. Separately, overlapping of SNPs with putative enhancer segmentations can strengthen target filtering. We report here the results of analyzing 7,607 PD risk SNPs along with an additional 23,759 high linkage disequilibrium-associated variants paired with eQTL gene expression. We found that enrichment analysis on the set of genes following target filtering pointed to a single large LD block at 6p21 that contained multiple HLA-MHC-II genes. These MHC-II genes remain associated with PD when the genes were filtered for correlation between GWAS significance and eQTL levels, strongly indicating a direct effect on PD etiology.
Collapse
Affiliation(s)
- Steven Pierce
- Center for Neurodegenerative Science, Van Andel Research Institute, Grand Rapids, MI, United States
| | - Gerhard A. Coetzee
- Center for Neurodegenerative Science, Van Andel Research Institute, Grand Rapids, MI, United States
- * E-mail:
| |
Collapse
|
18
|
Jia Z, Liu Y, Guan N, Bo X, Luo Z, Barnes MR. Cogena, a novel tool for co-expressed gene-set enrichment analysis, applied to drug repositioning and drug mode of action discovery. BMC Genomics 2016; 17:414. [PMID: 27234029 PMCID: PMC4884357 DOI: 10.1186/s12864-016-2737-8] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2016] [Accepted: 05/11/2016] [Indexed: 01/10/2023] Open
Abstract
BACKGROUND Drug repositioning, finding new indications for existing drugs, has gained much recent attention as a potentially efficient and economical strategy for accelerating new therapies into the clinic. Although improvement in the sensitivity of computational drug repositioning methods has identified numerous credible repositioning opportunities, few have been progressed. Arguably the "black box" nature of drug action in a new indication is one of the main blocks to progression, highlighting the need for methods that inform on the broader target mechanism in the disease context. RESULTS We demonstrate that the analysis of co-expressed genes may be a critical first step towards illumination of both disease pathology and mode of drug action. We achieve this using a novel framework, co-expressed gene-set enrichment analysis (cogena) for co-expression analysis of gene expression signatures and gene set enrichment analysis of co-expressed genes. The cogena framework enables simultaneous, pathway driven, disease and drug repositioning analysis. Cogena can be used to illuminate coordinated changes within disease transcriptomes and identify drugs acting mechanistically within this framework. We illustrate this using a psoriatic skin transcriptome, as an exemplar, and recover two widely used Psoriasis drugs (Methotrexate and Ciclosporin) with distinct modes of action. Cogena out-performs the results of Connectivity Map and NFFinder webservers in similar disease transcriptome analyses. Furthermore, we investigated the literature support for the other top-ranked compounds to treat psoriasis and showed how the outputs of cogena analysis can contribute new insight to support the progression of drugs into the clinic. We have made cogena freely available within Bioconductor or https://github.com/zhilongjia/cogena . CONCLUSIONS In conclusion, by targeting co-expressed genes within disease transcriptomes, cogena offers novel biological insight, which can be effectively harnessed for drug discovery and repositioning, allowing the grouping and prioritisation of drug repositioning candidates on the basis of putative mode of action.
Collapse
Affiliation(s)
- Zhilong Jia
- Department of Chemistry and Biology, College of Science, National University of Defense Technology, Changsha, Hunan, 410073, People's Republic of China
- William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ, UK
| | - Ying Liu
- Hunan Key Laboratory of Medical Epigenomics, Department of Dermatology, Second Xiangya Hospital, Central South University, Changsha, Hunan, 410011, People's Republic of China
| | - Naiyang Guan
- College of Computer, National University of Defense Technology, Changsha, 410073, People's Republic of China
- National Laboratory for Parallel and Distributed Processing, National University of Defense Technology, Changsha, 410073, People's Republic of China
| | - Xiaochen Bo
- Beijing Institute of Radiation Medicine, Beijing, 100850, People's Republic of China
| | - Zhigang Luo
- College of Computer, National University of Defense Technology, Changsha, 410073, People's Republic of China.
- National Laboratory for Parallel and Distributed Processing, National University of Defense Technology, Changsha, 410073, People's Republic of China.
| | - Michael R Barnes
- William Harvey Research Institute, Queen Mary University of London, Charterhouse Square, London, EC1M 6BQ, UK.
| |
Collapse
|
19
|
Immuno-Navigator, a batch-corrected coexpression database, reveals cell type-specific gene networks in the immune system. Proc Natl Acad Sci U S A 2016; 113:E2393-402. [PMID: 27078110 DOI: 10.1073/pnas.1604351113] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
High-throughput gene expression data are one of the primary resources for exploring complex intracellular dynamics in modern biology. The integration of large amounts of public data may allow us to examine general dynamical relationships between regulators and target genes. However, obstacles for such analyses are study-specific biases or batch effects in the original data. Here we present Immuno-Navigator, a batch-corrected gene expression and coexpression database for 24 cell types of the mouse immune system. We systematically removed batch effects from the underlying gene expression data and showed that this removal considerably improved the consistency between inferred correlations and prior knowledge. The data revealed widespread cell type-specific correlation of expression. Integrated analysis tools allow users to use this correlation of expression for the generation of hypotheses about biological networks and candidate regulators in specific cell types. We show several applications of Immuno-Navigator as examples. In one application we successfully predicted known regulators of importance in naturally occurring Treg cells from their expression correlation with a set of Treg-specific genes. For one high-scoring gene, integrin β8 (Itgb8), we confirmed an association between Itgb8 expression in forkhead box P3 (Foxp3)-positive T cells and Treg-specific epigenetic remodeling. Our results also suggest that the regulation of Treg-specific genes within Treg cells is relatively independent of Foxp3 expression, supporting recent results pointing to a Foxp3-independent component in the development of Treg cells.
Collapse
|
20
|
MIrExpress: A Database for Gene Coexpression Correlation in Immune Cells Based on Mutual Information and Pearson Correlation. J Immunol Res 2015; 2015:140819. [PMID: 26881263 PMCID: PMC4735977 DOI: 10.1155/2015/140819] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Accepted: 11/09/2015] [Indexed: 01/08/2023] Open
Abstract
Most current gene coexpression databases support the analysis for linear correlation of gene pairs, but not nonlinear correlation of them, which hinders precisely evaluating the gene-gene coexpression strengths. Here, we report a new database, MIrExpress, which takes advantage of the information theory, as well as the Pearson linear correlation method, to measure the linear correlation, nonlinear correlation, and their hybrid of cell-specific gene coexpressions in immune cells. For a given gene pair or probe set pair input by web users, both mutual information (MI) and Pearson correlation coefficient (r) are calculated, and several corresponding values are reported to reflect their coexpression correlation nature, including MI and r values, their respective rank orderings, their rank comparison, and their hybrid correlation value. Furthermore, for a given gene, the top 10 most relevant genes to it are displayed with the MI, r, or their hybrid perspective, respectively. Currently, the database totally includes 16 human cell groups, involving 20,283 human genes. The expression data and the calculated correlation results from the database are interactively accessible on the web page and can be implemented for other related applications and researches.
Collapse
|
21
|
Skourti E, Logotheti S, Kontos CK, Pavlopoulou A, Dimoragka PT, Trougakos IP, Gorgoulis V, Scorilas A, Michalopoulos I, Zoumpourlis V. Progression of mouse skin carcinogenesis is associated with the orchestrated deregulation of mir-200 family members, mir-205 and their common targets. Mol Carcinog 2015; 55:1229-42. [PMID: 26527515 DOI: 10.1002/mc.22365] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Revised: 06/24/2015] [Accepted: 06/26/2015] [Indexed: 12/15/2022]
Abstract
MicroRNAs are small, non-coding RNAs which regulate post-transcriptionally hundreds of target mRNAs. Given that their expression is deregulated in several cancer types, they represent potential diagnostic, prognostic, and predictive biomarkers, as well as next-generation therapeutic targets. Nevertheless, the involvement of miRNAs in non-melanoma skin cancer, a cancer type with increasing prevalence, is not extensively studied, and their comprehensive characterization as regard to the initiation, promotion, and progression stages is missing. To this end, we exploited a well-established multistage mouse skin carcinogenesis model in order to identify miRNAs consistently implicated in different stages of skin carcinogenesis. The cell lines comprising this model were subjected to miRNA expression profiling using microarrays, followed by bioinformatics analysis and validation with Q-PCR, as well as treatment with miRNA modulators. We showed that among all deregulated miRNAs in our system, only a functionally coherent group consisting of the miR-200 family members and miR-205-5p displays a pattern of progressive co-downregulation from the early toward the most aggressive stages of carcinogenesis. Their overlapping, co-regulated putative targets are potentially inter-associated and, of these, the EMT-related Rap1a is overexpressed toward aggressive stages. Ectopic expression of miR-205-5p in spindle cancer cells reduces Rap1a, mitigates cell invasiveness, decreases proliferation, and delays tumor onset. We conclude that deregulation of this miRNA group is primarily associated with aggressive phenotypes of skin cancer cells. Restoration of the miR-205-5p member of this group in spindle cells reduces the expression of critical, co-regulated targets that favor cancer progression, thus reversing the EMT characteristics. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Elena Skourti
- Biomedical Applications Unit, Institute of Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Stella Logotheti
- Biomedical Applications Unit, Institute of Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| | - Christos K Kontos
- Department of Biochemistry and Molecular Biology, Faculty of Biology, University of Athens, Athens, Greece
| | - Athanasia Pavlopoulou
- Computational Biology and Medicine, Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens, Greece
| | - Paraskevi T Dimoragka
- Computational Biology and Medicine, Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens, Greece
| | - Ioannis P Trougakos
- Department of Cell Biology and Biophysics, Faculty of Biology, University of Athens, Athens, Greece
| | - Vassilis Gorgoulis
- Laboratory of Histology-Embryology, Molecular Carcinogenesis Group, Medical School, National and Kapodistrian University of Athens, Athens, Greece
| | - Andreas Scorilas
- Department of Biochemistry and Molecular Biology, Faculty of Biology, University of Athens, Athens, Greece
| | - Ioannis Michalopoulos
- Computational Biology and Medicine, Centre of Systems Biology, Biomedical Research Foundation, Academy of Athens, Athens, Greece
| | - Vassilis Zoumpourlis
- Biomedical Applications Unit, Institute of Biology, Medicinal Chemistry and Biotechnology, National Hellenic Research Foundation, Athens, Greece
| |
Collapse
|
22
|
Zhu L, Malatras A, Thorley M, Aghoghogbe I, Mer A, Duguez S, Butler-Browne G, Voit T, Duddy W. CellWhere: graphical display of interaction networks organized on subcellular localizations. Nucleic Acids Res 2015; 43:W571-5. [PMID: 25883154 PMCID: PMC4489307 DOI: 10.1093/nar/gkv354] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2015] [Accepted: 04/02/2015] [Indexed: 01/18/2023] Open
Abstract
Given a query list of genes or proteins, CellWhere produces an interactive graphical display that mimics the structure of a cell, showing the local interaction network organized into subcellular locations. This user-friendly tool helps in the formulation of mechanistic hypotheses by enabling the experimental biologist to explore simultaneously two elements of functional context: (i) protein subcellular localization and (ii) protein–protein interactions or gene functional associations. Subcellular localization terms are obtained from public sources (the Gene Ontology and UniProt—together containing several thousand such terms) then mapped onto a smaller number of CellWhere localizations. These localizations include all major cell compartments, but the user may modify the mapping as desired. Protein–protein interaction listings, and their associated evidence strength scores, are obtained from the Mentha interactome server, or power-users may upload a pre-made network produced using some other interactomics tool. The Cytoscape.js JavaScript library is used in producing the graphical display. Importantly, for a protein that has been observed at multiple subcellular locations, users may prioritize the visual display of locations that are of special relevance to their research domain. CellWhere is at http://cellwhere-myology.rhcloud.com.
Collapse
Affiliation(s)
- Lu Zhu
- Bioinformatics Department, Bielefeld University, Bielefeld, D-33501, Germany Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - Apostolos Malatras
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - Matthew Thorley
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - Idonnya Aghoghogbe
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France Orthopaedics and Musculoskeletal Science, University College London, London, WC1E 6BT, UK
| | - Arvind Mer
- Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, SE-17177, Sweden
| | - Stéphanie Duguez
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - Gillian Butler-Browne
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - Thomas Voit
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| | - William Duddy
- Center for Research in Myology, Sorbonne Universités, UPMC Univ Paris 06, INSERM UMRS975, CNRS FRE3617, 47 Boulevard de l'hôpital, 75013 Paris, France
| |
Collapse
|
23
|
Wang P, Qi H, Song S, Li S, Huang N, Han W, Ma D. ImmuCo: a database of gene co-expression in immune cells. Nucleic Acids Res 2014; 43:D1133-9. [PMID: 25326331 PMCID: PMC4384033 DOI: 10.1093/nar/gku980] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Current gene co-expression databases and correlation networks do not support cell-specific analysis. Gene co-expression and expression correlation are subtly different phenomena, although both are likely to be functionally significant. Here, we report a new database, ImmuCo (http://immuco.bjmu.edu.cn), which is a cell-specific database that contains information about gene co-expression in immune cells, identifying co-expression and correlation between any two genes. The strength of co-expression of queried genes is indicated by signal values and detection calls, whereas expression correlation and strength are reflected by Pearson correlation coefficients. A scatter plot of the signal values is provided to directly illustrate the extent of co-expression and correlation. In addition, the database allows the analysis of cell-specific gene expression profile across multiple experimental conditions and can generate a list of genes that are highly correlated with the queried genes. Currently, the database covers 18 human cell groups and 10 mouse cell groups, including 20 283 human genes and 20 963 mouse genes. More than 8.6 × 108 and 7.4 × 108 probe set combinations are provided for querying each human and mouse cell group, respectively. Sample applications support the distinctive advantages of the database.
Collapse
Affiliation(s)
- Pingzhang Wang
- Department of Immunology, Key Laboratory of Medical Immunology, Ministry of Health, School of Basic Medical Sciences, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China Peking University Center for Human Disease Genomics, No. 38 Xueyuan Road, Beijing 100191, China
| | - Huiying Qi
- Department of Natural Science in Medicine, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China
| | - Shibin Song
- Information and Communication Center, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China
| | - Shuang Li
- Information and Communication Center, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China
| | - Ningyu Huang
- Information and Communication Center, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China
| | - Wenling Han
- Department of Immunology, Key Laboratory of Medical Immunology, Ministry of Health, School of Basic Medical Sciences, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China Peking University Center for Human Disease Genomics, No. 38 Xueyuan Road, Beijing 100191, China
| | - Dalong Ma
- Department of Immunology, Key Laboratory of Medical Immunology, Ministry of Health, School of Basic Medical Sciences, Peking University Health Science Center, No. 38 Xueyuan Road, Beijing 100191, China Peking University Center for Human Disease Genomics, No. 38 Xueyuan Road, Beijing 100191, China
| |
Collapse
|
24
|
Jayaram A, Pradeep ANR, Awasthi AK, Murthy GN, Ponnuvel KM, Sasibhushan S, Rao GC. Coregulation of host–response genes in integument: switchover of gene expression correlation pattern and impaired immune responses induced by dipteran parasite infection in the silkworm, Bombyx mori. J Appl Genet 2013; 55:209-21. [PMID: 24310719 DOI: 10.1007/s13353-013-0183-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Revised: 09/10/2013] [Accepted: 11/07/2013] [Indexed: 01/13/2023]
Affiliation(s)
- Anitha Jayaram
- Proteomics Division, Seribiotech Research Laboratory, Central Silk Board, Carmelaram. P.O., CSB-Kodathi Campus, Bangalore, 560035, Karnataka, India
| | | | | | | | | | | | | |
Collapse
|